Good day all,
I finally found some time to make some changes to the analysis focusing on forecasting an accurate capacity reading based on all the data (which can now be found on the second page).
This method takes the average capacity readings for each user in each mileage category (every 5,000 miles) that has at least 3 entries and performs a linear regression to predict what the capacity will be. (nerd note: the correlation coefficient of this data is 0.97 so it's very well attuned to the linear regression method). if there were any categories that did not meet the criteria, the forecasted capacity is used to maintain the continuity of the forecast. This can be seen for reference in the upper left hand chart.
In short, you can now enter any mileage you wish and see what the projected capacity in Ah will be based on the aggregate data. You can also enter your current capacity in Ah and your current mileage to see how you compare to the the total population.
in the example below, I've entered my current capacity (40.38 Ah) and my current mileage (55,000) and see that my projected capacity should be 47.06Ah
This is depicted on the chart on the lower left as follows:
Yellow = Projected capacity/Entered mileage
Green = Your entered capacity in Ah
Red = the 36.6Ah warranty threshold
You can also see where you current capacity should fall on mileage based on the projection. In my case my capacity is far below what should be expected. What this shows is that based on the cumulative data the average person wouldn't expect to see my capacity until at least 125k miles (don't worry, I picked mine up used a couple months ago and am definitely an outlier in the data).
View attachment 19870
As MrFixIt mentions, this data will only become more accurate the more datapoints are entered, so please continue to contribute!
I’ll project that your methodology will produce inaccurate results. Your own vehicle is a perfect example. Why would you dismiss actual data as being an outlier? Sure, since you recently purchased the vehicle, you have no history of how it was operated. That unknown history holds the secrets of why the capacity is what it is today.
Battery degradation isn’t linear. Additionally, using a mileage interval as a variable to project future battery capacity in a PHEV is futile. One owner might drive 5000 miles on gas and another might drive the same distance on batteries. The difference between the two cars could be hundreds of charge cycles and you’d have no idea of discharge rates, operating temperatures, age of the batteries or storage conditions.
I get it that with enough data, there will eventually be an average to which others can compare, however, that average will have been derived from a simplistic methodology that isn’t likely to produce meaningful results.