This is not a post about semantics. Using “N=1” to refer to self-experimentation is okay, as long as one understands that self-experimentation is one of the most powerful ways to improve one’s health. Typically the term “N=1” is used in a demeaning way, as in: “It is just my N=1 experience, so it’s not worth much, but …” This is the reason behind this post. Using the “N=1” term to refer to self-experimentation in this way is both incorrect and misleading.
Calling self-experimentation N=1 is incorrect
The table below shows a dataset that is discussed in this YouTube video on HealthCorrelator for Excel (HCE). It refers to one single individual. Nearly all health-related datasets will look somewhat like this, with columns referring to health variables and rows referring to multiple measurements for the health variables. (This actually applies to datasets in general, including datasets about non-health-related phenomena.)
Often each individual measurement, or row, will be associated with a particular point in time, such as a date. This will characterize the measurement approach used as longitudinal, as opposed to cross-sectional. One example of the latter would be a dataset where each row referred to a different individual, with the data on all rows collected at the same point in time. Longitudinal health-related measurement is frequently considered superior to cross-sectional measurement in terms of the insights that it can provide.
As you can see, the dataset has 10 rows, with the top row containing the names of the variables. So this dataset contains nine rows of data, which means that in this dataset “N=9”, even though the data is for one single individual. To call this an “N=1” experiment is incorrect.
As a side note, an empty cell, like that on the top row for HDL cholesterol, essentially means that a measurement for that variable was not taken on that date, or that it was left out because of obvious measurement error (e.g., the value received from the lab was “-10”, which would be a mistake since nobody has a negative HDL cholesterol level). The N of the dataset as a whole would still be technically 9 in a situation like this, with only one missing cell on the row in question. But the software would typically calculate associations for that variable (HDL cholesterol) based on a sample of 8.
Calling self-experimentation N=1 is misleading
Calling self-experimentation “N=1”, meaning that the results of self-experimentation are not a good basis for generalization, is very misleading. But there is a twist. Those results may indeed not be a good basis for generalization to other people, but they provide a particularly good basis for generalization for you. It is often much safer to generalize based on self-experimentation, even with small samples (e.g., N=9).
The reason, as I pointed out in this interview with Jimmy Moore, is that data about oneself only tends to be much more uniform than data about a sample of individuals. When multiple individuals are included in an analysis, the number of sources of error (e.g., confounding variables, measurement problems) is much higher than when the analysis is based on one single individual. Thus analyses based on data from one single individual yield results that are more uniform and stable across the sample.
Moreover, analyses of data about a sample of individuals are typically summarized through averages, and those averages tend to be biased by outliers. There are always outliers in any dataset; you might possibly be one of them if you were part of a dataset, which would render the average results at best misleading, and at worst meaningless, to you. This is a point that has also been made by Richard Nikoley, who has been discussing self-experimentation for quite some time, in this very interesting video.
Another person who has been talking about self-experimentation, and showing how it can be useful in personal health management, is Seth Roberts. He and the idea of self-experimentation were prominently portrayed in this article on the New York Times. Check this video where Dr. Roberts talks about how he found out through self-experimentation that, among other things, consuming butter reduced his arterial plaque deposits. Plaque reduction is something that only rarely happens, at least in folks who follow the traditional American diet.
HCE generates coefficients of association and graphs at the click of a button, making it relatively easy for anybody to understand how his or her health variables are associated with one another, and thus what modifiable health factors (e.g., consumption of certain foods) could be causing health effects (e.g., body fact accumulation). It may also help you identify other, more counter-intuitive, links; such as between certain thought and behavior patterns (e.g., wealth accumulation thoughts, looking at the mirror multiple times a day) and undesirable mental states (e.g., depression, panic attacks).
Just keep in mind that you need to have at least some variation in all the variables involved. Without variation there is no correlation, and thus causation may remain hidden from view.
Showing posts with label health management software. Show all posts
Showing posts with label health management software. Show all posts
Health markers varying inexplicably? Do some detective work with HCE
John was overweight, out of shape, and experiencing fatigue. What did he do? He removed foods rich in refined carbohydrates and sugars from his diet. He also ditched industrial seed oils and started exercising. He used HealthCorrelator for Excel (HCE) to keep track of several health-related numbers over time (see figure below).
Over the period of time covered in the dataset, health markers steadily improved. For example, John’s HDL cholesterol went from a little under 40 mg/dl to just under 70; see chart below, one of many generated by HCE.
However, John’s blood pressure varied strangely during that time, as you can see on the chart below showing the variation of systolic blood pressure (SBP) against time. What could have been the reason for that? Salt intake is an unlikely culprit, as we’ve seen before.
As it turns out, John knew that heart rate could influence blood pressure somewhat, and he also knew that his doctor’s office measured his heart rate regularly. So he got the data from his doctor's office. When he entered heart rate as a column into HCE, the reason for his blood pressure swings became clear, as you can see on the figure below.
On the left part of the figure above are the correlations between SBP and each of the other health-related variables John measured, which HCE lists in order of strength. Heart rate shows up at the top, with a high 0.946 correlation with SBP. On the right part of the figure is the chart of SBP against heart rate.
As you can see, John's heart rate, measured at the doctor's office, varied from 61 to 90 bpm. Given that, John decided to measure his resting heart rate. John’s resting heart rate, measured after waking up using a simple wrist watch, was 61 bpm.
Mystery solved! John’s blood pressure fluctuations were benign, and caused by fluctuations in heart rate.
If John's SBP had been greater than 140, which did not happen, this could be seen as an unusual example of irregular white coat hypertension.
If you are interested, this YouTube video clip discusses in more detail the case above, from HCE’s use perspective. It shows how the heart rate column was added to the dataset in HCE, how the software generated correlations and graphs, and how they were interpreted.
Reference
Kock, N. (2010). HealthCorrelator for Excel 1.0 User Manual. Laredo, Texas: ScriptWarp Systems.
Over the period of time covered in the dataset, health markers steadily improved. For example, John’s HDL cholesterol went from a little under 40 mg/dl to just under 70; see chart below, one of many generated by HCE.
However, John’s blood pressure varied strangely during that time, as you can see on the chart below showing the variation of systolic blood pressure (SBP) against time. What could have been the reason for that? Salt intake is an unlikely culprit, as we’ve seen before.
As it turns out, John knew that heart rate could influence blood pressure somewhat, and he also knew that his doctor’s office measured his heart rate regularly. So he got the data from his doctor's office. When he entered heart rate as a column into HCE, the reason for his blood pressure swings became clear, as you can see on the figure below.
On the left part of the figure above are the correlations between SBP and each of the other health-related variables John measured, which HCE lists in order of strength. Heart rate shows up at the top, with a high 0.946 correlation with SBP. On the right part of the figure is the chart of SBP against heart rate.
As you can see, John's heart rate, measured at the doctor's office, varied from 61 to 90 bpm. Given that, John decided to measure his resting heart rate. John’s resting heart rate, measured after waking up using a simple wrist watch, was 61 bpm.
Mystery solved! John’s blood pressure fluctuations were benign, and caused by fluctuations in heart rate.
If John's SBP had been greater than 140, which did not happen, this could be seen as an unusual example of irregular white coat hypertension.
If you are interested, this YouTube video clip discusses in more detail the case above, from HCE’s use perspective. It shows how the heart rate column was added to the dataset in HCE, how the software generated correlations and graphs, and how they were interpreted.
Reference
Kock, N. (2010). HealthCorrelator for Excel 1.0 User Manual. Laredo, Texas: ScriptWarp Systems.
The China Study II: A look at mortality in the 35-69 and 70-79 age ranges
This post is based on an analysis of a subset of the China Study II data, using HealthCorrelator for Excel (HCE), which is publicly available for download and use on a free trial basis. You can access the original data on the HCE web site, under “Sample datasets”.
HCE was designed to be used with small and individual personal datasets, but it can also be used with larger datasets for multiple individuals.
This analysis focuses on two main variables from the China Study II data: mortality in the 35-69 age range, and mortality in the 70-79 range. The table below shows the coefficients of association calculated by HCE for those two variables. The original variable labels are shown.
One advantage of looking at mortality in these ranges is that they are more likely to reflect the impact of degenerative diseases. Infectious diseases likely killed a lot of children in China at the time the data was being collected. Heart disease, on the other hand, is likely to have killed more people in the 35-69 and 70-79 ranges.
It is also good to have data for both ranges, because factors that likely increased longevity were those that were associated with decreased mortality in both ranges. For example, a factor that was strongly associated with mortality in the 35-69 range, but not the 70-79 range, might simply be very deadly in the former range.
The mortalities in both ranges are strongly correlated with each other, which is to be expected. Next, at the very top for both ranges, is sex. Being female is by far the variable with the strongest, and negative, association with mortality.
While I would expect females to live longer, the strengths of the associations make me think that there is something else going on here. Possibly different dietary or behavioral patterns displayed by females. Maybe smoking cigarettes or alcohol abuse was a lot less prevalent among them.
Markedly different lifestyle patterns between males and females may be a major confounding variable in the China Study sample.
Some of the variables are redundant; meaning that they are highly correlated and seem to measure the same thing. This is clear when one looks at the other coefficients of association generated by HCE.
For example, plant food consumption is strongly and negatively correlated with animal food consumption; so strongly that you could use either one of these two variables to measure the other, after inverting the scale. The same is true for consumption of rice and white flour.
Plant food consumption is not strongly correlated with plant protein consumption; many plant foods have little protein in them. The ones that have high protein content are typically industrialized and seed-based. The type of food most strongly associated with plant protein consumption is white flour, by far. The correlation is .645.
The figure below is based on the table above. I opened a separate instance of Excel, and copied the coefficients generated by HCE into it. Then I built two bar charts with them. The variable labels were replaced with more suggestive names, and some redundant variables were removed. Only the top 7 variables are shown, ordered from left to right on the bar charts in order of strength of association. The ones above the horizontal axis possibly increase mortality in each age range, whereas the ones at the bottom possibly decrease it.
When you look at these results as a whole, a few things come to mind.
White flour consumption doesn’t seem to be making people live longer; nor does plant food consumption in general. For white flour, it is quite the opposite. Plant food consumption reflects white flour consumption to a certain extent, especially in counties where rice consumption is low. These conclusions are consistent with previous analyses using more complex statistics.
Total food is positively associated with mortality in the 35-69 range, but not the 70-79 range. This may reflect the fact that folks who reach the age of 70 tend to naturally eat in moderation, so you don’t see wide variations in food consumption among those folks.
Eating in moderation does not mean practicing severe calorie restriction. This post suggests that calorie restriction doesn't seem to be associated with increased longevity in this sample. Eating well, but not too much, is.
The bar for rice (consumption) on the left chart is likely a mirror reflection of the white flour consumption, so it may appear to be good in the 35-69 range simply because it reflects reduced white flour consumption in that range.
Green vegetables seem to be good when you consider the 35-69 range, but not the 70-79 range.
Neither rice nor green vegetables seem to be bad either. For overall longevity they may well be neutral, with the benefits likely coming from their replacement of white flour in the diet.
Dietary fat seems protective overall, particularly together with animal foods in the 70-79 range. This may simply reflect a delayed protective effect of animal fat and protein consumption.
The protective effect of dietary fat becomes clear when we look at the relationship between carbohydrate calories and fat calories. Their correlation is -.957, which essentially means that carbohydrate intake seriously displaces fat intake.
Carbohydrates themselves may not be the problem, even if coming from high glycemic foods (except wheat flour, apparently). This post shows that they are relatively benign if coming from high glycemic rice, even at high intakes of 206 to 412 g/day. The problem seems to be caused by carbohydrates displacing nutrient-dense animal foods.
Interestingly, rice does not displace animal foods or fat in the diet. It is positively correlated with them. Wheat flour, on the other hand, displaces those foods. Wheat flour is negatively and somewhat strongly correlated with consumption of animal foods, as well as with animal fat and protein.
There are certainly several delayed effects here, which may be distorting the results somewhat. Degenerative diseases don’t develop fast and kill folks right away. They often require many years of eating and doing the wrong things to be fatal.
HCE was designed to be used with small and individual personal datasets, but it can also be used with larger datasets for multiple individuals.
This analysis focuses on two main variables from the China Study II data: mortality in the 35-69 age range, and mortality in the 70-79 range. The table below shows the coefficients of association calculated by HCE for those two variables. The original variable labels are shown.
One advantage of looking at mortality in these ranges is that they are more likely to reflect the impact of degenerative diseases. Infectious diseases likely killed a lot of children in China at the time the data was being collected. Heart disease, on the other hand, is likely to have killed more people in the 35-69 and 70-79 ranges.
It is also good to have data for both ranges, because factors that likely increased longevity were those that were associated with decreased mortality in both ranges. For example, a factor that was strongly associated with mortality in the 35-69 range, but not the 70-79 range, might simply be very deadly in the former range.
The mortalities in both ranges are strongly correlated with each other, which is to be expected. Next, at the very top for both ranges, is sex. Being female is by far the variable with the strongest, and negative, association with mortality.
While I would expect females to live longer, the strengths of the associations make me think that there is something else going on here. Possibly different dietary or behavioral patterns displayed by females. Maybe smoking cigarettes or alcohol abuse was a lot less prevalent among them.
Markedly different lifestyle patterns between males and females may be a major confounding variable in the China Study sample.
Some of the variables are redundant; meaning that they are highly correlated and seem to measure the same thing. This is clear when one looks at the other coefficients of association generated by HCE.
For example, plant food consumption is strongly and negatively correlated with animal food consumption; so strongly that you could use either one of these two variables to measure the other, after inverting the scale. The same is true for consumption of rice and white flour.
Plant food consumption is not strongly correlated with plant protein consumption; many plant foods have little protein in them. The ones that have high protein content are typically industrialized and seed-based. The type of food most strongly associated with plant protein consumption is white flour, by far. The correlation is .645.
The figure below is based on the table above. I opened a separate instance of Excel, and copied the coefficients generated by HCE into it. Then I built two bar charts with them. The variable labels were replaced with more suggestive names, and some redundant variables were removed. Only the top 7 variables are shown, ordered from left to right on the bar charts in order of strength of association. The ones above the horizontal axis possibly increase mortality in each age range, whereas the ones at the bottom possibly decrease it.
When you look at these results as a whole, a few things come to mind.
White flour consumption doesn’t seem to be making people live longer; nor does plant food consumption in general. For white flour, it is quite the opposite. Plant food consumption reflects white flour consumption to a certain extent, especially in counties where rice consumption is low. These conclusions are consistent with previous analyses using more complex statistics.
Total food is positively associated with mortality in the 35-69 range, but not the 70-79 range. This may reflect the fact that folks who reach the age of 70 tend to naturally eat in moderation, so you don’t see wide variations in food consumption among those folks.
Eating in moderation does not mean practicing severe calorie restriction. This post suggests that calorie restriction doesn't seem to be associated with increased longevity in this sample. Eating well, but not too much, is.
The bar for rice (consumption) on the left chart is likely a mirror reflection of the white flour consumption, so it may appear to be good in the 35-69 range simply because it reflects reduced white flour consumption in that range.
Green vegetables seem to be good when you consider the 35-69 range, but not the 70-79 range.
Neither rice nor green vegetables seem to be bad either. For overall longevity they may well be neutral, with the benefits likely coming from their replacement of white flour in the diet.
Dietary fat seems protective overall, particularly together with animal foods in the 70-79 range. This may simply reflect a delayed protective effect of animal fat and protein consumption.
The protective effect of dietary fat becomes clear when we look at the relationship between carbohydrate calories and fat calories. Their correlation is -.957, which essentially means that carbohydrate intake seriously displaces fat intake.
Carbohydrates themselves may not be the problem, even if coming from high glycemic foods (except wheat flour, apparently). This post shows that they are relatively benign if coming from high glycemic rice, even at high intakes of 206 to 412 g/day. The problem seems to be caused by carbohydrates displacing nutrient-dense animal foods.
Interestingly, rice does not displace animal foods or fat in the diet. It is positively correlated with them. Wheat flour, on the other hand, displaces those foods. Wheat flour is negatively and somewhat strongly correlated with consumption of animal foods, as well as with animal fat and protein.
There are certainly several delayed effects here, which may be distorting the results somewhat. Degenerative diseases don’t develop fast and kill folks right away. They often require many years of eating and doing the wrong things to be fatal.
Labels:
China Study,
HCE,
health management software,
longevity,
research,
statistics
HealthCorrelator for Excel (HCE) is now publicly available for free trial
HealthCorrelator for Excel (HCE) is now publicly available for download and use on a free trial basis. For those users who decide to buy it after trying, licenses are available for individuals and organizations. If you are a gym member, consider asking your gym to buy an organizational site license; this would allow the gym to distribute individual licenses at no cost to you and your colleagues.
HCE is a user-friendly Excel-based software that unveils important associations among health variables at the click of a button. Here are some of its main features:
- Easy to use yet powerful health management software.
- Estimates associations among any number of health variables.
- Automatically orders associations by decreasing absolute strength.
- Graphs relationships between pairs of health variables, for all possible combinations.
The beta testing was successfully completed, with fairly positive results. (Thank you beta testers!) Among beta testers were Mac users. The main request from beta testers was for more illustrative material on how to use HCE for specific purposes, such as losing body fat or managing blood glucose levels. This will be coming in the future in the form of posts and linked material.
To download a free trial version, good for 30 use sessions (which is quite a lot!), please visit the HealthCorrelator.com web site. There you will also find the software’s User Manual and various links to demo YouTube videos. You can also download sample datasets to try the software’s main features.
HCE is a user-friendly Excel-based software that unveils important associations among health variables at the click of a button. Here are some of its main features:
- Easy to use yet powerful health management software.
- Estimates associations among any number of health variables.
- Automatically orders associations by decreasing absolute strength.
- Graphs relationships between pairs of health variables, for all possible combinations.
The beta testing was successfully completed, with fairly positive results. (Thank you beta testers!) Among beta testers were Mac users. The main request from beta testers was for more illustrative material on how to use HCE for specific purposes, such as losing body fat or managing blood glucose levels. This will be coming in the future in the form of posts and linked material.
To download a free trial version, good for 30 use sessions (which is quite a lot!), please visit the HealthCorrelator.com web site. There you will also find the software’s User Manual and various links to demo YouTube videos. You can also download sample datasets to try the software’s main features.
Labels:
HCE,
health management software,
statistics
HealthCorrelator for Excel 1.0 (HCE): Call for beta testers
This call is closed. Beta testing has been successfully completed. HealthCorrelator for Excel (HCE) is now publicly available for download and use on a free trial basis. For those users who decide to buy it after trying, licenses are available for individuals and organizations.
To download a free trial version – as well as get the User Manual, view demo YouTube videos, and download and try sample datasets – visit the HealthCorrelator.com web site.
To download a free trial version – as well as get the User Manual, view demo YouTube videos, and download and try sample datasets – visit the HealthCorrelator.com web site.
Labels:
HCE,
health management software,
statistics
Subscribe to:
Posts (Atom)