People frequently ask us why we are anonymous. The answer is that we’re not anonymous, we’re pseudonymous. Both of these approaches are useful ways to conduct your internet science, so here is a quick guide to choosing which one works best for you:
Anonymous
No name attached at all.
Highest level of separation between government identity and the individual.
Can be good for one-off research or theoretical pieces that don’t need to be connected to other things you’ve worked on, or things that might be better for a different audience.
For study participants, the best option to protect their privacy.
Literally you can do infinite anonymous identities.
Naming conventions: numbering is easy, but there’s lots of opportunity to be creative. Or just say “by Anonymous”. Actually Google Docs has a nice approach:
Anonymous NyanCat
Pseudonymous
A pseudonym is like a band name. It’s a name that you will use multiple times, and a name that can develop its own reputation.
A pseudonym may or may not be connected to a legal name — this can also change over time or in different contexts. Lots of people are known by a stage name or a pen name professionally, but you can still find their other names without too much trouble. You may not immediately know Jay Z’s government name, but it’s not hard to find out that he was born Shawn Corey Carter.
Pseudonyms are great for groups! Fighting for first authorship is stupid — just come up with a silly name instead.
You can also have multiple pseudonyms. Generally it makes sense to pick one name for a given project and stay with that for a while, so that people can get to know your style and see what you do as a body of work that should be considered. But using different pseudonyms for different projects, or when you’re working with different groups, is a good approach.
Conclusions: Really it’s a spectrum. There are lots of different options that can work, depending on the project and your specific concerns.
Jon was a participant in our Potato Diet Community Trial. He recently sent us an email with an update on how he’s doing, which is reproduced below with his permission.
I don’t know if you wanted a 1 year followup.
So last year at this time I’d just come off my first potato diet and it seemed like the weight was staying off at least partially out of sheer cussedness and a desire to see my much-touted diet work out as well as I’d hoped. Where am I at a year later?
At the end of that first potato diet I was at 168, having lost about 15 pounds from my start of 183.6. Last time I weighed myself I was at around 172. That’s practically within water-weight of that 168! And that 172 is approximately stable compared to a month ago or whenever I last weighed myself!
In the last year I’ve done a couple more tries at the potato diet, neither of which were as successful as that first one. But I think when it comes down to it the potato diet knocked my basic set-weight down by about 11 pounds! The biggest ongoing change in my diet is having tater tots and sausage for breakfast almost every day, generally replacing cereal in the old regime.
Anyway, absurdly pleased by that result–potato diet wasn’t a magic bullet for me but it halted the inexorable upward slide of my weight, got me down a little bit and has kept me stable for a year.
Even though it never worked as well again (probably lack of accountability from not being part of a study) the potato diet was still life-changing and has improved my health long term! Please let me know if you need any more data, I’m happy to provide it! Thanks,
THIRD PLACE: Have Attention Spans Been Declining? by niplav, writing as Cennfaeladh, who also blogs at niplav.site. Also, honorable mention to niplav for getting by far the most (30,000+) pageviews.
We’ve been in contact with all three winners and will be sending them their awards shortly. Congratulations!
In addition, TripleTaco gave us this proposed explanation to the list of Amish mysteries. For your consideration:
After I formed my own theory of what caused Autism, I looked hard to see if anybody else had come to the same conclusion. After a long time of looking, I finally found one researcher, Max McDowell, who explains what he (and I) believe likely causes autism in this video: https://www.youtube.com/watch?v=BHhAnCTVLG4
If I were naming this, I’d call it “face starvation”. Babies have a crucial window in which they need a certain amount of eye contact and starving them of that causes autism for many children. He goes on in that video to describe many of the same mysteries I described in my mystery post. The theory lines up with the mysteries perfectly.
The video has been out for over 2 years and at this moment it has 114 views. His idea was first published all the way back in 2004, so it’s not new at all. It’s just being utterly ignored, and for very strong reasons:
First of all, he calls himself a “Jungian Psychoanalyst”, which frankly makes him sound kind of woo-woo and makes it harder to take him seriously. Secondly, he’s well outside of the academic circles that have earned the right to talk about autism and its causes. He’s an outsider. Thirdly, it’s a shocking, awful idea, and anybody championing it will be pilloried from all sides. The idea is threatening to the neurodiversity camp who don’t want autism treated like a disorder at all. The idea is threatening to parents who aren’t eager to hear that their own technology habits may have contributed to severe lifelong difficulty in their child. The idea is threatening to existing researchers who are far along other research paths and stand to gain little from such a simple tidy explanation, especially when championing it would get them in all kinds of hot water from every direction. Frankly, this idea is way too controversial to put my own name on, which is why this is an anonymous contribution.
Even if we’re right, I’m not hopeful that people are going to start taking this idea seriously. There are some ideas that are just too controversial to be taken seriously.
There are lots of stories where an American goes on vacation for a few weeks, to Europe or Asia or wherever, and loses a significant amount of weight without any special effort.
There are also some stories that are exactly the opposite: someone from Europe or Asia or wherever goes on vacation to America for a few weeks, and GAINS a significant amount of weight without any changes.
Some stories describe what are almost simple ABABA-style experiments, where a person goes back and forth from America to other parts of the world and sees their weight reliably yo-yo as they move from country to country. We shared one such story in Part X of A Chemical Hunger, this account from Julius:
I currently live in Seattle but have moved around a lot. I’ve made 6 separate moves between places where I drank the tap water (mostly USA/UK/Hungary) and places I haven’t (South East Asia, India, Middle East). Whenever I’ve spent significant time in bottled water countries I lost weight (up to 50 lbs), and each time, save one 3 month stretch in Western Europe, I gained it back in tap water countries. I also lost weight for the first time in the States (20 lbs) this year around the time I switched to filtered water.
We can also share this story, from another reader (lightly edited per his request):
I quit my job and moved to northwest Thailand. I lived there for about 1.5 years and lost 100lbs. At the end, I was thinner than I’d ever been as an adult, even in Germany, just under 200lbs (or ~90kg). I didn’t do much if any exercise and I typically ate as much as I wanted.
Then I moved to SF and gained the 100lbs back within 2-3 years. I worked out while I was still thin, but, again, it didn’t stop the gaining and eventually became impractical as I was too fat to run. Peaked around 300lbs once again.
So then I thought, it’s the work stress. So I quit my job once more, and I moved to Las Vegas, where I lived for an ENTIRE YEAR without a job. And didn’t lose any weight.
During these last few years, I would often vacation in Thailand for 1-3 months, and always lose weight without trying. Typically, if I was there for at least 1 month, I’d see some noticeable fat loss on the order of a few pounds, and feel my belt get looser.
Even right now, as I’m visiting with my family back in Germany, I am visibly and belt-feel thinner after only 3 weeks of being outside the US, despite zero exercise and eating a ton of salami and cheeses.
Anecdotes are fun, but so far there hasn’t been any systematic study. If this vacation weight loss effect is real, it seems like it would be good to know. It would give us a powerful tool for causing weight loss (just take a trip to somewhere leaner than where you’re living right now), and it would help us get closer to finding out what causes obesity.
In particular, it would provide more evidence for the contamination hypothesis, since losing weight on vacation is a pretty strong hint that something about the environment is to blame. Most of the anecdotes above seem pretty confident their weight loss has something to do with the “unprocessed” foreign food, but there are reasons to think it might be some other part of the foreign environment. And if we could study vacation weight loss in a systematic way, we might be able to narrow things down.
But this data is not very systematic, and may not be representative. In particular, if you go on vacation to Europe and you lose no weight, you probably don’t post about it on social media, and you probably don’t email us to tell us about the weight you didn’t lose.
We can correct for this by trying to collect data in a more systematic way. We can also try to come up with a design that lets us account for some alternative explanations (more on this in a bit).
We have a few ideas for study designs, but we’re not sure what kind of design would be best. We’ll describe general ideas for designs below, and please, let us know what you think.
If any of the designs seem good, we might run one of these studies at some point. But it’s somewhat of a shame for us to be running so many of these internet studies. No one should have to take our word for these things — it would be better if the work/knowledge/expertise were spread around the community.
So if you want to help run one of these studies, or if you want to take point and run it yourself, let us know and we can talk about collaborating. Or you can just take the design and run with it, we’re not your mom.
Walking
But first: Some people are skeptical that there is something special about vacation. The most common alternative they offer is some form of, “it’s all the extra walking”.
But there are some problems with this response. For starters, it assumes that all these people are walking more, but you don’t always walk more on vacation than you do at home. Some people have very active day jobs, and there’s nothing stopping you from sitting quietly in your hotel room for a week straight.
It ignores the other details people often mention in these stories, like that they also ate much more and ate “worse” than normal on vacation. From a simplistic CICO perspective, this extra food should balance out the extra walking.
And if extra walking on vacation were the cause of vacation weight loss, then extra walking ANYWHERE should cause weight loss. This would mean that slightly more mild cardio is a good treatment for obesity in general. But “slightly more mild cardio” doesn’t work as a treatment for obesity, so it’s clearly not the cause of vacation weight loss.
More importantly, many people who lost (or gained) weight on vacation make it clear that they don’t think it was the extra walking. Yes, they are not totally neutral parties, but no one is a totally neutral party, and they are the only ones with firsthand knowledge of their own cases.
There may be other reasons to doubt that extra walking is to blame. Friend-of-the-blog dynomight is skeptical that the walking explanation survives back-of-the-napkin math:
But all this debate is really for nothing. We can just check.
For starters, it’s easy enough to ask people to count steps. Their phones make estimates automatically. We can see if people are walking more on vacation and if how much extra walking they do is at all related to how much weight they lose.
We can compare Americans who go on vacation to Europe with Europeans who go to vacation in America. Maybe both groups walk more than average while they’re on vacation, but if the Americans lose weight in Europe and the Europeans gain weight in America, that would suggest that the extra walking has little to do with the weight change.
We can also compare between states. Different US states vary widely in their obesity rates, from 25% in Colorado to more than 40% in Kentucky. If it’s something about the environment, then people from Kentucky should lose weight when they vacation in Spain, but people from Colorado might not. Similarly, we can look at people taking vacations in different US states. Europeans might gain weight in Mississippi but might not gain weight in Massachusetts.
We could even do something ludicrous like: “take identical obese twins, send one twin on vacation to Italy and have the other twin stay in America and try to match their walking distance as closely as possible.”
With the right design, it should be relatively easy to tell if walking is to blame. It’s fine if somehow it turns out that extra walking is the real cause of vacation weight loss, but clearly this is somewhat controversial — we should do a study and see.
Design #1: Existing Cohort Study
First, we could use data that someone else has already collected.
Programs like the Fulbright Program and the Peace Corps send young people to different countries around the world. These programs are already doing a controlled study on the vacation weight change effect, they just don’t know it. For example, the Philippine-American Fulbright Commission takes young people from a fat country (America) and sends them to a lean country (the Philippines), and also takes young people from a lean country (the Philippines) and sends them to a fat country (America), in both cases for about a full year.
These programs may already be recording students’ weights before and after their trip abroad, perhaps for medical reasons. If so, all the data we need already exists out there in some database. If not, all that would be required would be convincing the program, or even just some of the students, to start recording their weights before and after the exchange.
This is pretty much the ideal experiment. It’s systematic — we will get information from everyone, whether or not they gain or lose weight. The sample size is huge. And the data may already exist.
The problem is that we don’t know if there’s any way for us to GET this data. We haven’t had any luck trying to get in touch with Fulbright or any of these other programs, and we doubt they would be willing to make this data public.
If anyone thinks they could put us in touch with anyone on the Fulbright Philippines Board of Directors, or anyone at a similar organization (the American armed forces might have similar data), we would love to do this analysis. But barring that, we’ll have to come up with ways to collect our own data.
Design #2: Retrospective Vacation Study
Second, we could collect vacation information that already exists.
People have gone on many vacations over the course of human history [citation needed]. So for this design, we would systematically collect all the existing vacation stories we can.
This could be as simple as setting up a google form and asking people to submit stories about their past vacations (or other international trips, like exchange programs or briefly being an expat). The form would include just a few simple questions — how long the trip was, how much they weighed before, how much they weighed after, where they went, etc. Then we could look at this data to see if there’s any support for the vacation effect.
You might want to limit this (and the designs we talk about below) to trips that are at least two weeks long, because one week probably wouldn’t be enough to get a clear signal.
The upside of this design is that you might be able to get a lot of data pretty quickly. But there are some downsides.
First off, people may not remember how much they lost, how long they were gone, or other details of the trip. In short, the measurements will be noisy.
Second, you can’t ask people to collect specific data, or ask them to collect it in specific ways, because the vacation has already happened. If people happen to have measurements of how much they walked on their vacation, that would be great, but most people won’t have collected that data.
Finally, there’s bound to be a selection effect. People who lost weight on vacation are probably more likely to respond than people who gained weight. This isn’t totally damning since we can still see things like 1) whether people lost more weight on longer vacations, 2) whether people lost more weight when they traveled to leaner countries, or 3) whether younger people lost more weight than older people. But it is a limitation.
So it would be easy to run this study, and maybe informative if we got enough responses, but there are some problems with this kind of design.
Design #3: Prospective Vacation Study
Third, we could collect vacation data as it’s being created.
People continue to go on vacations today; in fact, many people are planning vacations right now. So another option would be to ask people to sign up to report on any vacations they’re about to take, and provide us a little data about the before and after.
The design would be pretty simple: first a pre-vacation survey, where people tell us about the vacation they’re about to go on, where they’re coming from, where they’re going, how long it will be, and how much they weigh right now. We’d also ask them to give us some measure of their average activity, maybe their average steps per day over the last three months — something easy that should already be captured by their phone.
Then when they get back, there’d be a post-vacation survey, where people could tell us what their weight was after the vacation. We’d also ask them for their average steps per day while on vacation, so we could get a sense of if they are walking more or less than normal. We could also ask them to report subjectively on measures like whether they exercised more or less than usual, and whether they ate more or less than usual.
This would help keep selection effects somewhat in check. No one knows before a vacation whether they will gain or lose weight — they can’t decide whether or not to sign up for the study based on their weight change, because it hasn’t happened yet.
If we have a high attrition rate, with lots of people filling out the first part but not filling out the second, this could be a sign that people who gain weight don’t come back and report their weight gain. But if the attrition rate is low, that suggests there isn’t much post-hoc selection.
We can also do comparisons just within the people who respond. Maybe the attrition rate is high, but if all the Americans going to Europe report losing weight, and all the Europeans coming to America report gaining weight, we can be fairly sure that’s not just selection.
The downside is that this approach would be pretty slow. There are lots of vacations, but we have to wait for people to come to us one by one as they jet off on their various trips.
This design is also not very controlled. We don’t get to pick the vacations and we don’t get to pick the vacationers. It’s a convenience sample, and while it might still be revealing, there are limitations.
Design #4: Controlled Vacation Experiment
Fourth, we can go out of our way to create the data we want.
It’s within our power to make vacations happen under controlled conditions — recruit some people who are obese and who can spend a month or two abroad (perhaps they have a remote job), and send them to a lean country for a couple weeks to see what happens. As with the other designs, you could have them track their exercise and other confounders if you want.
This is a version of what we have previously called “Slime Mold Time Mold’s Excellent Adventure”.
We were originally concerned that even if things were perfect, weight loss would be pretty slow. Even if you sent people to Japan for a month, they might not lose weight quickly enough for you to reliably detect the change. We thought that you might have to send people abroad for closer to a year, just to pick up on the effect.
But the results of the Potato Diet have convinced us that under the right conditions, quite a lot of weight loss can happen in just a few weeks. So now we think that a vacation study of 2-4 weeks might actually be informative.
And you may not need very many people. It’s still not clear how many people lose weight on vacations, but if the effect is consistent enough, you could find strong evidence with just a few participants (yet another n of small opportunity). If five obese people go to Italy for a month and all of them drop 10 lbs while sitting around and eating spaghetti alla carbonara ad libitum, that would be strong evidence, even with the small sample size.
The ideal countries to send people to would probably be Vietnam (leanest country in the world, about 2% obese), Bangladesh, India, Nepal, Japan, or South Korea (all less than 5% obese). There are many relatively lean countries in South America and Europe, but even Albania is more than 20% obese, so you would probably get a weaker effect.
This wouldn’t be too expensive. It costs around $4,000-$5,000 to send someone to a lean country for a month, so you could likely send around 10 people to one of these countries for a month for less than $50,000. Not pocket change, true, but cheap in the grand scheme of scientific studies. Given that many people enjoy vacations [citation needed], you might even be able to get people to pay for half of their trip.
It might make sense to run one of the other studies first to see if they back up the anecdotes, before spending all this money. But the other study designs aren’t nearly as controlled. If you’d be interested in funding this study, or helping us arrange all the travel details (booking hotels and flights is not our strong suit), please let us know. Or you can ignore us, recruit 10 obese volunteers, and send them to Myanmar all on your own. Just let us know what you find out!
Similarly, anyone who is at all overweight could run this as an N=1 self-experiment, or get a few friends and run it as an N of small. You just need to be in a position to spend a few weeks in one of these lean regions.
The main drawback is that the small sample size would keep you from making interesting comparisons. Sending 10 obese Americans on a one-month trip to a small Asian country could make a strong case for the vacation effect, but it wouldn’t let us answer questions like the following: do young people lose weight faster on vacations than old people? Do other demographics (like ethnicity) have an impact? Do people lose weight faster in Peru (19% obese) than in Spain (27% obese)? Do people with BMI < 25 also lose weight on vacations?
This would be another reason to run multiple studies with different designs. Sending people on a planned vacation gives you a lot of experimental control with the limitation of a small sample size, while other designs can give you a large sample size at the cost of experimental control. Both could tell us a lot, and they’d be stronger together — triangulation is the name of the game.
Final Notes
As far as we know, no one has ever done a vacation study like this, so any study at all would be interesting. It doesn’t have to be perfect — we should start by getting our toes wet.
We’re talking about these designs like they’re all about obesity, but this approach doesn’t have to be used to study obesity. You could also use it to study, say, migraines. Lots of people think they are gluten or lactose intolerant — but are they still gluten or lactose intolerant if they’re eating bread or milk in, say, Italy? Might be interesting to find out.
As always, we are happy to help with designs and methodology. If you want to run one of the studies described above, please contact us!
We will probably keep voting open until the end of the day on August 31 in case you want a chance to go back and re-read your favorites. In case you’ve forgotten, the finalists are:
[This is one of the finalists in the SMTM Mysteries Contest, by a reader writing under the pseudonym Lee S. Pubb. We’ll be posting about one of these a week until we have gotten through all the finalists. At the end, we’ll ask you to vote for a favorite, so remember which ones you liked.]
Background
Element 53, iodine, is a mineral essential to human health largely because it is utilized in the production of thyroid hormones.[1] For much of recorded human history, societies the world over suffered from the iodine deficiency disorder known as goiter–swelling in the neck resulting from an enlarged thyroid gland. It was common in regions where topsoil was regularly eroded (e.g., by flooding), as topsoil is rich in iodine, normally leading to its uptake in food crops. It’s also abundant in foods derived from the ocean, as ocean water contains iodine.
While Chinese doctors are said to have prescribed the consumption of animals’ thyroid glands to treat goiter as early as the 7th century, it was only in the early 20th century that large-scale research was done on iodine supplementation. It was found that sufficient levels of iodine consumption eliminated goiter in the vast majority of cases, and we began iodizing salt soon after. (Which is why I had to give this lengthy background on goiter, instead of just saying “that giant lump in your friend’s neck”.) The FDA recommends that Americans consume 150 mcg of iodine a day, and expect that the vast majority of Americans will achieve this through salt consumption.
(Very likely the reason you don’t have a goiter.)
That said, while the level of iodine supplementation present in modern-day table salt is clearly sufficient to greatly reduce the incidence of goiter, that doesn’t necessarily mean that it’s the ideal level for overall thyroid health.[2] (The FDA’s upper limit is set nearly 8 times higher than the RDA, at 1,100 mcg, suggesting that considerably more than the recommended amount isn’t expected to be deleterious to human health.) The thyroid largely regulates metabolism, and you may be familiar with some mysteries raised about human metabolism over the past century.[3] There are also a number of diseases directly associated with thyroid function, and these are pretty common in the present day (especially among women), with around 20 million patients in the United States estimated to have some kind of thyroid disorder. Thyroid functioning is screened for by doctors via blood tests measuring thyroid hormone concentration. Common disorders include hypothyroidism, in which an abnormally low level of thyroid hormone is produced, and hyperthyroidism, in which too much is produced.
INTERESTING!:
The Blue Zones are regions around the world renowned for their high proportion of centenarians, people living to the age of 100 or later. These are Okinawa, Japan; Sardinia, Italy; Nicoya, Costa Rica; Ikaria, Greece, and Loma Linda, California. These zones are now getting the shit studied out of them by anthropologists, doctors, etc., so we can try and figure out why they live so long and hopefully use it to improve human healthspan elsewhere.
You will not be surprised, given the background I just provided, to find out that iodine appears to be related to the mystery of the blue zones.
In the study Association of endemic goitre and exceptional longevity in Sardinia: evidence from an ecological study, the authors note “The spatial analysis revealed that the goitre rate (p < 0.0001), the proportion of inhabitants involved in pastoralism (p = 0.016), the terrain inclination (p = 0.008), and the distance from the workplace as a proxy for physical activity (p = 0.023) were consistently associated with population longevity at an aggregated level in the 377 municipalities.” Which is to say, a higher goiter rate in a municipality was the measure they studied that was most clearly associated with greater longevity. This association had an extremely–to my mind, an almost outrageously–low p-value.
The study goes on to say that, “from a worldwide perspective, the finding of an epidemiological association between goitre prevalence and longevity does not seem to be limited to Sardinia, but partially shared also by other populations where long-lived subjects are numerous. Most of the Longevity Blue Zones in the past were niches of endemic goitre as well. In Costa Rica, where the Nicoya LBZ was identified in 2007 (Rosero-Bixby 2008), high prevalence of endemic goitre has been reported since the 1950s (Perez et al. 1956) possibly aggravated by a gross excess of calcium ingested with drinking water (Boyle et al. 1966). In another LBZ, Ikaria island, the iodine level in spring water is remarkably low (Pes and Poulain 2014).”
So great, restrict iodine and you’ll live forever, right? AGING SOLVED! WE DID IT, YOU’RE WELCOME, aging is just iodine damage.
…Except.
Another Blue Zone is Okinawa, where most of the population routinely consumes nutritionally large quantities of seaweed–one of the richest natural sources of iodine. How much iodine are they consuming? I don’t have great numbers on Okinawa itself, but mean consumption of iodine in Japan is estimated at around 1-3 milligrams a day–that’s 1,000-3,000 micrograms of iodine. Remember that the FDA puts the RDA of iodine at just 150 micrograms, and the UL at 1,100 micrograms! Apparently, many Japanese people are consuming considerably more than the “tolerable upper intake level” for iodine. Centenarianhood is apparently their reward. Japan has 86,000 centenarians (.06% of their population, the highest percentage in the world), and a life expectancy of 84.62 years, second only to Hong Kong (the population of which, as of 2011, had a high risk of iodine deficiency).
These studies, taken together, suggest that having a moderate iodine intake leads to early death. As the Buddha said, “seek extremes in all things and maybe you can live to be a hundred”.
It is difficult not to propose any explanations for this, but that is the task that has been assigned to me, so, there you have it. Low iodine is correlated with longevity. High iodine is correlated with longevity. The thyroid is truly the most mysterious organ.
[1] If that first sentence surprised you, you’re in good company; iodine is “the heaviest element commonly needed by living organisms.”
[2] Plus, there’s bound to be population variance–I, for one, use kosher salt in my cooking, because Alton Brown told me to, and it’s not iodized. I’m surely exposed to iodized salt via processed foods and restaurant meals, but I do consume those somewhat rarely, at least compared to the number of meals I prepare at home.
[3] Iodine was once used to treat flour, oxidizing it to allow gluten to more easily form bonds and create the texture all us non-celiacs love in bread and pizza. But around the same time we began to study iodine supplementation, we invented potassium bromate, and started brominating flour instead to achieve the same effect–potassium bromate produced a very white flour that rapidly oxidized. Iodine is a necessary mineral, though, and bromine isn’t. So there was likely a point in recent American history where iodine consumption in bread decreased. You’re also really not supposed to consume the potassium bromate, but the thought is that baking temperatures sublimate the bromine–ah fuck I guess you really *shouldn’t* eat raw cookie dough? fuck.
We agree with ExFatLoss that the mysteries of obesity fit seed oils pretty well. This is part of why when we investigated seed oils in Interlude E (as ExFatLoss mentions) we took them seriously as a hypothesis.
There is room for multiple theories at once. Disagreement and uncertainty at this stage is normal/healthy and it just shows that this is an area where we still don’t understand everything. We can entertain seed oils as a possible explanation, whereas things like “diet and exercise” are non-explanations. And it’s quite possible that many things contribute at once. If any single cause could potentially explain half of the obesity increase since 1970, or even just 25%, that’s still a big deal.
We also really like this passage, which we think bears reproducing:
By the way, I find the way scientists talk about the “cafeteria diet” to be mystical to the point of being comical. Really, you couldn’t recreate lab chow that made the rats as obese as the human cafeteria diet? Weird, the cafeteria managed to do it.
They somehow imbue “human junk food” with a mystical, obesogenic property that cannot be explained by the sum of its parts, almost like the immortal soul. Please. You could start by differentiating fatty acids.
But despite the good fit with the first 8 mysteries, we’re still not convinced that the seed oil hypothesis is a fit for the evidence overall. In this post we try to explain why.
7 Out of 8 Ain’t Bad, But This is Out of 10
If we wrote A Chemical Hunger today, Part I would include 10 mysteries, not 8. And one problem with the seed oil hypothesis is that while it provides a pretty good fit to 7 of the 8 original mysteries, it doesn’t match the two new ones.
ExFatLoss pays special attention to the map of obesity in the US (below) which shows a relationship between obesity and elevation. He suggests that could be explained if lower levels of oxygen found at higher altitudes compensate for the damage seed oils do to your metabolism (presuming seed oils do damage to your metabolism, of course), but he admits that this is “not exactly a slam dunk”.
We take some issue with this interpretation. The mountain ranges definitely stand out as less obese, but the distribution of obesity is not actually a great match with altitude. Low-lying areas of Florida, Massachusetts, and California, for example, also have very low rates of obesity.
This is why we prefer an interpretation based on the length of the watershed, which we think is a proxy for the levels of some contaminant in the water. Longer watersheds have more of a chance to pick up the contaminant, and to pick up more of it, assuming there is any contamination in their higher reaches.
In this model, these regions of Florida, Massachusetts, and California are not obese because while they are low-elevation, they are also at the end of very short watersheds, which probably don’t contain much contamination. In some cases (like Boston and San Francisco), the water is piped from pristine wilderness reservoirs deep in the mountains.
But that map is one thing. A theory of obesity should also be able to explain this map:
Mystery 9 is that there is a lot of international variation — some countries are much more obese than others. The most obese nations on Earth are all tiny Pacific Islands. The Middle East is by far the most obese region in the world. The next runners-up are the USA, Canada, and Australia. And these are just the highlights — ideally a theory would be able to explain the whole pattern that appears on the map.
Seed oils would seem to have a hard time accounting for this pattern, though of course we’re open to explanations. But if Kuwait loves seed oils in a way that Pakistan doesn’t, well, we haven’t heard about it.
In comparison, this mystery is one reason to prefer the lithium hypothesis. You see, fossil fuels are often accompanied by horrible brines, brackish and sometimes radioactive water that is brought to the surface as the oil and gas are extracted. Many of these brines, especially in arid regions, are extremely high in lithium. Despite industry promises, huge volumes of brine are regularly spilled or otherwise improperly disposed of (or sometimes just intentionally used to irrigate crops), and this is a huge source of lithium contamination. We cover this all in more detail in Interlude G, if you want to read more.
So under the lithium hypothesis, the US, Middle East, Canada, and Australia are all unusually obese because they are all major oil producers, producing from oilfields in arid climates, which have brines high in lithium, leading to major contamination. Major oil producers like Russia and Norway are not very obese because their oilfields are not in arid climates, so their brines probably are not high in lithium, leading to much less contamination (and/or their oilfields are remote enough, or offshore enough, that the contamination doesn’t reach their populations). If this is the case, the map of obesity should roughly line up with the map of oil production, which it does:
The remainder of the international pattern can be explained by other fossil-fuel mining (coal, for example, is also often high in lithium), other forms of intensive mining that might also stir up lithium, countries importing most of their food from one of these high-lithium countries (remember that the US is a major food exporter), and possibly some other sources like desalination.
Mystery 10 is the variation of obesity rates between professions. Some professions are much more obese than others. Truck drivers, mechanics, firefighters, and transportation workers, for example, tend to be especially obese. Meanwhile teachers, lawyers, engineers, and “health diagnosing occupations”, as four more examples, are much less obese than average.
The two main sources for these patterns are this survey of more than 37,000 workers from the Washington State Department of Labor and Industries and this 2004-2011 NIH survey of US workers, if you want to take a closer look at the details. People often write this off as being about race or class, but the NIH analysis finds the same general patterns within each race, and many professions at similar class levels have very different rates of obesity.
Again, it seems difficult for the seed oil hypothesis to explain this pattern. Stereotypes about truck drivers aside, do we really think that mechanics consume that much more canola oil than lawyers?
And again, we see this as another reason to prefer the lithium hypothesis. There are some exceptions, but overall the ranking of professions by obesity rates looks like a pretty good proxy for “amount of exposure to vehicles and heavy machinery”, and lithium grease is the most common lubricating grease used in vehicles and heavy machinery.
We should be clear that this doesn’t leave the seed oil hypothesis totally dead in the water. A theory doesn’t have to fit all 10 mysteries to be correct. For example, the obesity epidemic could have multiple causes — maybe seed oils caused much of the baseline increase and something else is responsible for the variation internationally and between professions. It’s still possible that this is massively multicausal.
And as we’ve previously written about, reality is very weird and full of bizarre exceptions. Maybe truck drivers and other people who work around vehicles are more obese because ExFatLoss is right and oxygen levels make a difference, and people who are breathing exhaust all day get less oxygen. There’s always a way to add epicycles to save a theory, and sometimes those epicycles are actually correct.
But that said, the fact that seed oils don’t fit these two mysteries is still a strike against the hypothesis.
It Doesn’t Fit Case Studies
Our second gripe is that the seed oil hypothesis doesn’t fit a number of individual and population-level case studies.
N=1: Krinn & M
The seed oil hypothesis doesn’t fit the self-experiment case study of Krinn, who has lost weight taking large doses of potassium. Krinn hasn’t changed her seed oil intake.
The seed oil hypothesis doesn’t fit the successful half-tato dieters, the best example of which comes from our reader M. He ate more potatoes, but he was still consuming seed oils like normal, and he talks about this pretty explicitly in his report:
I tossed my diced potatoes in olive oil before air frying, and more generally used olive oil, duck fat and avocado oil to cook other potato preparations. I probably used 1-2 “glugs” of oil per 1-1.5lb potatoes across these preparations (“lightly greased”, call it). And of course in my non-potato meals, I consumed whatever oil – and other ingredients – restaurants would be using to cook their food. Given my diet was substantially made up of non-potato meals that I actively tried to keep “as before”, I think it is a safe bet that there wasn’t any particular type of food (diary, oil, red meat, etc.) I stopped consuming, or even materially reduced my consumption in, as a result of potatoes by default (beyond the generic ~1/3 reduction from swapping out a third of my meals to be mostly potato).
Pima
The seed oil hypothesis really doesn’t fit the case of the Pima (who we wrote about here and here). This group had a remarkably high rate of obesity in the 1960s, and we have no reason to believe they were exposed to any more seed oils than other Americans. However, we do know that their water contained 50-100x more lithium than the median dose in American municipal water sources at the time. At least one of their food plants was found to concentrate lithium in its berries, so they probably got an even higher dose from their food.
Frankly this case study is hard to account for by anything but the lithium hypothesis, which is part of why we so strongly prefer that hypothesis. Sievers & Cannon, writing in 1973, even say, “It is tempting to postulate that the lithium intake of Pimas may relate … to relative physical inactivity and high rates of obesity and diabetes mellitus.”
Fattest and Leanest Places in America
The seed oil hypothesis also doesn’t account for the pattern of fattest and leanest places in America. We can’t see any reason why Charleston, WV would be eating more seed oils than Bridgeport, CT. But we can see a reason why Charleston, WV would be exposed to unusual levels of lithium — because it’s a famous site for the prospecting of salt brines, including brines that as far as we can tell are unusually high in lithium.
We are still kind of freaked out by how well the pattern of fattest and leanest places fits the lithium hypothesis. The #1 most obese community in America is downstream from three coal power plants that are well-documented to be leaking lithium into the groundwater. The #2 most obese community sits on an aquifer that is unusually high in lithium. The #5 most obese community is in an area with many oilfield brine spills. The #6 most obese community (Charleston, WV) is, as we mentioned, a famous brine extraction site. The #7 most obese community is the site of a lithium plant that recently exploded. So is the #10 most obese community — it’s home to another lithium plant that, yes, also exploded.
Read our full report in Interlude I for all the gory details, but when you look at the most and least obese communities in America, you find a pattern that looks like fossil fuel activity and industrial lithium accidents. You don’t see a pattern that looks like canola oil taste testing.
It Would be Easy to Test
No one believes that carrot juice can cure cancer. If it did, anyone could give some cancer patients a bit of carrot juice, effortlessly cure them, use those case studies to raise money for a clinical trial, then pass go and collect their $200 billion and Nobel prize in Physiology or Medicine. We can dismiss out of hand anyone who says that carrot juice does have such wonderful qualities — if it did, they would be out there demonstrating those qualities, not arguing with us.
Our point is, the easier it would be to collect evidence that a theory is correct, the less seriously we should take the theory in the absence of that evidence. As far as we can tell, if the seed oil hypothesis were correct, it would be easy to get evidence for it. So until seed oil theorists can present that evidence, we’re not inclined to take the seed oil hypothesis very seriously.
We’re not saying this as a dig — we’re saying this to encourage seed oil believers to go out there and collect that sweet, strong evidence, if they think they can get it.
The easiest test that comes to mind would be a variation of the potato diet. We know that people lose weight on the potato diet, and the seed oil theorists presumably think that this is because the potato diet is also a seed-oil-elimination diet. People on the potato diet do take some cheat days, but they’re surely consuming a lot less seed oil than usual, maybe close to zero.
It would be easy to run a variation of the potato diet where half the participants are randomly assigned to eat their potatoes with butter, and the other half are randomly assigned to eat their potatoes with sunflower oil. (Or substitute these for whatever fats the seed oil theorists think are best and worst.) If the seed oil theory is correct, then the participants eating potatoes + butter should lose weight much faster than the participants eating potatoes + sunflower oil. If the seed oil theory is wrong, there should be basically no difference.
Ideally you would go on to test more than just two fats — butter and sunflower oil differ in more ways than just how much linoleic acid they contain! But starting with two would be fine, and it would give us an idea of whether or not there’s anything worth looking into here.
If the potato diet offends you for some reason, you could do the same thing with any other elimination diet, or any other weight-loss protocol that we know to be effective. For example, ExFatLoss could add various fats to his ex150 protocol, the same amount of a new fat every week, and see if some fats stall his weight loss and other fats don’t.
Again, WE don’t think that this would shake out in favor of the seed oil hypothesis, which is why we don’t want to run the study ourselves. But if you feel differently, you should try to prove us wrong.
We want to emphasize that even if one of these studies did find a difference between seed oils and other fats, that wouldn’t be evidence for a specific mechanism — it wouldn’t necessarily point to linoleic acid. For example, it could be pesticides; some of these crops like grapeseed and soybeans and sunflowers might be sprayed with the same pesticide, and that might be the cause of the difference. Hell, it could still be lithium. There’s some evidence that sunflowers concentrate lithium (though “the lowest concentrations [occur] in the seeds”), so it’s possible that there’s more lithium in sunflower oil than in butter. If we need to, we can test for these things.
So this kind of study should be able to point to seed oils pretty easily — and if you’re a seed oil believer, you should try to make one of these studies happen. But if it does point to seed oils, that still doesn’t provide strong evidence for why seed oils might cause obesity. That would remain to be seen, though we would certainly be closer to an answer.
To recap: lots of things make the seed oil hypothesis an attractive explanation, and we’re still open to the idea. But right now it doesn’t seem very consistent with the evidence, and changing our mind would require addressing some of these apparent contradictions.
[This is one of the finalists in the SMTM Mysteries Contest, by a reader writing under the pseudonym Cennfaeladh. We’ll be posting about one of these a week until we have gotten through all the finalists. At the end, we’ll ask you to vote for a favorite, so remember which ones you liked.]
[UPDATE: Now that the contest is over, we reveal that the author of this post is niplav, who blogs at niplav.site]
I investigate whether the attention span of individual humans has been falling over the last two decades (prompted by curiosity about whether the introduction of the internet may be harmful to cognitive performance). I find little direct work on the topic, despite its wide appeal. Reviewing related research indicates that individual attention spans might indeed have been declining65%.
In what might be just the age-old regular ephebiphobia, claims have been raised that individual attention spans have been declining—not just among adolescents, but among the general population. If so, this would be quite worrying: Much of the economy in industrialized societies is comprised of knowledge work, and knowledge work depends on attention to the task at hand: switching between tasks too often might prevent progress on complicated and difficult problems.
I became interested in the topic after seeing several claims that e.g. Generation Z allegedly has lower attention spans, observing myself and how I struggled to get any work done when connected to the internet, and hearing reports from others online and in person having the same problem.
The exact question being asked is:
“Have the attention spans of individuals on neutral tasks (that is, tasks that are not specifically intended to be stimulating) declined from 2000 to the present?”
(One might also formulate it as “Is there an equivalent of the “Reversed Flynn Effect” for attention span?”) I am not particularly wedded to the specific timeframe, though the worries mentioned above assert that this has become most stark during the last decade or so, attributing the change to widespread social media/smartphone/internet usage. Data from before 2000 or just the aughts would be less interesting. The near-global COVID-19 lockdows could provide an especially enlightening natural experiment: Did social media usage increase (my guess: yes90%), and if so, did attention spans decrease at the same time (or with a lag) (my guess: also yes75%), but I don’t think anyone has the data on that and wants to share it.
Ideally want to have experiments from ~2000 up to 2019: close enough to the present to see whether there is a downward trend (a bit more than a decade after the introduction of the iPhone in 2007), but before the COVID-19 pandemic which might be a huge confounder, or just have accelerated existing trends (which we can probably check in another 2 years).
I am mostly interested in the attention span of individual humans and not groups: Lorenz-Spreen et al. 2019 investigate the development of a construct they call “collective attention” (and indeed find a decline), but that seems less economically relevant than individual attention span. I am also far less interested in self-perception of attention span, give me data from a proper power- or speed-test, cowards!
So the question I am asking is not any of the following:
“Does more social media/internet usage cause decreased attention spans?”
“Does more social media/internet usage correlate with decreased attention spans?”
“Does more social media/internet usage correlate with people reporting having shorter attention spans?”
“Did collective attention spans decrease?”
“Are people on average spending less time on webpages than they used to?”
How Is Attention Span Defined?
Attention is generally divided into three distinct categories: sustained attention, which is the consistent focus on a specific task or piece of information over time (Wikipedia states that the span for sustained attention has a leprechaun figure of 10 minutes floating around, elaborated on in Wilson & Korn 2007); selective attention, which is the ability to resist distractions while focusing on important information while performing on a task (the thing trained during mindfulness meditation); and alternating or divided attention, also known as the ability to multitask.
When asking the question “have attention spans been declining”, we’d ideally want the same test measuring all those three aspects of attention (and not just asking people about their perception via surveys), performed annually on large random samples of humans over decades, ideally with additional information such as age, sex, intelligence (or alternatively educational attainment), occupation etc. I’m personally most interested in the development of sustained attention, and less so in the development of selective attention. But I have not been able to find such research, and in fact there is apparently no agreed upon test for measuring attention span in the first place:
She studies attention in drivers and witnesses to crime and says the idea of an “average attention span” is pretty meaningless. “It’s very much task-dependent. How much attention we apply to a task will vary depending on what the task demand is.”
Perhaps I should soften my tone, as this perspective appears elsewhere:
[…] Gould suggests the metaphor of a dense bush whose branches are periodically pruned by nature. This allows for parallel evolutionary sequences, some of which are adaptive and others not — at any moment in time only the tips of aseledaptive branches are in evidence, the pruned ones cannot be seen. Thus rather than being direct descendants of primitive hominids, for example, huankind would have evolved along a separate but parallel line from other primates.
Might the ontogeny of selective attention recapitulate this theme? That is, rather than selective attention comprising a single construct with a fixed ontogenic plan, might it be better conceptualized as a multidimensional construct with separat, parallel developmental trajectories for different components. To carry the analogy still further, might the specific developmental progression for a particular component of selective attention be determined by the adaptive fit of that component with the individual’s ‘environmental press’? Although such a conjecture rekindles the tened of ontogeny recapitulates phylogney long since abandoned in physiological development (e.g., Dixon and Lerner, 1985), we suggest that it may nonetheless provide an overarching framework within which to cast life-span research and theory on the development of selective attention.
One of my hopes was that there is a canonical and well-established (and therefore, ah, tested) test for attention span (or just attention) à la the IQ test for g: If so, I would be able to laboriously go through the literature on attention, extract the individual measurements (and maybe even acquire some datasets) and perform a meta-analysis.
Continuous Performance Tests
For measuring sustained and selective attention, I found the family of continuous performance tests, including the Visual and Auditory CPT (IVA-2), the Test of Variables of Attention (T.O.V.A.), Conners’ CPT-III, the gradCPT and the QbTest, some of which are described here. These tests usually contain two parts: a part with low stimulation and rare changes of stimuli, which tests for lack of attention, and a part with high stimulation and numerous changes of stimuli, which tests for impulsivity/self control.
Correct detection: This indicates the number of times the client responded to the target stimulus. Higher rates of correct detections indicate better attentional capacity.
Reaction times: This measures the amount of time between the presentation of the stimulus and the client’s response.
Omission errors: This indicates the number of times the target was presented, but the client did not respond/click the mouse. High omission rates indicate that the subject is either not paying attention (distractibility) to stimuli or has a sluggish response.
Commission errors: This score indicates the number of times the client responded but no target was presented. A fast reaction time and high commission error rate points to difficulties with impulsivity. A slow reaction time with high commission and omission errors, indicates inattention in general.
I’m currently unsure about two crucial points:
How much does any CPT measure the concept we naively call attention span? The papers I’ve read don’t refer to attention span per se, but a general capability of sustained and selective attention.
Are there any time-series analyses or longitudinal studies using a CPT, or alternatively meta-analyses using data collected from existing studies? I have not been able to find any.
Other Heterogenous Metrics
I also attempted to find a survey or review paper on attention span, but was unsuccessful in my quest, so I fell back to collecting metrics for attention span from different papers:
Three online tests (probably devised by the authors (?), since no source is given) (n≈2000 Canadians). Very little information about the exact nature of the tests.
Sustained attention span: “Counting the number of times responds correctly identified an X occurring after an A.”
Selective attention span: “Counting the number of times respondents correctly identified a change in the orientation of the rectangles”
Alternating attention span: “Calculating the difference in the time lapsed to perform a series of consecutive number or letter classification, compared to a mixture of number and letter classifications.”
Neurological research: The same games/tests as above with the participants being measured with an EEG (“Results were reported as ACE (Attention, Connectivity, Encoding) scores, as well as the number of attention bursts”) (n=112 Canadians)
Questionnaire developed by the authors based on Conners 2004 (reliability: α=0.786)
Wilson & Korn 2007 report several different measures of attention span during lectures: the amount of notes taken over time, observation of the students by an author of one study or two independent observers in another study, retention of material after the lecture, self-report in 5-minute intervals during the lecture, and heart rate. They also note that “Researchers use behaviors such as fidgeting, doodling, yawning, and looking around as indicators of inattentiveness (e.g., Frost, 1965; Johnstone & Percival, 1976).”
Plude et al. 1994 review how selective attention develops during a human life. For measuring attention, they mainly focus on studies using reaction time as a metric—the speed at which an action occurs as a result of a changing stimulus: eye movement patterns of infants, simple tests such as pressing a button on a changing (often visual) stimulus, the influence of irrelevant visual stimuli at the periphery on a task performed at the centre of the visual field, judging similarity of stimuli at various distances in the visual field, responding to a target stimulus surrounded by interfering distractor stimuli, and determining whether a visual target item is present or absent. They also mention skin conductance (measuring arousal).
They also mention studies investigating the time required for attentional switching in acoustic contexts: “Pearson and Lane (1991a) studied the time course of the attention-shifting process between lists and also found large age-related improvements between 8 and 11 years. Whereas 8-year-olds required more than 3.5 s to completely switch from monitoring one list to another, 11-year-olds and adults appeared to complete the switch in less than 2.5 seconds.”
This is not an adequate metric, I believe: It would also decline if people would become better at prioritising on which websites are more worthy of their attention.
Time that specific pieces of information (hashtags/n-grams/Reddit submissions &c) were popular
As it stands, I think there’s a decent chance60% that one or several tests from the CPT family can be used as tests for attention span without much of a problem.
I don’t think a separate dedicated test for attention span exists45%: The set of listed measures I found (apart from the CPT) appears to be too heterogenous, idiosyncratic, mostly not quantitative enough and measuring slightly different things to be robustly useful for a meta-analysis.
What Are the Existing Investigations?
A lack of long-term studies means we can’t tell whether attention spans have actually declined.
Do younger people perform worse on the sustained attention span test?, Yes (31% high sustained attention for group aged 18-34, 34% for group aged 35-54, and 35% group aged 55+) (the methodology is wholly unclear here, though: how do we determine the group that has “high sustained attention span”? Did they perform any statisitical tests? If yes, which?).
Do people who report more technology usage (web browsing/multi-screen usage while online/social media usage/tech adoption) perform worse on the sustained attention span test?, Yes. Light:medium:heavy usage for web browsing has 39%:33%:27% users with high sustained attention span, 36%:33%:27% for light:medium:heavy multi-screen usage, 36%:29%:23% for light:medium:heavy social media usage and 35%:31%:25% for light:medium:heavy tech adoption (though these numbers are basically not elaborated on).
Selective attention:
Do younger people perform worse on the selective attention span test? No (34% high selective attention for group aged 18-34, 30% for group aged 35-54, and 35% group aged 55+).
Do people with high selective attention use fewer devices at the same time? Yes (details p. 31).
Alternating attention:
Do younger people perform worse on the alternating attention span test? No (36% high selective attention for group aged 18-34, 28% for group aged 35-54, and 36% group aged 55+).
Do people who report more technology usage (tech adoption/web browsing/multi-screen usage while online) perform worse on the alternating attention span test? No, they seem to perform better: Light:medium:heavy tech adoption corresponds to 31%:39%:40% having high alternating attention spans, light:medium:heavy web browsing to 29%:34%:37% and multi-screening while online to 27%:32%:37%.
Do people who use social media more have higher Attention/Connection/Encoding scores on EEG measurements?, Not quite: “Moderate users of social media are better at multi-tasking than lower users. But, when crossing into the top quartile of social media usage, scores plummet.”
This is a marketing statement wearing the skinsuit of a previously great paper, it would be awesome if they released their exact methodology (tests performed, data collected, exact calculations & code written). I can smell that they actually put effort into the research: Creating an actual test instead of just asking respondents about their attention spans, doing EEG measurements of over 100 people, for 3 different types of attention…come on! Just put out there what you did!
Is self-reported attention span related to the number of social media accounts?, No, not statistically significant (F(2, 206)=0.1223, p>0.05) (via a one-way ANOVA)
Is self-reported attention span related to whether a respondent mainly uses a mobile phone or a computer?, No, not statistically significant (P(2,713)=0.923, p>0.05) (via a one-way ANOVA)
I do not trust this paper: Calling (what I think is) Generation Z “Generation D” (without source for the term), being clearly written in Word, and confusing grammar (I think the authors are all Americans, so no excuse here):
Users that are older such as late adolescents and emerging adults average approximately 30-minutes daily for just Facebook that does not calculate the time spent on all social media networks
Bakardjieva and Gaden (2012) examined the field of social interaction in general to the everyday chatter of unstructured and spontaneous interactions among individuals to highly structured and regulated interaction consisting of the military or the stock exchange.
Question answered: How much time do people spend on a website, on average?, “if you look at the trend for mobile browsing between the years 2017 and 2019 you would see that there is a drop of about 11 seconds in the average time spent on a website.” and “The data suggests that the average amount of time spent on websites before navigating away for all devices has gone down by 49 seconds which is a pretty huge reduction all things considered.”
The data is from the right timeframe (up to but not including 2020), but the linked SimilarWeb report is behind a paywall, so I can’t confirm the numbers. Furthermore, the time spent on websites is a weak proxy: Perhaps people simply have become better at prioritising information sources.
How long does any particular hashtag stay in the group of the top 50 most used hashtags? Specifically, how has that number developed from 2013 to 2016?, “in 2013 a hashtag stayed within the top 50 for 17.5 hours on average, a number which gradually decreases to 11.9 hours in 2016”, and “The average maximum popularity ⟨L(tpeak)⟩
on one day tpeak stays relatively constant, while the average gradients ⟨ΔL⟩
in positive and negative direction become steeper over the years.”
Do things become more popular faster over time? That is, when e.g. a movie is gaining popularity, did it take longer to become popular in 1985 than it did in 2018?, Broadly yes (the trends holds for popularity of hashtags in tweets (2013-2016)/n-grams in books (1900-2004)/number of theaters that movies were screened in (1985-2018)/topics for search queries on Google (2010-2017)/Reddit comments on posts (2010-2015)/citations of publications (1990-2015)/daily traffic for Wikipedia articles (2012-2017)). Again the length of the time at the peak mostly didn’t change (except in the case of Wikipedia articles, where the time at the peak shrunk)
While it investigates a question different from the one I have, this paper seems good and trustworthy to me, while supporting a suspicion I’ve had (observing that the lifecycle of e.g. memes has apparently sped up significantly). I’d be interested in seeing whether the same process holds for internet communities I’m part of (for example on votes LessWrong and the EA Forum or forecasts on Metaculus).
Mark 2023 is a recent book about attention spans, which I was excited to read and find the important studies I’d missed. Unfortunately, it is quite thin on talking about the development of attention span over time. It states that
My own research, as well as those of others, has shown that over the last fifteen years, our attention spans have declined in duration when we use our devices. Our attention spans while on our computers and smartphones have become short—crazily short—as we now spend about forty-seven seconds on any screen on average.
—Gloria Mark, “Attention Span” p. 13/14, 2023
which is not quite strong enough a measurement for me.
In 2004, in our earliest study, we found that people averaged about one hundred fifty seconds (two and a half minutes) on a computer screen before switching their attention to another screen; in 2012, the average went down to seventy-five seconds before switching. In later years, from 2016 to 2021, the average amount of time on any screen before switching was found to be relatively consistent between forty-four and fifty seconds. Others replicated our results, also with computer logging. seconds. Others replicated our results, also with computer logging. André Meyer and colleagues at Microsoft Research found the average attention span of twenty software developers over eleven workdays to be fifty seconds.⁹ For her dissertation, my student Fatema Akbar found the average attention span of fifty office workers in various jobs over a period of three to four weeks to be a mere forty-four seconds.¹⁰ In other words, in the last several years, every day and all day in the workplace, people switch their attention on computer screens about every forty-seven seconds on average. In fact, in 2016 we found the median (i.e., midpoint) for length of attention duration to be forty seconds.¹¹ This means that half the observations of attention length on any screen were shorter than forty seconds.
—Gloria Mark, “Attention Span” p. 74/75, 2023
She doesn’t mention the hypothesis that this could be the symptom of a higher ability to prioritize tasks, although she is adamant that multi-tasking is bad.
Furthermore, this behavior displays only a decrease in the propensity of attention, but not necessarily one of capacity: Perhaps people could concentrate more, if they wanted to/were incentivized to, but they don’t, because there is no strong intent to or reward for doing so. Admittedly, this is less of an argument in the workplace where these studies were conducted, but perhaps people just care not as much about their jobs (or so I’ve heard).
when email was cut off, people’s attention spans were significantly longer while working on their computers—in other words, they switched their attention less frequently.
—Gloria Mark, “Attention Span” p. 97, 2023
She gives some useful statistics about time spent on screens:
Nielsen reports that Americans spend on average five hours and thirty minutes daily of screen time on their computers, tablets and phones8. […] But what is really astonishing is that when we add in the time watching other media like TV and films to this, then we see that our attention is fixated on some form of screen, in some type of mediated environment, nearly ten hours a day8.
—Gloria Mark, “Attention Span” p. 180, 2023
She connects attention span to shot-length in movies:
The type of motion within shots has been changing. According to film scholar James Cutting and his colleagues at Cornell, shots containing the onset of motion (like a standing person who then runs) have increased because filmmakers believe that it will better attract viewers’ attention. […] The average film shot length in 1930 was twelve seconds, but then began to shorten, reaching an average of less than four seconds after the year 2010, as measured by James Cutting and colleagues.12 Interestingly, the shot length for film sequels also decreased. For example, the shot length of the first Iron Man film averaged about 3.7 seconds; for Iron Man 2, 3.0 seconds; and for Iron Man 3, about 2.4 seconds.13
—Gloria Mark, “Attention Span” p. 180/181, 2023
Like in TV and film, shot lengths in television commercials also shortened over time. The average shot length of commercials in 1978 was 3.8 seconds, dropping down to an average of 2.3 seconds in 1991. […] It’s not just the shot lengths, though, that are short—the overall length of advertisements on TV has also decreased. The majority of ads started out as sixty seconds in length in the 1950s,26 but that length comprised only 5 percent of ads shown in 2017. In the 1980s, advertisers started experimenting with showing fifteen-second ads instead of thirty-second ads. They discovered that fifteen seconds was even more persuasive than thirty seconds, especially when the ads used elements expressing cuteness and humor.27 In 2014, 61 percent of ads were thirty seconds in length, but three years later, that percentage decreased to 49 percent.28
—Gloria Mark, “Attention Span” p. 189, 2023
Do People Believe Attention Spans Have Declined?
Half of the public feel their attention span is shorter than it used to be, compared with around a quarter (23%) who believe they are just attentive [sic] as they’ve always been.
Again, the feeling of is not just reported by the young — it’s also the dominant feeling among the middle aged too, with 56% of 35- to 54-year-olds thinking their attention spans have worsened.
Even more widespread is the belief that young people’s attention spans in particular are worse than they were in the past—two-thirds of people think this is the case (66%).
Perhaps unsurprisingly, this belief is most common among the oldest age group surveyed, of those aged 55 or over — however, young people themselves also feel this way, with a majority of 18- 34-year-olds holding this view.
Note that selective attention mostly improves with age, so the older age-groups might be comparing themselves now to the younger age groups now (as opposed to remembering back at their own attention spans).
The absence of long-term research means it remains unknown whether technology has caused a deterioration in the country’s ability to concentrate — but comparisons with survey data from previous decades indicate that, on some measures the public feel more pressured than they did in the past.
In response to the questions (n=2093 UK adults aged 18+ in 2021):
“To what extent do you agree or disagree with the following statement? The pace of life is too much for me these days” (1983: 30% agree, 2021: 41% agree)
“To what extent do you agree or disagree with the following statement? I wish I could slow down the pace of my life” (1997: 47% agree, 1999: 51% agree, 2008: 45% agree, 2021: 54% agree)
What About Rates of ADHD?
Data from the CDC shows a clear increase in the percentage of children with a parent-reported ADHD diagnosis:
There has been a similar increase in the diagnosis of ADHD among adults, “from 0.43 to 0.96 percent” between 2007 and 2016.
However, this does not necessarily mean that the rate of ADHD has increased, if e.g. awareness of ADHD has increased and therefore leads to more diagnoses.
What Could A Study Look Like?
Compared to other feats that psychology is accomplishing, finding out whether individual attention spans are declining appears to be of medium difficulty, so I’ll try to outline how this could be accomplished in three different ways:
Develop a good instrument for measuring attention span (optionally just use a continuous performance test). Once one has a suitable instrument for measuring attention span, one can every year (or every second year) for a couple of years pick a random sample from the population (not of the same set of people, though, since attention span increases with age), e.g. via the internet if the test can be done online. One could then apply a linear trend estimation or a fancier statistical technique I don’t know to find out whether attention spans have declined between the measurements.
This could be done relatively cheaply: Let’s say we collect 50 datapoints a year, from Mechanical Turk workers at $10/hr. A conservative estimate is that the test takes ~30 minutes to complete, so for three years the cost of the data would be 50⋅3⋅10$/h⋅0.5h=$750. It looks like there areopen-source implementationsof the test available (Conners’ CPT 3 costs $1.5k), so the additional cost is for the researcher setting up the test and recruiting the participants, which could take ~30 hours, and another ~30 hours for analysing the data. So the total cost of the experiment would be, at an hourly wage of $15 for the researcher (come on, we can let a grad student do it), $750+60h⋅15$/h=$1650
. Fudging upwards by taking the planning fallacy into account gives $2k for the experiment.
Use the control groups from studies testing the effect of interventions on attention as data and then perform a meta-analysis. A lot of studies use some variant of the CPT, I started collecting such studies in Appendix B.
Conclusion
Given the amount of interest the question about shrinking attention spans has received, I was surprised to not find a knockdown study of the type I was looking for, and instead many different investigations that were either not quite answering the question I was asking or too shoddy (or murky) to be trusted. It seems likely to me that individual attention spans have declined (I’d give it ~70%), but I wouldn’t be surprised if the decline was relatively small, noisy & dependent on specific tests.
So—why hasn’t anyone investigated this question to satisfaction yet? After all, it doesn’t seem to me to be extremely difficult to do (compared to other things science has accomplished), there is pretty clearly a lot of media attention on the question (so much so that a likely incorrect number proliferates far & wide), it appears economically and strategically relevant to me (especially sustained attention is probably an important factor in knowledge work, I’d guess?) and it slots more or less into cognitive psychology.
I’m not sure why this hasn’t happened yet (and consider this text evidence for a partial violation of Cowen’s 2nd law—although, to be fair, the law doesn’t specify there needs to be a good literature on everything…). The reasons I can think of is that one would need to first develop a good test for determining attention span, which is some work in itself (or use the CPT); be relatively patient (since the test would need to be re-run at least twice with a >1 year pause, for which the best grant structure might not exist); there are many partial investigations into the topic, making it appear like it’s solved; and perhaps there just aren’t enough cognitive psychologists around to investigate all the interesting questions that come up.
So I want to end with a call to action: If you have the capacity to study this problem, there is room for improvement in the existing literature! Attention spans could be important, it’s probably not hard to measure them, and many people claim that they’re declining, but are way too confident about it given the state of the evidence. False numbers are widely circulated, meaning that correct numbers might be cited even more widely. And it’s probably not even (that) hard!
Consider your incentives :-).
Appendix A: Claims That Attention Spans Have Been Declining
Most of these are either unsourced or cite Gausby 2015 fallaciously (which Bradbury 2016 conjectures to be the number of seconds spent on websites on average).
Today, individuals are constantly on an information overload from both the quantity of information available and the speed of which information gets into the hands of individuals through advertising and multimedia. Attention deficits tend to be increasing as it is challenging to attract individuals and hold their attention long enough for people to read or watch messages such as work memos, advertisements, etc.
Big data plays an important role in the development of microlearning. In the age of big data, human’s attention span is decreasing. As per Hebert (1971), “what information consumes is rather obvious: it consumes the attention of its recipients. Hence a wealth of information creates a poverty of attention and a need to allocate that attention efficiently among the overabundance of information sources that might consume it” (p. 41). An example of short attention span in the age of big data can be found in the music industry, as per (Gauvin, 2017), the average time that passed before the audience would hear the vocals on any radio song was 23 s, today the average intro is just 5 s long. Wertz (2017) also suggested that 40% of users are likely to abandon a website if it does not load within three seconds or less. Furthermore, a survey (Gausby, 2015) conducted by Microsoft indicated that the average attention span of a human dropped from 12 to eight seconds, which means shorter than a goldfish. Given the average human attention span is decreasing, microlearning becomes more and more important because it emphasises short learning duration.
Unfortunately, all too many of us are having “squirrel” days, according to Dr. Gloria Mark, a professor of informatics at the University of California, Irvine, who studies how digital media impacts our lives. In her new book, “Attention Span: A Groundbreaking Way to Restore Balance, Happiness and Productivity,” Mark explained how decades of research has tracked the decline of the ability to focus.
“In 2004, we measured the average attention on a screen to be 2½ minutes,” Mark said. “Some years later, we found attention spans to be about 75 seconds. Now we find people can only pay attention to one screen for an average of 47 seconds.”
Not only do people concentrate for less than a minute on any one screen, Mark said, but when attention is diverted from an active work project, it also takes about 25 minutes to refocus on that task.
Tech-savvy users often say that the way the modern internet works has made it so that people’s attention spans are getting shorter every single day but the truth behind this story is rather tough to ascertain. However, recent data from SimilarWeb indicates that people definitely are suffering from shorter attention spans, and what’s more is that these attention spans are shortening at a pretty rapid pace when you take into account the numerous factors that are coming into play, all of which serve some kind of purpose in this trend.
If you look at the data for how long users spend on websites before navigating away, for the most part the trend has been that these times are remaining more or less stable on web based browsing, but if you look at the trend for mobile browsing between the years 2017 and 2019 you would see that there is a drop of about 11 seconds in the average time spent on a website. When you take into account the fact that mobile browsing is starting to become a lot more popular and in many ways has become the preferred form of browsing for people on the internet, the change is a lot more drastic.
However, as much as technology can be used as an effective learning tool inside and outside the classroom, there’s no denying that one of the biggest challenges faced by educators today is the distraction posed by social media. Students are distracted by their phones during class, and even without that distraction, the time they spend on social media outside the classroom has an impact on their attention spans.
In 2015, a study commissioned by Microsoft and discussed in Time magazine found that the average attention span was in fact only 8 s. If indeed this is the case, then even participating in a 15-min lecture would be positively heroic. To place this in perspective, it was reported in the same Time article, that goldfish, of the piscine rather than snack variety, have an attention span of 9 s, one whole second greater than humans! It is perhaps rather premature to opt for an 8-s lecture format, as there are many caveats to the Time article, not the least of which is that no one knows how to actually measure a goldfish’s attention span. What has been measured is goldfish memory, which, according to researchers in the School of Psychology at the University of Plymouth, is actually quite good (7). Similarly the 8-s attention span for humans actually reflects the average time a person will spend on a web page before looking somewhere else.
I’ll list the closest thing those studies have to a control group, list sorted by year.
Studies Using the CPT
What Determines Continuous Performance Task Performance? (Maria J. O. van Asma/René P. Rombouts/Robert J. van den Bosch, 1996): Patient controls n=19 (mean age 29.5±8.1 years), normal controls n=20 (mean age 32.2±8.3 years). Tested with an unspecified digit related CPT.
Measuring Several Aspects of Attention in One Test (Iwona Kovalik-Gran/Jens Egeland, 2010): n=376 Norwegian patients, aged 14-77 (mean 32.9 years, standard deviation 13.8), either referred to Egeland or to the Vestfold Mental Health Care Trust, with various psychological disorders (57 without such disorders). Tested using Conners’ CPT. Scores for normal group not reported independently.
Reliability and validity of the Conners’ Continuous Performance Test (Danielle Shaked/Lauren M. D. Faulkner/Kathryn Tolle/Carrington R. Wendell/Shari R. Waldstein/Robert J. Spencer, 2019): n=91 undergraduate psychology students (20.01±1.68 years), tested using Conners’ CPT II. Page 3 lists more retest studies for Conners’ CPT II.
Do concerns about COVID-19 impair sustained attention? (Jihyang Jun/Yi Ni Toh/Caitlin A. Sisk/Roger W. Remington/Vanessa G. Lee, 2021): n=161 participants (23±5.2 years), recruited online. No control group, but instead a correlational study. Tested using the scene CPT.
Appendix C: How I Believed One Might Measure Attention Span Before I Found Out About The CPT
Before I found out about the Continuous Performance Test, I speculated about how to measure attention span:
(Note that I’m not a psychometrician, but I like speculating about things, so the ideas below might contain subtle and glaring mistakes. Noting them down here anyway because I might want to implement them at some point.)
It seems relatively easy to measure attention span with a power- or speed-test, via one of three methods:
Present a stimulus, change the stimulus and let the test subject report the change in stimulus; this results in two numbers: the time between the stimulus being initially being presented and the time it was changed (let’s call this value t_change), and the time between the change of the stimulus and the reporting of the change (calling this value t_report). Performing this test with different value of t_change should result in different values of t_report. There is a t_change for which t_report falls over a threshold value, that t_change can be called the attention span.
This method has some disadvantages:
It needs a change in stimulus that requires selective attention to notice, but changing e.g. visual stimuli involves motion, which direct attention. (Idea: have a colored stimulus continuously changing color, and a reference color, once the stimulus has the reference color, the subject is supposed to report; avoiding sudden changes in visual stimuli.)
The method would require many samples to find the t_change for which t_report falls over the threshold value.
Performing the test multiple times in a row might induce mental fatigue, decreasing attention span
Let the test subject engage in a mentally draining exercise like the Stroop test with some performance measure. I would the performance to decline over time, and one could define a threshold value at which the subject “is no longer paying attention”.
Let the subject observe a neutral stimulus while measuring some indicator of attention (such as arousal via skin conductivity or the default mode network being inactive), when the measured value falls under/over a threshold the subject has “lost attention”.
This method has the major disadvantage that it requires special equipment to perform.
Such an instrument would of course need to have different forms of reliability and validity, and I think it would probably work best as a power test or a speed test.
I’m not sure how such a test would relate to standard IQ tests: would it simply measure a subpart of g, completely independent or just partially related to it?
[This is one of the finalists in the SMTM Mysteries Contest, by a reader writing under the pseudonym TripleTaco. We’ll be posting about one of these a week until we have gotten through all the finalists. At the end, we’ll ask you to vote for a favorite, so remember which ones you liked.]
I’ve always thought it was really weird that Amish kids mostly don’t get autism, and I have found other people’s explanations unsatisfying. For most of America, the autism rate is about 1 in 44, but among the Amish, it’s more like 1 in 271. Like obesity, autism is extremely common in developed countries, and largely did not exist (or was not recognized) before modernity (diagnoses have been rising dramatically). So, what if some environmental factor of our modern lives is poisoning our children’s very minds?
It’s not because of vaccines
It appears that the correlation is mostly used by anti-vaxxers as evidence that children shouldn’t be vaccinated, which makes it data-non-grata; (we can’t talk about this data lest we reinforce their conspiracy theories).
It’s not because of vaccines, but what if there’s some other aspect of modern living that contributes heavily to autism? There are many other things that the Amish abstain from that could be contributing.
It’s not (just) kids watching TV
An easy culprit would be babies watching too much TV. Indeed, there’s strong evidence that needs to be considered here. An increase in cable TV subscriptions in a neighborhood is directly correlated to an increase in autism. The same article also points out that places with more precipitation get more autism (perhaps because people stay inside to watch tv?)
But wait, while research on this topic does support the theory that screen time is bad for babies’ social development, the effect size is not nearly large enough to explain the severe autism that many children get. No study I’ve seen directly correlates baby screen time with actual autism diagnosis (although few would argue that heavy screen usage is good for babies). The TV explanation is missing something.
It’s not (just) genetics
Some have pointed out that the Amish are genetically isolated, and perhaps they have good genes for avoiding autism. However, no genetic explanation can explain the dramatic rise in autism rates in other populations.
Are there any clues from treatment?
Often diseases are treated with therapies that reflect their causes. Bacterial infections are treated with antibiotics, for instance; kill the bacteria, and you solve the problem. Autism is generally treated with language and behavioral therapy. The child has extended one-on-one time with a therapist where they practice building the social skills they are lacking. If this is truly the best treatment (at least for now), does that tell us anything about possible causes?
Teaser
There’s more I’d like to say and link to, but I can’t really go much further without getting into my own theories on this topic. I do have a pet theory that I think better fits the observed data than other explanations. I have in fact found one other individual (a therapist who treats autism) espousing my exact theory, although it doesn’t appear to have gotten much traction. For now, I’ll just leave this mystery without suggesting a theory as requested.
Krinn is a reader who participated in our Low-Dose Potassium Community Trial. She lost 6 lbs taking low doses of potassium, and liked it enough that she decided to keep going, along with a new exercise habit to help support the weight loss. And she started trying higher doses of potassium, eventually ramping up to around 10,000 mg of potassium a day.
This is a lot — way more than the average person gets from their diet, and a lot more than people added in the original potassium trial.
Krinn writes, “I decided to stabilize at about 10,000mg per day … because that’s about how much potassium people were getting during the SMTM potato diet community trial. … Aiming for that amount also meant that it would be easier to compare my results to something that worked decently well and to ask questions like ‘is there something special about whole potatoes, or is it mostly the potassium?’ If it’s mostly the potassium, you’d expect my results to be closer to the full-potato-diet results than to the low-dose-potassium results — which is what happened.”
Indeed, she lost quite a bit of weight. Here’s the chart of her weight change so far:
Krinn’s report is excellent — nuanced, detailed, and clearly written. She covers almost every aspect of her self-experiment better than we could, so we won’t try to restate her points. We recommend that you read the report for yourself. Instead we will focus on the few small areas where we can add some speculation or additional context.
(Krinn’s full report is also reproduced in an appendix below, because tumblr posts do not always have the best longevity and we figured it might be good for the report to exist in two places.)
Again, you may want to read what Krinn wrote before you take a look at our comments. But we will restate this part: while this seems to be working for Krinn, it’s not clear that high doses of potassium are safe for everyone, and they almost certainly are not safe if you have kidney disease or related diseases like diabetes. Do not try supplementing doses this high without consulting your physician, and absolutely do not try it if you have kidney problems or any conditions that might compromise your kidney function.
Ok, here are our thoughts:
Optimal Weight Loss Brine
Originally, we argued that high doses of potassium alone might be responsible for weight loss on the potato diet. After all, eating nothing but potatoes does give you heroic doses of potassium.
In retrospect, that seems a little naïve. Sure, it could be just the potassium. But biology tends to be a bit more complicated than that.
At risk of oversimplifying (read the original piece), people knew that cholera patients needed electrolytes, but feeding them an electrolyte solution didn’t seem to help.
Through a series of coincidences, people eventually discovered that adding glucose to the electrolyte solution sometimes made the treatment work. But this didn’t immediately lead to a cure, because if you put too much glucose and salt in the solution, it made patients worse instead.
After more confusion, they discovered that sugar and sodium ions are absorbed together in the gut through a sodium-glucose cotransport protein, but you need the right concentration or it will dehydrate the patient instead, which often kills them. The solution was simple, but getting there was hard.
Getting sodium into someone’s body isn’t simple — you need to include glucose in your rehydration formula, and even then, you need to get the right ratios. This makes us suspicious that something similar might be the case for potassium.
Even if high doses of potassium are required for curing obesity, it seems pretty likely that potassium by itself isn’t the whole engine. So now we are looking for some other set of factors, probably other switches that are triggered when you eat ~100% potatoes, that might also be needed to make the gears of weight loss mesh.
Magnesium
The most likely candidate at this point seems to be magnesium.
Potassium and magnesium serve many complementary roles in our biology, and the two minerals are often prescribed together. We spoke to a physician about this, and he pointed out that for patients with low potassium (hypoK), if you don’t have enough magnesium (hypoMg), you’ll be hypoK forever unless you fix the hypoMg first, because of “some renal excretion thing I think” (his words). See also this paper, which says: “magnesium replacement is often necessary before hypokalemia and potassium depletion can be satisfactorily corrected with potassium supplements.”
Electrolyte mixes, like LMNT and Snake Juice, are sometimes used for weight loss, and these mixes usually contain some amount of magnesium. Assuming that, by a process of natural selection (it’s a electrolyte-powder-eat-electrolyte-powder world out there), they have run across something like the right electrolyte ratios to cause weight loss, this also suggests that magnesium might be involved.
Like potassium, most people are not getting enough magnesium, at least per the official recommendations. According to the NIH, you’re supposed to get 300-400 mg of magnesium per day. And potatoes are not only high in potassium, they are also pretty high in magnesium. Each potato contains about 40-50 mg of magnesium. Someone on the potato diet would be getting about 800-1,000 mg of magnesium per day.
If potassium and magnesium together are the cause of this weight-loss effect we’ve discovered, this would explain why potatoes are such a reliable way to cause weight loss, and why they’re often more effective than supplementing straight potassium.
All the above are just reasons for the hunch, but we also want to note that this hunch is supported by Krinn’s case study.
In her report, Krinn writes:
My go-to snacks are cashews, pistachios, cherries, and granola bars.
Cashews and pistachios are particularly high in magnesium, providing 260 mg per 100 g and 110 mg per 100 g, respectively. We don’t know exactly how much cashew Krinn is consuming, but it’s likely that it’s giving her a respectable amount of extra magnesium.
We discussed this with Krinn over twitter DMs, where she said, “cashews are one of my go-to snack foods, so whatever amount I’m getting, I would be extremely surprised if I was getting something less than a healthy amount of magnesium in my diet.”
She also notes that she tends to consume the magnesium and the potassium relatively close to one another.
…another way to look at it is that “eating cashews” and “drinking gatorade+potassium” are never _that_ far apart
since the potassium stuff is spaced out 4–6 hours apart across the day, and I eat cashews at least once a day, that puts a ceiling on how far apart those two things could possibly be
We have no idea if it’s helpful or necessary to take potassium and magnesium at the same time, but it’s worth noting that Krinn tends to leave little daylight between them.
In short, we’ve suspected for a while that potassium might only cause weight loss, or might cause it more effectively, when combined with reasonable doses of magnesium. The potato diet would get this “for free”, since potatoes contain high amounts of both. This hunch predicts that people who lose weight by supplementing straight potassium will also likely be getting high doses of magnesium from some source, either from supplements or their diet. Krinn’s case matches that prediction.
We’ve also recently been corresponding with a participant from the half-tato diet, who told us that straight potassium seems to have some kind of effect for her, and who mentioned that she has been supplementing 266 mg magnesium a day as magnesium glycinate capsules. Not conclusive, but another hit for the prediction.
Stearic Acid
Another possible connection is with stearic acid, a fatty acid found in “meat, poultry, fish, eggs, dairy products, and foods prepared with fats; beef tallow, lard, butterfat, cocoa butter, and shea butter are rich fat sources of stearic acid.”
If you’ve heard of this before, it’s likely from Fire in a Bottle (FIAB), a website/program/theory which argues that a diet high in stearic acid can cause weight loss. This is sometimes called The Croissant Diet (TCD), presumably in the hopes of confusing readers — you do not actually eat nothing but croissants. In fact, you don’t have to eat any croissants at all. But you do ideally eat lots of foods high in stearic acid, sometimes supplementing with additional stearic acid, and some people seem to lose weight when they do this.
We find the evidence for stearic acid to be pretty thin (though see FIAB for the other side). And there’s no theoretical reason to suspect that stearic acid influences potassium uptake or anything. But there are just a few hints, so we figured we might mention them here.
One is from our half-tato diet. Most people lost only a small amount of weight on the half-tato diet, but one person (participant 25348806) lost 17 lbs in four weeks. This person gave us detailed notes about the rest of her diet, and this part jumped out in particular:
…I also have dairy – at least one glass of milk a day (either raw whole milk or 2% or whole conventional) – and a small amount of juice or lemonade. Some mornings I may have full fat yogurt with collagen and stearic acid (see fireinabottle.net) but not all mornings. I have some extra potassium as well as other supplements.
For comparison, here’s what Krinn has to say about her fat intake:
I use a generous hand when measuring out olive oil. I believe that if you need either milk taste or milk fat, you shouldn’t half-ass it, so when I need milk taste or milk fat, I rely on whole milk and heavy cream. Fats, generally, taste good. … Once in a while, dark chocolate, usually with the nuts and fruit.
Milk fat and chocolate (via cocoa butter) are both high in stearic acid. You’ll also notice that Krinn usually takes the chocolate with her cashews.
And remember that participant from the half-tato diet we mentioned at the end of last section? Based on these hints, we asked her if she also consumes a lot of stearic acid, and she told us she eats a whole lot of dairy fat, and chocolate “EVERY DAY” (her emphasis).
This is certainly suggestive, but what doesn’t fit is the potato diet. Potatoes are high in both potassium and magnesium, so it would make sense if high doses of potassium and magnesium conspire to create the potato weight loss effect. But potatoes contain very little fat and approximately 0% stearic acid. The idea that you might need stearic acid to cause the potato effect is rather inconsistent with the potato diet, since you do not get appreciable amounts of stearic acid from potatoes.
That said, there are hints that people who are on half-tato, or who are supplementing potassium directly, do benefit from stearic acid. At the very least, we’ve noticed that some of the biggest success stories are people who have been getting decent amounts of stearic acid in their diet. Maybe stearic acid helps when you are getting less enormous doses of these minerals?
It’s always possible there’s some unknown connection — maybe the potato diet only works for people who already have sufficient reserves of stearic acid in their body. Or maybe some people need stearic acid for the effect to kick in and others don’t, for genetic reasons. Or maybe other fatty acids can substitute in a pinch, but stearic acid happens to be slightly better than average. But at this point, it definitely fits less well than magnesium.
This Age Needs Heroes
We love everything that Krinn did here. She participated in a community trial, decided to keep going, and spun it into a self-experiment. She came up with her own design and attacked her questions in her own unique way. She did something interesting and she wrote it down so that all ingenious people could be informed thereof and consider the results for themselves.
More people should do what Krinn did, and get involved in the business of conducting science. Anyone else who wants to do anything else remotely like this should feel free to reach out, we’d be happy to help. We’re in the process of writing a whole series about how to conduct self-experiments, which may be a good starting point. Consider this your invitation.
Here’s a reproduction of Krinn’s full report as it appears in her tumblr post:
An Ad-Hoc, Informally-Specified, Bug-Ridden, Single-Subject Study Of Weight Loss Via Potassium Supplementation And Exercise Without Dieting
Here’s the short version: I lost 30 pounds in 6 months by chugging a bunch of potassium salt and exercising a lot. My subjective experience is that cranking my potassium intake way up made it possible to do a lot more exercise than I had been doing without also eating a lot more. Exercising more without also eating more led to weight loss (as one would hope!). I did not diet: I ate as I had been doing and as it pleased me to do. Do with the raw data as you please.
Losing weight this way is unusual and worth paying attention to because many things about increases and decreases in weight and obesity are very poorly understood. Many people would like their personal weight and obesity levels to be different, so anything that improves our collective understanding of how to make that happen is valuable. However, losing weight this way is an experiment: it’s not necessarily safe to do what I did! Part of why I did it was to find out what would happen, and if you have any kind of existing kidney problems then you definitely should not do what I did. Note to other transfemmes: if you’re taking spiro, that counts as a kidney problem.
I also don’t want to overstate the significance of this experiment: what I’ve been up to in the last 6 months amounts to a single data point that happens to also be 1,100 spreadsheet cells. It’s a data point that is highly suggestive, sure, but it would be extremely ambitious to say that it proves anything beyond “this worked for me” and perhaps “it’s not impossible for this to work”. I am writing about it because as far as I know, this particular experiment is something that nobody else has tried, and, again, anything that improves our collective understanding here is valuable.
The long version comes next: how I came to be doing this experiment, what I did in the experiment, what I plan to do next, and finally what I think about it all. The really long version is the ongoing conversation that this post is part of, starting with A Chemical Hunger, which is a book-length literature review about the 1980s–present global increase in obesity prevalence, also the posts about single-subject research where the same authors discuss the limits of what can be learned from experiences like mine, also the Experimental Fat Loss guy and his wide variety of diet-only experiments, also somecritics who disagree.
How I came to be doing this
At the tail end of 2022, I noticed both that my BMI had hit 30 and that I had become very unhappy about my weight. There’s a specific photo where I didn’t realize until I saw the photo that my belly was hanging out over my waistband and it’s vividly unpleasant in my memory. Around the same time, I happened to find the potassium-supplementation community trial that the Slime Mold Time Mold folks were running. The value proposition was “this will be easy, cheap, and safe, but also it might not actually work,” and that sounded good to me, so I signed up for it and took a modest amount of potassium all through December and January. It kinda-sorta worked: I lost 6 pounds. Not nothing, but “it kinda-sorta worked” is the most one can really say about losing 6 pounds in 60 days.
The low-dose potassium delivered on all of what the SMTM folks promised, though. It was easy, cheap, and safe. So I kept doing it and, since I was already doing the potassium, decided that I should get an exercise habit going. I am a big believer in the idea that it’s a tremendous amount easier to go from doing Something to doing More Something than to go from doing Nothing to doing Something. The low-dose potassium got me through the first step: once I was doing Something about my weight, it was relatively easy to do More Something. When the community trial ended in early February, I didn’t have to worry about messing up its results by departing from the trial’s instructions, so I started taking more potassium and building my own experiment. I also kept in touch with the SMTM authors, who were very encouraging. 🐯💕
By late March I had brought myself up to daily amounts of potassium and exertion that seemed good to me, and I stuck with those. This is the first time in my life I’ve focused on trying to lose weight, and I was not fully prepared for how demoralizing it is that the weight change from day N to day N+1 sometimes seems perversely unrelated to what you were doing on day N. Fortunately I have experience with spreadsheets, so I put together a tracker for myself that focused on the trailing-week average of my daily weight and exercise measurements as well as long-term graphs. Three months of data were enough to put together a chart whose trendline said very, very clearly, “what you are doing is working — keep it up!” With any kind of long-term project it’s very important to create and sustain sources of feedback. All else being equal, the longer it takes before you can get a read on “is this going well or poorly?”, the worse it will go.
I decided that my goal would be to get my BMI from 30 (the lower limit of “obese”) to under 25 (the upper limit of “normal”). Happily, the math is very easy there: for my height, a BMI of 25 rounds off to 200lbs. I further decided that I was willing to spend all of 2023 working on this. That decision is why I’m writing this post now: halfway through a project is a natural time to pause and take stock.
What I did
By the end of March, my regimen was firmly settled and I kept at it through the end of June without further tinkering. The daily goals I settled on were 10,000mg of potassium and 1,200kcal of exertion. That amount of exercise worked out to be 90 to 100 minutes per day. For contrast, in 2022 my average amount of exercise per day was 15 minutes and my average exertion was 500kcal.
I used my smartwatch’s exertion number (“how many calories are you using above the amount you need to burn to be alive at all?”) and gradually walked up my daily goal, settling at 1,200kcal/day partially because it was working and partially because one hour of watch face equaling 100kcal was helpful for being able to read “how close to my goal am I?” without thinking hard about it. Most of the exercise was treadmill time, usually a brisk walk or light jog. Over the months I also did some running, some bicycling, and some hiking, but treadmill time was the reliable, unremarkable, do-this-every-day core of my exercise regimen. It took a while to ramp up to that amount of exertion and there were definitely days when I stumbled, for good reasons and bad. However, in general I hit the exertion goal and in particular had it absolutely dialed from early March to mid-April.
It was easier to be totally rigorous about the potassium-intake goal — it helped that that part only took a few minutes per day, instead of 90+ minutes! I used potassium chloride powder (whatever came up first on an Amazon search since all KCl should be alike) mixed with regular Gatorade (i.e. not the sugar-free kind) to make it taste okay (I recommend blue Gatorade, it’s the closest to appealing when kaliated — the yellow lemon-lime was meh and the fruit punch red was awful). I added two heaping teaspoons of KCl powder to a 20oz. bottle of Gatorade and drank that. KCl is about 52% potassium and a heaping teaspoon of it is about 6500mg, so I rounded up a smidge and called that 6600-and-a-bit milligrams of potassium per bottle. On Thursdays and Sundays I drank 2 full bottles and on other days 1.5 bottles. I recorded this as 10,000mg of potassium on regular days and 13,500mg on Thursdays and Sundays.
Is 10,000mg of potassium a lot? It’s a lot more than average! The SMTM potassium trial post contextualizes it helpfully:
For a long time, the recommended daily value for adults (technically, the “Adequate Intake”) was 4,700 mg of potassium per day. But most people don’t get anywhere near this amount.
In every CDC NHANES dataset from 1999 to 2018, median potassium intake hovers around 2,400 mg/day, and mean intake around 2,600 mg/day. In this report from 2004, the National Academy of Medicine found that “most American women … consume no more than half of the recommended amount of potassium, and men’s intake is only moderately higher.” Per this paper, only 0.3% of American women were getting the recommended amount. Similarly low levels of intake are also observed in Europe, Mexico, China, etc.
But in 2019, the National Academies of Sciences, Engineering, and Medicine changed the recommended / adequate intake to 2,600 mg/day for women and 3,400 mg/day for men. They say that the change is “due, in part, to the expansion of the DRI model in which consideration of chronic disease risk reduction was separate from consideration of adequacy,” but we can’t help but wonder if they changed it because it was embarrassing to have less than 5% of the population getting the recommended amount.
In any case, recommended potassium intake is something like 2,500 to 5,000 mg per day for adults, and many people don’t get enough.
Potatoes are exceptionally high in potassium. A single potato contains somewhere between 600 and 1000 mg of potassium, depending on which source you look at. They are the 6th highest in potassium on this list of high-potassium foods from the NIH, and 9th on this old list from the USDA. If you do the math, this means that someone on the potato diet, eating 2,000 kcal of potatoes a day, gets at least 11,000 mg of potassium per day, more than twice the old recommended intake.
This explanation is most of why I decided to stabilize at about 10,000mg per day: because that’s about how much potassium people were getting during the SMTM potato diet community trial. Because that community trial involved around 200 people, it was unlikely that there would be any truly heinous health effects from knocking back that much potassium, especially together with the anecdotal evidence that inspired the trial. Aiming for that amount also meant that it would be easier to compare my results to something that worked decently well and to ask questions like “is there something special about whole potatoes, or is it mostly the potassium?” If it’s mostly the potassium, you’d expect my results to be closer to the full-potato-diet results than to the low-dose-potassium results — which is what happened.
I measured those results in a very basic way: ordinary bathroom scale, first thing in the morning, every day. Considering how much noise there is in weight measurement, there’s just no advantage to measuring it more often. I kept the circumstances of the weigh-in simple and stable, trusting that that was good enough. I also measured exertion in two other forms — step count and exercise minutes — but that was mostly for my personal curiosity because both are basically downstream of exertion as such. Similarly, I tracked my sleep but didn’t expect that to matter a whole lot.
While I was affirmatively not dieting, I want to make sure to talk about my food habits because I could be missing something that’s easy for others to see as unusual but seems totally ordinary to me. My meals are heavy on pasta, rice, bread, and granola. I work diligently to get enough dietary fiber. I eat some meat but not a lot (eating a pound of meat in a week would be above average for me), and I enjoy coffee but not a tremendous amount of it since usually I make Chemex-style coffee and having a bunch of that in a day would be too time-intensive. My go-to snacks are cashews, pistachios, cherries, and granola bars. Like most people, I should eat more dark leafy greens than I do. I use a generous hand when measuring out olive oil. I believe that if you need either milk taste or milk fat, you shouldn’t half-ass it, so when I need milk taste or milk fat, I rely on whole milk and heavy cream. Fats, generally, taste good. I eat more whole food and food I personally cook than I eat packaged and processed food, and I only infrequently eat restaurant food (weekly pizza night, maybe twice a month other than that). I really like sour candies but basically stopped eating them last autumn after some very patient coaxing from my dentist. Once in a while, dark chocolate, usually with the nuts and fruit.
I ate as I had been doing: I ate the food I felt like eating and ate as much of it as I felt like eating. If I felt like eating more or less, I did that. Since I wanted to keep the exercise habit going regardless of whether or not I lost weight, it was very important to me to not make the exercise any more difficult than it had to be. Going hungry would definitely make it more difficult, so I avoided doing that. One way in which I’m very sure my experience generalizes is, it’s much easier to persuade people to try “add this supplement to what you’re already eating” than to get them to try “replace all of your current food with potatoes,” especially when talking about long-term or indefinite-duration changes.
What I plan to do next
I’ll be thrilled if I can recapture something like the 7-week March/April streak I had going. Most days in this period (44 out of 49) were PB days (i.e. a day where my trailing-week-average weight was the lowest it had been since the start of the year) and no two consecutive days in this period were non-PB days (i.e. if a day wasn’t a PB day, both the day before and the day after were PB days). I was losing almost 2lbs per week and exercising a lot and I felt great. However, my intuition is that that was the honeymoon period of going from mostly-sedentary to exercising regularly, and that I should expect further progress to be more difficult, to be like the less impressive results I got in May and June.
Still, the thing as a whole has definitely been successful enough that I’m going to keep at it until the end of the year, re-evaluating again in December (and maybe when I hit my weight-loss goal, which should happen around halfway between now and then). Since I’m using January 1st as my anchor date for the start of the experiment, it lines up nicely with the calendar if I just keep going all year and see what happens. Besides, I only need 6 months more to generate a year of data, while someone going from a cold start would need a whole year.
Given that I have a setup that is working pretty well, I’m reluctant to tinker with it. I might add one more high-potassium day in addition to Thursdays and Sundays, and I might start tracking some extra data — even though I’m not trying to change them, recording my food habits seems like the most helpful additional thing I could record.
If I develop health problems I’m gonna pull the ripcord (and post about it). There are already too many shitty fake weight loss regimens in the world that fuck up the health of people who try them, we do not need more.
What I think about it
Since I’m the one doing this experiment, I get to be excited about how it’s working out for me personally, which is to say, very well indeed. Right now it seems pretty certain that I’ll be able to reach my goal of losing ~50lbs in a long-term-sustainable way and just as importantly, getting myself to a much better baseline state of physical fitness. I feel pretty great about that part!
The experiment is not just for me, though: the reason it’s an experiment rather than just “I’m trying to lose weight” is that I am keeping track of things carefully such that other people could carry out the same steps I did and get results similar to or different from mine and ideally everyone eventually comes to pretty firm conclusions about whether this — losing weight via potassium and exercise without dieting — works or not. My chugging potassium and Gatorade for six months to a year is the very beginning of that process, and I expect that the difficult parts of the process will be carried out by people with more expertise and resources than me.
I also expect that I have not tumbled to the One Weird Trick for weight loss that everyone else just overlooked. As someone with plenty of programming experience, I have a hearty suspicion towards “well, it worked on MY setup” stories. One obvious alternate explanation for my successful weight loss is “well yeah, you doubled your exertion and kept your food intake the same, of course you lost weight” — but I don’t find that explanation satisfying. To start with, if it were that easy, people would do it more often. There are a tremendous number of people who would like to lose weight and a tremendous marketplace of devices, services, and professionals to help them use exercise for that purpose, and yet in a 20-year NCHS study, average exercise rose without obesity falling. It’s also very, very easy to find fat people who exercise plenty — you will find them more or less anywhere you find lots of people exercising, as well as in places like sumo stables. A member of my family has taken up powerlifting in the last year, making him both fitter and heavier by quite a bit.
Additionally, there’s studies like Keating 2017 concluding that short-term exercise intervention doesn’t do enough to matter, or like the Wu 2009 work concluding that exerise-and-dieting isn’t meaningfully better than just dieting over periods of 6+ months, and then there’s the STRRIDE study, Slentz 2004, concluding that jogging 20 miles a week can get people to lose about 7 pounds over 8 months. The STRRIDE study caught my eye because it’s pretty similar to what I did: they took obese mostly-sedentary folks, had them exercise more, and forbade them from eating less. However, once you do the math the results are much less similar: the average STRRIDE participant did around half the exercise I’ve done for at most a fifth of the weight loss (i.e. around 1lb/month vs. around 5lbs/month and around 3mi/day vs. 7mi/day). If someone else told me “Krinn, your naïve just-hit-the-treadmill exercise regimen is 2.5x as effective as an exercise regimen supervised & measured by professionals,” I would want them to provide some compelling evidence for that.
If you tell someone you want to lose weight and would like their advice, it is overwhelmingly likely that the advice will involve exercising more. Everyone has heard this advice. And yet, as Michael Hobbes observes in a searing piece for Highline, “many ‘failed’ obesity interventions are successful eat-healthier-and-exercise-more interventions” that simply didn’t result in weight loss. Even if we as a society choose to believe “more exercise always leads to weight loss, most people just fuck up at it,” that immediately confronts us with the important question, why do they fuck up at it? and its equally urgent sibling, what can we learn from those who succeed at it to give a hand up to those who have not yet succeeded?
[exercising more and eating less] is not an explanation any more than “the bullet” is a good explanation for “who killed the mayor?” Something about the potato diet lowered people’s lipostat set point, which reduced their appetite, which yes made them eat fewer calories, which was part of what led them to lose weight. Yes, “fewer kcal/day” is somewhere in the causal chain. No, it is not an explanation.
Since I’ve been doing this for six months, I feel pretty certain that the potassium is doing something positive for me and I’m entirely willing to put in another six months to find out what happens for me. Finding out whether that generalizes is beyond my power: all I can do is explain what worked for me, one middle-aged Seattle housewife, and hope that it’s useful to people who are in a position to do serious work about it.
One kind of serious work that’s available is the very cool analytic techniques that other people in this conversation have used while looking at their data. If you are the kind of person to get elbows-deep in R or Matlab, feel free to grab my day-by-day measurements for that (I release this data under Creative Commons’ CC0 if that’s relevant to you). I’m not going to do that, though, partially because it’s been a long time since I last used R but mostly because of the thing I said earlier about my whole experiment basically being one data point. If you have a data series, then yeah, get in there with some numeric interrogation, but if you only have one data point, that data point is what it is and statistical analysis can’t really add to it. All I can claim here is that this is a new data point: people going about their everyday lives do not spontaneously increase their potassium intake severalfold and the background work from the SMTM potato diet and potassium community trials tell me that no-one’s run a study looking directly at what happens if you do increase your potassium intake that much.
Do you want to increase your potassium intake that much? If you do, I have to re-emphasize the potassium community trial‘s safety warning: if you have existing kidney problems, do not try this. Also I’m gonna deploy the boldface again to make sure I get this across to other trans women: on this topic, taking spiro counts as a kidney problem! I am not a doctor and I’m extremely not your doctor, you should talk to your actual doctor if you have any kind of potential kidney issues and even if you’re in good health and want to try chugging a bunch of potassium, you should titrate up gradually the way the SMTM writeup suggests (which is also the way I did).
In addition to a general spirit of responsibility, those warnings are important because otherwise just telling you that this is easy would sound like a recommendation. Did I mention that the experiment was easy? Easy easy. Piss easy. Lemon squeezy, etc. Of course building an exercise habit wasn’t easy, but the potassium part didn’t make it easier or harder, and the potassium part itself was pretty trivial. Mix this powder into Gatorade a couple times per day, drink it, done.
That said, if you do want to try this, godspeed and please write down how it goes for you. I recommend building positive reinforcement into whatever you use to track it; my personal spreadsheet for this is adorned with color-coding and happy emoji. I also recommend at least thinking about the following questions, whether you’re going to do this, evaluate the results of this, or both.
How safe is it, in general rather than for me particularly, to chug this much potassium? This is the big one: “just mix potassium salt into Gatorade and drink it a few times a day” is so incredibly easy that even if the effect size is small, it could benefit a huge number of people, but of course it doesn’t benefit them if it’s not actually safe to do that.
Does this replicate? If it’s not safe it matters a lot less whether it replicates, so the safety question comes first, but if it is safe, then one would immediately want to find out whether it works for 1% of people, 10% of people, or 50% of people.
How much do other mineral nutrients, particularly sodium and magnesium, matter for this? Maybe they need to be combined in some specific way, as this Twitter thread suggests.
Do sex hormone levels matter? I’m a trans woman and I’ve been having problems with access to HRT in this timeframe. Given how many things in one’s body testosterone and estrogen affect, and given that previous obesity research has shown differences based on hormone profiles, that’s definitely something to keep an eye on. Also because spironolactone in particular messes with renal function and potassium metabolism, I expect that it affects this. Digression: spironolactone is total bullshit as an anti-androgen of first resort. It sucks and I hate it and I should have switched to other anti-androgens even sooner than I did. If you’re using spironolactone as an anti-androgen because it was the first thing your doctor tried for that, you really should try something else and see if that works.
I steadfastly avoided dieting. I like my existing diet just fine, and that’s why I preferred the “what if I just chug a bunch of potassium” plan. All else being equal, I’d rather try things that let me eat what I like than things that require throwing my relationship with food into upheaval. But of course you wonder, what would happen if you did combine dieting and exercise and potassium? The ExFatLoss guy has been busy trying a lot of diet-only interventions and he’s got a lot of interesting results. I am not the person to try it, but it’s one of the obvious things to try, so I hope someone does try it.
How does this interact with the munchies? If you decide to try what I tried and you, like me, enjoy living somewhere where marijuana is legal, I think you should look at whether the potassium changes how you experience marijuana-induced hunger/overeating. One of the things I found very striking about the matter is that it was possible for me to chug enough potassium that the marijuana-induced hunger was drastically reduced. I expected the opposite since the potassium was causing me to eat less (relative to exertion) at other times. However, I have very strong habits about marijuana (exactly twice a week, edibles only, same amount every time) and I’m not willing to change them for this, so who knows how this aspect will work out for others. Definitely something to keep an eye on, though. Even if I wasn’t losing weight, the potassium reduces marijuana-induced overeating enough that I’d probably keep going with it just for that effect.
Conclusion
I spent 6 months trying to lose weight with lots of potassium and exercise but without dieting. So far I have succeeded. Unless something disastrous comes up, I’m going to keep trying it for at least another 6 months and going to keep recording what I’m doing. I’m particularly curious to see where I’ll plateau, since I assume at some point I’ll start getting really hungry and/or tired instead of accidentally starving. I hope that my experience and the data I’ve recorded from it, are useful to people who are looking into questions about obesity and weight. Please feel free to use my data and my writeup (this post) for that. If you want to try doing as I’ve done, good luck and stay safe: this has worked for me but it is still experimental, it might be unsafe and/or fail to work for you.