Sunday, May 6, 2007
Beets, Al Gore, and other Drama (sort of)
I think (perhaps somewhat pessimistically) that the problem of the public understanding science, and maybe more importantly, the philosophy of science, isn't something which can ever fully be resolved. It seems to be human nature to want to compartmentalize ideas into definite categories - this is right, this is wrong, this is true, this is false - according to what we've been brought up with, what our political party thinks, or what we want to believe. The public doesn't want to hear that something is under investigation and there are a number of different theories, all supported by different kinds of evidence, and that the true answer is a complex integration of them that we haven't quite figured out yet. People are uncomfortable with uncertainty and it's often easier to ignore other points of view and just focus on what makes yours "right." And if you and yours are right, that means someone else is wrong, which is good for self-esteem, group cohesion, etc.. I think people like to define an Other, so they have someone to elevate themselves and their group above or to blame things on, creating an us vs. them mentality that only serves to alienate differing viewpoints and make them more extreme. It's easy to maintain your viewpoint when you're fighting against "godless liberals" or "closed-minded Republicans", not so easy to hover somewhere in the middle, belonging to no group and unable to sum up your standpoint in a 10-second sound bite.
The other day while out to dinner I mentioned something about how I'd gone to see Al Gore's presentation last week. Now, I don't particularly like the guy or agree with all of his politics, but I appreciate what he's doing in terms of raising the public consciousness about climate change. Anyway, a friend of mine immediately said "oh I don't believe in that global warming stuff" - spoken in the same tone that one might espouse distaste for beets, or country music, or Tuesdays. I was baffled, considering I don't think she's had a science class since high school and freely admits that she knows virtually nothing about it. Unfortunately I think this is the way a lot of people come to their beliefs about important issues: they base it on what their parents/church/friends are telling them, or on their opinion of the messenger, or on their own speculations about how the world works. And while we could definitely be doing better with science education, I don't see those fundamental aspects of humanity going away. Evolution didn't prepare our minds for scientific inquiry, it equipped us to learn from our families and social groups. Our tendency to furiously guard our pride and reluctance to admit being wrong doesn't help either!
So, I don't really have a solution to propose. Obviously we should be doing something about the way science is taught in this country since there are countries where people are more scientifically literate. So it is possible to make substantial improvements - it just will never be 100% of the population that has the necessary understanding. But hey, in science you never really get perfect results anyway, right?
Thursday, April 26, 2007
Analyzing training data
Anyway, the level of detail (and the consistency) of these training logs varies drastically over the past two years or so; the more recent they are, the more data I have tended to record, and it’s only in the past semester that I really consider my spreadsheets to contain “enough” data, just for my personal use to look back on. What degree of statistical analyses I’ll be able to perform on the rest remains to be seen.
Here is what I currently keep track of--
For all cardiovascular work:
- sport (swim, bike, run, other)
- duration
- distance (yards, miles)
- pace (min/100 yards, mph, min/mile)
- RPE (rate of perceived exertion)
- average heart rate
- max heart rate
- skill trained (endurance, force, speed, muscular endurance, anaerobic endurance, power)
- workout done (e.g. 3x8/4r = 3 reps of 8 minutes high intensity followed by 4 min recovery)
- notes on the workout: can be details about lap times (to get an idea of changing pace & HR throughout), any nagging pains or developing injuries, general comments/thoughts
Since I focus on triathlon, I have more detailed information about the following:
- SWIM: stroke used, pool or open water
- BIKE: bike ridden, route, hills (1-9), wind (from weather.com)
- RUN: shoes worn, surface (dirt, pavement, track, etc), hills (1-9)
- LIFT: sets, reps, poundages for each exercise done
I also summarize the major information for each sport for each week and make a few graphs: total weekly volume (with breakdown for each sport), total weekly distance per sport, and the overall breakdown of how much time is spent in each discipline.
If you’re not completely convinced I’m a lunatic by this point, I present you with the following screenshots!


Craziness. But it's useful.
You may notice the uh, mild decrease in activity on the most recent week, where all I did was one weight session the entire week. Unfortunately I started having major iliotibial band problems during runs and have had to take some time off so that it doesn’t turn into a recurring injury. Neglecting exercise is stressful enough but it’s coupled with the likely prospect that the marathon I was training for all semester is not going to happen for me. A thought that’s more than a little devastating – but, better to lay off now and miss the race than to ignore the pain and injure myself permanently for the rest of the summer. With that in mind, I’ve been trying to think of some questions that I could ask about these data in order to use for an analysis. Here are a few:
- What factors predict incidence of joint/tendon pain or injury?
- How does weekly training volume affect paces?
- Does running surface/shoe type affect pace?
- What kind of relationship exists between heart rate, RPE, and pace?
- How does sleep impact speed, distance, RPE?
- How does daily volume relate to stress?
- Which weightlifting exercises lead to greatest strength increases?
- How much lifting volume leads to the fastest gains?
I don’t know if I have the consistency of data for a long enough period of time (e.g. a year) to fully answer these questions but if I can figure out a way to model some of these things, then in the future I can always go back and enter the increased data sets to see if the answers change. It would be really interesting to use this to find out information that would be extremely relevant to me. Combining personal life and intellectual interests so closely is definitely a unique opportunity!
Tuesday, March 27, 2007
On diligence
So, I decided to clear up some of the information that's clogging up my computer right now. I have a folder on my desktop devoted to 2.5 yrs worth of notepad files, ranging all the way from from "$ spent updat" (which not only has records of money spent, but some weightlifting workouts and part of a to-do list??) to "YOU ABSOLUTELY MUST BY 11-05" (in reference to some magazine subscriptions I had to cancel to avoid getting charged). However, there are 318 files. So... I decided on an easier task to tackle first.
I have a program that lets me put virtual post-it notes on my desktop. These notes tend to accumulate and the problem is exacerbated because whenever I have too much into cluttering up the space, I just stack all the notes on top of each other and forget about the bottom ones. So I decided to clean this up and sort all the important data. After separating all of them, here's the diversity of information I found:
- Distances to a bunch of landmarks on the levy bike path that I measured with Google Earth.
- Training paces for various kinds of running (easy, tempo, long, speedwork, etc) calculated based on my mile time.
- Data for some stuff I sold on ebay (listing fees, final value fee, paypal fee, shipping, price charged, profit).
- No less than 48 (!) songs to download. This list has been in progress for a long time.
- Information about a job on craigslist.
- A list of features I was looking for in a heart rate monitor (despite having already bought one that satisfied these requirements nearly two months ago).
- A to-do list (only one! a good step for me!)
- Some motivational training quotes.
- Six identifying phrases/sentences from different song lyrics so that I could google them to find out what the songs are, for eventual download.
- A library call number. To what? Don't know.
- Two phone numbers. Whose? Don't know.
- Some order numbers. For what? Don't know.
- Random workout information from ages ago, identified in time only by the day of the week since I suppose I expected to have it all entered in my spreadsheet well before I forgot what week it referred to.
- A few websites I heard/read about in some context and planned to visit. Almost a year ago.
- The titles of a few studies I wanted to look up.
- A list of all the categories of information I planned to include in my excel workout log.
- The numbers "12" and "6166537". Ok...
If you stay on top of things - entering data as soon as you get it - you're more likely to maintain that, whereas if you make a habit of jotting down quick notes in different places without timely follow-up, that's something you're going to stick to as well. The conclusion I've come to is, if the information is important to you, and you're taking the time to monitor it - whether it be for research or just personal stuff - then it should be worth the extra 2 seconds (or even, god forbid, 2 minutes!) to make sure that it's in a format that will be useful to you. Now hopefully I'll be able to abide by this philosophy!
Monday, March 26, 2007
virtue and vice
http://addictedtor.free.fr/graphiques/
It's a collection of graphs made using R, and the diversity and complexity of some (or most) of them is astonishing. I never would have thought it possible to make figures like those with R. Odd how a program that appears so "simple" (as compared to even something like Excel) is so powerful given the right knowledge about how to use it.
Monday, March 5, 2007
improbability
For simplicity's sake I assumed that every day of the year is equally likely to be someone's birthday (which is not true, but close enough for this estimate). There were 97 delegates, plus 29 staph (not a typo - their enthusiasm is "infectious." ha ha.), for a total of 126 people. The program lasted almost four weeks, or about 26 days.
So, the probability of 0 occurrences of an event when there are 126 trials and the odds are 26/365 is:
pbinom(0,size=126,prob=26/365)
Which turns out to be:
9.041928e-05
Or, less than 1 in 10,000!