Big Data and Public Health: An interview with Dr. Willem van Panhuis about Project Tycho, digitizing disease records, and new ways of doing research in public health

All opinions of the interviewer are my own and do not necessarily reflect those of Novo Nordisk.

One of the huge and perhaps still underappreciated aspects of the internet age is the digitization of information. While the invention of the printing press made the copying of information easy, quick and accurate, print still relied on books and other printed materials that were moved from place to place to spread information. Today digitization of information, cheap (almost free) storage, and the pervasiveness of the internet have vastly reduced barriers to use, transmission and analysis of information.

In an earlier post I described the project by researchers at the University of Pittsburgh that digitized US disease reports over the past 120+ years, creating a computable and freely available database of disease incidence in the US (Project Tycho, http://www.tycho.pitt.edu/) This incredible resource is there for anyone to download and use for research ranging from studies of vaccine efficacy to the building of epidemiological models to making regional public health analyses and comparisons.

Their work fascinates me both for what it said about vaccines and also for its connection to larger issues like Big Data in Public Health. I contacted the lead researcher on the project, Dr. Willem G. van Panhuis and he very kindly consented to an interview. What follows is our conversation about his work and the implications of this approach for Public Health research.

vanPanhuis,Wilbert[brianCohen20131113] (12)_resized

Dr. Willem van Panhuis. Image credit: Brian Cohen, 2013

Kyle Serikawa: Making this effort to digitize the disease records over the past ~120 years sounds like a pretty colossal undertaking. What inspired you and your colleagues to undertake this work?

Dr. Willem van Panhuis: One of the main goals of our center is to make computational models of how diseases spread and are transmitted. We’re inspired by the idea that by making computational models we can help decision makers with their policy choices. For example, in pandemics, we believe computational models will help decision makers to test their assumptions, to see how making different decisions will have different impacts.

So this led us to the thinking behind the current work. We believe that having better and more complete data will lead to better models and better decisions. Therefore, we needed better data.

On top of this, each model needs to be disease specific because each disease acts differently in how it spreads and what effects it has. In contrast, however, the basic data collection process that goes into creating the model for each disease is actually pretty similar across diseases. There is contacting those with the records of disease prevalence and its spread over time, collecting the data and then making the data ready for analysis. There’s considerable effort in that last part, especially as Health Departments often do not have the capacity to spend a lot of time and effort on responding to data requests by scientists.

The challenges are similar–we go through the same process every time we want to model a disease–so when we learned that a great source of much of the disease data in the public domain is in the form of these weekly surveillance reports published in MMWR and precursor journals, we had the idea: if we digitize the data once for all the diseases that would provide a useful resource for everybody.

We can make models for ourselves, but we can also allow others to do the same without duplication of effort. Continue reading

Can studies of bosses help us figure out how good sports managers are?

All opinions are my own and do not necessarily reflect those of Novo Nordisk.

The world would be a simpler place, although maybe a much more boring and predictable one, if every aspect of performance could be measured directly. My completely unoriginal thought here is that one of the reasons sports appeal to so many people is because they provide clarity. In a confusing, complex world where the NSA is sucking up our information like a Dyson vacuum sucks feathers in a henhouse, and we’re told this is for our own good, clarity can be refreshing.

The simple view of an athlete’s performance is that all the accolades (or jeers), all the milestones (or flops), all the accumulated statistical totals (or lack thereof) are because of that athlete’s ability: his or her drive, passion, training, and natural ability. And that performance is measured via the statistics each sport collects and chooses to honor and promote. Performance is right there, what more do you need? What more could you want? Continue reading

What do labrador retrievers and NFL wide receivers have in common?

All opinions are my own and do not necessarily reflect those of Novo Nordisk.

A.  They’re both being studied via mobile tech to create their ethograms.

What’s an ethogram?  I had no idea until I saw this PLOSone paper on using inertial sensors such as accelerometers and gyroscopes to measure the movements and behaviors of dogs, so as to create an ethogram, or collection of behaviors and actions, characteristic of labrador retrievers and Belgian Malinois.  For scientists studying the behavior and action patterns of different species, building an ethogram is essential to studies of animal behavior.  Without a standardized, objective catalog of behaviors, it can be easy for the perspectives of the observer to get in the way.  And it can make comparisons of data among different researchers (or coaches, as we’ll discuss in a bit) difficult.

And just as I was mulling over how that study shows the power of technology for behavioral research, the latest issue of Sports Illustrated came in the mail and I read a short, fascinating article by Tim Newcomb about how eight NFL teams have signed up with the company Catapult to integrate small GPS sensors into practice and game uniforms. This data allows a more accurate, granular and comprehensive view of how different receivers, for example, play the game.  Basically, building the receiver ethogram (using the term rather loosely). Sadly, this article is currently only in the print issue and not online that I can find.  But it’s at newsstands now.  You can go pick one up.  I’ll wait.

So let me delve into each of these articles a little more.

Continue reading

Mobile tech and the challenge and opportunity for design

All opinions are my own and do not necessarily reflect those of Novo Nordisk.

Lots of things bother me when I’m driving my car.  But recently I’ve found that the number one thing making me bang my head on the steering wheel is when I’m behind a car at a stoplight, the light changes…and nothing happens. Many times I can see the back of the driver’s head, which is almost always tilted down, and I’m going to go out on a limb and suggest that this is because the person in front of me is texting or surfing the web.

A recent report from the Seattle Times that one in twelve active drivers in Washington State was observed using a cellphone while driving confirms how widespread this problem is. The thing is, the problem is, the opportunity is, this is only one small symptom of how our world is changing and becoming full of distraction. I may be irritated when the person in front of me isn’t paying attention but at the same time I’m continually impressed by the immediacy and mobility of technology.  In some ways, much as I might rant about people who are texting while driving, I also understand why. They do it because it’s easy, simple, and feeds our hardwired desire for rapid positive feedback.

So what can be done? Continue reading

Biopharma should choose targets using a baseball-style draft

All opinions my own and do not necessarily reflect those of Novo Nordisk

I was sitting around last evening checking out how the end of my fantasy baseball season is working out (for the record, first out of ten in one league and fourth in the league I wrote about here) and I starting thinking again about the parallels between baseball and drug development (which I previously wrote about here and here for example, and also Stewart Lyman has a nice piece on a similar theme here). And it hit me that there’s another way in which biopharma could take a  page from baseball: fantasy and Major League Baseball both.

Biopharma could institute a draft for drug targets.  And to explore this I’m going to employ the time-honored, not to mention trite and artificial, format of a series of questions and answers.

Continue reading