Menus Subscribe Search

Follow us


Protein Data Bank Deposits Are Life’s Building Blocks

• February 13, 2012 • 4:00 AM

A four-decade project to catalog the basic structures used to build life pays dividends for everything from new drugs to Bjork’s performances.

Biology’s newest knowledge, fused with the special effects of The Hobbit or Harry Potter films — that’s what’s in store from a stunning new cinematic field of biomedical animation. Catch a glimpse in this video — The Inner Life of a Cell — that might have made biologists of us all had we seen it earlier in our lives. It offers an unprecedented, scientifically accurate dramatization of how cells function, sense their surroundings and respond to external stimuli in mind-blowing moving imagery. It is part of a continuing animation series created by Xvivo, a Connecticut scientific animation firm, for future biologists now studying at Harvard University. Expect more inspired animations — as teaching tools, in video games and any Hollywood screen — the fruit of ever improving software and our “golden age of biology.”

And although Helen M. Berman had no direct hand in Inner Life, this kind of biological animation is only possible because of a critical scientific accomplishment, the Protein Data Bank, she helped create 41 years ago. Currently the director of the Research Collaboratory for Structural Bioinformatics at Rutgers University in New Jersey — which serves as a central custodian of the PDB — she is the sole still-active professional among the founders.

Far more than just providing the source material for illuminating entertainment, the Protein Data Bank is the single most important global repository of virtually everything science has so far discovered about nucleic acids and proteins — the “tiny molecular machines” that carry out critical functions for cells, including just keeping them alive.

Another way to think of it as a kind of global root cellar where all basic biological building blocks sit neatly on shelves. These are the three-dimensional atomic structures of biologically important molecules that range from bits of DNA to complex machines like the ribosome, which make proteins from amino acids.

“If we know the structure, we will understand the function — that’s the paradigm. The sequence [of amino acids] goes to structure, goes to function — so, if you know [for example] the structure of hemoglobin, you can understand better how hemoglobin carries oxygen,” explains Berman. “These are the molecules of life that are found in all organisms including bacteria, yeast, plants, flies, other animals, and humans. Understanding the shape of a molecule helps to understand how it works.”

An indispensable source of raw data for furthering our understanding of biology, from cells to our bodies, the data bank has been critical for aiding practical advances. Think drug discovery — pharmaceutical firms periodically download the entire data bank — or launching whole new fields of study, such as computational structural biology, aka structural bioformatics.

Berman is excited over what might be called “second generation use” of the bank’s data — especially unexpected roles in the arts, such as the cinema or performances (e.g., the Icelandic chanteuse Bjork’s live Biophilia performance (and accompanying apps) currently in a 10-day North American debut run in New York City.

Writing in the Journal for Biocommunication, molecular biologist David S. Goodsell, who hosts the “Molecule of the Month” feature on the bank’s website, called the bank “an amazing resource that is waiting to be tapped for all manner of educational and artistic applications.”

The two most common methods used in determining molecular structures — which means visualizing at a level too small even for the most powerful microscope — are X-ray crystallography (also called X-ray diffraction) and nuclear magnetic resonance spectroscopy. (For a nice description of these methods, click here.) The cost of figuring out some atomic structures — say getting a 3-D rendering of proteins structures, which are like a necklace of different-color beads — has decreased as scientific techniques improve. But while an average cost may lie between $50,000 and $250,000 per structure, complicated structures can still cost $1 million to $2 million each.

As for the value of the data bank’s “deposits,” current estimates range up to $8 billion. Meanwhile, the U.S. government contributes $6 million a year, the lion’s share of the data bank’s annual budget — which is slated to remain flat for the foreseeable future. That imposes inventiveness on the part of the bank. “We have to figure out ways of improving our infrastructure so that we can actually keep up with the data,” Berman says matter-of-factly.

When “crystallographers” — scientists studying atoms in solids — who started the PDB at the Brookhaven National Laboratory and Cold Springs Harbor Lab (both on Long Island) in the 1970s, there were just seven atomic structures in the bank. Today, it catalogs more than 78,000 structures — still just a fraction of the 20 million unique sequences estimated to exist. But it’s constantly growing. “In 1977, which is kind of when I started in this, there were 77 structures in total,” said University of California, San Diego’s Philip E. Bourne, the associate director of the PDB. “Now we get almost that many or twice that many in a month.”

Anyone in the world can browse the bank’s vaults, and download whatever they want to work with on their own. Every month some 150,000 visitors come to the site; last year, those visitors hit “download” 250 million times. Based on an early commitment to free and unfettered access to all, use of the PDB is free of charge. “People put data into the PDB, then they expect to have the data available for free,” says Berman. “After all, if it weren’t for the people who put the data in, there wouldn’t be any PDB.”

While the prevailing scientific model today is open access, that was not the obvious choice in the 1970s. Nonetheless, the chemistry community took a different route. “It’s a cultural-sociologic thing — the evolution of those rules — it’s completely community based,” says Berman, adding with understated pride: “We were ahead of our time.”

A critical phase in this evolution was the onset of the AIDS epidemic 30 years ago, when respected figures in the scientific community argued it was morally indefensible for the findings of any publicly funded research related not be released.

Today all major scientific journals in certain fields require, almost as a condition of being published, that the basic data underlying the papers be deposited with the Protein Data Bank. This makes sense not only because most of the data is still largely publicly funded, but because that basic data may prove useful to others.

“[The PDB] really set a precedent for data sharing and now there are depositories for all sorts of different information,” says Heather Carlson, a medicinal chemistry professor at the University of Michigan, Ann Arbor, who makes frequent use of the PDB. “No individual lab can have enough information on sequences or on proteins needed to solve the problems we face — a worldwide repository of information, like the PDB, is definitely needed.” Even the most well-intentioned sharing — requesting and then waiting for others to send you the data you need — among colleagues spread out all over the world is too impractical.

The United States, which for a long time generated the lion’s share of new knowledge in the data bank, accounts for about 50 percent of new research added today. The data bank has gone global, with key collaborators at the European Bioinformatics Institute (UK), and the Protein Data Bank Japan, which — together with the Biological Magnetic Resonance Data Bank, at the University of Wisconsin-Madison form the Worldwide PDB, the overseeing organization.

Swelling data demands efficient curation. This means not only ensuring that the deposited structures are high quality, but also that there is an ever-more-handy set of tools to make the data useable by that widening base of users. “Our responsibility is to make sure the data are in good shape so that when people take [it] they can count on it,” says Berman. “People call this the gold standard of structures.”

Sign up for the free Miller-McCune.com e-newsletter.

“Like” Miller-McCune on Facebook.

Follow Miller-McCune on Twitter.

Add Miller-McCune.com news to your site.

Subscribe to Miller-McCune

Ken Stier
Ken Stier got started as a reporter at community newspapers, independent film and television industry publications and in public affairs TV in New York in the 1980s. After attending Columbia's School of International Affairs, he moved to Southeast Asia in time for the final Vietnamese troop withdrawal from Cambodia. From bases in Bangkok, Hanoi and Kuala Lumpur, he worked for wire services, newspapers and magazines, including Time and Newsweek. Until recently, he was a features writer at CNBC.com, covering energy and the financial crisis that got him laid off. He now freelances from New York, where he has covered and worked inside the United Nations, written policy papers for think tanks, conducted proprietary research for boutique consultancies, and taught at university.

More From Ken Stier

A weekly roundup of the best of Pacific Standard and PSmag.com, delivered straight to your inbox.

Recent Posts

October 31 • 4:00 PM

Should the Victims of the War on Drugs Receive Reparations?

A drug war Truth and Reconciliation Commission along the lines of post-apartheid South Africa is a radical idea proposed by the Green Party. Substance.com asks their candidates for New York State’s gubernatorial election to tell us more.


October 31 • 2:00 PM

India’s Struggle to Get Reliable Power to Hundreds of Millions of People

India’s new Prime Minister Narendra Modi is known as a “big thinker” when it comes to energy. But in his country’s case, could thinking big be a huge mistake?


October 31 • 12:00 PM

In the Picture: SNAP Food Benefits, Birthday Cake, and Walmart

In every issue, we fix our gaze on an everyday photograph and chase down facts about details in the frame.


October 31 • 10:15 AM

Levels of Depression Could Be Evaluated Through Measurements of Acoustic Speech

Engineers find tell-tale signs in speech patterns of the depressed.


October 31 • 8:00 AM

Who Wants a Cute Congressman?

You probably do—even if you won’t admit it. In politics, looks aren’t everything, but they’re definitely something.


October 31 • 7:00 AM

Why Scientists Make Promises They Can’t Keep

A research proposal that is totally upfront about the uncertainty of the scientific process and its potential benefits might never pass governmental muster.


October 31 • 6:12 AM

The Psychology of a Horror Movie Fan

Scientists have tried to figure out the appeal of axe murderers and creepy dolls, but it mostly remains a spooky mystery.


October 31 • 4:00 AM

The Power of Third Person Plural on Support for Public Policies

Researchers find citizens react differently to policy proposals when they’re framed as impacting “people,” as opposed to “you.”


October 30 • 4:00 PM

I Should Have Told My High School Students About My Struggle With Drinking

As a teacher, my students confided in me about many harrowing aspects of their lives. I never crossed the line and shared my biggest problem with them—but now I wish I had.


October 30 • 2:00 PM

How Dark Money Got a Mining Company Everything It Wanted

An accidentally released court filing reveals how one company secretly gave money to a non-profit that helped get favorable mining legislation passed.


October 30 • 12:00 PM

The Halloween Industrial Complex

The scariest thing about Halloween might be just how seriously we take it. For this week’s holiday, Americans of all ages will spend more than $5 billion on disposable costumes and bite-size candy.


October 30 • 10:00 AM

Sky’s the Limit: The Case for Selling Air Rights

Lower taxes and debt, increased revenue for the city, and a much better use of space in already dense environments: Selling air rights and encouraging upward growth seem like no-brainers, but NIMBY resistance and philosophical barriers remain.


October 30 • 9:00 AM

Cycles of Fear and Bias in the Criminal Justice System

Exploring the psychological roots of racial disparity in U.S. prisons.


October 30 • 8:00 AM

How Do You Make a Living, Email Newsletter Writer?

Noah Davis talks to Wait But Why writer Tim Urban about the newsletter concept, the research process, and escaping “money-flushing toilet” status.



October 30 • 6:00 AM

Dreamers of the Carbon-Free Dream

Can California go full-renewable?


October 30 • 5:08 AM

We’re Not So Great at Rejecting Each Other

And it’s probably something we should work on.


October 30 • 4:00 AM

He’s Definitely a Liberal—Just Check Out His Brain Scan

New research finds political ideology can be easily determined by examining how one’s brain reacts to disgusting images.


October 29 • 4:00 PM

Should We Prosecute Climate Change Protesters Who Break the Law?

A conversation with Bristol County, Massachusetts, District Attorney Sam Sutter, who dropped steep charges against two climate change protesters.


October 29 • 2:23 PM

Innovation Geography: The Beginning of the End for Silicon Valley

Will a lack of affordable housing hinder the growth of creative start-ups?


October 29 • 2:00 PM

Trapped in the Tobacco Debt Trap

A refinance of Niagara County, New York’s tobacco bonds was good news—but for investors, not taxpayers.


October 29 • 12:00 PM

Purity and Self-Mutilation in Thailand

During the nine-day Phuket Vegetarian Festival, a group of chosen ones known as the mah song torture themselves in order to redirect bad luck and misfortune away from their communities and ensure a year of prosperity.


October 29 • 10:00 AM

Can Proposition 47 Solve California’s Problem With Mass Incarceration?

Reducing penalties for low-level felonies could be the next step in rolling back draconian sentencing laws and addressing the criminal justice system’s long legacy of racism.


October 29 • 9:00 AM

Chronic Fatigue Syndrome and the Brain

Neuroscientists find less—but potentially stronger—white matter in the brains of patients with CFS.


October 29 • 8:00 AM

America’s Bathrooms Are a Total Failure

No matter which American bathroom is crowned in this year’s America’s Best Restroom contest, it will still have a host of terrible flaws.


Follow us


Levels of Depression Could Be Evaluated Through Measurements of Acoustic Speech

Engineers find tell-tale signs in speech patterns of the depressed.

We’re Not So Great at Rejecting Each Other

And it's probably something we should work on.

Chronic Fatigue Syndrome and the Brain

Neuroscientists find less—but potentially stronger—white matter in the brains of patients with CFS.

Incumbents, Pray for Rain

Come next Tuesday, rain could push voters toward safer, more predictable candidates.

Could Economics Benefit From Computer Science Thinking?

Computational complexity could offer new insight into old ideas in biology and, yes, even the dismal science.

The Big One

One town, Champlain, New York, was the source of nearly half the scams targeting small businesses in the United States last year. November/December 2014

Copyright © 2014 by Pacific Standard and The Miller-McCune Center for Research, Media, and Public Policy. All Rights Reserved.