Protein Bioinformatics

Vikram Alva-Kullanja

It is estimated that there are more than a trillion differing proteins existing today. Although this may seem a vast number, the actual diversity of proteins in nature is somewhat limited. Many proteins share detectable similarities in sequence and structure, as they arose by amplification, recombination, and divergence from a basic complement of autonomously folding modules, referred to as domains.

On the diversity of modern proteins

Sequence-based comparison of modern proteins shows that they fall into only about ten thousand domain families, which, based on structural similarity, can be grouped further into one of a thousand folds. Many of these folds were already established at the time of the Last Universal Common Ancestor, a theoretical primordial organism from which all life on earth descended. We are broadly interested in understanding the events that led to the emergence of these first folds and their diversification into the many functional protein families we recognize today.

The MPI Bioinformatics Toolkit

To track evolutionary relationships between proteins, we use bioinformatics tools that establish correlations between sequence and structure similarity. Many of the tools we use, such as the state-of-the-art sequence comparison methods HHblits and HHpred, are integrated into the MPI Bioinformatics Toolkit (http://toolkit.tuebingen.mpg.de), a one-stop, integrative resource for protein bioinformatic analysis, which we develop and maintain. The Toolkit currently includes 36 interconnected in-house and external tools, whose functionality covers the detection of remote homologs, the calculation of multiple alignments, and the annotation of sequence features.

Prokaryotic cell surface proteins

All cells use sophisticated molecular machinery to interact with their environment, but the molecular basis of such interactions remains poorly understood in prokaryotes. To this end, in an HFSP-funded project, we collaborate with Tanmay Bharat (MRC LMB) and Alex Bisson (Brandeis University) to explore the evolution and molecular basis of mechanosensing in archaea. Additionally, we collaborate with Tanmay Bharat on deciphering the evolution and structure of prokaryotic S-layers (two-dimensional paracrystalline sheets that encapsulate many prokaryotic cells).

Go to Editor View