Target selection for structural genomics: A single genome approach

Igor V. Grigoriev, In-Geol Choi

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

We describe our strategy for selecting targets for protein structure determination in context of structural genomics of a single genome. In the course of target selection, we have studied two the smallest microbial genomes, Mycoplasma genitalium and Mycoplasma pneumoniae. To our surprise, we found that only 71 Mycoplasma genes or their orthologues can be considered as easy targets for high-throughput structural studies - far fewer than expected. We discuss the methods and criteria used for target selection and the reasons explaining rarity of easy targets. First, despite the common opinion that protein folds can be predicted for only 30-50% of genes, the number of "truly unknown" structures is less than one-third. Second, due to the different codon usage, two thirds of Mycoplasma proteins cannot be directly expressed in E.coli in high-throughput manner and require substitution by their homologues from other organisms. Third, membrane or large multi-domain proteins are difficult targets because of solubility and size issues and often require identification and structure determination of protein domains. Finally, we propose different approaches to address the difficult targets.

Original languageEnglish
Pages (from-to)349-362
Number of pages14
JournalOMICS A Journal of Integrative Biology
Volume6
Issue number4
Publication statusPublished - 2002 Dec 1
Externally publishedYes

Fingerprint

Genomics
Mycoplasma
Genes
Genome
Mycoplasma genitalium
Microbial Genome
Mycoplasma pneumoniae
Proteins
Codon
Solubility
Throughput
Escherichia coli
Membranes
Substitution reactions
Protein Domains

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Cite this

Target selection for structural genomics : A single genome approach. / Grigoriev, Igor V.; Choi, In-Geol.

In: OMICS A Journal of Integrative Biology, Vol. 6, No. 4, 01.12.2002, p. 349-362.

Research output: Contribution to journalArticle

@article{204d847ab826426e9508e3ef96d055c1,
title = "Target selection for structural genomics: A single genome approach",
abstract = "We describe our strategy for selecting targets for protein structure determination in context of structural genomics of a single genome. In the course of target selection, we have studied two the smallest microbial genomes, Mycoplasma genitalium and Mycoplasma pneumoniae. To our surprise, we found that only 71 Mycoplasma genes or their orthologues can be considered as easy targets for high-throughput structural studies - far fewer than expected. We discuss the methods and criteria used for target selection and the reasons explaining rarity of easy targets. First, despite the common opinion that protein folds can be predicted for only 30-50{\%} of genes, the number of {"}truly unknown{"} structures is less than one-third. Second, due to the different codon usage, two thirds of Mycoplasma proteins cannot be directly expressed in E.coli in high-throughput manner and require substitution by their homologues from other organisms. Third, membrane or large multi-domain proteins are difficult targets because of solubility and size issues and often require identification and structure determination of protein domains. Finally, we propose different approaches to address the difficult targets.",
author = "Grigoriev, {Igor V.} and In-Geol Choi",
year = "2002",
month = "12",
day = "1",
language = "English",
volume = "6",
pages = "349--362",
journal = "OMICS A Journal of Integrative Biology",
issn = "1536-2310",
publisher = "Mary Ann Liebert Inc.",
number = "4",

}

TY - JOUR

T1 - Target selection for structural genomics

T2 - A single genome approach

AU - Grigoriev, Igor V.

AU - Choi, In-Geol

PY - 2002/12/1

Y1 - 2002/12/1

N2 - We describe our strategy for selecting targets for protein structure determination in context of structural genomics of a single genome. In the course of target selection, we have studied two the smallest microbial genomes, Mycoplasma genitalium and Mycoplasma pneumoniae. To our surprise, we found that only 71 Mycoplasma genes or their orthologues can be considered as easy targets for high-throughput structural studies - far fewer than expected. We discuss the methods and criteria used for target selection and the reasons explaining rarity of easy targets. First, despite the common opinion that protein folds can be predicted for only 30-50% of genes, the number of "truly unknown" structures is less than one-third. Second, due to the different codon usage, two thirds of Mycoplasma proteins cannot be directly expressed in E.coli in high-throughput manner and require substitution by their homologues from other organisms. Third, membrane or large multi-domain proteins are difficult targets because of solubility and size issues and often require identification and structure determination of protein domains. Finally, we propose different approaches to address the difficult targets.

AB - We describe our strategy for selecting targets for protein structure determination in context of structural genomics of a single genome. In the course of target selection, we have studied two the smallest microbial genomes, Mycoplasma genitalium and Mycoplasma pneumoniae. To our surprise, we found that only 71 Mycoplasma genes or their orthologues can be considered as easy targets for high-throughput structural studies - far fewer than expected. We discuss the methods and criteria used for target selection and the reasons explaining rarity of easy targets. First, despite the common opinion that protein folds can be predicted for only 30-50% of genes, the number of "truly unknown" structures is less than one-third. Second, due to the different codon usage, two thirds of Mycoplasma proteins cannot be directly expressed in E.coli in high-throughput manner and require substitution by their homologues from other organisms. Third, membrane or large multi-domain proteins are difficult targets because of solubility and size issues and often require identification and structure determination of protein domains. Finally, we propose different approaches to address the difficult targets.

UR - http://www.scopus.com/inward/record.url?scp=0038402360&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0038402360&partnerID=8YFLogxK

M3 - Article

C2 - 12626094

AN - SCOPUS:0038402360

VL - 6

SP - 349

EP - 362

JO - OMICS A Journal of Integrative Biology

JF - OMICS A Journal of Integrative Biology

SN - 1536-2310

IS - 4

ER -