Swedish
Albanian
Arabic
Armenian
Azerbaijani
Belarusian
Bengali
Bosnian
Catalan
Czech
Danish
Deutsch
Dutch
English
Estonian
Finnish
Français
Greek
Haitian Creole
Hebrew
Hindi
Hungarian
Icelandic
Indonesian
Irish
Italian
Japanese
Korean
Latvian
Lithuanian
Macedonian
Mongolian
Norwegian
Persian
Polish
Portuguese
Romanian
Russian
Serbian
Slovak
Slovenian
Spanish
Swahili
Swedish
Turkish
Ukrainian
Vietnamese
Български
中文(简体)
中文(繁體)
Genome Research 2011-Apr

Discovery and annotation of small proteins using genomics, proteomics, and computational approaches.

Endast registrerade användare kan översätta artiklar
Logga in Bli medlem
Länken sparas på Urklipp
Xiaohan Yang
Timothy J Tschaplinski
Gregory B Hurst
Sara Jawdy
Paul E Abraham
Patricia K Lankford
Rachel M Adams
Manesh B Shah
Robert L Hettich
Erika Lindquist

Nyckelord

Abstrakt

Small proteins (10-200 amino acids [aa] in length) encoded by short open reading frames (sORF) play important regulatory roles in various biological processes, including tumor progression, stress response, flowering, and hormone signaling. However, ab initio discovery of small proteins has been relatively overlooked. Recent advances in deep transcriptome sequencing make it possible to efficiently identify sORFs at the genome level. In this study, we obtained ~2.6 million expressed sequence tag (EST) reads from Populus deltoides leaf transcriptome and reconstructed full-length transcripts from the EST sequences. We identified an initial set of 12,852 sORFs encoding proteins of 10-200 aa in length. Three computational approaches were then used to enrich for bona fide protein-coding sORFs from the initial sORF set: (1) coding-potential prediction, (2) evolutionary conservation between P. deltoides and other plant species, and (3) gene family clustering within P. deltoides. As a result, a high-confidence sORF candidate set containing 1469 genes was obtained. Analysis of the protein domains, non-protein-coding RNA motifs, sequence length distribution, and protein mass spectrometry data supported this high-confidence sORF set. In the high-confidence sORF candidate set, known protein domains were identified in 1282 genes (higher-confidence sORF candidate set), out of which 611 genes, designated as highest-confidence candidate sORF set, were supported by proteomics data. Of the 611 highest-confidence candidate sORF genes, 56 were new to the current Populus genome annotation. This study not only demonstrates that there are potential sORF candidates to be annotated in sequenced genomes, but also presents an efficient strategy for discovery of sORFs in species with no genome annotation yet available.

Gå med på vår
facebook-sida

Den mest kompletta databasen med medicinska örter som stöds av vetenskapen

  • Fungerar på 55 språk
  • Växtbaserade botemedel som stöds av vetenskap
  • Örter igenkänning av bild
  • Interaktiv GPS-karta - märka örter på plats (kommer snart)
  • Läs vetenskapliga publikationer relaterade till din sökning
  • Sök efter medicinska örter efter deras effekter
  • Organisera dina intressen och håll dig uppdaterad med nyheterna, kliniska prövningar och patent

Skriv ett symptom eller en sjukdom och läs om örter som kan hjälpa, skriv en ört och se sjukdomar och symtom den används mot.
* All information baseras på publicerad vetenskaplig forskning

Google Play badgeApp Store badge