Skip to Content

Archived discussions are read-only. Learn more about SAP Q&A

2nd Bioinformatic Coding Problem: genes into protein primary structures

On this wiki page:

https://wiki.sdn.sap.com/wiki/display/EmTech/Bio-InformaticBasicsInRelationtoScriptingLanguages

Post 3 (Protein Primary Structures from a Scripting Language Point of Vew ) presents the background needed to understand how to translate a protein gene like this:

atgaacaaacagatcgatctacccattgctgatgtacaaggctcgttggacacaagacat

attgccatcgacagagtaggaatcaaagcgatccggcatcctgtcgtggtggcagataaa

ggcggtggctcccagcataccgtggcgcaattcaatatgtacgtcaatctgccccacaac

ttcaagggaacccacatgtctcgctttgtcgagatactgaacagtcacgagcgcgagatt

tcggtcgaatcgttcgaggaaatcctgcgttccatggtcagcagactggaatcggattcc

ggacatatcgaaatggccttcccttacttcatcaataaatctgcacctgtctcgggtgta

aaaagcctgctggactacgaagtgacatttatcggtgagatcaaacacggcaatcaatat

agttttaccatgaaggtaatcgtccctgttaccagcctgtgcccctgctccaaaaaaata

tccgactacggtgcacacaaccagcgttcacatgtcacgatttcggtgcgtaccaatagt

ttcatctggatcgaggacatcatcagaatcgcggaagagcaggcctcatgcgaactgtac

ggcctgctgaaacgcccggatgaaaaatatgttacggaaagagcttacaacaatccgaaa

tttgtcgaagatatcgtccgcgatgtggccgaagtactcaaccacgatgaccgtatagac

gcctatatcgttgaatcagaaaatttcgaatccatacacaaccactctgcctacgcattg

atcgaacgagacaaaagaatacgataa

into a protein primary structure like this:

MNKQIDLPIADVQGSLDTRHIAIDRVGIKAIRHPVVVADKGGGSQHTVAQFNMYVNLPHNFKGTHMSRFV

EILNSHEREISVESFEEILRSMVSRLESDSGHIEMAFPYFINKSAPVSGVKSLLDYEVTFIGEIKHGNQY

SFTMKVIVPVTSLCPCSKKISDYGAHNQRSHVTISVRTNSFIWIEDIIRIAEEQASCELYGLLKRPDEKY

VTERAYNNPKFVEDIVRDVAEVLNHDDRIDAYIVESENFESIHNHSAYALIERDKRIR

using the "standard genetic code":

F: ttt S: tct Y: tat C: tgt

F: ttc S: tcc Y: tac C: tgc

L: tta S: tca *: taa *: tga

L: ttg S: tcg: *: tag W: tgg

L: ctt P: cct H: cat R: cgt

L: ctc P: ccc H: cac R: cgc

L: cta P: cca Q: caa R: cga

L: ctg P: ccg Q: cag R: cgg

I: att T: act N: aat S: agt

I: atc T: acc N: aac S: agc

I: ata T: aca K: aaa R: aga

M: atg T: acg K: aag R: agg

V: gtt A: gct D: gat G: ggtr

V: gtc A: gcc D: gac G: ggc

V: gta A: gca E: gaa G: gga

V: gtg A: gcg E: gag G: ggg

I'd love to have a copy of the necessary translation routine in each of the usual scripting languages - any routines posted in this thread will be added to the above wiki page.

Not what you were looking for? Ask a question