EG11067

From BioE80 Boot
Jump to: navigation, search

Author Information

Luísa Galhardo

Basic Information

  • ID: EG11067
  • Name: ValS
  • Organism: E. coli
  • Description:

This protein is an enzyme catalyzing the reaction a tRNAval + L-valine + ATP → an L-valyl-[tRNAval] + AMP + diphosphate, covalently linking L-valine to its specific tRNA molecule using ATP hydrolysis (EcoCyc). Fittingly, the enzyme also binds ATP in addition to Valine. ValRS can inadvertently process amino acids similar to valine, like threonine, so it has a "posttransfer" editing activity that hydrolyzes mischarged Thr-tRNA(Val) in a tRNA-dependent manner (UniProt). ValRS has two distinct active sites: one for aminoacylation and one for editing. The misactivated threonine is translocated from the active site to the editing site (UniProt). This enzyme works with the Ribosome to translate mRNA to an amino acid sequence.

  • DNA Length: 2856 base pairs.
  • DNA sequence:

ATG GAG AAA ACT TAT AAC CCT CAG GAT ATC GAA CAA CCA TTG TAC GAA CAC TGG GAG AAA CAA GGA TAT TTT AAG CCA AAT GGT GAC GAG AGT CAG GAAAGT TTT TGT ATT ATG ATT CCC CCT CCT AAC GTC ACG GGG TCC TTG CAT ATG GGT CAC GCT TTC CAA CAG ACT ATC ATG GAC ACA ATG ATC CGT TAC CAGCGC ATG CAA GGG AAG AAC ACG CTG TGG CAG GTG GGA ACA GAC CAC GCG GGG ATC GCC ACT CAG ATG GTA GTC GAA CGT AAG ATT GCT GCC GAG GAG GGAAAA ACG CGT CAC GAC TAT GGC CGT GAG GCG TTT ATT GAT AAG ATC TGG GAG TGG AAA GCG GAA TCG GGG GGC ACA ATC ACC CGC CAG ATG CGC CGT TTAGGG AAT AGT GTA GAT TGG GAA CGC GAA CGC TTT ACG ATG GAC GAG GGG CTG TCG AAC GCA GTC AAA GAG GTT TTC GTG CGT CTG TAC AAA GAG GAT TTGATC TAT CGC GGT AAA CGT CTT GTC AAT TGG GAT CCC AAG TTA CGT ACG GCA ATT TCC GAC CTG GAG GTA GAG AAC CGC GAG TCA AAA GGT TCT ATG TGGCAC ATC CGT TAC CCA TTG GCA GAC GGG GCA AAG ACA GCA GAC GGG AAG GAC TAT CTG GTA GTG GCC ACC ACG CGC CCG GAA ACT TTA TTG GGT GAT ACTGGA GTA GCA GTG AAT CCC GAA GAT CCT CGC TAT AAG GAT TTG ATT GGG AAG TAT GTT ATC TTA CCC TTG GTC AAC CGT CGC ATC CCC ATT GTT GGC GATGAA CAC GCT GAT ATG GAA AAA GGA ACT GGG TGC GTA AAG ATC ACT CCA GCA CAC GAT TTT AAT GAT TAC GAA GTG GGG AAG CGT CAC GCC CTG CCC ATGATT AAC ATC TTG ACT TTC GAC GGC GAT ATC CGC GAA TCT GCA CAA GTA TTT GAT ACG AAG GGC AAT GAG AGC GAC GTG TAC AGT AGT GAG ATT CCA GCCGAG TTT CAA AAG CTT GAG CGT TTC GCG GCA CGT AAG GCT GTT GTG GCT GCT GTC GAC GCT CTT GGG TTG TTA GAG GAA ATC AAA CCG CAT GAT TTA ACTGTG CCC TAT GGT GAC CGC GGT GGA GTG GTA ATT GAA CCA ATG CTT ACG GAC CAA TGG TAT GTA CGC GCG GAC GTG TTA GCT AAA CCA GCA GTA GAA GCCGTG GAG AAC GGG GAC ATT CAG TTC GTG CCA AAA CAG TAT GAA AAC ATG TAC TTT AGT TGG ATG CGT GAC ATC CAA GAT TGG TGC ATC AGC CGT CAA TTGTGG TGG GGG CAT CGT ATT CCC GCT TGG TAC GAC GAG GCC GGG AAC GTC TAC GTA GGC CGC AAT GAA GAT GAA GTC CGC AAG GAG AAC AAT CTG GGC GCGGAC GTT GTC TTG CGC CAG GAT GAG GAT GTG CTG GAC ACA TGG TTT AGC AGT GCG CTG TGG ACT TTT AGT ACA TTG GGA TGG CCT GAA AAT ACC GAT GCACTT CGT CAA TTC CAT CCA ACT TCT GTT ATG GTT TCG GGC TTC GAC ATT ATT TTC TTT TGG ATC GCC CGT ATG ATC ATG ATG ACG ATG CAC TTT ATC AAGGAT GAG AAT GGA AAA CCA CAA GTT CCA TTT CAC ACC GTC TAT ATG ACC GGC CTT ATC CGT GAC GAC GAG GGC CAA AAA ATG AGC AAA TCT AAG GGC AATGTC ATC GAT CCA TTG GAT ATG GTC GAT GGT ATT TCT TTG CCC GAA CTT CTG GAG AAG CGC ACC GGC AAC ATG ATG CAA CCG CAG TTG GCG GAC AAA ATCCGT AAG CGT ACT GAG AAA CAA TTC CCG AAC GGA ATC GAA CCG CAT GGA ACC GAT GCT CTG CGT TTT ACT TTA GCT GCA CTT GCT AGT ACC GGT CGT GACATT AAT TGG GAC ATG AAG CGT CTG GAG GGC TAT CGT AAT TTC TGT AAC AAG TTG TGG AAT GCC AGC CGC TTT GTG CTT ATG AAC ACT GAA GGG CAA GATTGT GGC TTT AAC GGA GGG GAA ATG ACA CTT TCG TTG GCA GAT CGC TGG ATC CTT GCA GAA TTC AAC CAA ACT ATC AAG GCA TAC CGC GAA GCA CTG GATTCC TTC CGC TTC GAT ATT GCA GCA GGT ATT CTG TAC GAG TTC ACT TGG AAC CAA TTT TGC GAC TGG TAC TTA GAA TTA ACA AAA CCA GTT ATG AAC GGCGGT ACT GAA GCA GAG TTG CGT GGG ACT CGC CAC ACA CTT GTA ACG GTA CTG GAG GGA CTG TTG CGT TTG GCG CAT CCG ATC ATT CCG TTT ATT ACC GAGACA ATC TGG CAA CGC GTC AAA GTC TTA TGT GGC ATC ACG GCA GAC ACG ATC ATG TTA CAA CCT TTT CCA CAA TAC GAC GCA AGC CAG GTA GAC GAA GCAGCG CTT GCG GAT ACC GAG TGG TTA AAA CAG GCT ATC GTT GCA GTT CGC AAC ATC CGC GCA GAA ATG AAC ATT GCT CCA GGA AAA CCG TTG GAG TTA CTTCTG CGC GGG TGC TCC GCC GAC GCC GAG CGT CGC GTA AAC GAG AAT CGC GGG TTT TTA CAG ACG TTG GCA CGC CTG GAG TCT ATT ACC GTG CTT CCA GCCGAC GAT AAA GGA CCA GTG AGC GTC ACG AAG ATC ATT GAT GGT GCT GAG CTG CTG ATT CCT ATG GCC GGA CTT ATT AAC AAA GAG GAT GAA TTA GCT CGCTTA GCA AAG GAG GTT GCC AAA ATT GAG GGC GAG ATC AGC CGT ATT GAG AAT AAA CTT GCA AAC GAA GGC TTT GTG GCC CGT GCT CCT GAA GCG GTG ATCGCT AAG GAG CGC GAG AAG CTT GAA GGC TAC GCC GAG GCG AAA GCG AAA CTT ATC GAA CAG CAG GCG GTA ATC GCG GCA CTT TAG

  • Amino Acid length: 951 amino acids.
  • Amino Acid sequence:

MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EDEVRKENNL GADVVLRQDE DVLDTWFSSA LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS ADAERRVNEN RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKIE GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVIAA L

Function and Homologs

  • Product: Valine—tRNA ligase
  • Closest homologous proteins: The top (max three) homologous proteins to this protein, as identified by BLAST searches.
    • Valine—tRNA ligase [Klebsiella oxytoca], 5,011/100%/0.0/100.0%, A0A181WXI9
    • Valine—tRNA ligase [Shigella sp. FC2928], 5,011/100%/0.0/99.9%, A0A1E2VN84
    • Valine—tRNA [Shigella sonnei], 5,011/100%/0.0/99.9%, A0A1H0QK90

Expression

  • Expression Level: medium
  • Expression Level Hypothesis:

This protein helps in translation, meaning it is probably relatively important to be abundant in the cell, but one protein can continually charge several tRNAs with valine and valine is only 1 of many amino acids, so the expression level may not be the highest.

  • Expression Level References and Description:

Similar proteins in File:EcoliProteomicExpressionData.xlsx, like cysS (Cysteinyl-tRNA synthetase), also have a medium expression level, indicating that medium would be appropriate for this protein.

  • Expression Time: At which time should the gene be expressed in the lifecycle of our organism? early
  • Expression Time References and Description:

According to the Wikipedia article on trna synthetases, these proteins are key to translation and can also play a role in editing incorrectly charged trna molecules. This leads me to believe that it is required early, but not necessarily at the beginning, because it is not necessary for transcription but is necessary to translate proteins.

Gene Context

  • Other Components:

Valine-tRNA ligase does not require any other proteins to perform its function, so there are not really other components. There are however other tRNA ligases that also work to make translation possible, like LysS.

  • Possible Dependencies:

This ligase depends on the presence of valine, for obvious reasons, so the charged trna can be formed.

  • Process: charging tRNA
    • Inputs: tRNAval, L-valine, ATP
    • Outputs: L-valyl-[tRNAval], AMP, diphosphate

Construct

We will handle this - not part of your assignment

  • Synthesis Score: The synthesis score: 1, 2,3
  • Predicted Translation Rate: Prediction of construct translation rate from the RBS calculator
  • Design Notes and Details:
  • GenBank File: A link to the GenBank file. file