EG10746 - yuchenj

From BioE80 Boot
Jump to: navigation, search

Author Information

starrj | Starr Jiang

Basic Information

  • ID: EG10746
  • Name: polA: DNA polymerase I
  • Organism: E. coli
  • Description: The three main functions of DNA polymerase 1 are DNA binding (InterPro), DNA-directed DNA polymerase activity (InterPro), and nuclease activity (UniProtKB). It duplicates existing DNA by using the parental DNA as a template to synthesize new strands (QuickGO).
  • DNA Length: 2784 base pairs.
  • DNA sequence:

ATG AAA ACT AAA ATC CTG GTA GTC GAC GGT AAT TCG CTG ATT TTC AGA GCC TTC TAC GCG ACT GCT TAC TCT CCG AAT ACG AGT CTG CTT AAG ACG AAA TCA GGC GTT TTA ACA AAT GCA GTA TAC AGT TTC ATA AAT ATG TTA TTG TCT GTC ATC CAT CAA AGA GGC CCT TAT GAT CAC ATT TTA ATC GCA TTT GAT AAG GGA AAG AAG ACT TTT CGT CAC GAC CTG CTG TCG GAT TAC AAA GCG AAC AGA ATA AAA ACA CCC AAT GAA CTT GTT GAG CAG TTC AGT GTT GTG CGT GAG TTT CTG ACT AAA GCT AAC ATT CAA TGG TTC GAA CAA GAG AAT ATC GAG GCG GAC GAT ATA GTA GGG TCT ATA TGC AAA TAT GCG GAG AAG CAA TTC GAC AAC TTA CAG GCT GAA ATT TTA AGC TCG GAC AAG GAC ATG TAT CAG TTG ATC ACC AAC AAA GTG ATA TGC TTG AAC CCA GTG CAG GGG GTC AAC GAA TTA GAG GAA GTA GAT ACT AAT AAA TTG TTC GAG AAA TGG CAA ATA TTA CCA AAC CAA GTA CCG GAT TAT AAG GCT ATT GTT GGG GAC TCG TCA GAT AAC CTT AAA GGC GTA AAC GGA ATA GGG CAG AAA GGA GCT ATC AAA TTG ATA CAG CAA TAT CAA AAT TTG GAG AAT ATT TAT AAT AAC TTA GAG CAA ATC AAG GGT GCA ATC AAA ACA AAA CTG GAG CAG GAC AAG AAG ATG GCC TTT CTG TGT AAG GAT CTG GCC ACG ATA AAG ACG GAC GTG ATC CTG GAA AAC TTC TCT TTT AAA AAG CTT GAT TTC AAC GTT GAT AAC ATA TAC GAG TTT TTG AAC AAG TAT GAG ATG TAT TCT TTA AAG AAG CGG TTC ACG AAC ATA CTT AAT TTA GAT TTC AAC CCG TAT CAA AAC AAG AAA CAG AAC TTG GAC GTT AAG ATC ATC AAC AGC TGG AGC AAA GAT TAT GAA GAC TCC ATT AAC TAC TTA TAT GTC GAG TCT TTA GAA GAG GAT TAC CAC AAG GAC AAA ATC ATA GGC ATC GGA ATT TCG AAT AAT AAG GGT AAT TTC TAC TTG GAC TTC AAG AAC AAG GCG CAG CAG TTG TCC TTC TTT GAG GAC ACT ACT CTG TCT AGC ACG GAC TCA TTA TTT GAA GAG TTT TTG AAC AAC TCG AAC TTA AAG AAA TAC ACC TAT GAT ATT AAG AAA ACA ACT TAT TTG CTG AAG AAC CAT AAG TAT AAC GTA CTG GCC AGT AAT TTT GAC TTT GAC TTT ATG GTA GCT TGC TAC TCT CTT AAT GCA AAC GTC GTA TCT GAT CTT TCT AAC CAG ATA AAA TTG GTC GAC AAC CTT ATA GAG CTG GAG ACT ATC GAT CAG ATT TTT GGG AAA GGG GTG AAA AAA AAC CCC GAT ATA GAC TTG GAT ATA AAA TCC AAG TAT ATC TCC AAG AAG GCA TAT TTG TTA AAA AAA TAT TCA GAC CAA CTG ATA GAG CAG CTT AAA CAG ACG AAT ACT TAC GAC CTT TAC TTA AAA ATA GAC CAC CCA TTG ATC GAA GTA CTG TAC GAT ATA GAG GTT CAA GGG ATT TTA ATA GAT AAA GAA CAG CTT AAG CTT CAG ACC CAG CAG ATA CTG AAA AAA ATC AAT CAC ATT GAG GGG CAA ATG AAG ATA CTT GTC GCA GAA GAA ATC GAC AAT AAC TTC AAT TTC AGT TCT CCA AAA CAA ATT CAG GAG TTG CTG TTC GAC AAG CTG AAA CTG CCA AAT CTT GAA AAA GGC ACG ACA AGT AAA GAG GTA TTG GAA AAA CTG ATA ACT TAC CAT CCA ATC ATA AAC CTT CTG TTG GAA CAC CGG AAA TAT ACA AAA CTT TAC ACA ACA TAC CTG AAA GGT TTT GAA AAA TTC ATA TTT GAT GAC TCC AAA GTG CAC ACT ATT TTT AAT CAT ACG TTA ACT AAC ACC GGT AGA CTT TCA TCG TCT TAC CCT AAC ATC CAG AAT ATT AGT ATT CGT GAT AAT GAA CAG AAG GAA GTA CGC AAG ATC TTC ATT ACG AAC AAC AAC AAG ACA TTT CTT TCG TAC GAC TAC TCG CAG ATC GAG TTG CGT GTA TTA GCT CAG ATG AGC AAA GAG ACA AAC TTG ATT AAC GCC TTT AAC CAA AAC GCT GAT ATA CAC TTA CAA GCA GCG AAG CTT ATT TTT AAC CTG TCA GAC GAC CAG ATT ACA AGC GAG CAG CGT AGA ATT GCG AAG GTA TTT AAT TTT GGA ATA TTG TAC GGC TTG ACC GAT TTT GGC CTG GCG TCA GAT TTA AAC ATA TCT GTC AAC CAA GCA AAG CAG ATG ATA AAA GAT TAC TAT TCC GCT TTT CCC AGT TTG TTA GAG TTT AAG GAA AAG CAG GTC GAG ATA GCG ACT AGC CAA GGT TAC ATC ACT ACT CTT AGC AAC CGG CGG AGA TAT ATC AAT GAG TTA AAC AGT ACT AAT CAC AAC ATT AGA CAG TTC GGT AAG CGG ATT GCA GTG AAC ACC CCG ATT CAG GGC ACA GCA TCG GAC ATC TTA AAA GTG GCC ATG ATT TCT ATT TAC AAA AAG TTA AAA GAA CAA AAT CTT GAT GCA CGT ATC GTG TGT CAG ATT CAT GAT GAA ATT ATT CTT GAA GTA GAT GAC AAC CAG CTT GAA CAA ACA AAA CGC ATC GTT GTA TCC GAG TTG GAA AAC GCT TTG GAA AAG TTG TTC CTG GAT TTG AAC ATT AAG GAG CAA GTT GTG GTT AAG CTT AAA GTA GGT GAA TCG GTG GGT AAG ACG TGG TTT GAC TTA AAA TAG

  • Amino Acid length: 943 amino acids.
  • Amino Acid sequence:

MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLIMQYKPTHAAV VFDAKGKTFRDELFEHYKSHRPPMPDDLRAQIEPLHAMVKAMGLPLLAVSGVEADDVIGT LAREAEKAGRPVLISTGDKDMAQLVTPNITLINTMTNTILGPEEVVNKYGVPPELIIDFL ALMGDSSDNIPGVPGVGEKTAQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLEQNK EVAYLSYQLATIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEAGKWLQAK GAKPAAKPQETSVADEAPEVTATVISYDNYVTILDEETLKAWIAKLEKAPVFAFDTETDS LDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEKALKVGQN LKYDRGILANYGIELRGIAFDTMLESYILNSVAGRHDMDSLAERWLKHKTITFEEIAGKG KNQLTFNQIALEEAGRYAAEDADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRI ERNGVKIDPKVLHNHSEELTLRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLK KTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTYTDKLPLMINPKTGRVHTSYH QAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYVIVSADYSQIELRIMAHLSRD KGLLTAFAEGKDIHRATAAEVFGLPLETVTSEQRRSAKAINFGLIYGMSAFGLARQLNIP RKEAQKYMDLYFERYPGVLEYMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAE RAAINAPMQGTAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDVDAVAKQI HQLMENCTRLDVPLLVEVGSGENWDQAH

Function and Homologs

  • Product: DNA polymerase I, 5' --> 3' polymerase, 5' --> 3' and 3' --> 5' exonuclease
  • Closest homologous proteins: The top (max three) homologous proteins to this protein, as identified by BLAST searches.
    • DNA polymerase I [Shigella sp. SF-2015]/1902/100%/0.0/99%, [1]
    • DNA polymerase I [Shigella boydii]/1901/100%/0.0/99%, [2]
    • DNA polymerase I [Shigella sonnei]/1900/100%/0.0/99%, [3]
  • Equivalent E. coli / JCVI functional protein: [4]

Expression

  • Expression Level: medium
  • Expression Level Hypothesis: This gene is necessary to the creation of newly created DNA, and DNA duplication is a major component of organism growth.
  • Expression Level References and Description: Escherichia coli proteome dataset, this is a reliable source provided by the specification instructions
  • Expression Time: late
  • Expression Level Hypothesis: Since the gene plays an integral role in DNA replication, it is not a part of the central dogma. DNA must first exist before the gene comes into play.
  • Expression Time References and Description: Did not use a source, but researched the role of DNA polymerase on Wikipedia and EcoSyc to determine that is a part of cell division, which is not a part of the central dogma.

Gene Context

  • Other Components:
  • Possible Dependencies: nicked circular duplex DNA [5]
  • Process: 3'-5' exonuclease activity [6]
    • Inputs: Deoxynucleoside triphosphate, DNA(n) [7]
    • Outputs: diphosphate, DNA(n+1) [8]