Tan0016002 (gene) Snake gourd v1

Overview
NameTan0016002
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCytochrome oxidase assembly protein
LocationLG04: 338143 .. 349131 (+)
RNA-Seq ExpressionTan0016002
SyntenyTan0016002
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTTGAAGGGCTCGAATTTTAGCCCCCATCCTCTTTTCCCCTCTCAATTTCGTTTCCCTTGCCCAAACAAAACAAAACCCAGAAAACTGATTTGTTATTGCTGAAGCTCGGTCCCTTTTTTTTTTCCGATCAGCTGGCGTCGACCACACCAGATTCGCGAAGCTCCGTCCCAGCCACCGGAAGATCTGTCTCAGACACACGTCGTTCAGCCGCCGGACGAAGCTATCGCTGTCGCAGCCACAGCCACGCGAGACACCTGCGATTTGCTCAAAGCTCCAGCCGCCGTGAGCAGTCGCATAGAGCTGCACGTCGATCTGTCGGCCTACCGTTCGCGACCGCAGTCACACTCGCAGGTGTAAGTTTTATTCTGTAAGTTAATTTTAAATTTTATCTTAATTCATTTTAGGAGCCCCACTGAGTTTGGAGTCAATCGAATTCGTTAGATCCCACAGAATTGGTTAAGATTGTGAGTGGATTGAAATTTTGAATTGATTGGTGAACTGAAGCCCAAAACTGTGCATTCCCTTAAACGATATTCACGGTTAGGCGTGATTTCCTCTGGAGCGAGCTCATCCTTGAATGAAGCCGTGAGGTTGGGGTATATATACAGCTTCTTGCAAAGGTCCATGAAGGAAGGCATTGCTTACATCAAGTTGTTTAATCTCGCATTGGGAGGTTACCACCAAAGTGAGGACTATGCGGATTGTAGAAGGCTTAATGGACCAGGTGAGTAGGGATGCAGGTCCATGGACTGCTCGTGTCGGGCCTGATAGGTGGTGGTATCACACCCTTCTCAAAGGAACCGTACGTGACACTCTTGCCTCATACGGCTCCGTCCCGGCATCAGGACCTCCCCTTTCCTTTGACCAACGGGTGGGTTAAAAGTCTCACCATAGTCGATGCCGGGTTGTTGAAGGTATCCTTTGGCCACTAAGTGGGCCTTAAACCTTTGAACGTTCCCATCAACATCACGTTTAATCTTGTATATCCATTTGGTGCCAACAAGATGGTGATGAGGGGCAGCAGGTACAAGTTCCCAAGTGTGGTTGCGTTGGAGAGCTTCAAATTCTCCAGACATAGCCTTTTGCCAAACGGGGACAGAGACAACAGTGTGAAATGAAAGAGGCTCAATTTGTGTCCAGTCATGGTGAGATTGAGATTGGGTGAGACAGGTGAAGGGTATTTTGGGCTTGGATATGCCGGATTTAGCTCGACAAGTCACCATGGGATGTTGGTTGGAGATATCAGCCACGGGAAGGTCTGGAGAGAAGGAGGGAGTGGGTGATGAGCTGGGACTGGAAGGGGATGGATTTATAGGGTGTGTACTGGGGGTAGCTGGAGAATTAATGTTGTGAGTTGGGCTGTCTGGTGTGGTAGGCAGTGAAGGCAGATGGGGAGAATAAATGGGGGGTGAATTTAATGTGGTGGTAGGTTGGCTGGAATGAATTGGAAGGGTTGGGAACCAGGAATTAATTGGTTGATCATGCTCCTTGATGCTGTCTCTGTTGCTAAATAAGGCTGCATAAGGAAATTCTTGTTCGTTAAAAATCACATGTCTAGAAATATAAATTTTACCTGAGGATGTCATGCACTTGTACTCTTTGTGAAGCTCACTGTACCCCATAAATATGCACTTCTCACTGTGAAAATCAAATTTATGCAGTTGGTATGGTCTTAGGCATGGAAAGCAAGCTGAACCAAAAGGTCCTTAAGCTGCTGTAATCTGGTTGTTGGCCTTTTATCAGAAAGTATGGGCAGTTTCCTTCAATAATTGGTGTGGGCATGCGGTTTATAAGATTGACAGCAGTTTGAAAAGCTTCCCACCAATAATGCAACGGTAATTTCCCTTGAGCAAGGAGAGAGAGGCCTGTTTTTTCCACTATATGTCGGTGTTTTCTTTCAATTCTCCCATTTTGAGCTGAGGTATAAGGACATGTCAGTTGAACTTCAATCCCTTGATTCTTGCAGTAGGTGATGAATGGTTTAAATTCACCTCCATTATCACATCTTAAAGTCTCGATAGTACTTTCATATTGCTTTTCTACTTGGACTTTGAATTGTTGAAAGGTTAAGAAGGCTTCCCCCTTGTTCTTTAATGGGTAAATCCAAGTGTACCTTGAAAAATCATCTAGGAAGGCAATATAATAACTGAAACTGTTAGAGGATTGAATAGGAGCAGGCCCCCATAGGTCAGTATGTATAAGTTCTAGTACTTTTGTAGCTCTTGATTTAGATAAGTTAAAGGGAAGTCTATGGGATTTGCCAAACTTACATGCCTCACAAAAAGATATATTTTCATGGCCTTGAATAGGAGAATTACAAGCATTTAGGACTTGACTCAAAACATTAAATGATGGACAACCCAATCTTTGGTGCCATGTATCCATTGTACAGGTTGTTAACACAAAATTCCTAGAAAATGACAAGTCCTTTGATCTTGAATTGAAATGGTTCTTCACATTTCTTGTAGCTTGAGTGAAGTTGCTTCTTGAATCTCCCTTATTTACTTGTGACTTAACGGAGGCTCGATTATCTTCAAGGACCTTGTAGAGACCGTTCTCAAGCTTCCCTACCAAGCAAGTTTCGGACTTCTTTTCCTTTACAAAAACAGAACATTTTATCAAACTCAACTGTCACATCACATTCTTGGGTTAGTTTGGAAACACTAATTAGATTCTTCCTCATTTTAGGTACAAATAGAACATCTTTAATGACTAGATTTCTAAATGATCCTTTAATGCAAGCATCACCTATGCCAAGTATATAGACTTTAGACCCATCACCTACTATAACATTCTTAAGTTTAGTAGTTTTACCTTTCATGGACAACTTTCCAAGCTCAGCAGTCACGTGGTTACTTGCTCCACTATCTAGGTACCATGAAGGATCTTGTAGAGTCTCAGGGTAGGCCATCATTGCTGTAGGATTGTGAGGACTAATTGCTGTCTATGAGGAAGTATTTATCTGAGTGCTTGAAGGTCCTTGATTTGGTTTTTCATATCTGTGGTAGCATATATTTGTTGTGTGCCCCACCTTGCCACATATTTGACATGTTGGTTTGTTTGGTGATGGAAAGGGAGAATAACGATTTCGTCCCCTTGATCTTCCTCCTCTTTGGGAGGTATGGTAGCTTCCACGGCTTGGATATTGAGAGGTATTGGAGAGGTGATGAGGTGGTTTGGTAGCTTCATTGTTGGCTACATTGACGGAGGCTTGTTGAACTGAGATGGCTCCTTTAACAGATTGGAGATGTTCAAGTCTCTATTCAAAGGAAAGTAGTTCGTTTTGAACTCCAAACCAAGTTTTGTTAGCAGTCCTGAGAAAACAAACTATGGTGGTATATTCTTCATCCAAACCAGTTGTAACATGTGAAATGAAGCCCTCATGTCCAATGGGAACCCAACAACCAGCAACCTGCAAGTTATCAAAATATTTCTTCATTGTTTCAAGGTATTCATACATTTTCATGGTACCTTTTCTTGTTTGTTGCAACATCAGCCTGTTGTAATCCTCTTGTGATCGAGATTGAATGCCATAATAGTCTTGAACAACATCCCACAGTCCTTTGGCTTCGTCATGGCCCATGACTTGTCCTGCTACTTTCGGAGTCATGGAGTTGTACATCCAACCGACTAGCAATTGATCTGCAGCAAGGCAAATGTCGTATTCAAGATTGGGAATCTTCAAACCAGTCGGGTCTTCTTCAAATGGTGGAAGTTCTATCGTCATCTCAGGTGGCGAAGAAGTGCCGGTTAAATGCCCTTCAAGTTTATAACTTTTGAGAATGGGTATGGCGATGTTTTGCTAGAGGAGAAAGTTGTTACGATTGAGTTTGATCGAGGTGGACTGGTTTAGAAGGTTTACAAATGGTGGGTTTCCGACGGGTATGTCTTTCGTTCGCACGAGGTATGGACTTTTTCTTGCTGATTTTTCAGAGTTCACATCAGTGGCCGTTGCACTTGCGGCAGTAGGTTGTGTAGCTCGGGACGATTCGCTGATGGGTGTTTGCGAAGGAGAGTCAGCCATGGTTGCGTGATTTGTTGATTGTCGACGCTCTGATACCAAGTTTGAAAAACAACAGAGAGAAGAACCTGATAACTTCGTAATTTCTTATTCACCTTAATCGAATTGATACAAAGGGTAGAAGGTGAGTTAATACAAGAAATGTAATCATTAAGGAACTCTTCTAAAGGATAAGAGGAAGTCAAATCAAATAAATAAACCAATAGTTACAAGGAAGAAAGTGCAAGTAATAAGTGAGATGTCAAGAGATATGATAGGTTGACTTTAACTTCTAACAAAGTCAGATGCAAGTGCTGTTTTCTTATCCCTATCTGTCGATGCTTCGTCTTGTAGTGAAATAATTGGAGTATAGGTTGAAAGCTAAGTTTGAATTTTGTACCCTCTCCTTCTTATGTGCAAATTAGCTCAGATTTTATTTTCTAGTCTAGCTTAAGAACACACAATTCTTCTTCTTGTTCTTATTATTATGAATTTTGGTTGACATAGAATGAGTGTGTTTTGAAGTTAACAGGATGATTTTGACCGTTGATGAAAACTCAGAATTCTCACAATCCTTAGCATTCTGTGTAGTATTTCCCTTGGACTTCCATTGTATTTAATAGCTCTGATGCTGATGATAGGATATTTACACCACAGACTCAAGTAAATGCGATCACAACTTAAGCTATATATGCTTATTTCAGTTAGCATTCATTCATTTTGTGTTTTTGTGATAAATTATATATAAAGATGTGAGATGCATATTTAACACATGTTGATTAATTTGACACTCATGACATACAAATTACAAAAGAAATAGACATATTCTTATCTTGTTCCTGCTATTTTAGGAGTTGTTTAGCGCACAGGTTTGAAATCAAAACCCGAATTCTGGAACCTTGGAGTTTCTTGGCTAGGTTTGTTGTAGATTGTTTTTTTGTTTTTTTTTTTAATATAACTTTTCAGATTGTTTATTATTTTAAAGAGTTTTTCTACACTGTTTTTTCACTCTCTTTTTAACACAACTCAACAAATTTTATAGCTTCATTGATTAAGAGTATGTAGGAGTCCACTATTGTTTGAGTATAGGTTGATGCTTATAGATTCATTTGAAGTTAGGGGGTGGGTGAGAAATAATTTTGAGAAGTCTTAGCAAAGTTCTAAGTAATTTTATTGTGTTCTAAAACTTTTGGTTTTTAGTTATTATACTTATTATGCTTGTGTAACTTTTGTGAGTTAGTAAGTTATATTTAGTTTGTTGTCTCCAAACTTATTTGGTTTGTTCGAGTGTTCTTAGTAGTTGAGTTTATAGGTGTTATGTTAATAAATTTTGAGAAGTGTTAGCTAAGTCGTAAGTAAATCTATTGTCTTATAACTTTGGTTTTTTAGATGTTATACTTATTATGTTTGTGTAACTTTTGTGAGTTAGTAAAATTATTTTTAGTTCGTTGTCACCAAAATCATTTGGTTTGTTCGAGTTTTCTTAATAGTTTAGCTTATAGGTGCTATGTTAATAAATGTTTGAAGGTGATTTGTTTATTTGAGGGTTTGTAAAACTAATTGGTTGAAATTTTGAAGGTGATTTGAAGGGTTTAGGCTTAACAAGCACAGTTCAAAGCTTTCCAAAACTAATGTCACGATGAGTTGGTCCTCTTAAATATCTATAAACGTCATTCATTGACGTTTTTCACGTTTCAAATAACAAATTTGAGTTATATTTCTGGTATATGTAGCTTTGTTGATTGTATGACTTTCATAGAAACCTCAAGCCTTCTCACCAACCAAACACTTTCCTTCTTTTTCACCTGTACAAAAAAGCACAAATTTTGCCACATACCCACAAGAAGAAACAGATCAAACCCAGAAGATAGAAATAAAAAGAGAAGCCAAAAATTGCACATTGCATTGCATGTGTGTATTGGGTTAGGAAGACAGTGTAGGAAAGCATGAGTTAGTTTCATTTCATGTCTCAATAAAAGCAACACAAGGTTATACAATCTTCAGCCCAACAAAACCCCCATTATTTCTCTTCTTCAGAATTCATTTCAGGCCAAAGATGAATCAGCCCAGCTTCAAACCTCTTCAATTTCTTATCATCCTCAAACGGACGGTCAAACCGAGGTAGTTAACAAGTGTGTGGATACCTATTTGAGGTGCTTTTGCAATGAGAAGCCAAAGCAATGGTATCAATGGTTAAGTTGGGTTGAATTTTGGTACAACACCACTTTTCATACGTCCATAGGTATGACTCCCTCCCAGGCGGTTTATGGTCGTTTACCACCACCCCTAGTTTCCTACGGGGAGCAAAGAACATCCAACGAGACAGTGGAGGGGCAGTTAGTGGAATGGGATGTGATCATTGACCGATAGAGACTCATTTGAGGATAGCTCGAGATAGAATGAAGAAGTTTGCGGATAAAAAGAAGAGAGAGGTCGAGTTTGAAGTGGGAGATTGGGTGTACTTGAAAATTAAGCCATATCGGCAGGCTACCATAGCCAAAAGAAGGAATGAGAAGCTCTCGGCTAAGTATTTGCGTTCTTTTTGCATTGAAGGGAAGTTGGGACCAGTAGCCTACAAGCTGAAGTTGCCAAAGGATGCCTCCATTCATCCGGTTTTCCATGTTTCTCAACTGGGTTGTGGGAAATTTTTGCAGACCACCGCAGAAATGTTCCTTGATATACGAAGAACATGAGTGGATAACAATACTTGAAGAAGTAATAGCTTATAGAGAAAATACACAAAATGGACAAGATGAAGTGTTGATGCGTTTGGAGGGACTACCGGAGTATGATTCCTCGTTGGAAGAGATTGAAGATTTCAAGAGGCAATTCCCACACTTCAACCTTGAGGACAAGGTTTTGTGGAAGTCGGAAGGTACTATTAGGCCACCTATCACAAAAGTTAATCAAAGAAAGGGTAAAAAGGGAAATGCATTGGGGACAATGGGGAACGTGGGAGTAAGTGCCTTAGGGGGGACCAGTGAGAGATTTGTATCCGAAGGGGGGGGGGGGTATCCTTCTTCCTGTTTTTCAAGTTAGTTATCCTTTTTGTATAAATTGCGAGTAGCAATTTAGGACAAGGAGAGCCACCAACCCTCTCGAATTGGTTGGCTTGATTGTAGATGTTTGGCAAGTTAATTTCGGTGTAATTCCAAAAGGAAGAATACTTCCACAACTCTCTTACATTAACCTCACATTATACTTTTTCCCTTAAAATCCATGACACTCCGAGCTATTTTCTAATCCTATCAAAACCTTTTCGGTGTGTAGTTCCCATTGCTTGCAGCAAAAGTCAATGGTCAATTCTTGGATGGTAGGTTCATTATACACTCACCACTTGATCATCAATGGGAAGCCGTAAAACAAATTTTAGGTTATCTCAATAGTACTATTGATTGTGGTTCTTTCTAATCCGACCTCGTTGACTGGGGCCTAATCTAAATGATTGAAAGGTCTCTCATAAGCAACAAAGTCTCCCAATCTAGCAGTGAAGTAGAAATTTGGAGGCTTGCTTGTGCCTCTGCTGAGTCGAGTTTGATGGCAATTCTTGATGTGTGATCTTTATATTTCTGTATCAGTGTCCTGTAATGCAGAATCGTGCAGCTTAGTGCCAATTTAATGGTGGTGGTTTGAACAGTCAAAACATCATACCAACTAAAAATTTACTAATATACTGCCCAAGTCCATGTCAACCCAAAATTCCATTCTAATTTGGTACAAGCGTCCTTCTCTTTTCCTCCTAAAGAGACATTTCCTGTCTTTTCTTTGCCATCAACAACAATGATTCAGACAATGATGAAAGCAAACTAAGCACTTTTCGTTGGTGAAAACAGGTCGGATCCGGTGTGAGGATTTCTGCTAAGCCCACCAATAGAAATGAAAAGGAAAAAAAAAAGACTCAATGCTTTGAATCGGAAGAAAAGGAGGAAAAAATGTGAGACCTAAACATGGTTGGGGATTTGGGGATGGGTTCGCGGGGGTTGGATGAGAGGGATTGGCGGTTGGAGAAGAGGTCGCGGTATATGGTTGGGTATTATTGGGGGTTCTGGTGCTGATGGCCGTATTAGAAGTATAGTTTGGAGAAATTGAACTCTTGAAATTTTATCTTTGCGGTTGTGGTGTTGGGGTTTCAGGAACATTCTTCTTTGAAGAAATAGGTTGGTTCATGAAATCACATAGTTTGTCCACTCAACTTGAGAAGAATCTGCTCAGGTTGAGAATCGCCTAACATTCGAAATCGTAACAGATGAATTAAGAGAAAGGGTGTAACCTGTGGTCGTAGCAGTCGTCGTTGCAAGTCTCTCAGTGAAGAAAACCTTGCCTAAGTTGTTGTCGTCCGTGCCTTGACGTTGCCGGAGAGCCAACGAACCATAGGAACTGATGCGTCGATTGATAGACAGAGCGTGAGAACTGAGAAGCGAAAGCTAATGGAACCCTAACTTAAATTACGAGTCGGTTCTGGTCGAATAATGGTAACTAATCAAGCAAACCATAAATACCTAATAAGGGGCCGTTTGGATTGAGGAGTTGGGGGAAGTAGGAGTTGCAGGGAGTAGAAGTAAAGAAGTCTGGGGAGTTCTGAAGTCTGTGTTTGGGGTGCAGAGTTGTTTACTCTGGAGTTCTGAGAAACTGTGCTTGGGGCAAGGAGTTCTGATGTCTGAGTTTAGGGTGCAAATTTATTAATTTGTGATTTTAGCTATTTCATGTACAGATGGAGAAGGGGAGAGCGAAGATACCGAGGAGTCGCGTGGAGGGTTGTGCAGACTTCAACAGTAAACACTGTTGGAGTCCGAGAGTTTAATGCTTTCTAAACCGTGTGATAAAAAATCGTGGGCCAAACGAGGGTTTTCTAAACCCTCCCAACTCCTTCAATTAATTACCCTCGGCCCAAACGCCCCCTAAAAGTTATGATCGACCCCGTGATTAGCAGAGTATAGTCATCTTCTGCGTTGAGAAATTATAAAGCAAAAAAGAACTTACAAATTGCTTAACGCATCATTAACAAGTTATCGTCAACCCATTCGAATTTCAAACAAAGTAACTACTTTGATTTTGTATACGGTAGAGGAAAGCAAACACAGAATTGTGAATGAACCAAACATCTGCGAAATGGAGGAGTTGAAGAAATGAACCTTGAAGATTAGCACAAAAAACAAAACCCTAATGAGACCCGCTCTGACTGGTTTGTTTCGCTCAGCTGTTGCCCAGATTGAGAGTCCCGAATCTTGAGCAGATTGGATGACCTCAATACGTCAGAGCCGCAAGGTTTTGAAATTTGCAGCTTGAAGCTATTGGGCCGCGTTAAATGATGGCCCCGTTAAATATTTAGGTAGATGTTGCTTGGTATTCTGATGGCACTTAGTCTGGTTTCATACCGAGTTGGACCGACCCACTTGGCCTTTGGAAACTGCACCCATACCGTTGACAAATGTTCTGTTACAACAATATATCATCAGGTAGAAGAATGAAACAACAGTTCAAATCTGTGAACCTCTGTTCTCAACTATGAGGGGTCGACTTTTGAATTCATTAGTCTGGTTTCATACCGAGTTGGACCAACCCACTTGGCCCTTGGAAACTGCACCCATACCGTTGACAAATGTTCTGTTACAACAATATATCATCAGGTAGAAGAATGAAACAACAGTTCAAATCTGTGAACCTCTGTTCTCAACTATGAGGGGTCGACTTTTGAATTCAATAATCCATAGTAAGAAGAAAAATGCATGGTATGTCAAACCAAGTCTTAGATGCTGCCTTACGAAGTTTATGCGCATGAGTTTGTTCATTAGTAGAACCCAAATGTGGAGGCTGGTTCATGTGGACATCAGCAGTACCCAATTTTTGTAATGCCCAACCTGTAGGCAAGAGCAATAGTAAGAACCATCTGATTAGGTTTGACCATAATCAACATCCAGTGATTGGAAATCTCTTATCCATTAACTTGACCTTTGAGCGCTACTGTCCATCTGGATTTCTTTTGATTTTAAATGCCCATTTACGCCTTGTAACCTTAGAATCACTTTGTGGGGAACTTGGTTGCCAATGCTTTGGATGACGATAAATTTCTTAGGAAGTTGTATATAAACATTGATATACCTTTCAAAGTAAAGTTCATTTTTTCTGCTAGGGTTTAACCATTCTGGAACTTTAGGTCGCACAGATTTACTTCACTCGAACCCTCCTGTAGCTGAGCTTTTAATGCTTTGCTACTATCATAAGAAAGCAAATATATTATTGGAAAGATCACTGTGTAGCTCATAATACGCTGTTTTTCTCCCCTTTTGTATGACTTGATGAAGTGTGGAGTGAATTTTCTTTTGTATAACAATGATTTTCAGTATTATCAGTTCATTATTCGTTTTTCTAAGTCTTGTTCCTTATGTGAAAACCATATATTGCCATTGAATAATAGTCCTAATTGGATTGATATTTACCAGTTTGCAGCATTCAAAGTATAACAAAACAGTGTCTTTGTCTGCAAAATGGCTGGACTTTTGGGATATAGGGGCCTTGCACCCAAGACAAAGAATATTGTTGTTGCTGGAGGTTTAACTGCTTTTGTGTTTGGGGTATATTTCTACACAATGAGAGCTGTTGGAGGTACAGATGAACTGCAAGTAGCGATAGACCAGTTTGAAGCGCATAAAAGCAAGGATCCGAACGTGTAGCCTTGCAATAAACCTCTTTCTGGAGATCGCAACAATGAAGACGCTAATGTCAGAGGAAAAGGACCATTGAGTTTGACCCTTTTATACTGACTTATAGCTCTGTAAAAACAACAAATTCCTCGGCATTTAGACAATAATGCGTTTTCCATGAACTCTAAGAAGTAATGTCATGTGGTTTGTTGGGCTGCGCTTTCCATTTTGGGGAATTCAGAATGATTTGCGTTGCATCTACTTGACAATTCAATTACTTCCATTGATTTCAATACATGAAAAGAAATGAATTCTTAACAACACATGGTATGAAAAGTGATTCAA

mRNA sequence

GTTTTGAAGGGCTCGAATTTTAGCCCCCATCCTCTTTTCCCCTCTCAATTTCGTTTCCCTTGCCCAAACAAAACAAAACCCAGAAAACTGATTTGTTATTGCTGAAGCTCGGTCCCTTTTTTTTTTCCGATCAGCTGGCGTCGACCACACCAGATTCGCGAAGCTCCGTCCCAGCCACCGGAAGATCTGTCTCAGACACACGTCGTTCAGCCGCCGGACGAAGCTATCGCTGTCGCAGCCACAGCCACGCGAGACACCTGCGATTTGCTCAAAGCTCCAGCCGCCGTGAGCAGTCGCATAGAGCTGCACGTCGATCTGTCGGCCTACCGTTCGCGACCGCAGTCACACTCGCAGGTTTTGCAGCATTCAAAGTATAACAAAACAGTGTCTTTGTCTGCAAAATGGCTGGACTTTTGGGATATAGGGGCCTTGCACCCAAGACAAAGAATATTGTTGTTGCTGGAGGTTTAACTGCTTTTGTGTTTGGGGTATATTTCTACACAATGAGAGCTGTTGGAGGTACAGATGAACTGCAAGTAGCGATAGACCAGTTTGAAGCGCATAAAAGCAAGGATCCGAACGTGTAGCCTTGCAATAAACCTCTTTCTGGAGATCGCAACAATGAAGACGCTAATGTCAGAGGAAAAGGACCATTGAGTTTGACCCTTTTATACTGACTTATAGCTCTGTAAAAACAACAAATTCCTCGGCATTTAGACAATAATGCGTTTTCCATGAACTCTAAGAAGTAATGTCATGTGGTTTGTTGGGCTGCGCTTTCCATTTTGGGGAATTCAGAATGATTTGCGTTGCATCTACTTGACAATTCAATTACTTCCATTGATTTCAATACATGAAAAGAAATGAATTCTTAACAACACATGGTATGAAAAGTGATTCAA

Coding sequence (CDS)

ATGGCTGGACTTTTGGGATATAGGGGCCTTGCACCCAAGACAAAGAATATTGTTGTTGCTGGAGGTTTAACTGCTTTTGTGTTTGGGGTATATTTCTACACAATGAGAGCTGTTGGAGGTACAGATGAACTGCAAGTAGCGATAGACCAGTTTGAAGCGCATAAAAGCAAGGATCCGAACGTGTAG

Protein sequence

MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHKSKDPNV
Homology
BLAST of Tan0016002 vs. NCBI nr
Match: XP_038891345.1 (uncharacterized protein LOC120080790 [Benincasa hispida] >XP_038891346.1 uncharacterized protein LOC120080790 [Benincasa hispida])

HSP 1 Score: 109.0 bits (271), Expect = 1.5e-20
Identity = 56/61 (91.80%), Postives = 57/61 (93.44%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHKS-KDP 60
          MAGLLGYRGL PKTKNIVVAGGLTAFVFGVYFYTMRAVGG+DELQVAIDQFEA KS KD 
Sbjct: 1  MAGLLGYRGLGPKTKNIVVAGGLTAFVFGVYFYTMRAVGGSDELQVAIDQFEAQKSDKDS 60

BLAST of Tan0016002 vs. NCBI nr
Match: XP_022956375.1 (uncharacterized protein LOC111458133 [Cucurbita moschata] >XP_022998843.1 uncharacterized protein LOC111493381 [Cucurbita maxima] >XP_023522138.1 uncharacterized protein LOC111785987 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 108.2 bits (269), Expect = 2.5e-20
Identity = 56/62 (90.32%), Postives = 58/62 (93.55%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHK-SKDP 60
          MAGLLG RGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGG+DELQVAIDQFEA K +KD 
Sbjct: 1  MAGLLGLRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGSDELQVAIDQFEAQKDNKDS 60

Query: 61 NV 62
          NV
Sbjct: 61 NV 62

BLAST of Tan0016002 vs. NCBI nr
Match: KAG6601607.1 (hypothetical protein SDJN03_06840, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 108.2 bits (269), Expect = 2.5e-20
Identity = 56/62 (90.32%), Postives = 58/62 (93.55%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHK-SKDP 60
          MAGLLG RGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGG+DELQVAIDQFEA K +KD 
Sbjct: 1  MAGLLGLRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGSDELQVAIDQFEAQKDNKDS 60

Query: 61 NV 62
          NV
Sbjct: 61 NV 62

BLAST of Tan0016002 vs. NCBI nr
Match: XP_022151963.1 (uncharacterized protein LOC111019802 [Momordica charantia] >XP_022151965.1 uncharacterized protein LOC111019802 [Momordica charantia] >XP_022151966.1 uncharacterized protein LOC111019802 [Momordica charantia])

HSP 1 Score: 107.1 bits (266), Expect = 5.6e-20
Identity = 52/56 (92.86%), Postives = 54/56 (96.43%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHKS 57
          MAGLLGY+ LAPKTKN+VVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEA KS
Sbjct: 1  MAGLLGYKSLAPKTKNVVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAQKS 56

BLAST of Tan0016002 vs. NCBI nr
Match: XP_007033727.2 (PREDICTED: uncharacterized protein LOC18602341 [Theobroma cacao])

HSP 1 Score: 106.3 bits (264), Expect = 9.5e-20
Identity = 51/58 (87.93%), Postives = 54/58 (93.10%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHKSKD 59
          MAGLLGY  LAPKTKN+VVAGGLTAFVFGVYFYTMRAVGGTDELQVAID+FE HK +D
Sbjct: 1  MAGLLGYNSLAPKTKNLVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDKFEEHKKQD 58

BLAST of Tan0016002 vs. ExPASy TrEMBL
Match: A0A6J1KHY0 (uncharacterized protein LOC111493381 OS=Cucurbita maxima OX=3661 GN=LOC111493381 PE=4 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 1.2e-20
Identity = 56/62 (90.32%), Postives = 58/62 (93.55%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHK-SKDP 60
          MAGLLG RGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGG+DELQVAIDQFEA K +KD 
Sbjct: 1  MAGLLGLRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGSDELQVAIDQFEAQKDNKDS 60

Query: 61 NV 62
          NV
Sbjct: 61 NV 62

BLAST of Tan0016002 vs. ExPASy TrEMBL
Match: A0A6J1GXM1 (uncharacterized protein LOC111458133 OS=Cucurbita moschata OX=3662 GN=LOC111458133 PE=4 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 1.2e-20
Identity = 56/62 (90.32%), Postives = 58/62 (93.55%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHK-SKDP 60
          MAGLLG RGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGG+DELQVAIDQFEA K +KD 
Sbjct: 1  MAGLLGLRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGSDELQVAIDQFEAQKDNKDS 60

Query: 61 NV 62
          NV
Sbjct: 61 NV 62

BLAST of Tan0016002 vs. ExPASy TrEMBL
Match: A0A6J1DG74 (uncharacterized protein LOC111019802 OS=Momordica charantia OX=3673 GN=LOC111019802 PE=4 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 2.7e-20
Identity = 52/56 (92.86%), Postives = 54/56 (96.43%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHKS 57
          MAGLLGY+ LAPKTKN+VVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEA KS
Sbjct: 1  MAGLLGYKSLAPKTKNVVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAQKS 56

BLAST of Tan0016002 vs. ExPASy TrEMBL
Match: A0A5A7SUP4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G002370 PE=4 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 7.8e-20
Identity = 54/62 (87.10%), Postives = 57/62 (91.94%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHKS-KDP 60
          MAGLLGY GL PKTKNIVVAGGLTAFVFGVYFYTMRAVGG+DELQVAID+FEA KS K+ 
Sbjct: 1  MAGLLGYSGLGPKTKNIVVAGGLTAFVFGVYFYTMRAVGGSDELQVAIDKFEAQKSNKES 60

Query: 61 NV 62
          NV
Sbjct: 61 NV 62

BLAST of Tan0016002 vs. ExPASy TrEMBL
Match: A0A1S4DW05 (uncharacterized protein LOC103488878 OS=Cucumis melo OX=3656 GN=LOC103488878 PE=4 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 7.8e-20
Identity = 54/62 (87.10%), Postives = 57/62 (91.94%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHKS-KDP 60
          MAGLLGY GL PKTKNIVVAGGLTAFVFGVYFYTMRAVGG+DELQVAID+FEA KS K+ 
Sbjct: 1  MAGLLGYSGLGPKTKNIVVAGGLTAFVFGVYFYTMRAVGGSDELQVAIDKFEAQKSNKES 60

Query: 61 NV 62
          NV
Sbjct: 61 NV 62

BLAST of Tan0016002 vs. TAIR 10
Match: AT2G43780.1 (unknown protein; Has 30 Blast hits to 30 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 30; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 89.7 bits (221), Expect = 8.6e-19
Identity = 45/58 (77.59%), Postives = 47/58 (81.03%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHKSKD 59
          MAG  G+  L PK KN VVAGGLTAFVFGVYFYTMRAVGGTDELQVAID+FE  K  D
Sbjct: 1  MAGFPGFSYLGPKGKNTVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDKFEGQKQVD 58

BLAST of Tan0016002 vs. TAIR 10
Match: AT2G43780.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; Has 30 Blast hits to 30 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 30; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 89.7 bits (221), Expect = 8.6e-19
Identity = 45/58 (77.59%), Postives = 47/58 (81.03%), Query Frame = 0

Query: 1  MAGLLGYRGLAPKTKNIVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDQFEAHKSKD 59
          MAG  G+  L PK KN VVAGGLTAFVFGVYFYTMRAVGGTDELQVAID+FE  K  D
Sbjct: 1  MAGFPGFSYLGPKGKNTVVAGGLTAFVFGVYFYTMRAVGGTDELQVAIDKFEGQKQVD 58

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038891345.11.5e-2091.80uncharacterized protein LOC120080790 [Benincasa hispida] >XP_038891346.1 unchara... [more]
XP_022956375.12.5e-2090.32uncharacterized protein LOC111458133 [Cucurbita moschata] >XP_022998843.1 unchar... [more]
KAG6601607.12.5e-2090.32hypothetical protein SDJN03_06840, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022151963.15.6e-2092.86uncharacterized protein LOC111019802 [Momordica charantia] >XP_022151965.1 uncha... [more]
XP_007033727.29.5e-2087.93PREDICTED: uncharacterized protein LOC18602341 [Theobroma cacao][more]
Match NameE-valueIdentityDescription
A0A6J1KHY01.2e-2090.32uncharacterized protein LOC111493381 OS=Cucurbita maxima OX=3661 GN=LOC111493381... [more]
A0A6J1GXM11.2e-2090.32uncharacterized protein LOC111458133 OS=Cucurbita moschata OX=3662 GN=LOC1114581... [more]
A0A6J1DG742.7e-2092.86uncharacterized protein LOC111019802 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A5A7SUP47.8e-2087.10Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S4DW057.8e-2087.10uncharacterized protein LOC103488878 OS=Cucumis melo OX=3656 GN=LOC103488878 PE=... [more]
Match NameE-valueIdentityDescription
AT2G43780.18.6e-1977.59unknown protein; Has 30 Blast hits to 30 proteins in 11 species: Archae - 0; Bac... [more]
AT2G43780.28.6e-1977.59unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36744:SF2CYTOCHROME OXIDASE ASSEMBLY PROTEINcoord: 1..58
NoneNo IPR availablePANTHERPTHR36744CYTOCHROME OXIDASE ASSEMBLY PROTEINcoord: 1..58

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0016002.1Tan0016002.1mRNA
Tan0016002.2Tan0016002.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane