Tan0008308 (gene) Snake gourd v1

Overview
NameTan0008308
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionVacuolar sorting-associated protein 62
LocationLG06: 18126826 .. 18134838 (-)
RNA-Seq ExpressionTan0008308
SyntenyTan0008308
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATAACCAAAACTCACACATAGAAAAATATCGATCTCAAAAAATGAGAAAATTTTAATAAAACCATCCAAATAATTACGAATTTTCTATTACCTAATAAATTCTAGAAAAGGAAGACGAAGAGAGGAAGAAGAAAGAAGAAAGAAGAAAGAATAAAGAATGGGCAAACCTTGGAAAATGAACCAAAAAAAATGGTAAACCCTATGTAATATTAATCCGTAAAAGGATGAGTGCCCTTTCGTTATTGGCTCACTGTGTCAAAAGTAGGATCCTCATTCAATTAGCAGAAACGGTAAGCAGAATTAAACGTAACGCAATGAAATCACTTTTTCGAATTTCGTTTGATTTTCAGTAAAAACACCCTAGATTCAAACTATAGTAGTCTAGTCACTCATTTAAAATTTTACAAATACTCGAGAATATATATAATTGAGATGAAATCAGACAAATGACTTTACTAGAAATAAAAAAAAAGCATAGATTTTATTTATTCAAAAAGCGCCGCAGTGTGATTTTGGCCTGCGGAAGCGCTGAGCGAAGGAAGGAGCAAAGAATCCCGAAGCTGAAAATTATGGCGAAAAGGTTCATTTCCATATTCAAGCGCTCTTCAACTCCACACGCTTCCAGGTGCGGTTTTTCTTGTAATCCCGATTTCTCATTTCCCTTTCGCTTTTTTCTGCTCTTTATATACGCTTCCGTTTCTTGGATTTTCATACGATCAGCTACTAACTCACCCACGAACTCAGTTCAACATGGAATTGATCGCCCGCATTTTTAGAAACAGATATATCTGTATCTGGGTTCTTCAATTTTTTTTTTTGTTTTGATCGTATAAGAGCCATAATTTCCACCTTATGCTCGTTTAGAAGTGGGAACTGTTTTATTTGGTTTTCTATGGAAAAGATAACGTTGGATTTGATTCTTAGAAGATTTTAATTGACATCAATTACCGGTTTTTTTTAATGGATAGAGTGGAACTTGATGATCTCTGATTAAGGGTTGCTTGCAGCAGTTCCATAAAGCCAGCGGAGGAAGGGGTGAATAAATCCTGGGGTCGTAAAGCAGCCTCCTTTGTGCTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTCGCCATTTATCATAGCTGTAGCAGGTATGTCTGACTTGTTAGGGCTTTTCAAATTTTGGATGAATTATATCGTTTAAGCAAATGAAAGTTTGAATTTGGCGAAATTAAACTGGTGTATCTTGGGATTGCTTCTTATGCACTTGTTTGTTGTCATGTTTCACGTATTGGTGCTTTGAAGTTCAGGGAAGCTCTCCCCATTGCTGTGTTGTTTTCTTCGTAGCCATCTAACTTGAAGACTAGATGGCAATGAAGTTGCCAAATCTTGGTGGTTCTCAACCTTTAAAATTGGCTGTGGTTTTACGTTACTTAGGTTATTGTATTTTTGAAATCGAGCCCTATCACTTCATATGAACTAATGAATTGGATTGGATAAGATCTCATATTACTATTCAATGATGTTCACATTATTGTTACCGAAAGAGCTCAGTTGTCAACTTTTTTATGACGAGGGGATAAGAGAATGCACCAATCCTTTTTCTTTGGGTTTCTTTTAGCTACACGTAAGCTGCTTGCTCCGTTTCTGTTGGGGGGCTTTAGATCTTTGTTATTGGCCCAGTTACTTAGGTTTAGTACTTTTCTGGATGAAATTGTGCAAACAGATGATAGATCTTCAAGGGCTTAAGTGAATGCATTTTTCTTGTTTTTAGACGCCCAACCTGACTTCTCCTTACCATGAATACTCACCAGCACCTCTATTCGACTTGGCGTATCTCCAACCGAGATCACCATTTTCAATAAAGCCACCTTGTTACAATTAGGACGAGATCAGGAATTAGAGTAACCGAGCCTACCTCAACATTTTTATCCAAAGATTTAGTCTACAACCTACAACTCTTATACAAGCTACTCTTTCTAGTTTGCCACTTCATTACTTGTCCGGTTGTCCCTATTTAGAGCTCCTGTTAGAGTGATAAAGATGTTGGACAAGTTTGTTCGTGATTTCTTGTGGGAGGTGCGCATGGTGATGGGGTGTCCACAATATGAATTGGACAACCACACAACTTCCTAAATTGATGTTGGCTTTGGTATAGGCAATTTTCAGCATCATAATTTGGCACTTTTGGCTAAGTGGATTTGGAGGTTTGTTCATGAGTAAAACTCTCTTTGGCGAAATCTTATTGTGGCAAAATATTATGGTGGTATATATGGGGATGACTGGCCTCAATTTACTGTGAATGGTTCTCATAAATCTCCTTGGAAATTTATATGTAATGTTCGAGTTTTGATTACTACCCGTGCTTGTCGACGTATTGGGGATGGTGTTTCAACTTTTTTTTTGGAGTAATTCTTGGTTAAGTTGTGGTCCTATTGCTACGACTTTTTCTAGACTATATCGACTCACTTGGGCTCCAACTTTTATTGTAGTTGAAGCTTGGGTTATTTCTAGTTATGCTTGGAATTTGAGGTACGTCGTAATTTGAATGATTTGGAAATGATTGAGTGCTGCTCTTTCAAATCTTCTACCTCATATCACTTTGACGATTTCTCATGATACATGGTTATGGCCTTTAGAACCATTTGGGGGTTTCTCTGTGAAGTCTCTTACTGCCAGTTTGTTGGGTGTTGATAATCCTATACTAAAAGAATTATATTCGGTGATTTGGTCAGATGCTTATTCAAAAAAGATAAAATCTTTGTTTGGGAGCTTAGTCTAGGAGCAGTGAACACTTATGATCACCTACAGAGAAGAATACCATTTATGGTCCTTTCTCCTTCTTGATGTGTAATGTGTGGACTAAATTCAGAATCTACGAGTCATCTTTTTGTGCATTGTTGTTTTGCTTCACGTTTTTTTATGCGCATTTTAGATGCTTTTGGTTGGTCAATACCTCTTTTCAACGACATTTTTGTTCATTTATCTATGATTCTGGTGGGTCATTCGTTCAAAGGCACAAAGAAGATTGTTTGGTTGGCAATTGTTTGATCTTTTCTTTGTCACCTTTGGCGGTAGAGGAATAGTTGAATTTTCAGGGATGTATCTTCTATTTTTAATAGGTTTTTTTAATTGATTCTTTCAAAGGAATTTTTATGGTGCAAACCTGTACGACCCTTTAGTCTTTATAGACTATCTTTTTTTATTTACAATTAGAGAATTTTATTGTAATTTTATCAATAAGGTGTTTGGAGTTTTTTCTCCTTTATTTCATTCACCAGTGAAATTGTTTCTTTATCAAAAAAAAAAAAAACTTAGATAGTATACTTCTAAATTCTAACAAAAATGAAAAAGAGAAGTCCTCTTCTTGAAGTAGAGGATTCGGTGGTAAGGGGAAGAGTGACTGGGAAGTGGCTTTTGATGGACGATCTGTATTTAAAATTAGAAGCTTGGGACGACAATGCCCATGGTTTAGCAGAGTATATGCTCAAATATGGTGGTTAGATTGCGGTAAAAAATTTACCTATTAGTACTAGTCAAGAGTTGTCTTTGAAGCTGTAGGAAAGCATTTCAGGGGATTAGAAGAAATCGATTTGGAGACTCTAAATTTCACTGATTTGTCATGTGCAAAAATCAAAGTGAAAAGGAACTTGTGCGGTTTTATGCCTCAGTCAACTGAAGTCAAAGTGGCGCCTTTAAGTATGTTTTGTCTTTCCTTCGTTAGTTTGGATAAGAATGAACCCCCAATTTGGGCCAACGACCCTTTATTTGCAGAAGACTTCTCAAATATGATAGGCTTAGCTTAGATTAATATGGTCCTTGAGGATGTCTCTGGCTCGGACTCCTAGGAAGAAGTGGGCTGCTTAAAGGATCAAGAGATGCCAAATGGACAATCTGAAGGAAGTGCAGGGGACTCCATTGCTAGAAGGAATTCAGGAGGATTAATCAGAGGGGATTATGGGAATCTGAGGGGTTGTGGGAATTTAAATATTTGTGATTGGAGGGAGATGACTGGGGGTGTTGCTCAGAAGAATATAAATAGTGAGATTATTGAAGATGCCTCGTCAATTAGGTTTGCGGGAACTAAGGAGCTGATCAGTGGGGAATAAGAGATTTTGAGGAATTGTGGAATTTTGAATAGTAGTGAAGAGAGCGAGGCGATTGGGTGTTGCTCATGAGGATGCTTTTATTTTATCTGGTAATAATGAAATTGTTTTGAATGGTGCCCATCCATATAACCTTCTAATCGACAAATCGCAACCCTTGGCGGCTTGTGGTGTTAAATTACTAAACAAGAAGGAGAGTGAAGATCTGTTATTTTCTTCCACTATTCCAACGAGTGTTAAAGAAATAAAAGATGCCTTTTGTGGAAAATATTATACTAGACAGAGTGGAAGATTCCACTCTGGGTGTAAAAATTCAGCTGGAAGGAACTTTTCCAGATTTCCTCGAACAATCTGATGGTTTATCTCATCATCATAGCCCTTGGGTGCAACATTTGCAAGCAAGCTCTAATAAACCTCTCAATGCTTCGGGTTTTGCCATTGAAGAATGTCATCTTGGGTCAAAGGGGATCTCGTTTTTTAAAACTTCGTTTCAACCAAATTTAGATGTTGCCCGAAAAGAATCCTCTGCATCTCTAGTGGTTGGTCAGAATTGCGCATTTCAATATGACTCTGAGGTCAACCTAAGCAGTCTGGATATAGGCTCGGTAAGAAGTTCGGGAAATAAGGTGTTGGAGGACCTTTCTCCAGTAATTAATCTGGTTGATTTATTTAATTCTGCAACTCATCTACCTTCTTTGGGGATAAGAATGAGAAGAAGAGTTTTGATTGTCCTACTGATTTGTCTCGGTTTAGTGCATTAATTAAAGCAAGTGGGTTCCAATTTAAAGAAATTGTAGCAGGAGGTACTTCTATTCCTGTGCAATGAAAATATTATCATGGAATATGAGAGGATTGGGTGTCGGTTCAAAGCGAGTTAGCCTTAAACAGTTTCTTCAAAAAAATATGTTGGGATGTGGTGTTTATTCAAGAAACAAAGTTGAGATTAATAGATGATGAGGTGTTCAAATCCTTATGGAGTCCAAGGGGTATAGGTTGGGTGCATGTTGAAGCTTATGGTCGATCAGGAGGGAAGCTAATGGTGCGAGGGTTAATCTTTTTGGATTTCTAATGGTTATGGGCTTGATCATTATCAAGAGAGGAAATTTTGAGGGGTAAATGGTGTTCTCTATCTTATTACGGTGAATGGCCTTGGTGTATGGGAGGAGTTGTTAATTTATTCAGATGGGTCCATGTACGGTTGCCTCAAAGAAGAATGACCAAAGAATCAGGGTCAGGGGTGGCTTTGGAGAGGGGTCTTATTTGTGCCCCGTGGTGCTCTTTATGGCTTAGGCGTGTCAATGTTTTTGTTATGGATTATGGTCAGGGGGAAGTTGATTGCGGCTAAGATTTTGCAAAGAAAGTTGTCTTCTTCGGTGTTGCAACCCTCTTTTTGTTGTCTATGTTTAAAAGCAGCCAAAACTTCTTATCATTTGTTCTTTGGGTGTGACTATGTGCATTTTGATTTGGCTCGACAAAGGCTTCTTCTTGGTGTTTTCTTTCTAAAGATTTTGTAGGTTACTCTGTGTCTGATTTGTATTTTTCTTGGGCTCCGTTTTTGTCTTATCTGTAGAGTGTTTACTTGAGGTCTCGTGGTCTTTGTATTGCTTGTGCTTATTGGCTTCTCTTTATGTGCTCTTTCTTGTTTGTGCTTTTGATTTAGCTTCTTATTTTGTACTATGAACTTCTCGCTCTTTCATTATTTCAATATAATTTTCTTGTTGCCTTGTGAAAAAAAAAAAATCCTCTTCTTAATGAAGTTGCCTGTTTGCTTATCATTCAAAAGAATTGATAAGTTTAGAGAAGAGAGACACTCTTTGACCCACACTCTCTATTTCCTACCAAATATTTTCTTATCAGTAACTTCATCAAGAAAGTCCCAATCCACAAGATTATTGATTCTGTTTTTAGTGGATAATTGAGCTTTTATAATGAAAGAATACACGATAAAAAAGGGCATTCAAAAAGATTGCCCAACAAAGGGAGACAAACTATAACCAGACTAACTATACAACAAAATAAATATAATATCTTTTCTCACGCTATGCTTGCATCTTGCATTGTACCTTAAATAAATTGGTTTAGAACTGTTAGTGCTCTGTAATACTTCTGTAAAAGAAATTTACTCTAAGAATTCTCTTTAATAAATGAAGGAATCAAACTAGGGTTTCATTGAGCTTTTTGTCACATGTATGAGTGTTCTTAGGAGTGGAAAGCTTTGGAATGTACATTTGTGGTGATTTTCCTTTGCAGTTGGTTGCTGGCATAAGACTTATTTTATTTTATTGTTTTTCCAGCAAAGCCATAGAGAAAGCCAGAAACAATCAAGCAATTATAGATGCTATTGGAGAACCCATTGTTAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAAAGACATTCTCTATCCTGCACATTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCAATTGAAGGCAGTTCGTAATGGAGGTTGAGATACTTTTCCTGTCAGCTGGTTTACATATTCTTTTGTGAAATCAAGTTTTAGTTTCAATAGAAAATTAGAACTGAATATACAAGTTTTCCTCTTTTCTATCATTTTATGTTAGATCTCAGGTGAAATTTTCTACTTGATTTATCTCATTGCATTATATAAAAAATGATATTATATTTATACATGAAATTTTCATTATAAAAGAATTATACATAGAAATCCTCTAGCTGAATTGGGCCTAAATCTTGTTCCTAAAAAAATCTGATTCTTTAAAGTTCTTTTCATCGGTAATTTAGTGAATCAATCATCATTTCTAGGACTTAGTTATGCTATCTGTTTTGTTATATGTTTATTTACTGCAAATTTTTGGAGTTGAAGGATGGAATGACAATTGTGAAAGTTGTTTTGACCTTTTCTTTTCCGAAGCAATCCAACTTAGCTGGCGTAACTTCAATCTGCAATATTTGGAAAAGGGTTTTCTGGATTAGTGAATTTTTTGAGGTAAAATAGGAAGATTCTGAAGCCATTTGGTGTATAATTTATTCTATATGTAAGCTTCTCTAAATCTTCAAATCGTGGTGGCCACGTACTTAAGATTTAATTTCTTATGCTTTTTCTTGGTAACCAAATGTAGTAAAGTTGTTTTGTTAGATTAGTTGAGTTGCATGTAAGCGGATCCTTACACACACAGATATCAATCGATTCTTTTGATTTTACTTTTGCATCAAGATATTCAAAATGGACAAATCTTGTTGTATTTTGTTTATTTTTTTTACCTGAATTATCATCTTCTTTCAACTCTTATTATCTTGTTATATATTACTTTCATTCCTTTAGAAGGGAGAAAACTTGACCTTGTCTTTTTTTCTTTTTATAATTTTTTTTTTAAAATTGTAATACTAAGTTGGAATATCTCAACAGAGGACTCCTGGATTTCTTTTCTCCGGCCACGTGACTGGGACATTCTAATCATGGACGCTCTCCTTCATGTTCCTGCAAACGAAGGTAAGCAGCAAACATTGCGCATCAATCTAACTGAGAAGTTTACCCCTGCTGCTTGTGTTGCATGCACTGATTTTCAGCCTCCGGAGTCAGAGAAGAGATGAACTCAGCTCGTAAGTTTATCGAGAAAATATTGTCGAAAGTTAGCAGTTTTTTTTTTCTTAATTAGTTTTCTGCCTCGACAGGATTTCGAACTAACCGAACGATCAAAATATTCGTTATTCCCATCAAAATAATATTTTTATCTGGACTGCTCTGTCCAAATGCCTTTGAGGTTGAGTTCAAACCATAGTGGAACCAACTTAGAAGATTTATTATCCTACAATTTTGCCACATCAAATGTAGCAAAGTTAAATGGTTGTTCATGTAACCCAAGTTATAAAAGCAATATTAGTAACTAATTCACTTTATTTTTGTCTTCA

mRNA sequence

AAAATAACCAAAACTCACACATAGAAAAATATCGATCTCAAAAAATGAGAAAATTTTAATAAAACCATCCAAATAATTACGAATTTTCTATTACCTAATAAATTCTAGAAAAGGAAGACGAAGAGAGGAAGAAGAAAGAAGAAAGAAGAAAGAATAAAGAATGGGCAAACCTTGGAAAATGAACCAAAAAAAATGGTAAACCCTATGTAATATTAATCCGTAAAAGGATGAGTGCCCTTTCGTTATTGGCTCACTGTGTCAAAAGTAGGATCCTCATTCAATTAGCAGAAACGGTAAGCAGAATTAAACGTAACGCAATGAAATCACTTTTTCGAATTTCGTTTGATTTTCAGTAAAAACACCCTAGATTCAAACTATAGTAGTCTAGTCACTCATTTAAAATTTTACAAATACTCGAGAATATATATAATTGAGATGAAATCAGACAAATGACTTTACTAGAAATAAAAAAAAAGCATAGATTTTATTTATTCAAAAAGCGCCGCAGTGTGATTTTGGCCTGCGGAAGCGCTGAGCGAAGGAAGGAGCAAAGAATCCCGAAGCTGAAAATTATGGCGAAAAGGTTCATTTCCATATTCAAGCGCTCTTCAACTCCACACGCTTCCAGTTCCATAAAGCCAGCGGAGGAAGGGGTGAATAAATCCTGGGGTCGTAAAGCAGCCTCCTTTGTGCTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTCGCCATTTATCATAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACAATCAAGCAATTATAGATGCTATTGGAGAACCCATTGTTAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAAAGACATTCTCTATCCTGCACATTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCAATTGAAGGCAGTTCGTAATGGAGAGGACTCCTGGATTTCTTTTCTCCGGCCACGTGACTGGGACATTCTAATCATGGACGCTCTCCTTCATGTTCCTGCAAACGAAGGTAAGCAGCAAACATTGCGCATCAATCTAACTGAGAAGTTTACCCCTGCTGCTTGTGTTGCATGCACTGATTTTCAGCCTCCGGAGTCAGAGAAGAGATGAACTCAGCTCGATTTCGAACTAACCGAACGATCAAAATATTCGTTATTCCCATCAAAATAATATTTTTATCTGGACTGCTCTGTCCAAATGCCTTTGAGGTTGAGTTCAAACCATAGTGGAACCAACTTAGAAGATTTATTATCCTACAATTTTGCCACATCAAATGTAGCAAAGTTAAATGGTTGTTCATGTAACCCAAGTTATAAAAGCAATATTAGTAACTAATTCACTTTATTTTTGTCTTCA

Coding sequence (CDS)

ATGACTTTACTAGAAATAAAAAAAAAGCATAGATTTTATTTATTCAAAAAGCGCCGCAGTGTGATTTTGGCCTGCGGAAGCGCTGAGCGAAGGAAGGAGCAAAGAATCCCGAAGCTGAAAATTATGGCGAAAAGGTTCATTTCCATATTCAAGCGCTCTTCAACTCCACACGCTTCCAGTTCCATAAAGCCAGCGGAGGAAGGGGTGAATAAATCCTGGGGTCGTAAAGCAGCCTCCTTTGTGCTTATTACTGTTACTGGTGGTGTAGCTTTGAGTGCTTTAGATGACCTCGCCATTTATCATAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACAATCAAGCAATTATAGATGCTATTGGAGAACCCATTGTTAAAGGTCCATGGTACAATGCATCACTTGCAGTAGCTCATAAAAGACATTCTCTATCCTGCACATTTCCAGTATCAGGACCACAAGGCACAGGGATCCTCCAATTGAAGGCAGTTCGTAATGGAGAGGACTCCTGGATTTCTTTTCTCCGGCCACGTGACTGGGACATTCTAATCATGGACGCTCTCCTTCATGTTCCTGCAAACGAAGGTAAGCAGCAAACATTGCGCATCAATCTAACTGAGAAGTTTACCCCTGCTGCTTGTGTTGCATGCACTGATTTTCAGCCTCCGGAGTCAGAGAAGAGATGA

Protein sequence

MTLLEIKKKHRFYLFKKRRSVILACGSAERRKEQRIPKLKIMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEKR
Homology
BLAST of Tan0008308 vs. NCBI nr
Match: XP_023547893.1 (uncharacterized protein LOC111806704 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 362.5 bits (929), Expect = 2.7e-96
Identity = 178/205 (86.83%), Postives = 192/205 (93.66%), Query Frame = 0

Query: 27  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVT 86
           S ER +E+R  KLK++AKRF+SIFKRS TP+ASSS++P+EEGVNKSWGRKA SFVL+TVT
Sbjct: 21  SGERLRERRTHKLKMLAKRFLSIFKRSPTPNASSSVRPSEEGVNKSWGRKAVSFVLVTVT 80

Query: 87  GGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSC 146
           GGVALSALDDLAIYHSCSSKAIEKA+NN+AIIDAIGEPIVKGPWYNASLAVAHKRHSLSC
Sbjct: 81  GGVALSALDDLAIYHSCSSKAIEKAKNNKAIIDAIGEPIVKGPWYNASLAVAHKRHSLSC 140

Query: 147 TFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLT 206
           TFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDILIMDALLHVP NEGKQQTLRINLT
Sbjct: 141 TFPVSGPQGTGIVQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPENEGKQQTLRINLT 200

Query: 207 EKFTP---AACVACTDFQPPESEKR 229
           EKF P   AACVACTD Q PE+EKR
Sbjct: 201 EKFAPAAAAACVACTDCQSPEAEKR 225

BLAST of Tan0008308 vs. NCBI nr
Match: XP_038876219.1 (uncharacterized protein LOC120068500 isoform X2 [Benincasa hispida])

HSP 1 Score: 357.8 bits (917), Expect = 6.8e-95
Identity = 171/188 (90.96%), Postives = 181/188 (96.28%), Query Frame = 0

Query: 41  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIY 100
           ++AKRF+SIFKRS  PHASSSIKP+E+GVNKSWGRKA SFVLITVTGGVALSALDDLAIY
Sbjct: 1   MLAKRFVSIFKRSPIPHASSSIKPSEDGVNKSWGRKAVSFVLITVTGGVALSALDDLAIY 60

Query: 101 HSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQ 160
           HSCSSKAIEKARNNQA+IDAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQ
Sbjct: 61  HSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQ 120

Query: 161 LKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDF 220
           LKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQ+T+RINLTEKF PAACV+CTD 
Sbjct: 121 LKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQKTMRINLTEKFAPAACVSCTDC 180

Query: 221 QPPESEKR 229
           QPPE+E R
Sbjct: 181 QPPETETR 188

BLAST of Tan0008308 vs. NCBI nr
Match: XP_023547892.1 (uncharacterized protein LOC111806704 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 357.8 bits (917), Expect = 6.8e-95
Identity = 178/206 (86.41%), Postives = 192/206 (93.20%), Query Frame = 0

Query: 27  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHA-SSSIKPAEEGVNKSWGRKAASFVLITV 86
           S ER +E+R  KLK++AKRF+SIFKRS TP+A SSS++P+EEGVNKSWGRKA SFVL+TV
Sbjct: 21  SGERLRERRTHKLKMLAKRFLSIFKRSPTPNASSSSVRPSEEGVNKSWGRKAVSFVLVTV 80

Query: 87  TGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLS 146
           TGGVALSALDDLAIYHSCSSKAIEKA+NN+AIIDAIGEPIVKGPWYNASLAVAHKRHSLS
Sbjct: 81  TGGVALSALDDLAIYHSCSSKAIEKAKNNKAIIDAIGEPIVKGPWYNASLAVAHKRHSLS 140

Query: 147 CTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINL 206
           CTFPVSGPQGTGI+QLKAVRNGEDSWISFLRPRDWDILIMDALLHVP NEGKQQTLRINL
Sbjct: 141 CTFPVSGPQGTGIVQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPENEGKQQTLRINL 200

Query: 207 TEKFTP---AACVACTDFQPPESEKR 229
           TEKF P   AACVACTD Q PE+EKR
Sbjct: 201 TEKFAPAAAAACVACTDCQSPEAEKR 226

BLAST of Tan0008308 vs. NCBI nr
Match: XP_008459968.1 (PREDICTED: uncharacterized protein LOC103498925 isoform X1 [Cucumis melo])

HSP 1 Score: 354.0 bits (907), Expect = 9.8e-94
Identity = 175/203 (86.21%), Postives = 188/203 (92.61%), Query Frame = 0

Query: 27  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITV 86
           +++R KEQ  PKLK++AKRF SIFKRSSTP+ASS SIKP+E  VNKSWGRKA SFVLITV
Sbjct: 6   NSQRLKEQGSPKLKMLAKRFASIFKRSSTPYASSNSIKPSENEVNKSWGRKAVSFVLITV 65

Query: 87  TGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLS 146
           TGGVALSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI KGPWYNASLAVAHKRHSLS
Sbjct: 66  TGGVALSALDDLAIYHSCSSKAIEKARNNQAVKDAIGEPIAKGPWYNASLAVAHKRHSLS 125

Query: 147 CTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINL 206
           CTFPVSGPQG GILQLKAVRNGEDSWISFLRPRDWDIL+MDALL+VP NEGKQ+TLRINL
Sbjct: 126 CTFPVSGPQGAGILQLKAVRNGEDSWISFLRPRDWDILMMDALLYVPENEGKQKTLRINL 185

Query: 207 TEKFTPAACVACTDFQPPESEKR 229
           TEKF PAACV+CT  QPPE+EKR
Sbjct: 186 TEKFAPAACVSCTGCQPPETEKR 208

BLAST of Tan0008308 vs. NCBI nr
Match: XP_038876218.1 (uncharacterized protein LOC120068500 isoform X1 [Benincasa hispida])

HSP 1 Score: 353.2 bits (905), Expect = 1.7e-93
Identity = 171/189 (90.48%), Postives = 181/189 (95.77%), Query Frame = 0

Query: 41  IMAKRFISIFKRSSTPHA-SSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAI 100
           ++AKRF+SIFKRS  PHA SSSIKP+E+GVNKSWGRKA SFVLITVTGGVALSALDDLAI
Sbjct: 1   MLAKRFVSIFKRSPIPHASSSSIKPSEDGVNKSWGRKAVSFVLITVTGGVALSALDDLAI 60

Query: 101 YHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIL 160
           YHSCSSKAIEKARNNQA+IDAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIL
Sbjct: 61  YHSCSSKAIEKARNNQAVIDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIL 120

Query: 161 QLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTD 220
           QLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQ+T+RINLTEKF PAACV+CTD
Sbjct: 121 QLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQKTMRINLTEKFAPAACVSCTD 180

Query: 221 FQPPESEKR 229
            QPPE+E R
Sbjct: 181 CQPPETETR 189

BLAST of Tan0008308 vs. ExPASy TrEMBL
Match: A0A1S3CBF0 (uncharacterized protein LOC103498925 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498925 PE=4 SV=1)

HSP 1 Score: 354.0 bits (907), Expect = 4.7e-94
Identity = 175/203 (86.21%), Postives = 188/203 (92.61%), Query Frame = 0

Query: 27  SAERRKEQRIPKLKIMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITV 86
           +++R KEQ  PKLK++AKRF SIFKRSSTP+ASS SIKP+E  VNKSWGRKA SFVLITV
Sbjct: 6   NSQRLKEQGSPKLKMLAKRFASIFKRSSTPYASSNSIKPSENEVNKSWGRKAVSFVLITV 65

Query: 87  TGGVALSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLS 146
           TGGVALSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI KGPWYNASLAVAHKRHSLS
Sbjct: 66  TGGVALSALDDLAIYHSCSSKAIEKARNNQAVKDAIGEPIAKGPWYNASLAVAHKRHSLS 125

Query: 147 CTFPVSGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINL 206
           CTFPVSGPQG GILQLKAVRNGEDSWISFLRPRDWDIL+MDALL+VP NEGKQ+TLRINL
Sbjct: 126 CTFPVSGPQGAGILQLKAVRNGEDSWISFLRPRDWDILMMDALLYVPENEGKQKTLRINL 185

Query: 207 TEKFTPAACVACTDFQPPESEKR 229
           TEKF PAACV+CT  QPPE+EKR
Sbjct: 186 TEKFAPAACVSCTGCQPPETEKR 208

BLAST of Tan0008308 vs. ExPASy TrEMBL
Match: A0A5D3DMW4 (Vacuolar sorting-associated protein 62 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G001570 PE=4 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 6.8e-93
Identity = 173/198 (87.37%), Postives = 184/198 (92.93%), Query Frame = 0

Query: 32  KEQRIPKLKIMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVA 91
           +EQ  PKLK++AKRF SIFKRSSTP+ASS SIKP+E  VNKSWGRKA SFVLITVTGGVA
Sbjct: 522 EEQGSPKLKMLAKRFASIFKRSSTPYASSNSIKPSENEVNKSWGRKAVSFVLITVTGGVA 581

Query: 92  LSALDDLAIYHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPV 151
           LSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI KGPWYNASLAVAHKRHSLSCTFPV
Sbjct: 582 LSALDDLAIYHSCSSKAIEKARNNQAVKDAIGEPIAKGPWYNASLAVAHKRHSLSCTFPV 641

Query: 152 SGPQGTGILQLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFT 211
           SGPQG GILQLKAVRNGEDSWISFLRPRDWDIL+MDALL+VP NEGKQ+TLRINLTEKF 
Sbjct: 642 SGPQGAGILQLKAVRNGEDSWISFLRPRDWDILMMDALLYVPENEGKQKTLRINLTEKFA 701

Query: 212 PAACVACTDFQPPESEKR 229
           PAACV+CT  QPPE+EKR
Sbjct: 702 PAACVSCTGCQPPETEKR 719

BLAST of Tan0008308 vs. ExPASy TrEMBL
Match: A0A6J1JLR1 (uncharacterized protein LOC111488050 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111488050 PE=4 SV=1)

HSP 1 Score: 347.8 bits (891), Expect = 3.4e-92
Identity = 169/190 (88.95%), Postives = 181/190 (95.26%), Query Frame = 0

Query: 41  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIY 100
           ++AKRF+SIFKRS TP+ASSS++P+EEGVNKSWGRKA SFVL+TVTGGVALSALDDLAIY
Sbjct: 1   MLAKRFLSIFKRSPTPNASSSVRPSEEGVNKSWGRKAVSFVLVTVTGGVALSALDDLAIY 60

Query: 101 HSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQ 160
           HSCSSKAIEKA+NN+AIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI+Q
Sbjct: 61  HSCSSKAIEKAKNNKAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIVQ 120

Query: 161 LKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAA--CVACT 220
           LKAVRNGEDSWISFLRPRDWDILIMDALLHVP NEGKQQTLRINLTEKF PAA  CVACT
Sbjct: 121 LKAVRNGEDSWISFLRPRDWDILIMDALLHVPENEGKQQTLRINLTEKFAPAAAPCVACT 180

Query: 221 DFQPPESEKR 229
           D Q P +EKR
Sbjct: 181 DCQSPTAEKR 190

BLAST of Tan0008308 vs. ExPASy TrEMBL
Match: A0A0A0KF26 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G087990 PE=4 SV=1)

HSP 1 Score: 345.1 bits (884), Expect = 2.2e-91
Identity = 169/189 (89.42%), Postives = 178/189 (94.18%), Query Frame = 0

Query: 41  IMAKRFISIFKRSSTPHASS-SIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAI 100
           ++AKRF SIFKRSSTPHASS SIKP E  VNKSWGRKA SFVLITVTGGVALSALDDLAI
Sbjct: 1   MLAKRFASIFKRSSTPHASSNSIKPTENEVNKSWGRKAVSFVLITVTGGVALSALDDLAI 60

Query: 101 YHSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIL 160
           YHSCSSKAIEK RNNQA+IDAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIL
Sbjct: 61  YHSCSSKAIEKVRNNQAVIDAIGEPIDKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIL 120

Query: 161 QLKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTD 220
           QLKAVRNGEDSWISFLRPRDWDIL+MDALL+VP NEGKQ+TLRINL+EKF PAACV+CTD
Sbjct: 121 QLKAVRNGEDSWISFLRPRDWDILMMDALLYVPENEGKQKTLRINLSEKFAPAACVSCTD 180

Query: 221 FQPPESEKR 229
            QPPE+EKR
Sbjct: 181 CQPPETEKR 189

BLAST of Tan0008308 vs. ExPASy TrEMBL
Match: A0A6J1GQQ6 (uncharacterized protein LOC111456591 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111456591 PE=4 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 6.4e-91
Identity = 166/190 (87.37%), Postives = 180/190 (94.74%), Query Frame = 0

Query: 41  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIY 100
           ++AKR +SIFKRS TP+ASSS++P+EEGVNKSWGR A SFVL+TVTGGVALSALDDLAIY
Sbjct: 1   MLAKRLLSIFKRSPTPNASSSVRPSEEGVNKSWGRNAVSFVLVTVTGGVALSALDDLAIY 60

Query: 101 HSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQ 160
           HSCSSKAIEKA+NN+AIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI+Q
Sbjct: 61  HSCSSKAIEKAKNNKAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIVQ 120

Query: 161 LKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTP--AACVACT 220
           LKAVRNGE+SWISFLRPRDWDILIMDALLHVP NEGKQQT+RINLTEKF P  AACVACT
Sbjct: 121 LKAVRNGEESWISFLRPRDWDILIMDALLHVPENEGKQQTMRINLTEKFAPAAAACVACT 180

Query: 221 DFQPPESEKR 229
           D Q PE+EKR
Sbjct: 181 DCQSPEAEKR 190

BLAST of Tan0008308 vs. TAIR 10
Match: AT2G20390.1 (unknown protein; Has 50 Blast hits to 50 proteins in 18 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 48; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 239.2 bits (609), Expect = 3.3e-63
Identity = 125/187 (66.84%), Postives = 145/187 (77.54%), Query Frame = 0

Query: 41  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIY 100
           + A+RF S FK SST   SS  K A  G   S+GRKA SFVLITVTGGVALSALDDL+IY
Sbjct: 1   MFARRFTSFFKGSST---SSPDKTA--GTLGSFGRKAVSFVLITVTGGVALSALDDLSIY 60

Query: 101 HSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQ 160
             CSSKA+EK  N++ +I+AIGEPI KGPWYNASLAV+H+RHS+SC+FPV GPQGTGIL 
Sbjct: 61  RGCSSKAMEKVMNSKVMIEAIGEPIEKGPWYNASLAVSHQRHSVSCSFPVIGPQGTGILH 120

Query: 161 LKAVRNGEDSWISFLRPRDWDILIMDALLHVPANEGKQQTLRINLTEKFTPAACVACTDF 220
           LKAVRNGEDS   FL+ RDWDILIMDAL+HVP+NEG QQTLRIN+T+   P+        
Sbjct: 121 LKAVRNGEDSMFGFLQQRDWDILIMDALVHVPSNEGPQQTLRINVTDIVDPSPGTHDKPL 180

Query: 221 QPPESEK 228
           +P E EK
Sbjct: 181 EPLEPEK 182

BLAST of Tan0008308 vs. TAIR 10
Match: AT2G20390.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages. )

HSP 1 Score: 221.1 bits (562), Expect = 9.2e-58
Identity = 125/223 (56.05%), Postives = 145/223 (65.02%), Query Frame = 0

Query: 41  IMAKRFISIFKRSSTPHASSSIKPAEEGVNKSWGRKAASFVLITVTGGVALSALDDLAIY 100
           + A+RF S FK SST   SS  K A  G   S+GRKA SFVLITVTGGVALSALDDL+IY
Sbjct: 1   MFARRFTSFFKGSST---SSPDKTA--GTLGSFGRKAVSFVLITVTGGVALSALDDLSIY 60

Query: 101 HSCSSKAIEKARNNQAIIDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQ 160
             CSSKA+EK  N++ +I+AIGEPI KGPWYNASLAV+H+RHS+SC+FPV GPQGTGIL 
Sbjct: 61  RGCSSKAMEKVMNSKVMIEAIGEPIEKGPWYNASLAVSHQRHSVSCSFPVIGPQGTGILH 120

Query: 161 LKAVRNG------------------------------------EDSWISFLRPRDWDILI 220
           LKAVRNG                                    EDS   FL+ RDWDILI
Sbjct: 121 LKAVRNGGKHSQTRTVTQRQNICVRYLHLPFFLLIGSPFLLGIEDSMFGFLQQRDWDILI 180

Query: 221 MDALLHVPANEGKQQTLRINLTEKFTPAACVACTDFQPPESEK 228
           MDAL+HVP+NEG QQTLRIN+T+   P+        +P E EK
Sbjct: 181 MDALVHVPSNEGPQQTLRINVTDIVDPSPGTHDKPLEPLEPEK 218

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023547893.12.7e-9686.83uncharacterized protein LOC111806704 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_038876219.16.8e-9590.96uncharacterized protein LOC120068500 isoform X2 [Benincasa hispida][more]
XP_023547892.16.8e-9586.41uncharacterized protein LOC111806704 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_008459968.19.8e-9486.21PREDICTED: uncharacterized protein LOC103498925 isoform X1 [Cucumis melo][more]
XP_038876218.11.7e-9390.48uncharacterized protein LOC120068500 isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A1S3CBF04.7e-9486.21uncharacterized protein LOC103498925 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5D3DMW46.8e-9387.37Vacuolar sorting-associated protein 62 OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A6J1JLR13.4e-9288.95uncharacterized protein LOC111488050 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0KF262.2e-9189.42Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G087990 PE=4 SV=1[more]
A0A6J1GQQ66.4e-9187.37uncharacterized protein LOC111456591 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT2G20390.13.3e-6366.84unknown protein; Has 50 Blast hits to 50 proteins in 18 species: Archae - 0; Bac... [more]
AT2G20390.29.2e-5856.05unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR014807Cytochrome c oxidase assembly factor 1PFAMPF08695Coa1coord: 91..169
e-value: 5.6E-6
score: 26.1
NoneNo IPR availablePANTHERPTHR35114:SF1CYTOCHROME OXIDASE COMPLEX ASSEMBLY PROTEINcoord: 42..227
NoneNo IPR availablePANTHERPTHR35114CYTOCHROME OXIDASE COMPLEX ASSEMBLY PROTEINcoord: 42..227

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0008308.1Tan0008308.1mRNA
Tan0008308.2Tan0008308.2mRNA