Tan0012892.1 (mRNA) Snake gourd v1

Overview
NameTan0012892.1
TypemRNA
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNADH-ubiquinone oxidoreductase chain 5
LocationLG02: 67263623 .. 67266118 (+)
Sequence length827
RNA-Seq ExpressionTan0012892.1
SyntenyTan0012892.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAGGGAGGGGGGTGGTAGTAAGAGAAGGTCCACGCAACGCAACAAACCCCCTCAACCTCAACCCCAACCCCAACCTGAACGACTGCGAATTGAACGCTCATGCTCTCTCTCAATTCCATTTCTCAATTCCTCCTTCAATTCCTCTGTCGCTGCTGTGATAAATCCTCCATTCTCCACCATCGCCCTTCTTCCCTTGAGGCAAAGAAAAGAAGGTCTCGGGGCGCAGCCATGATTACTCGATCGAATCTGGTTGAGCAGTTGAGAGAATACCAGATTCGATCTAAACACGAGTGGGCTTCCGTCTCCTTCTTCTCATCTACCTCCAATATTTCTTCTTCCAGGTATCTCCACTCCAATCAATCCATTCATTTCTGCTTCCCATTTCCTTATCTTCCAAATTCATTCTTTCATACCTATGTTTCTGATTGTCAGATTTTGATTCTTCGGATCCTTAATTTAAGGCGTTGGTTTTGTTTTGCTTTCAATTGGGTCGGTTCTGATCGATTAAACTAGGGTTTTGTGTTCTTTAAACGCTGAATTAGAGGATTTGGATTTGGGATCATTGTAGATGTTGACAGACTCTGTCCCTAATTTGTTTTATTTTATTGTCTGGTATTTTTTATCACGGAAATTCTCGTTTCTTTCGTCAATAAGCTTTTGATTTTTGTAAGATCGTATTGGAAACGTTGGTTTCTTTTGTCCTAGGCAGTACTGTTCAATGGACACGGGGGTTTTTATTAACTCATAGGGGAATAATAGTGTTCATGTGGAATTTGGTACCCGAAACGGGATTCTCATAAATAAATGTCATGGCATGATGCATCATATCATCCTCTGGGTGTCAAGATAATTGCTTTGTCCGACAATGTGCTATGGTAATTGTTCAATTGAAGTCTTTTATGCGAGTAATTACTTTTTCGAGAGCGTGGTCACGAGTGCCTTTAGCCGTAGCTTGTAAATTTCCTTTACTCCTAGCTTAGCATCAAATCCAATGCTTATGTTATGTAACTGGATATTGACAGACACTCATGACGCCTTGATTGCTAATTGCATTGTAGTTGATGAAGAAATGTTGAAAATTGGGCCAAAGTTCTCAGATTATTCAATCAATGTTTCCTATAAGAAACTCAATTGTATTTTTGTTTTAGCATCCAATTACGTATCTCTTGTGGGTCAAGTTCTATGTTTCAAATTACATGATCTTGGTTATACATTTTCTTGGTTGTTATTGTTAATTTGTCTAATCTAAATTGGGAGAGGAACTCTTGCCCCCAAGGGGCGGACCCGTTGGTAAGGACTTGGAGTCTCTTGGTCATACTGGCTTAGAGATCTTAGGTTCGAGCTCTTGGGCGAGCATAATAAATAAAAACCTCTGATGTCTCTCGGGTCCGAGCCTTAGAGCAGGCATGTATACCCGGGTATAAGGGAGCAAAGCTTTGACTCTTGGTCATCCCTAAAAAAAAAAATTGGAGAACACTTGTATTTTCTTACGACATTGTATAAGAGGACATTGTATAAGAGGGCAAAGCTCTGACTCTTTGGTTATCCCTAAAAAAAAAGGGCAGAACTCTTGTATTTTCTTAGGACGTTGTTAATACTCAATGTCCTAAAAATCTTGATTTGCCTGTTTGGGTTCGAAAAAAATGTTACGTCAGTGCTTATAATTGGAAATTGGAAGTGGGGAATTGTGCATGATAGGAATTTAATATGCAGTTTTTCTTTGAGTTGGTGATTGATTAAGAAGAATTTTCACTTTTGCCAACTGATGTATCTTTTCATCTTATGTACAATCTATTCAAATAGATATCAGCAGGCTGGTCCGGGACTCTGTGTTACTGTCCATGACAGTTTTTTTATTGTTTGGTTTATGTATGTTGGGATTCACTCTCTTATTACAATGAACATGAATTAGATCCCATTGGTGATTTATTTATGCACTATTTGTTTGCAAAGGTATTTCATATAATATCAATTCTGATATGGGTTTTCGTTCTTCTATTTTGGATGCAGGGTGGATGTTGTCATTTTTGTAATATGGGAACTCATTATTCTAGCGTTCTTGGTCTTTTCAGCAGTTTCTTTATATTTTCGGCATATGCAGTTGGCTTTTATTTTAGCATGCATCACGATGTTGTTGCTTCTATGCATGAAAGTTACAAAGCAAGTGAGATTGGCTAGGAAGAAGAAAAGAAGGATGCTTCTTCCATTGTCCATGTGAAAAACTGATAGTAGAAGGAGAGATATACTAAACTTTTTTATGTTCTAAATGCCCCGAATTATATTTACAAAAATTAGATTTGGTCCTTTCTCCTCCCACTCTTTGTTATATTGGGATCTGTAATCACCAGGAAAGCATGTTTATCAAAAATGAGAAAAAAAAATTTGAAAAAAGGAATGAGATTGATGCGAGTAGTTTAGAAATTTGTGAATGCATTCTTATCTGAGAGTTTGGACCATCTTCAATGATTTTCCCTTTCTTAATCCTTTTTAGTTGT

mRNA sequence

GAAGGGAGGGGGGTGGTAGTAAGAGAAGGTCCACGCAACGCAACAAACCCCCTCAACCTCAACCCCAACCCCAACCTGAACGACTGCGAATTGAACGCTCATGCTCTCTCTCAATTCCATTTCTCAATTCCTCCTTCAATTCCTCTGTCGCTGCTGTGATAAATCCTCCATTCTCCACCATCGCCCTTCTTCCCTTGAGGCAAAGAAAAGAAGGTCTCGGGGCGCAGCCATGATTACTCGATCGAATCTGGTTGAGCAGTTGAGAGAATACCAGATTCGATCTAAACACGAGTGGGCTTCCGTCTCCTTCTTCTCATCTACCTCCAATATTTCTTCTTCCAGGGTGGATGTTGTCATTTTTGTAATATGGGAACTCATTATTCTAGCGTTCTTGGTCTTTTCAGCAGTTTCTTTATATTTTCGGCATATGCAGTTGGCTTTTATTTTAGCATGCATCACGATGTTGTTGCTTCTATGCATGAAAGTTACAAAGCAAGTGAGATTGGCTAGGAAGAAGAAAAGAAGGATGCTTCTTCCATTGTCCATGTGAAAAACTGATAGTAGAAGGAGAGATATACTAAACTTTTTTATGTTCTAAATGCCCCGAATTATATTTACAAAAATTAGATTTGGTCCTTTCTCCTCCCACTCTTTGTTATATTGGGATCTGTAATCACCAGGAAAGCATGTTTATCAAAAATGAGAAAAAAAAATTTGAAAAAAGGAATGAGATTGATGCGAGTAGTTTAGAAATTTGTGAATGCATTCTTATCTGAGAGTTTGGACCATCTTCAATGATTTTCCCTTTCTTAATCCTTTTTAGTTGT

Coding sequence (CDS)

ATGCTCTCTCTCAATTCCATTTCTCAATTCCTCCTTCAATTCCTCTGTCGCTGCTGTGATAAATCCTCCATTCTCCACCATCGCCCTTCTTCCCTTGAGGCAAAGAAAAGAAGGTCTCGGGGCGCAGCCATGATTACTCGATCGAATCTGGTTGAGCAGTTGAGAGAATACCAGATTCGATCTAAACACGAGTGGGCTTCCGTCTCCTTCTTCTCATCTACCTCCAATATTTCTTCTTCCAGGGTGGATGTTGTCATTTTTGTAATATGGGAACTCATTATTCTAGCGTTCTTGGTCTTTTCAGCAGTTTCTTTATATTTTCGGCATATGCAGTTGGCTTTTATTTTAGCATGCATCACGATGTTGTTGCTTCTATGCATGAAAGTTACAAAGCAAGTGAGATTGGCTAGGAAGAAGAAAAGAAGGATGCTTCTTCCATTGTCCATGTGA

Protein sequence

MLSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRGAAMITRSNLVEQLREYQIRSKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILACITMLLLLCMKVTKQVRLARKKKRRMLLPLSM
Homology
BLAST of Tan0012892.1 vs. NCBI nr
Match: XP_038884232.1 (uncharacterized protein LOC120075129 [Benincasa hispida])

HSP 1 Score: 244.2 bits (622), Expect = 7.1e-61
Identity = 138/149 (92.62%), Postives = 145/149 (97.32%), Query Frame = 0

Query: 1   MLSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRGAAMITRSNLVEQLREYQIR 60
           MLSLNSISQFL++F+CRC  KSSILH RPSSLEAKKR+SRG+AMITRSNLVEQLREYQIR
Sbjct: 1   MLSLNSISQFLVRFVCRCV-KSSILHPRPSSLEAKKRKSRGSAMITRSNLVEQLREYQIR 60

Query: 61  SKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILACIT 120
           SKHEWASVSFFSSTSNI+SSRVDVVIFVIWELIIL+FLVFSAVSLYFRHMQLAFIL CIT
Sbjct: 61  SKHEWASVSFFSSTSNITSSRVDVVIFVIWELIILSFLVFSAVSLYFRHMQLAFILVCIT 120

Query: 121 MLLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           MLLLLCMKVTKQVRLARKKKRRMLLPLSM
Sbjct: 121 MLLLLCMKVTKQVRLARKKKRRMLLPLSM 148

BLAST of Tan0012892.1 vs. NCBI nr
Match: XP_008464874.1 (PREDICTED: uncharacterized protein LOC103502637 isoform X1 [Cucumis melo])

HSP 1 Score: 233.8 bits (595), Expect = 9.6e-58
Identity = 137/151 (90.73%), Postives = 143/151 (94.70%), Query Frame = 0

Query: 1   MLSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRG--AAMITRSNLVEQLREYQ 60
           MLSLNSISQFL++FLCRC  +SSILH   SSLEAKKRRSRG  +AMITRSNLVEQLREYQ
Sbjct: 1   MLSLNSISQFLVRFLCRCF-RSSILHLPSSSLEAKKRRSRGSASAMITRSNLVEQLREYQ 60

Query: 61  IRSKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILAC 120
           IRSKHEWASVSFFSSTSNI+SSRVDVVIFVIWELIIL+FLVFSAVSLYFRHMQLAFIL C
Sbjct: 61  IRSKHEWASVSFFSSTSNITSSRVDVVIFVIWELIILSFLVFSAVSLYFRHMQLAFILVC 120

Query: 121 ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM
Sbjct: 121 ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150

BLAST of Tan0012892.1 vs. NCBI nr
Match: XP_004144225.1 (uncharacterized protein LOC101213289 [Cucumis sativus] >KGN47556.1 hypothetical protein Csa_018918 [Cucumis sativus])

HSP 1 Score: 232.3 bits (591), Expect = 2.8e-57
Identity = 137/151 (90.73%), Postives = 141/151 (93.38%), Query Frame = 0

Query: 1   MLSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRG--AAMITRSNLVEQLREYQ 60
           MLSLNSISQFL+ FLCRC   SSILH   SSLEAKKRRSRG  +AMITRSNLVEQLREYQ
Sbjct: 1   MLSLNSISQFLVSFLCRCF-HSSILHLPSSSLEAKKRRSRGSASAMITRSNLVEQLREYQ 60

Query: 61  IRSKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILAC 120
           IRSKHEWASVSFFSSTSNI+SSRVDVVIFVIWELIIL+FLVFSAVSLYFRHMQLAFIL C
Sbjct: 61  IRSKHEWASVSFFSSTSNITSSRVDVVIFVIWELIILSFLVFSAVSLYFRHMQLAFILVC 120

Query: 121 ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM
Sbjct: 121 ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150

BLAST of Tan0012892.1 vs. NCBI nr
Match: XP_022951473.1 (uncharacterized protein LOC111454282 [Cucurbita moschata])

HSP 1 Score: 222.6 bits (566), Expect = 2.2e-54
Identity = 130/148 (87.84%), Postives = 135/148 (91.22%), Query Frame = 0

Query: 2   LSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRGAAMITRSNLVEQLREYQIRS 61
           LSLNSISQFLL F+  C  KSS LH RP+S EAK+ RSRGAAMITRSNLVEQLREYQIRS
Sbjct: 20  LSLNSISQFLLHFVSHCF-KSSDLHRRPTSPEAKETRSRGAAMITRSNLVEQLREYQIRS 79

Query: 62  KHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILACITM 121
           KHEWASVS FSS SNI+SSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLA IL CITM
Sbjct: 80  KHEWASVSLFSSASNITSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLASILVCITM 139

Query: 122 LLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           LLL+CMKVTKQVRLARKKKRRMLLPLSM
Sbjct: 140 LLLICMKVTKQVRLARKKKRRMLLPLSM 166

BLAST of Tan0012892.1 vs. NCBI nr
Match: KAG7020712.1 (hypothetical protein SDJN02_17399 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 220.3 bits (560), Expect = 1.1e-53
Identity = 129/148 (87.16%), Postives = 134/148 (90.54%), Query Frame = 0

Query: 2   LSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRGAAMITRSNLVEQLREYQIRS 61
           LSLNSISQFLL  +  C  KSS LH RP+S EAK+ RSRGAAMITRSNLVEQLREYQIRS
Sbjct: 24  LSLNSISQFLLHIVSHCF-KSSDLHRRPTSPEAKETRSRGAAMITRSNLVEQLREYQIRS 83

Query: 62  KHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILACITM 121
           KHEWASVS FSS SNI+SSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLA IL CITM
Sbjct: 84  KHEWASVSLFSSASNITSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLASILVCITM 143

Query: 122 LLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           LLL+CMKVTKQVRLARKKKRRMLLPLSM
Sbjct: 144 LLLICMKVTKQVRLARKKKRRMLLPLSM 170

BLAST of Tan0012892.1 vs. ExPASy TrEMBL
Match: A0A1S3CMK7 (uncharacterized protein LOC103502637 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103502637 PE=4 SV=1)

HSP 1 Score: 233.8 bits (595), Expect = 4.7e-58
Identity = 137/151 (90.73%), Postives = 143/151 (94.70%), Query Frame = 0

Query: 1   MLSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRG--AAMITRSNLVEQLREYQ 60
           MLSLNSISQFL++FLCRC  +SSILH   SSLEAKKRRSRG  +AMITRSNLVEQLREYQ
Sbjct: 1   MLSLNSISQFLVRFLCRCF-RSSILHLPSSSLEAKKRRSRGSASAMITRSNLVEQLREYQ 60

Query: 61  IRSKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILAC 120
           IRSKHEWASVSFFSSTSNI+SSRVDVVIFVIWELIIL+FLVFSAVSLYFRHMQLAFIL C
Sbjct: 61  IRSKHEWASVSFFSSTSNITSSRVDVVIFVIWELIILSFLVFSAVSLYFRHMQLAFILVC 120

Query: 121 ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM
Sbjct: 121 ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150

BLAST of Tan0012892.1 vs. ExPASy TrEMBL
Match: A0A0A0KIH9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G358690 PE=4 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 1.4e-57
Identity = 137/151 (90.73%), Postives = 141/151 (93.38%), Query Frame = 0

Query: 1   MLSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRG--AAMITRSNLVEQLREYQ 60
           MLSLNSISQFL+ FLCRC   SSILH   SSLEAKKRRSRG  +AMITRSNLVEQLREYQ
Sbjct: 1   MLSLNSISQFLVSFLCRCF-HSSILHLPSSSLEAKKRRSRGSASAMITRSNLVEQLREYQ 60

Query: 61  IRSKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILAC 120
           IRSKHEWASVSFFSSTSNI+SSRVDVVIFVIWELIIL+FLVFSAVSLYFRHMQLAFIL C
Sbjct: 61  IRSKHEWASVSFFSSTSNITSSRVDVVIFVIWELIILSFLVFSAVSLYFRHMQLAFILVC 120

Query: 121 ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM
Sbjct: 121 ITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150

BLAST of Tan0012892.1 vs. ExPASy TrEMBL
Match: A0A6J1GHS6 (uncharacterized protein LOC111454282 OS=Cucurbita moschata OX=3662 GN=LOC111454282 PE=4 SV=1)

HSP 1 Score: 222.6 bits (566), Expect = 1.1e-54
Identity = 130/148 (87.84%), Postives = 135/148 (91.22%), Query Frame = 0

Query: 2   LSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRGAAMITRSNLVEQLREYQIRS 61
           LSLNSISQFLL F+  C  KSS LH RP+S EAK+ RSRGAAMITRSNLVEQLREYQIRS
Sbjct: 20  LSLNSISQFLLHFVSHCF-KSSDLHRRPTSPEAKETRSRGAAMITRSNLVEQLREYQIRS 79

Query: 62  KHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILACITM 121
           KHEWASVS FSS SNI+SSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLA IL CITM
Sbjct: 80  KHEWASVSLFSSASNITSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLASILVCITM 139

Query: 122 LLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           LLL+CMKVTKQVRLARKKKRRMLLPLSM
Sbjct: 140 LLLICMKVTKQVRLARKKKRRMLLPLSM 166

BLAST of Tan0012892.1 vs. ExPASy TrEMBL
Match: A0A6J1G568 (uncharacterized protein LOC111450786 OS=Cucurbita moschata OX=3662 GN=LOC111450786 PE=4 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 6.5e-52
Identity = 123/149 (82.55%), Postives = 133/149 (89.26%), Query Frame = 0

Query: 1   MLSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRGAAMITRSNLVEQLREYQIR 60
           MLSLNSISQFL+ F+C C  K  ILH RPS    +K  SRGAAMITRSNLVEQLREYQIR
Sbjct: 1   MLSLNSISQFLVPFICGCV-KFPILHRRPS----EKPTSRGAAMITRSNLVEQLREYQIR 60

Query: 61  SKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILACIT 120
           SKH+WAS SFFSSTSNI+SSRVDVVIFVIWELIIL+FLVFSAVSLYFRHMQL F+L CIT
Sbjct: 61  SKHDWASASFFSSTSNITSSRVDVVIFVIWELIILSFLVFSAVSLYFRHMQLTFVLVCIT 120

Query: 121 MLLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           MLL++CMKVTKQ+RLARKKKRRMLLPLSM
Sbjct: 121 MLLIVCMKVTKQMRLARKKKRRMLLPLSM 144

BLAST of Tan0012892.1 vs. ExPASy TrEMBL
Match: A0A6J1KBR2 (uncharacterized protein LOC111494024 OS=Cucurbita maxima OX=3661 GN=LOC111494024 PE=4 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 2.5e-51
Identity = 122/149 (81.88%), Postives = 132/149 (88.59%), Query Frame = 0

Query: 1   MLSLNSISQFLLQFLCRCCDKSSILHHRPSSLEAKKRRSRGAAMITRSNLVEQLREYQIR 60
           MLSLNSISQFL+ F+C C  K  I H RPS    +K  SRGAAMITRSNLVEQLREYQIR
Sbjct: 1   MLSLNSISQFLVPFICGCV-KFPIFHRRPS----EKPTSRGAAMITRSNLVEQLREYQIR 60

Query: 61  SKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAVSLYFRHMQLAFILACIT 120
           SKH+WAS SFFSSTSNI+SSRVDVVIFVIWELIIL+FLVFSAVSLYFRHMQL F+L CIT
Sbjct: 61  SKHDWASASFFSSTSNITSSRVDVVIFVIWELIILSFLVFSAVSLYFRHMQLTFVLICIT 120

Query: 121 MLLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           MLL++CMKVTKQ+RLARKKKRRMLLPLSM
Sbjct: 121 MLLIVCMKVTKQMRLARKKKRRMLLPLSM 144

BLAST of Tan0012892.1 vs. TAIR 10
Match: AT1G20460.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G76185.1); Has 37 Blast hits to 37 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 171.4 bits (433), Expect = 5.5e-43
Identity = 93/106 (87.74%), Postives = 101/106 (95.28%), Query Frame = 0

Query: 44  MITRSNLVEQLREYQIRSKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAV 103
           MITRSNL EQLREYQIRSKH+WASVSFFSSTSN SSSRVDVV+FVIWEL+ILAF VFSAV
Sbjct: 1   MITRSNLAEQLREYQIRSKHDWASVSFFSSTSNFSSSRVDVVVFVIWELVILAFFVFSAV 60

Query: 104 SLYFRHMQLAFILACITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           SLYF+ +QLAFIL C+T+LLL+CMKVTKQVRLARKKKRRMLLPLSM
Sbjct: 61  SLYFKRLQLAFILVCVTLLLLICMKVTKQVRLARKKKRRMLLPLSM 106

BLAST of Tan0012892.1 vs. TAIR 10
Match: AT1G76185.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G20460.1); Has 37 Blast hits to 37 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 166.8 bits (421), Expect = 1.3e-41
Identity = 91/106 (85.85%), Postives = 100/106 (94.34%), Query Frame = 0

Query: 44  MITRSNLVEQLREYQIRSKHEWASVSFFSSTSNISSSRVDVVIFVIWELIILAFLVFSAV 103
           MITRSNL EQLREYQIRSKH+WASVSFFSSTSN SSSRVDVV+FVIWEL++LA +VFSAV
Sbjct: 1   MITRSNLAEQLREYQIRSKHDWASVSFFSSTSNFSSSRVDVVVFVIWELVMLALVVFSAV 60

Query: 104 SLYFRHMQLAFILACITMLLLLCMKVTKQVRLARKKKRRMLLPLSM 150
           SLYFR +QLAFIL C+T+LLLLCMK+TKQVR ARKKKRRMLLPLSM
Sbjct: 61  SLYFRRLQLAFILLCVTLLLLLCMKITKQVRHARKKKRRMLLPLSM 106

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038884232.17.1e-6192.62uncharacterized protein LOC120075129 [Benincasa hispida][more]
XP_008464874.19.6e-5890.73PREDICTED: uncharacterized protein LOC103502637 isoform X1 [Cucumis melo][more]
XP_004144225.12.8e-5790.73uncharacterized protein LOC101213289 [Cucumis sativus] >KGN47556.1 hypothetical ... [more]
XP_022951473.12.2e-5487.84uncharacterized protein LOC111454282 [Cucurbita moschata][more]
KAG7020712.11.1e-5387.16hypothetical protein SDJN02_17399 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A1S3CMK74.7e-5890.73uncharacterized protein LOC103502637 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0KIH91.4e-5790.73Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G358690 PE=4 SV=1[more]
A0A6J1GHS61.1e-5487.84uncharacterized protein LOC111454282 OS=Cucurbita moschata OX=3662 GN=LOC1114542... [more]
A0A6J1G5686.5e-5282.55uncharacterized protein LOC111450786 OS=Cucurbita moschata OX=3662 GN=LOC1114507... [more]
A0A6J1KBR22.5e-5181.88uncharacterized protein LOC111494024 OS=Cucurbita maxima OX=3661 GN=LOC111494024... [more]
Match NameE-valueIdentityDescription
AT1G20460.15.5e-4387.74unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G76185.11.3e-4185.85unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34936:SF10SUBFAMILY NOT NAMEDcoord: 44..149
NoneNo IPR availablePANTHERPTHR34936EXPRESSED PROTEINcoord: 44..149

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Tan0012892Tan0012892gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0012892.1-five_prime_utrTan0012892.1-five_prime_utr-LG02:67263623..67263722five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0012892.1-exonTan0012892.1-exon-LG02:67263623..67263964exon
Tan0012892.1-exonTan0012892.1-exon-LG02:67265634..67266118exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0012892.1-cdsTan0012892.1-cds-LG02:67263723..67263964CDS
Tan0012892.1-cdsTan0012892.1-cds-LG02:67265634..67265841CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0012892.1-three_prime_utrTan0012892.1-three_prime_utr-LG02:67265842..67266118three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Tan0012892.1Tan0012892.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane