Tan0014384 (gene) Snake gourd v1

Overview
NameTan0014384
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUPF0565 protein C2orf69 homolog
LocationLG10: 985998 .. 988195 (-)
RNA-Seq ExpressionTan0014384
SyntenyTan0014384
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGAATTGAAAGATGGAGCAAAACAAAACAAGCAAAGGCCGACTTCTTTTAAACTCCTTGGACATTTGAAAGAAATTCTATCAGATTCTACGGTATCTGCTCACTCTCTGACTTGAGATTGATTTTCTGATGATCTTTTTAGATTCCCCATCTGTTCTTTTACTCGTAATTCTTCGTTTCTTTCGTATGCCAATTGAAGTACTTAGCTGCAGGAATTTCACGTGCGACGATATTAATTCTATGCATTTGGAAACGAGAGATTTTGACATGCTTGTAGCAATTTTCCCATGGAGTCTAATGGTAATCGAATGCTTATTGACGAAGAATTTGCTTCTTTGTTTTTCTTTTGCGAAGTTTCCGATGCTGTGGAGTTAAATACACAAAAGCTACATTTTAATCCACACTAGCCGATTCAATTCAGACAAGTTCTTGATGATGGAGCGTTGGAGTGGAGTTTTGAAGGTTCCATTGCATCTCAACAGCCACAAGTTTTATCGAGTTGGAGCATCACTCTGCCTTTCCCCAACTTCCAAAACCTTGTCTGTAAGTTCTCGATGCTCGAAATGCAACTGTATTGTGGAATTTCATGCCTAGCATCACTATAAAAAAGATGGAATTCTTCCGGATATTGCCTTTATGCAGGAGAAATTATTTGAGGAAACCGCCATAACATGACTGATAGCTCTACACCATTGAATGTACGAAAGAGAAATTACAAAAGTTTTTCAAATATGTCTAGTGATTATTAGTGGATCTGTACCTAATGATTATTAGAGACCAACCAACCACATTGTAACGACACTTTCATCTTTCTCACTGATCTAAAATGGTTTTCAATGAAGAGAGATTGAAGAGATTCGAACTTCTAACTTCTTGGTCGAGTGTATATGCCCTTAACCAGTTGTCGAGCTGGGAATGTCTTATAAATATTTTCATGGTAACAGGTTCCTTCCGCAAATGCCATCTTTTTCAATGGGGATCGGGTTGAAGGAACAGGCAATCCAGTGATTGAGAGATTGTCTGATTTGCAGAACATAGCAGAAATTTTAGTTTCAAAATTTGGAGACTCCACTAATACATGGGCTATCGAGACTTCTGATTTCAATGGAGCTTTTGCTATATATAAGGATTTTATCCCTTCTCTGAATCGATGGGGAGAACCAAAATCATATACTCCAAATGGGTTTCCTGCTTCGATGTCAACCGTTTCACTTTTGGGAAGTTGCTACACTGAGGTATGAATATGATCAAACAGCAAATGCAATCAATATCCATTCATAATCATATGATCCAAATGGGTTTTCTGCTTCTGATTTTGATATGAACTAATGCACTCAGTTGTGGTTTTGGCAGGTAAAGAAGATAATTTCTAGGGGAAAACAAGAATCACAGCCAACCACCATATCCACGCTGCATTGCAGTCCACCCAAAACAATCATTCTTGGATTTAGCAAGGGAGGAACTGTAGTTAACCAGCTAGTTGCTGAACTTGGCTCGAAAGACTTTATAGCTGCTGAGAATCTGCCTCATTCTAATCAAGCATCAGGTGTTGAATGTTCTAACCTCAATGAGGTTCAGTTCACATCTACTACAGAACAAAGCTTTTTGAAAAGCATAACAGAGATTCATTATGTCGATGTTGGATTGAACAGCCATGGTGCATATCTTACAGACCCTGAGGTGATTAAAAGAATCTCTAGTAGCCTTGTTCGGGAGTCGAGACGAGTCCATTTCGTTCTTCATGGTACTCCAAGACAATGGTGTGACAACAGAAGAGTTTGGATACGAGATGAGAAGGAAAAAATGTTGAGTTTTCTTGAATCTGAAGCACGGGAAAGTGGAGGAAAGTTGCAGGTTTTTGAGAAATTTTATTTTTCTGATAGGCCTGCAGACTTGCAGATGCATTTTGAAATAATTGAAAAGTTGGATGTTTGCTGAGTTCTTTTGTCAGGTCGCTGGAAAATCAAGAGTTCTGAAGTCAACCAATCAGGTAGTTTAACAAGATTTTGATGAGAGAGGAAAGTCAGTAACCTTAGAATTCGAGGATAAAGTTATTGTACAACCTATAGAACGAATTTTGTTCAAGTAGGGTCGTGAGAGAAGTCAGTAGCCGTATTATATCCCTTGTTAAAATGGTGAATTGTTGTCCTTGTTAGAAAGAAGAATTGTTGCGAGAACATTTCTTCGTACCATCGATG

mRNA sequence

TGGAATTGAAAGATGGAGCAAAACAAAACAAGCAAAGGCCGACTTCTTTTAAACTCCTTGGACATTTGAAAGAAATTCTATCAGATTCTACGTTTCCGATGCTGTGGAGTTAAATACACAAAAGCTACATTTTAATCCACACTAGCCGATTCAATTCAGACAAGTTCTTGATGATGGAGCGTTGGAGTGGAGTTTTGAAGGTTCCATTGCATCTCAACAGCCACAAGTTTTATCGAGTTGGAGCATCACTCTGCCTTTCCCCAACTTCCAAAACCTTGTCTGTTCCTTCCGCAAATGCCATCTTTTTCAATGGGGATCGGGTTGAAGGAACAGGCAATCCAGTGATTGAGAGATTGTCTGATTTGCAGAACATAGCAGAAATTTTAGTTTCAAAATTTGGAGACTCCACTAATACATGGGCTATCGAGACTTCTGATTTCAATGGAGCTTTTGCTATATATAAGGATTTTATCCCTTCTCTGAATCGATGGGGAGAACCAAAATCATATACTCCAAATGGGTTTCCTGCTTCGATGTCAACCGTTTCACTTTTGGGAAGTTGCTACACTGAGGTAAAGAAGATAATTTCTAGGGGAAAACAAGAATCACAGCCAACCACCATATCCACGCTGCATTGCAGTCCACCCAAAACAATCATTCTTGGATTTAGCAAGGGAGGAACTGTAGTTAACCAGCTAGTTGCTGAACTTGGCTCGAAAGACTTTATAGCTGCTGAGAATCTGCCTCATTCTAATCAAGCATCAGGTGTTGAATGTTCTAACCTCAATGAGGTTCAGTTCACATCTACTACAGAACAAAGCTTTTTGAAAAGCATAACAGAGATTCATTATGTCGATGTTGGATTGAACAGCCATGGTGCATATCTTACAGACCCTGAGGTGATTAAAAGAATCTCTAGTAGCCTTGTTCGGGAGTCGAGACGAGTCCATTTCGTTCTTCATGGTACTCCAAGACAATGGTGTGACAACAGAAGAGTTTGGATACGAGATGAGAAGGAAAAAATGTTGAGTTTTCTTGAATCTGAAGCACGGGAAAGTGGAGGAAAGTTGCAGGTTTTTGAGAAATTTTATTTTTCTGATAGGCCTGCAGACTTGCAGATGCATTTTGAAATAATTGAAAAGTTGGATGTTTGCTGAGTTCTTTTGTCAGGTCGCTGGAAAATCAAGAGTTCTGAAGTCAACCAATCAGGTAGTTTAACAAGATTTTGATGAGAGAGGAAAGTCAGTAACCTTAGAATTCGAGGATAAAGTTATTGTACAACCTATAGAACGAATTTTGTTCAAGTAGGGTCGTGAGAGAAGTCAGTAGCCGTATTATATCCCTTGTTAAAATGGTGAATTGTTGTCCTTGTTAGAAAGAAGAATTGTTGCGAGAACATTTCTTCGTACCATCGATG

Coding sequence (CDS)

ATGATGGAGCGTTGGAGTGGAGTTTTGAAGGTTCCATTGCATCTCAACAGCCACAAGTTTTATCGAGTTGGAGCATCACTCTGCCTTTCCCCAACTTCCAAAACCTTGTCTGTTCCTTCCGCAAATGCCATCTTTTTCAATGGGGATCGGGTTGAAGGAACAGGCAATCCAGTGATTGAGAGATTGTCTGATTTGCAGAACATAGCAGAAATTTTAGTTTCAAAATTTGGAGACTCCACTAATACATGGGCTATCGAGACTTCTGATTTCAATGGAGCTTTTGCTATATATAAGGATTTTATCCCTTCTCTGAATCGATGGGGAGAACCAAAATCATATACTCCAAATGGGTTTCCTGCTTCGATGTCAACCGTTTCACTTTTGGGAAGTTGCTACACTGAGGTAAAGAAGATAATTTCTAGGGGAAAACAAGAATCACAGCCAACCACCATATCCACGCTGCATTGCAGTCCACCCAAAACAATCATTCTTGGATTTAGCAAGGGAGGAACTGTAGTTAACCAGCTAGTTGCTGAACTTGGCTCGAAAGACTTTATAGCTGCTGAGAATCTGCCTCATTCTAATCAAGCATCAGGTGTTGAATGTTCTAACCTCAATGAGGTTCAGTTCACATCTACTACAGAACAAAGCTTTTTGAAAAGCATAACAGAGATTCATTATGTCGATGTTGGATTGAACAGCCATGGTGCATATCTTACAGACCCTGAGGTGATTAAAAGAATCTCTAGTAGCCTTGTTCGGGAGTCGAGACGAGTCCATTTCGTTCTTCATGGTACTCCAAGACAATGGTGTGACAACAGAAGAGTTTGGATACGAGATGAGAAGGAAAAAATGTTGAGTTTTCTTGAATCTGAAGCACGGGAAAGTGGAGGAAAGTTGCAGGTTTTTGAGAAATTTTATTTTTCTGATAGGCCTGCAGACTTGCAGATGCATTTTGAAATAATTGAAAAGTTGGATGTTTGCTGA

Protein sequence

MMERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIERLSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPASMSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELGSKDFIAAENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLTDPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKLQVFEKFYFSDRPADLQMHFEIIEKLDVC
Homology
BLAST of Tan0014384 vs. NCBI nr
Match: XP_011650653.1 (UPF0565 protein C2orf69 homolog [Cucumis sativus] >KGN56378.1 hypothetical protein Csa_011056 [Cucumis sativus])

HSP 1 Score: 558.5 bits (1438), Expect = 3.8e-155
Identity = 273/328 (83.23%), Postives = 293/328 (89.33%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           M+RW+G+LKVPL+ NS KFYRV  SLCLSPTSKTL+VP  NAIFFNGDRVEGTGNPVIER
Sbjct: 1   MDRWNGILKVPLNSNSRKFYRVAVSLCLSPTSKTLTVPRGNAIFFNGDRVEGTGNPVIER 60

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LS+LQNIAEILVSKFGDSTN W +E SDFNGAFAIY+DFIPSLNRWGEPKSYTPNGFPAS
Sbjct: 61  LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +STVSLLGSCY EVKKI+SRGK  SQ T ISTL C  P+TIILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPRSQETAISTLSCCTPETIILGFSKGGTVVNQLVTELG 180

Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           SKD +AA ENLP S Q SGVECS L+E+QF  TT QSFLKSITEIHYVDVGLNSHGAYLT
Sbjct: 181 SKDLMAADENLPLSKQESGVECSKLDEIQFVPTTGQSFLKSITEIHYVDVGLNSHGAYLT 240

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           DPEVIKRISSSL++ESR + FVLHGTPRQWCD RRVWIRDEKEKM SFLESEA  SGG L
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKEKMRSFLESEALRSGGNL 300

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           +V EKFYF+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 KVNEKFYFADRPADMQMHFEIIEKLDVC 328

BLAST of Tan0014384 vs. NCBI nr
Match: KAG7020705.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 556.2 bits (1432), Expect = 1.9e-154
Identity = 274/328 (83.54%), Postives = 293/328 (89.33%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           MERW G+LKVPLH  SHKFYRV ASLCLSP+SKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 655 MERWCGILKVPLHPQSHKFYRVAASLCLSPSSKTLTMPHANAILFNGDRVEGTGNPVIER 714

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DFIPSLNRWGEPKSYT NGFPAS
Sbjct: 715 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 774

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +ST+SLLGSCY+EVKKIISR  Q S   TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 775 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 834

Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           SKD IAA EN PHS QA GVECS L+EVQF   TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 835 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 894

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           DPEV+KRIS+SLV+ESR + FVLHGTPRQWCD+RRVWIRDEKEKMLS LESEAR SGG+L
Sbjct: 895 DPEVMKRISNSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKEKMLSLLESEARRSGGRL 954

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 955 QVCERFHFADRPADMQMHFEIIEKLDVC 982

BLAST of Tan0014384 vs. NCBI nr
Match: KAG6582701.1 (hypothetical protein SDJN03_22703, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 556.2 bits (1432), Expect = 1.9e-154
Identity = 274/328 (83.54%), Postives = 293/328 (89.33%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           MERW G+LKVPLH  SHKFYRV ASLCLSP+SKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 46  MERWCGILKVPLHPQSHKFYRVAASLCLSPSSKTLTMPHANAILFNGDRVEGTGNPVIER 105

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DFIPSLNRWGEPKSYT NGFPAS
Sbjct: 106 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 165

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +ST+SLLGSCY+EVKKIISR  Q S   TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 166 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 225

Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           SKD IAA EN PHS QA GVECS L+EVQF   TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 226 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 285

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           DPEV+KRIS+SLV+ESR + FVLHGTPRQWCD+RRVWIRDEKEKMLS LESEAR SGG+L
Sbjct: 286 DPEVMKRISNSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKEKMLSLLESEARRSGGRL 345

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 346 QVCERFHFADRPADMQMHFEIIEKLDVC 373

BLAST of Tan0014384 vs. NCBI nr
Match: XP_023527404.1 (uncharacterized protein LOC111790644 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 553.9 bits (1426), Expect = 9.3e-154
Identity = 274/328 (83.54%), Postives = 291/328 (88.72%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           MERW G+LKVPLH  SHKFYRV ASLCLSPTSKTL+ P ANAI FNGDRVEGTGNPVIER
Sbjct: 46  MERWCGILKVPLHPQSHKFYRVAASLCLSPTSKTLTTPHANAILFNGDRVEGTGNPVIER 105

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LSD QNIA+ILVSKFGDSTN W IE SDFNG FAIY DFIPSLNRWGEPKSYT NGFPAS
Sbjct: 106 LSDPQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 165

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +ST+SLLGSCY+EVKKIISR  Q S   TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 166 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 225

Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           SKD IAA EN PHS QA GVECS L+EVQF   TEQSFLKSITEIHYVDVGLNS GAY T
Sbjct: 226 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSQGAYFT 285

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           DPEVIKRIS+SLV+ESR + FVLHGTPRQWCD+RRVWIRDEK+KMLSFLESEAR SGG+L
Sbjct: 286 DPEVIKRISNSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKDKMLSFLESEARRSGGRL 345

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 346 QVCERFHFADRPADMQMHFEIIEKLDVC 373

BLAST of Tan0014384 vs. NCBI nr
Match: XP_022924396.1 (uncharacterized protein LOC111431905 [Cucurbita moschata])

HSP 1 Score: 551.6 bits (1420), Expect = 4.6e-153
Identity = 272/328 (82.93%), Postives = 291/328 (88.72%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           MERW G+LKVPLH  SHKFYRV ASLCLSPTSKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 1   MERWCGILKVPLHPQSHKFYRVAASLCLSPTSKTLTMPHANAILFNGDRVEGTGNPVIER 60

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DF+ SLNRWGEPKSYT NGFPAS
Sbjct: 61  LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFMHSLNRWGEPKSYTANGFPAS 120

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +ST+SLLGSCY+EVKKIISR  Q S   TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 121 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 180

Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           SKD IAA EN PHS QA GVECS L+EVQF   TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 181 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 240

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           DPEVIKRIS+SLV+ESR + F+LHGTPRQWCD+RRVWIRDEKEKM S LESEAR SGG+L
Sbjct: 241 DPEVIKRISNSLVQESRGIRFILHGTPRQWCDSRRVWIRDEKEKMSSLLESEARRSGGRL 300

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 QVCERFHFADRPADMQMHFEIIEKLDVC 328

BLAST of Tan0014384 vs. ExPASy TrEMBL
Match: A0A0A0L3P6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G118130 PE=4 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 1.8e-155
Identity = 273/328 (83.23%), Postives = 293/328 (89.33%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           M+RW+G+LKVPL+ NS KFYRV  SLCLSPTSKTL+VP  NAIFFNGDRVEGTGNPVIER
Sbjct: 1   MDRWNGILKVPLNSNSRKFYRVAVSLCLSPTSKTLTVPRGNAIFFNGDRVEGTGNPVIER 60

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LS+LQNIAEILVSKFGDSTN W +E SDFNGAFAIY+DFIPSLNRWGEPKSYTPNGFPAS
Sbjct: 61  LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +STVSLLGSCY EVKKI+SRGK  SQ T ISTL C  P+TIILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPRSQETAISTLSCCTPETIILGFSKGGTVVNQLVTELG 180

Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           SKD +AA ENLP S Q SGVECS L+E+QF  TT QSFLKSITEIHYVDVGLNSHGAYLT
Sbjct: 181 SKDLMAADENLPLSKQESGVECSKLDEIQFVPTTGQSFLKSITEIHYVDVGLNSHGAYLT 240

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           DPEVIKRISSSL++ESR + FVLHGTPRQWCD RRVWIRDEKEKM SFLESEA  SGG L
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKEKMRSFLESEALRSGGNL 300

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           +V EKFYF+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 KVNEKFYFADRPADMQMHFEIIEKLDVC 328

BLAST of Tan0014384 vs. ExPASy TrEMBL
Match: A0A6J1E9C8 (uncharacterized protein LOC111431905 OS=Cucurbita moschata OX=3662 GN=LOC111431905 PE=4 SV=1)

HSP 1 Score: 551.6 bits (1420), Expect = 2.2e-153
Identity = 272/328 (82.93%), Postives = 291/328 (88.72%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           MERW G+LKVPLH  SHKFYRV ASLCLSPTSKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 1   MERWCGILKVPLHPQSHKFYRVAASLCLSPTSKTLTMPHANAILFNGDRVEGTGNPVIER 60

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DF+ SLNRWGEPKSYT NGFPAS
Sbjct: 61  LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFMHSLNRWGEPKSYTANGFPAS 120

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +ST+SLLGSCY+EVKKIISR  Q S   TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 121 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 180

Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           SKD IAA EN PHS QA GVECS L+EVQF   TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 181 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 240

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           DPEVIKRIS+SLV+ESR + F+LHGTPRQWCD+RRVWIRDEKEKM S LESEAR SGG+L
Sbjct: 241 DPEVIKRISNSLVQESRGIRFILHGTPRQWCDSRRVWIRDEKEKMSSLLESEARRSGGRL 300

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 QVCERFHFADRPADMQMHFEIIEKLDVC 328

BLAST of Tan0014384 vs. ExPASy TrEMBL
Match: A0A1S3AV17 (UPF0565 protein C2orf69 homolog OS=Cucumis melo OX=3656 GN=LOC103483140 PE=4 SV=1)

HSP 1 Score: 551.2 bits (1419), Expect = 2.9e-153
Identity = 271/328 (82.62%), Postives = 290/328 (88.41%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           M+RW+G+LKVPL  NS KFYRV  SLCLSPTSKTL+VP ANAIFFNGDRVEGTGNPVIE 
Sbjct: 1   MDRWNGILKVPLRSNSRKFYRVAVSLCLSPTSKTLTVPRANAIFFNGDRVEGTGNPVIEG 60

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LS+LQNIAEILVSKFGDSTN W +E SDFNGAFAIY+DFIPSLNRWGEPKSYTPNGFPAS
Sbjct: 61  LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +STVSLLGSCY EVKKI+SRGK  SQ TTI TL    P+T+ILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPGSQETTIPTLSGCTPETVILGFSKGGTVVNQLVTELG 180

Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           SKD IAA ENLP S Q SGVECS L+E QF  TTE SFLKSITEIHYVDVGLN+HGAYLT
Sbjct: 181 SKDLIAADENLPLSKQESGVECSKLDENQFIPTTEHSFLKSITEIHYVDVGLNTHGAYLT 240

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           DPEVIKRISSSL++ESR + FVLHGTPRQWCD RRVWIRDEKE M SFLESEA  SGG L
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKETMTSFLESEALRSGGNL 300

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           QV+EKFYF+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 QVYEKFYFADRPADMQMHFEIIEKLDVC 328

BLAST of Tan0014384 vs. ExPASy TrEMBL
Match: A0A6J1CWE4 (uncharacterized protein LOC111014908 OS=Momordica charantia OX=3673 GN=LOC111014908 PE=4 SV=1)

HSP 1 Score: 545.8 bits (1405), Expect = 1.2e-151
Identity = 267/329 (81.16%), Postives = 291/329 (88.45%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           M RWSG+LKVPLH  S KFYRVGAS+CLSP SKTL+VPSAN IFFNGDRVEGTGNPVIER
Sbjct: 1   MGRWSGILKVPLHPKSQKFYRVGASICLSPNSKTLTVPSANVIFFNGDRVEGTGNPVIER 60

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LSDLQ IAEILVSKFGDSTN W +E SDFNG FA+YKDFIP LNRWGEPKSY PNGFPAS
Sbjct: 61  LSDLQKIAEILVSKFGDSTNAWVVEASDFNGTFAVYKDFIPYLNRWGEPKSYIPNGFPAS 120

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +STVSLLGSCY EVKK+ISRGKQ SQ +TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 121 VSTVSLLGSCYNEVKKMISRGKQGSQSSTISTLCCSLPKTIILGFSKGGTVVNQLVAELG 180

Query: 182 SKDFIAAENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLTD 241
           S +F+A +NLPHS Q++GVECSNL  +QF  TTE+ FLKS+TEIHYVDVGLN+HGAYLTD
Sbjct: 181 SSEFMAVDNLPHSKQSAGVECSNLEGIQFIPTTERGFLKSMTEIHYVDVGLNTHGAYLTD 240

Query: 242 PEVIKRISSSLV--RESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGK 301
            EVIKRISSSLV  +ESR V F+LHGTPRQWCD+RRVWIR EKEKMLS LESEAR SGGK
Sbjct: 241 HEVIKRISSSLVQDQESRGVRFILHGTPRQWCDSRRVWIRHEKEKMLSLLESEARRSGGK 300

Query: 302 LQVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           LQV EKF F+D P ++QMHFEIIEKL+VC
Sbjct: 301 LQVCEKFCFADSPPNMQMHFEIIEKLEVC 329

BLAST of Tan0014384 vs. ExPASy TrEMBL
Match: A0A6J1IN96 (uncharacterized protein LOC111479029 OS=Cucurbita maxima OX=3661 GN=LOC111479029 PE=4 SV=1)

HSP 1 Score: 541.2 bits (1393), Expect = 3.0e-150
Identity = 268/328 (81.71%), Postives = 287/328 (87.50%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           MERW G+LKVPLH   H+FYRV ASLCLSPTSKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 61  MERWCGILKVPLHPQCHRFYRVAASLCLSPTSKTLTMPHANAILFNGDRVEGTGNPVIER 120

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DFIPSLNRWGEPKSYT NGFPAS
Sbjct: 121 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 180

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
           +ST+SLLGSCY+EVKKIISR  Q S   TISTL CS PKTIILGFSKGGTV NQLVAELG
Sbjct: 181 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVANQLVAELG 240

Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           SKD IAA EN PHS QA GVECS L+E QF   TEQSFLKSITEIHYVDVGLNS GAY T
Sbjct: 241 SKDLIAADENPPHSKQAPGVECSKLDEDQFIPNTEQSFLKSITEIHYVDVGLNSQGAYFT 300

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           DPE IKRIS+SLV+ESR + FVLHGTPRQW D+RRVWIRDEK+KMLS LESEAR SGG+L
Sbjct: 301 DPEAIKRISNSLVQESRGIRFVLHGTPRQWGDSRRVWIRDEKDKMLSLLESEARRSGGRL 360

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
           QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 361 QVCERFHFADRPADMQMHFEIIEKLDVC 388

BLAST of Tan0014384 vs. TAIR 10
Match: AT2G44850.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: M germinated pollen stage, LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0565 (InterPro:IPR018881); Has 106 Blast hits to 106 proteins in 50 species: Archae - 0; Bacteria - 0; Metazoa - 73; Fungi - 0; Plants - 31; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 368.2 bits (944), Expect = 6.7e-102
Identity = 189/327 (57.80%), Postives = 239/327 (73.09%), Query Frame = 0

Query: 2   MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
           MERWSGVLK+PL   +  +YRV ASLCLS +SKTL+VPSANAIFF+GD+V+ TGN VIER
Sbjct: 1   MERWSGVLKIPLDATTSNYYRVAASLCLS-SSKTLTVPSANAIFFHGDKVQDTGNHVIER 60

Query: 62  LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
           L DLQ +AEI+VSKFG+S N W +E S FNG FAIYKDF+PS+N  G PKSY+P GFPAS
Sbjct: 61  LYDLQKVAEIIVSKFGNSVNAWVVEASVFNGPFAIYKDFVPSVNHMGAPKSYSPVGFPAS 120

Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
            S VSLL SC  EV K    G        I+++H   PKTI+LGFSKGG V+NQL++E+ 
Sbjct: 121 SSIVSLLSSCLHEVLK---EGTDVCLIDQIASVH-HCPKTIVLGFSKGGVVMNQLMSEIS 180

Query: 182 SKDFIAAENLPHSNQASGVECSNLNE-VQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
           S D     N   ++ A   E ++ +E +Q    +++SFL SI+E+HY+DVGLNS GAY+T
Sbjct: 181 SLD----TNFAKTSSAMVEESTSQHEKIQIIPASKESFLNSISEVHYIDVGLNSSGAYIT 240

Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
           D  V++RIS  L R +  +  V+HGTPRQWCD  R WIR EK++++  L++E   SGGKL
Sbjct: 241 DHNVVQRISQRLARGADSLRIVIHGTPRQWCDELRGWIRKEKDELVRLLKAETENSGGKL 300

Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDV 328
           QV E+FYFSDR ADLQMHFEII+ +DV
Sbjct: 301 QVCERFYFSDRLADLQMHFEIIDAMDV 318

BLAST of Tan0014384 vs. TAIR 10
Match: AT2G44850.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0565 (InterPro:IPR018881); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G45380.1); Has 138 Blast hits to 138 proteins in 53 species: Archae - 0; Bacteria - 0; Metazoa - 73; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )

HSP 1 Score: 322.4 bits (825), Expect = 4.2e-88
Identity = 165/291 (56.70%), Postives = 210/291 (72.16%), Query Frame = 0

Query: 38  VPSANAIFFNGDRVEGTGNPVIERLSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIY 97
           VPSANAIFF+GD+V+ TGN VIERL DLQ +AEI+VSKFG+S N W +E S FNG FAIY
Sbjct: 117 VPSANAIFFHGDKVQDTGNHVIERLYDLQKVAEIIVSKFGNSVNAWVVEASVFNGPFAIY 176

Query: 98  KDFIPSLNRWGEPKSYTPNGFPASMSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCS 157
           KDF+PS+N  G PKSY+P GFPAS S VSLL SC  EV K    G        I+++H  
Sbjct: 177 KDFVPSVNHMGAPKSYSPVGFPASSSIVSLLSSCLHEVLK---EGTDVCLIDQIASVH-H 236

Query: 158 PPKTIILGFSKGGTVVNQLVAELGSKDFIAAENLPHSNQASGVECSNLNE-VQFTSTTEQ 217
            PKTI+LGFSKGG V+NQL++E+ S D     N   ++ A   E ++ +E +Q    +++
Sbjct: 237 CPKTIVLGFSKGGVVMNQLMSEISSLD----TNFAKTSSAMVEESTSQHEKIQIIPASKE 296

Query: 218 SFLKSITEIHYVDVGLNSHGAYLTDPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRV 277
           SFL SI+E+HY+DVGLNS GAY+TD  V++RIS  L R +  +  V+HGTPRQWCD  R 
Sbjct: 297 SFLNSISEVHYIDVGLNSSGAYITDHNVVQRISQRLARGADSLRIVIHGTPRQWCDELRG 356

Query: 278 WIRDEKEKMLSFLESEARESGGKLQVFEKFYFSDRPADLQMHFEIIEKLDV 328
           WIR EK++++  L++E   SGGKLQV E+FYFSDR ADLQMHFEII+ +DV
Sbjct: 357 WIRKEKDELVRLLKAETENSGGKLQVCERFYFSDRLADLQMHFEIIDAMDV 399

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_011650653.13.8e-15583.23UPF0565 protein C2orf69 homolog [Cucumis sativus] >KGN56378.1 hypothetical prote... [more]
KAG7020705.11.9e-15483.54Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
KAG6582701.11.9e-15483.54hypothetical protein SDJN03_22703, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023527404.19.3e-15483.54uncharacterized protein LOC111790644 [Cucurbita pepo subsp. pepo][more]
XP_022924396.14.6e-15382.93uncharacterized protein LOC111431905 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A0A0L3P61.8e-15583.23Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G118130 PE=4 SV=1[more]
A0A6J1E9C82.2e-15382.93uncharacterized protein LOC111431905 OS=Cucurbita moschata OX=3662 GN=LOC1114319... [more]
A0A1S3AV172.9e-15382.62UPF0565 protein C2orf69 homolog OS=Cucumis melo OX=3656 GN=LOC103483140 PE=4 SV=... [more]
A0A6J1CWE41.2e-15181.16uncharacterized protein LOC111014908 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A6J1IN963.0e-15081.71uncharacterized protein LOC111479029 OS=Cucurbita maxima OX=3661 GN=LOC111479029... [more]
Match NameE-valueIdentityDescription
AT2G44850.26.7e-10257.80unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G44850.14.2e-8856.70unknown protein; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018881C2orf69PFAMPF10561UPF0565coord: 149..290
e-value: 5.9E-11
score: 42.1
IPR018881C2orf69PANTHERPTHR31296UPF0565 PROTEIN C2ORF69coord: 2..326

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0014384.1Tan0014384.1mRNA