Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGAATTGAAAGATGGAGCAAAACAAAACAAGCAAAGGCCGACTTCTTTTAAACTCCTTGGACATTTGAAAGAAATTCTATCAGATTCTACGGTATCTGCTCACTCTCTGACTTGAGATTGATTTTCTGATGATCTTTTTAGATTCCCCATCTGTTCTTTTACTCGTAATTCTTCGTTTCTTTCGTATGCCAATTGAAGTACTTAGCTGCAGGAATTTCACGTGCGACGATATTAATTCTATGCATTTGGAAACGAGAGATTTTGACATGCTTGTAGCAATTTTCCCATGGAGTCTAATGGTAATCGAATGCTTATTGACGAAGAATTTGCTTCTTTGTTTTTCTTTTGCGAAGTTTCCGATGCTGTGGAGTTAAATACACAAAAGCTACATTTTAATCCACACTAGCCGATTCAATTCAGACAAGTTCTTGATGATGGAGCGTTGGAGTGGAGTTTTGAAGGTTCCATTGCATCTCAACAGCCACAAGTTTTATCGAGTTGGAGCATCACTCTGCCTTTCCCCAACTTCCAAAACCTTGTCTGTAAGTTCTCGATGCTCGAAATGCAACTGTATTGTGGAATTTCATGCCTAGCATCACTATAAAAAAGATGGAATTCTTCCGGATATTGCCTTTATGCAGGAGAAATTATTTGAGGAAACCGCCATAACATGACTGATAGCTCTACACCATTGAATGTACGAAAGAGAAATTACAAAAGTTTTTCAAATATGTCTAGTGATTATTAGTGGATCTGTACCTAATGATTATTAGAGACCAACCAACCACATTGTAACGACACTTTCATCTTTCTCACTGATCTAAAATGGTTTTCAATGAAGAGAGATTGAAGAGATTCGAACTTCTAACTTCTTGGTCGAGTGTATATGCCCTTAACCAGTTGTCGAGCTGGGAATGTCTTATAAATATTTTCATGGTAACAGGTTCCTTCCGCAAATGCCATCTTTTTCAATGGGGATCGGGTTGAAGGAACAGGCAATCCAGTGATTGAGAGATTGTCTGATTTGCAGAACATAGCAGAAATTTTAGTTTCAAAATTTGGAGACTCCACTAATACATGGGCTATCGAGACTTCTGATTTCAATGGAGCTTTTGCTATATATAAGGATTTTATCCCTTCTCTGAATCGATGGGGAGAACCAAAATCATATACTCCAAATGGGTTTCCTGCTTCGATGTCAACCGTTTCACTTTTGGGAAGTTGCTACACTGAGGTATGAATATGATCAAACAGCAAATGCAATCAATATCCATTCATAATCATATGATCCAAATGGGTTTTCTGCTTCTGATTTTGATATGAACTAATGCACTCAGTTGTGGTTTTGGCAGGTAAAGAAGATAATTTCTAGGGGAAAACAAGAATCACAGCCAACCACCATATCCACGCTGCATTGCAGTCCACCCAAAACAATCATTCTTGGATTTAGCAAGGGAGGAACTGTAGTTAACCAGCTAGTTGCTGAACTTGGCTCGAAAGACTTTATAGCTGCTGAGAATCTGCCTCATTCTAATCAAGCATCAGGTGTTGAATGTTCTAACCTCAATGAGGTTCAGTTCACATCTACTACAGAACAAAGCTTTTTGAAAAGCATAACAGAGATTCATTATGTCGATGTTGGATTGAACAGCCATGGTGCATATCTTACAGACCCTGAGGTGATTAAAAGAATCTCTAGTAGCCTTGTTCGGGAGTCGAGACGAGTCCATTTCGTTCTTCATGGTACTCCAAGACAATGGTGTGACAACAGAAGAGTTTGGATACGAGATGAGAAGGAAAAAATGTTGAGTTTTCTTGAATCTGAAGCACGGGAAAGTGGAGGAAAGTTGCAGGTTTTTGAGAAATTTTATTTTTCTGATAGGCCTGCAGACTTGCAGATGCATTTTGAAATAATTGAAAAGTTGGATGTTTGCTGAGTTCTTTTGTCAGGTCGCTGGAAAATCAAGAGTTCTGAAGTCAACCAATCAGGTAGTTTAACAAGATTTTGATGAGAGAGGAAAGTCAGTAACCTTAGAATTCGAGGATAAAGTTATTGTACAACCTATAGAACGAATTTTGTTCAAGTAGGGTCGTGAGAGAAGTCAGTAGCCGTATTATATCCCTTGTTAAAATGGTGAATTGTTGTCCTTGTTAGAAAGAAGAATTGTTGCGAGAACATTTCTTCGTACCATCGATG
mRNA sequence
TGGAATTGAAAGATGGAGCAAAACAAAACAAGCAAAGGCCGACTTCTTTTAAACTCCTTGGACATTTGAAAGAAATTCTATCAGATTCTACGTTTCCGATGCTGTGGAGTTAAATACACAAAAGCTACATTTTAATCCACACTAGCCGATTCAATTCAGACAAGTTCTTGATGATGGAGCGTTGGAGTGGAGTTTTGAAGGTTCCATTGCATCTCAACAGCCACAAGTTTTATCGAGTTGGAGCATCACTCTGCCTTTCCCCAACTTCCAAAACCTTGTCTGTTCCTTCCGCAAATGCCATCTTTTTCAATGGGGATCGGGTTGAAGGAACAGGCAATCCAGTGATTGAGAGATTGTCTGATTTGCAGAACATAGCAGAAATTTTAGTTTCAAAATTTGGAGACTCCACTAATACATGGGCTATCGAGACTTCTGATTTCAATGGAGCTTTTGCTATATATAAGGATTTTATCCCTTCTCTGAATCGATGGGGAGAACCAAAATCATATACTCCAAATGGGTTTCCTGCTTCGATGTCAACCGTTTCACTTTTGGGAAGTTGCTACACTGAGGTAAAGAAGATAATTTCTAGGGGAAAACAAGAATCACAGCCAACCACCATATCCACGCTGCATTGCAGTCCACCCAAAACAATCATTCTTGGATTTAGCAAGGGAGGAACTGTAGTTAACCAGCTAGTTGCTGAACTTGGCTCGAAAGACTTTATAGCTGCTGAGAATCTGCCTCATTCTAATCAAGCATCAGGTGTTGAATGTTCTAACCTCAATGAGGTTCAGTTCACATCTACTACAGAACAAAGCTTTTTGAAAAGCATAACAGAGATTCATTATGTCGATGTTGGATTGAACAGCCATGGTGCATATCTTACAGACCCTGAGGTGATTAAAAGAATCTCTAGTAGCCTTGTTCGGGAGTCGAGACGAGTCCATTTCGTTCTTCATGGTACTCCAAGACAATGGTGTGACAACAGAAGAGTTTGGATACGAGATGAGAAGGAAAAAATGTTGAGTTTTCTTGAATCTGAAGCACGGGAAAGTGGAGGAAAGTTGCAGGTTTTTGAGAAATTTTATTTTTCTGATAGGCCTGCAGACTTGCAGATGCATTTTGAAATAATTGAAAAGTTGGATGTTTGCTGAGTTCTTTTGTCAGGTCGCTGGAAAATCAAGAGTTCTGAAGTCAACCAATCAGGTAGTTTAACAAGATTTTGATGAGAGAGGAAAGTCAGTAACCTTAGAATTCGAGGATAAAGTTATTGTACAACCTATAGAACGAATTTTGTTCAAGTAGGGTCGTGAGAGAAGTCAGTAGCCGTATTATATCCCTTGTTAAAATGGTGAATTGTTGTCCTTGTTAGAAAGAAGAATTGTTGCGAGAACATTTCTTCGTACCATCGATG
Coding sequence (CDS)
ATGATGGAGCGTTGGAGTGGAGTTTTGAAGGTTCCATTGCATCTCAACAGCCACAAGTTTTATCGAGTTGGAGCATCACTCTGCCTTTCCCCAACTTCCAAAACCTTGTCTGTTCCTTCCGCAAATGCCATCTTTTTCAATGGGGATCGGGTTGAAGGAACAGGCAATCCAGTGATTGAGAGATTGTCTGATTTGCAGAACATAGCAGAAATTTTAGTTTCAAAATTTGGAGACTCCACTAATACATGGGCTATCGAGACTTCTGATTTCAATGGAGCTTTTGCTATATATAAGGATTTTATCCCTTCTCTGAATCGATGGGGAGAACCAAAATCATATACTCCAAATGGGTTTCCTGCTTCGATGTCAACCGTTTCACTTTTGGGAAGTTGCTACACTGAGGTAAAGAAGATAATTTCTAGGGGAAAACAAGAATCACAGCCAACCACCATATCCACGCTGCATTGCAGTCCACCCAAAACAATCATTCTTGGATTTAGCAAGGGAGGAACTGTAGTTAACCAGCTAGTTGCTGAACTTGGCTCGAAAGACTTTATAGCTGCTGAGAATCTGCCTCATTCTAATCAAGCATCAGGTGTTGAATGTTCTAACCTCAATGAGGTTCAGTTCACATCTACTACAGAACAAAGCTTTTTGAAAAGCATAACAGAGATTCATTATGTCGATGTTGGATTGAACAGCCATGGTGCATATCTTACAGACCCTGAGGTGATTAAAAGAATCTCTAGTAGCCTTGTTCGGGAGTCGAGACGAGTCCATTTCGTTCTTCATGGTACTCCAAGACAATGGTGTGACAACAGAAGAGTTTGGATACGAGATGAGAAGGAAAAAATGTTGAGTTTTCTTGAATCTGAAGCACGGGAAAGTGGAGGAAAGTTGCAGGTTTTTGAGAAATTTTATTTTTCTGATAGGCCTGCAGACTTGCAGATGCATTTTGAAATAATTGAAAAGTTGGATGTTTGCTGA
Protein sequence
MMERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIERLSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPASMSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELGSKDFIAAENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLTDPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKLQVFEKFYFSDRPADLQMHFEIIEKLDVC
Homology
BLAST of Tan0014384 vs. NCBI nr
Match:
XP_011650653.1 (UPF0565 protein C2orf69 homolog [Cucumis sativus] >KGN56378.1 hypothetical protein Csa_011056 [Cucumis sativus])
HSP 1 Score: 558.5 bits (1438), Expect = 3.8e-155
Identity = 273/328 (83.23%), Postives = 293/328 (89.33%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
M+RW+G+LKVPL+ NS KFYRV SLCLSPTSKTL+VP NAIFFNGDRVEGTGNPVIER
Sbjct: 1 MDRWNGILKVPLNSNSRKFYRVAVSLCLSPTSKTLTVPRGNAIFFNGDRVEGTGNPVIER 60
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LS+LQNIAEILVSKFGDSTN W +E SDFNGAFAIY+DFIPSLNRWGEPKSYTPNGFPAS
Sbjct: 61 LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+STVSLLGSCY EVKKI+SRGK SQ T ISTL C P+TIILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPRSQETAISTLSCCTPETIILGFSKGGTVVNQLVTELG 180
Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
SKD +AA ENLP S Q SGVECS L+E+QF TT QSFLKSITEIHYVDVGLNSHGAYLT
Sbjct: 181 SKDLMAADENLPLSKQESGVECSKLDEIQFVPTTGQSFLKSITEIHYVDVGLNSHGAYLT 240
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
DPEVIKRISSSL++ESR + FVLHGTPRQWCD RRVWIRDEKEKM SFLESEA SGG L
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKEKMRSFLESEALRSGGNL 300
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
+V EKFYF+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 KVNEKFYFADRPADMQMHFEIIEKLDVC 328
BLAST of Tan0014384 vs. NCBI nr
Match:
KAG7020705.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 556.2 bits (1432), Expect = 1.9e-154
Identity = 274/328 (83.54%), Postives = 293/328 (89.33%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
MERW G+LKVPLH SHKFYRV ASLCLSP+SKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 655 MERWCGILKVPLHPQSHKFYRVAASLCLSPSSKTLTMPHANAILFNGDRVEGTGNPVIER 714
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DFIPSLNRWGEPKSYT NGFPAS
Sbjct: 715 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 774
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+ST+SLLGSCY+EVKKIISR Q S TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 775 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 834
Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
SKD IAA EN PHS QA GVECS L+EVQF TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 835 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 894
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
DPEV+KRIS+SLV+ESR + FVLHGTPRQWCD+RRVWIRDEKEKMLS LESEAR SGG+L
Sbjct: 895 DPEVMKRISNSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKEKMLSLLESEARRSGGRL 954
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 955 QVCERFHFADRPADMQMHFEIIEKLDVC 982
BLAST of Tan0014384 vs. NCBI nr
Match:
KAG6582701.1 (hypothetical protein SDJN03_22703, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 556.2 bits (1432), Expect = 1.9e-154
Identity = 274/328 (83.54%), Postives = 293/328 (89.33%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
MERW G+LKVPLH SHKFYRV ASLCLSP+SKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 46 MERWCGILKVPLHPQSHKFYRVAASLCLSPSSKTLTMPHANAILFNGDRVEGTGNPVIER 105
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DFIPSLNRWGEPKSYT NGFPAS
Sbjct: 106 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 165
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+ST+SLLGSCY+EVKKIISR Q S TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 166 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 225
Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
SKD IAA EN PHS QA GVECS L+EVQF TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 226 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 285
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
DPEV+KRIS+SLV+ESR + FVLHGTPRQWCD+RRVWIRDEKEKMLS LESEAR SGG+L
Sbjct: 286 DPEVMKRISNSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKEKMLSLLESEARRSGGRL 345
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 346 QVCERFHFADRPADMQMHFEIIEKLDVC 373
BLAST of Tan0014384 vs. NCBI nr
Match:
XP_023527404.1 (uncharacterized protein LOC111790644 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 553.9 bits (1426), Expect = 9.3e-154
Identity = 274/328 (83.54%), Postives = 291/328 (88.72%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
MERW G+LKVPLH SHKFYRV ASLCLSPTSKTL+ P ANAI FNGDRVEGTGNPVIER
Sbjct: 46 MERWCGILKVPLHPQSHKFYRVAASLCLSPTSKTLTTPHANAILFNGDRVEGTGNPVIER 105
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LSD QNIA+ILVSKFGDSTN W IE SDFNG FAIY DFIPSLNRWGEPKSYT NGFPAS
Sbjct: 106 LSDPQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 165
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+ST+SLLGSCY+EVKKIISR Q S TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 166 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 225
Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
SKD IAA EN PHS QA GVECS L+EVQF TEQSFLKSITEIHYVDVGLNS GAY T
Sbjct: 226 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSQGAYFT 285
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
DPEVIKRIS+SLV+ESR + FVLHGTPRQWCD+RRVWIRDEK+KMLSFLESEAR SGG+L
Sbjct: 286 DPEVIKRISNSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKDKMLSFLESEARRSGGRL 345
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 346 QVCERFHFADRPADMQMHFEIIEKLDVC 373
BLAST of Tan0014384 vs. NCBI nr
Match:
XP_022924396.1 (uncharacterized protein LOC111431905 [Cucurbita moschata])
HSP 1 Score: 551.6 bits (1420), Expect = 4.6e-153
Identity = 272/328 (82.93%), Postives = 291/328 (88.72%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
MERW G+LKVPLH SHKFYRV ASLCLSPTSKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 1 MERWCGILKVPLHPQSHKFYRVAASLCLSPTSKTLTMPHANAILFNGDRVEGTGNPVIER 60
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DF+ SLNRWGEPKSYT NGFPAS
Sbjct: 61 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFMHSLNRWGEPKSYTANGFPAS 120
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+ST+SLLGSCY+EVKKIISR Q S TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 121 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 180
Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
SKD IAA EN PHS QA GVECS L+EVQF TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 181 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 240
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
DPEVIKRIS+SLV+ESR + F+LHGTPRQWCD+RRVWIRDEKEKM S LESEAR SGG+L
Sbjct: 241 DPEVIKRISNSLVQESRGIRFILHGTPRQWCDSRRVWIRDEKEKMSSLLESEARRSGGRL 300
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 QVCERFHFADRPADMQMHFEIIEKLDVC 328
BLAST of Tan0014384 vs. ExPASy TrEMBL
Match:
A0A0A0L3P6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G118130 PE=4 SV=1)
HSP 1 Score: 558.5 bits (1438), Expect = 1.8e-155
Identity = 273/328 (83.23%), Postives = 293/328 (89.33%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
M+RW+G+LKVPL+ NS KFYRV SLCLSPTSKTL+VP NAIFFNGDRVEGTGNPVIER
Sbjct: 1 MDRWNGILKVPLNSNSRKFYRVAVSLCLSPTSKTLTVPRGNAIFFNGDRVEGTGNPVIER 60
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LS+LQNIAEILVSKFGDSTN W +E SDFNGAFAIY+DFIPSLNRWGEPKSYTPNGFPAS
Sbjct: 61 LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+STVSLLGSCY EVKKI+SRGK SQ T ISTL C P+TIILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPRSQETAISTLSCCTPETIILGFSKGGTVVNQLVTELG 180
Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
SKD +AA ENLP S Q SGVECS L+E+QF TT QSFLKSITEIHYVDVGLNSHGAYLT
Sbjct: 181 SKDLMAADENLPLSKQESGVECSKLDEIQFVPTTGQSFLKSITEIHYVDVGLNSHGAYLT 240
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
DPEVIKRISSSL++ESR + FVLHGTPRQWCD RRVWIRDEKEKM SFLESEA SGG L
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKEKMRSFLESEALRSGGNL 300
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
+V EKFYF+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 KVNEKFYFADRPADMQMHFEIIEKLDVC 328
BLAST of Tan0014384 vs. ExPASy TrEMBL
Match:
A0A6J1E9C8 (uncharacterized protein LOC111431905 OS=Cucurbita moschata OX=3662 GN=LOC111431905 PE=4 SV=1)
HSP 1 Score: 551.6 bits (1420), Expect = 2.2e-153
Identity = 272/328 (82.93%), Postives = 291/328 (88.72%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
MERW G+LKVPLH SHKFYRV ASLCLSPTSKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 1 MERWCGILKVPLHPQSHKFYRVAASLCLSPTSKTLTMPHANAILFNGDRVEGTGNPVIER 60
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DF+ SLNRWGEPKSYT NGFPAS
Sbjct: 61 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFMHSLNRWGEPKSYTANGFPAS 120
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+ST+SLLGSCY+EVKKIISR Q S TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 121 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 180
Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
SKD IAA EN PHS QA GVECS L+EVQF TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 181 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 240
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
DPEVIKRIS+SLV+ESR + F+LHGTPRQWCD+RRVWIRDEKEKM S LESEAR SGG+L
Sbjct: 241 DPEVIKRISNSLVQESRGIRFILHGTPRQWCDSRRVWIRDEKEKMSSLLESEARRSGGRL 300
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 QVCERFHFADRPADMQMHFEIIEKLDVC 328
BLAST of Tan0014384 vs. ExPASy TrEMBL
Match:
A0A1S3AV17 (UPF0565 protein C2orf69 homolog OS=Cucumis melo OX=3656 GN=LOC103483140 PE=4 SV=1)
HSP 1 Score: 551.2 bits (1419), Expect = 2.9e-153
Identity = 271/328 (82.62%), Postives = 290/328 (88.41%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
M+RW+G+LKVPL NS KFYRV SLCLSPTSKTL+VP ANAIFFNGDRVEGTGNPVIE
Sbjct: 1 MDRWNGILKVPLRSNSRKFYRVAVSLCLSPTSKTLTVPRANAIFFNGDRVEGTGNPVIEG 60
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LS+LQNIAEILVSKFGDSTN W +E SDFNGAFAIY+DFIPSLNRWGEPKSYTPNGFPAS
Sbjct: 61 LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+STVSLLGSCY EVKKI+SRGK SQ TTI TL P+T+ILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPGSQETTIPTLSGCTPETVILGFSKGGTVVNQLVTELG 180
Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
SKD IAA ENLP S Q SGVECS L+E QF TTE SFLKSITEIHYVDVGLN+HGAYLT
Sbjct: 181 SKDLIAADENLPLSKQESGVECSKLDENQFIPTTEHSFLKSITEIHYVDVGLNTHGAYLT 240
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
DPEVIKRISSSL++ESR + FVLHGTPRQWCD RRVWIRDEKE M SFLESEA SGG L
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKETMTSFLESEALRSGGNL 300
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
QV+EKFYF+DRPAD+QMHFEIIEKLDVC
Sbjct: 301 QVYEKFYFADRPADMQMHFEIIEKLDVC 328
BLAST of Tan0014384 vs. ExPASy TrEMBL
Match:
A0A6J1CWE4 (uncharacterized protein LOC111014908 OS=Momordica charantia OX=3673 GN=LOC111014908 PE=4 SV=1)
HSP 1 Score: 545.8 bits (1405), Expect = 1.2e-151
Identity = 267/329 (81.16%), Postives = 291/329 (88.45%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
M RWSG+LKVPLH S KFYRVGAS+CLSP SKTL+VPSAN IFFNGDRVEGTGNPVIER
Sbjct: 1 MGRWSGILKVPLHPKSQKFYRVGASICLSPNSKTLTVPSANVIFFNGDRVEGTGNPVIER 60
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LSDLQ IAEILVSKFGDSTN W +E SDFNG FA+YKDFIP LNRWGEPKSY PNGFPAS
Sbjct: 61 LSDLQKIAEILVSKFGDSTNAWVVEASDFNGTFAVYKDFIPYLNRWGEPKSYIPNGFPAS 120
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+STVSLLGSCY EVKK+ISRGKQ SQ +TISTL CS PKTIILGFSKGGTVVNQLVAELG
Sbjct: 121 VSTVSLLGSCYNEVKKMISRGKQGSQSSTISTLCCSLPKTIILGFSKGGTVVNQLVAELG 180
Query: 182 SKDFIAAENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLTD 241
S +F+A +NLPHS Q++GVECSNL +QF TTE+ FLKS+TEIHYVDVGLN+HGAYLTD
Sbjct: 181 SSEFMAVDNLPHSKQSAGVECSNLEGIQFIPTTERGFLKSMTEIHYVDVGLNTHGAYLTD 240
Query: 242 PEVIKRISSSLV--RESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGK 301
EVIKRISSSLV +ESR V F+LHGTPRQWCD+RRVWIR EKEKMLS LESEAR SGGK
Sbjct: 241 HEVIKRISSSLVQDQESRGVRFILHGTPRQWCDSRRVWIRHEKEKMLSLLESEARRSGGK 300
Query: 302 LQVFEKFYFSDRPADLQMHFEIIEKLDVC 329
LQV EKF F+D P ++QMHFEIIEKL+VC
Sbjct: 301 LQVCEKFCFADSPPNMQMHFEIIEKLEVC 329
BLAST of Tan0014384 vs. ExPASy TrEMBL
Match:
A0A6J1IN96 (uncharacterized protein LOC111479029 OS=Cucurbita maxima OX=3661 GN=LOC111479029 PE=4 SV=1)
HSP 1 Score: 541.2 bits (1393), Expect = 3.0e-150
Identity = 268/328 (81.71%), Postives = 287/328 (87.50%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
MERW G+LKVPLH H+FYRV ASLCLSPTSKTL++P ANAI FNGDRVEGTGNPVIER
Sbjct: 61 MERWCGILKVPLHPQCHRFYRVAASLCLSPTSKTLTMPHANAILFNGDRVEGTGNPVIER 120
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
LSDLQNIA+ILVSKFGDSTN W IE SDFNG FAIY DFIPSLNRWGEPKSYT NGFPAS
Sbjct: 121 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 180
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
+ST+SLLGSCY+EVKKIISR Q S TISTL CS PKTIILGFSKGGTV NQLVAELG
Sbjct: 181 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVANQLVAELG 240
Query: 182 SKDFIAA-ENLPHSNQASGVECSNLNEVQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
SKD IAA EN PHS QA GVECS L+E QF TEQSFLKSITEIHYVDVGLNS GAY T
Sbjct: 241 SKDLIAADENPPHSKQAPGVECSKLDEDQFIPNTEQSFLKSITEIHYVDVGLNSQGAYFT 300
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
DPE IKRIS+SLV+ESR + FVLHGTPRQW D+RRVWIRDEK+KMLS LESEAR SGG+L
Sbjct: 301 DPEAIKRISNSLVQESRGIRFVLHGTPRQWGDSRRVWIRDEKDKMLSLLESEARRSGGRL 360
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDVC 329
QV E+F+F+DRPAD+QMHFEIIEKLDVC
Sbjct: 361 QVCERFHFADRPADMQMHFEIIEKLDVC 388
BLAST of Tan0014384 vs. TAIR 10
Match:
AT2G44850.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: M germinated pollen stage, LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0565 (InterPro:IPR018881); Has 106 Blast hits to 106 proteins in 50 species: Archae - 0; Bacteria - 0; Metazoa - 73; Fungi - 0; Plants - 31; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )
HSP 1 Score: 368.2 bits (944), Expect = 6.7e-102
Identity = 189/327 (57.80%), Postives = 239/327 (73.09%), Query Frame = 0
Query: 2 MERWSGVLKVPLHLNSHKFYRVGASLCLSPTSKTLSVPSANAIFFNGDRVEGTGNPVIER 61
MERWSGVLK+PL + +YRV ASLCLS +SKTL+VPSANAIFF+GD+V+ TGN VIER
Sbjct: 1 MERWSGVLKIPLDATTSNYYRVAASLCLS-SSKTLTVPSANAIFFHGDKVQDTGNHVIER 60
Query: 62 LSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIYKDFIPSLNRWGEPKSYTPNGFPAS 121
L DLQ +AEI+VSKFG+S N W +E S FNG FAIYKDF+PS+N G PKSY+P GFPAS
Sbjct: 61 LYDLQKVAEIIVSKFGNSVNAWVVEASVFNGPFAIYKDFVPSVNHMGAPKSYSPVGFPAS 120
Query: 122 MSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCSPPKTIILGFSKGGTVVNQLVAELG 181
S VSLL SC EV K G I+++H PKTI+LGFSKGG V+NQL++E+
Sbjct: 121 SSIVSLLSSCLHEVLK---EGTDVCLIDQIASVH-HCPKTIVLGFSKGGVVMNQLMSEIS 180
Query: 182 SKDFIAAENLPHSNQASGVECSNLNE-VQFTSTTEQSFLKSITEIHYVDVGLNSHGAYLT 241
S D N ++ A E ++ +E +Q +++SFL SI+E+HY+DVGLNS GAY+T
Sbjct: 181 SLD----TNFAKTSSAMVEESTSQHEKIQIIPASKESFLNSISEVHYIDVGLNSSGAYIT 240
Query: 242 DPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRVWIRDEKEKMLSFLESEARESGGKL 301
D V++RIS L R + + V+HGTPRQWCD R WIR EK++++ L++E SGGKL
Sbjct: 241 DHNVVQRISQRLARGADSLRIVIHGTPRQWCDELRGWIRKEKDELVRLLKAETENSGGKL 300
Query: 302 QVFEKFYFSDRPADLQMHFEIIEKLDV 328
QV E+FYFSDR ADLQMHFEII+ +DV
Sbjct: 301 QVCERFYFSDRLADLQMHFEIIDAMDV 318
BLAST of Tan0014384 vs. TAIR 10
Match:
AT2G44850.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0565 (InterPro:IPR018881); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G45380.1); Has 138 Blast hits to 138 proteins in 53 species: Archae - 0; Bacteria - 0; Metazoa - 73; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )
HSP 1 Score: 322.4 bits (825), Expect = 4.2e-88
Identity = 165/291 (56.70%), Postives = 210/291 (72.16%), Query Frame = 0
Query: 38 VPSANAIFFNGDRVEGTGNPVIERLSDLQNIAEILVSKFGDSTNTWAIETSDFNGAFAIY 97
VPSANAIFF+GD+V+ TGN VIERL DLQ +AEI+VSKFG+S N W +E S FNG FAIY
Sbjct: 117 VPSANAIFFHGDKVQDTGNHVIERLYDLQKVAEIIVSKFGNSVNAWVVEASVFNGPFAIY 176
Query: 98 KDFIPSLNRWGEPKSYTPNGFPASMSTVSLLGSCYTEVKKIISRGKQESQPTTISTLHCS 157
KDF+PS+N G PKSY+P GFPAS S VSLL SC EV K G I+++H
Sbjct: 177 KDFVPSVNHMGAPKSYSPVGFPASSSIVSLLSSCLHEVLK---EGTDVCLIDQIASVH-H 236
Query: 158 PPKTIILGFSKGGTVVNQLVAELGSKDFIAAENLPHSNQASGVECSNLNE-VQFTSTTEQ 217
PKTI+LGFSKGG V+NQL++E+ S D N ++ A E ++ +E +Q +++
Sbjct: 237 CPKTIVLGFSKGGVVMNQLMSEISSLD----TNFAKTSSAMVEESTSQHEKIQIIPASKE 296
Query: 218 SFLKSITEIHYVDVGLNSHGAYLTDPEVIKRISSSLVRESRRVHFVLHGTPRQWCDNRRV 277
SFL SI+E+HY+DVGLNS GAY+TD V++RIS L R + + V+HGTPRQWCD R
Sbjct: 297 SFLNSISEVHYIDVGLNSSGAYITDHNVVQRISQRLARGADSLRIVIHGTPRQWCDELRG 356
Query: 278 WIRDEKEKMLSFLESEARESGGKLQVFEKFYFSDRPADLQMHFEIIEKLDV 328
WIR EK++++ L++E SGGKLQV E+FYFSDR ADLQMHFEII+ +DV
Sbjct: 357 WIRKEKDELVRLLKAETENSGGKLQVCERFYFSDRLADLQMHFEIIDAMDV 399
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_011650653.1 | 3.8e-155 | 83.23 | UPF0565 protein C2orf69 homolog [Cucumis sativus] >KGN56378.1 hypothetical prote... | [more] |
KAG7020705.1 | 1.9e-154 | 83.54 | Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... | [more] |
KAG6582701.1 | 1.9e-154 | 83.54 | hypothetical protein SDJN03_22703, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023527404.1 | 9.3e-154 | 83.54 | uncharacterized protein LOC111790644 [Cucurbita pepo subsp. pepo] | [more] |
XP_022924396.1 | 4.6e-153 | 82.93 | uncharacterized protein LOC111431905 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0L3P6 | 1.8e-155 | 83.23 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G118130 PE=4 SV=1 | [more] |
A0A6J1E9C8 | 2.2e-153 | 82.93 | uncharacterized protein LOC111431905 OS=Cucurbita moschata OX=3662 GN=LOC1114319... | [more] |
A0A1S3AV17 | 2.9e-153 | 82.62 | UPF0565 protein C2orf69 homolog OS=Cucumis melo OX=3656 GN=LOC103483140 PE=4 SV=... | [more] |
A0A6J1CWE4 | 1.2e-151 | 81.16 | uncharacterized protein LOC111014908 OS=Momordica charantia OX=3673 GN=LOC111014... | [more] |
A0A6J1IN96 | 3.0e-150 | 81.71 | uncharacterized protein LOC111479029 OS=Cucurbita maxima OX=3661 GN=LOC111479029... | [more] |
Match Name | E-value | Identity | Description | |
AT2G44850.2 | 6.7e-102 | 57.80 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT2G44850.1 | 4.2e-88 | 56.70 | unknown protein; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0... | [more] |