Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGCAATGGCAGCCTCCAACCGTCCCGTACACATTTCCCCAAGAACCTGCTTCTTCCCATTTTCCCGCCAAATCAACACCTTCTACCGACCGCGGCGGCGTCTCTGCTTCACGAACTCCAAAGGTCAGTTCATCTCCTTCTCCTTCGTCGCATAAACATTATCGCTTGATCCCATCACTTCCTTAGCCCTCGTATTCTCCGTCAATCAAACCGGAAAATTCATTGGAATGAACTCAGAGTTTTTCTTCCAATTAACATTTATCTCTCTCGACTATGTTTCAGAAATTTACGTGTTCAATTTCTTAATTTCTGCCTCTTCTCGTCTCGTAGGGCTGGGATGGTACGGATGTGGAGTTTGTGTACGGTTACCTGGTTTTGAGGTCGCTGCCGTAGCCGGAGGAAGGGAGAGAGCGCAAGTATCTGCTGCGTGGGATGAGGAGCCTTATGAATTGCTTCCGAACGGGAAAATACAGTACCTGGACGAACAAGATGTAGTAACCTTTCTAGATCCGCCCAAGGAGCTCATACCATTAGACCCTGCTTCTTATAATCCAGCTGCGTATCTTTGGTACGTAGTAATTTATTTCTGAGCTTGTAGCCTTCTTAATCGACTTCCGCTAATCTTATTCAATCTCATGTGCTTGTTTCATATCATCATAAATTAACTACTACCATTACAAAATTGTAACGGTCAGTGGTTTAACATGCTTTATATTTTCCAATTTGACATAGTTTCTAACATTACCAACTGGTTGGCAAAATATTTTTGGTTTTTTCTACTGTATTGTTGCTGATTCTCATGTAGTTTACTCGCAATTCAGTTTGAAATTTTGTCTTTTGATTGACAGGAAGAAGATTGAAGATATTCCTGAGGAACGACGTCATAGACTTTTGCACCTCCTAAGCCCTAGGTTACATATAGGAAGCCTTGTAATACCTATATAGTTGTTTATGCTCTTTCTTATTCTTCATTATGCAATTTCCATTCGCTCGAACACTCTTCTTAATTTTGTCGGAACTATTTTAGCAACAGAATTCTACATTTTAAGTTTTGTTAAGAATTCGGAAATTCTGACACCAGCAATATAATATAGGTGTATATCAAGGGCTTGGGGAATAGCTGGCACACGATATGAGGATCCAAAGTTGGTTAAGAAAATGGCATCTAGTTTGCTGCAAAATGAAGACAGTACGGTGCTTGAATATTATAACTGCCTAAAGAGTGGAGGTAAGCAGTTCACTACATTATTTTCCCTATCGTATGGAAGAATTCAAATTTTGTTTCCTAATTTCTAAAATTCCTTCACTGTATAAGGAAAATGATTCGATAGAGGACAAGAAAGGGTGAACTTTGTTATTGGGTGGTGATTTTTTGGAACATTTAAAGTTTAAACATTCAAAGTTTTATTTAATATTTTCCGCAAAATTATAATAATGTTATGATAGCCTAACAACCTTGATTTTTAATATCATGAATAGGACAAATACCTATTGCTTGGATTAATCTTTTCAAGAAGGTGAAAATTATCAAGTCCTTGTCCCCACTTTCTTCTTTCAAGTGTACAGTACAATATAATCTATCTATTTACGTTTCCTTGTTTATTAGGCATTATTTTCTTGCAAAGATGGAAAGACTTATGGGCGGTTTATTGGTAAGAGCTCTTAACATTTTTCAACTATACATTCTTCTCCTCTTATGACGTTAGTAACTTTGTAAAACTACAGATTTTATGGCACTAAAACAGTACAGAAGCTAGCTGATCCATATCCCATTGACCTATAAGTTTACTACATCCTCATTTTCTTAATTCATGGATATGGATTGGACACCAACACGACTATTTCGACGTATCACTATGTTGATAGCACTTTAAATTATTTGCAGTTGCACAATCACAAATGATGATCAAAATCCAATTGTCACAACCTCATTCCTCTGTCTGTTTGATGGAGTTTTTGTAAAATGAAAATGGATCATTTGATGATCACCAATCCACTAAAATTTCCTTCTGCTAACATTATCAGGCATGTCCTTACTGGCTGGATTTGCAAATTGCTTTAGTCCATTGTACTTTGAGGTGACACAACTTAAGGAGGTAATGTCAACCGAACATCCTTGTGACTTGGCATATGAATTTGGAGATGGGCTTTTTGATATTCATGAATACCCTGAAGGCTTTCCGGCTCCAGGTATGATATTGAAATCATTTTTTTGTCTTTCTTCACATTGATTAAATCTCATCAAATGTGTTGGTTCTTCCATGATACAATTTCAACTCAAATCTCATGTTATTTATGCAAGTTAATTGCGACCAGGATATTTTTCATGAAAGAAATGACAACGAACAAGAACTGTTTGCTCTGCATGCACCTATTTTCTCAGGGGTAGGATGTTGCAAGGTCTGGTCCCCTCCTTAGCTTTGCAACATAAGCTCCACTATGTATATGAAGTCTTTAGTGTCTTTCTTGAAGGAGACTTCTAAGTAACTATGTACCATTAGGAACAAAATTAGAGACGTGAGAGATGGTTTGCATATTTGGAGATACACTCATGAATTTGCGATGGTGAAGAACATCCTTTACATGAGTTTTTAAATAACCTTGTTAAGTGCATTTCATATTCAATTGAAATGATAGCTCTCATGCAGCATGCTTCTCCTTGGTTTTCCAAGTTTTCAATACTTTATTTTAGCCCCCTCGCTTTCAAATCTACCAAAGTATTATATAATAAGTACGTACTTTAGAGGTAACTTCACCTCCATTTTCCCTCTTTTTAGCCTCTTGATCTATACCCGTTAGAAAGAACCTTCAACCTCATATATTTCTTCATCAGAAACTGATGCAAGTTTCACTCGATTAAATATTAATTATTTGTATAGCTTCCAAATTTATAATATTTAAGAATTTGTGTGTTATTCTTTTCATTTTGCAGTCAAGCATCGATATCCTTTCAACGATCAGCTTGTAGTATATGTTCGATATCTAGGACCTGGAGTGTTAGTTGGCCAGGCATGGCAAGAAGGAAAAGCTTTGGAGCAGGTGCCACGTAAATTATGTTCTGAAATCTTGATGATCAAAGACTACAGTCCATGTCCACTCCAGGAAAAGCAATAG
mRNA sequence
ATGAAGGCAATGGCAGCCTCCAACCGTCCCGTACACATTTCCCCAAGAACCTGCTTCTTCCCATTTTCCCGCCAAATCAACACCTTCTACCGACCGCGGCGGCGTCTCTGCTTCACGAACTCCAAAGGGCTGGGATGGTACGGATGTGGAGTTTGTGTACGGTTACCTGGTTTTGAGGTCGCTGCCGTAGCCGGAGGAAGGGAGAGAGCGCAAGTATCTGCTGCGTGGGATGAGGAGCCTTATGAATTGCTTCCGAACGGGAAAATACAGTACCTGGACGAACAAGATGTAGTAACCTTTCTAGATCCGCCCAAGGAGCTCATACCATTAGACCCTGCTTCTTATAATCCAGCTGCGTATCTTTGGAAGAAGATTGAAGATATTCCTGAGGAACGACGTCATAGACTTTTGCACCTCCTAAGCCCTAGGTGTATATCAAGGGCTTGGGGAATAGCTGGCACACGATATGAGGATCCAAAGTTGGTTAAGAAAATGGCATCTAGTTTGCTGCAAAATGAAGACAGTACGGTGCTTGAATATTATAACTGCCTAAAGAGTGGAGGACAAATACCTATTGCTTGGATTAATCTTTTCAAGAAGGCATTATTTTCTTGCAAAGATGGAAAGACTTATGGGCGGTTTATTGGCATGTCCTTACTGGCTGGATTTGCAAATTGCTTTAGTCCATTGTACTTTGAGGTGACACAACTTAAGGAGGTAATGTCAACCGAACATCCTTGTGACTTGGCATATGAATTTGGAGATGGGCTTTTTGATATTCATGAATACCCTGAAGGCTTTCCGGCTCCAGTCAAGCATCGATATCCTTTCAACGATCAGCTTGTAGTATATGTTCGATATCTAGGACCTGGAGTGTTAGTTGGCCAGGCATGGCAAGAAGGAAAAGCTTTGGAGCAGGTGCCACGTAAATTATGTTCTGAAATCTTGATGATCAAAGACTACAGTCCATGTCCACTCCAGGAAAAGCAATAG
Coding sequence (CDS)
ATGAAGGCAATGGCAGCCTCCAACCGTCCCGTACACATTTCCCCAAGAACCTGCTTCTTCCCATTTTCCCGCCAAATCAACACCTTCTACCGACCGCGGCGGCGTCTCTGCTTCACGAACTCCAAAGGGCTGGGATGGTACGGATGTGGAGTTTGTGTACGGTTACCTGGTTTTGAGGTCGCTGCCGTAGCCGGAGGAAGGGAGAGAGCGCAAGTATCTGCTGCGTGGGATGAGGAGCCTTATGAATTGCTTCCGAACGGGAAAATACAGTACCTGGACGAACAAGATGTAGTAACCTTTCTAGATCCGCCCAAGGAGCTCATACCATTAGACCCTGCTTCTTATAATCCAGCTGCGTATCTTTGGAAGAAGATTGAAGATATTCCTGAGGAACGACGTCATAGACTTTTGCACCTCCTAAGCCCTAGGTGTATATCAAGGGCTTGGGGAATAGCTGGCACACGATATGAGGATCCAAAGTTGGTTAAGAAAATGGCATCTAGTTTGCTGCAAAATGAAGACAGTACGGTGCTTGAATATTATAACTGCCTAAAGAGTGGAGGACAAATACCTATTGCTTGGATTAATCTTTTCAAGAAGGCATTATTTTCTTGCAAAGATGGAAAGACTTATGGGCGGTTTATTGGCATGTCCTTACTGGCTGGATTTGCAAATTGCTTTAGTCCATTGTACTTTGAGGTGACACAACTTAAGGAGGTAATGTCAACCGAACATCCTTGTGACTTGGCATATGAATTTGGAGATGGGCTTTTTGATATTCATGAATACCCTGAAGGCTTTCCGGCTCCAGTCAAGCATCGATATCCTTTCAACGATCAGCTTGTAGTATATGTTCGATATCTAGGACCTGGAGTGTTAGTTGGCCAGGCATGGCAAGAAGGAAAAGCTTTGGAGCAGGTGCCACGTAAATTATGTTCTGAAATCTTGATGATCAAAGACTACAGTCCATGTCCACTCCAGGAAAAGCAATAG
Protein sequence
MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQEGKALEQVPRKLCSEILMIKDYSPCPLQEKQ
Homology
BLAST of Tan0021039 vs. NCBI nr
Match:
XP_023000285.1 (uncharacterized protein LOC111494560 [Cucurbita maxima])
HSP 1 Score: 618.2 bits (1593), Expect = 4.0e-173
Identity = 294/330 (89.09%), Postives = 306/330 (92.73%), Query Frame = 0
Query: 1 MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
MKAMAAS+RPV IS TCFFPFSRQINTF RPRRRLC+TNSKGL W+G GVCV P FEV
Sbjct: 1 MKAMAASSRPVQISHGTCFFPFSRQINTFCRPRRRLCYTNSKGLKWFGYGVCVSPPSFEV 60
Query: 61 AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
AAVAGGRERAQVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61 AAVAGGRERAQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120
Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D + LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGSTLEY 180
Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
YNC+KSGGQIPIAWIN FKKALFSCKDGKTYGRFIGM LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSCKDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240
Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
MSTEHPCDLAYEFGDGLFD HEYPEGFPA VKHRYPFNDQLVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDFHEYPEGFPASVKHRYPFNDQLVVYVRFVGPGVLVGQAWQE 300
Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
GK+LEQVPRKLCSEILMIKDYSP PL EKQ
Sbjct: 301 GKSLEQVPRKLCSEILMIKDYSPRPLPEKQ 329
BLAST of Tan0021039 vs. NCBI nr
Match:
XP_023515334.1 (uncharacterized protein LOC111779395 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 617.1 bits (1590), Expect = 9.0e-173
Identity = 294/330 (89.09%), Postives = 306/330 (92.73%), Query Frame = 0
Query: 1 MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
MKAMAAS+RPV IS TCFFPFSRQINTF RPRRRLC+TNSKGL W+G GVCV P FEV
Sbjct: 1 MKAMAASSRPVQISHGTCFFPFSRQINTFCRPRRRLCYTNSKGLKWFGYGVCVSPPSFEV 60
Query: 61 AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
AAVAGGRER QVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61 AAVAGGRERPQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120
Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D T+LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGTMLEY 180
Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
YNC+KSGGQIPIAWIN FKKALFS DGKTYGRFIGM LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSSNDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240
Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
MSTEHPCDLAYEFGDGLFD+HEYPEGFPA VKHRYPFNDQLVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDVHEYPEGFPASVKHRYPFNDQLVVYVRFVGPGVLVGQAWQE 300
Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
GKALEQVPRKLCSEILMIKDYSP PLQEKQ
Sbjct: 301 GKALEQVPRKLCSEILMIKDYSPRPLQEKQ 329
BLAST of Tan0021039 vs. NCBI nr
Match:
XP_022964125.1 (uncharacterized protein LOC111464245 [Cucurbita moschata])
HSP 1 Score: 611.3 bits (1575), Expect = 4.9e-171
Identity = 292/330 (88.48%), Postives = 302/330 (91.52%), Query Frame = 0
Query: 1 MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
MKAMAAS+RPV IS TCF PFSRQINTF RP RRLC+TNSKGL W+G GVCV P FEV
Sbjct: 1 MKAMAASSRPVQISHGTCFLPFSRQINTFCRPPRRLCYTNSKGLKWFGYGVCVSPPSFEV 60
Query: 61 AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
AAVAG RERAQVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61 AAVAGRRERAQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120
Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D T LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGTTLEY 180
Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
YNC+KSGGQIPIAWIN FKKALFSC DGKTYGRFIGM LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSCNDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240
Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
MSTEHPCDLAYEFGDGLFD HEYPEGFPA VKHRYPFND LVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDFHEYPEGFPASVKHRYPFNDHLVVYVRFVGPGVLVGQAWQE 300
Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
GKALEQVPRKLCSEILMIKDYSP PLQEKQ
Sbjct: 301 GKALEQVPRKLCSEILMIKDYSPRPLQEKQ 329
BLAST of Tan0021039 vs. NCBI nr
Match:
XP_038899771.1 (uncharacterized protein LOC120087001 [Benincasa hispida])
HSP 1 Score: 598.2 bits (1541), Expect = 4.3e-167
Identity = 287/332 (86.45%), Postives = 298/332 (89.76%), Query Frame = 0
Query: 1 MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPG- 60
MKAMAAS+ V ISP TC PF QINTF PRRRLCFTNSKGL GWYGCG+CVR PG
Sbjct: 4 MKAMAASSSSVQISPGTCCIPFPHQINTFNTPRRRLCFTNSKGLGIGWYGCGICVRSPGC 63
Query: 61 FEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNP 120
VAA+AGGRER Q S+AWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLD ASYNP
Sbjct: 64 VVVAAIAGGREREQASSAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDLASYNP 123
Query: 121 AAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV 180
AAYLWKKIEDIPEERRHRLL LL+PRCISRAWGIAGTRYEDPKL+KKMASSLLQNED V
Sbjct: 124 AAYLWKKIEDIPEERRHRLLQLLTPRCISRAWGIAGTRYEDPKLLKKMASSLLQNEDGMV 183
Query: 181 LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQL 240
LEYY CLKSGGQIPI WIN FKKALFSCKDGKTYGR IGMSLLAGFAN SPLYFEV QL
Sbjct: 184 LEYYYCLKSGGQIPIGWINRFKKALFSCKDGKTYGRIIGMSLLAGFANSVSPLYFEVKQL 243
Query: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQA 300
KEVMSTEHPCDLAYEFGDGLFDIHEYP GFPAP KH YPFNDQ+VVYVRYLGPGVLVGQA
Sbjct: 244 KEVMSTEHPCDLAYEFGDGLFDIHEYPAGFPAPAKHLYPFNDQVVVYVRYLGPGVLVGQA 303
Query: 301 WQEGKALEQVPRKLCSEILMIKDYSPCPLQEK 330
WQEGKALEQVPRKLC+EILMIKDYS P+Q++
Sbjct: 304 WQEGKALEQVPRKLCAEILMIKDYSESPVQKQ 335
BLAST of Tan0021039 vs. NCBI nr
Match:
XP_004141740.2 (uncharacterized protein LOC101213828 [Cucumis sativus] >KAE8646483.1 hypothetical protein Csa_016678 [Cucumis sativus])
HSP 1 Score: 574.3 bits (1479), Expect = 6.7e-160
Identity = 276/332 (83.13%), Postives = 289/332 (87.05%), Query Frame = 0
Query: 1 MKAMAA-SNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPG 60
MKAMAA SNRP+ ISP TC F RQINTF RRL FTN KGL GWY CGVCVR PG
Sbjct: 1 MKAMAATSNRPLQISPWTCCSSFPRQINTFNTQHRRLSFTNFKGLGIGWYSCGVCVRSPG 60
Query: 61 FEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNP 120
VAA AGGRER QVS+ WDEEPYELLPNG+IQY+DEQDV +FLDPPKELIP DP SYNP
Sbjct: 61 CVVAAAAGGREREQVSSVWDEEPYELLPNGRIQYIDEQDVASFLDPPKELIPFDPDSYNP 120
Query: 121 AAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV 180
AAYLWKKIE+IPEERRHRLLHLL+PRCISRAWGIAGTRYEDPKLVKK ASSLLQNED V
Sbjct: 121 AAYLWKKIEEIPEERRHRLLHLLTPRCISRAWGIAGTRYEDPKLVKKTASSLLQNEDGMV 180
Query: 181 LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQL 240
LEYYNCLKSGGQIPI WIN FKKA+FS KDGK YGR I M LLAGFAN SPLYFE+ QL
Sbjct: 181 LEYYNCLKSGGQIPIGWINRFKKAIFSSKDGKIYGRIINMPLLAGFANSVSPLYFEMKQL 240
Query: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQA 300
KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAP KH YPFNDQ+VVYVRYLGPGVLVGQA
Sbjct: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPAKHLYPFNDQVVVYVRYLGPGVLVGQA 300
Query: 301 WQEGKALEQVPRKLCSEILMIKDYSPCPLQEK 330
WQEGKALEQVP+KLC EILMIKDYS PLQ++
Sbjct: 301 WQEGKALEQVPQKLCGEILMIKDYSQQPLQKQ 332
BLAST of Tan0021039 vs. ExPASy TrEMBL
Match:
A0A6J1KHW6 (uncharacterized protein LOC111494560 OS=Cucurbita maxima OX=3661 GN=LOC111494560 PE=4 SV=1)
HSP 1 Score: 618.2 bits (1593), Expect = 1.9e-173
Identity = 294/330 (89.09%), Postives = 306/330 (92.73%), Query Frame = 0
Query: 1 MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
MKAMAAS+RPV IS TCFFPFSRQINTF RPRRRLC+TNSKGL W+G GVCV P FEV
Sbjct: 1 MKAMAASSRPVQISHGTCFFPFSRQINTFCRPRRRLCYTNSKGLKWFGYGVCVSPPSFEV 60
Query: 61 AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
AAVAGGRERAQVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61 AAVAGGRERAQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120
Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D + LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGSTLEY 180
Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
YNC+KSGGQIPIAWIN FKKALFSCKDGKTYGRFIGM LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSCKDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240
Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
MSTEHPCDLAYEFGDGLFD HEYPEGFPA VKHRYPFNDQLVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDFHEYPEGFPASVKHRYPFNDQLVVYVRFVGPGVLVGQAWQE 300
Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
GK+LEQVPRKLCSEILMIKDYSP PL EKQ
Sbjct: 301 GKSLEQVPRKLCSEILMIKDYSPRPLPEKQ 329
BLAST of Tan0021039 vs. ExPASy TrEMBL
Match:
A0A6J1HJY4 (uncharacterized protein LOC111464245 OS=Cucurbita moschata OX=3662 GN=LOC111464245 PE=4 SV=1)
HSP 1 Score: 611.3 bits (1575), Expect = 2.4e-171
Identity = 292/330 (88.48%), Postives = 302/330 (91.52%), Query Frame = 0
Query: 1 MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
MKAMAAS+RPV IS TCF PFSRQINTF RP RRLC+TNSKGL W+G GVCV P FEV
Sbjct: 1 MKAMAASSRPVQISHGTCFLPFSRQINTFCRPPRRLCYTNSKGLKWFGYGVCVSPPSFEV 60
Query: 61 AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
AAVAG RERAQVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61 AAVAGRRERAQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120
Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D T LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGTTLEY 180
Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
YNC+KSGGQIPIAWIN FKKALFSC DGKTYGRFIGM LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSCNDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240
Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
MSTEHPCDLAYEFGDGLFD HEYPEGFPA VKHRYPFND LVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDFHEYPEGFPASVKHRYPFNDHLVVYVRFVGPGVLVGQAWQE 300
Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
GKALEQVPRKLCSEILMIKDYSP PLQEKQ
Sbjct: 301 GKALEQVPRKLCSEILMIKDYSPRPLQEKQ 329
BLAST of Tan0021039 vs. ExPASy TrEMBL
Match:
A0A0A0KCA7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447880 PE=4 SV=1)
HSP 1 Score: 569.7 bits (1467), Expect = 7.9e-159
Identity = 271/327 (82.87%), Postives = 284/327 (86.85%), Query Frame = 0
Query: 5 AASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPGFEVAA 64
A SNRP+ ISP TC F RQINTF RRL FTN KGL GWY CGVCVR PG VAA
Sbjct: 3 ATSNRPLQISPWTCCSSFPRQINTFNTQHRRLSFTNFKGLGIGWYSCGVCVRSPGCVVAA 62
Query: 65 VAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAYLW 124
AGGRER QVS+ WDEEPYELLPNG+IQY+DEQDV +FLDPPKELIP DP SYNPAAYLW
Sbjct: 63 AAGGREREQVSSVWDEEPYELLPNGRIQYIDEQDVASFLDPPKELIPFDPDSYNPAAYLW 122
Query: 125 KKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEYYN 184
KKIE+IPEERRHRLLHLL+PRCISRAWGIAGTRYEDPKLVKK ASSLLQNED VLEYYN
Sbjct: 123 KKIEEIPEERRHRLLHLLTPRCISRAWGIAGTRYEDPKLVKKTASSLLQNEDGMVLEYYN 182
Query: 185 CLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEVMS 244
CLKSGGQIPI WIN FKKA+FS KDGK YGR I M LLAGFAN SPLYFE+ QLKEVMS
Sbjct: 183 CLKSGGQIPIGWINRFKKAIFSSKDGKIYGRIINMPLLAGFANSVSPLYFEMKQLKEVMS 242
Query: 245 TEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQEGK 304
TEHPCDLAYEFGDGLFDIHEYPEGFPAP KH YPFNDQ+VVYVRYLGPGVLVGQAWQEGK
Sbjct: 243 TEHPCDLAYEFGDGLFDIHEYPEGFPAPAKHLYPFNDQVVVYVRYLGPGVLVGQAWQEGK 302
Query: 305 ALEQVPRKLCSEILMIKDYSPCPLQEK 330
ALEQVP+KLC EILMIKDYS PLQ++
Sbjct: 303 ALEQVPQKLCGEILMIKDYSQQPLQKQ 329
BLAST of Tan0021039 vs. ExPASy TrEMBL
Match:
A0A1S3CGG5 (uncharacterized protein LOC103500630 OS=Cucumis melo OX=3656 GN=LOC103500630 PE=4 SV=1)
HSP 1 Score: 564.3 bits (1453), Expect = 3.3e-157
Identity = 269/325 (82.77%), Postives = 285/325 (87.69%), Query Frame = 0
Query: 1 MKAMAA-SNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPG 60
MKAMAA S+RPV ISP TC F RQINTF RRL FTN KGL GWY CGVCVR PG
Sbjct: 1 MKAMAATSSRPVQISPWTCCSSFPRQINTFNTQHRRLSFTNFKGLGIGWYSCGVCVRSPG 60
Query: 61 FEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNP 120
VAAVAG +ER + S+ WDE+PYELLPNG+IQY+DE DV +FLDPPKELIPLDPASYNP
Sbjct: 61 CVVAAVAGRKEREEASSVWDEKPYELLPNGRIQYIDELDVASFLDPPKELIPLDPASYNP 120
Query: 121 AAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV 180
AAYLWKKIE IPEERRHRLLHLL+PRCISRAWGIAG+RYEDPKLVKKMASSLLQNED V
Sbjct: 121 AAYLWKKIEAIPEERRHRLLHLLTPRCISRAWGIAGSRYEDPKLVKKMASSLLQNEDGMV 180
Query: 181 LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQL 240
LEYYNCLKSGGQIPI WIN FKKA+FSCKDGK YGR I M LLAGFAN FSPLYFEV QL
Sbjct: 181 LEYYNCLKSGGQIPIGWINRFKKAIFSCKDGKIYGRIINMPLLAGFANSFSPLYFEVKQL 240
Query: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQA 300
KEVMSTEHPCDLA++FGDGLFDIHEYPEGFP P KH YPFNDQ+VVYVRYLGPGVLVGQA
Sbjct: 241 KEVMSTEHPCDLAFDFGDGLFDIHEYPEGFPVPAKHLYPFNDQVVVYVRYLGPGVLVGQA 300
Query: 301 WQEGKALEQVPRKLCSEILMIKDYS 323
WQEGKALEQVP+KLC EILMIKDY+
Sbjct: 301 WQEGKALEQVPQKLCGEILMIKDYN 325
BLAST of Tan0021039 vs. ExPASy TrEMBL
Match:
A0A5D3C0T8 (WD repeat-containing protein 49 isoform 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G001770 PE=4 SV=1)
HSP 1 Score: 560.1 bits (1442), Expect = 6.3e-156
Identity = 270/332 (81.33%), Postives = 287/332 (86.45%), Query Frame = 0
Query: 1 MKAMAA-SNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPG 60
MKAMAA S+RPV ISP TC F RQINTF RRL FTN KGL GWY CGVCVR PG
Sbjct: 1 MKAMAATSSRPVQISPWTCCSSFPRQINTFNTQHRRLSFTNFKGLGIGWYSCGVCVRSPG 60
Query: 61 FEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNP 120
V AVAG +ER + S+ WDE+PYELLPNG+IQY+DE DV +FLDPPKELIPLDPASYNP
Sbjct: 61 CVVTAVAGRKEREEASSVWDEKPYELLPNGRIQYIDELDVASFLDPPKELIPLDPASYNP 120
Query: 121 AAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV 180
AAYLWKKIE IPEERRHRLLHLL+PRCISRAWGIAG+RYEDPKLVKKMASSLLQNED V
Sbjct: 121 AAYLWKKIEAIPEERRHRLLHLLTPRCISRAWGIAGSRYEDPKLVKKMASSLLQNEDGMV 180
Query: 181 LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQL 240
LEYYNCLKSGGQIPI WIN FKKA+FSCKDGK YGR I M LLAGFAN SPLYFEV QL
Sbjct: 181 LEYYNCLKSGGQIPIGWINRFKKAIFSCKDGKIYGRIINMPLLAGFANSVSPLYFEVKQL 240
Query: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQA 300
KEVMSTEHPCDLA++FGDGLFDIHEYPEGFP P KH YPFNDQ+VVYVRYLGPGVLVGQA
Sbjct: 241 KEVMSTEHPCDLAFDFGDGLFDIHEYPEGFPVPAKHLYPFNDQVVVYVRYLGPGVLVGQA 300
Query: 301 WQEGKALEQVPRKLCSEILMIKDYS-PCPLQE 329
WQEGKALEQVP+KLC EILMIKDY+ PLQ+
Sbjct: 301 WQEGKALEQVPQKLCGEILMIKDYNQQHPLQK 332
BLAST of Tan0021039 vs. TAIR 10
Match:
AT1G73090.1 (unknown protein; Has 28 Blast hits to 28 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 28; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 360.5 bits (924), Expect = 1.4e-99
Identity = 169/269 (62.83%), Postives = 203/269 (75.46%), Query Frame = 0
Query: 54 RLPGFEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPA 113
RL V AV GGR + WDE+PYE LP GK Y+DE DVVTFLDPPKELIPLDPA
Sbjct: 40 RLSRCVVMAVQGGR---GYESPWDEKPYETLPTGKRVYVDESDVVTFLDPPKELIPLDPA 99
Query: 114 SYNPAAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNE 173
SYNPAAYLWKKIEDIPEERRH LL LL PR ISRAW IA TRYEDPKL K AS +
Sbjct: 100 SYNPAAYLWKKIEDIPEERRHHLLQLLEPRLISRAWEIASTRYEDPKLAKMTASKIFSAG 159
Query: 174 DSTV-LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYF 233
++ + +EY++C S G + I+WIN FK ALF +G+ YGR G +++ AN FSPLYF
Sbjct: 160 NAEIPVEYFSCRTSQGPLIISWINFFKMALFRSYNGQIYGRVCGGPVVSTLANAFSPLYF 219
Query: 234 EVTQLKEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGV 293
EVT+ EVM+TE PCD+A +FGDGL I +YP+GFP P KH YPFND +V+Y+R++GPGV
Sbjct: 220 EVTEAMEVMATEEPCDVACKFGDGLLAIEDYPQGFPRPAKHPYPFNDSVVIYIRHIGPGV 279
Query: 294 LVGQAWQEGKALEQVPRKLCSEILMIKDY 322
VGQAWQEG+ L+QVP++LCS+ILM+K Y
Sbjct: 280 CVGQAWQEGRELQQVPQRLCSDILMVKQY 305
BLAST of Tan0021039 vs. TAIR 10
Match:
AT1G73090.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages. )
HSP 1 Score: 345.5 bits (885), Expect = 4.7e-95
Identity = 169/297 (56.90%), Postives = 203/297 (68.35%), Query Frame = 0
Query: 54 RLPGFEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPA 113
RL V AV GGR + WDE+PYE LP GK Y+DE DVVTFLDPPKELIPLDPA
Sbjct: 40 RLSRCVVMAVQGGR---GYESPWDEKPYETLPTGKRVYVDESDVVTFLDPPKELIPLDPA 99
Query: 114 SYNPAAYLWKKIEDIPEERRHRLLHLLSP----------------------------RCI 173
SYNPAAYLWKKIEDIPEERRH LL LL P R I
Sbjct: 100 SYNPAAYLWKKIEDIPEERRHHLLQLLEPRLDSYAKEVDMEKELRACTVFFMRVYVCRLI 159
Query: 174 SRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV-LEYYNCLKSGGQIPIAWINLFKKALFS 233
SRAW IA TRYEDPKL K AS + ++ + +EY++C S G + I+WIN FK ALF
Sbjct: 160 SRAWEIASTRYEDPKLAKMTASKIFSAGNAEIPVEYFSCRTSQGPLIISWINFFKMALFR 219
Query: 234 CKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEVMSTEHPCDLAYEFGDGLFDIHEYP 293
+G+ YGR G +++ AN FSPLYFEVT+ EVM+TE PCD+A +FGDGL I +YP
Sbjct: 220 SYNGQIYGRVCGGPVVSTLANAFSPLYFEVTEAMEVMATEEPCDVACKFGDGLLAIEDYP 279
Query: 294 EGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQEGKALEQVPRKLCSEILMIKDY 322
+GFP P KH YPFND +V+Y+R++GPGV VGQAWQEG+ L+QVP++LCS+ILM+K Y
Sbjct: 280 QGFPRPAKHPYPFNDSVVIYIRHIGPGVCVGQAWQEGRELQQVPQRLCSDILMVKQY 333
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023000285.1 | 4.0e-173 | 89.09 | uncharacterized protein LOC111494560 [Cucurbita maxima] | [more] |
XP_023515334.1 | 9.0e-173 | 89.09 | uncharacterized protein LOC111779395 [Cucurbita pepo subsp. pepo] | [more] |
XP_022964125.1 | 4.9e-171 | 88.48 | uncharacterized protein LOC111464245 [Cucurbita moschata] | [more] |
XP_038899771.1 | 4.3e-167 | 86.45 | uncharacterized protein LOC120087001 [Benincasa hispida] | [more] |
XP_004141740.2 | 6.7e-160 | 83.13 | uncharacterized protein LOC101213828 [Cucumis sativus] >KAE8646483.1 hypothetica... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1KHW6 | 1.9e-173 | 89.09 | uncharacterized protein LOC111494560 OS=Cucurbita maxima OX=3661 GN=LOC111494560... | [more] |
A0A6J1HJY4 | 2.4e-171 | 88.48 | uncharacterized protein LOC111464245 OS=Cucurbita moschata OX=3662 GN=LOC1114642... | [more] |
A0A0A0KCA7 | 7.9e-159 | 82.87 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447880 PE=4 SV=1 | [more] |
A0A1S3CGG5 | 3.3e-157 | 82.77 | uncharacterized protein LOC103500630 OS=Cucumis melo OX=3656 GN=LOC103500630 PE=... | [more] |
A0A5D3C0T8 | 6.3e-156 | 81.33 | WD repeat-containing protein 49 isoform 2 OS=Cucumis melo var. makuwa OX=1194695... | [more] |
Match Name | E-value | Identity | Description | |
AT1G73090.1 | 1.4e-99 | 62.83 | unknown protein; Has 28 Blast hits to 28 proteins in 12 species: Archae - 0; Bac... | [more] |
AT1G73090.2 | 4.7e-95 | 56.90 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |