Tan0021039 (gene) Snake gourd v1

Overview
NameTan0021039
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionWD repeat-containing protein 49 isoform 2
LocationLG06: 626966 .. 630083 (+)
RNA-Seq ExpressionTan0021039
SyntenyTan0021039
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGCAATGGCAGCCTCCAACCGTCCCGTACACATTTCCCCAAGAACCTGCTTCTTCCCATTTTCCCGCCAAATCAACACCTTCTACCGACCGCGGCGGCGTCTCTGCTTCACGAACTCCAAAGGTCAGTTCATCTCCTTCTCCTTCGTCGCATAAACATTATCGCTTGATCCCATCACTTCCTTAGCCCTCGTATTCTCCGTCAATCAAACCGGAAAATTCATTGGAATGAACTCAGAGTTTTTCTTCCAATTAACATTTATCTCTCTCGACTATGTTTCAGAAATTTACGTGTTCAATTTCTTAATTTCTGCCTCTTCTCGTCTCGTAGGGCTGGGATGGTACGGATGTGGAGTTTGTGTACGGTTACCTGGTTTTGAGGTCGCTGCCGTAGCCGGAGGAAGGGAGAGAGCGCAAGTATCTGCTGCGTGGGATGAGGAGCCTTATGAATTGCTTCCGAACGGGAAAATACAGTACCTGGACGAACAAGATGTAGTAACCTTTCTAGATCCGCCCAAGGAGCTCATACCATTAGACCCTGCTTCTTATAATCCAGCTGCGTATCTTTGGTACGTAGTAATTTATTTCTGAGCTTGTAGCCTTCTTAATCGACTTCCGCTAATCTTATTCAATCTCATGTGCTTGTTTCATATCATCATAAATTAACTACTACCATTACAAAATTGTAACGGTCAGTGGTTTAACATGCTTTATATTTTCCAATTTGACATAGTTTCTAACATTACCAACTGGTTGGCAAAATATTTTTGGTTTTTTCTACTGTATTGTTGCTGATTCTCATGTAGTTTACTCGCAATTCAGTTTGAAATTTTGTCTTTTGATTGACAGGAAGAAGATTGAAGATATTCCTGAGGAACGACGTCATAGACTTTTGCACCTCCTAAGCCCTAGGTTACATATAGGAAGCCTTGTAATACCTATATAGTTGTTTATGCTCTTTCTTATTCTTCATTATGCAATTTCCATTCGCTCGAACACTCTTCTTAATTTTGTCGGAACTATTTTAGCAACAGAATTCTACATTTTAAGTTTTGTTAAGAATTCGGAAATTCTGACACCAGCAATATAATATAGGTGTATATCAAGGGCTTGGGGAATAGCTGGCACACGATATGAGGATCCAAAGTTGGTTAAGAAAATGGCATCTAGTTTGCTGCAAAATGAAGACAGTACGGTGCTTGAATATTATAACTGCCTAAAGAGTGGAGGTAAGCAGTTCACTACATTATTTTCCCTATCGTATGGAAGAATTCAAATTTTGTTTCCTAATTTCTAAAATTCCTTCACTGTATAAGGAAAATGATTCGATAGAGGACAAGAAAGGGTGAACTTTGTTATTGGGTGGTGATTTTTTGGAACATTTAAAGTTTAAACATTCAAAGTTTTATTTAATATTTTCCGCAAAATTATAATAATGTTATGATAGCCTAACAACCTTGATTTTTAATATCATGAATAGGACAAATACCTATTGCTTGGATTAATCTTTTCAAGAAGGTGAAAATTATCAAGTCCTTGTCCCCACTTTCTTCTTTCAAGTGTACAGTACAATATAATCTATCTATTTACGTTTCCTTGTTTATTAGGCATTATTTTCTTGCAAAGATGGAAAGACTTATGGGCGGTTTATTGGTAAGAGCTCTTAACATTTTTCAACTATACATTCTTCTCCTCTTATGACGTTAGTAACTTTGTAAAACTACAGATTTTATGGCACTAAAACAGTACAGAAGCTAGCTGATCCATATCCCATTGACCTATAAGTTTACTACATCCTCATTTTCTTAATTCATGGATATGGATTGGACACCAACACGACTATTTCGACGTATCACTATGTTGATAGCACTTTAAATTATTTGCAGTTGCACAATCACAAATGATGATCAAAATCCAATTGTCACAACCTCATTCCTCTGTCTGTTTGATGGAGTTTTTGTAAAATGAAAATGGATCATTTGATGATCACCAATCCACTAAAATTTCCTTCTGCTAACATTATCAGGCATGTCCTTACTGGCTGGATTTGCAAATTGCTTTAGTCCATTGTACTTTGAGGTGACACAACTTAAGGAGGTAATGTCAACCGAACATCCTTGTGACTTGGCATATGAATTTGGAGATGGGCTTTTTGATATTCATGAATACCCTGAAGGCTTTCCGGCTCCAGGTATGATATTGAAATCATTTTTTTGTCTTTCTTCACATTGATTAAATCTCATCAAATGTGTTGGTTCTTCCATGATACAATTTCAACTCAAATCTCATGTTATTTATGCAAGTTAATTGCGACCAGGATATTTTTCATGAAAGAAATGACAACGAACAAGAACTGTTTGCTCTGCATGCACCTATTTTCTCAGGGGTAGGATGTTGCAAGGTCTGGTCCCCTCCTTAGCTTTGCAACATAAGCTCCACTATGTATATGAAGTCTTTAGTGTCTTTCTTGAAGGAGACTTCTAAGTAACTATGTACCATTAGGAACAAAATTAGAGACGTGAGAGATGGTTTGCATATTTGGAGATACACTCATGAATTTGCGATGGTGAAGAACATCCTTTACATGAGTTTTTAAATAACCTTGTTAAGTGCATTTCATATTCAATTGAAATGATAGCTCTCATGCAGCATGCTTCTCCTTGGTTTTCCAAGTTTTCAATACTTTATTTTAGCCCCCTCGCTTTCAAATCTACCAAAGTATTATATAATAAGTACGTACTTTAGAGGTAACTTCACCTCCATTTTCCCTCTTTTTAGCCTCTTGATCTATACCCGTTAGAAAGAACCTTCAACCTCATATATTTCTTCATCAGAAACTGATGCAAGTTTCACTCGATTAAATATTAATTATTTGTATAGCTTCCAAATTTATAATATTTAAGAATTTGTGTGTTATTCTTTTCATTTTGCAGTCAAGCATCGATATCCTTTCAACGATCAGCTTGTAGTATATGTTCGATATCTAGGACCTGGAGTGTTAGTTGGCCAGGCATGGCAAGAAGGAAAAGCTTTGGAGCAGGTGCCACGTAAATTATGTTCTGAAATCTTGATGATCAAAGACTACAGTCCATGTCCACTCCAGGAAAAGCAATAG

mRNA sequence

ATGAAGGCAATGGCAGCCTCCAACCGTCCCGTACACATTTCCCCAAGAACCTGCTTCTTCCCATTTTCCCGCCAAATCAACACCTTCTACCGACCGCGGCGGCGTCTCTGCTTCACGAACTCCAAAGGGCTGGGATGGTACGGATGTGGAGTTTGTGTACGGTTACCTGGTTTTGAGGTCGCTGCCGTAGCCGGAGGAAGGGAGAGAGCGCAAGTATCTGCTGCGTGGGATGAGGAGCCTTATGAATTGCTTCCGAACGGGAAAATACAGTACCTGGACGAACAAGATGTAGTAACCTTTCTAGATCCGCCCAAGGAGCTCATACCATTAGACCCTGCTTCTTATAATCCAGCTGCGTATCTTTGGAAGAAGATTGAAGATATTCCTGAGGAACGACGTCATAGACTTTTGCACCTCCTAAGCCCTAGGTGTATATCAAGGGCTTGGGGAATAGCTGGCACACGATATGAGGATCCAAAGTTGGTTAAGAAAATGGCATCTAGTTTGCTGCAAAATGAAGACAGTACGGTGCTTGAATATTATAACTGCCTAAAGAGTGGAGGACAAATACCTATTGCTTGGATTAATCTTTTCAAGAAGGCATTATTTTCTTGCAAAGATGGAAAGACTTATGGGCGGTTTATTGGCATGTCCTTACTGGCTGGATTTGCAAATTGCTTTAGTCCATTGTACTTTGAGGTGACACAACTTAAGGAGGTAATGTCAACCGAACATCCTTGTGACTTGGCATATGAATTTGGAGATGGGCTTTTTGATATTCATGAATACCCTGAAGGCTTTCCGGCTCCAGTCAAGCATCGATATCCTTTCAACGATCAGCTTGTAGTATATGTTCGATATCTAGGACCTGGAGTGTTAGTTGGCCAGGCATGGCAAGAAGGAAAAGCTTTGGAGCAGGTGCCACGTAAATTATGTTCTGAAATCTTGATGATCAAAGACTACAGTCCATGTCCACTCCAGGAAAAGCAATAG

Coding sequence (CDS)

ATGAAGGCAATGGCAGCCTCCAACCGTCCCGTACACATTTCCCCAAGAACCTGCTTCTTCCCATTTTCCCGCCAAATCAACACCTTCTACCGACCGCGGCGGCGTCTCTGCTTCACGAACTCCAAAGGGCTGGGATGGTACGGATGTGGAGTTTGTGTACGGTTACCTGGTTTTGAGGTCGCTGCCGTAGCCGGAGGAAGGGAGAGAGCGCAAGTATCTGCTGCGTGGGATGAGGAGCCTTATGAATTGCTTCCGAACGGGAAAATACAGTACCTGGACGAACAAGATGTAGTAACCTTTCTAGATCCGCCCAAGGAGCTCATACCATTAGACCCTGCTTCTTATAATCCAGCTGCGTATCTTTGGAAGAAGATTGAAGATATTCCTGAGGAACGACGTCATAGACTTTTGCACCTCCTAAGCCCTAGGTGTATATCAAGGGCTTGGGGAATAGCTGGCACACGATATGAGGATCCAAAGTTGGTTAAGAAAATGGCATCTAGTTTGCTGCAAAATGAAGACAGTACGGTGCTTGAATATTATAACTGCCTAAAGAGTGGAGGACAAATACCTATTGCTTGGATTAATCTTTTCAAGAAGGCATTATTTTCTTGCAAAGATGGAAAGACTTATGGGCGGTTTATTGGCATGTCCTTACTGGCTGGATTTGCAAATTGCTTTAGTCCATTGTACTTTGAGGTGACACAACTTAAGGAGGTAATGTCAACCGAACATCCTTGTGACTTGGCATATGAATTTGGAGATGGGCTTTTTGATATTCATGAATACCCTGAAGGCTTTCCGGCTCCAGTCAAGCATCGATATCCTTTCAACGATCAGCTTGTAGTATATGTTCGATATCTAGGACCTGGAGTGTTAGTTGGCCAGGCATGGCAAGAAGGAAAAGCTTTGGAGCAGGTGCCACGTAAATTATGTTCTGAAATCTTGATGATCAAAGACTACAGTCCATGTCCACTCCAGGAAAAGCAATAG

Protein sequence

MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQEGKALEQVPRKLCSEILMIKDYSPCPLQEKQ
Homology
BLAST of Tan0021039 vs. NCBI nr
Match: XP_023000285.1 (uncharacterized protein LOC111494560 [Cucurbita maxima])

HSP 1 Score: 618.2 bits (1593), Expect = 4.0e-173
Identity = 294/330 (89.09%), Postives = 306/330 (92.73%), Query Frame = 0

Query: 1   MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
           MKAMAAS+RPV IS  TCFFPFSRQINTF RPRRRLC+TNSKGL W+G GVCV  P FEV
Sbjct: 1   MKAMAASSRPVQISHGTCFFPFSRQINTFCRPRRRLCYTNSKGLKWFGYGVCVSPPSFEV 60

Query: 61  AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
           AAVAGGRERAQVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61  AAVAGGRERAQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120

Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
           LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D + LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGSTLEY 180

Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
           YNC+KSGGQIPIAWIN FKKALFSCKDGKTYGRFIGM  LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSCKDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240

Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
           MSTEHPCDLAYEFGDGLFD HEYPEGFPA VKHRYPFNDQLVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDFHEYPEGFPASVKHRYPFNDQLVVYVRFVGPGVLVGQAWQE 300

Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
           GK+LEQVPRKLCSEILMIKDYSP PL EKQ
Sbjct: 301 GKSLEQVPRKLCSEILMIKDYSPRPLPEKQ 329

BLAST of Tan0021039 vs. NCBI nr
Match: XP_023515334.1 (uncharacterized protein LOC111779395 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 617.1 bits (1590), Expect = 9.0e-173
Identity = 294/330 (89.09%), Postives = 306/330 (92.73%), Query Frame = 0

Query: 1   MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
           MKAMAAS+RPV IS  TCFFPFSRQINTF RPRRRLC+TNSKGL W+G GVCV  P FEV
Sbjct: 1   MKAMAASSRPVQISHGTCFFPFSRQINTFCRPRRRLCYTNSKGLKWFGYGVCVSPPSFEV 60

Query: 61  AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
           AAVAGGRER QVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61  AAVAGGRERPQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120

Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
           LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D T+LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGTMLEY 180

Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
           YNC+KSGGQIPIAWIN FKKALFS  DGKTYGRFIGM  LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSSNDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240

Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
           MSTEHPCDLAYEFGDGLFD+HEYPEGFPA VKHRYPFNDQLVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDVHEYPEGFPASVKHRYPFNDQLVVYVRFVGPGVLVGQAWQE 300

Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
           GKALEQVPRKLCSEILMIKDYSP PLQEKQ
Sbjct: 301 GKALEQVPRKLCSEILMIKDYSPRPLQEKQ 329

BLAST of Tan0021039 vs. NCBI nr
Match: XP_022964125.1 (uncharacterized protein LOC111464245 [Cucurbita moschata])

HSP 1 Score: 611.3 bits (1575), Expect = 4.9e-171
Identity = 292/330 (88.48%), Postives = 302/330 (91.52%), Query Frame = 0

Query: 1   MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
           MKAMAAS+RPV IS  TCF PFSRQINTF RP RRLC+TNSKGL W+G GVCV  P FEV
Sbjct: 1   MKAMAASSRPVQISHGTCFLPFSRQINTFCRPPRRLCYTNSKGLKWFGYGVCVSPPSFEV 60

Query: 61  AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
           AAVAG RERAQVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61  AAVAGRRERAQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120

Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
           LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D T LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGTTLEY 180

Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
           YNC+KSGGQIPIAWIN FKKALFSC DGKTYGRFIGM  LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSCNDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240

Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
           MSTEHPCDLAYEFGDGLFD HEYPEGFPA VKHRYPFND LVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDFHEYPEGFPASVKHRYPFNDHLVVYVRFVGPGVLVGQAWQE 300

Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
           GKALEQVPRKLCSEILMIKDYSP PLQEKQ
Sbjct: 301 GKALEQVPRKLCSEILMIKDYSPRPLQEKQ 329

BLAST of Tan0021039 vs. NCBI nr
Match: XP_038899771.1 (uncharacterized protein LOC120087001 [Benincasa hispida])

HSP 1 Score: 598.2 bits (1541), Expect = 4.3e-167
Identity = 287/332 (86.45%), Postives = 298/332 (89.76%), Query Frame = 0

Query: 1   MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPG- 60
           MKAMAAS+  V ISP TC  PF  QINTF  PRRRLCFTNSKGL  GWYGCG+CVR PG 
Sbjct: 4   MKAMAASSSSVQISPGTCCIPFPHQINTFNTPRRRLCFTNSKGLGIGWYGCGICVRSPGC 63

Query: 61  FEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNP 120
             VAA+AGGRER Q S+AWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLD ASYNP
Sbjct: 64  VVVAAIAGGREREQASSAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDLASYNP 123

Query: 121 AAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV 180
           AAYLWKKIEDIPEERRHRLL LL+PRCISRAWGIAGTRYEDPKL+KKMASSLLQNED  V
Sbjct: 124 AAYLWKKIEDIPEERRHRLLQLLTPRCISRAWGIAGTRYEDPKLLKKMASSLLQNEDGMV 183

Query: 181 LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQL 240
           LEYY CLKSGGQIPI WIN FKKALFSCKDGKTYGR IGMSLLAGFAN  SPLYFEV QL
Sbjct: 184 LEYYYCLKSGGQIPIGWINRFKKALFSCKDGKTYGRIIGMSLLAGFANSVSPLYFEVKQL 243

Query: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQA 300
           KEVMSTEHPCDLAYEFGDGLFDIHEYP GFPAP KH YPFNDQ+VVYVRYLGPGVLVGQA
Sbjct: 244 KEVMSTEHPCDLAYEFGDGLFDIHEYPAGFPAPAKHLYPFNDQVVVYVRYLGPGVLVGQA 303

Query: 301 WQEGKALEQVPRKLCSEILMIKDYSPCPLQEK 330
           WQEGKALEQVPRKLC+EILMIKDYS  P+Q++
Sbjct: 304 WQEGKALEQVPRKLCAEILMIKDYSESPVQKQ 335

BLAST of Tan0021039 vs. NCBI nr
Match: XP_004141740.2 (uncharacterized protein LOC101213828 [Cucumis sativus] >KAE8646483.1 hypothetical protein Csa_016678 [Cucumis sativus])

HSP 1 Score: 574.3 bits (1479), Expect = 6.7e-160
Identity = 276/332 (83.13%), Postives = 289/332 (87.05%), Query Frame = 0

Query: 1   MKAMAA-SNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPG 60
           MKAMAA SNRP+ ISP TC   F RQINTF    RRL FTN KGL  GWY CGVCVR PG
Sbjct: 1   MKAMAATSNRPLQISPWTCCSSFPRQINTFNTQHRRLSFTNFKGLGIGWYSCGVCVRSPG 60

Query: 61  FEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNP 120
             VAA AGGRER QVS+ WDEEPYELLPNG+IQY+DEQDV +FLDPPKELIP DP SYNP
Sbjct: 61  CVVAAAAGGREREQVSSVWDEEPYELLPNGRIQYIDEQDVASFLDPPKELIPFDPDSYNP 120

Query: 121 AAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV 180
           AAYLWKKIE+IPEERRHRLLHLL+PRCISRAWGIAGTRYEDPKLVKK ASSLLQNED  V
Sbjct: 121 AAYLWKKIEEIPEERRHRLLHLLTPRCISRAWGIAGTRYEDPKLVKKTASSLLQNEDGMV 180

Query: 181 LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQL 240
           LEYYNCLKSGGQIPI WIN FKKA+FS KDGK YGR I M LLAGFAN  SPLYFE+ QL
Sbjct: 181 LEYYNCLKSGGQIPIGWINRFKKAIFSSKDGKIYGRIINMPLLAGFANSVSPLYFEMKQL 240

Query: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQA 300
           KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAP KH YPFNDQ+VVYVRYLGPGVLVGQA
Sbjct: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPAKHLYPFNDQVVVYVRYLGPGVLVGQA 300

Query: 301 WQEGKALEQVPRKLCSEILMIKDYSPCPLQEK 330
           WQEGKALEQVP+KLC EILMIKDYS  PLQ++
Sbjct: 301 WQEGKALEQVPQKLCGEILMIKDYSQQPLQKQ 332

BLAST of Tan0021039 vs. ExPASy TrEMBL
Match: A0A6J1KHW6 (uncharacterized protein LOC111494560 OS=Cucurbita maxima OX=3661 GN=LOC111494560 PE=4 SV=1)

HSP 1 Score: 618.2 bits (1593), Expect = 1.9e-173
Identity = 294/330 (89.09%), Postives = 306/330 (92.73%), Query Frame = 0

Query: 1   MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
           MKAMAAS+RPV IS  TCFFPFSRQINTF RPRRRLC+TNSKGL W+G GVCV  P FEV
Sbjct: 1   MKAMAASSRPVQISHGTCFFPFSRQINTFCRPRRRLCYTNSKGLKWFGYGVCVSPPSFEV 60

Query: 61  AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
           AAVAGGRERAQVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61  AAVAGGRERAQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120

Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
           LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D + LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGSTLEY 180

Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
           YNC+KSGGQIPIAWIN FKKALFSCKDGKTYGRFIGM  LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSCKDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240

Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
           MSTEHPCDLAYEFGDGLFD HEYPEGFPA VKHRYPFNDQLVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDFHEYPEGFPASVKHRYPFNDQLVVYVRFVGPGVLVGQAWQE 300

Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
           GK+LEQVPRKLCSEILMIKDYSP PL EKQ
Sbjct: 301 GKSLEQVPRKLCSEILMIKDYSPRPLPEKQ 329

BLAST of Tan0021039 vs. ExPASy TrEMBL
Match: A0A6J1HJY4 (uncharacterized protein LOC111464245 OS=Cucurbita moschata OX=3662 GN=LOC111464245 PE=4 SV=1)

HSP 1 Score: 611.3 bits (1575), Expect = 2.4e-171
Identity = 292/330 (88.48%), Postives = 302/330 (91.52%), Query Frame = 0

Query: 1   MKAMAASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGLGWYGCGVCVRLPGFEV 60
           MKAMAAS+RPV IS  TCF PFSRQINTF RP RRLC+TNSKGL W+G GVCV  P FEV
Sbjct: 1   MKAMAASSRPVQISHGTCFLPFSRQINTFCRPPRRLCYTNSKGLKWFGYGVCVSPPSFEV 60

Query: 61  AAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAY 120
           AAVAG RERAQVSAAWDE PYELLPNGKIQYLDEQDVVTFLDPPKELIPLDP +YNPAAY
Sbjct: 61  AAVAGRRERAQVSAAWDEGPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPTTYNPAAY 120

Query: 121 LWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEY 180
           LWKKIE IPEERRH LLHLLSPRCISRAWGIAGTRY+DPKLVKKMASSLLQN+D T LEY
Sbjct: 121 LWKKIEVIPEERRHNLLHLLSPRCISRAWGIAGTRYDDPKLVKKMASSLLQNKDGTTLEY 180

Query: 181 YNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEV 240
           YNC+KSGGQIPIAWIN FKKALFSC DGKTYGRFIGM  LAGFAN F+PLYFEV QLKEV
Sbjct: 181 YNCIKSGGQIPIAWINHFKKALFSCNDGKTYGRFIGMG-LAGFANSFNPLYFEVKQLKEV 240

Query: 241 MSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQE 300
           MSTEHPCDLAYEFGDGLFD HEYPEGFPA VKHRYPFND LVVYVR++GPGVLVGQAWQE
Sbjct: 241 MSTEHPCDLAYEFGDGLFDFHEYPEGFPASVKHRYPFNDHLVVYVRFVGPGVLVGQAWQE 300

Query: 301 GKALEQVPRKLCSEILMIKDYSPCPLQEKQ 331
           GKALEQVPRKLCSEILMIKDYSP PLQEKQ
Sbjct: 301 GKALEQVPRKLCSEILMIKDYSPRPLQEKQ 329

BLAST of Tan0021039 vs. ExPASy TrEMBL
Match: A0A0A0KCA7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447880 PE=4 SV=1)

HSP 1 Score: 569.7 bits (1467), Expect = 7.9e-159
Identity = 271/327 (82.87%), Postives = 284/327 (86.85%), Query Frame = 0

Query: 5   AASNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPGFEVAA 64
           A SNRP+ ISP TC   F RQINTF    RRL FTN KGL  GWY CGVCVR PG  VAA
Sbjct: 3   ATSNRPLQISPWTCCSSFPRQINTFNTQHRRLSFTNFKGLGIGWYSCGVCVRSPGCVVAA 62

Query: 65  VAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNPAAYLW 124
            AGGRER QVS+ WDEEPYELLPNG+IQY+DEQDV +FLDPPKELIP DP SYNPAAYLW
Sbjct: 63  AAGGREREQVSSVWDEEPYELLPNGRIQYIDEQDVASFLDPPKELIPFDPDSYNPAAYLW 122

Query: 125 KKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTVLEYYN 184
           KKIE+IPEERRHRLLHLL+PRCISRAWGIAGTRYEDPKLVKK ASSLLQNED  VLEYYN
Sbjct: 123 KKIEEIPEERRHRLLHLLTPRCISRAWGIAGTRYEDPKLVKKTASSLLQNEDGMVLEYYN 182

Query: 185 CLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEVMS 244
           CLKSGGQIPI WIN FKKA+FS KDGK YGR I M LLAGFAN  SPLYFE+ QLKEVMS
Sbjct: 183 CLKSGGQIPIGWINRFKKAIFSSKDGKIYGRIINMPLLAGFANSVSPLYFEMKQLKEVMS 242

Query: 245 TEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQEGK 304
           TEHPCDLAYEFGDGLFDIHEYPEGFPAP KH YPFNDQ+VVYVRYLGPGVLVGQAWQEGK
Sbjct: 243 TEHPCDLAYEFGDGLFDIHEYPEGFPAPAKHLYPFNDQVVVYVRYLGPGVLVGQAWQEGK 302

Query: 305 ALEQVPRKLCSEILMIKDYSPCPLQEK 330
           ALEQVP+KLC EILMIKDYS  PLQ++
Sbjct: 303 ALEQVPQKLCGEILMIKDYSQQPLQKQ 329

BLAST of Tan0021039 vs. ExPASy TrEMBL
Match: A0A1S3CGG5 (uncharacterized protein LOC103500630 OS=Cucumis melo OX=3656 GN=LOC103500630 PE=4 SV=1)

HSP 1 Score: 564.3 bits (1453), Expect = 3.3e-157
Identity = 269/325 (82.77%), Postives = 285/325 (87.69%), Query Frame = 0

Query: 1   MKAMAA-SNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPG 60
           MKAMAA S+RPV ISP TC   F RQINTF    RRL FTN KGL  GWY CGVCVR PG
Sbjct: 1   MKAMAATSSRPVQISPWTCCSSFPRQINTFNTQHRRLSFTNFKGLGIGWYSCGVCVRSPG 60

Query: 61  FEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNP 120
             VAAVAG +ER + S+ WDE+PYELLPNG+IQY+DE DV +FLDPPKELIPLDPASYNP
Sbjct: 61  CVVAAVAGRKEREEASSVWDEKPYELLPNGRIQYIDELDVASFLDPPKELIPLDPASYNP 120

Query: 121 AAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV 180
           AAYLWKKIE IPEERRHRLLHLL+PRCISRAWGIAG+RYEDPKLVKKMASSLLQNED  V
Sbjct: 121 AAYLWKKIEAIPEERRHRLLHLLTPRCISRAWGIAGSRYEDPKLVKKMASSLLQNEDGMV 180

Query: 181 LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQL 240
           LEYYNCLKSGGQIPI WIN FKKA+FSCKDGK YGR I M LLAGFAN FSPLYFEV QL
Sbjct: 181 LEYYNCLKSGGQIPIGWINRFKKAIFSCKDGKIYGRIINMPLLAGFANSFSPLYFEVKQL 240

Query: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQA 300
           KEVMSTEHPCDLA++FGDGLFDIHEYPEGFP P KH YPFNDQ+VVYVRYLGPGVLVGQA
Sbjct: 241 KEVMSTEHPCDLAFDFGDGLFDIHEYPEGFPVPAKHLYPFNDQVVVYVRYLGPGVLVGQA 300

Query: 301 WQEGKALEQVPRKLCSEILMIKDYS 323
           WQEGKALEQVP+KLC EILMIKDY+
Sbjct: 301 WQEGKALEQVPQKLCGEILMIKDYN 325

BLAST of Tan0021039 vs. ExPASy TrEMBL
Match: A0A5D3C0T8 (WD repeat-containing protein 49 isoform 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G001770 PE=4 SV=1)

HSP 1 Score: 560.1 bits (1442), Expect = 6.3e-156
Identity = 270/332 (81.33%), Postives = 287/332 (86.45%), Query Frame = 0

Query: 1   MKAMAA-SNRPVHISPRTCFFPFSRQINTFYRPRRRLCFTNSKGL--GWYGCGVCVRLPG 60
           MKAMAA S+RPV ISP TC   F RQINTF    RRL FTN KGL  GWY CGVCVR PG
Sbjct: 1   MKAMAATSSRPVQISPWTCCSSFPRQINTFNTQHRRLSFTNFKGLGIGWYSCGVCVRSPG 60

Query: 61  FEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPASYNP 120
             V AVAG +ER + S+ WDE+PYELLPNG+IQY+DE DV +FLDPPKELIPLDPASYNP
Sbjct: 61  CVVTAVAGRKEREEASSVWDEKPYELLPNGRIQYIDELDVASFLDPPKELIPLDPASYNP 120

Query: 121 AAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV 180
           AAYLWKKIE IPEERRHRLLHLL+PRCISRAWGIAG+RYEDPKLVKKMASSLLQNED  V
Sbjct: 121 AAYLWKKIEAIPEERRHRLLHLLTPRCISRAWGIAGSRYEDPKLVKKMASSLLQNEDGMV 180

Query: 181 LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQL 240
           LEYYNCLKSGGQIPI WIN FKKA+FSCKDGK YGR I M LLAGFAN  SPLYFEV QL
Sbjct: 181 LEYYNCLKSGGQIPIGWINRFKKAIFSCKDGKIYGRIINMPLLAGFANSVSPLYFEVKQL 240

Query: 241 KEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQA 300
           KEVMSTEHPCDLA++FGDGLFDIHEYPEGFP P KH YPFNDQ+VVYVRYLGPGVLVGQA
Sbjct: 241 KEVMSTEHPCDLAFDFGDGLFDIHEYPEGFPVPAKHLYPFNDQVVVYVRYLGPGVLVGQA 300

Query: 301 WQEGKALEQVPRKLCSEILMIKDYS-PCPLQE 329
           WQEGKALEQVP+KLC EILMIKDY+   PLQ+
Sbjct: 301 WQEGKALEQVPQKLCGEILMIKDYNQQHPLQK 332

BLAST of Tan0021039 vs. TAIR 10
Match: AT1G73090.1 (unknown protein; Has 28 Blast hits to 28 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 28; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 360.5 bits (924), Expect = 1.4e-99
Identity = 169/269 (62.83%), Postives = 203/269 (75.46%), Query Frame = 0

Query: 54  RLPGFEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPA 113
           RL    V AV GGR      + WDE+PYE LP GK  Y+DE DVVTFLDPPKELIPLDPA
Sbjct: 40  RLSRCVVMAVQGGR---GYESPWDEKPYETLPTGKRVYVDESDVVTFLDPPKELIPLDPA 99

Query: 114 SYNPAAYLWKKIEDIPEERRHRLLHLLSPRCISRAWGIAGTRYEDPKLVKKMASSLLQNE 173
           SYNPAAYLWKKIEDIPEERRH LL LL PR ISRAW IA TRYEDPKL K  AS +    
Sbjct: 100 SYNPAAYLWKKIEDIPEERRHHLLQLLEPRLISRAWEIASTRYEDPKLAKMTASKIFSAG 159

Query: 174 DSTV-LEYYNCLKSGGQIPIAWINLFKKALFSCKDGKTYGRFIGMSLLAGFANCFSPLYF 233
           ++ + +EY++C  S G + I+WIN FK ALF   +G+ YGR  G  +++  AN FSPLYF
Sbjct: 160 NAEIPVEYFSCRTSQGPLIISWINFFKMALFRSYNGQIYGRVCGGPVVSTLANAFSPLYF 219

Query: 234 EVTQLKEVMSTEHPCDLAYEFGDGLFDIHEYPEGFPAPVKHRYPFNDQLVVYVRYLGPGV 293
           EVT+  EVM+TE PCD+A +FGDGL  I +YP+GFP P KH YPFND +V+Y+R++GPGV
Sbjct: 220 EVTEAMEVMATEEPCDVACKFGDGLLAIEDYPQGFPRPAKHPYPFNDSVVIYIRHIGPGV 279

Query: 294 LVGQAWQEGKALEQVPRKLCSEILMIKDY 322
            VGQAWQEG+ L+QVP++LCS+ILM+K Y
Sbjct: 280 CVGQAWQEGRELQQVPQRLCSDILMVKQY 305

BLAST of Tan0021039 vs. TAIR 10
Match: AT1G73090.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 12 growth stages. )

HSP 1 Score: 345.5 bits (885), Expect = 4.7e-95
Identity = 169/297 (56.90%), Postives = 203/297 (68.35%), Query Frame = 0

Query: 54  RLPGFEVAAVAGGRERAQVSAAWDEEPYELLPNGKIQYLDEQDVVTFLDPPKELIPLDPA 113
           RL    V AV GGR      + WDE+PYE LP GK  Y+DE DVVTFLDPPKELIPLDPA
Sbjct: 40  RLSRCVVMAVQGGR---GYESPWDEKPYETLPTGKRVYVDESDVVTFLDPPKELIPLDPA 99

Query: 114 SYNPAAYLWKKIEDIPEERRHRLLHLLSP----------------------------RCI 173
           SYNPAAYLWKKIEDIPEERRH LL LL P                            R I
Sbjct: 100 SYNPAAYLWKKIEDIPEERRHHLLQLLEPRLDSYAKEVDMEKELRACTVFFMRVYVCRLI 159

Query: 174 SRAWGIAGTRYEDPKLVKKMASSLLQNEDSTV-LEYYNCLKSGGQIPIAWINLFKKALFS 233
           SRAW IA TRYEDPKL K  AS +    ++ + +EY++C  S G + I+WIN FK ALF 
Sbjct: 160 SRAWEIASTRYEDPKLAKMTASKIFSAGNAEIPVEYFSCRTSQGPLIISWINFFKMALFR 219

Query: 234 CKDGKTYGRFIGMSLLAGFANCFSPLYFEVTQLKEVMSTEHPCDLAYEFGDGLFDIHEYP 293
             +G+ YGR  G  +++  AN FSPLYFEVT+  EVM+TE PCD+A +FGDGL  I +YP
Sbjct: 220 SYNGQIYGRVCGGPVVSTLANAFSPLYFEVTEAMEVMATEEPCDVACKFGDGLLAIEDYP 279

Query: 294 EGFPAPVKHRYPFNDQLVVYVRYLGPGVLVGQAWQEGKALEQVPRKLCSEILMIKDY 322
           +GFP P KH YPFND +V+Y+R++GPGV VGQAWQEG+ L+QVP++LCS+ILM+K Y
Sbjct: 280 QGFPRPAKHPYPFNDSVVIYIRHIGPGVCVGQAWQEGRELQQVPQRLCSDILMVKQY 333

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023000285.14.0e-17389.09uncharacterized protein LOC111494560 [Cucurbita maxima][more]
XP_023515334.19.0e-17389.09uncharacterized protein LOC111779395 [Cucurbita pepo subsp. pepo][more]
XP_022964125.14.9e-17188.48uncharacterized protein LOC111464245 [Cucurbita moschata][more]
XP_038899771.14.3e-16786.45uncharacterized protein LOC120087001 [Benincasa hispida][more]
XP_004141740.26.7e-16083.13uncharacterized protein LOC101213828 [Cucumis sativus] >KAE8646483.1 hypothetica... [more]
Match NameE-valueIdentityDescription
A0A6J1KHW61.9e-17389.09uncharacterized protein LOC111494560 OS=Cucurbita maxima OX=3661 GN=LOC111494560... [more]
A0A6J1HJY42.4e-17188.48uncharacterized protein LOC111464245 OS=Cucurbita moschata OX=3662 GN=LOC1114642... [more]
A0A0A0KCA77.9e-15982.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G447880 PE=4 SV=1[more]
A0A1S3CGG53.3e-15782.77uncharacterized protein LOC103500630 OS=Cucumis melo OX=3656 GN=LOC103500630 PE=... [more]
A0A5D3C0T86.3e-15681.33WD repeat-containing protein 49 isoform 2 OS=Cucumis melo var. makuwa OX=1194695... [more]
Match NameE-valueIdentityDescription
AT1G73090.11.4e-9962.83unknown protein; Has 28 Blast hits to 28 proteins in 12 species: Archae - 0; Bac... [more]
AT1G73090.24.7e-9556.90unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37201WD REPEAT PROTEINcoord: 60..322

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021039.1Tan0021039.1mRNA