Tan0007868 (gene) Snake gourd v1

Overview
NameTan0007868
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPWWP domain-containing protein
LocationLG11: 6742255 .. 6744756 (-)
RNA-Seq ExpressionTan0007868
SyntenyTan0007868
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCTCTTCCTCGTCCCCACGCTCAGATATTTACACGGTAACACAAGAGGAATTCAACATTTTTCACACCATCGACCGAACTCTCTTCAGTCGCATGGTGTTCGCACTCGGTCGCGACCCGGACGAGTCGGTTCGAGTCATGGGACTGTGGCTTTGGCTCGAACAGAACGGCGAAGAGTTCAATTTGGTCTACAAAATGCTGTCGTTACCCGACGCGTTGGTTGACGCTCTATCGGACGAGGCCGTCATGTCATTGGCGTGTATAGAAAACGACAAATTCCCCTTCGAACCGGAGTCATCGGTCGACATTCCCCTCATTCAACACGTTTCTAAAACCCCGGTTTCACTCCGGTTCTTCCACGAGAACCGGCTCAGAATCCTCCGTGCCGTTACCATCATTTGTTCCGATATTTGCCACCGGGCCTTTAAGGACATTCTCCACGCGCTTGAAACTCAGCGAGTTCTGTCACGCGCCGCTGTGGCCGTACCCGCGGTCCACGGCGGCGCCCGTGGAAGCTACTTTGCCACGGCTCCGGCGAGTCAGTTTGTTGTTCCGGCTTTTGGGTTCGTGAATCTCAGGGGAGAGAGTCCGCCGGTGTCGGCGACGGCGACGGCGACGGCGAGTCAGAGTGTCAGAGGGGAAAACCGGCAAGTGATTTCGAAGAGTGAAATGAGTGATTTGTTCAGTCGTTTACAGTTGAAGAGCGGTAAGGAAGAAGAAGGGGAGGCGGTTCCGCCAGACGAGAGGACGATTTTTCTGACGTTCTCGAAAGGATACCCAATTTCTGAAGAGGAAGTTAGAGACTATTTTGGCAGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTACTCTTCCCAATATTTCTCTATATTCCTCTGCATGTCGAAATTTTTTTCAATATTTCTAGATTCTAGTGCGGGGTCAAAATTTTCTTTATGAGGGGAATATTTTAAAATCTTAAAATTTTTTCTATATATAATATATATATTTCAATTTAAAACGAGAAATTGAGGCGATTCTTCCTTTTTTACTTCCCTTCAATTTAAAAGGAGATAATCTACGGTGGAGCAAATAAAGGAGGAAGACAATTGCCTTATATATAAAAAAAAAAGAAAGATGAGGTTTTAAAATTTCTCTCTAATTTGACTTTTAAATTTTCAATTGAAAATTTTCATATATAGAGAATTGTTTTTTTTAGTGTACCATTCTATAGATAGACAGTAGATTTTTTGCTTTTCTAGAAATATCTCTAACTCATTTTGGTTTGAAAATTAATGATCTATTTCTTTTGTTATCCCATTTTTTTATCTATATTTTCAAAATCATGCCTAAATCCTAAAAAAAATATTTTTATTTTTAGAATTTGATTAAAAATTTAAATATGGTCTTAGAAAAAATCTCCCGTCATTGAAAGCATGGAAACAAGCTCAATTTTCAAAAATTAAAAATAAAAATTCTAGTCATTATCAAATAAGACCTCATCTTTCCATTCATATTTTTACCTTTGAGGAAGGAAATGGTAATAGCTTTTTAATTTGTTTAAAATTAAAATTTTAGGTGTTTAAACTTTTAAAGTTGTGTGTTTAATAAAACTTTGTTAAATAGATATCCGTTTGATAATCATTTAGTTTTTTATTTTTAGTGTTTATGCTTGTTTTTTCACAACTATTTGCTTTTAATAAAAAAATACTTTTTTTTTAGTTTTCATAACTTTACAACAAAGTAGATAGCAAAATATATATATCTTGAATTATTGAAAGTTTAGGTCTCCTTCGATAACTATTTTATTTTTGGTTTTTAGTTTTTGAAAGTTGATTATCTTGAGTAAGCATTTGAACTTTTAGCCAAATTCTAAAAACAAAGTGAAGTTTTTAAAAAACTACTTTTTTTAGTTTTCAAAAATTGGCTTGATTTTTTAAACTATAGGTACAAAATAGATAAAAACATATAAACTCATGAGTAGAAATAGTATTTTTAGGTTTAAATTTTAAAAACAAAAAACAAAATGGTTATCAAATCGATATTAGAGATTTAATACAACAACACTTTTAGATTTCAGAATTTAAAAGAAAATACTTTATAATCTATTTTCAAACATATGAACAAACTTGTAATTCATGAACTGATAATTGATATGAAAAAAAAGTATGGTACGGTGTTGTGTTATTAATAATGTGGATTTTGGAGCAGAAGATATGGAGATTTCATAGAGAGCATTCACATGCAAGAGGCGCAGCCGCCGGAGCAGCCGCTGTTCGCCCGGCTGGTCGTCAAAACGGAATCCTCCATCGACCTCGTCCTTGAATCAAGAACCAAAGCTAAATTTTCCATCAACGGCAAACACGTTTGGGCTCGAAAATATGTCCGTAAAGCTCCCCGGTCGCCGCAGCTCCGACCTTCTCCGCCGCCGACACGGCCCCCTTCTCCGCCACTCATCGATGGAATTAGAGCTTTTAACTTCCTCCCACCGCCGCCGCCGCCCGCACCCTGA

mRNA sequence

ATGGCTTCCTCTTCCTCGTCCCCACGCTCAGATATTTACACGGTAACACAAGAGGAATTCAACATTTTTCACACCATCGACCGAACTCTCTTCAGTCGCATGGTGTTCGCACTCGGTCGCGACCCGGACGAGTCGGTTCGAGTCATGGGACTGTGGCTTTGGCTCGAACAGAACGGCGAAGAGTTCAATTTGGTCTACAAAATGCTGTCGTTACCCGACGCGTTGGTTGACGCTCTATCGGACGAGGCCGTCATGTCATTGGCGTGTATAGAAAACGACAAATTCCCCTTCGAACCGGAGTCATCGGTCGACATTCCCCTCATTCAACACGTTTCTAAAACCCCGGTTTCACTCCGGTTCTTCCACGAGAACCGGCTCAGAATCCTCCGTGCCGTTACCATCATTTGTTCCGATATTTGCCACCGGGCCTTTAAGGACATTCTCCACGCGCTTGAAACTCAGCGAGTTCTGTCACGCGCCGCTGTGGCCGTACCCGCGGTCCACGGCGGCGCCCGTGGAAGCTACTTTGCCACGGCTCCGGCGAGTCAGTTTGTTGTTCCGGCTTTTGGGTTCGTGAATCTCAGGGGAGAGAGTCCGCCGGTGTCGGCGACGGCGACGGCGACGGCGAGTCAGAGTGTCAGAGGGGAAAACCGGCAAGTGATTTCGAAGAGTGAAATGAGTGATTTGTTCAGTCGTTTACAGTTGAAGAGCGGTAAGGAAGAAGAAGGGGAGGCGGTTCCGCCAGACGAGAGGACGATTTTTCTGACGTTCTCGAAAGGATACCCAATTTCTGAAGAGGAAGTTAGAGACTATTTTGGCAGAAGATATGGAGATTTCATAGAGAGCATTCACATGCAAGAGGCGCAGCCGCCGGAGCAGCCGCTGTTCGCCCGGCTGGTCGTCAAAACGGAATCCTCCATCGACCTCGTCCTTGAATCAAGAACCAAAGCTAAATTTTCCATCAACGGCAAACACGTTTGGGCTCGAAAATATGTCCGTAAAGCTCCCCGGTCGCCGCAGCTCCGACCTTCTCCGCCGCCGACACGGCCCCCTTCTCCGCCACTCATCGATGGAATTAGAGCTTTTAACTTCCTCCCACCGCCGCCGCCGCCCGCACCCTGA

Coding sequence (CDS)

ATGGCTTCCTCTTCCTCGTCCCCACGCTCAGATATTTACACGGTAACACAAGAGGAATTCAACATTTTTCACACCATCGACCGAACTCTCTTCAGTCGCATGGTGTTCGCACTCGGTCGCGACCCGGACGAGTCGGTTCGAGTCATGGGACTGTGGCTTTGGCTCGAACAGAACGGCGAAGAGTTCAATTTGGTCTACAAAATGCTGTCGTTACCCGACGCGTTGGTTGACGCTCTATCGGACGAGGCCGTCATGTCATTGGCGTGTATAGAAAACGACAAATTCCCCTTCGAACCGGAGTCATCGGTCGACATTCCCCTCATTCAACACGTTTCTAAAACCCCGGTTTCACTCCGGTTCTTCCACGAGAACCGGCTCAGAATCCTCCGTGCCGTTACCATCATTTGTTCCGATATTTGCCACCGGGCCTTTAAGGACATTCTCCACGCGCTTGAAACTCAGCGAGTTCTGTCACGCGCCGCTGTGGCCGTACCCGCGGTCCACGGCGGCGCCCGTGGAAGCTACTTTGCCACGGCTCCGGCGAGTCAGTTTGTTGTTCCGGCTTTTGGGTTCGTGAATCTCAGGGGAGAGAGTCCGCCGGTGTCGGCGACGGCGACGGCGACGGCGAGTCAGAGTGTCAGAGGGGAAAACCGGCAAGTGATTTCGAAGAGTGAAATGAGTGATTTGTTCAGTCGTTTACAGTTGAAGAGCGGTAAGGAAGAAGAAGGGGAGGCGGTTCCGCCAGACGAGAGGACGATTTTTCTGACGTTCTCGAAAGGATACCCAATTTCTGAAGAGGAAGTTAGAGACTATTTTGGCAGAAGATATGGAGATTTCATAGAGAGCATTCACATGCAAGAGGCGCAGCCGCCGGAGCAGCCGCTGTTCGCCCGGCTGGTCGTCAAAACGGAATCCTCCATCGACCTCGTCCTTGAATCAAGAACCAAAGCTAAATTTTCCATCAACGGCAAACACGTTTGGGCTCGAAAATATGTCCGTAAAGCTCCCCGGTCGCCGCAGCTCCGACCTTCTCCGCCGCCGACACGGCCCCCTTCTCCGCCACTCATCGATGGAATTAGAGCTTTTAACTTCCTCCCACCGCCGCCGCCGCCCGCACCCTGA

Protein sequence

MASSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEEFNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFFHENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPASQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKEEEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFARLVVKTESSIDLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPSPPPTRPPSPPLIDGIRAFNFLPPPPPPAP
Homology
BLAST of Tan0007868 vs. NCBI nr
Match: KAG7033019.1 (hypothetical protein SDJN02_07071, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 625.9 bits (1613), Expect = 2.2e-175
Identity = 325/372 (87.37%), Postives = 343/372 (92.20%), Query Frame = 0

Query: 2   ASSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEE 61
           +SSSSSPR+DIYTVTQEEFN+FHTIDRTLFSRMVF LGRDPDESVRVMGLWLWLEQNGEE
Sbjct: 4   SSSSSSPRADIYTVTQEEFNVFHTIDRTLFSRMVFTLGRDPDESVRVMGLWLWLEQNGEE 63

Query: 62  FNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFF 121
           FNLVYKMLSLPDALVDAL DEAV+SLACIENDKFPFEP++SVDIPLIQHVSKTPVSLRFF
Sbjct: 64  FNLVYKMLSLPDALVDALFDEAVLSLACIENDKFPFEPDTSVDIPLIQHVSKTPVSLRFF 123

Query: 122 HENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPA 181
           HENRLRILR VT IC+DIC+RAFKDIL+ALETQRVLSRA+V VPA+HGG RGSYF  APA
Sbjct: 124 HENRLRILRGVTKICTDICYRAFKDILYALETQRVLSRASVPVPAIHGG-RGSYFPAAPA 183

Query: 182 SQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKEE 241
           S   VPAFGFVN+RGES P+     ATASQS R  NR VIS+SEMSDLF RL+L+SGK E
Sbjct: 184 SPLAVPAFGFVNVRGESSPLPPPTAATASQSARVGNRNVISRSEMSDLFGRLKLRSGK-E 243

Query: 242 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFARLVV 301
           EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPL+ARLVV
Sbjct: 244 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLYARLVV 303

Query: 302 KTESSIDLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPSPPPTRPPSPPLIDGIRA 361
           KTESSIDLVLE RTKAKFSINGKHVWARKYVRKAPRSP  RPSPP TRPPSPPLIDG+RA
Sbjct: 304 KTESSIDLVLEGRTKAKFSINGKHVWARKYVRKAPRSPH-RPSPPSTRPPSPPLIDGVRA 363

Query: 362 FNFLPPPPPPAP 374
           FNF  PPPPPAP
Sbjct: 364 FNF--PPPPPAP 370

BLAST of Tan0007868 vs. NCBI nr
Match: XP_022957165.1 (uncharacterized protein LOC111458632 [Cucurbita moschata])

HSP 1 Score: 623.6 bits (1607), Expect = 1.1e-174
Identity = 324/372 (87.10%), Postives = 342/372 (91.94%), Query Frame = 0

Query: 2   ASSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEE 61
           +SSSSSPR+DIYTVTQEEFN+FHTIDRTLFSRMVF LGRDPDESVRVMGLWLWLEQNGEE
Sbjct: 4   SSSSSSPRADIYTVTQEEFNVFHTIDRTLFSRMVFTLGRDPDESVRVMGLWLWLEQNGEE 63

Query: 62  FNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFF 121
           FNLVYKMLSLPDALVDAL DEAV+SLACIENDKFPFEP++SVDIPLIQHVSKTPVSLRFF
Sbjct: 64  FNLVYKMLSLPDALVDALFDEAVLSLACIENDKFPFEPDTSVDIPLIQHVSKTPVSLRFF 123

Query: 122 HENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPA 181
           HENRLRILR VT IC+DIC+RAFKDIL+ALETQRVLSRA+V VPA+HGG RGSYF  APA
Sbjct: 124 HENRLRILRGVTKICTDICYRAFKDILYALETQRVLSRASVPVPAIHGG-RGSYFPAAPA 183

Query: 182 SQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKEE 241
           S   VPAFGFVN+RGES P+     ATASQS R  NR VIS+SEMSDLF RL+L+SGK E
Sbjct: 184 SPLAVPAFGFVNVRGESSPLPPPTAATASQSARVGNRNVISRSEMSDLFGRLKLRSGK-E 243

Query: 242 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFARLVV 301
           EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPL+ARLVV
Sbjct: 244 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLYARLVV 303

Query: 302 KTESSIDLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPSPPPTRPPSPPLIDGIRA 361
           KTESSIDLVLE RTKAKFSINGKHVWARKYVRKAPR P  RPSPP TRPPSPPLIDG+RA
Sbjct: 304 KTESSIDLVLEGRTKAKFSINGKHVWARKYVRKAPRLPH-RPSPPSTRPPSPPLIDGVRA 363

Query: 362 FNFLPPPPPPAP 374
           FNF  PPPPPAP
Sbjct: 364 FNF--PPPPPAP 370

BLAST of Tan0007868 vs. NCBI nr
Match: KAG6602336.1 (hypothetical protein SDJN03_07569, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 623.2 bits (1606), Expect = 1.4e-174
Identity = 323/372 (86.83%), Postives = 342/372 (91.94%), Query Frame = 0

Query: 2   ASSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEE 61
           +SSSSSPR+DIYTVTQEEFN+FHTIDRTLFSRMVF LGRDPDESVRVMGLWLWLEQNGEE
Sbjct: 4   SSSSSSPRADIYTVTQEEFNVFHTIDRTLFSRMVFTLGRDPDESVRVMGLWLWLEQNGEE 63

Query: 62  FNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFF 121
           FNLVYKMLSLPDALVDAL DEAV+SLACIENDKFPFEP++SVDIPL+QHVSKTPVSLRFF
Sbjct: 64  FNLVYKMLSLPDALVDALFDEAVLSLACIENDKFPFEPDTSVDIPLVQHVSKTPVSLRFF 123

Query: 122 HENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPA 181
           HENRLRILR VT IC+DIC+RAFKDIL+ALETQRVLSRA+V VPA+HGG RGSYF  APA
Sbjct: 124 HENRLRILRGVTKICTDICYRAFKDILYALETQRVLSRASVPVPAIHGG-RGSYFPAAPA 183

Query: 182 SQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKEE 241
           S   VPAFGFVN+RGES P+     ATASQS R  NR VIS+SEMSDLF RL+L+SGK E
Sbjct: 184 SPLAVPAFGFVNVRGESSPLPPPTAATASQSARVGNRNVISRSEMSDLFGRLKLRSGK-E 243

Query: 242 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFARLVV 301
           EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPL+ARLVV
Sbjct: 244 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLYARLVV 303

Query: 302 KTESSIDLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPSPPPTRPPSPPLIDGIRA 361
           KTESSIDLVLE RTKAKFSINGKHVWARKYVRKAPR P  RPSPP TRPPSPPLIDG+RA
Sbjct: 304 KTESSIDLVLEGRTKAKFSINGKHVWARKYVRKAPRLPH-RPSPPSTRPPSPPLIDGVRA 363

Query: 362 FNFLPPPPPPAP 374
           FNF  PPPPPAP
Sbjct: 364 FNF--PPPPPAP 370

BLAST of Tan0007868 vs. NCBI nr
Match: XP_023529203.1 (uncharacterized protein LOC111792024 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 621.7 bits (1602), Expect = 4.1e-174
Identity = 328/374 (87.70%), Postives = 346/374 (92.51%), Query Frame = 0

Query: 2   ASSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEE 61
           +SSSSSPR+DIYTVTQEEFN+FHTIDRTLFSRMVF LGRDP+ESVRVMGLWLWLEQNGEE
Sbjct: 4   SSSSSSPRADIYTVTQEEFNVFHTIDRTLFSRMVFTLGRDPEESVRVMGLWLWLEQNGEE 63

Query: 62  FNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFF 121
           FNLVYKMLSLPDALVDAL DEAV+SLACIENDKFPFEP++S+DIPLIQHVSKTPVSLRFF
Sbjct: 64  FNLVYKMLSLPDALVDALFDEAVLSLACIENDKFPFEPDTSIDIPLIQHVSKTPVSLRFF 123

Query: 122 HENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPA 181
           HENRL ILR VT IC+DIC+RAFKDIL+ALETQRVLSRA+V VPAVHGG RGSYFA APA
Sbjct: 124 HENRLGILRGVTKICTDICYRAFKDILYALETQRVLSRASVPVPAVHGG-RGSYFAAAPA 183

Query: 182 SQFVVPAFGFVNLRGE-SPPVSATAT-ATASQSVRGENRQVISKSEMSDLFSRLQLKSGK 241
           S  VVPAFGFVN+RGE SPP    +T ATASQS R  NR VISKSEMSDLF RL+L+SGK
Sbjct: 184 SPLVVPAFGFVNVRGESSPPFPPPSTAATASQSARVGNRNVISKSEMSDLFGRLKLRSGK 243

Query: 242 EEEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFARL 301
            EEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPL+ARL
Sbjct: 244 -EEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLYARL 303

Query: 302 VVKTESSIDLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPSPPPTRPPSPPLIDGI 361
           VVKTESSIDLVLE RTKAKFSINGKHVWARKYVRKAPRSP  RPSPP TRPPSPPLIDG+
Sbjct: 304 VVKTESSIDLVLEGRTKAKFSINGKHVWARKYVRKAPRSPH-RPSPPSTRPPSPPLIDGV 363

Query: 362 RAFNFLPPPPPPAP 374
           RAFNF  PPPPPAP
Sbjct: 364 RAFNF--PPPPPAP 372

BLAST of Tan0007868 vs. NCBI nr
Match: XP_022990387.1 (uncharacterized protein LOC111487260 [Cucurbita maxima])

HSP 1 Score: 619.8 bits (1597), Expect = 1.6e-173
Identity = 323/372 (86.83%), Postives = 342/372 (91.94%), Query Frame = 0

Query: 2   ASSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEE 61
           +SSSSSPR+DIYTVTQEEFN+FHTIDRTLF+RMVF LGRDPDESVRVMGLWLWLEQNGEE
Sbjct: 8   SSSSSSPRADIYTVTQEEFNVFHTIDRTLFTRMVFTLGRDPDESVRVMGLWLWLEQNGEE 67

Query: 62  FNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFF 121
           FNLVYKMLSLPDALVDAL DEAV+SLACIENDKFPFEP++SVDIPLIQHVSKTPVSLRFF
Sbjct: 68  FNLVYKMLSLPDALVDALFDEAVLSLACIENDKFPFEPDTSVDIPLIQHVSKTPVSLRFF 127

Query: 122 HENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPA 181
           H+NRLRILR VT IC+DIC+RAFKDIL+ALET+RVLSRA+V VPA+HGG RGSYFA APA
Sbjct: 128 HQNRLRILRGVTKICTDICYRAFKDILYALETRRVLSRASVPVPAIHGG-RGSYFAAAPA 187

Query: 182 SQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKEE 241
           S  VVPAFGFVN+RGES P      ATASQS R   R VISKSEMSDLF RL+L+SGK E
Sbjct: 188 SPLVVPAFGFVNVRGESSPPPPPEAATASQSARVGKRNVISKSEMSDLFGRLKLRSGK-E 247

Query: 242 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFARLVV 301
           EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPL+ARLVV
Sbjct: 248 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLYARLVV 307

Query: 302 KTESSIDLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPSPPPTRPPSPPLIDGIRA 361
           KTESSIDLVLE RTKAKFSINGKHVWARKYVRKAPRSP  RPSPP TRPPSPPLIDG+RA
Sbjct: 308 KTESSIDLVLEGRTKAKFSINGKHVWARKYVRKAPRSPH-RPSPPSTRPPSPPLIDGVRA 367

Query: 362 FNFLPPPPPPAP 374
           FNF  P PPPAP
Sbjct: 368 FNF--PTPPPAP 374

BLAST of Tan0007868 vs. ExPASy TrEMBL
Match: A0A6J1GZS4 (uncharacterized protein LOC111458632 OS=Cucurbita moschata OX=3662 GN=LOC111458632 PE=4 SV=1)

HSP 1 Score: 623.6 bits (1607), Expect = 5.2e-175
Identity = 324/372 (87.10%), Postives = 342/372 (91.94%), Query Frame = 0

Query: 2   ASSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEE 61
           +SSSSSPR+DIYTVTQEEFN+FHTIDRTLFSRMVF LGRDPDESVRVMGLWLWLEQNGEE
Sbjct: 4   SSSSSSPRADIYTVTQEEFNVFHTIDRTLFSRMVFTLGRDPDESVRVMGLWLWLEQNGEE 63

Query: 62  FNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFF 121
           FNLVYKMLSLPDALVDAL DEAV+SLACIENDKFPFEP++SVDIPLIQHVSKTPVSLRFF
Sbjct: 64  FNLVYKMLSLPDALVDALFDEAVLSLACIENDKFPFEPDTSVDIPLIQHVSKTPVSLRFF 123

Query: 122 HENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPA 181
           HENRLRILR VT IC+DIC+RAFKDIL+ALETQRVLSRA+V VPA+HGG RGSYF  APA
Sbjct: 124 HENRLRILRGVTKICTDICYRAFKDILYALETQRVLSRASVPVPAIHGG-RGSYFPAAPA 183

Query: 182 SQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKEE 241
           S   VPAFGFVN+RGES P+     ATASQS R  NR VIS+SEMSDLF RL+L+SGK E
Sbjct: 184 SPLAVPAFGFVNVRGESSPLPPPTAATASQSARVGNRNVISRSEMSDLFGRLKLRSGK-E 243

Query: 242 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFARLVV 301
           EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPL+ARLVV
Sbjct: 244 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLYARLVV 303

Query: 302 KTESSIDLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPSPPPTRPPSPPLIDGIRA 361
           KTESSIDLVLE RTKAKFSINGKHVWARKYVRKAPR P  RPSPP TRPPSPPLIDG+RA
Sbjct: 304 KTESSIDLVLEGRTKAKFSINGKHVWARKYVRKAPRLPH-RPSPPSTRPPSPPLIDGVRA 363

Query: 362 FNFLPPPPPPAP 374
           FNF  PPPPPAP
Sbjct: 364 FNF--PPPPPAP 370

BLAST of Tan0007868 vs. ExPASy TrEMBL
Match: A0A6J1JIJ6 (uncharacterized protein LOC111487260 OS=Cucurbita maxima OX=3661 GN=LOC111487260 PE=4 SV=1)

HSP 1 Score: 619.8 bits (1597), Expect = 7.6e-174
Identity = 323/372 (86.83%), Postives = 342/372 (91.94%), Query Frame = 0

Query: 2   ASSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEE 61
           +SSSSSPR+DIYTVTQEEFN+FHTIDRTLF+RMVF LGRDPDESVRVMGLWLWLEQNGEE
Sbjct: 8   SSSSSSPRADIYTVTQEEFNVFHTIDRTLFTRMVFTLGRDPDESVRVMGLWLWLEQNGEE 67

Query: 62  FNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFF 121
           FNLVYKMLSLPDALVDAL DEAV+SLACIENDKFPFEP++SVDIPLIQHVSKTPVSLRFF
Sbjct: 68  FNLVYKMLSLPDALVDALFDEAVLSLACIENDKFPFEPDTSVDIPLIQHVSKTPVSLRFF 127

Query: 122 HENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPA 181
           H+NRLRILR VT IC+DIC+RAFKDIL+ALET+RVLSRA+V VPA+HGG RGSYFA APA
Sbjct: 128 HQNRLRILRGVTKICTDICYRAFKDILYALETRRVLSRASVPVPAIHGG-RGSYFAAAPA 187

Query: 182 SQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKEE 241
           S  VVPAFGFVN+RGES P      ATASQS R   R VISKSEMSDLF RL+L+SGK E
Sbjct: 188 SPLVVPAFGFVNVRGESSPPPPPEAATASQSARVGKRNVISKSEMSDLFGRLKLRSGK-E 247

Query: 242 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFARLVV 301
           EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPL+ARLVV
Sbjct: 248 EGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLYARLVV 307

Query: 302 KTESSIDLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPSPPPTRPPSPPLIDGIRA 361
           KTESSIDLVLE RTKAKFSINGKHVWARKYVRKAPRSP  RPSPP TRPPSPPLIDG+RA
Sbjct: 308 KTESSIDLVLEGRTKAKFSINGKHVWARKYVRKAPRSPH-RPSPPSTRPPSPPLIDGVRA 367

Query: 362 FNFLPPPPPPAP 374
           FNF  P PPPAP
Sbjct: 368 FNF--PTPPPAP 374

BLAST of Tan0007868 vs. ExPASy TrEMBL
Match: A0A6J1BYR7 (uncharacterized protein LOC111006470 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111006470 PE=4 SV=1)

HSP 1 Score: 488.4 bits (1256), Expect = 2.6e-134
Identity = 280/378 (74.07%), Postives = 301/378 (79.63%), Query Frame = 0

Query: 3   SSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEEF 62
           SSSS   S+ YTVTQEEF  FHTIDRTLFSRMV+ LGRD DESVRVMGLWLWLEQNGEEF
Sbjct: 9   SSSSFSSSEFYTVTQEEFITFHTIDRTLFSRMVYTLGRDSDESVRVMGLWLWLEQNGEEF 68

Query: 63  NLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPES--SVDIPLIQHVSKTPVSLRF 122
           NLV+KMLSLPDALVDALSDEAVMSLACIENDKFPFEPES  SVDIPLIQHVSKTPVSLRF
Sbjct: 69  NLVHKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSPSVDIPLIQHVSKTPVSLRF 128

Query: 123 FHENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAP 182
           FHENRLRILR V+ I +DIC RAF DIL  LETQRVL RA +            YF  AP
Sbjct: 129 FHENRLRILRGVSKIINDICLRAFHDILQNLETQRVLPRAPL------------YFGAAP 188

Query: 183 AS-QFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGK 242
           A+ QFVVP+FGFV              + A +SV GE+R+VI   E+SD+F RLQLKSGK
Sbjct: 189 AAGQFVVPSFGFV------------GPSAAIRSVGGEHRRVILNGEVSDVFRRLQLKSGK 248

Query: 243 --EEEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFA 302
             E+E E+VP +ERTIFLTFSKGYPISE+EVRDYF RRYG+FIESIHMQE QPPEQPL+A
Sbjct: 249 EGEKEEESVPAEERTIFLTFSKGYPISEDEVRDYFARRYGNFIESIHMQEVQPPEQPLYA 308

Query: 303 RLVVKTESSIDLVLESRTKAKFSINGKHVWARKYVRK--APRSPQLRPSPPPTRPPSPPL 362
           RLVVKTE SID+VLESRTKAKFSINGKHVWARKYVRK    RS  LRPSP    PPSPPL
Sbjct: 309 RLVVKTEPSIDIVLESRTKAKFSINGKHVWARKYVRKNSNSRSSPLRPSP----PPSPPL 356

Query: 363 IDGIRAFNFLPPPPPPAP 374
            DGIR FNF PP  PPAP
Sbjct: 369 -DGIRPFNF-PPQTPPAP 356

BLAST of Tan0007868 vs. ExPASy TrEMBL
Match: A0A6J1C145 (uncharacterized protein LOC111006470 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111006470 PE=4 SV=1)

HSP 1 Score: 481.5 bits (1238), Expect = 3.2e-132
Identity = 279/378 (73.81%), Postives = 300/378 (79.37%), Query Frame = 0

Query: 3   SSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEEF 62
           SSSS   S+ YTVTQEEF  FHTIDRTLFSRMV+ LGRD DESVRVMGLWLWLEQNGEEF
Sbjct: 9   SSSSFSSSEFYTVTQEEFITFHTIDRTLFSRMVYTLGRDSDESVRVMGLWLWLEQNGEEF 68

Query: 63  NLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPES--SVDIPLIQHVSKTPVSLRF 122
           NLV+KMLSLPDALVDALSDEAVMSLACIENDKFPFEPES  SVDIPLIQHVSKTPVSLRF
Sbjct: 69  NLVHKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSPSVDIPLIQHVSKTPVSLRF 128

Query: 123 FHENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAP 182
           FHENRLRILR V+ I +DIC RAF DIL  LETQRVL RA +            YF  AP
Sbjct: 129 FHENRLRILRGVSKIINDICLRAFHDILQNLETQRVLPRAPL------------YFGAAP 188

Query: 183 AS-QFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGK 242
           A+ QFVVP+FGFV              + A +SV GE+R+VI   E+SD+F RLQLKSGK
Sbjct: 189 AAGQFVVPSFGFV------------GPSAAIRSVGGEHRRVILNGEVSDVFRRLQLKSGK 248

Query: 243 --EEEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFA 302
             E+E E+VP +ERTIFLTFSKGYPISE+EVRDYF  RYG+FIESIHMQE QPPEQPL+A
Sbjct: 249 EGEKEEESVPAEERTIFLTFSKGYPISEDEVRDYFA-RYGNFIESIHMQEVQPPEQPLYA 308

Query: 303 RLVVKTESSIDLVLESRTKAKFSINGKHVWARKYVRK--APRSPQLRPSPPPTRPPSPPL 362
           RLVVKTE SID+VLESRTKAKFSINGKHVWARKYVRK    RS  LRPSP    PPSPPL
Sbjct: 309 RLVVKTEPSIDIVLESRTKAKFSINGKHVWARKYVRKNSNSRSSPLRPSP----PPSPPL 355

Query: 363 IDGIRAFNFLPPPPPPAP 374
            DGIR FNF PP  PPAP
Sbjct: 369 -DGIRPFNF-PPQTPPAP 355

BLAST of Tan0007868 vs. ExPASy TrEMBL
Match: A0A1S3CJK2 (uncharacterized protein LOC103501679 OS=Cucumis melo OX=3656 GN=LOC103501679 PE=4 SV=1)

HSP 1 Score: 452.6 bits (1163), Expect = 1.6e-123
Identity = 258/395 (65.32%), Postives = 293/395 (74.18%), Query Frame = 0

Query: 1   MASSSSSPRSDI-YTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNG 60
           MASSSSS +  I YT+TQEEFN+FHTIDR LFSRMVF+LGR+PDESVRVMG WLWLE+NG
Sbjct: 1   MASSSSSLQDTINYTITQEEFNMFHTIDRKLFSRMVFSLGREPDESVRVMGFWLWLEKNG 60

Query: 61  EEFNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLR 120
           EE +LVYK+L LPD LVDAL DEAVMSLACIENDKFPFEP+S+VD+PLIQHVSKTPVSLR
Sbjct: 61  EESSLVYKILGLPDVLVDALCDEAVMSLACIENDKFPFEPDSTVDVPLIQHVSKTPVSLR 120

Query: 121 FFHENRLRILRAVTIICSDICHRAFKDILHALETQRVLSR--AAVAVPAVHG-GARGSYF 180
           FFH NRL ILR VT +C+DIC+RAF DIL AL T+R +SR  AAV++PAV G G RG  F
Sbjct: 121 FFHLNRLGILRGVTKMCNDICNRAFLDILQALHTRRAISRAPAAVSIPAVQGEGGRGRVF 180

Query: 181 --ATAPASQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQ 240
                P S+F VP+ GF+ LRGE                        S + +    S L+
Sbjct: 181 EGRAPPVSKFFVPSIGFLGLRGEG-----------------------STAAIRSGMSSLE 240

Query: 241 LKSGK-------EEEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQE 300
           LKSGK       EEEGEA+P D+RTIFLTFSKGYPISE+EVRD+FGRRYGDFIESIHMQE
Sbjct: 241 LKSGKEEEEEEGEEEGEAIPADQRTIFLTFSKGYPISEDEVRDFFGRRYGDFIESIHMQE 300

Query: 301 AQPPEQPLFARLVVKTESSIDLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPSPPP 360
           A PPEQPL+ARLVVKTES IDLVLE+RTKAKFSINGKHVWARKYVRK P    +R SP P
Sbjct: 301 AHPPEQPLYARLVVKTESYIDLVLEARTKAKFSINGKHVWARKYVRKTP----IRSSPRP 360

Query: 361 TRPPSPPLIDGIRAFNFLPPPP---------PPAP 374
           + PPSP L    R   ++  PP         PP+P
Sbjct: 361 SPPPSPSL----RRAQYVRRPPIRSSLIPSSPPSP 364

BLAST of Tan0007868 vs. TAIR 10
Match: AT1G49290.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G13620.1); Has 99 Blast hits to 93 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 97; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 188.0 bits (476), Expect = 1.4e-47
Identity = 121/338 (35.80%), Postives = 189/338 (55.92%), Query Frame = 0

Query: 6   SSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEEFNLV 65
           ++P S I  VT++EFN FHTIDRTLFSR+VF L RD D+S   M   L+LEQ+    +++
Sbjct: 11  NNPLSSI-VVTRDEFNAFHTIDRTLFSRLVFNLNRDVDQSFLAMCFLLFLEQSSYARDII 70

Query: 66  YKMLSLPDALVDALSDEAVMSLACIENDK-----FPFEPESSVDIPLIQHVSKTPVSLRF 125
             ++SLP+A VDA+++E  + +  + N +     F  + + +  IPL+  ++    +LR 
Sbjct: 71  AYLVSLPNAFVDAVANEIGVCINLLYNVEFASTFFAADNDDNSMIPLLLRITGGKFTLRL 130

Query: 126 FHENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAP 185
            ++ R      VT   +D+C RAF D+       R+     +A+       R  +     
Sbjct: 131 INQQRKNFCAGVTKSWTDVCTRAFSDLCET--AHRINREKQLAL------EREKFIEDMK 190

Query: 186 ASQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKE 245
             +  +       L  +   +++         V  E  + + + E  ++         KE
Sbjct: 191 KLRLSLQQEKSNRLSVQQVKIASPPPPRPHPPVEDETEKALREKETMEV---------KE 250

Query: 246 EEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFARLV 305
           +EG  +  D+RT+FLTFSKGYPISE EVR YF RR+G+ IE++ MQE +  EQPLFA++V
Sbjct: 251 KEG-VLAADDRTVFLTFSKGYPISEAEVRVYFTRRFGEVIEAVEMQEVEANEQPLFAKMV 310

Query: 306 VKTE--SSIDLVLESRTKAKFSINGKHVWARKYVRKAP 337
           +K +  S +D ++ +R + KF+I+GKHVWARKYVRK P
Sbjct: 311 MKLQCASMMDEIVSARFRNKFTIDGKHVWARKYVRKNP 329

BLAST of Tan0007868 vs. TAIR 10
Match: AT5G13620.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G49290.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 170.2 bits (430), Expect = 3.0e-42
Identity = 116/347 (33.43%), Postives = 179/347 (51.59%), Query Frame = 0

Query: 1   MASSSSSPRSDIYTVTQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGE 60
           MASSSS+       VT++EFN FH  DR LF R V  L RD ++S++VM   L+LE++G 
Sbjct: 1   MASSSSA-----VAVTRDEFNAFHKCDRALFRRFVVRLRRDINQSLQVMSFLLYLEKSGL 60

Query: 61  EFNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPES--SVDIPLIQHVSKTPVSL 120
             NL+    SLPD  ++ ++DE VM L+C+  + F     +     IPLI  ++   ++L
Sbjct: 61  VSNLIVNFNSLPDFFINTVADEVVMCLSCLSYENFSMFVANFGKKIIPLITRMTGEYLTL 120

Query: 121 RFFHENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFAT 180
              H+NR  IL  +    + IC+ AF+DI    E ++V+          H G   +    
Sbjct: 121 AVIHQNRESILLDMKKHLTSICYPAFEDICVQAEKEKVIE------DMKHLGFSKAVHKA 180

Query: 181 APASQFVVPAFGFVNLRGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSG 240
             +SQF+                                +Q  +++    +FS       
Sbjct: 181 GSSSQFL------------------------------SEQQATTRTSKVGVFS------- 240

Query: 241 KEEEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGDFIESIHMQEAQPPEQPLFAR 300
              E E    D+RT+FLTFS+GYP+SE EV  YF RR+G+ IE+I M   +  EQ L+A+
Sbjct: 241 ---EDEQAREDDRTVFLTFSRGYPLSEAEVHAYFTRRFGEIIEAIIMPGGEGNEQALYAK 296

Query: 301 LVVKTESSI-DLVLESRTKAKFSINGKHVWARKYVRKAPRSPQLRPS 345
           +V+ + + I ++V +   + K++INGKHVWARKY+ ++  +  L PS
Sbjct: 301 MVLHSAAMIPEIVSDGIERNKYTINGKHVWARKYIPRSSINNNLAPS 296

BLAST of Tan0007868 vs. TAIR 10
Match: AT1G64870.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G45200.1); Has 99 Blast hits to 91 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 99; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 136.3 bits (342), Expect = 4.9e-32
Identity = 100/331 (30.21%), Postives = 158/331 (47.73%), Query Frame = 0

Query: 16  TQEEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEEFNLVYKMLSL-PDA 75
           T E+ + FH  +R +FS++V  L R P ES+ VM  WLW E  G  F  ++ ++++  D 
Sbjct: 5   TVEQLHAFHAQEREIFSKLVQKLRRPPAESLLVMATWLWFEDFG--FGNIFSIITVFSDL 64

Query: 76  LVDALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFFHENRLRILRAVTI 135
           L+  L++EAV+   C+E+D+    P     IPL +   K  +SL+  H +R   +  +  
Sbjct: 65  LIVDLANEAVLCFRCLESDQ---PPNDVSQIPLTERFMKNDISLQIIHNHRYTAITGIKN 124

Query: 136 ICSDICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPASQFVVPAFGFVNL 195
             + IC R F DIL     QRVL  ++            S F T      ++P F     
Sbjct: 125 FLTTICSRIFSDIL-----QRVLPSSS------------SSFITNLRHPLIIPGF----- 184

Query: 196 RGESPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKEEEGEAVPPD-ERTI 255
                P   +     +         ++++  + +  S L            V  D ERT+
Sbjct: 185 ---PHPTFGSINVLPN---------IVARDNLPNANSFLFPHGLWGWNANHVATDKERTV 244

Query: 256 FLTFSKGYPISEEEVRDYFGRRYG-DFIESIHMQE------------AQPPEQPLFARLV 315
           FLTFS+G+P+S  EV   F   YG D +ES++M E                +QPLFA++V
Sbjct: 245 FLTFSRGFPVSHAEVIHLFTEIYGEDCVESVYMPEDGGNSSNDNTNCNGHQQQPLFAKMV 296

Query: 316 VKTESSIDLVLESRTKAKFSINGKHVWARKY 332
           + +  ++D +L  + K K+ INGKH+WARK+
Sbjct: 305 LDSVVTVDRILSGQEKQKYKINGKHIWARKF 296

BLAST of Tan0007868 vs. TAIR 10
Match: AT3G45200.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G64870.1); Has 95 Blast hits to 91 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 95; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 122.5 bits (306), Expect = 7.3e-28
Identity = 100/329 (30.40%), Postives = 156/329 (47.42%), Query Frame = 0

Query: 18  EEFNIFHTIDRTLFSRMVFALGRDPDESVRVMGLWLWLEQNGEEFNLVYKMLSLPDALVD 77
           +E ++FH  DR +FS++V    R P ES+ VM  WLWLE  G E N+   +L+L D L+ 
Sbjct: 7   QELHVFHAQDREIFSKLVLKFSRPPAESLLVMATWLWLEDFGFE-NIFSIILTLTDPLIA 66

Query: 78  ALSDEAVMSLACIENDKFPFEPESSVDIPLIQHVSKTPVSLRFFHENRLRILRAVTIICS 137
            L+ EAV    C+  +  P        IPL     K  +SL+  ++NR   +  +    +
Sbjct: 67  GLAYEAVSCFQCLSLNNPPIG-----RIPLTTKYLKKNISLQMIYKNRYSAITGIKNFLT 126

Query: 138 DICHRAFKDILHALETQRVLSRAAVAVPAVHGGARGSYFATAPASQFVVPAFGFVNLRGE 197
            +C R F DIL      RVL  ++++       AR       P   F  P FG +N+   
Sbjct: 127 TVCTRIFTDIL-----LRVLPPSSMS----SFDARLRQPLQIPG--FPHPIFGSINV--- 186

Query: 198 SPPVSATATATASQSVRGENRQVISKSEMSDLFSRLQLKSGKEEEGEAVPPDERTIFLTF 257
                          V  +N      ++ ++LF       G      A   D RT+FLTF
Sbjct: 187 -----------MPNEVDRDN----FSNKNNNLFFIPNGLWGWNANCIATEND-RTLFLTF 246

Query: 258 SKGYPISEEEVRDYFGRRYGD-FIESIHMQEAQP-----------PEQPLFARLVVKTES 317
           S+GYP++  E+ + F + YG+  +E ++MQ                +Q LFARLV+ + +
Sbjct: 247 SRGYPVTHAEIIELFTKEYGENCVEGVYMQHDNKRSFNANANRSCEQQSLFARLVLDSVT 299

Query: 318 SIDLVLESRTKAKFSINGKHVWARKYVRK 335
           ++D VL+   K +  I GK++WARKY ++
Sbjct: 307 TVDRVLDDEQKKELMIYGKNIWARKYDKR 299

BLAST of Tan0007868 vs. TAIR 10
Match: AT5G11220.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G64870.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 107.1 bits (266), Expect = 3.2e-23
Identity = 82/298 (27.52%), Postives = 142/298 (47.65%), Query Frame = 0

Query: 49  MGLWLWLEQNGEEFNLVYKMLSLPDALVDALSDEAVMSLACIENDKFPFEPESSVDIPLI 108
           M  W WLE    + N++  +L+L D ++ AL++EAV+   C+++ +   +P     IPL 
Sbjct: 1   MATWFWLEDFFSQ-NILSTILALSDPVIMALANEAVLCFQCLDSAE---QPNDFNQIPLT 60

Query: 109 QHVSKTPVSLRFFHENRLRILRAVTIICSDICHRAFKDILHALETQRVLSRAAVAVPAVH 168
             +    +SL+ FH++R   +  +    + +C R F DIL     QR L  ++ + P V 
Sbjct: 61  AELLAKDISLQIFHKHRYSAIAGIRNFLTTVCSRIFSDIL-----QRALPPSS-SYPFV- 120

Query: 169 GGARGSYFATAPASQFVVPAFGFVNLRGE---SPPVSATATATASQSVRGENRQVISKSE 228
              R  +    P   F  P FG +N+  +      +        S  + G N   I+   
Sbjct: 121 --TRLRHPLIIPG--FPHPTFGSINVMHDVVVGDNLYNNNLFPCSHGLWGWNASCIATD- 180

Query: 229 MSDLFSRLQLKSGKEEEGEAVPPDERTIFLTFSKGYPISEEEVRDYFGRRYGD-FIESIH 288
                                  +ERT+F+TFS+G+P+S+ EV+ +F + YG+  +E ++
Sbjct: 181 -----------------------NERTMFITFSRGFPVSQAEVKRFFTKNYGENCVEGVY 240

Query: 289 MQEAQP-----------PEQPLFARLVVKTESSIDLVLESRTKAKFSINGKHVWARKY 332
           M+E               +Q LFA+LV+ + +++D +L+     +F  NGKH+WARKY
Sbjct: 241 MKEDNKNFLNANGNDNGQQQSLFAKLVLNSVATVDRILDGEKIKRFKSNGKHIWARKY 259

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG7033019.12.2e-17587.37hypothetical protein SDJN02_07071, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022957165.11.1e-17487.10uncharacterized protein LOC111458632 [Cucurbita moschata][more]
KAG6602336.11.4e-17486.83hypothetical protein SDJN03_07569, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023529203.14.1e-17487.70uncharacterized protein LOC111792024 [Cucurbita pepo subsp. pepo][more]
XP_022990387.11.6e-17386.83uncharacterized protein LOC111487260 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1GZS45.2e-17587.10uncharacterized protein LOC111458632 OS=Cucurbita moschata OX=3662 GN=LOC1114586... [more]
A0A6J1JIJ67.6e-17486.83uncharacterized protein LOC111487260 OS=Cucurbita maxima OX=3661 GN=LOC111487260... [more]
A0A6J1BYR72.6e-13474.07uncharacterized protein LOC111006470 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1C1453.2e-13273.81uncharacterized protein LOC111006470 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A1S3CJK21.6e-12365.32uncharacterized protein LOC103501679 OS=Cucumis melo OX=3656 GN=LOC103501679 PE=... [more]
Match NameE-valueIdentityDescription
AT1G49290.11.4e-4735.80unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G13620.13.0e-4233.43unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G64870.14.9e-3230.21unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G45200.17.3e-2830.40unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G11220.13.2e-2327.52unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 333..373
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 339..354
NoneNo IPR availablePANTHERPTHR33527:SF28GB|AAD43168.1coord: 3..344
NoneNo IPR availablePANTHERPTHR33527OS07G0274300 PROTEINcoord: 3..344

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007868.1Tan0007868.1mRNA