Tan0008516 (gene) Snake gourd v1

Overview
NameTan0008516
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSerine/arginine repetitive matrix-like protein
LocationLG01: 9344092 .. 9346125 (-)
RNA-Seq ExpressionTan0008516
SyntenyTan0008516
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGCATCTGCAAAAAAAAAAAAAGGAAAAAAAAGAGATTTTTCTTTCCAATATAAATAGTGAGTTTTCTGTACGGTGGCACCACGGTCCTCTTTCTAAAAGGCCAACTCATCATCATCTCTCCCAAACCCTAGCTCTCTCTCTCTCTCTCTCCTCCAGATCTCGCCGGATATTCCGCCGATGTGATTTGCGCCATTAACGTTGACCTTCTCCGGCGTCACTTTCTTCCCCATTACTGGACTTCCTCTTCTTCACCTTCGCATGATCGCCGCCGCCATCCATGGCTTCCGCATGCGTTAACAACATCGGAATGTCGCCGGAGAACTTCCTTGACTGTTCTTCTGCTCCTTGCCATTCCTACAGTTGGCTCAGCCCTCGCATCTCCTTCAGCCGTGACGACTCCCCACCTTCTACTAACCTCCCCGGACCTATATCTAAGCCTGCAGCTGACGTGGCCGGAGAGTCCGAGATTCGAGATCCGGATCCTGAACTGGTGCCGGTCAGCGAGTTCGAGTTCCGCCTTCAAGATCCCGTCGCCTTGATGCTCCCTGCGGATGAGCTGTTTCTGGATGGAAAACTCGTGCCGCTTCAGGTCTCGTCCGTTAAACAACCCTCGGTCAACGGTTTGAAGTCGACGAGGTGCGTTTCTTCGCCGGAGACGGCGGCTCAATCGCGCCGGAGAGTTGAGGAAGAATGCAGTACGGATCCGTATCTGTTCTCTCCCAAGGCGCCGAGGTGTTCCAGTCGATGGAGAGAGCTTCTAGGGCTCAAGAAGCTGTACCAGAGCAGCAGCAATGGCAATGGCAATGGCAGCGCCAAAACCGAAAATCACAAAACGACGTCATCGTCATCGTCGTCTTACTTCTCGGAAGCGAATTCGAAGGCGCTGAGGTATTTCCTCCACCGGAGTTCGAAATCATCGTTGTCGTCTTCGTTCGATTCGTCTCTGAGCCTTCCATTGCTGAAGGATTCCGACAGTGAGTCTGTTTCGCTGTCTTCTTCCCGTGTATCTCTTTCCTCTTCTTCCTCAGGTCACGAACACGAAGATCTTCACAGACTCTCGCTCGATTACGAAAACAAGCCAAACACGAATCCGATTTCCCTCCATCGGAACCCTAATAATAACAATCCACCCCGGATGAGACAGGTAAAACCTCGGCCTAAATCGGAGTCCAATCCAAGATCACCAATGGATCATCATCCGACGGCAACAAGAGTAGGTAGAAGCCCGATGCGGCGTACGCCGGGAGAATCCACCACTAGATTAGCGATCCGAGGAGTATCCGTAGACAGTCCACGAATGAACTCCTCTGGTAAAATCGTATTCCACAACTTGGAGAGAAGCTCGAGCAGTCCAAGCAGCTTCAATGGCGGACCAAAATTCAAGAACAGAGGAATGGAACGGTCATACTCAGCGAACGTGAGAGTAACTCCAGTTCTAAATGTTCCAGTCTGTTCTTCCCTGAGAGGATCCTCAAAATCCGTCTCTGTATTCGGATTCGGTCAGTTATTCTCCAGCACCGGCACCGGCACCGGCGGAAGCAGCAGAAGCCACCAAAGTAGTAGTAGTAATAGTACTAACCGGACGACGTCTAGGCGTATCATCGAAACAGACGGCGGAGGAAAACTCCATTGATGAAGATTAGAAGAAGAAAATGGATTTTGGGTAACGCTAACAACCCCTATTTCATCTCTCTCGTTTTTTTCTTTGGGATTATACTGTATTCGATTGATTCGCCCTTCATTAGAAGACAGATTCAATCGCTAATTCTTAACACCAAAAAGAAGATTCAAAATCATCCGTGTAAATATGGGGAGGAATTTTGTTATTTCTGTTAATTTTTTTTCACCAATTTGAAAGCTCTGTTTGTATGGTCATCAATGGCGTCCACCTGAGAAATTTGTTTTTTCGCTGTTCCATCCAAAAGGAGGCGTTTGAAGGGCACGCGCTGAGAGGGGAGCGTGAGAGTAGAGAAGTCTCTGGGTTTTTGTTTGAAGCTTTTTTCGTAATGATGTGTGACATACTTGTTATCTAA

mRNA sequence

AGCATCTGCAAAAAAAAAAAAAGGAAAAAAAAGAGATTTTTCTTTCCAATATAAATAGTGAGTTTTCTGTACGGTGGCACCACGGTCCTCTTTCTAAAAGGCCAACTCATCATCATCTCTCCCAAACCCTAGCTCTCTCTCTCTCTCTCTCCTCCAGATCTCGCCGGATATTCCGCCGATGTGATTTGCGCCATTAACGTTGACCTTCTCCGGCGTCACTTTCTTCCCCATTACTGGACTTCCTCTTCTTCACCTTCGCATGATCGCCGCCGCCATCCATGGCTTCCGCATGCGTTAACAACATCGGAATGTCGCCGGAGAACTTCCTTGACTGTTCTTCTGCTCCTTGCCATTCCTACAGTTGGCTCAGCCCTCGCATCTCCTTCAGCCGTGACGACTCCCCACCTTCTACTAACCTCCCCGGACCTATATCTAAGCCTGCAGCTGACGTGGCCGGAGAGTCCGAGATTCGAGATCCGGATCCTGAACTGGTGCCGGTCAGCGAGTTCGAGTTCCGCCTTCAAGATCCCGTCGCCTTGATGCTCCCTGCGGATGAGCTGTTTCTGGATGGAAAACTCGTGCCGCTTCAGGTCTCGTCCGTTAAACAACCCTCGGTCAACGGTTTGAAGTCGACGAGGTGCGTTTCTTCGCCGGAGACGGCGGCTCAATCGCGCCGGAGAGTTGAGGAAGAATGCAGTACGGATCCGTATCTGTTCTCTCCCAAGGCGCCGAGGTGTTCCAGTCGATGGAGAGAGCTTCTAGGGCTCAAGAAGCTGTACCAGAGCAGCAGCAATGGCAATGGCAATGGCAGCGCCAAAACCGAAAATCACAAAACGACGTCATCGTCATCGTCGTCTTACTTCTCGGAAGCGAATTCGAAGGCGCTGAGGTATTTCCTCCACCGGAGTTCGAAATCATCGTTGTCGTCTTCGTTCGATTCGTCTCTGAGCCTTCCATTGCTGAAGGATTCCGACAGTGAGTCTGTTTCGCTGTCTTCTTCCCGTGTATCTCTTTCCTCTTCTTCCTCAGGTCACGAACACGAAGATCTTCACAGACTCTCGCTCGATTACGAAAACAAGCCAAACACGAATCCGATTTCCCTCCATCGGAACCCTAATAATAACAATCCACCCCGGATGAGACAGGTAAAACCTCGGCCTAAATCGGAGTCCAATCCAAGATCACCAATGGATCATCATCCGACGGCAACAAGAGTAGGTAGAAGCCCGATGCGGCGTACGCCGGGAGAATCCACCACTAGATTAGCGATCCGAGGAGTATCCGTAGACAGTCCACGAATGAACTCCTCTGGTAAAATCGTATTCCACAACTTGGAGAGAAGCTCGAGCAGTCCAAGCAGCTTCAATGGCGGACCAAAATTCAAGAACAGAGGAATGGAACGGTCATACTCAGCGAACGTGAGAGTAACTCCAGTTCTAAATGTTCCAGTCTGTTCTTCCCTGAGAGGATCCTCAAAATCCGTCTCTGTATTCGGATTCGGTCAGTTATTCTCCAGCACCGGCACCGGCACCGGCGGAAGCAGCAGAAGCCACCAAAGTAGTAGTAGTAATAGTACTAACCGGACGACGTCTAGGCGTATCATCGAAACAGACGGCGGAGGAAAACTCCATTGATGAAGATTAGAAGAAGAAAATGGATTTTGGGTAACGCTAACAACCCCTATTTCATCTCTCTCGTTTTTTTCTTTGGGATTATACTGTATTCGATTGATTCGCCCTTCATTAGAAGACAGATTCAATCGCTAATTCTTAACACCAAAAAGAAGATTCAAAATCATCCGTGTAAATATGGGGAGGAATTTTGTTATTTCTGTTAATTTTTTTTCACCAATTTGAAAGCTCTGTTTGTATGGTCATCAATGGCGTCCACCTGAGAAATTTGTTTTTTCGCTGTTCCATCCAAAAGGAGGCGTTTGAAGGGCACGCGCTGAGAGGGGAGCGTGAGAGTAGAGAAGTCTCTGGGTTTTTGTTTGAAGCTTTTTTCGTAATGATGTGTGACATACTTGTTATCTAA

Coding sequence (CDS)

ATGGCTTCCGCATGCGTTAACAACATCGGAATGTCGCCGGAGAACTTCCTTGACTGTTCTTCTGCTCCTTGCCATTCCTACAGTTGGCTCAGCCCTCGCATCTCCTTCAGCCGTGACGACTCCCCACCTTCTACTAACCTCCCCGGACCTATATCTAAGCCTGCAGCTGACGTGGCCGGAGAGTCCGAGATTCGAGATCCGGATCCTGAACTGGTGCCGGTCAGCGAGTTCGAGTTCCGCCTTCAAGATCCCGTCGCCTTGATGCTCCCTGCGGATGAGCTGTTTCTGGATGGAAAACTCGTGCCGCTTCAGGTCTCGTCCGTTAAACAACCCTCGGTCAACGGTTTGAAGTCGACGAGGTGCGTTTCTTCGCCGGAGACGGCGGCTCAATCGCGCCGGAGAGTTGAGGAAGAATGCAGTACGGATCCGTATCTGTTCTCTCCCAAGGCGCCGAGGTGTTCCAGTCGATGGAGAGAGCTTCTAGGGCTCAAGAAGCTGTACCAGAGCAGCAGCAATGGCAATGGCAATGGCAGCGCCAAAACCGAAAATCACAAAACGACGTCATCGTCATCGTCGTCTTACTTCTCGGAAGCGAATTCGAAGGCGCTGAGGTATTTCCTCCACCGGAGTTCGAAATCATCGTTGTCGTCTTCGTTCGATTCGTCTCTGAGCCTTCCATTGCTGAAGGATTCCGACAGTGAGTCTGTTTCGCTGTCTTCTTCCCGTGTATCTCTTTCCTCTTCTTCCTCAGGTCACGAACACGAAGATCTTCACAGACTCTCGCTCGATTACGAAAACAAGCCAAACACGAATCCGATTTCCCTCCATCGGAACCCTAATAATAACAATCCACCCCGGATGAGACAGGTAAAACCTCGGCCTAAATCGGAGTCCAATCCAAGATCACCAATGGATCATCATCCGACGGCAACAAGAGTAGGTAGAAGCCCGATGCGGCGTACGCCGGGAGAATCCACCACTAGATTAGCGATCCGAGGAGTATCCGTAGACAGTCCACGAATGAACTCCTCTGGTAAAATCGTATTCCACAACTTGGAGAGAAGCTCGAGCAGTCCAAGCAGCTTCAATGGCGGACCAAAATTCAAGAACAGAGGAATGGAACGGTCATACTCAGCGAACGTGAGAGTAACTCCAGTTCTAAATGTTCCAGTCTGTTCTTCCCTGAGAGGATCCTCAAAATCCGTCTCTGTATTCGGATTCGGTCAGTTATTCTCCAGCACCGGCACCGGCACCGGCGGAAGCAGCAGAAGCCACCAAAGTAGTAGTAGTAATAGTACTAACCGGACGACGTCTAGGCGTATCATCGAAACAGACGGCGGAGGAAAACTCCATTGA

Protein sequence

MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAGESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKSTRCVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGSAKTENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPRPKSESNPRSPMDHHPTATRVGRSPMRRTPGESTTRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPSSFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQLFSSTGTGTGGSSRSHQSSSSNSTNRTTSRRIIETDGGGKLH
Homology
BLAST of Tan0008516 vs. NCBI nr
Match: XP_022980895.1 (uncharacterized protein LOC111480206 [Cucurbita maxima])

HSP 1 Score: 723.8 bits (1867), Expect = 9.3e-205
Identity = 403/451 (89.36%), Postives = 414/451 (91.80%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAG 60
           MASACVNNIGMSPENFLDCSSAPCHSY WLSPRISFSRDDSPPSTNL G I+KPAAD AG
Sbjct: 1   MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLAGLITKPAADPAG 60

Query: 61  ESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKSTR 120
           ESEIRD DPE+VPVSEFEFRLQDPVALMLPADELFL+GKLVP QVSSVK PSVN LKS R
Sbjct: 61  ESEIRDSDPEVVPVSEFEFRLQDPVALMLPADELFLNGKLVPFQVSSVK-PSVNVLKSMR 120

Query: 121 CVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGSAK 180
           CVS PETAAQ RR+VE ECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS    NGSAK
Sbjct: 121 CVSLPETAAQPRRKVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS----NGSAK 180

Query: 181 TENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDSESVSLSS 240
           TENHKTT   S SY SEANSKALRY LHR SKSSL SSFDSSL+LPLLKDSDSESVSLSS
Sbjct: 181 TENHKTT---SPSYASEANSKALRYLLHRRSKSSLLSSFDSSLNLPLLKDSDSESVSLSS 240

Query: 241 SRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPRPKSESNP 300
           SRVSLSSSSSGHE EDLHRL LD+ENKPNTNPISLHRNPNN+NPPRMRQVKPRPKSE NP
Sbjct: 241 SRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNNSNPPRMRQVKPRPKSEMNP 300

Query: 301 RSPMDHHPTATRVGRSPMRRTPGESTTRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS 360
           RS MDHHPTATRVGRSPMR  PGES TRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS
Sbjct: 301 RSTMDHHPTATRVGRSPMRPAPGES-TRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS 360

Query: 361 SFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQLFSSTGTGTGG 420
           SFNGGPK KNRG ERSYSANVR+TPVLNVPVCSSLRGSSKSVSVFGFGQLFS  GTGT G
Sbjct: 361 SFNGGPKLKNRGTERSYSANVRITPVLNVPVCSSLRGSSKSVSVFGFGQLFS--GTGTSG 420

Query: 421 SSRSHQSSSSNSTNRTTSRRIIETDGGGKLH 452
           SSRS+QSS S+STNRTT+RRIIETDGGGKLH
Sbjct: 421 SSRSYQSSGSSSTNRTTTRRIIETDGGGKLH 440

BLAST of Tan0008516 vs. NCBI nr
Match: KAG6600467.1 (hypothetical protein SDJN03_05700, partial [Cucurbita argyrosperma subsp. sororia] >KAG7031114.1 hypothetical protein SDJN02_05153, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 719.9 bits (1857), Expect = 1.3e-203
Identity = 403/452 (89.16%), Postives = 416/452 (92.04%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAG 60
           MASACVNNIGMSPENFLDCSSAPCHSY WLSPRISFSRDDSPPSTNL G I+KPAAD AG
Sbjct: 1   MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLAGLITKPAADPAG 60

Query: 61  ESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKSTR 120
           ESEIRD DPELVPVSEFEFRLQDPVALMLPADELFL+GKLVP QVSSVK PSVN LKS R
Sbjct: 61  ESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKLVPFQVSSVK-PSVNVLKSMR 120

Query: 121 CVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGSAK 180
           CVSSPETAAQSRR VE ECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG+G    K
Sbjct: 121 CVSSPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSG----K 180

Query: 181 TENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDSESVSLSS 240
           TENHKTT   S SY SEANSKALRY LHR SKSSLSSSF+SSL+LPLLKDSDSESVSLSS
Sbjct: 181 TENHKTT---SPSYASEANSKALRYLLHRRSKSSLSSSFESSLNLPLLKDSDSESVSLSS 240

Query: 241 SRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPRPKSESNP 300
           SRVSLSSSSSGHE EDLHRL LD+ENKPNTNPISLHRNPNN+NPPRMRQVKPRPKSE NP
Sbjct: 241 SRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNNSNPPRMRQVKPRPKSEMNP 300

Query: 301 RSPMDHHPTATRVGRSPMRRTPGESTTRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS 360
           RS MDHHPTATRVGRSPMR  PGES TRLAIRGVSVDSPR+NS GKIVFHNLERSSSSPS
Sbjct: 301 RSTMDHHPTATRVGRSPMRPAPGES-TRLAIRGVSVDSPRINSCGKIVFHNLERSSSSPS 360

Query: 361 SFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQLFSSTGTGTGG 420
           SFNGGPK KNRG ERSYSANVR+TPVLNVPVCSSLRGSSKSVSVFGFGQLFSS  TGT G
Sbjct: 361 SFNGGPKLKNRGTERSYSANVRITPVLNVPVCSSLRGSSKSVSVFGFGQLFSS--TGTSG 420

Query: 421 SSRSHQ-SSSSNSTNRTTSRRIIETDGGGKLH 452
           SSRS+Q SSSS+STNRTT+RRI+ETDGGGKLH
Sbjct: 421 SSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH 441

BLAST of Tan0008516 vs. NCBI nr
Match: XP_022942648.1 (uncharacterized protein LOC111447620 [Cucurbita moschata])

HSP 1 Score: 716.8 bits (1849), Expect = 1.1e-202
Identity = 402/452 (88.94%), Postives = 414/452 (91.59%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAG 60
           MASACVNNIGMSPENFLDCSSAPCHSY WLSPRISFSRDDSPPSTNL G I+KPAAD AG
Sbjct: 1   MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAG 60

Query: 61  ESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKSTR 120
           ESEIRD DPELVPVSEFEFRLQDPVALMLPADELFL+GKLVP QVSSVK PSVN LKS R
Sbjct: 61  ESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKLVPFQVSSVK-PSVNVLKSMR 120

Query: 121 CVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGSAK 180
           CVS PETAAQSRR VE ECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS    NGSAK
Sbjct: 121 CVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS----NGSAK 180

Query: 181 TENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDSESVSLSS 240
           TENHKTT   S SY SEANSKALRY LHR SKSSLSSSFDSSLSLPLLKDSDSESVSLSS
Sbjct: 181 TENHKTT---SPSYASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSS 240

Query: 241 SRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPRPKSESNP 300
           SRVSLSSSSSGHE EDLHRL LD+ENKPNTNPISLHRNPN +NPPRMRQVKPRPKSE NP
Sbjct: 241 SRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNP 300

Query: 301 RSPMDHHPTATRVGRSPMRRTPGESTTRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS 360
           RS MDHHPTATRVGRSPMR  PGES TRLA+RGVSVDSPR+NS GKIVFHNLERSSSSPS
Sbjct: 301 RSTMDHHPTATRVGRSPMRPAPGES-TRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPS 360

Query: 361 SFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQLFSSTGTGTGG 420
           SFNGGPK KNRG ERSYSANVR+TPVL+VPVCSSLRGSSKSVSVFGFGQLFSS  TGT G
Sbjct: 361 SFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQLFSS--TGTSG 420

Query: 421 SSRSHQ-SSSSNSTNRTTSRRIIETDGGGKLH 452
           SSRS+Q SSSS+STNRTT+RRI+ETDGGGKLH
Sbjct: 421 SSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH 441

BLAST of Tan0008516 vs. NCBI nr
Match: XP_023549710.1 (uncharacterized protein LOC111808127 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 716.5 bits (1848), Expect = 1.5e-202
Identity = 402/452 (88.94%), Postives = 415/452 (91.81%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAG 60
           MASACVNNIGMSPENFLDCSSAPCHSY WLSPRISFSRDDSPPSTNL G ++KPAAD AG
Sbjct: 1   MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLAGLMTKPAADPAG 60

Query: 61  ESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKSTR 120
           ESEIRD DPELVPVSEFEFRLQDPVALMLPADELFL+GKLVP +VSSVK PSVN LKS R
Sbjct: 61  ESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKLVPFRVSSVK-PSVNVLKSMR 120

Query: 121 CVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGSAK 180
           CVSSPETAAQSRR VE ECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS    NGSAK
Sbjct: 121 CVSSPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS----NGSAK 180

Query: 181 TENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDSESVSLSS 240
           TENHKTT   S SY SEANSKALRY LHR SKSSLSSSFDSSL+LPLLKDSDSESVSLSS
Sbjct: 181 TENHKTT---SPSYASEANSKALRYLLHRRSKSSLSSSFDSSLNLPLLKDSDSESVSLSS 240

Query: 241 SRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPRPKSESNP 300
           SRVSLSSSSSGHE EDLHRL LD+ENKPNTNPISLHRNPNN+NPPRMRQVKPRPKSE NP
Sbjct: 241 SRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNNSNPPRMRQVKPRPKSEMNP 300

Query: 301 RSPMDHHPTATRVGRSPMRRTPGESTTRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS 360
           RS MDHH TATRVGRSPMR  PGES TRLAIRGVSVDSPR+NS GKIVFHNLERSSSSPS
Sbjct: 301 RSTMDHHTTATRVGRSPMRPAPGES-TRLAIRGVSVDSPRINSCGKIVFHNLERSSSSPS 360

Query: 361 SFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQLFSSTGTGTGG 420
           SFNGGPK KNRG ERSYSANVR+TPVLNVPVCSSLRGSSKSVSVFGFGQLFS+  TGT G
Sbjct: 361 SFNGGPKLKNRGTERSYSANVRITPVLNVPVCSSLRGSSKSVSVFGFGQLFSN--TGTSG 420

Query: 421 SSRSHQ-SSSSNSTNRTTSRRIIETDGGGKLH 452
           SSRS+Q SSSS+STNRTT+RRIIETDGGGKLH
Sbjct: 421 SSRSYQSSSSSSSTNRTTTRRIIETDGGGKLH 441

BLAST of Tan0008516 vs. NCBI nr
Match: XP_023551901.1 (uncharacterized protein LOC111809733 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 712.2 bits (1837), Expect = 2.8e-201
Identity = 402/463 (86.83%), Postives = 424/463 (91.58%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSR---DDSPPSTNLPGPIS--KPA 60
           MASACVN++GMSPENFLDCSSAPCHSY WLSPR+SFSR   DDS PS+NL  PIS  KP 
Sbjct: 1   MASACVNSVGMSPENFLDCSSAPCHSYGWLSPRVSFSRDFSDDSSPSSNLARPISKPKPG 60

Query: 61  ADVAGESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNG 120
           AD AG+SEIRDPDPELVPVSEFEF L+DPVALMLPADELFLDGKLVPLQVSSVK PSVNG
Sbjct: 61  ADPAGKSEIRDPDPELVPVSEFEFCLKDPVALMLPADELFLDGKLVPLQVSSVK-PSVNG 120

Query: 121 LKSTRCVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG-- 180
           LKSTRCVSSPET  Q+RRRVE+EC+TDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG  
Sbjct: 121 LKSTRCVSSPETVVQARRRVEDECNTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNN 180

Query: 181 NGNGSAKTENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDS 240
           N NG AK ENHKTT++SSSSYFSEANSKAL+YFLHR+SKSSL+SSFDSSLSLPLLKDSDS
Sbjct: 181 NSNGGAKNENHKTTTTSSSSYFSEANSKALKYFLHRNSKSSLASSFDSSLSLPLLKDSDS 240

Query: 241 ESVSLSSSRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPR 300
           ESVSLSSSRVSLSSSSSGHEHEDLHRLSLD ENKPN NPISLHRNPN+NNPPRMR VKPR
Sbjct: 241 ESVSLSSSRVSLSSSSSGHEHEDLHRLSLDCENKPNKNPISLHRNPNHNNPPRMRLVKPR 300

Query: 301 PKSESNPR--SPMDHHPTATRVGRSPMRRTPGE-STTRL-AIRGVSVDSPRMNSSGKIVF 360
           PKSE+NPR  S +DHHPTATRVGRSPMRRTPGE S++RL  IRGVSVDSPRMNSSGKIVF
Sbjct: 301 PKSETNPRSTSTVDHHPTATRVGRSPMRRTPGESSSSRLGGIRGVSVDSPRMNSSGKIVF 360

Query: 361 HNLERSSSSPSSFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQ 420
           HNLERSSSSPS+FNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKS SVFGFG 
Sbjct: 361 HNLERSSSSPSTFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSASVFGFGP 420

Query: 421 LFSSTGTGTGGSSRSHQ-SSSSNSTNRTTSRRIIETDGGGKLH 452
           LF    TGTGG  RSHQ SSSS+STNRTT+RRI ETDGGGKLH
Sbjct: 421 LF----TGTGG--RSHQNSSSSSSTNRTTTRRINETDGGGKLH 456

BLAST of Tan0008516 vs. ExPASy TrEMBL
Match: A0A6J1IXV4 (uncharacterized protein LOC111480206 OS=Cucurbita maxima OX=3661 GN=LOC111480206 PE=4 SV=1)

HSP 1 Score: 723.8 bits (1867), Expect = 4.5e-205
Identity = 403/451 (89.36%), Postives = 414/451 (91.80%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAG 60
           MASACVNNIGMSPENFLDCSSAPCHSY WLSPRISFSRDDSPPSTNL G I+KPAAD AG
Sbjct: 1   MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLAGLITKPAADPAG 60

Query: 61  ESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKSTR 120
           ESEIRD DPE+VPVSEFEFRLQDPVALMLPADELFL+GKLVP QVSSVK PSVN LKS R
Sbjct: 61  ESEIRDSDPEVVPVSEFEFRLQDPVALMLPADELFLNGKLVPFQVSSVK-PSVNVLKSMR 120

Query: 121 CVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGSAK 180
           CVS PETAAQ RR+VE ECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS    NGSAK
Sbjct: 121 CVSLPETAAQPRRKVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS----NGSAK 180

Query: 181 TENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDSESVSLSS 240
           TENHKTT   S SY SEANSKALRY LHR SKSSL SSFDSSL+LPLLKDSDSESVSLSS
Sbjct: 181 TENHKTT---SPSYASEANSKALRYLLHRRSKSSLLSSFDSSLNLPLLKDSDSESVSLSS 240

Query: 241 SRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPRPKSESNP 300
           SRVSLSSSSSGHE EDLHRL LD+ENKPNTNPISLHRNPNN+NPPRMRQVKPRPKSE NP
Sbjct: 241 SRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNNSNPPRMRQVKPRPKSEMNP 300

Query: 301 RSPMDHHPTATRVGRSPMRRTPGESTTRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS 360
           RS MDHHPTATRVGRSPMR  PGES TRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS
Sbjct: 301 RSTMDHHPTATRVGRSPMRPAPGES-TRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS 360

Query: 361 SFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQLFSSTGTGTGG 420
           SFNGGPK KNRG ERSYSANVR+TPVLNVPVCSSLRGSSKSVSVFGFGQLFS  GTGT G
Sbjct: 361 SFNGGPKLKNRGTERSYSANVRITPVLNVPVCSSLRGSSKSVSVFGFGQLFS--GTGTSG 420

Query: 421 SSRSHQSSSSNSTNRTTSRRIIETDGGGKLH 452
           SSRS+QSS S+STNRTT+RRIIETDGGGKLH
Sbjct: 421 SSRSYQSSGSSSTNRTTTRRIIETDGGGKLH 440

BLAST of Tan0008516 vs. ExPASy TrEMBL
Match: A0A6J1FVC0 (uncharacterized protein LOC111447620 OS=Cucurbita moschata OX=3662 GN=LOC111447620 PE=4 SV=1)

HSP 1 Score: 716.8 bits (1849), Expect = 5.5e-203
Identity = 402/452 (88.94%), Postives = 414/452 (91.59%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAG 60
           MASACVNNIGMSPENFLDCSSAPCHSY WLSPRISFSRDDSPPSTNL G I+KPAAD AG
Sbjct: 1   MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAG 60

Query: 61  ESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKSTR 120
           ESEIRD DPELVPVSEFEFRLQDPVALMLPADELFL+GKLVP QVSSVK PSVN LKS R
Sbjct: 61  ESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKLVPFQVSSVK-PSVNVLKSMR 120

Query: 121 CVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGSAK 180
           CVS PETAAQSRR VE ECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS    NGSAK
Sbjct: 121 CVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS----NGSAK 180

Query: 181 TENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDSESVSLSS 240
           TENHKTT   S SY SEANSKALRY LHR SKSSLSSSFDSSLSLPLLKDSDSESVSLSS
Sbjct: 181 TENHKTT---SPSYASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSS 240

Query: 241 SRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPRPKSESNP 300
           SRVSLSSSSSGHE EDLHRL LD+ENKPNTNPISLHRNPN +NPPRMRQVKPRPKSE NP
Sbjct: 241 SRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNP 300

Query: 301 RSPMDHHPTATRVGRSPMRRTPGESTTRLAIRGVSVDSPRMNSSGKIVFHNLERSSSSPS 360
           RS MDHHPTATRVGRSPMR  PGES TRLA+RGVSVDSPR+NS GKIVFHNLERSSSSPS
Sbjct: 301 RSTMDHHPTATRVGRSPMRPAPGES-TRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPS 360

Query: 361 SFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQLFSSTGTGTGG 420
           SFNGGPK KNRG ERSYSANVR+TPVL+VPVCSSLRGSSKSVSVFGFGQLFSS  TGT G
Sbjct: 361 SFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQLFSS--TGTSG 420

Query: 421 SSRSHQ-SSSSNSTNRTTSRRIIETDGGGKLH 452
           SSRS+Q SSSS+STNRTT+RRI+ETDGGGKLH
Sbjct: 421 SSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH 441

BLAST of Tan0008516 vs. ExPASy TrEMBL
Match: A0A6J1ETX7 (homeobox protein prospero-like OS=Cucurbita moschata OX=3662 GN=LOC111437683 PE=4 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 3.0e-201
Identity = 402/463 (86.83%), Postives = 424/463 (91.58%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSR---DDSPPSTNLPGPIS--KPA 60
           MASACVN++GMSPENFLDCSSAPCHSY WLSPR+SFSR   DDS PS+NL  PIS  KP 
Sbjct: 1   MASACVNSVGMSPENFLDCSSAPCHSYGWLSPRVSFSRDFSDDSSPSSNLARPISKPKPG 60

Query: 61  ADVAGESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNG 120
           AD A +SEIRDPDPELVPVSEFEF LQDPVALMLPADELFLDGKLVPLQVSSVK PSVNG
Sbjct: 61  ADPARKSEIRDPDPELVPVSEFEFCLQDPVALMLPADELFLDGKLVPLQVSSVK-PSVNG 120

Query: 121 LKSTRCVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG-- 180
           LKSTRCVSSPET  Q+RRRVE+EC+TDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG  
Sbjct: 121 LKSTRCVSSPETVVQARRRVEDECNTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNS 180

Query: 181 NGNGSAKTENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDS 240
           N NGSAK ENHKTT++SSSSYFSEANSKAL+YFLHR+SKSSL+SSFDSSLSLPLLKDSDS
Sbjct: 181 NSNGSAKNENHKTTTTSSSSYFSEANSKALKYFLHRNSKSSLASSFDSSLSLPLLKDSDS 240

Query: 241 ESVSLSSSRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPR 300
           ESVSLSSSRVSLSSSSSGHEHEDLHRLSLD ENKPN NPISLHRNPN+NNPPRMR VKPR
Sbjct: 241 ESVSLSSSRVSLSSSSSGHEHEDLHRLSLDCENKPNKNPISLHRNPNHNNPPRMRLVKPR 300

Query: 301 PKSESNPR--SPMDHHPTATRVGRSPMRRTPGE-STTRL-AIRGVSVDSPRMNSSGKIVF 360
           PKSE+NPR  S +DHHPTATRVGRSPMRRTPG+ S++RL  IRGVSVDSPRMNSSGKIVF
Sbjct: 301 PKSETNPRSTSTVDHHPTATRVGRSPMRRTPGDSSSSRLGGIRGVSVDSPRMNSSGKIVF 360

Query: 361 HNLERSSSSPSSFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQ 420
           HNLERSSSSPS+FNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKS SVFGFG 
Sbjct: 361 HNLERSSSSPSTFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSASVFGFGP 420

Query: 421 LFSSTGTGTGGSSRSHQ-SSSSNSTNRTTSRRIIETDGGGKLH 452
           LF    TGTGG  RSHQ SSSS+STNRTT+RRI ETDGGGKLH
Sbjct: 421 LF----TGTGG--RSHQSSSSSSSTNRTTTRRINETDGGGKLH 456

BLAST of Tan0008516 vs. ExPASy TrEMBL
Match: A0A6J1JAD5 (uncharacterized serine-rich protein C215.13 OS=Cucurbita maxima OX=3661 GN=LOC111483163 PE=4 SV=1)

HSP 1 Score: 707.6 bits (1825), Expect = 3.3e-200
Identity = 400/463 (86.39%), Postives = 423/463 (91.36%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSR---DDSPPSTNLPGPIS--KPA 60
           MASACVN++GMSPENFLDCSSAPCHSY WLSPR+SFSR   DDS PS+NL  PIS  KP 
Sbjct: 1   MASACVNSVGMSPENFLDCSSAPCHSYGWLSPRVSFSRDFSDDSSPSSNLARPISKPKPG 60

Query: 61  ADVAGESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNG 120
           AD AG+SEIRDPDPELVPVSEFEF LQDPVALMLPADELFLDGKLVPLQVSSVK PSVNG
Sbjct: 61  ADPAGKSEIRDPDPELVPVSEFEFCLQDPVALMLPADELFLDGKLVPLQVSSVK-PSVNG 120

Query: 121 LKSTRCVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG-- 180
           LKSTRCVSSPE+A Q+RRRVE+EC+TDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG  
Sbjct: 121 LKSTRCVSSPESAVQARRRVEDECNTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNS 180

Query: 181 NGNGSAKTENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSDS 240
           N NG AK ENHKTT++SSSSYFSEANSKAL+YFLHR+SKSSL+SSFDSSLSLPLLKDSDS
Sbjct: 181 NSNGVAKNENHKTTTTSSSSYFSEANSKALKYFLHRNSKSSLTSSFDSSLSLPLLKDSDS 240

Query: 241 ESVSLSSSRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPR 300
           ESVSLSSSRVSLSSSSSGHEHEDLHRLSLD EN PN NPISLHRNPN+NNPPRMR VKPR
Sbjct: 241 ESVSLSSSRVSLSSSSSGHEHEDLHRLSLDCENMPNKNPISLHRNPNHNNPPRMRLVKPR 300

Query: 301 PKSESNPR--SPMDHHPTATRVGRSPMRRTPGE-STTRL-AIRGVSVDSPRMNSSGKIVF 360
           PKSE+NPR  S +DHHPTA RVGRSPMRRTPGE S++RL  IRGVSVDSPRMNSSGKIVF
Sbjct: 301 PKSETNPRSTSTVDHHPTAKRVGRSPMRRTPGESSSSRLGGIRGVSVDSPRMNSSGKIVF 360

Query: 361 HNLERSSSSPSSFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQ 420
           HNLERSSSSPS+FNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKS SVFGFG 
Sbjct: 361 HNLERSSSSPSTFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSASVFGFGP 420

Query: 421 LFSSTGTGTGGSSRSHQ-SSSSNSTNRTTSRRIIETDGGGKLH 452
           LF    TGTGG  RSHQ SSSS+STNRTT+RRI ETDGGGK+H
Sbjct: 421 LF----TGTGG--RSHQSSSSSSSTNRTTTRRINETDGGGKIH 456

BLAST of Tan0008516 vs. ExPASy TrEMBL
Match: A0A5D3CZJ6 (Putative serine-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G001780 PE=4 SV=1)

HSP 1 Score: 704.9 bits (1818), Expect = 2.2e-199
Identity = 399/469 (85.07%), Postives = 420/469 (89.55%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSA-PCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVA 60
           MASACVNN+G+S ENFLDCSS+ PCHSY WL PR+SFSRDDSPPS NL GP+SK     A
Sbjct: 1   MASACVNNVGISSENFLDCSSSVPCHSYGWLGPRLSFSRDDSPPS-NLVGPLSK-TKPAA 60

Query: 61  GESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKST 120
           GESE RDPDPELVPVSEFEFRLQDPV+LMLPADELF DGKLVPLQVSS K PSVNGLKST
Sbjct: 61  GESETRDPDPELVPVSEFEFRLQDPVSLMLPADELFFDGKLVPLQVSSAK-PSVNGLKST 120

Query: 121 RCVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSN------- 180
           RCVSSPET  QSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSN       
Sbjct: 121 RCVSSPETTVQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGSG 180

Query: 181 -GNGNGSAKTENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDS 240
            GNGNGSAK ENHKTT++SSSSYFSEANSKAL+YFLHRSSKSSLSSS DSSLSLPLLKDS
Sbjct: 181 SGNGNGSAKNENHKTTTTSSSSYFSEANSKALKYFLHRSSKSSLSSSLDSSLSLPLLKDS 240

Query: 241 DSESVSLSSSRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVK 300
           DSESVSLSSSRVSLSSSSSGHEHEDLHRLSLD ENKPNTNPISLHRNPN+NNPPRMR VK
Sbjct: 241 DSESVSLSSSRVSLSSSSSGHEHEDLHRLSLDCENKPNTNPISLHRNPNHNNPPRMRLVK 300

Query: 301 PRPKSESNPR--SPMDH-HPTATRVGRSPMRRTPGE---STTRLAIRGVSVDSPRMNSSG 360
           PRPKSESNPR  S  DH HP+ATRVGRSP+RRTPGE   S++RL IRGVSVDSPRMNSSG
Sbjct: 301 PRPKSESNPRSTSTADHPHPSATRVGRSPIRRTPGESSSSSSRLGIRGVSVDSPRMNSSG 360

Query: 361 KIVFHNLERSSSSPSSFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVF 420
           KIVFHNLERSSSSPSSFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVF
Sbjct: 361 KIVFHNLERSSSSPSSFNGGPKFKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVF 420

Query: 421 GFGQLFSSTGTGTGGSSRSHQSSSSN---STNRTTSRRIIETDGGGKLH 452
           GFG L     TG GGSSR+HQ+SSSN   S+NRTT+RRI +T+GGGK+H
Sbjct: 421 GFGPLLF---TGNGGSSRNHQNSSSNSSSSSNRTTTRRITDTEGGGKIH 463

BLAST of Tan0008516 vs. TAIR 10
Match: AT1G79060.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G56020.1); Has 3399 Blast hits to 980 proteins in 195 species: Archae - 0; Bacteria - 839; Metazoa - 390; Fungi - 256; Plants - 154; Viruses - 9; Other Eukaryotes - 1751 (source: NCBI BLink). )

HSP 1 Score: 268.5 bits (685), Expect = 9.9e-72
Identity = 213/448 (47.54%), Postives = 264/448 (58.93%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAG 60
           MAS CVNN+ +S +           +Y   +PR SFSRDD            + +  VA 
Sbjct: 1   MASVCVNNVTVSQD---------FPTYGCFNPRASFSRDDG----------GRSSGSVAS 60

Query: 61  ESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKSTR 120
           E      +   V   +FEFRL++    MLPADELF DGKLV  Q                
Sbjct: 61  EI---PKEETAVGAGDFEFRLEEDPVGMLPADELFSDGKLVTKQQQQ------------- 120

Query: 121 CVSSPETAAQSRR----RVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGN 180
              + E   + RR     +E     D   FSPKAPRCSSRWR+LLGLK+  Q+SS     
Sbjct: 121 --QTTEIGGKCRRMEVVEIEISGGGDNCSFSPKAPRCSSRWRDLLGLKRFSQNSSKSAST 180

Query: 181 GSAKTENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSL--SLPLLKDSDSE 240
            +  T N ++++SS            L+ FLHRSS+SS SSS D+SL  SLPLLKDSDSE
Sbjct: 181 ATTTTTNPRSSTSS------------LKQFLHRSSRSS-SSSSDASLLMSLPLLKDSDSE 240

Query: 241 SVSLSSSRVSLSSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNNNPPRMRQVKPRP 300
           SVS+SSSR+SLSSSSSGH+HEDL RLSLD E     + I+L+ N   N     R + P P
Sbjct: 241 SVSISSSRMSLSSSSSGHDHEDLPRLSLDAERPNQNHIINLNHNLTANPFAPARSLNPNP 300

Query: 301 KSESNPRSPMDHHPTA----TRVGRSPMRRTPGESTTRLAIRGVSVDSPRMNSSGKIVFH 360
                PR  + +H T+     RVGRSPMRR+ GE T+ +  RGVSVDSPR+NSSGKIVF 
Sbjct: 301 -----PRMRLVNHSTSGTGGGRVGRSPMRRSGGE-TSAIMNRGVSVDSPRLNSSGKIVFQ 360

Query: 361 NLERSSSSPSSFNGGPK-FKNRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQ 420
           NLERSSSSPSSFNGG   +++RGMERSYS+NVRVTPVLNVPVC S+RG S       FGQ
Sbjct: 361 NLERSSSSPSSFNGGTSGYRHRGMERSYSSNVRVTPVLNVPVC-SIRGGS-----VVFGQ 383

Query: 421 LFSSTGTGTGGSSRSHQSSSSNSTNRTT 438
            FSS+   +  SS  +  + +N+ NR +
Sbjct: 421 FFSSS---SSSSSSQNNRTGNNNNNRAS 383

BLAST of Tan0008516 vs. TAIR 10
Match: AT1G56020.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G12970.1); Has 3011 Blast hits to 958 proteins in 192 species: Archae - 0; Bacteria - 193; Metazoa - 479; Fungi - 286; Plants - 158; Viruses - 8; Other Eukaryotes - 1887 (source: NCBI BLink). )

HSP 1 Score: 247.7 bits (631), Expect = 1.8e-65
Identity = 207/454 (45.59%), Postives = 264/454 (58.15%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAG 60
           MASACV + G+SPE F         SY W SPR+S +RDD+  S+++             
Sbjct: 1   MASACVKSAGVSPEKF--------SSYGWTSPRMSLTRDDNRRSSSV------------- 60

Query: 61  ESEIRDPDPELV-PVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVN----- 120
           + +  DP PE+  PV +FEF L+DPV  ML ADELF DGKLVPL+ S  K  +       
Sbjct: 61  DKQQSDPLPEIQDPVVDFEFCLEDPVT-MLSADELFSDGKLVPLKFSGPKTTTTTTSTTV 120

Query: 121 GLKSTRCVSSPETAAQSRRRVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGN 180
              +T    SPE   +S RR+E E S    LFSPKAPRC++RWRELLGLK+L        
Sbjct: 121 NTTTTEPRGSPE-VLKSCRRLEMEISD---LFSPKAPRCTTRWRELLGLKRLV------- 180

Query: 181 GNGSAKTENHKTTSSSSSSYFSEANSKALRYFLHRSSKSSLSSSFDSSLSLPLLKDSD-S 240
            N   + E+ K +SSSSS   +   + + + FLHR SKSS      ++ S PL K+SD S
Sbjct: 181 -NAKEQEESIKASSSSSS---TNPKTSSFKQFLHRGSKSS------TAASSPLQKESDIS 240

Query: 241 ESVSLSSSRVSL-SSSSSGHEHEDLHRLSLDYENKPNTNPISLHRNPNNN-NPPRMRQVK 300
           ES+S++SSR+SL SSSSS HE +DL RLSLD + KP+ NP +  R  + N N PR+R  K
Sbjct: 241 ESISVASSRLSLSSSSSSSHEIDDLPRLSLDLD-KPSANPFAPSRTHSRNLNQPRIRLAK 300

Query: 301 PRPKSESNPRSPMDHHPTATRVGRSPMRRTPGESTTRLAIRGVSV--DSPRMNSSGKIVF 360
           PR            +HP +T     P       S+  +  RG++V  DSPR+N+SGKIVF
Sbjct: 301 PR-----------RNHPPST-----PSVDGSSSSSACIESRGLTVTADSPRLNASGKIVF 360

Query: 361 HNLERSSSSPSSFNGGPKFK-NRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFG 420
           H LERSSSSP SF GGP+ K + GM RSYSANVR+TPVLNVPVCS   G         FG
Sbjct: 361 HGLERSSSSPGSFTGGPRMKQHHGMPRSYSANVRITPVLNVPVCSLKSG-------LFFG 387

Query: 421 QLFSSTGTGT------GGSSRSHQSSSSNSTNRT 437
           QLFSS+ + +      G  S+   + S N  NRT
Sbjct: 421 QLFSSSSSSSSSSPSPGNKSQLQSNGSKNRINRT 387

BLAST of Tan0008516 vs. TAIR 10
Match: AT3G12970.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G56020.1); Has 2408 Blast hits to 418 proteins in 91 species: Archae - 0; Bacteria - 41; Metazoa - 198; Fungi - 63; Plants - 125; Viruses - 13; Other Eukaryotes - 1968 (source: NCBI BLink). )

HSP 1 Score: 234.6 bits (597), Expect = 1.6e-61
Identity = 203/456 (44.52%), Postives = 257/456 (56.36%), Query Frame = 0

Query: 1   MASACVNNIGMSPENFLDCSSAPCHSYSWLSPRISFSRDDSPPSTNLPGPISKPAADVAG 60
           MAS CV N+G SP            S+SW S ++S +R+  P               +A 
Sbjct: 1   MASGCVKNVGTSP------------SHSWTSSKMSLTRESQP---------------LAP 60

Query: 61  ESEIRDPDPELVPVSEFEFRLQDPVALMLPADELFLDGKLVPLQVSSVKQPSVNGLKSTR 120
             E  D      PV +FEF L+DPV  ML ADELF DGKLVPL+ S V  P    + S  
Sbjct: 61  ALENED------PVDDFEFLLEDPVT-MLSADELFSDGKLVPLKFSGVTYPEEKPITSV- 120

Query: 121 CVSSPETAAQSRRRVEEECS--TDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGS 180
                 TA +  RR+E E S   DPYLFSP+APRC+ RWRELLGLK+L            
Sbjct: 121 ----VHTAVKPCRRLEMEISGVVDPYLFSPRAPRCTVRWRELLGLKRL------------ 180

Query: 181 AKTENHKTTSSSSSSYFSEANSK--ALRYFLHRSSKSSLSSSFDSSLSLPLLKDSD---S 240
           AKT+   + SSSS    S  N K  + R+FL+RSSKS+           P  KDSD   S
Sbjct: 181 AKTQQEASASSSSRLSSSSPNPKTASFRHFLNRSSKSTAQQPSHP----PPGKDSDILES 240

Query: 241 ESVSLSSSRVSL-SSSSSGHEHEDLHRLSLDYENKPNT-NPISLHRNPNNNNPPRMRQVK 300
            S S+SSSR+SL SSSSSGHE +DL RLSLD +NKP T NP +  R  ++++        
Sbjct: 241 SSTSISSSRLSLSSSSSSGHELDDLPRLSLDLDNKPGTPNPFARSRAHHHHH-------- 300

Query: 301 PRPKSESNPRSPMDHHPTATRVGRSPMRRTPGESTTRLAIRGVSVDSPRMNSSGKIVFHN 360
              ++++ PR P  H    T+V  S       ES+    +  V+ DSPR+N+SGKIVFH 
Sbjct: 301 --LRNQNQPRKPRRH----TQVDEST------ESSIESRVMTVTADSPRLNASGKIVFHG 360

Query: 361 LERSSSSPSSFNGGPKFK-NRGMERSYSANVRVTPVLNVPVCSSLRGSSKSVSVFGFGQL 420
           LERSSSSP +F GGP+ K + GM RS+SANVR+TPVLNVPV SSLR   KS  +F FGQL
Sbjct: 361 LERSSSSPGNFTGGPRMKLHHGMPRSHSANVRITPVLNVPV-SSLRSGPKS-GLF-FGQL 378

Query: 421 FSSTGTGTGGSSRSH--QSSSSNSTNRTTSRRIIET 445
           F+S+ + +  SS  +  Q  S+N  NRT   R+  T
Sbjct: 421 FASSSSASSSSSSGNRAQLQSNNIKNRTNRSRLEPT 378

BLAST of Tan0008516 vs. TAIR 10
Match: AT3G05980.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G19340.1); Has 202 Blast hits to 202 proteins in 28 species: Archae - 0; Bacteria - 0; Metazoa - 39; Fungi - 4; Plants - 148; Viruses - 0; Other Eukaryotes - 11 (source: NCBI BLink). )

HSP 1 Score: 43.5 bits (101), Expect = 5.2e-04
Identity = 61/182 (33.52%), Postives = 84/182 (46.15%), Query Frame = 0

Query: 30  LSPRISFSRDDSPPSTNLPGPISKPAADVAGESEIRDPDPELVPVSEFEFRLQDPVA--L 89
           L PRISFS D S     +         DV   S         V VS+FEF   + V+   
Sbjct: 15  LGPRISFSSDLSDGGDFICITPVMCKEDVVKGS---------VKVSDFEFLSSENVSPQR 74

Query: 90  MLPADELFLDGKLVPL-QVSSVKQPSVNGLKSTRCVSSPETAAQSRR------------- 149
           ML ADELF +GKL+P  QV   ++     LK+    ++ E  A+ R+             
Sbjct: 75  MLTADELFSEGKLLPFWQVKHSEK-----LKNITLKTNEEEEAEKRKVEVKKKDQEINNR 134

Query: 150 --RVEEECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGNGNGSAKTENHKTTSSSS 194
             RV      DP   SP+ P+C+  W+ELL LKK    SS+     +A+T +  + SSS+
Sbjct: 135 DNRVTWFIDEDP---SPRPPKCTVLWKELLRLKKQRNPSSS---PVTARTVSSLSPSSST 176

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022980895.19.3e-20589.36uncharacterized protein LOC111480206 [Cucurbita maxima][more]
KAG6600467.11.3e-20389.16hypothetical protein SDJN03_05700, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022942648.11.1e-20288.94uncharacterized protein LOC111447620 [Cucurbita moschata][more]
XP_023549710.11.5e-20288.94uncharacterized protein LOC111808127 [Cucurbita pepo subsp. pepo][more]
XP_023551901.12.8e-20186.83uncharacterized protein LOC111809733 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1IXV44.5e-20589.36uncharacterized protein LOC111480206 OS=Cucurbita maxima OX=3661 GN=LOC111480206... [more]
A0A6J1FVC05.5e-20388.94uncharacterized protein LOC111447620 OS=Cucurbita moschata OX=3662 GN=LOC1114476... [more]
A0A6J1ETX73.0e-20186.83homeobox protein prospero-like OS=Cucurbita moschata OX=3662 GN=LOC111437683 PE=... [more]
A0A6J1JAD53.3e-20086.39uncharacterized serine-rich protein C215.13 OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A5D3CZJ62.2e-19985.07Putative serine-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
Match NameE-valueIdentityDescription
AT1G79060.19.9e-7247.54unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G56020.11.8e-6545.59unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G12970.11.6e-6144.52unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G05980.15.2e-0433.52unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..129
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 414..451
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..69
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 264..330
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 264..287
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..140
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 214..234
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 168..194
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 414..441
NoneNo IPR availablePANTHERPTHR31722OS06G0675200 PROTEINcoord: 1..422
NoneNo IPR availablePANTHERPTHR31722:SF0OS06G0675200 PROTEINcoord: 1..422

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0008516.1Tan0008516.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding