Tan0010831 (gene) Snake gourd v1

Overview
NameTan0010831
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC103496210
LocationLG01: 8107935 .. 8110102 (+)
RNA-Seq ExpressionTan0010831
SyntenyTan0010831
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCCCATTTTCATTTTCAGAAGAAACCATGATGGCCTGGCCAATTCCTGTTTTCCCTTTTTATCATTTTCAGTTTTGGTCAAACTTTTCAGTGCCCATTATCCATGGACCCAATCAATTCCTCTCTTCAATCTTCTCCTTCAAGCAATGCCAATGACCCTCTGATTCATCAAGCTTTGGATCTTTTAGAGCTCTCTTGGTTCTTTGATAATCTGCTTCTTAGAAGGAACCCCAGGATGTCTACTTCCCACTCTCATCCTTGTCTTTCTAATGTTGCCCACCAAGTGTTTGTTGAAAGTCCTGTTCCTAATGTCTGCTCATCTGCTTTGGATGGAGATGTTTCCTTCGGGAATGGTGGCGGTAAACGTCGTAATCTGCTCCGAACACCATCGTTACCGTCCCGTATGGATCGAGGGGAAGGAGTTCAAGAGAAGGGAAATGGTTCTAGGCCATTGTTAGAGCATGGTGTTGTGGTTGAAAGTTCTGTTGATGATCAGGTTTGCTCATCTACTTTGGATATGGATGTTTCCATTGGAAATGGTGGTCATAAACATAGGAACTTGGTCCAAACGCCATTGTTACCGTCCCCCGTCGACCGAGAGGAAGGAAAGGCTTCTATGCCATTATCAGAGAATGCTGTGTTTGTTGAAAGTCATGCTGATAATGCCTGCTCTTCTGCTTTGGATATGGATGTTCCACTGGGAAATGGTTGCAGCAAACGTCGGAGTCTGCTCCGAATGCCATCGTTACCGTCCCGTGTGGAACGAGAAGAAGGAATTCGAGAGAAAGGGAATAGTTCCAGGCCATTATTAGAGCATGGTTTGCTTCAAACGCCAGCCAAGCCACCTTATGTAGAGAGGAAAGAAGAGGGAACTCGCAGCAAAGAAAGCAGCAGCACACGGAGGAGCAAATCAGCAAGGAAACCACGACATGGTAATCTGCTAAGAACACCATCTTTGCCACCATGTATTGGAAGAGAAAAAGAATTTGGTGAAAAGGAAGCTGCTGCTAGAATCAGAAACTCTATTCAACCAAACCTTTCTGAATTCTTTCCCTCAAGACGAGAGGTATTACAAAAAAAATTTGTTCTGTTTCAACTGTCTCATTTTCATGTGCGCTCGAATGATATTCGAGCTCACTTATGGTTCAGAGTAAATATAACTAGTTTTTATCTTAATAGATTCAATGTCGAGAATATATTGAAAAGATTTGCATAACAAAGAAGTCTTGCATTCAAACGTTTGCATTGCGTTTTAATCCATTTGTTGAGCTGTATGCTAATGTTTGAGCCTCGATGTGATTGTGGAAAGATTACTTTGGTTTTACCTCAGATGCATAGAGTGACCCTCCAAAGTCGTAGATTCAAGCCATAAATTGAAAATTTTGTAACTTACAATCTTTGTTATCACCAGAATGGTATTAGATAACCGAAGTTTTCTTGGAAATTATTAGGGTCGACCGTGAGCATAATACAGTGACCGAAACCTTCGACATTCTACTTATCAAAAGAATTGACTCTACCCAAATAGCTTAAACTTTTTGGTTTCAAAAGTGACTACCATTAGCAACTTAGTAAACTCAATTGGATTCTTGTGTGAAGAAAGTATATAATCTTGAATCTCTTAAATGAGTTAAACACAATACACTCTTTCAGGATAGAAAGACAAGAATAAACATCATTAGTTCTATTCTTCTTTTCATAATATTTGCTTCCTATTCACAGATTCTTGAAAAGAACTTCAGCCTCCCGATGTGTCGAATCCCGACAAGCAACGACGAAATGTGGCACCAATTTCTCATCCAAATGAGGAGGAGAAGAAGCCAAAGTGAACTTGAATCAGAAGAATTGCAAGGTTTCAAGGACTTGGGATTCACATTTGACAAAAAAGATATAAACCCAACAGTGGTAGACATAATTCCAGGCTTGAGAGAGAAAAAAGAGGAAGAGTTGGAGAGTGAAAGAGCTAGAAGGCCATATCTTTCTGAGGCTTGGATGCTCCAAACTCATCTCCTTCCTCCAATTCCAAAATGGGACACTAGAAATTCTGCAGAAGACATGAAACAACAAATCAAGTTTTGGGCTAGAGCTGTTGCTTCTAATGTGCACTAAAAATGGTGAGAGCTTAAACTTGCATAGATTCCCCCATTTTCTAAAGTCAAATATAGCAC

mRNA sequence

ATCCCATTTTCATTTTCAGAAGAAACCATGATGGCCTGGCCAATTCCTGTTTTCCCTTTTTATCATTTTCAGTTTTGGTCAAACTTTTCAGTGCCCATTATCCATGGACCCAATCAATTCCTCTCTTCAATCTTCTCCTTCAAGCAATGCCAATGACCCTCTGATTCATCAAGCTTTGGATCTTTTAGAGCTCTCTTGGTTCTTTGATAATCTGCTTCTTAGAAGGAACCCCAGGATGTCTACTTCCCACTCTCATCCTTGTCTTTCTAATGTTGCCCACCAAGTGTTTGTTGAAAGTCCTGTTCCTAATGTCTGCTCATCTGCTTTGGATGGAGATGTTTCCTTCGGGAATGGTGGCGGTAAACGTCGTAATCTGCTCCGAACACCATCGTTACCGTCCCGTATGGATCGAGGGGAAGGAGTTCAAGAGAAGGGAAATGGTTCTAGGCCATTGTTAGAGCATGGTGTTGTGGTTGAAAGTTCTGTTGATGATCAGGTTTGCTCATCTACTTTGGATATGGATGTTTCCATTGGAAATGGTGGTCATAAACATAGGAACTTGGTCCAAACGCCATTGTTACCGTCCCCCGTCGACCGAGAGGAAGGAAAGGCTTCTATGCCATTATCAGAGAATGCTGTGTTTGTTGAAAGTCATGCTGATAATGCCTGCTCTTCTGCTTTGGATATGGATGTTCCACTGGGAAATGGTTGCAGCAAACGTCGGAGTCTGCTCCGAATGCCATCGTTACCGTCCCGTGTGGAACGAGAAGAAGGAATTCGAGAGAAAGGGAATAGTTCCAGGCCATTATTAGAGCATGGTTTGCTTCAAACGCCAGCCAAGCCACCTTATGTAGAGAGGAAAGAAGAGGGAACTCGCAGCAAAGAAAGCAGCAGCACACGGAGGAGCAAATCAGCAAGGAAACCACGACATGGTAATCTGCTAAGAACACCATCTTTGCCACCATGTATTGGAAGAGAAAAAGAATTTGGTGAAAAGGAAGCTGCTGCTAGAATCAGAAACTCTATTCAACCAAACCTTTCTGAATTCTTTCCCTCAAGACGAGAGATTCTTGAAAAGAACTTCAGCCTCCCGATGTGTCGAATCCCGACAAGCAACGACGAAATGTGGCACCAATTTCTCATCCAAATGAGGAGGAGAAGAAGCCAAAGTGAACTTGAATCAGAAGAATTGCAAGGTTTCAAGGACTTGGGATTCACATTTGACAAAAAAGATATAAACCCAACAGTGGTAGACATAATTCCAGGCTTGAGAGAGAAAAAAGAGGAAGAGTTGGAGAGTGAAAGAGCTAGAAGGCCATATCTTTCTGAGGCTTGGATGCTCCAAACTCATCTCCTTCCTCCAATTCCAAAATGGGACACTAGAAATTCTGCAGAAGACATGAAACAACAAATCAAGTTTTGGGCTAGAGCTGTTGCTTCTAATGTGCACTAAAAATGGTGAGAGCTTAAACTTGCATAGATTCCCCCATTTTCTAAAGTCAAATATAGCAC

Coding sequence (CDS)

ATGGACCCAATCAATTCCTCTCTTCAATCTTCTCCTTCAAGCAATGCCAATGACCCTCTGATTCATCAAGCTTTGGATCTTTTAGAGCTCTCTTGGTTCTTTGATAATCTGCTTCTTAGAAGGAACCCCAGGATGTCTACTTCCCACTCTCATCCTTGTCTTTCTAATGTTGCCCACCAAGTGTTTGTTGAAAGTCCTGTTCCTAATGTCTGCTCATCTGCTTTGGATGGAGATGTTTCCTTCGGGAATGGTGGCGGTAAACGTCGTAATCTGCTCCGAACACCATCGTTACCGTCCCGTATGGATCGAGGGGAAGGAGTTCAAGAGAAGGGAAATGGTTCTAGGCCATTGTTAGAGCATGGTGTTGTGGTTGAAAGTTCTGTTGATGATCAGGTTTGCTCATCTACTTTGGATATGGATGTTTCCATTGGAAATGGTGGTCATAAACATAGGAACTTGGTCCAAACGCCATTGTTACCGTCCCCCGTCGACCGAGAGGAAGGAAAGGCTTCTATGCCATTATCAGAGAATGCTGTGTTTGTTGAAAGTCATGCTGATAATGCCTGCTCTTCTGCTTTGGATATGGATGTTCCACTGGGAAATGGTTGCAGCAAACGTCGGAGTCTGCTCCGAATGCCATCGTTACCGTCCCGTGTGGAACGAGAAGAAGGAATTCGAGAGAAAGGGAATAGTTCCAGGCCATTATTAGAGCATGGTTTGCTTCAAACGCCAGCCAAGCCACCTTATGTAGAGAGGAAAGAAGAGGGAACTCGCAGCAAAGAAAGCAGCAGCACACGGAGGAGCAAATCAGCAAGGAAACCACGACATGGTAATCTGCTAAGAACACCATCTTTGCCACCATGTATTGGAAGAGAAAAAGAATTTGGTGAAAAGGAAGCTGCTGCTAGAATCAGAAACTCTATTCAACCAAACCTTTCTGAATTCTTTCCCTCAAGACGAGAGATTCTTGAAAAGAACTTCAGCCTCCCGATGTGTCGAATCCCGACAAGCAACGACGAAATGTGGCACCAATTTCTCATCCAAATGAGGAGGAGAAGAAGCCAAAGTGAACTTGAATCAGAAGAATTGCAAGGTTTCAAGGACTTGGGATTCACATTTGACAAAAAAGATATAAACCCAACAGTGGTAGACATAATTCCAGGCTTGAGAGAGAAAAAAGAGGAAGAGTTGGAGAGTGAAAGAGCTAGAAGGCCATATCTTTCTGAGGCTTGGATGCTCCAAACTCATCTCCTTCCTCCAATTCCAAAATGGGACACTAGAAATTCTGCAGAAGACATGAAACAACAAATCAAGTTTTGGGCTAGAGCTGTTGCTTCTAATGTGCACTAA

Protein sequence

MDPINSSLQSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAHQVFVESPVPNVCSSALDGDVSFGNGGGKRRNLLRTPSLPSRMDRGEGVQEKGNGSRPLLEHGVVVESSVDDQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREEGKASMPLSENAVFVESHADNACSSALDMDVPLGNGCSKRRSLLRMPSLPSRVEREEGIREKGNSSRPLLEHGLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGREKEFGEKEAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRRRSQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWMLQTHLLPPIPKWDTRNSAEDMKQQIKFWARAVASNVH
Homology
BLAST of Tan0010831 vs. NCBI nr
Match: XP_038874899.1 (uncharacterized protein LOC120067383 isoform X1 [Benincasa hispida] >XP_038874933.1 uncharacterized protein LOC120067383 isoform X1 [Benincasa hispida] >XP_038874965.1 uncharacterized protein LOC120067383 isoform X1 [Benincasa hispida])

HSP 1 Score: 713.0 bits (1839), Expect = 1.6e-201
Identity = 372/455 (81.76%), Postives = 397/455 (87.25%), Query Frame = 0

Query: 1   MDPINSSL-QSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAH 60
           MDPIN+SL QSSPS   ++PLI +A DLLEL WFFDNLL+RR+PRM  S S PCLS VAH
Sbjct: 1   MDPINASLHQSSPS--RDEPLIDEAFDLLELFWFFDNLLVRRSPRMLISRSDPCLSKVAH 60

Query: 61  QVFVESPVPNVCSSALDGDVSFGNGGGKRRNLLRTPSLPSRMDRGEGVQEKGNGSRPLLE 120
           QVFVESP  N+CSS LDG VS GNGGG RRNLLRTPSLPSRMDRGEG++EK N SRPL+E
Sbjct: 61  QVFVESPPANLCSSQLDGCVSLGNGGGIRRNLLRTPSLPSRMDRGEGIREKENCSRPLVE 120

Query: 121 HGVVVESSVDDQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREE-----GKASMPL 180
           HGV+V   VD+ V SS LDMDVSIGNGG K RNL++TP LPS VDREE     G  + PL
Sbjct: 121 HGVLVGIPVDN-VSSSALDMDVSIGNGGGKCRNLLRTPSLPSRVDREEGIQEKGNDARPL 180

Query: 181 SENAVFVESHADNACSSALDMDVPLGNGCSKRRSLLRMPSLPSRVEREEGIREKGNSSRP 240
           SE+ VF E  ADN C S LDMDV  GNG  KRRSLLRMPSLPS VERE+ IREKGN S+P
Sbjct: 181 SEHGVFAERPADNVCLSTLDMDVSPGNGSGKRRSLLRMPSLPSPVEREQVIREKGNGSKP 240

Query: 241 LLEHGLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGREKE 300
           L+EHGLLQ PAKPPYVERKE+GTRSKES STRRSKSARKPR+GNLLRTPSLPPCIGREKE
Sbjct: 241 LIEHGLLQKPAKPPYVERKEDGTRSKESGSTRRSKSARKPRNGNLLRTPSLPPCIGREKE 300

Query: 301 FGEKEAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRRRS 360
           FGEKEAAARIRNSIQPN SEFFP+R+EILEKNFSLPMCRIPTSNDE+WHQFLIQMRRRRS
Sbjct: 301 FGEKEAAARIRNSIQPNPSEFFPTRQEILEKNFSLPMCRIPTSNDEIWHQFLIQMRRRRS 360

Query: 361 QSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWMLQ 420
           QSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESER RRPYLSEAWMLQ
Sbjct: 361 QSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERTRRPYLSEAWMLQ 420

Query: 421 THLLPPIPKWDTRNSAEDMKQQIKFWARAVASNVH 450
           THLLPPIPKWDTR  AEDMKQQI+FWARAVASNVH
Sbjct: 421 THLLPPIPKWDTRKPAEDMKQQIRFWARAVASNVH 452

BLAST of Tan0010831 vs. NCBI nr
Match: KAG7015030.1 (hypothetical protein SDJN02_22661, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 701.0 bits (1808), Expect = 6.4e-198
Identity = 376/516 (72.87%), Postives = 404/516 (78.29%), Query Frame = 0

Query: 1   MDPINSSLQSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAHQ 60
           MDPIN+SLQSSPSS+   PLI +ALD LEL WFFDNLL  RNPRMSTS S PCLSNVAHQ
Sbjct: 49  MDPINASLQSSPSSDV--PLIDEALDRLELCWFFDNLLTWRNPRMSTSRSDPCLSNVAHQ 108

Query: 61  VFVESPVPNVCSSALDGDVSFGNGGGKRRNLLRTPSLPSRMD-RGEGVQEKGNGSRPLLE 120
           VF ESP  N+CSS LDGDVS  NGGG RRNLLRTPSLPSRMD  GEG++EKG+GSRPLLE
Sbjct: 109 VFDESPAANLCSSDLDGDVSLPNGGGVRRNLLRTPSLPSRMDLGGEGIREKGSGSRPLLE 168

Query: 121 HGVVVESSVDDQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREE-----GKASMPL 180
           H V+VE+   D VCSS LDMDVSIGNGG KHR+L++TP L S VDREE     G  S PL
Sbjct: 169 HDVLVETP-GDNVCSSALDMDVSIGNGGGKHRSLLRTPSLQSRVDREEGIRDKGSGSRPL 228

Query: 181 SENAVFVESHADNACSSALDMDVPLGNGCSKR---------------------------- 240
           SE+ V VE+ ADN CSS+LDMDV +GNGC K                             
Sbjct: 229 SEHDVLVETPADNVCSSSLDMDVSIGNGCCKHRSLLRTPSLPSRVDREEGIQEKGSGSRP 288

Query: 241 ---------------------------------RSLLRMPSLPSRVEREEGIREKGNSSR 300
                                            RSLLRMPSLPSRVERE+GIREKGN S+
Sbjct: 289 LAEHDVLVESPADNVCLSAMDINVSPGYGGGKCRSLLRMPSLPSRVEREQGIREKGNDSK 348

Query: 301 PLLEHGLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGREK 360
           PL+EHGLLQ PAKPPYVERKEEGTRSKESSSTR+SKSARKPR+GNLLRTPSLPP IGREK
Sbjct: 349 PLIEHGLLQKPAKPPYVERKEEGTRSKESSSTRKSKSARKPRNGNLLRTPSLPPFIGREK 408

Query: 361 EFGEKEAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRRR 420
           EFGEKEAAARIRNSIQPNLSEFFP+R+EILEKNFSLP CRIPT++D MWHQFLIQMR+RR
Sbjct: 409 EFGEKEAAARIRNSIQPNLSEFFPTRQEILEKNFSLPTCRIPTNSDRMWHQFLIQMRKRR 468

Query: 421 SQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWML 450
           SQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEEL+SERARRPYLSEAWML
Sbjct: 469 SQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELDSERARRPYLSEAWML 528

BLAST of Tan0010831 vs. NCBI nr
Match: KAG6577008.1 (hypothetical protein SDJN03_24582, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 695.3 bits (1793), Expect = 3.5e-196
Identity = 373/515 (72.43%), Postives = 402/515 (78.06%), Query Frame = 0

Query: 1   MDPINSSLQSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAHQ 60
           MDPIN+SLQSSPSS+   PLI +ALD LEL WFFDNLL  RNPRMSTS S PCLSNVAHQ
Sbjct: 1   MDPINASLQSSPSSDV--PLIDEALDRLELCWFFDNLLTWRNPRMSTSRSDPCLSNVAHQ 60

Query: 61  VFVESPVPNVCSSALDGDVSFGNGGGKRRNLLRTPSLPSRMD-RGEGVQEKGNGSRPLLE 120
           VF ESP  N+CSS LDGDVS  NGGG RRNLLRTPSLPSRMD  GEG+QEKG+GSRPLLE
Sbjct: 61  VFDESPAANLCSSDLDGDVSLPNGGGVRRNLLRTPSLPSRMDLGGEGIQEKGSGSRPLLE 120

Query: 121 HGVVVESSVDDQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDRE-----EGKASMPL 180
           H V+VE+   D VCSS LDMDVSIGNGG KHR+L++TP L S VDRE     +G  S PL
Sbjct: 121 HDVLVETP-GDNVCSSALDMDVSIGNGGGKHRSLLRTPSLQSRVDREGGIRDKGSGSRPL 180

Query: 181 SENAVFVESHADNACSSALDMDVPLGNGCSKR---------------------------- 240
           SE+ V VE+ ADN CSS+LDMDV +GNGC K                             
Sbjct: 181 SEHDVLVETPADNVCSSSLDMDVSIGNGCCKHRSLLRTPSLPSRVDREEGIQEKGSGSRP 240

Query: 241 ---------------------------------RSLLRMPSLPSRVEREEGIREKGNSSR 300
                                            RSLLRMPSLPSRVERE+GIREKGN S+
Sbjct: 241 LAEHDVLVESPADNVCLSAMDINVSPGYGGGKCRSLLRMPSLPSRVEREQGIREKGNDSK 300

Query: 301 PLLEHGLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGREK 360
           PL+EHGLLQ PAKPPYVERKEEGTRSKES STR+SKSARKPR+GNLLRTPSLPP IGREK
Sbjct: 301 PLIEHGLLQKPAKPPYVERKEEGTRSKESGSTRKSKSARKPRNGNLLRTPSLPPFIGREK 360

Query: 361 EFGEKEAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRRR 420
           EFGEKEAAARIRNSIQPNLSEFFP+R+EILEKNFSLP CRIPT++D MWHQFLIQMR+RR
Sbjct: 361 EFGEKEAAARIRNSIQPNLSEFFPTRQEILEKNFSLPTCRIPTNSDRMWHQFLIQMRKRR 420

Query: 421 SQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWML 449
           SQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEEL+SERARRPYLSEAWML
Sbjct: 421 SQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELDSERARRPYLSEAWML 480

BLAST of Tan0010831 vs. NCBI nr
Match: XP_023551849.1 (uncharacterized protein LOC111809699 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023551850.1 uncharacterized protein LOC111809699 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 692.6 bits (1786), Expect = 2.3e-195
Identity = 370/511 (72.41%), Postives = 402/511 (78.67%), Query Frame = 0

Query: 1   MDPINSSLQSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAHQ 60
           MDPIN+SLQSSPSS+   PLI +ALD LEL WFFDNLL  RNPRMSTS S PCLSNVAHQ
Sbjct: 1   MDPINASLQSSPSSDV--PLIDEALDRLELCWFFDNLLTWRNPRMSTSRSDPCLSNVAHQ 60

Query: 61  VFVESPVPNVCSSALDGDVSFGNGGGKRRNLLRTPSLPSRMD-RGEGVQEKGNGSRPLLE 120
           VF ESP  N+CSS LDGDVS  NGGG RRNLLRTPSLPSRMD  GEG++EKG+GSRPLLE
Sbjct: 61  VFDESPAANLCSSDLDGDVSLPNGGGVRRNLLRTPSLPSRMDLGGEGIREKGSGSRPLLE 120

Query: 121 HGVVVESSVD-------------------------------------------------- 180
           H V+VE+  D                                                  
Sbjct: 121 HDVLVETPADNVCSSALDMDVSIGNGGGKHRSLLRTPSLQSEEGIRDKGSGSRPLSEHDV 180

Query: 181 ------DQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREE-----GKASMPLSENA 240
                 D VCSS+LDM+VSIGNGG KHR+L++TP LPS VDREE     G  S  L+E+ 
Sbjct: 181 LVETPADNVCSSSLDMNVSIGNGGRKHRSLLRTPSLPSRVDREEGIQEKGSGSRSLAEHD 240

Query: 241 VFVESHADNACSSALDMDVPLGNGCSKRRSLLRMPSLPSRVEREEGIREKGNSSRPLLEH 300
           V VES ADNAC SA+D++V  G G  K RSLLRMPSLPSRVERE+GI+EKGN S+PL+EH
Sbjct: 241 VLVESPADNACLSAMDINVSPGYGGGKCRSLLRMPSLPSRVEREQGIQEKGNDSKPLIEH 300

Query: 301 GLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGREKEFGEK 360
           GLLQ PAKPPYVERKEEGTRSKES STR+SKSARKPR+GNLLRTPSLPP IGREKEFGEK
Sbjct: 301 GLLQKPAKPPYVERKEEGTRSKESGSTRKSKSARKPRNGNLLRTPSLPPFIGREKEFGEK 360

Query: 361 EAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRRRSQSEL 420
           EAAARIRNSIQPNLSEFFP+R+EILEKNFSLP CRIPT++D MWHQFLIQMR+RRSQSEL
Sbjct: 361 EAAARIRNSIQPNLSEFFPTRQEILEKNFSLPTCRIPTNSDRMWHQFLIQMRKRRSQSEL 420

Query: 421 ESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWMLQTHLL 450
           ESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEEL+SER RRPYLSEAWMLQTHLL
Sbjct: 421 ESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELDSERTRRPYLSEAWMLQTHLL 480

BLAST of Tan0010831 vs. NCBI nr
Match: XP_022922903.1 (uncharacterized protein LOC111430741 [Cucurbita moschata] >XP_022922904.1 uncharacterized protein LOC111430741 [Cucurbita moschata])

HSP 1 Score: 692.6 bits (1786), Expect = 2.3e-195
Identity = 371/515 (72.04%), Postives = 402/515 (78.06%), Query Frame = 0

Query: 1   MDPINSSLQSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAHQ 60
           MDPIN+SLQSSPSS+   PLI +ALD LEL WFFDNLL  RNPRMSTS S PCLSNVAHQ
Sbjct: 1   MDPINASLQSSPSSDV--PLIDEALDRLELCWFFDNLLTWRNPRMSTSRSDPCLSNVAHQ 60

Query: 61  VFVESPVPNVCSSALDGDVSFGNGGGKRRNLLRTPSLPSRMD-RGEGVQEKGNGSRPLLE 120
           VF ESP  N+CSS LDGDVS  NGGG RRNLLRTPSLPSR+D  GEG++EKG+GSRPLLE
Sbjct: 61  VFDESPAANLCSSDLDGDVSLPNGGGVRRNLLRTPSLPSRIDLGGEGIREKGSGSRPLLE 120

Query: 121 HGVVVESSVD-------------------------------------------------- 180
           H V+VE+  D                                                  
Sbjct: 121 HDVLVETPGDNVCSSALDMDVSIGNGGGKHRSLLRTPSLQSRVDREEGIRDKGSGSRPLS 180

Query: 181 ----------DQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREE-----GKASMPL 240
                     D VCSS+LDMDVSIGNGG KHR+L++TP LPS VDREE     G  S PL
Sbjct: 181 EHDVLVATPADNVCSSSLDMDVSIGNGGCKHRSLLRTPSLPSRVDREEGIQEKGSGSRPL 240

Query: 241 SENAVFVESHADNACSSALDMDVPLGNGCSKRRSLLRMPSLPSRVEREEGIREKGNSSRP 300
           +E+ V VES ADN C SA+D++V  G G  K RSLLRMPSLPSRVERE+GIREKGN S+P
Sbjct: 241 AEHDVLVESPADNVCLSAMDINVSPGYGGGKCRSLLRMPSLPSRVEREQGIREKGNDSKP 300

Query: 301 LLEHGLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGREKE 360
           L+EHGLLQ PAKPPYVERKEEGTRSKES STR+SKSARKPR+GNLLRTPSLPP IGREKE
Sbjct: 301 LIEHGLLQKPAKPPYVERKEEGTRSKESGSTRKSKSARKPRNGNLLRTPSLPPFIGREKE 360

Query: 361 FGEKEAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRRRS 420
           FGEKEAAARIRNSIQPNLSEFFP+R+EILEKNFSLPMCRIPT++D MWHQFLIQMR+RRS
Sbjct: 361 FGEKEAAARIRNSIQPNLSEFFPTRQEILEKNFSLPMCRIPTNSDRMWHQFLIQMRKRRS 420

Query: 421 QSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWMLQ 450
           QSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEEL+SER RRPYLSEAWMLQ
Sbjct: 421 QSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELDSERTRRPYLSEAWMLQ 480

BLAST of Tan0010831 vs. ExPASy TrEMBL
Match: A0A6J1E4T2 (uncharacterized protein LOC111430741 OS=Cucurbita moschata OX=3662 GN=LOC111430741 PE=4 SV=1)

HSP 1 Score: 692.6 bits (1786), Expect = 1.1e-195
Identity = 371/515 (72.04%), Postives = 402/515 (78.06%), Query Frame = 0

Query: 1   MDPINSSLQSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAHQ 60
           MDPIN+SLQSSPSS+   PLI +ALD LEL WFFDNLL  RNPRMSTS S PCLSNVAHQ
Sbjct: 1   MDPINASLQSSPSSDV--PLIDEALDRLELCWFFDNLLTWRNPRMSTSRSDPCLSNVAHQ 60

Query: 61  VFVESPVPNVCSSALDGDVSFGNGGGKRRNLLRTPSLPSRMD-RGEGVQEKGNGSRPLLE 120
           VF ESP  N+CSS LDGDVS  NGGG RRNLLRTPSLPSR+D  GEG++EKG+GSRPLLE
Sbjct: 61  VFDESPAANLCSSDLDGDVSLPNGGGVRRNLLRTPSLPSRIDLGGEGIREKGSGSRPLLE 120

Query: 121 HGVVVESSVD-------------------------------------------------- 180
           H V+VE+  D                                                  
Sbjct: 121 HDVLVETPGDNVCSSALDMDVSIGNGGGKHRSLLRTPSLQSRVDREEGIRDKGSGSRPLS 180

Query: 181 ----------DQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREE-----GKASMPL 240
                     D VCSS+LDMDVSIGNGG KHR+L++TP LPS VDREE     G  S PL
Sbjct: 181 EHDVLVATPADNVCSSSLDMDVSIGNGGCKHRSLLRTPSLPSRVDREEGIQEKGSGSRPL 240

Query: 241 SENAVFVESHADNACSSALDMDVPLGNGCSKRRSLLRMPSLPSRVEREEGIREKGNSSRP 300
           +E+ V VES ADN C SA+D++V  G G  K RSLLRMPSLPSRVERE+GIREKGN S+P
Sbjct: 241 AEHDVLVESPADNVCLSAMDINVSPGYGGGKCRSLLRMPSLPSRVEREQGIREKGNDSKP 300

Query: 301 LLEHGLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGREKE 360
           L+EHGLLQ PAKPPYVERKEEGTRSKES STR+SKSARKPR+GNLLRTPSLPP IGREKE
Sbjct: 301 LIEHGLLQKPAKPPYVERKEEGTRSKESGSTRKSKSARKPRNGNLLRTPSLPPFIGREKE 360

Query: 361 FGEKEAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRRRS 420
           FGEKEAAARIRNSIQPNLSEFFP+R+EILEKNFSLPMCRIPT++D MWHQFLIQMR+RRS
Sbjct: 361 FGEKEAAARIRNSIQPNLSEFFPTRQEILEKNFSLPMCRIPTNSDRMWHQFLIQMRKRRS 420

Query: 421 QSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWMLQ 450
           QSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEEL+SER RRPYLSEAWMLQ
Sbjct: 421 QSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELDSERTRRPYLSEAWMLQ 480

BLAST of Tan0010831 vs. ExPASy TrEMBL
Match: A0A6J1J9E4 (uncharacterized protein LOC111482920 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482920 PE=4 SV=1)

HSP 1 Score: 684.9 bits (1766), Expect = 2.3e-193
Identity = 367/518 (70.85%), Postives = 403/518 (77.80%), Query Frame = 0

Query: 1   MDPINSSLQSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAHQ 60
           MDPIN+SLQSSPSS+   PLI +ALD LEL WFFDNLL  RNPRMSTS S PCLSNVAHQ
Sbjct: 1   MDPINASLQSSPSSDV--PLIDEALDRLELCWFFDNLLTWRNPRMSTSRSDPCLSNVAHQ 60

Query: 61  VFVESPVP---------------------------------------------------- 120
           VF ESP P                                                    
Sbjct: 61  VFDESPSPAANLCSSDLDGDVSLPNGGRVRRNLLRTPSLPSRMDLGGEGIREKGSCSRPL 120

Query: 121 ------------NVCSSALDGDVSFGNGGGKRRNLLRTPSLPSRMDRGEGVQEKGNGSRP 180
                       NVCSS+LD DVS GNGGGK+R+LLRTPSL SR+DR EG+++KG+GS+P
Sbjct: 121 LEHDVLVETPGDNVCSSSLDMDVSIGNGGGKQRSLLRTPSLQSRVDREEGIRDKGSGSKP 180

Query: 181 LLEHGVVVESSVDDQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREE-----GKAS 240
           L EH V+VE+  D+ VCSS+LDMDVSIGNGG KHR+L++TP LPS VD+EE     G  S
Sbjct: 181 LSEHDVLVETPADN-VCSSSLDMDVSIGNGGRKHRSLLRTPSLPSHVDQEEGIQEKGSGS 240

Query: 241 MPLSENAVFVESHADNACSSALDMDVPLGNGCSKRRSLLRMPSLPSRVEREEGIREKGNS 300
            PLSE+ V VES ADNAC SA+D++V LG G  K RSLLRMPSLPSRVERE+GIREKGN 
Sbjct: 241 RPLSEHDVLVESPADNACLSAMDINVSLGYGGGKCRSLLRMPSLPSRVEREQGIREKGND 300

Query: 301 SRPLLEHGLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGR 360
           S+PL+EH LLQ PAKPPYVERKEEGTRSKES STR+SKSARKPR+GNLLRTPSLPP IGR
Sbjct: 301 SKPLIEHVLLQKPAKPPYVERKEEGTRSKESGSTRKSKSARKPRNGNLLRTPSLPPFIGR 360

Query: 361 EKEFGEKEAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRR 420
           EKEFGEKEAAARIRNSIQPNLSEFFP+R+EILEKNFSLP CRIPT++D MWHQFLIQMR+
Sbjct: 361 EKEFGEKEAAARIRNSIQPNLSEFFPTRQEILEKNFSLPTCRIPTNSDRMWHQFLIQMRK 420

Query: 421 RRSQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAW 450
           RRSQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEEL+SER RRPYLSEAW
Sbjct: 421 RRSQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELDSERTRRPYLSEAW 480

BLAST of Tan0010831 vs. ExPASy TrEMBL
Match: A0A5A7UXD2 (DUF3082 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold803G00340 PE=4 SV=1)

HSP 1 Score: 671.8 bits (1732), Expect = 2.0e-189
Identity = 354/457 (77.46%), Postives = 388/457 (84.90%), Query Frame = 0

Query: 1   MDPINSSL-QSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAH 60
           MDPIN SL QSSPSS+A  PLI +ALDLLEL WFFDNLLLRRNPRM  S S PCLS + H
Sbjct: 1   MDPINPSLHQSSPSSDA--PLIDEALDLLELFWFFDNLLLRRNPRMLISRSDPCLSKLPH 60

Query: 61  QVFVESPVPNVCSSALDGDVSFGNGGG--KRRNLLRTPSLPSRMDRGEGVQEKGNGSRPL 120
           QVFVE+P  N+ S ALD  VS  N G    RRNLLRTPSLPSRM RG+G++E+GNGSRPL
Sbjct: 61  QVFVETPPTNLRSPALDAGVSLQNNGDGVVRRNLLRTPSLPSRMYRGQGIREEGNGSRPL 120

Query: 121 LEHGVVVESSVDDQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREEG-----KASM 180
           +EH V++E+ VD+ VCSS+LDMDVS GN   K RNL++TP LP  V++ EG       + 
Sbjct: 121 VEHCVLLETPVDN-VCSSSLDMDVSSGNPAGKCRNLLRTPSLPPRVEQGEGIKEKVNDAG 180

Query: 181 PLSENAVFVESHADNACSSALDMDVPLGNGCSKRRSLLRMPSLPSRVEREEGIREKGNSS 240
           PLSE+ VF E  ADNAC S LDM    GN   KRRSL R+PSLPSRVERE+GI+EKGN S
Sbjct: 181 PLSEHGVFAERPADNACLSTLDMGFSPGNSGDKRRSLRRIPSLPSRVEREQGIQEKGNGS 240

Query: 241 RPLLEHGLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGRE 300
           +PL+EHGLLQ PAKPPYVERKEEGTR KES STRRSKSARKP   NLLRTPSLPPCIGRE
Sbjct: 241 KPLIEHGLLQKPAKPPYVERKEEGTRCKESGSTRRSKSARKPPQSNLLRTPSLPPCIGRE 300

Query: 301 KEFGEKEAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRR 360
           KEFGE+EAAARIRNSIQPNLSEFFP+R+EILEKNFSLPMCRIPTS+DE+WHQFLIQMR+R
Sbjct: 301 KEFGEREAAARIRNSIQPNLSEFFPTRQEILEKNFSLPMCRIPTSSDEVWHQFLIQMRKR 360

Query: 361 RSQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWM 420
           RSQSELESEE+QGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESER RRPYLSEAWM
Sbjct: 361 RSQSELESEEVQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERTRRPYLSEAWM 420

Query: 421 LQTHLLPPIPKWDTRNSAEDMKQQIKFWARAVASNVH 450
           LQTHLLPPIPKWDTR SAEDMKQQIKFWARAVASN+H
Sbjct: 421 LQTHLLPPIPKWDTRKSAEDMKQQIKFWARAVASNLH 454

BLAST of Tan0010831 vs. ExPASy TrEMBL
Match: A0A5D3CIY0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold227G001080 PE=4 SV=1)

HSP 1 Score: 671.8 bits (1732), Expect = 2.0e-189
Identity = 354/457 (77.46%), Postives = 388/457 (84.90%), Query Frame = 0

Query: 1   MDPINSSL-QSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAH 60
           MDPIN SL QSSPSS+A  PLI +ALDLLEL WFFDNLLLRRNPRM  S S PCLS + H
Sbjct: 1   MDPINPSLHQSSPSSDA--PLIDEALDLLELFWFFDNLLLRRNPRMLISRSDPCLSKLPH 60

Query: 61  QVFVESPVPNVCSSALDGDVSFGNGGG--KRRNLLRTPSLPSRMDRGEGVQEKGNGSRPL 120
           QVFVE+P  N+ S ALD  VS  N G    RRNLLRTPSLPSRM RG+G++E+GNGSRPL
Sbjct: 61  QVFVETPPTNLRSPALDAGVSLQNNGDGVVRRNLLRTPSLPSRMYRGQGIREEGNGSRPL 120

Query: 121 LEHGVVVESSVDDQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREEG-----KASM 180
           +EH V++E+ VD+ VCSS+LDMDVS GN   K RNL++TP LP  V++ EG       + 
Sbjct: 121 VEHCVLLETPVDN-VCSSSLDMDVSSGNPAGKCRNLLRTPSLPPRVEQGEGIKEKVNDAG 180

Query: 181 PLSENAVFVESHADNACSSALDMDVPLGNGCSKRRSLLRMPSLPSRVEREEGIREKGNSS 240
           PLSE+ VF E  ADNAC S LDM    GN   +RRSL R+PSLPSRVERE+GI+EKGN S
Sbjct: 181 PLSEHGVFAERPADNACLSTLDMGFSPGNSGDRRRSLRRIPSLPSRVEREQGIQEKGNGS 240

Query: 241 RPLLEHGLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGRE 300
           +PL+EHGLLQ PAKPPYVERKEEGTR KES STRRSKSARKP   NLLRTPSLPPCIGRE
Sbjct: 241 KPLIEHGLLQKPAKPPYVERKEEGTRCKESGSTRRSKSARKPPQSNLLRTPSLPPCIGRE 300

Query: 301 KEFGEKEAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRR 360
           KEFGE+EAAARIRNSIQPNLSEFFP+R+EILEKNFSLPMCRIPTS+DE+WHQFLIQMR+R
Sbjct: 301 KEFGEREAAARIRNSIQPNLSEFFPTRQEILEKNFSLPMCRIPTSSDEVWHQFLIQMRKR 360

Query: 361 RSQSELESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWM 420
           RSQSELESEE+QGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESER RRPYLSEAWM
Sbjct: 361 RSQSELESEEVQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERTRRPYLSEAWM 420

Query: 421 LQTHLLPPIPKWDTRNSAEDMKQQIKFWARAVASNVH 450
           LQTHLLPPIPKWDTR SAEDMKQQIKFWARAVASNVH
Sbjct: 421 LQTHLLPPIPKWDTRKSAEDMKQQIKFWARAVASNVH 454

BLAST of Tan0010831 vs. ExPASy TrEMBL
Match: A0A0A0LC64 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G236040 PE=4 SV=1)

HSP 1 Score: 650.6 bits (1677), Expect = 4.8e-183
Identity = 344/451 (76.27%), Postives = 374/451 (82.93%), Query Frame = 0

Query: 1   MDPINSSL-QSSPSSNANDPLIHQALDLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAH 60
           MDPIN SL QSSPS++       +ALDLLEL WFFDNLLLRRNP+M  S S PCLS + H
Sbjct: 49  MDPINPSLHQSSPSTD-------EALDLLELFWFFDNLLLRRNPKMLISRSDPCLSKLPH 108

Query: 61  QVFVESPVPNVCSSALDGDVSF-GNGGGKRRNLLRTPSLPSRMDRGEGVQEKGNGSRPLL 120
           QVFVE+P  N+CSS LD  +S   NGG  RRNLLRTPSLPSRM RG+G+ E+ N SRPLL
Sbjct: 109 QVFVETPPTNLCSSPLDAALSLHNNGGAVRRNLLRTPSLPSRMYRGQGIPEERNDSRPLL 168

Query: 121 EHGVVVESSVDDQVCSSTLDMDVSIGNGGHKHRNLVQTPLLPSPVDREEGKASMPLSENA 180
           EH V+VE+ V + VCSS+LDMDVS         NL++TP LP  VD+EEG  S PLSE+ 
Sbjct: 169 EHCVLVETPVHN-VCSSSLDMDVSTA-------NLLRTPSLPPRVDQEEGNGSGPLSEHG 228

Query: 181 VFVESHADNACSSALDMDVPLGNGCSKRRSLLRMPSLPSRVEREEGIREKGNSSRPLLEH 240
           VF E  AD+AC S LDM    GN   KRRSL RMPSLPSRVERE+GI+EKGN S+PL+EH
Sbjct: 229 VFAEPPADHACLSTLDMPFSPGNSGDKRRSLRRMPSLPSRVEREQGIQEKGNGSKPLIEH 288

Query: 241 GLLQTPAKPPYVERKEEGTRSKESSSTRRSKSARKPRHGNLLRTPSLPPCIGREKEFGEK 300
            LLQ PAKPP VERKEEG RSKES STRRSKSARKP   NLLRTPSLPPCIGRE+EFGE+
Sbjct: 289 ALLQKPAKPPSVERKEEGIRSKESGSTRRSKSARKPPQSNLLRTPSLPPCIGREREFGER 348

Query: 301 EAAARIRNSIQPNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRRRSQSEL 360
           EAAARIRNSIQPNLSEFFP+R+E LEK FSLPMCRIPTS+DEMWHQFLIQMR+RRSQSEL
Sbjct: 349 EAAARIRNSIQPNLSEFFPTRQEFLEKKFSLPMCRIPTSSDEMWHQFLIQMRKRRSQSEL 408

Query: 361 ESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWMLQTHLL 420
           ESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESER RRPYLSEAWMLQTHLL
Sbjct: 409 ESEELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERTRRPYLSEAWMLQTHLL 468

Query: 421 PPIPKWDTRNSAEDMKQQIKFWARAVASNVH 450
           PPIPKWD R SAEDMKQQIKFWARAVASNVH
Sbjct: 469 PPIPKWDNRKSAEDMKQQIKFWARAVASNVH 484

BLAST of Tan0010831 vs. TAIR 10
Match: AT2G42760.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881); Has 170 Blast hits to 164 proteins in 34 species: Archae - 0; Bacteria - 1; Metazoa - 26; Fungi - 10; Plants - 107; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 70.1 bits (170), Expect = 5.2e-12
Identity = 49/127 (38.58%), Postives = 66/127 (51.97%), Query Frame = 0

Query: 347 IQMRRRRSQSELESEELQGFKDLGFTFDKKDINPT-VVDIIPGLRE---------KKEEE 406
           ++ R+ +S S+LE EEL+GF DLGF F + D   + +V I+PGL+          K+EEE
Sbjct: 139 VRTRKGKSMSDLEYEELKGFMDLGFVFSEDDHKDSDLVSILPGLQRLVKKDDGVTKEEEE 198

Query: 407 LESE------RARRPYLSEAW-----MLQTHLLPPIPKW----DTRNSAEDMKQQIKFWA 449
            E E      RA RPYLSEAW           + P  KW        S  D+K  ++ WA
Sbjct: 199 EEEEDKIGGNRAARPYLSEAWDHCGGRKGKKQITPEIKWRVPAPAAASEVDLKDNLRLWA 258

BLAST of Tan0010831 vs. TAIR 10
Match: AT3G15115.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G53180.1); Has 47 Blast hits to 47 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 13; Fungi - 0; Plants - 30; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 62.4 bits (150), Expect = 1.1e-09
Identity = 71/269 (26.39%), Postives = 120/269 (44.61%), Query Frame = 0

Query: 216 PSRVEREEG---------IREKGNSSRPLLEHGLLQTPAKPPYVERK--EEGTRSKES-- 275
           P  +E++EG         +  +  S +  ++        K P V  K  +EG+R K    
Sbjct: 85  PPCIEKKEGGGEPEKINKVMRRQFSEKTRVQERRTYLQKKEPVVREKGIKEGSRKKNRTR 144

Query: 276 -SSTRRSKSARKPRHGNLLRTPSLPPCIGRE---KEFGEKE---------AAARIRNSIQ 335
            S +  +        G+L RT +LP  +GRE    EF ++E             I NS  
Sbjct: 145 ISCSNNNSVQSCSMGGSLQRTQTLPSYLGREDDVNEFQDQEIDDSRMGFLIREAIANSSS 204

Query: 336 PNLSEFFPSRREILEKNFSLPMCRIP--TSNDEMWHQFLIQMRR-------RRSQSELES 395
            + S F P+++ I  K   +P  R P  + +++   + +++ ++       R++ S +E+
Sbjct: 205 SSSSGFTPTKQNI-PKVSCIPRHRPPRNSRSEDAIQELVVKSQKSPNRKTLRKTLSSIET 264

Query: 396 EELQGFKDLGFTFDKKDINPTVVDIIPGLREKKEEELESERARRPYLSEAWMLQTHLL-P 449
           +++Q  KD                      EKK+EE E ++ + P  +      T ++  
Sbjct: 265 KDIQMLKDFHIE-----------------TEKKQEEDEEKQRKVPCTTTGKNRSTAVVGQ 324

BLAST of Tan0010831 vs. TAIR 10
Match: AT1G53180.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 6 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G15115.1); Has 58 Blast hits to 56 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 6; Fungi - 4; Plants - 29; Viruses - 0; Other Eukaryotes - 19 (source: NCBI BLink). )

HSP 1 Score: 55.8 bits (133), Expect = 1.0e-07
Identity = 113/441 (25.62%), Postives = 180/441 (40.82%), Query Frame = 0

Query: 26  DLLELSWFFDNLLLRRNPRMSTSHSHPCLSNVAHQVFVESPVPNVCSSALDGDVSFGNGG 85
           DLLE  WFF+NL  RR+  +   HS P  S+ +      S  P     +  G V   + G
Sbjct: 13  DLLEDYWFFENLFTRRSRGLRYCHSDPYPSSSS-----TSTSPEKMGDSDIGKVLEASTG 72

Query: 86  GKRRNLLRTPSLPSRMDRGEGVQEKGNGSRPLLEHGVVVESSVDDQVCSSTLDMDVSIGN 145
              R+L+R  S+ SR    EG      GS+  L      +  V +Q              
Sbjct: 73  ---RSLIRASSIDSR----EG------GSQTKLTGRFSEKIRVQEQ-------------- 132

Query: 146 GGHKHRNLVQTPLLPSPVDREEGKASMPLSENAVFVESHADNACSSALDMDVPLGNGCSK 205
                              R+ G +S+   E+ V  +S + +A     +         S 
Sbjct: 133 -------------------RQVG-SSLQKKEHVVLPKSGSRSAPGKIQE--------AST 192

Query: 206 RRSLLRMPSLPSRVEREEGIREK----GNSSRPLLEHGLLQTPAKP--PYVERKEEGTRS 265
           +R L+R PSLP ++E+ E  RE        +R   E   +  P +P   ++++KE   R 
Sbjct: 193 KRGLIRAPSLPPQIEKREMDREAKKMINKLTRQFSEKIRVLEPTRPGEHFLQKKETIARD 252

Query: 266 KE-SSSTRRSKSARKPRHG----NLLRTPSLPPCIGREK-----EFGEKEAAARIRNSIQ 325
           K  + S+R +K+     +     +L RT ++P  +GRE+     EF ++E+ +R+   I 
Sbjct: 253 KGITESSRSNKTGSSSSYSSVKISLQRTQTMPNNMGREEDNEEDEFEDQESDSRMGFLI- 312

Query: 326 PNLSEFFPSRREILEKNFSLPMCRIPTSNDEMWHQFLIQMRRRRSQSELESEELQGFKDL 385
                     RE L  +  +P      SN++          R+R    L  E+    K  
Sbjct: 313 ----------REALASSHYVP----KVSNNQ----------RQRPPRSLRLEDTAMVKQG 354

Query: 386 GFTFDKKDINPTVVDIIPGLREKKEEELESERAR-RPYLSEAWMLQTHLLPP-IPKWDTR 445
           G +   K +  T+  +        E   E +R +    L E  +      PP +PK    
Sbjct: 373 GSS--PKTLRKTLSSV--------ETTKEIQRLKGYDQLVEPRVASGLATPPRVPK---- 354

Query: 446 NSAEDMKQQIKFWARAVASNV 449
           +S+++MK QIKFWARAVA+NV
Sbjct: 433 DSSKEMKDQIKFWARAVATNV 354

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038874899.11.6e-20181.76uncharacterized protein LOC120067383 isoform X1 [Benincasa hispida] >XP_03887493... [more]
KAG7015030.16.4e-19872.87hypothetical protein SDJN02_22661, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAG6577008.13.5e-19672.43hypothetical protein SDJN03_24582, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023551849.12.3e-19572.41uncharacterized protein LOC111809699 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
XP_022922903.12.3e-19572.04uncharacterized protein LOC111430741 [Cucurbita moschata] >XP_022922904.1 unchar... [more]
Match NameE-valueIdentityDescription
A0A6J1E4T21.1e-19572.04uncharacterized protein LOC111430741 OS=Cucurbita moschata OX=3662 GN=LOC1114307... [more]
A0A6J1J9E42.3e-19370.85uncharacterized protein LOC111482920 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A5A7UXD22.0e-18977.46DUF3082 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A5D3CIY02.0e-18977.46Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0LC644.8e-18376.27Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G236040 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G42760.15.2e-1238.58unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685... [more]
AT3G15115.11.1e-0926.39unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G53180.11.0e-0725.62unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 216..233
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 203..290
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 86..115
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 148..169
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 248..268
NoneNo IPR availablePANTHERPTHR33785:SF5SERINE/ARGININE REPETITIVE MATRIX PROTEINcoord: 196..449
NoneNo IPR availablePANTHERPTHR33785:SF5SERINE/ARGININE REPETITIVE MATRIX PROTEINcoord: 14..171
NoneNo IPR availablePANTHERPTHR33785FAMILY NOT NAMEDcoord: 14..171
NoneNo IPR availablePANTHERPTHR33785FAMILY NOT NAMEDcoord: 196..449

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0010831.1Tan0010831.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane