Sed0001684 (gene) Chayote v1

Overview
NameSed0001684
Typegene
OrganismSechium edule (Chayote v1)
DescriptionDirigent protein
LocationLG04: 44627414 .. 44630559 (-)
RNA-Seq ExpressionSed0001684
SyntenySed0001684
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGCGAATGGTGGTTCAGCTCCGTCCGCTCTTGGGTATCCGCCGTTGTCTGGGGATGTTTGGATGCGGATTATATGAAACTTGGTACACCCTTTTACCGTTTCTATCATTTGAATTTATTTGATTCATATTTAAGTTTTTTTTTTTTTTAAATTTTATTTATTTTTAATGTGGAATCAAATTGGGAAAGTTAAGTTTACCTTTGAAGTTAGGTCCCATCCGTTGTTGCTTTTAGCTGGTGCAAATGTACAGAATCCTTCAAGAACTATAGTTTCACTAATCTTGTAGCTGTTTGGAAGAAATTTTTGCCCCCATTTAACTTGATGATCTATGTTTGTTGAGTTGCCTTGAGACTTGAAATATGATGTATTGATGTTAATAAGCCATCAACAACGAGGGGCTGCCAAATTTATTATTGTTCAACTTTATCCACTAGTTTTACACTCATTGGAGCATTGTTCATTTCATACTGATGAACTGACCATACGATATGTGTTTTTTTTGTAAAATTAAGAAACAGAAAAGGAAACAAACAACAAGAAATCAAAAAGTTGTTATCAAACAAGTTTTGTTCCTGTTTTTTAATCTTTGTAAGAAACAATAATATTGAATCATGTAAAACTTTGTGGTCATCTCTTCCAATTGTTCATATGATATCTACTTGGAATGCTAGTTCATGCAATTATGTAGTTCTAGCACCACAACTGGATGTTGATGGAAGTTTCCTGTACCTCACTATAACTTACTTCGTGGATTGCTCAATTCATGGAGTATTTTGGATCCAATTGTTCCTATGCAAATGGAGACACAAGGGCTGGTAAATTATCTCTTCCCCATAGATTTCGAACTTGATGATAACAAAAATGTTTTGAATATGAGATTTTCATGTGGGTTATGCATCAAGCAAGTGGCTTAATCTGCTTTGTGGGAGAAGGGACTTTTGTTGCGGCATGCGAGAGAAACAATAGAATCTTTTGAGGGTTGGAGAGATTTCCAGACGATATTAGATTCTATGTTTCTCTATGAGGTTCGATGATAATACTTTTTTTATAATTACATTTTAGGTCTTGTTTTGTTGGATTAGAAGCCCTTTTCATAGGTCTCTCTTTTTGTTTGCCTTATATTATTTCTTTTAATCTTAATGAATGCTCGGTTGTTAATTAAAAGAAAAATGATAATTTTTTTTTTCAGTTAGATAAACTCCACAGGACATTGTAGCTGTGTGTCAATGTGATAAAGAATCGATAAAACAATAGAGTTACAATTCTGACAGTCTGTGCATAATTTGGATTGATATATGTGAAAACTTTGTTTTTGTAGTCTTTTCATTCTGAATTTAATAGACAGGTACCCAGTTTTGGGATGAGCATTTTAATGCTTCAAAAAGGTTAGATCTACAAAATATGGAAAGATATGAGGCAATTCTTAGCAATGCTTTCTGATTTTATCGAATATAGAGTGGCTCAATTTTATGATTGGAACATCAATCATGGGTTGACCCAACAGTGAAAGAGGGTCAGGAAGATGTAAGAGTGTCATGAATTCAATCTATATTGATTACTTACCTAGTAATTTTCTATGTGTTTCTTTCGCATCCAATTTTAGAATAAATGAGTGATATTTTACTTTTTTACGAGCCTCTGATTTGCCTGGCTGTGTGTTTCTGAAATGACATCTCACTTGGACTCTTGCTGTGTACATGTGTGCCTTGTTTGTTTCAGAATGGACATTTCTTTGCCTGTGAACACAAGGTCATCCACTCAATTATTTATCCAATGATGATCTTAAGGCAGGATATTTGTTCACTTTCGTCCTCAAGAGGTGGCTTTGGGCTTGGAGTCTTGGAGGTCTCAGGTTCGAAATCCACAAATGAGCTTAATTATTAAATCCCTTGTTATCTTCCAGGTCTGAGCTTTGAGATGGCTGCAGGTACCTTGGGTACGTAGTGTAGTAAATCTCCAATTCTCGGTCGTAAAAAAAGAGAGTATATTTCTGTCAGCCATTGGCTTAGATCATTACTAAGATCTATCAAACTGATGAAGCCCCTTTTCTCATATGGATCTAAACCATTGTCGAAATGCTCCAAGAGCAGAAGGAACCCTTACTAAGTTTCAACCTCTTGTTGCAGCATGAGTGGGTTTGACTAGAAGTCACTGGATTTAATAGTGGATTTGTTATTTAGGGTACACCAAATCCTTTTAGATGGTTGACTTTTAAAACAATAGTTTTTTCTTGAAATACTGACCCTCCATGGATTTGGTGGGCGATGGATTAGGTACAAAACTTTGCATTAGTTGACCTAAAATATAAACATGTTGAATGAGCTATATGTATCTGAGAATGCAGCTAAAATGGCTACTACATATTAAACTTTCTTTAAAGAACTGGGTCTTTCAATGCATGATGAAGAATAGAATTATATTCTGCTCTGCAATCAGCTTGGCCCTCTTAGCTGTTATTCTCTTGGCCTTGCTTTCGCCAGTATCCCACCGAAAGCAGTCCAACCAAGCGAAAAAACCGCCGTGGGCGGACCTGTCCCTCTACATTCAACAGCCACATTCTAAAGCAAATGCCAGATCCAACAATTTGCAGCCTGTACAACCAGATTCTGGAGTTTTTGTCTTTCGACGAGCGCTCACGGAGGGGCCCGAGAACACGTCTCAGATCGTCGGAAAGGCTCAAGGTTTCATCATTCCTAACGAGCAGTTCGCTCGTTCGTCGTTCAATATCATCTATCTCAGTTTCGACACGCCAGAGTATTCGGGCAGCTTGAGTGTCCATGCCAAACATATTGGACATGAGAAAAAAGAAGAAATGGCAGTGGTTGGGGGGACAGGTTCTTTTGCTTTTGCTCAAGGGATAGCTGTTTTTTTACAGACAGAGAAGCAAGCATCTGTTGCTGATACATCTTATCATTTAAAGCTTCAACTTCAATTCCCCAAATGATAATTATAATGCAAAGTAGTCCCACATGCTTCTTTGTTTGATTATATTTAATAAAGCACAATTTTGTGTGCAACTGAGGAAGCATCCTCCTCATTCCAGTTGTATAAATTATTTGAGTCAAGTTATGAATTGGTGTACAAATAAGGAGTACTATGGAACTTGAGATGACATTGATCTTCCAACTTTTTCTTGTTACAATCT

mRNA sequence

CAGCGAATGGTGGTTCAGCTCCGTCCGCTCTTGGGTATCCGCCGTTGTCTGGGGATGTTTGGATGCGGATTATATGAAACTTGAATGGACATTTCTTTGCCTGTGAACACAAGGTCATCCACTCAATTATTTATCCAATGATGATCTTAAGGCAGGATATTTGTTCACTTTCGTCCTCAAGAGGTGGCTTTGGGCTTGGAGTCTTGGAGGTCTCAGGTACCTTGGGTACGTAGTGTAGTAAATCTCCAATTCTCGGTCGTAAAAAAAGAGAGTATATTTCTGTCAGCCATTGGCTTAGATCATTACTAAGATCTATCAAACTGATGAAGCCCCTTTTCTCATATGGATCTAAACCATTGTCGAAATGCTCCAAGAGCAGAAGGAACCCTTACTAAGTTTCAACCTCTTGTTGCAGCATGAGTGGGTTTGACTAGAAGTCACTGGATTTAATAGTGGATTTGTTATTTAGGGTACACCAAATCCTTTTAGATGGTTGACTTTTAAAACAATAGTTTTTTCTTGAAATACTGACCCTCCATGGATTTGGTGGGCGATGGATTAGGTACAAAACTTTGCATTAGTTGACCTAAAATATAAACATGTTGAATGAGCTATATGTATCTGAGAATGCAGCTAAAATGGCTACTACATATTAAACTTTCTTTAAAGAACTGGGTCTTTCAATGCATGATGAAGAATAGAATTATATTCTGCTCTGCAATCAGCTTGGCCCTCTTAGCTGTTATTCTCTTGGCCTTGCTTTCGCCAGTATCCCACCGAAAGCAGTCCAACCAAGCGAAAAAACCGCCGTGGGCGGACCTGTCCCTCTACATTCAACAGCCACATTCTAAAGCAAATGCCAGATCCAACAATTTGCAGCCTGTACAACCAGATTCTGGAGTTTTTGTCTTTCGACGAGCGCTCACGGAGGGGCCCGAGAACACGTCTCAGATCGTCGGAAAGGCTCAAGGTTTCATCATTCCTAACGAGCAGTTCGCTCGTTCGTCGTTCAATATCATCTATCTCAGTTTCGACACGCCAGAGTATTCGGGCAGCTTGAGTGTCCATGCCAAACATATTGGACATGAGAAAAAAGAAGAAATGGCAGTGGTTGGGGGGACAGGTTCTTTTGCTTTTGCTCAAGGGATAGCTGTTTTTTTACAGACAGAGAAGCAAGCATCTGTTGCTGATACATCTTATCATTTAAAGCTTCAACTTCAATTCCCCAAATGATAATTATAATGCAAAGTAGTCCCACATGCTTCTTTGTTTGATTATATTTAATAAAGCACAATTTTGTGTGCAACTGAGGAAGCATCCTCCTCATTCCAGTTGTATAAATTATTTGAGTCAAGTTATGAATTGGTGTACAAATAAGGAGTACTATGGAACTTGAGATGACATTGATCTTCCAACTTTTTCTTGTTACAATCT

Coding sequence (CDS)

ATGAGCTATATGTATCTGAGAATGCAGCTAAAATGGCTACTACATATTAAACTTTCTTTAAAGAACTGGGTCTTTCAATGCATGATGAAGAATAGAATTATATTCTGCTCTGCAATCAGCTTGGCCCTCTTAGCTGTTATTCTCTTGGCCTTGCTTTCGCCAGTATCCCACCGAAAGCAGTCCAACCAAGCGAAAAAACCGCCGTGGGCGGACCTGTCCCTCTACATTCAACAGCCACATTCTAAAGCAAATGCCAGATCCAACAATTTGCAGCCTGTACAACCAGATTCTGGAGTTTTTGTCTTTCGACGAGCGCTCACGGAGGGGCCCGAGAACACGTCTCAGATCGTCGGAAAGGCTCAAGGTTTCATCATTCCTAACGAGCAGTTCGCTCGTTCGTCGTTCAATATCATCTATCTCAGTTTCGACACGCCAGAGTATTCGGGCAGCTTGAGTGTCCATGCCAAACATATTGGACATGAGAAAAAAGAAGAAATGGCAGTGGTTGGGGGGACAGGTTCTTTTGCTTTTGCTCAAGGGATAGCTGTTTTTTTACAGACAGAGAAGCAAGCATCTGTTGCTGATACATCTTATCATTTAAAGCTTCAACTTCAATTCCCCAAATGA

Protein sequence

MSYMYLRMQLKWLLHIKLSLKNWVFQCMMKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARSNNLQPVQPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEYSGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQFPK
Homology
BLAST of Sed0001684 vs. NCBI nr
Match: XP_038888051.1 (dirigent protein 8 [Benincasa hispida])

HSP 1 Score: 326.6 bits (836), Expect = 1.5e-85
Identity = 167/206 (81.07%), Postives = 180/206 (87.38%), Query Frame = 0

Query: 8   MQLKWLLHIK----LSLKNWVFQCMMKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQ 67
           M + WLLH K    L LK  VFQCMMKNRIIFC+AI LA LAVILLALLSPVSHRKQ+  
Sbjct: 1   MLINWLLHYKYINVLFLKKLVFQCMMKNRIIFCAAICLAFLAVILLALLSPVSHRKQAKH 60

Query: 68  AKKPPWADLSLYIQQPHSKANARSNNLQPV-QPDSGVFVFRRALTEGPENTSQIVGKAQG 127
            +KPPWADLSLYIQ+PHSK NARSN ++PV +PDSG+FVFRR LT+GPENTSQIVG AQG
Sbjct: 61  DRKPPWADLSLYIQRPHSKENARSNKMKPVTEPDSGIFVFRRTLTKGPENTSQIVGNAQG 120

Query: 128 FIIPNEQFARSSFNIIYLSFDTPEYSGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIA 187
           FIIPNEQFARSSFNIIYLSFDTPEYSGSL VHAK IGHE +EEM VVGGTGSFAFAQGIA
Sbjct: 121 FIIPNEQFARSSFNIIYLSFDTPEYSGSLGVHAKRIGHENREEMTVVGGTGSFAFAQGIA 180

Query: 188 VFLQTEKQASVADTSYHLKLQLQFPK 209
           +FLQTEKQ S+ DTSYHLKLQLQFPK
Sbjct: 181 IFLQTEKQTSITDTSYHLKLQLQFPK 206

BLAST of Sed0001684 vs. NCBI nr
Match: XP_022952599.1 (uncharacterized protein LOC111455239 [Cucurbita moschata])

HSP 1 Score: 308.1 bits (788), Expect = 5.6e-80
Identity = 156/181 (86.19%), Postives = 166/181 (91.71%), Query Frame = 0

Query: 29  MKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARSN 88
           MKNRIIFC+AI LA+LAVILLALLSPVSHRKQ+    KPPWADLSLYIQQPHSKAN+RSN
Sbjct: 1   MKNRIIFCAAICLAILAVILLALLSPVSHRKQAKHDSKPPWADLSLYIQQPHSKANSRSN 60

Query: 89  NLQPV-QPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEY 148
           N+Q V  PDSG+FVFRR LT+G ENTSQIVG AQGFIIPNEQF RSSFNIIYLSFDTPEY
Sbjct: 61  NMQLVPSPDSGIFVFRRMLTKGLENTSQIVGNAQGFIIPNEQFGRSSFNIIYLSFDTPEY 120

Query: 149 SGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQFP 208
           SGSLSVHAKHIGHE +EEM VVGGTGSFAFAQGIA+FLQTE+QASV DTSYHLKLQLQFP
Sbjct: 121 SGSLSVHAKHIGHENREEMTVVGGTGSFAFAQGIAIFLQTERQASVTDTSYHLKLQLQFP 180

BLAST of Sed0001684 vs. NCBI nr
Match: XP_022969428.1 (dirigent protein 8 [Cucurbita maxima])

HSP 1 Score: 307.0 bits (785), Expect = 1.2e-79
Identity = 156/181 (86.19%), Postives = 167/181 (92.27%), Query Frame = 0

Query: 29  MKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARSN 88
           MKNRIIFC+AI LA+LAVILLALLSPVSHRKQ+    KPPWADLSLYIQQPHSKAN+RS+
Sbjct: 1   MKNRIIFCAAICLAILAVILLALLSPVSHRKQAKHDGKPPWADLSLYIQQPHSKANSRSS 60

Query: 89  NLQPV-QPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEY 148
           N+Q V  PDSG+FVFRR LT+G ENTSQIVG AQGFIIPNEQFARSSFNIIYLSFDTPEY
Sbjct: 61  NMQLVPSPDSGIFVFRRMLTKGLENTSQIVGNAQGFIIPNEQFARSSFNIIYLSFDTPEY 120

Query: 149 SGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQFP 208
           SGSLSVHAKHIGHE +EEM VVGGTGSFAFAQGIA+FLQTE+QASV DTSYHLKLQLQFP
Sbjct: 121 SGSLSVHAKHIGHENREEMTVVGGTGSFAFAQGIAIFLQTERQASVTDTSYHLKLQLQFP 180

BLAST of Sed0001684 vs. NCBI nr
Match: XP_023511732.1 (uncharacterized protein LOC111776503 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 306.6 bits (784), Expect = 1.6e-79
Identity = 156/181 (86.19%), Postives = 167/181 (92.27%), Query Frame = 0

Query: 29  MKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARSN 88
           MKNRIIFC+AI LA+LAVILLALLSPVSHRKQ+   +KPPWADLSLYIQQPHSKAN+RSN
Sbjct: 1   MKNRIIFCAAICLAILAVILLALLSPVSHRKQAKHDRKPPWADLSLYIQQPHSKANSRSN 60

Query: 89  NLQPV-QPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEY 148
           N+Q V  PDSG+FVFRR LT+G ENTSQIVG AQGFIIPNEQFARSSFNIIYLSFDT EY
Sbjct: 61  NMQLVPSPDSGIFVFRRMLTKGLENTSQIVGNAQGFIIPNEQFARSSFNIIYLSFDTLEY 120

Query: 149 SGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQFP 208
           SGSLSVHAKHIGHE +EEM VVGGTGSFAFAQGIA+FLQTE+QASV DTSYHLKLQLQFP
Sbjct: 121 SGSLSVHAKHIGHENREEMTVVGGTGSFAFAQGIAIFLQTERQASVTDTSYHLKLQLQFP 180

BLAST of Sed0001684 vs. NCBI nr
Match: KAG6572477.1 (Dirigent protein 6, partial [Cucurbita argyrosperma subsp. sororia] >KAG7012071.1 Dirigent protein 6, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 305.8 bits (782), Expect = 2.8e-79
Identity = 155/181 (85.64%), Postives = 167/181 (92.27%), Query Frame = 0

Query: 29  MKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARSN 88
           MKNRIIFC+AI LA+LAVILLALLSPVSHRKQ+   +KPPWADLSLYIQQP+SKAN+RSN
Sbjct: 1   MKNRIIFCAAICLAILAVILLALLSPVSHRKQAKHDRKPPWADLSLYIQQPNSKANSRSN 60

Query: 89  NLQPV-QPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEY 148
           N+Q V  PDSG+FVFRR LT+G ENTSQIVG AQGFIIPNEQFARSSFNIIYLSFDTPEY
Sbjct: 61  NMQLVPSPDSGIFVFRRMLTKGLENTSQIVGNAQGFIIPNEQFARSSFNIIYLSFDTPEY 120

Query: 149 SGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQFP 208
           SGSLSVHAKHIGHE +EEM VVGGTGSFAFAQGIA+FLQTE+QASV DT YHLKLQLQFP
Sbjct: 121 SGSLSVHAKHIGHENREEMTVVGGTGSFAFAQGIAIFLQTERQASVTDTCYHLKLQLQFP 180

BLAST of Sed0001684 vs. ExPASy Swiss-Prot
Match: A0A1V1FH01 (Pterocarpan synthase 1 OS=Glycyrrhiza echinata OX=46348 GN=PTS1 PE=1 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 2.7e-08
Identity = 42/109 (38.53%), Postives = 61/109 (55.96%), Query Frame = 0

Query: 76  IQQPHSKANARSNNLQPVQPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSF 135
           + +P+ KA    N+L    P   V      LT GPE+ S++VGKAQG      Q      
Sbjct: 62  VAEPNGKA---KNSL----PFGTVVAMDDPLTVGPESDSKLVGKAQGIYTSISQEEMGLM 121

Query: 136 NIIYLSFDTPEYSGS-LSVHAKH-IGHEKKEEMAVVGGTGSFAFAQGIA 183
            ++ ++F   E++GS LS+ A++ I  E   EMA+VGGTG+F FA+G A
Sbjct: 122 MVMTMAFSDGEFNGSTLSILARNMIMSEPVREMAIVGGTGAFRFARGYA 163

BLAST of Sed0001684 vs. ExPASy Swiss-Prot
Match: I1JNN8 (Pterocarpan synthase 1 OS=Glycine max OX=3847 GN=PTS1 PE=2 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 1.7e-07
Identity = 39/111 (35.14%), Postives = 59/111 (53.15%), Query Frame = 0

Query: 74  LYIQQPHSKANARSNNLQPVQPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARS 133
           ++I +P+ KA       +   P   V      LT GPE  S++VGKAQG      Q    
Sbjct: 58  VFIAEPNGKA-------KDALPFGTVVAMDDPLTVGPEQDSKLVGKAQGIYTSISQEEMG 117

Query: 134 SFNIIYLSFDTPEYSGS-LSVHAKH-IGHEKKEEMAVVGGTGSFAFAQGIA 183
              ++ ++F   +++GS +SV  ++ I  E   EMA+VGGTG+F FA+G A
Sbjct: 118 LMMVMTMAFTDGDFNGSTISVLGRNMIMSEPVREMAIVGGTGAFRFARGYA 161

BLAST of Sed0001684 vs. ExPASy Swiss-Prot
Match: Q9SKQ2 (Dirigent protein 4 OS=Arabidopsis thaliana OX=3702 GN=DIR4 PE=2 SV=1)

HSP 1 Score: 53.1 bits (126), Expect = 4.3e-06
Identity = 29/100 (29.00%), Postives = 54/100 (54.00%), Query Frame = 0

Query: 84  NARSNNLQPVQPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFD 143
           + R +N     P   +F     LT GP+  S+ +G A+G  + + +   +    +   F 
Sbjct: 62  HTRGDNDSSPSPFGSLFALDDPLTVGPDPKSEKIGNARGMYVSSGKHVPTLTMYVDFGFT 121

Query: 144 TPEYSG-SLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIA 183
           + +++G S++V +++   EK+ E+AVVGG G F  A+G+A
Sbjct: 122 SGKFNGSSIAVFSRNTITEKEREVAVVGGRGRFRMARGVA 161

BLAST of Sed0001684 vs. ExPASy Swiss-Prot
Match: Q9FIG7 (Dirigent protein 2 OS=Arabidopsis thaliana OX=3702 GN=DIR2 PE=2 SV=1)

HSP 1 Score: 52.4 bits (124), Expect = 7.3e-06
Identity = 29/78 (37.18%), Postives = 47/78 (60.26%), Query Frame = 0

Query: 106 LTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEYSGS-LSVHAKHIGHEKKE 165
           LTEGP+ +S+ VG+AQG          S   +  L+F   E++GS ++++ ++    K  
Sbjct: 86  LTEGPDPSSKEVGRAQGMYALTAMKNISFTMVFNLAFTAGEFNGSTVAMYGRNEIFSKVR 145

Query: 166 EMAVVGGTGSFAFAQGIA 183
           EM ++GGTG+F FA+G A
Sbjct: 146 EMPIIGGTGAFRFARGYA 163

BLAST of Sed0001684 vs. ExPASy Swiss-Prot
Match: Q9SUQ8 (Dirigent protein 6 OS=Arabidopsis thaliana OX=3702 GN=DIR6 PE=1 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 2.1e-05
Identity = 49/157 (31.21%), Postives = 77/157 (49.04%), Query Frame = 0

Query: 42  ALLAVILLALL---SPVSHRKQSNQAKKPPWADLSLY----IQQPHSKANARSNNLQPVQ 101
           AL +  LL LL   + +S RK  +Q  K P    S Y    +    + ANA S  +    
Sbjct: 12  ALFSFFLLVLLFSDTVLSFRKTIDQ--KKPCKHFSFYFHDILYDGDNVANATSAAIVS-P 71

Query: 102 PDSGVFVFRR-ALTEGP-----ENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEYS 161
           P  G F F +  + +GP        S+ V +AQGF   + +   +S+    L F++ E+ 
Sbjct: 72  PGLGNFKFGKFVIFDGPITMDKNYLSKPVARAQGFYFYDMKMDFNSWFSYTLVFNSTEHK 131

Query: 162 GSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFL 186
           G+L++    +  E   +++VVGGTG F  A+GIA F+
Sbjct: 132 GTLNIMGADLMMEPTRDLSVVGGTGDFFMARGIATFV 165

BLAST of Sed0001684 vs. ExPASy TrEMBL
Match: A0A6J1GM61 (Dirigent protein OS=Cucurbita moschata OX=3662 GN=LOC111455239 PE=3 SV=1)

HSP 1 Score: 308.1 bits (788), Expect = 2.7e-80
Identity = 156/181 (86.19%), Postives = 166/181 (91.71%), Query Frame = 0

Query: 29  MKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARSN 88
           MKNRIIFC+AI LA+LAVILLALLSPVSHRKQ+    KPPWADLSLYIQQPHSKAN+RSN
Sbjct: 1   MKNRIIFCAAICLAILAVILLALLSPVSHRKQAKHDSKPPWADLSLYIQQPHSKANSRSN 60

Query: 89  NLQPV-QPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEY 148
           N+Q V  PDSG+FVFRR LT+G ENTSQIVG AQGFIIPNEQF RSSFNIIYLSFDTPEY
Sbjct: 61  NMQLVPSPDSGIFVFRRMLTKGLENTSQIVGNAQGFIIPNEQFGRSSFNIIYLSFDTPEY 120

Query: 149 SGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQFP 208
           SGSLSVHAKHIGHE +EEM VVGGTGSFAFAQGIA+FLQTE+QASV DTSYHLKLQLQFP
Sbjct: 121 SGSLSVHAKHIGHENREEMTVVGGTGSFAFAQGIAIFLQTERQASVTDTSYHLKLQLQFP 180

BLAST of Sed0001684 vs. ExPASy TrEMBL
Match: A0A6J1I0Y2 (Dirigent protein OS=Cucurbita maxima OX=3661 GN=LOC111468438 PE=3 SV=1)

HSP 1 Score: 307.0 bits (785), Expect = 6.0e-80
Identity = 156/181 (86.19%), Postives = 167/181 (92.27%), Query Frame = 0

Query: 29  MKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARSN 88
           MKNRIIFC+AI LA+LAVILLALLSPVSHRKQ+    KPPWADLSLYIQQPHSKAN+RS+
Sbjct: 1   MKNRIIFCAAICLAILAVILLALLSPVSHRKQAKHDGKPPWADLSLYIQQPHSKANSRSS 60

Query: 89  NLQPV-QPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEY 148
           N+Q V  PDSG+FVFRR LT+G ENTSQIVG AQGFIIPNEQFARSSFNIIYLSFDTPEY
Sbjct: 61  NMQLVPSPDSGIFVFRRMLTKGLENTSQIVGNAQGFIIPNEQFARSSFNIIYLSFDTPEY 120

Query: 149 SGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQFP 208
           SGSLSVHAKHIGHE +EEM VVGGTGSFAFAQGIA+FLQTE+QASV DTSYHLKLQLQFP
Sbjct: 121 SGSLSVHAKHIGHENREEMTVVGGTGSFAFAQGIAIFLQTERQASVTDTSYHLKLQLQFP 180

BLAST of Sed0001684 vs. ExPASy TrEMBL
Match: A0A5A7U5G8 (Dirigent protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold986G00790 PE=3 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 5.9e-75
Identity = 146/181 (80.66%), Postives = 159/181 (87.85%), Query Frame = 0

Query: 29  MKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARSN 88
           MKNRIIFC+AI LA LAVILLA+LSPVSH+KQ+   +KP W DLSLYIQ+P SKANAR N
Sbjct: 1   MKNRIIFCAAICLAFLAVILLAVLSPVSHKKQAKHDRKPQWTDLSLYIQRPRSKANARPN 60

Query: 89  NLQPVQ-PDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEY 148
           N+QPV  PDSGVF FRR LT+GPENTSQIVG AQG IIP+EQFARSSFNIIYLSFDTPEY
Sbjct: 61  NMQPVTVPDSGVFFFRRMLTKGPENTSQIVGNAQGIIIPSEQFARSSFNIIYLSFDTPEY 120

Query: 149 SGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQFP 208
           SGSL VHAKHIGHE +EEM VVGGTGSFAFAQG+A+FLQTE+Q    DTSYHLKLQLQFP
Sbjct: 121 SGSLGVHAKHIGHENREEMTVVGGTGSFAFAQGVAIFLQTERQTLNTDTSYHLKLQLQFP 180

BLAST of Sed0001684 vs. ExPASy TrEMBL
Match: A0A1S3BGM4 (Dirigent protein OS=Cucumis melo OX=3656 GN=LOC103489634 PE=3 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 5.9e-75
Identity = 146/181 (80.66%), Postives = 159/181 (87.85%), Query Frame = 0

Query: 29  MKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARSN 88
           MKNRIIFC+AI LA LAVILLA+LSPVSH+KQ+   +KP W DLSLYIQ+P SKANAR N
Sbjct: 1   MKNRIIFCAAICLAFLAVILLAVLSPVSHKKQAKHDRKPQWTDLSLYIQRPRSKANARPN 60

Query: 89  NLQPVQ-PDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEY 148
           N+QPV  PDSGVF FRR LT+GPENTSQIVG AQG IIP+EQFARSSFNIIYLSFDTPEY
Sbjct: 61  NMQPVTVPDSGVFFFRRMLTKGPENTSQIVGNAQGIIIPSEQFARSSFNIIYLSFDTPEY 120

Query: 149 SGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQFP 208
           SGSL VHAKHIGHE +EEM VVGGTGSFAFAQG+A+FLQTE+Q    DTSYHLKLQLQFP
Sbjct: 121 SGSLGVHAKHIGHENREEMTVVGGTGSFAFAQGVAIFLQTERQTLNTDTSYHLKLQLQFP 180

BLAST of Sed0001684 vs. ExPASy TrEMBL
Match: A0A6J1D409 (Dirigent protein OS=Momordica charantia OX=3673 GN=LOC111016784 PE=3 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 1.9e-73
Identity = 145/182 (79.67%), Postives = 161/182 (88.46%), Query Frame = 0

Query: 28  MMKNRIIFCSAISLALLAVILLALLSPVSHRKQSNQAKKPPWADLSLYIQQPHSKANARS 87
           MMK RIIFC+A+ LA L VILLALLSPV ++KQS   +KP WADLSLYIQQPHS  NA+S
Sbjct: 1   MMKTRIIFCAAVCLAFLVVILLALLSPVPNKKQSKHGRKPSWADLSLYIQQPHSTGNAKS 60

Query: 88  NNLQPV-QPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPE 147
           NN+QPV + DSGVFVF R LTEGPENTS+IVG A+GFIIPNEQFA SSFN+IYLSFDTPE
Sbjct: 61  NNMQPVPRSDSGVFVFLRTLTEGPENTSRIVGNARGFIIPNEQFAHSSFNVIYLSFDTPE 120

Query: 148 YSGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTSYHLKLQLQF 207
           YSGSLSVHAKHIGHE + E+AVVGGTGSFAFAQGIA+FLQT+ QASV DT+YHLKLQLQF
Sbjct: 121 YSGSLSVHAKHIGHENR-ELAVVGGTGSFAFAQGIAIFLQTDGQASVTDTTYHLKLQLQF 180

Query: 208 PK 209
           PK
Sbjct: 181 PK 181

BLAST of Sed0001684 vs. TAIR 10
Match: AT5G42655.1 (Disease resistance-responsive (dirigent-like protein) family protein )

HSP 1 Score: 138.7 bits (348), Expect = 5.5e-33
Identity = 67/130 (51.54%), Postives = 91/130 (70.00%), Query Frame = 0

Query: 78  QPHSKANARSNNLQPVQPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNI 137
           QPH +   +           G  +FRR LTEGPEN S+IVGKA+GFIIP+E FA S FN+
Sbjct: 3   QPHGRGGGK-----------GALIFRRTLTEGPENNSRIVGKAEGFIIPHEDFANSDFNV 62

Query: 138 IYLSFDTPEYSGSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFLQTEKQASVADTS 197
           IYL+ +TPEY+GS+S+ ++ + H+ KE M VVGGTG+FAFA+GIA+F + +     A T+
Sbjct: 63  IYLTLETPEYTGSVSIRSRDMTHKLKEVMEVVGGTGAFAFARGIAMFNEIDDHEEEAVTT 121

Query: 198 YHLKLQLQFP 208
           Y +KL L+FP
Sbjct: 123 YRVKLLLRFP 121

BLAST of Sed0001684 vs. TAIR 10
Match: AT2G21110.1 (Disease resistance-responsive (dirigent-like protein) family protein )

HSP 1 Score: 53.1 bits (126), Expect = 3.0e-07
Identity = 29/100 (29.00%), Postives = 54/100 (54.00%), Query Frame = 0

Query: 84  NARSNNLQPVQPDSGVFVFRRALTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFD 143
           + R +N     P   +F     LT GP+  S+ +G A+G  + + +   +    +   F 
Sbjct: 62  HTRGDNDSSPSPFGSLFALDDPLTVGPDPKSEKIGNARGMYVSSGKHVPTLTMYVDFGFT 121

Query: 144 TPEYSG-SLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIA 183
           + +++G S++V +++   EK+ E+AVVGG G F  A+G+A
Sbjct: 122 SGKFNGSSIAVFSRNTITEKEREVAVVGGRGRFRMARGVA 161

BLAST of Sed0001684 vs. TAIR 10
Match: AT5G42500.1 (Disease resistance-responsive (dirigent-like protein) family protein )

HSP 1 Score: 52.4 bits (124), Expect = 5.2e-07
Identity = 29/78 (37.18%), Postives = 47/78 (60.26%), Query Frame = 0

Query: 106 LTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEYSGS-LSVHAKHIGHEKKE 165
           LTEGP+ +S+ VG+AQG          S   +  L+F   E++GS ++++ ++    K  
Sbjct: 86  LTEGPDPSSKEVGRAQGMYALTAMKNISFTMVFNLAFTAGEFNGSTVAMYGRNEIFSKVR 145

Query: 166 EMAVVGGTGSFAFAQGIA 183
           EM ++GGTG+F FA+G A
Sbjct: 146 EMPIIGGTGAFRFARGYA 163

BLAST of Sed0001684 vs. TAIR 10
Match: AT3G58090.1 (Disease resistance-responsive (dirigent-like protein) family protein )

HSP 1 Score: 51.2 bits (121), Expect = 1.2e-06
Identity = 30/78 (38.46%), Postives = 43/78 (55.13%), Query Frame = 0

Query: 106 LTEGPENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEYSGS-LSVHAKHIGHEKKE 165
           LT GPE TS+ VG+AQG     +Q            F   E+SGS +S++ ++    K  
Sbjct: 170 LTVGPEITSEEVGRAQGIFASADQNNFGLLMAFNFVFTKGEFSGSTVSMYGRNPIFSKVR 229

Query: 166 EMAVVGGTGSFAFAQGIA 183
           EM ++GGTG+F F +G A
Sbjct: 230 EMPIIGGTGAFRFGRGYA 247

BLAST of Sed0001684 vs. TAIR 10
Match: AT4G23690.1 (Disease resistance-responsive (dirigent-like protein) family protein )

HSP 1 Score: 50.8 bits (120), Expect = 1.5e-06
Identity = 49/157 (31.21%), Postives = 77/157 (49.04%), Query Frame = 0

Query: 42  ALLAVILLALL---SPVSHRKQSNQAKKPPWADLSLY----IQQPHSKANARSNNLQPVQ 101
           AL +  LL LL   + +S RK  +Q  K P    S Y    +    + ANA S  +    
Sbjct: 12  ALFSFFLLVLLFSDTVLSFRKTIDQ--KKPCKHFSFYFHDILYDGDNVANATSAAIVS-P 71

Query: 102 PDSGVFVFRR-ALTEGP-----ENTSQIVGKAQGFIIPNEQFARSSFNIIYLSFDTPEYS 161
           P  G F F +  + +GP        S+ V +AQGF   + +   +S+    L F++ E+ 
Sbjct: 72  PGLGNFKFGKFVIFDGPITMDKNYLSKPVARAQGFYFYDMKMDFNSWFSYTLVFNSTEHK 131

Query: 162 GSLSVHAKHIGHEKKEEMAVVGGTGSFAFAQGIAVFL 186
           G+L++    +  E   +++VVGGTG F  A+GIA F+
Sbjct: 132 GTLNIMGADLMMEPTRDLSVVGGTGDFFMARGIATFV 165

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888051.11.5e-8581.07dirigent protein 8 [Benincasa hispida][more]
XP_022952599.15.6e-8086.19uncharacterized protein LOC111455239 [Cucurbita moschata][more]
XP_022969428.11.2e-7986.19dirigent protein 8 [Cucurbita maxima][more]
XP_023511732.11.6e-7986.19uncharacterized protein LOC111776503 [Cucurbita pepo subsp. pepo][more]
KAG6572477.12.8e-7985.64Dirigent protein 6, partial [Cucurbita argyrosperma subsp. sororia] >KAG7012071.... [more]
Match NameE-valueIdentityDescription
A0A1V1FH012.7e-0838.53Pterocarpan synthase 1 OS=Glycyrrhiza echinata OX=46348 GN=PTS1 PE=1 SV=1[more]
I1JNN81.7e-0735.14Pterocarpan synthase 1 OS=Glycine max OX=3847 GN=PTS1 PE=2 SV=1[more]
Q9SKQ24.3e-0629.00Dirigent protein 4 OS=Arabidopsis thaliana OX=3702 GN=DIR4 PE=2 SV=1[more]
Q9FIG77.3e-0637.18Dirigent protein 2 OS=Arabidopsis thaliana OX=3702 GN=DIR2 PE=2 SV=1[more]
Q9SUQ82.1e-0531.21Dirigent protein 6 OS=Arabidopsis thaliana OX=3702 GN=DIR6 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1GM612.7e-8086.19Dirigent protein OS=Cucurbita moschata OX=3662 GN=LOC111455239 PE=3 SV=1[more]
A0A6J1I0Y26.0e-8086.19Dirigent protein OS=Cucurbita maxima OX=3661 GN=LOC111468438 PE=3 SV=1[more]
A0A5A7U5G85.9e-7580.66Dirigent protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold986G007... [more]
A0A1S3BGM45.9e-7580.66Dirigent protein OS=Cucumis melo OX=3656 GN=LOC103489634 PE=3 SV=1[more]
A0A6J1D4091.9e-7379.67Dirigent protein OS=Momordica charantia OX=3673 GN=LOC111016784 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G42655.15.5e-3351.54Disease resistance-responsive (dirigent-like protein) family protein [more]
AT2G21110.13.0e-0729.00Disease resistance-responsive (dirigent-like protein) family protein [more]
AT5G42500.15.2e-0737.18Disease resistance-responsive (dirigent-like protein) family protein [more]
AT3G58090.11.2e-0638.46Disease resistance-responsive (dirigent-like protein) family protein [more]
AT4G23690.11.5e-0631.21Disease resistance-responsive (dirigent-like protein) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004265Dirigent proteinPFAMPF03018Dirigentcoord: 96..195
e-value: 9.1E-21
score: 74.2
IPR044859Allene oxide cyclase/Dirigent proteinGENE3D2.40.480.10coord: 70..208
e-value: 7.8E-9
score: 37.3
NoneNo IPR availablePANTHERPTHR21495:SF171DIRIGENT PROTEINcoord: 31..207
NoneNo IPR availablePANTHERPTHR21495NUCLEOPORIN-RELATEDcoord: 31..207

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0001684.1Sed0001684.1mRNA
Sed0001684.2Sed0001684.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0048046 apoplast
cellular_component GO:0016021 integral component of membrane