Cla020112 (gene) Watermelon (97103) v1

NameCla020112
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionDisease resistance response/ dirigent-like protein (AHRD V1 ***- Q0WPQ6_ARATH); contains Interpro domain(s) IPR004265 Plant disease resistance response protein
LocationChr2 : 23543337 .. 23545830 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAAAACTCCTCTTTATTTCTTCTTCTTCTTTTTGTTGGTGGGTTTCAGCGGTCTGAGATCTGCCATTTCTGCTAGAGTATTAATGGATGATGACGTCGACGCCGAGTCTCAGCCACAAGCGGCGGCGGTTACCCCTCCTCTGCCCACCATCCCCTCGCCGGCCACCACTTTTCCGGCAACTCAGGTTCCAATGAACACCATTTACTTTTACCCTTTTGTTCTATTTAGTTACTTATTATTATTAAAAACATATTGTCATTTTTTTAAAGATTCTCTGCCTAGTGGATTTTACAAACTAATTAATATAGGATTTTATTATAAAAGGAAGAATTTATTAGGAAAAGAAATTTTTTATTTTTATTTTTGAATTTAAGATATAAATTTGATTTTTCACCTACATTTTATATATAAAAGAATTACTATGGATAATACTTTCGGTTCAATTACATTTAAAATACTATTATGGTCTCTATACTTTAAAGTTTTATTCAATTTTAGTTCATTTATTTTTAATAAATTTTAAATTTAATCTCCTATATTAGGTTTTAGTCCCTAATACTAGTTTAAAAATATTTTTTTTTTATCTATAACATTTTTACTATAAATTTTAAAAACATATTCACATATTTTATTTTTTTCATGAAAATTATTATTATTATTCAATCAATCTCAATAAAAATTAATTTTGAGAGACTAAATTTAAGATTTATTGAGAATAAATGACTAAATTTGAATAATTAAAAGTATAAGGATTAAAATTCAACCAACTTCGTAGTAGAAAGACTATTTTAACCTTTTTATTTCAATAATATAACTTTTTTTCTTTTCATATGTGTATATATATTTTTTATGTGATTATGTTTGTTCAAATAATGGTATAACCACGTTTAACAATTGTGATAATTTTTTTTTTCATTAAATTAAAAGTTTTTTTTTACATAAATAATATAAATAATAAAGTGTATTTTGACCAATTTAACTTGTATAGTTAAATTGAGGAGCGATAGTTTGGTGTATTCCAAAATTCACTTATATTTTTATACATCTCACATTACCATCACACATAAAAAATAAATGAAAATATTTTAATAAAAAAAGTAAAAAGAATTTCTCGTGTGACTGCCCATTTTGAATTCCAGGGTTTCTCTGAATTTGAAATTATGATCGAATAATTTGGTGGAACACAAGTGTTGGAGTGCACTAAGTATTTTTTCAAATAAAATATATCCACATCAAAATGTTAAAAATAACAAAAAATAAAAAAATAAAAAATTAAAAACCCAATACTTTTTAGAGGTTTCTTTTTTTTTTTTTTTTTTAAGTTTGTAAACATATCCAATTTCATCACATTTCAAACCATCCTTTCATGTTTTTCAAAAACCAAAAAAGAATGTTAAATATTACCAAATAAGTTCCTAAGTCTGATTTCCAAACCCAATGAAATCTAGAGAAGTTACAAATTGTTTCATTCATTTATTCTTTACCAGGCCGGAACAACTACGCTACCGTCGACCACGAGCCAGATTCCAGCAACCACTCCATCTCCGGTCACCAACGACGAGGAGGAAGACGCCGCCGCAATACCTCAAACCAACCCACCAGTAGCAGCAGCCAACAATCCAGGGCCAACACAAGATCAAGAAGATGACAATACAACCACCACTCCAGCCGCCGTATCACCGGCTGCCGTTGTATCACCAGCTGCCGCCGTGCCCACATCAACACCAACGCTAACTCAGCCTCTTCCGGCTGCGGTAAAGGGCCCAGAACCAATTTCATTTTACATGCATGACATCCTCGGAGGATCCCATCCATCCGCCAGAGTAGTCACCGGAATCGTTGCTAACTCCGACAGCAGTGGCATCGCATTTTCAAAGCCCAACGACAACTTCTTCCCAATCCAAGGAACACTCCCTTTACTCAATAATGACAACCTCAAGAACATCATCAACAACAACAACAACCTCCCTTTCCTTGTTGGCTTCAACGGCGCCGCTCAAGGCAACAACTTGCTCCTCCAAAACAGCGCCAACAATGGCGTCCTCAACGGTGACGAAGACAATAACCAGCCCTTTGTCACCGCTGGCCAGCTCCCTTCTAGAGTCACCCTTCAGCAGCTCATGTTCGGCTCTGTCACGGTGGTTGACGACGAGCTGACCGAAGGCCACGAGCTCGGGTCGGCAGTGGTGGGTCGGGCACAAGGTTTTTACTTGGCAAGCTCATTGGATGGGACCAGCCAGACAGTGGCTTTAACAGCGTTGTTTCACGGTGGTGGCCATGAGCACGTGGTTGAGGATACCATAAGCTTCTTTGGGGTTCATCGGACGGCGACGACGGAATCGCAGATTGCGGTAGTAGGAGGGACGGGGAAGTATGAAAATGCAAGAGGGTATGCGACGGTGGAGATGCTTCATCATCAAGAGGATCAACACACAACAGATGGTGTGGATACAATTATTCATTTTAGTGTTTATCTTACAGAGGAGTGA

mRNA sequence

ATGGCTAAAACTCCTCTTTATTTCTTCTTCTTCTTTTTGTTGGTGGGTTTCAGCGGTCTGAGATCTGCCATTTCTGCTAGAGTATTAATGGATGATGACGTCGACGCCGAGTCTCAGCCACAAGCGGCGGCGGTTACCCCTCCTCTGCCCACCATCCCCTCGCCGGCCACCACTTTTCCGGCAACTCAGGCCGGAACAACTACGCTACCGTCGACCACGAGCCAGATTCCAGCAACCACTCCATCTCCGGTCACCAACGACGAGGAGGAAGACGCCGCCGCAATACCTCAAACCAACCCACCAGTAGCAGCAGCCAACAATCCAGGGCCAACACAAGATCAAGAAGATGACAATACAACCACCACTCCAGCCGCCGTATCACCGGCTGCCGTTGTATCACCAGCTGCCGCCGTGCCCACATCAACACCAACGCTAACTCAGCCTCTTCCGGCTGCGGTAAAGGGCCCAGAACCAATTTCATTTTACATGCATGACATCCTCGGAGGATCCCATCCATCCGCCAGAGTAGTCACCGGAATCGTTGCTAACTCCGACAGCAGTGGCATCGCATTTTCAAAGCCCAACGACAACTTCTTCCCAATCCAAGGAACACTCCCTTTACTCAATAATGACAACCTCAAGAACATCATCAACAACAACAACAACCTCCCTTTCCTTGTTGGCTTCAACGGCGCCGCTCAAGGCAACAACTTGCTCCTCCAAAACAGCGCCAACAATGGCGTCCTCAACGGTGACGAAGACAATAACCAGCCCTTTGTCACCGCTGGCCAGCTCCCTTCTAGAGTCACCCTTCAGCAGCTCATGTTCGGCTCTGTCACGGTGGTTGACGACGAGCTGACCGAAGGCCACGAGCTCGGGTCGGCAGTGGTGGGTCGGGCACAAGGTTTTTACTTGGCAAGCTCATTGGATGGGACCAGCCAGACAGTGGCTTTAACAGCGTTGTTTCACGGTGGTGGCCATGAGCACGTGGTTGAGGATACCATAAGCTTCTTTGGGGTTCATCGGACGGCGACGACGGAATCGCAGATTGCGGTAGTAGGAGGGACGGGGAAGTATGAAAATGCAAGAGGGTATGCGACGGTGGAGATGCTTCATCATCAAGAGGATCAACACACAACAGATGGTGTGGATACAATTATTCATTTTAGTGTTTATCTTACAGAGGAGTGA

Coding sequence (CDS)

ATGGCTAAAACTCCTCTTTATTTCTTCTTCTTCTTTTTGTTGGTGGGTTTCAGCGGTCTGAGATCTGCCATTTCTGCTAGAGTATTAATGGATGATGACGTCGACGCCGAGTCTCAGCCACAAGCGGCGGCGGTTACCCCTCCTCTGCCCACCATCCCCTCGCCGGCCACCACTTTTCCGGCAACTCAGGCCGGAACAACTACGCTACCGTCGACCACGAGCCAGATTCCAGCAACCACTCCATCTCCGGTCACCAACGACGAGGAGGAAGACGCCGCCGCAATACCTCAAACCAACCCACCAGTAGCAGCAGCCAACAATCCAGGGCCAACACAAGATCAAGAAGATGACAATACAACCACCACTCCAGCCGCCGTATCACCGGCTGCCGTTGTATCACCAGCTGCCGCCGTGCCCACATCAACACCAACGCTAACTCAGCCTCTTCCGGCTGCGGTAAAGGGCCCAGAACCAATTTCATTTTACATGCATGACATCCTCGGAGGATCCCATCCATCCGCCAGAGTAGTCACCGGAATCGTTGCTAACTCCGACAGCAGTGGCATCGCATTTTCAAAGCCCAACGACAACTTCTTCCCAATCCAAGGAACACTCCCTTTACTCAATAATGACAACCTCAAGAACATCATCAACAACAACAACAACCTCCCTTTCCTTGTTGGCTTCAACGGCGCCGCTCAAGGCAACAACTTGCTCCTCCAAAACAGCGCCAACAATGGCGTCCTCAACGGTGACGAAGACAATAACCAGCCCTTTGTCACCGCTGGCCAGCTCCCTTCTAGAGTCACCCTTCAGCAGCTCATGTTCGGCTCTGTCACGGTGGTTGACGACGAGCTGACCGAAGGCCACGAGCTCGGGTCGGCAGTGGTGGGTCGGGCACAAGGTTTTTACTTGGCAAGCTCATTGGATGGGACCAGCCAGACAGTGGCTTTAACAGCGTTGTTTCACGGTGGTGGCCATGAGCACGTGGTTGAGGATACCATAAGCTTCTTTGGGGTTCATCGGACGGCGACGACGGAATCGCAGATTGCGGTAGTAGGAGGGACGGGGAAGTATGAAAATGCAAGAGGGTATGCGACGGTGGAGATGCTTCATCATCAAGAGGATCAACACACAACAGATGGTGTGGATACAATTATTCATTTTAGTGTTTATCTTACAGAGGAGTGA

Protein sequence

MAKTPLYFFFFFLLVGFSGLRSAISARVLMDDDVDAESQPQAAAVTPPLPTIPSPATTFPATQAGTTTLPSTTSQIPATTPSPVTNDEEEDAAAIPQTNPPVAAANNPGPTQDQEDDNTTTTPAAVSPAAVVSPAAAVPTSTPTLTQPLPAAVKGPEPISFYMHDILGGSHPSARVVTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGNNLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAVVGRAQGFYLASSLDGTSQTVALTALFHGGGHEHVVEDTISFFGVHRTATTESQIAVVGGTGKYENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLTEE
BLAST of Cla020112 vs. Swiss-Prot
Match: DIR24_ARATH (Dirigent protein 24 OS=Arabidopsis thaliana GN=DIR24 PE=2 SV=1)

HSP 1 Score: 307.8 bits (787), Expect = 1.8e-82
Identity = 165/266 (62.03%), Postives = 195/266 (73.31%), Query Frame = 1

Query: 133 SPAAAVPTSTPTLTQPLPA-AVKGPEPI-SFYMHDILGGSHPSARVVTGIVANSDSSGIA 192
           SP A   T TP    PLP  A  GPEPI  F+MHD+LGGSHPSARVVTGIVA ++ +GI 
Sbjct: 45  SPQAVTTTPTPI---PLPGPATGGPEPILEFFMHDVLGGSHPSARVVTGIVAQTEVNGIP 104

Query: 193 FSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGNNLLLQNSANNGVLN 252
           FSK ++N FP+   +PL+N +++ N+IN N   P L G +G+    N ++QNS  NG   
Sbjct: 105 FSKSSNNIFPVDNAVPLVNANSINNLINPNT-APLLTGLSGSQA--NTVIQNS--NGNSQ 164

Query: 253 GD-EDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAVVGRAQGFYLASSL 312
           G    NN PFVT GQLP    LQQLMFGS+TVVDDELTEGHELGSA++GRAQGFYLASSL
Sbjct: 165 GSLSSNNLPFVTTGQLPPIAALQQLMFGSITVVDDELTEGHELGSAIIGRAQGFYLASSL 224

Query: 313 DGTSQTVALTALFH-GGGHEHVVEDTISFFGVHRTATTESQIAVVGGTGKYENARGYATV 372
           DGTSQT++LT L H    H   ++D ISFFGVHRTA+  S IAVVGGTG++E+A+GYA V
Sbjct: 225 DGTSQTLSLTVLLHEDHDHHDTLDDAISFFGVHRTASHASHIAVVGGTGRFEHAKGYAVV 284

Query: 373 EMLHHQEDQHTTDGVDTIIHFSVYLT 395
           E LH+QEDQH TDG DTI+HFSVYLT
Sbjct: 285 ETLHNQEDQHVTDGHDTILHFSVYLT 302

BLAST of Cla020112 vs. Swiss-Prot
Match: DIR9_ARATH (Dirigent protein 9 OS=Arabidopsis thaliana GN=DIR9 PE=2 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 1.3e-80
Identity = 157/295 (53.22%), Postives = 198/295 (67.12%), Query Frame = 1

Query: 101 PVAAANNPGPTQDQEDDNTTTTPAAVSPAAVVSPAAAVPTSTPTLTQPLPAAVKGPEPIS 160
           P        PT+ +E+D T   P   +     S A  VP      T+PL         + 
Sbjct: 39  PTGQIPTVAPTEAEEEDGTDDNPGLATTTTTAS-AVTVPAGPAEATEPL---------LE 98

Query: 161 FYMHDILGGSHPSARVVTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNN 220
           F+MHD+LGGSHPSARVVTGIVA ++ +GI FSK +++ FP+   +PL+N++N+ ++IN N
Sbjct: 99  FFMHDVLGGSHPSARVVTGIVAQTEVNGIPFSKASNSIFPVDNGVPLVNSNNINSVINPN 158

Query: 221 NNLPFLVGFNGAAQGNNLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVT 280
              P L G  GA     +   N  +N  L+    N+ PFVTAG LP    LQ LMFG++T
Sbjct: 159 T-APLLTGLGGAQTSTVIQNTNGNSNDALSA---NSLPFVTAGNLPPGAALQHLMFGTIT 218

Query: 281 VVDDELTEGHELGSAVVGRAQGFYLASSLDGTSQTVALTALFHGGGHEH-VVEDTISFFG 340
           VVDDELTE HELGSAV+GRAQGFYLASSLDGTSQT++LT L HG   +H  ++D ISFFG
Sbjct: 219 VVDDELTESHELGSAVIGRAQGFYLASSLDGTSQTLSLTVLLHGEHDQHDTLDDAISFFG 278

Query: 341 VHRTATTESQIAVVGGTGKYENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLT 395
           VHRTA+  SQIAV+GGTGK+E+A+GYA VE LH+Q++QH TDG DTI+HFSVYLT
Sbjct: 279 VHRTASHASQIAVIGGTGKFEHAKGYAIVETLHNQDNQHITDGQDTILHFSVYLT 319

BLAST of Cla020112 vs. Swiss-Prot
Match: DIR25_ARATH (Dirigent protein 25 OS=Arabidopsis thaliana GN=DIR25 PE=2 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 5.7e-65
Identity = 167/411 (40.63%), Postives = 229/411 (55.72%), Query Frame = 1

Query: 6   LYFFFFFLLVGFSGLRSAISARVLMDDDVDAESQPQAAAVTPPLPTI---PSP-ATTFPA 65
           L+F    L + F      +SA  L+D++ D    P       PLPT+   P P A + PA
Sbjct: 7   LFFLILALAITF------VSAARLLDEEEDIGLVPLPTTSPGPLPTVGLGPFPTANSGPA 66

Query: 66  T--QAGTTTLPSTTSQIPATT-PSPVTNDEEEDAAAIPQTNPPVAAANNPGPTQDQEDDN 125
           T   +GT +       +   T P P++      ++ +P  +        PGP       +
Sbjct: 67  TGIASGTGSASGGLGSLGTNTGPGPLSTT---GSSLLPVASSGTLPVTGPGPLPT----S 126

Query: 126 TTTTPAAVSPAAVVSPAAAVPT-----STPTLTQPLPAAVKGPEP---ISFYMHDILGGS 185
           +   P A S     S +  +PT     +   L     + + G  P   + F+MHDILGGS
Sbjct: 127 SGLLPGASSGNLPGSGSGPLPTVGSGAAATGLGAGAGSVIGGSVPDNTLVFFMHDILGGS 186

Query: 186 HPSARVVTGIVANSDSSG-IAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGF 245
           +P+AR VTG+VAN+  SG I F+KPN    P+   +P   +DN  N I NNNN+P LVG 
Sbjct: 187 NPTARAVTGVVANAALSGQIPFAKPNGANLPVSNGVP---SDNNNNGILNNNNVPLLVGL 246

Query: 246 NGAAQGNNLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEG 305
            G       +LQN+ NN +LNG      P    GQLPS  +LQ LMFG++TV+D+ELTEG
Sbjct: 247 GGTTSN---ILQNNGNN-MLNG-----LPVANGGQLPSGSSLQMLMFGTLTVMDNELTEG 306

Query: 306 HELGSAVVGRAQGFYLASSLDGTSQTVALTALFHGGGHEHVVEDTISFFGVHRTATTESQ 365
           HELGS ++G+AQGFY+AS+LDGTSQT+A TA+F  GG+    ED+ISFFGVHRTA +ES 
Sbjct: 307 HELGSGLLGKAQGFYVASALDGTSQTMAFTAMFESGGY----EDSISFFGVHRTAASESH 366

Query: 366 IAVVGGTGKYENARGYATVEML------HHQEDQHTTDGVDTIIHFSVYLT 395
           + V+GGTGKY NARG+A V+          Q+    TDG++T++  +VYL+
Sbjct: 367 LGVMGGTGKYVNARGFAIVKTFTGSSGTQQQQPHQFTDGLETVLECTVYLS 388

BLAST of Cla020112 vs. Swiss-Prot
Match: DIR10_ARATH (Dirigent protein 10 OS=Arabidopsis thaliana GN=DIR10 PE=2 SV=1)

HSP 1 Score: 233.0 bits (593), Expect = 5.5e-60
Identity = 128/251 (51.00%), Postives = 168/251 (66.93%), Query Frame = 1

Query: 152 AVKGPE-PISFYMHDILGGSHPSARVVTGIVANSDSSG-IAFSKPNDNFFPIQGTLPLLN 211
           A  GP+  + F+MHDILGGS+P+AR VTG+VAN   SG + F+KPN    P+   +P  N
Sbjct: 210 ASAGPDNTLVFFMHDILGGSNPTARAVTGVVANPALSGQLPFAKPNGANLPVSNGVPSNN 269

Query: 212 NDNLKNIINNNNNLPFLVGFNGAAQGNNLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRV 271
           N+N    I NNNN+PFLVG  G     N+L  N+  N +LNG      P  + GQLPS  
Sbjct: 270 NNNG---IVNNNNVPFLVGLGGTTA--NILQNNNNGNNILNGF-----PVASGGQLPSGS 329

Query: 272 TLQQLMFGSVTVVDDELTEGHELGSAVVGRAQGFYLASSLDGTSQTVALTALFHGGGHEH 331
            LQ LMFG++TV+DDELTEGHELGS ++G+AQG+Y+AS++DGTSQT+A TA+F  GG+  
Sbjct: 330 ALQMLMFGTMTVIDDELTEGHELGSGLLGKAQGYYVASAIDGTSQTMAFTAMFESGGY-- 389

Query: 332 VVEDTISFFGVHRTATTESQIAVVGGTGKYENARGYATVEML------HHQEDQHTTDGV 391
             ED+ISFFGV RTA +ES I V+GGTGKY NARG+A ++           +    TDG+
Sbjct: 390 --EDSISFFGVLRTAVSESHIGVMGGTGKYVNARGFAILKTFTGSSGTQQNQPHQFTDGL 446

Query: 392 DTIIHFSVYLT 395
           +T++  +VYL+
Sbjct: 450 ETVVECTVYLS 446

BLAST of Cla020112 vs. Swiss-Prot
Match: DIR18_ARATH (Dirigent protein 18 OS=Arabidopsis thaliana GN=DIR18 PE=2 SV=1)

HSP 1 Score: 137.5 bits (345), Expect = 3.2e-31
Identity = 94/261 (36.02%), Postives = 140/261 (53.64%), Query Frame = 1

Query: 136 AAVPTSTPTLTQPLPAAVKGPEPI-SFYMHDILGGSHPSARVVTGIVANSDSSGIAFSKP 195
           AA+ T+T  L  P P      +PI   YMHDILGGS P+AR +TG++ N  +  + F+K 
Sbjct: 17  AALFTATTAL-DPAPE-----DPIFELYMHDILGGSSPTARPITGLLGNIYNGQVPFAK- 76

Query: 196 NDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAA-QGNNLLLQNSANNGVLNGDE 255
              F P Q  + + N +     +N  N +P   G +G A  G NL       NG+     
Sbjct: 77  QIGFVPPQNGVAIPNANGAMPTVNGINGIPLGTGLSGTAFSGQNL-------NGIQT--- 136

Query: 256 DNNQPFVTAGQL-PSRVTLQQLMFGSVTVVDDELTEGHELGSAVVGRAQGFYLASSLDGT 315
                     QL P  ++L    FG++TV+DD +T G +LGS  +G+AQG Y+ASS DG+
Sbjct: 137 ----------QLGPDGLSLG---FGTITVIDDIITSGPDLGSQPLGKAQGVYVASSADGS 196

Query: 316 SQTVALTALFHGGGHEHVVEDTISFFGVHRTATTESQIAVVGGTGKYENARGYATVEMLH 375
           +Q +A TA+  GG +     D ++F+G++R  +  S ++V GGTG+++NA G+A V  L 
Sbjct: 197 TQMMAFTAMLEGGEY----NDNLNFYGIYRIGSAMSHLSVTGGTGRFKNACGFAEVRPL- 242

Query: 376 HQEDQHTTDGVDTIIHFSVYL 394
               QH  DG + ++   V+L
Sbjct: 257 IPAGQHFVDGAEMLLRIIVHL 242

BLAST of Cla020112 vs. TrEMBL
Match: A0A0A0KLL0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G179750 PE=4 SV=1)

HSP 1 Score: 676.0 bits (1743), Expect = 2.8e-191
Identity = 350/400 (87.50%), Postives = 361/400 (90.25%), Query Frame = 1

Query: 1   MAKTPLYFFFFFLLVGFSGLRSAISARVLMDDD-VDAESQPQAAAVTPPLPTIPSPATTF 60
           MA+T L+FFFFFLLV F GLRSAI+AR+LMDDD  DAESQPQ AAVTPPL TI SPATTF
Sbjct: 1   MARTSLHFFFFFLLVSFYGLRSAIAARILMDDDDADAESQPQTAAVTPPLATISSPATTF 60

Query: 61  PATQAGTTTLPSTT-SQIPATTPSPV-TNDEEEDAAAIPQTNPPVAAANNPGPTQD-QED 120
           PATQ GTTTLPS T S IPATTPSP  TND+EED + IPQTN PVAA NN G TQD QED
Sbjct: 61  PATQGGTTTLPSITGSSIPATTPSPTATNDDEEDDSVIPQTNQPVAATNNLGQTQDDQED 120

Query: 121 DNTTTTPAAVSPAAVVSPAAAVPTSTPTLTQPLPAAVKGPEPISFYMHDILGGSHPSARV 180
           D+ TTTPAAVSPAA      AVPT     T+PLPAAVKGPEPISFYMHDILGGSHPSARV
Sbjct: 121 DSATTTPAAVSPAA------AVPTLPSPPTEPLPAAVKGPEPISFYMHDILGGSHPSARV 180

Query: 181 VTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGN 240
           VTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFL GFNG AQGN
Sbjct: 181 VTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLAGFNGVAQGN 240

Query: 241 NLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAV 300
           NLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAV
Sbjct: 241 NLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAV 300

Query: 301 VGRAQGFYLASSLDGTSQTVALTALFHGGGHEHVVEDTISFFGVHRTATTESQIAVVGGT 360
           VGRAQGFY+ASSLDGTSQTVALTALFH GGHEHVVED+ISFFGVHRTA   SQIAVVGGT
Sbjct: 301 VGRAQGFYMASSLDGTSQTVALTALFHSGGHEHVVEDSISFFGVHRTAMAGSQIAVVGGT 360

Query: 361 GKYENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLTEE 397
           GKYENARGYATVEMLHHQEDQHTTDG+DTIIHFSVYLTEE
Sbjct: 361 GKYENARGYATVEMLHHQEDQHTTDGMDTIIHFSVYLTEE 394

BLAST of Cla020112 vs. TrEMBL
Match: A0A061FN99_THECC (Disease resistance-responsive family protein, putative OS=Theobroma cacao GN=TCM_042975 PE=4 SV=1)

HSP 1 Score: 371.3 bits (952), Expect = 1.4e-99
Identity = 232/413 (56.17%), Postives = 275/413 (66.59%), Query Frame = 1

Query: 1   MAKTPLYF--------FFFFLLVGFSGLRSAISARVLMDDDVDAESQPQAAAVTPPLPTI 60
           MAKTP+          +F FL + F+    A SARVL  D+V+A  QPQ       +  I
Sbjct: 1   MAKTPILLCKTLKATIYFLFLAIIFT---CANSARVL--DEVEA--QPQV------VDDI 60

Query: 61  PSPATTFPATQAGTTTLPSTTSQIPATTPSPVTNDEEEDAAAIPQTNPPVAAANNPGPTQ 120
           P P+    AT     TLPS   Q+ ATTPS  + D+++    +P+   P AAA    P +
Sbjct: 61  PQPSNPV-ATTVPPNTLPS--GQVRATTPSG-SEDDDDTGPQLPEA--PAAAA---APAE 120

Query: 121 DQEDDNTTTTPAAVSPAAVVSPAAAVPTSTPTLTQPLPAAVKGP---EPI-SFYMHDILG 180
           D+    T   PAA   A +  PAA   T+         A V  P   +P+ SF+MHDILG
Sbjct: 121 DEAPVATPAAPAAGGVAPIAVPAATSATTGAGAAAAASATVATPGSHDPVLSFFMHDILG 180

Query: 181 GSHPSARVVTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDN---LKNIIN--NNNNL 240
           GSHPSARVVTGI+ANS+ SGI FSK N++ FP+QG  PLLN +N   LKNI N  N NN+
Sbjct: 181 GSHPSARVVTGIIANSEVSGIPFSKTNNDLFPVQGAAPLLNGNNINDLKNINNLINPNNV 240

Query: 241 PFLVGFNGAAQGNNLLLQNSANNG-VLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVV 300
           PFL G  GA    N +LQNS NN  VLNGD   +QPFVTAGQLP   +LQ+LMFGS+TV+
Sbjct: 241 PFLTGLTGAQ--TNAILQNSGNNNNVLNGD---SQPFVTAGQLPPG-SLQRLMFGSITVI 300

Query: 301 DDELTEGHELGSAVVGRAQGFYLASSLDGTSQTVALTALFHGGGHEHVVEDTISFFGVHR 360
           DDELTE HELGSAV+GRAQGFYLASSLDG+SQT+ALT L HGG H H +ED ISFFGVHR
Sbjct: 301 DDELTEAHELGSAVLGRAQGFYLASSLDGSSQTIALTVLLHGGEHGHELEDAISFFGVHR 360

Query: 361 TATTESQIAVVGGTGKYENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLTE 396
           T +  SQIAVVGGTGKYENARGYATVE L HQEDQH TDGVDTI+HF+VYL +
Sbjct: 361 TVSPASQIAVVGGTGKYENARGYATVETL-HQEDQHITDGVDTILHFNVYLID 384

BLAST of Cla020112 vs. TrEMBL
Match: V4TI14_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10033309mg PE=4 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 5.5e-99
Identity = 217/389 (55.78%), Postives = 265/389 (68.12%), Query Frame = 1

Query: 20  LRSAISARVLMDDDVDAESQPQAAAVTP-PLPTIPSPATTFPATQAGTTTLPSTTSQIPA 79
           LR A SAR+L      AE  PQ   +   P PT P  ATT P T    TTLPS   Q PA
Sbjct: 25  LRCANSARIL------AEVDPQPPVIVDSPEPTNPV-ATTVPPT----TTLPS--GQFPA 84

Query: 80  TTPSPVTNDEEEDAAAIPQTNPPVAAANNPGPTQDQEDDNTTTTPAAVSPAAVVSPAAAV 139
           T     T D++ DA       P   A     P  D +DD       A +P   V+P  AV
Sbjct: 85  TAAPDATPDDDSDA-------PGAEAPTKVIPPIDNDDD-------AAAPVDDVAPLPAV 144

Query: 140 PTST-------PTLTQPLPAAVKGPE--PISFYMHDILGGSHPSARVVTGIVANSDSSGI 199
           PT++       P +      A  GP+  P+ F+MHDILGGSHPSARVVTGI+A+++ +GI
Sbjct: 145 PTTSSPSGPASPAVAASATVASPGPQTPPLCFFMHDILGGSHPSARVVTGIIADTEINGI 204

Query: 200 AFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGNNLLLQNSANNGVL 259
            FSK NDNFFP+QG  PLL  +NL +IIN +N +PFL G NGA      +LQ++  + ++
Sbjct: 205 PFSKSNDNFFPVQGGTPLLTQNNLNDIINPDN-VPFLTGLNGAQPST--VLQDTGTSNIV 264

Query: 260 NGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAVVGRAQGFYLASSL 319
           NGD+  NQPFV+AGQLPS VTLQ+LMFG++TV+DDELTEGHELGS V+G+AQGFYLASSL
Sbjct: 265 NGDD--NQPFVSAGQLPSGVTLQKLMFGTMTVIDDELTEGHELGSGVIGKAQGFYLASSL 324

Query: 320 DGTSQTVALTALFHGGGHE---HVVEDTISFFGVHRTATTESQIAVVGGTGKYENARGYA 379
           DGTSQT+ALTAL HG  H+   H V DTISFFGV+RTA+ ESQ+AV+GG+GKYENA+GYA
Sbjct: 325 DGTSQTIALTALLHGEEHDHDHHDVLDTISFFGVYRTASHESQVAVIGGSGKYENAKGYA 380

Query: 380 TVEMLHHQEDQHTTDGVDTIIHFSVYLTE 396
           TVE + HQEDQHTTDGVDTI+ FSVYLT+
Sbjct: 385 TVETI-HQEDQHTTDGVDTIVKFSVYLTD 380

BLAST of Cla020112 vs. TrEMBL
Match: A0A067LCJ1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06633 PE=4 SV=1)

HSP 1 Score: 362.8 bits (930), Expect = 5.2e-97
Identity = 210/389 (53.98%), Postives = 258/389 (66.32%), Query Frame = 1

Query: 11  FFLLVGFSGLRSAISARVLMDDDVDAES----QPQAAAVTPPLPTIPSPATTFPATQAGT 70
           F L + FS   +A SARVL  DDVD ++     P A    PP  T+PS     PA     
Sbjct: 13  FLLAITFS---AANSARVL--DDVDPQNPVFVDPPATLTIPPSTTLPSGQV--PAIAPDE 72

Query: 71  TTLPSTTSQIPATTPSPVTNDEEEDAAAIPQTNPPVAAANNPGPTQDQEDDNTTTTPAAV 130
              P    Q+P T  +P  + E  DA   P T P   AA  P PT       T+T+P+  
Sbjct: 73  ADSPLPVPQVPTTVEAPDADVEAPDADVAPITPPITVAAPIPLPTT-----TTSTSPSG- 132

Query: 131 SPAAVVSPAAAVPTSTPTLTQPLPAAVKGPEPISFYMHDILGGSHPSARVVTGIVANSDS 190
                  PA AVPTS  T+  P  +A +    +SF+MHDILGGS PS RVVTGI+A +D 
Sbjct: 133 -------PATAVPTSA-TVANPATSAPQ----LSFFMHDILGGSTPSVRVVTGIIARTDI 192

Query: 191 SGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGNNLLLQNSANN 250
           +GI FS+PN+NFFP+QG +PL N DN+ N+IN N   P + G N    G + +LQNS NN
Sbjct: 193 NGIPFSEPNNNFFPVQGGIPLTNIDNINNLINPNT-APLITGLNPGQTGTSTVLQNSGNN 252

Query: 251 GVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAVVGRAQGFYLA 310
              N    NNQPFV+AGQ+P+  TLQ++MFGS+TV+DDELTEGHELGSAV+G+AQGFYLA
Sbjct: 253 N--NVVNSNNQPFVSAGQIPASSTLQRIMFGSITVIDDELTEGHELGSAVMGKAQGFYLA 312

Query: 311 SSLDGTSQTVALTALFHGGGHEHVVEDTISFFGVHRTATTESQIAVVGGTGKYENARGYA 370
           SSLDGTS T+ALT + HG G+ H VEDTISFFGVHRTA+  SQIAV+GGTGKYEN +GYA
Sbjct: 313 SSLDGTSHTMALTIMLHGEGNGH-VEDTISFFGVHRTASHTSQIAVIGGTGKYENGKGYA 371

Query: 371 TVEMLHHQEDQHTTDGVDTIIHFSVYLTE 396
           TVE L  QE+QH TDGVDT++HF+VYL+E
Sbjct: 373 TVETL-PQENQHVTDGVDTVMHFNVYLSE 371

BLAST of Cla020112 vs. TrEMBL
Match: C6TDR1_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_18G282600 PE=2 SV=1)

HSP 1 Score: 349.4 bits (895), Expect = 5.9e-93
Identity = 212/396 (53.54%), Postives = 263/396 (66.41%), Query Frame = 1

Query: 23  AISARVLMDDDVDAESQPQA-------AAVTPPLPTIP-------SPATTFPATQAGTTT 82
           A SAR+L  D+V  ESQPQA        A  P L T P        PATT P+   G T+
Sbjct: 29  ANSARIL--DEV--ESQPQAIGNLPAPGATNPSLTTAPVTTTPQIDPATTLPS---GQTS 88

Query: 83  LPSTTSQI-PATT-PSPVTNDEEEDAAAIPQTNPPVAAANNPGPTQD--QEDDNTTTTPA 142
            P+T SQ+ PATT PS  T              P  AA  NP  T+D  +EDDN    P 
Sbjct: 89  APTTISQVGPATTLPSGQT--------------PATAATMNPATTEDSGEEDDNQVEPPV 148

Query: 143 --AVSPAAVVSPAAAVPTSTPTLTQPLPAAVKGPEPISFYMHDILGGSHPSARVVTGIVA 202
               +PAAV +PA     + P+     P ++      SF+MHDILGGS PSARVV GIVA
Sbjct: 149 PETEAPAAVTAPAEEETPAIPSAAPVTPNSLAKEPSFSFFMHDILGGSRPSARVVAGIVA 208

Query: 203 NSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGNNLLLQN 262
           N+D +G+ FSK N+N FPI G +PL+N   L  II NNN LP LVG +GA Q + +   +
Sbjct: 209 NTDVTGLPFSKLNNNLFPITGGIPLVN-PKLNGIITNNN-LPNLVGLSGA-QSSTVFKNS 268

Query: 263 SANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAVVGRAQG 322
             +N V  G   NNQPFV+AG LP+  T+Q+LMFGSVTV+DD+LTEGHEL SAV+G+AQG
Sbjct: 269 GTSNTVTGG---NNQPFVSAGNLPAGFTIQKLMFGSVTVIDDQLTEGHELDSAVIGKAQG 328

Query: 323 FYLASSLDGTSQTVALTALFHGGGHEH---VVEDTISFFGVHRTATTESQIAVVGGTGKY 382
           FYLASSLDG+SQT+ LT L HGG H+    VV+D+I+FFG+HRTA++ES++AV+GGTGKY
Sbjct: 329 FYLASSLDGSSQTILLTVLVHGGEHDQHHDVVDDSINFFGIHRTASSESEVAVIGGTGKY 388

Query: 383 ENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLTE 396
           ENARGYA++E L  +EDQHTTDGVDTI+HF+VYLTE
Sbjct: 389 ENARGYASLETL-LKEDQHTTDGVDTILHFNVYLTE 396

BLAST of Cla020112 vs. NCBI nr
Match: gi|659123356|ref|XP_008461621.1| (PREDICTED: dirigent protein 24 [Cucumis melo])

HSP 1 Score: 688.0 bits (1774), Expect = 1.0e-194
Identity = 352/400 (88.00%), Postives = 363/400 (90.75%), Query Frame = 1

Query: 1   MAKTPLYFFFFF-LLVGFSGLRSAISARVLMDDDVDAESQPQAAAVTPPLPTIPSPATTF 60
           MAKTPLYFFFFF LLV F G+RSAI+ARVLMDDD DAES  Q  AVTPPL TIPSPATTF
Sbjct: 1   MAKTPLYFFFFFFLLVSFYGVRSAIAARVLMDDDADAESLSQTPAVTPPLATIPSPATTF 60

Query: 61  PATQAGTTTLPSTT-SQIPATTPSPV-TNDEEEDAAAIPQTNPPVAAANNPGPTQD-QED 120
           PATQ GTT LPSTT S +PATTPSP  TND+EED +AIPQTNPPVAA NNPGPTQD QED
Sbjct: 61  PATQGGTTALPSTTGSSVPATTPSPAATNDDEEDDSAIPQTNPPVAATNNPGPTQDDQED 120

Query: 121 DNTTTTPAAVSPAAVVSPAAAVPTSTPTLTQPLPAAVKGPEPISFYMHDILGGSHPSARV 180
           DN TTTPAAVSPAA      AVPT     T+PLPAAVKGPEPISFYMHDILGGSHPSARV
Sbjct: 121 DNATTTPAAVSPAA------AVPTLPSPPTEPLPAAVKGPEPISFYMHDILGGSHPSARV 180

Query: 181 VTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGN 240
           VTGIVANSDSSGIAFSKPNDNFFPIQGTLPL NNDNLKNIINNNNNLPFL GFNG AQGN
Sbjct: 181 VTGIVANSDSSGIAFSKPNDNFFPIQGTLPLFNNDNLKNIINNNNNLPFLAGFNGVAQGN 240

Query: 241 NLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAV 300
           NLLLQNSANNG+L+GDEDNNQPFVTAGQLPSRVTLQQLMFGSVTV+DDELTEGHELGSAV
Sbjct: 241 NLLLQNSANNGILSGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVIDDELTEGHELGSAV 300

Query: 301 VGRAQGFYLASSLDGTSQTVALTALFHGGGHEHVVEDTISFFGVHRTATTESQIAVVGGT 360
           VGRAQGFY+ASSLDGTSQTVALTALFH GGHEHV ED+ISFFGVHRTAT  SQIAVVGGT
Sbjct: 301 VGRAQGFYMASSLDGTSQTVALTALFHSGGHEHVAEDSISFFGVHRTATAGSQIAVVGGT 360

Query: 361 GKYENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLTEE 397
           GKYENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLTEE
Sbjct: 361 GKYENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLTEE 394

BLAST of Cla020112 vs. NCBI nr
Match: gi|778700918|ref|XP_004147710.2| (PREDICTED: dirigent protein 24 [Cucumis sativus])

HSP 1 Score: 676.0 bits (1743), Expect = 3.9e-191
Identity = 350/400 (87.50%), Postives = 361/400 (90.25%), Query Frame = 1

Query: 1   MAKTPLYFFFFFLLVGFSGLRSAISARVLMDDD-VDAESQPQAAAVTPPLPTIPSPATTF 60
           MA+T L+FFFFFLLV F GLRSAI+AR+LMDDD  DAESQPQ AAVTPPL TI SPATTF
Sbjct: 1   MARTSLHFFFFFLLVSFYGLRSAIAARILMDDDDADAESQPQTAAVTPPLATISSPATTF 60

Query: 61  PATQAGTTTLPSTT-SQIPATTPSPV-TNDEEEDAAAIPQTNPPVAAANNPGPTQD-QED 120
           PATQ GTTTLPS T S IPATTPSP  TND+EED + IPQTN PVAA NN G TQD QED
Sbjct: 61  PATQGGTTTLPSITGSSIPATTPSPTATNDDEEDDSVIPQTNQPVAATNNLGQTQDDQED 120

Query: 121 DNTTTTPAAVSPAAVVSPAAAVPTSTPTLTQPLPAAVKGPEPISFYMHDILGGSHPSARV 180
           D+ TTTPAAVSPAA      AVPT     T+PLPAAVKGPEPISFYMHDILGGSHPSARV
Sbjct: 121 DSATTTPAAVSPAA------AVPTLPSPPTEPLPAAVKGPEPISFYMHDILGGSHPSARV 180

Query: 181 VTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGN 240
           VTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFL GFNG AQGN
Sbjct: 181 VTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLAGFNGVAQGN 240

Query: 241 NLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAV 300
           NLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAV
Sbjct: 241 NLLLQNSANNGVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAV 300

Query: 301 VGRAQGFYLASSLDGTSQTVALTALFHGGGHEHVVEDTISFFGVHRTATTESQIAVVGGT 360
           VGRAQGFY+ASSLDGTSQTVALTALFH GGHEHVVED+ISFFGVHRTA   SQIAVVGGT
Sbjct: 301 VGRAQGFYMASSLDGTSQTVALTALFHSGGHEHVVEDSISFFGVHRTAMAGSQIAVVGGT 360

Query: 361 GKYENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLTEE 397
           GKYENARGYATVEMLHHQEDQHTTDG+DTIIHFSVYLTEE
Sbjct: 361 GKYENARGYATVEMLHHQEDQHTTDGMDTIIHFSVYLTEE 394

BLAST of Cla020112 vs. NCBI nr
Match: gi|590564096|ref|XP_007009563.1| (Disease resistance-responsive family protein, putative [Theobroma cacao])

HSP 1 Score: 371.3 bits (952), Expect = 2.1e-99
Identity = 232/413 (56.17%), Postives = 275/413 (66.59%), Query Frame = 1

Query: 1   MAKTPLYF--------FFFFLLVGFSGLRSAISARVLMDDDVDAESQPQAAAVTPPLPTI 60
           MAKTP+          +F FL + F+    A SARVL  D+V+A  QPQ       +  I
Sbjct: 1   MAKTPILLCKTLKATIYFLFLAIIFT---CANSARVL--DEVEA--QPQV------VDDI 60

Query: 61  PSPATTFPATQAGTTTLPSTTSQIPATTPSPVTNDEEEDAAAIPQTNPPVAAANNPGPTQ 120
           P P+    AT     TLPS   Q+ ATTPS  + D+++    +P+   P AAA    P +
Sbjct: 61  PQPSNPV-ATTVPPNTLPS--GQVRATTPSG-SEDDDDTGPQLPEA--PAAAA---APAE 120

Query: 121 DQEDDNTTTTPAAVSPAAVVSPAAAVPTSTPTLTQPLPAAVKGP---EPI-SFYMHDILG 180
           D+    T   PAA   A +  PAA   T+         A V  P   +P+ SF+MHDILG
Sbjct: 121 DEAPVATPAAPAAGGVAPIAVPAATSATTGAGAAAAASATVATPGSHDPVLSFFMHDILG 180

Query: 181 GSHPSARVVTGIVANSDSSGIAFSKPNDNFFPIQGTLPLLNNDN---LKNIIN--NNNNL 240
           GSHPSARVVTGI+ANS+ SGI FSK N++ FP+QG  PLLN +N   LKNI N  N NN+
Sbjct: 181 GSHPSARVVTGIIANSEVSGIPFSKTNNDLFPVQGAAPLLNGNNINDLKNINNLINPNNV 240

Query: 241 PFLVGFNGAAQGNNLLLQNSANNG-VLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVV 300
           PFL G  GA    N +LQNS NN  VLNGD   +QPFVTAGQLP   +LQ+LMFGS+TV+
Sbjct: 241 PFLTGLTGAQ--TNAILQNSGNNNNVLNGD---SQPFVTAGQLPPG-SLQRLMFGSITVI 300

Query: 301 DDELTEGHELGSAVVGRAQGFYLASSLDGTSQTVALTALFHGGGHEHVVEDTISFFGVHR 360
           DDELTE HELGSAV+GRAQGFYLASSLDG+SQT+ALT L HGG H H +ED ISFFGVHR
Sbjct: 301 DDELTEAHELGSAVLGRAQGFYLASSLDGSSQTIALTVLLHGGEHGHELEDAISFFGVHR 360

Query: 361 TATTESQIAVVGGTGKYENARGYATVEMLHHQEDQHTTDGVDTIIHFSVYLTE 396
           T +  SQIAVVGGTGKYENARGYATVE L HQEDQH TDGVDTI+HF+VYL +
Sbjct: 361 TVSPASQIAVVGGTGKYENARGYATVETL-HQEDQHITDGVDTILHFNVYLID 384

BLAST of Cla020112 vs. NCBI nr
Match: gi|567887208|ref|XP_006436126.1| (hypothetical protein CICLE_v10033309mg [Citrus clementina])

HSP 1 Score: 369.4 bits (947), Expect = 7.9e-99
Identity = 217/389 (55.78%), Postives = 265/389 (68.12%), Query Frame = 1

Query: 20  LRSAISARVLMDDDVDAESQPQAAAVTP-PLPTIPSPATTFPATQAGTTTLPSTTSQIPA 79
           LR A SAR+L      AE  PQ   +   P PT P  ATT P T    TTLPS   Q PA
Sbjct: 25  LRCANSARIL------AEVDPQPPVIVDSPEPTNPV-ATTVPPT----TTLPS--GQFPA 84

Query: 80  TTPSPVTNDEEEDAAAIPQTNPPVAAANNPGPTQDQEDDNTTTTPAAVSPAAVVSPAAAV 139
           T     T D++ DA       P   A     P  D +DD       A +P   V+P  AV
Sbjct: 85  TAAPDATPDDDSDA-------PGAEAPTKVIPPIDNDDD-------AAAPVDDVAPLPAV 144

Query: 140 PTST-------PTLTQPLPAAVKGPE--PISFYMHDILGGSHPSARVVTGIVANSDSSGI 199
           PT++       P +      A  GP+  P+ F+MHDILGGSHPSARVVTGI+A+++ +GI
Sbjct: 145 PTTSSPSGPASPAVAASATVASPGPQTPPLCFFMHDILGGSHPSARVVTGIIADTEINGI 204

Query: 200 AFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGNNLLLQNSANNGVL 259
            FSK NDNFFP+QG  PLL  +NL +IIN +N +PFL G NGA      +LQ++  + ++
Sbjct: 205 PFSKSNDNFFPVQGGTPLLTQNNLNDIINPDN-VPFLTGLNGAQPST--VLQDTGTSNIV 264

Query: 260 NGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAVVGRAQGFYLASSL 319
           NGD+  NQPFV+AGQLPS VTLQ+LMFG++TV+DDELTEGHELGS V+G+AQGFYLASSL
Sbjct: 265 NGDD--NQPFVSAGQLPSGVTLQKLMFGTMTVIDDELTEGHELGSGVIGKAQGFYLASSL 324

Query: 320 DGTSQTVALTALFHGGGHE---HVVEDTISFFGVHRTATTESQIAVVGGTGKYENARGYA 379
           DGTSQT+ALTAL HG  H+   H V DTISFFGV+RTA+ ESQ+AV+GG+GKYENA+GYA
Sbjct: 325 DGTSQTIALTALLHGEEHDHDHHDVLDTISFFGVYRTASHESQVAVIGGSGKYENAKGYA 380

Query: 380 TVEMLHHQEDQHTTDGVDTIIHFSVYLTE 396
           TVE + HQEDQHTTDGVDTI+ FSVYLT+
Sbjct: 385 TVETI-HQEDQHTTDGVDTIVKFSVYLTD 380

BLAST of Cla020112 vs. NCBI nr
Match: gi|802539904|ref|XP_012073380.1| (PREDICTED: dirigent protein 9 [Jatropha curcas])

HSP 1 Score: 362.8 bits (930), Expect = 7.4e-97
Identity = 210/389 (53.98%), Postives = 258/389 (66.32%), Query Frame = 1

Query: 11  FFLLVGFSGLRSAISARVLMDDDVDAES----QPQAAAVTPPLPTIPSPATTFPATQAGT 70
           F L + FS   +A SARVL  DDVD ++     P A    PP  T+PS     PA     
Sbjct: 13  FLLAITFS---AANSARVL--DDVDPQNPVFVDPPATLTIPPSTTLPSGQV--PAIAPDE 72

Query: 71  TTLPSTTSQIPATTPSPVTNDEEEDAAAIPQTNPPVAAANNPGPTQDQEDDNTTTTPAAV 130
              P    Q+P T  +P  + E  DA   P T P   AA  P PT       T+T+P+  
Sbjct: 73  ADSPLPVPQVPTTVEAPDADVEAPDADVAPITPPITVAAPIPLPTT-----TTSTSPSG- 132

Query: 131 SPAAVVSPAAAVPTSTPTLTQPLPAAVKGPEPISFYMHDILGGSHPSARVVTGIVANSDS 190
                  PA AVPTS  T+  P  +A +    +SF+MHDILGGS PS RVVTGI+A +D 
Sbjct: 133 -------PATAVPTSA-TVANPATSAPQ----LSFFMHDILGGSTPSVRVVTGIIARTDI 192

Query: 191 SGIAFSKPNDNFFPIQGTLPLLNNDNLKNIINNNNNLPFLVGFNGAAQGNNLLLQNSANN 250
           +GI FS+PN+NFFP+QG +PL N DN+ N+IN N   P + G N    G + +LQNS NN
Sbjct: 193 NGIPFSEPNNNFFPVQGGIPLTNIDNINNLINPNT-APLITGLNPGQTGTSTVLQNSGNN 252

Query: 251 GVLNGDEDNNQPFVTAGQLPSRVTLQQLMFGSVTVVDDELTEGHELGSAVVGRAQGFYLA 310
              N    NNQPFV+AGQ+P+  TLQ++MFGS+TV+DDELTEGHELGSAV+G+AQGFYLA
Sbjct: 253 N--NVVNSNNQPFVSAGQIPASSTLQRIMFGSITVIDDELTEGHELGSAVMGKAQGFYLA 312

Query: 311 SSLDGTSQTVALTALFHGGGHEHVVEDTISFFGVHRTATTESQIAVVGGTGKYENARGYA 370
           SSLDGTS T+ALT + HG G+ H VEDTISFFGVHRTA+  SQIAV+GGTGKYEN +GYA
Sbjct: 313 SSLDGTSHTMALTIMLHGEGNGH-VEDTISFFGVHRTASHTSQIAVIGGTGKYENGKGYA 371

Query: 371 TVEMLHHQEDQHTTDGVDTIIHFSVYLTE 396
           TVE L  QE+QH TDGVDT++HF+VYL+E
Sbjct: 373 TVETL-PQENQHVTDGVDTVMHFNVYLSE 371

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DIR24_ARATH1.8e-8262.03Dirigent protein 24 OS=Arabidopsis thaliana GN=DIR24 PE=2 SV=1[more]
DIR9_ARATH1.3e-8053.22Dirigent protein 9 OS=Arabidopsis thaliana GN=DIR9 PE=2 SV=1[more]
DIR25_ARATH5.7e-6540.63Dirigent protein 25 OS=Arabidopsis thaliana GN=DIR25 PE=2 SV=1[more]
DIR10_ARATH5.5e-6051.00Dirigent protein 10 OS=Arabidopsis thaliana GN=DIR10 PE=2 SV=1[more]
DIR18_ARATH3.2e-3136.02Dirigent protein 18 OS=Arabidopsis thaliana GN=DIR18 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KLL0_CUCSA2.8e-19187.50Uncharacterized protein OS=Cucumis sativus GN=Csa_5G179750 PE=4 SV=1[more]
A0A061FN99_THECC1.4e-9956.17Disease resistance-responsive family protein, putative OS=Theobroma cacao GN=TCM... [more]
V4TI14_9ROSI5.5e-9955.78Uncharacterized protein OS=Citrus clementina GN=CICLE_v10033309mg PE=4 SV=1[more]
A0A067LCJ1_JATCU5.2e-9753.98Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06633 PE=4 SV=1[more]
C6TDR1_SOYBN5.9e-9353.54Uncharacterized protein OS=Glycine max GN=GLYMA_18G282600 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
gi|659123356|ref|XP_008461621.1|1.0e-19488.00PREDICTED: dirigent protein 24 [Cucumis melo][more]
gi|778700918|ref|XP_004147710.2|3.9e-19187.50PREDICTED: dirigent protein 24 [Cucumis sativus][more]
gi|590564096|ref|XP_007009563.1|2.1e-9956.17Disease resistance-responsive family protein, putative [Theobroma cacao][more]
gi|567887208|ref|XP_006436126.1|7.9e-9955.78hypothetical protein CICLE_v10033309mg [Citrus clementina][more]
gi|802539904|ref|XP_012073380.1|7.4e-9753.98PREDICTED: dirigent protein 9 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004265Dirigent
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0048046 apoplast
cellular_component GO:0005576 extracellular region
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla020112Cla020112.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004265Plant disease resistance response proteinPFAMPF03018Dirigentcoord: 251..393
score: 4.9
NoneNo IPR availablePANTHERPTHR21495NUCLEOPORIN-RELATEDcoord: 148..396
score: 8.2E
NoneNo IPR availablePANTHERPTHR21495:SF51DIRIGENT PROTEIN 24-RELATEDcoord: 148..396
score: 8.2E