Cla97C01G007920 (gene) Watermelon (97103) v2

NameCla97C01G007920
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionbifunctional endo-1,4-beta-xylanase XylA-like
LocationCla97Chr01 : 8116809 .. 8119708 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTAATTGGAGAGAACGGCCTAGGAGAAATTTTCGGAACCAGAAGCCTCCATGGTCGGCGATAAGAAATTACGATCACGATCCTTCATTAGGTGATTTTCTCCCTTTGTTATTTTTAGAATTAGGGTGCAGTAGGGATTCTTGGAATTTTTTTGTTCTTCATTGAATGAATGGAACTCGACTTCTTTGATTAGTGGATATTGGTAGTTATTATTCATTAGTTCAGCTGTTGAATTGAATGGACATCTGGGTTGTTTTAGTTGGTTCTGTTTTGAAGAATGGTTTTTAGGCAAAATGTTTAGGTAAAGTGAAGACTAGGTTGTAGTTAGCACGAATATTGATTGGACCCTTTGTTTTATTTCTTCCCATAAAGTTGTGAATCTTTGATGCCTGCCACTATTGTACGGGCAATAAGTTACTGTTTATAGTAGTTTCTGTCGATAAAAAATGCAATGCAACGTCTTATTACGTTCTTTTTTTCCTTCCCTTATTTGGAAAGATGAGGAAAGTGAATGTGATCAGTGTGCTGTGTGACGGGACATTGGGAGTTGTAGACTCCGTTTGATAACACTTGGCTTTTGATTCTAGAAAATTAAGCTTATGTATACTACTTCTACTACTAAGTTTCTACATTTTGTTATCCACTTTTGGTTAATGTTTTCAAAAACCAAGTGGAATTTTGAAAACTAAAAAGAAAAAAATAGTTTTCAAAAAGTAGTTTTTGTTTTTAGAATTTGAAGATGGAGAGGAATCAAACATAAATTTAAAAAATTTAAAACTAAAAATAAAATTATAATCAAACAAGTCGTAGTAATTATGGAAATTGGAAGTGATCCTTTCTAATTTTGTTAGAACTACTCCCAATTTGTAGAATAAGTCAGATATTTCTAGATGATTGGTGTCTTAGGGGTAAGTGTGTTGCTTATTTCTTCTATTCATTGAACTTTAGTTGAATAATAATGTTAATAGGTCATTTTCTCAGAGATATATGAACCATAGAAACTGCTGATGGTTACTGTTGTAAGATGATGTACCATCTTATTACATTGCAAGAAAAAATCTCATGAAAATTGGTGATTGCAGGAAACAGGGATAAAATTGATTGTACGACCTTAACATTTTTTTCCCTATTTCTATTTTTTTTAATCAAAATTTGCAATTCAGAGAAGGGCAAGTGGTGGATTACAGTGGTTTATGAGATCAGAAATGACAAGAGAGTTTCCACAAAACAAGTTTGATTAGTCATAAAATTAACACTCAATTAAACTTCAAACTAAATTGTATCTAATCACTTTCTAATTATTGAAACTGAATATGTAGACAGGGAACTTGAGGAGGATTCAATGCCCTTTTGTACATTATTCATGTCAATCATTCTAGAGGGAGAAGGTAACAGTGAATTGTTAAGGGTGATGTAGTGGAACAAAAGATATGGAAAGAGAAGGAAGAAGAAGATCCAAAAAGGGAGGAGACAGAGATTCTGAGAGAGAAAATAGGGAGTTGGAGAGAGAAATGGATAAAAAATATGGGAGGGAGAGGAGGCTGCAACATAAACATTAAATGTTCAATGAATAAAACTTAATTAGATCAAAATATTATTAAATATTAATAAACAATGGCACATCACCACTTAATTACCATGACTTGTTCGTCGAAGTTCCTAACTCCTTAGAGAAAGTTGATGGTAGGGGCTAATTTGGCTAATCTTCAAAACCTCAAGATAGAATTGTTTGTGTTACAGCTTCAGGATTTAAAATGAAACCTAAACTTATTATTAGGGACCGGAATAATAAATATTCCATCTTTTTAATCTCTTTACTGATTTTAAAGCATTTGGAAATATTTGATCGTAGATTTTTTATTTGTTTGGATAAACAGAAATGCAAATTACACCTTGTTTTGTAGGATCCTCATCCATATTATTAATCCTTCAATGAACTTAATGGTTCTGAATTAATTTCATCATTTCTTCTTCGTGTAGAGTATTGGCGAGATGGTATTCCTTTATGGGAAAAGAAATTTTGCAAAGAGATCGGTTGTGTGCCATGGGGTAAAATAGTTGATTCCAAGAACTTTATTTATTGTCATACTAATGTAGTCAATTGGGATGATTCCGCCTGTGAAGAGGCATTCCACAATGCCAAAAGACGTTACTGGGTAGAGATTAATGGTCATCAGAGTGACATCTATTTGCCTGATCCAGACAAATATATTGAACAAATCGATTGGAGTCCTGATATGGATTCTGAAATGATTGAGGAACTGGATTGGGCATATCACACTCCAAATATGAAACAACATGATGTTTGGTTGGAATGCAGAAATAAAAGGACAAGGAACTCCAGTTCTGTTTGGACAGAAGGCGACAATGAAGATCCAGGTCATGTGGGTAATCCCTGGGGAGATGATAATCAATTCACTGAAAATACAGGGCAAGGATGCAGCCAGTGGAATTTAAGTGATTCAGGGAAGGTGAATAATGATCGCAATCCTTGGGACAGTAGCATTGACGAGGGGAACAGGGGTATGGTGGACAGTGCCTGGAAGGTTAAAGCGAATCAAGTTGCTACCTCTTGGAAGAATAAAGAATTTGCTAGTGATGCAAGTGGTGTGGTAGACAATGCCTGGAGAGATAGATTGCATCAAGGTGGCACTGCCTCCTGGAAGACTAGAGGGTTTGCTAGTGATGCAAGGAATAACTCATGGAGCAGGCACCAGCAGGGTGCCAGTAATTTTGATCATTATAATAGGCCTGGGAATAACAGTTACAATCGTAATGTCAGACATTTAACTGATAGAACACGGCCGAATATTCACAGAAACAACCAAGATTGGAAATATCAGTATAGGTATGGCAAGAGGCCAAAAGATGCAGAATTTGATAACTTTGGGAGGTAG

mRNA sequence

ATGGGTAATTGGAGAGAACGGCCTAGGAGAAATTTTCGGAACCAGAAGCCTCCATGGTCGGCGATAAGAAATTACGATCACGATCCTTCATTAGAGTATTGGCGAGATGGTATTCCTTTATGGGAAAAGAAATTTTGCAAAGAGATCGGTTGTGTGCCATGGGGTAAAATAGTTGATTCCAAGAACTTTATTTATTGTCATACTAATGTAGTCAATTGGGATGATTCCGCCTGTGAAGAGGCATTCCACAATGCCAAAAGACGTTACTGGGTAGAGATTAATGGTCATCAGAGTGACATCTATTTGCCTGATCCAGACAAATATATTGAACAAATCGATTGGAGTCCTGATATGGATTCTGAAATGATTGAGGAACTGGATTGGGCATATCACACTCCAAATATGAAACAACATGATGTTTGGTTGGAATGCAGAAATAAAAGGACAAGGAACTCCAGTTCTGTTTGGACAGAAGGCGACAATGAAGATCCAGGTCATGTGGGTAATCCCTGGGGAGATGATAATCAATTCACTGAAAATACAGGGCAAGGATGCAGCCAGTGGAATTTAAGTGATTCAGGGAAGGTGAATAATGATCGCAATCCTTGGGACAGTAGCATTGACGAGGGGAACAGGGGTATGGTGGACAGTGCCTGGAAGGTTAAAGCGAATCAAGTTGCTACCTCTTGGAAGAATAAAGAATTTGCTAGTGATGCAAGTGGTGTGGTAGACAATGCCTGGAGAGATAGATTGCATCAAGGTGGCACTGCCTCCTGGAAGACTAGAGGGTTTGCTAGTGATGCAAGGAATAACTCATGGAGCAGGCACCAGCAGGGTGCCAGTAATTTTGATCATTATAATAGGCCTGGGAATAACAGTTACAATCGTAATGTCAGACATTTAACTGATAGAACACGGCCGAATATTCACAGAAACAACCAAGATTGGAAATATCAGTATAGGTATGGCAAGAGGCCAAAAGATGCAGAATTTGATAACTTTGGGAGGTAG

Coding sequence (CDS)

ATGGGTAATTGGAGAGAACGGCCTAGGAGAAATTTTCGGAACCAGAAGCCTCCATGGTCGGCGATAAGAAATTACGATCACGATCCTTCATTAGAGTATTGGCGAGATGGTATTCCTTTATGGGAAAAGAAATTTTGCAAAGAGATCGGTTGTGTGCCATGGGGTAAAATAGTTGATTCCAAGAACTTTATTTATTGTCATACTAATGTAGTCAATTGGGATGATTCCGCCTGTGAAGAGGCATTCCACAATGCCAAAAGACGTTACTGGGTAGAGATTAATGGTCATCAGAGTGACATCTATTTGCCTGATCCAGACAAATATATTGAACAAATCGATTGGAGTCCTGATATGGATTCTGAAATGATTGAGGAACTGGATTGGGCATATCACACTCCAAATATGAAACAACATGATGTTTGGTTGGAATGCAGAAATAAAAGGACAAGGAACTCCAGTTCTGTTTGGACAGAAGGCGACAATGAAGATCCAGGTCATGTGGGTAATCCCTGGGGAGATGATAATCAATTCACTGAAAATACAGGGCAAGGATGCAGCCAGTGGAATTTAAGTGATTCAGGGAAGGTGAATAATGATCGCAATCCTTGGGACAGTAGCATTGACGAGGGGAACAGGGGTATGGTGGACAGTGCCTGGAAGGTTAAAGCGAATCAAGTTGCTACCTCTTGGAAGAATAAAGAATTTGCTAGTGATGCAAGTGGTGTGGTAGACAATGCCTGGAGAGATAGATTGCATCAAGGTGGCACTGCCTCCTGGAAGACTAGAGGGTTTGCTAGTGATGCAAGGAATAACTCATGGAGCAGGCACCAGCAGGGTGCCAGTAATTTTGATCATTATAATAGGCCTGGGAATAACAGTTACAATCGTAATGTCAGACATTTAACTGATAGAACACGGCCGAATATTCACAGAAACAACCAAGATTGGAAATATCAGTATAGGTATGGCAAGAGGCCAAAAGATGCAGAATTTGATAACTTTGGGAGGTAG

Protein sequence

MGNWRERPRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDSKNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMDSEMIEELDWAYHTPNMKQHDVWLECRNKRTRNSSSVWTEGDNEDPGHVGNPWGDDNQFTENTGQGCSQWNLSDSGKVNNDRNPWDSSIDEGNRGMVDSAWKVKANQVATSWKNKEFASDASGVVDNAWRDRLHQGGTASWKTRGFASDARNNSWSRHQQGASNFDHYNRPGNNSYNRNVRHLTDRTRPNIHRNNQDWKYQYRYGKRPKDAEFDNFGR
BLAST of Cla97C01G007920 vs. NCBI nr
Match: XP_011655255.1 (PREDICTED: uncharacterized protein LOC101203639 [Cucumis sativus] >KGN51098.1 hypothetical protein Csa_5G440120 [Cucumis sativus])

HSP 1 Score: 570.1 bits (1468), Expect = 5.0e-159
Identity = 278/338 (82.25%), Postives = 298/338 (88.17%), Query Frame = 0

Query: 1   MGNWRERPRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDS 60
           MGNWRERPRRNFR QKPPWSAIRNYDHDP LE+WRDGIPLWEKKFC EIGCVPWGKIVDS
Sbjct: 1   MGNWRERPRRNFRYQKPPWSAIRNYDHDPPLEHWRDGIPLWEKKFCAEIGCVPWGKIVDS 60

Query: 61  KNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMDS 120
           KNFIYCH+NVV WDDSACEEAFHNAKRRYW EINGHQSDI+LPDPDK+IEQIDWSPDMDS
Sbjct: 61  KNFIYCHSNVVKWDDSACEEAFHNAKRRYWAEINGHQSDIHLPDPDKHIEQIDWSPDMDS 120

Query: 121 EMIEELDWAYHTPNMKQHDVWLECRNKRTRNSSSVWTEGDNEDPGHVGNPWGDDNQFTEN 180
           +MIEELDWAY+TPNMKQ D WLEC+NKRTRNS+SVWTEG  E PGH GNPWG DNQ T+ 
Sbjct: 121 KMIEELDWAYYTPNMKQRDDWLECKNKRTRNSNSVWTEGHIEGPGHEGNPWGHDNQLTDK 180

Query: 181 TGQGCSQWNLSDSGKVNNDRNPWDSSIDEGNRGMVDSAWKVKANQVATSWKNKEFASDAS 240
           TGQGC +WN SDSG VNND NPWDSSI++GN+GMVDS WKV+ NQVATSWKNKEFAS+A 
Sbjct: 181 TGQGCRRWNFSDSGNVNNDGNPWDSSINQGNKGMVDSIWKVEKNQVATSWKNKEFASNAR 240

Query: 241 GVVDNAWRDRLHQGGTAXXXXXXXXXXARNNSWSR--HQQGASNFDHYNRPGNNSYNRNV 300
           GVVDNA +D+ HQGGTAXXXXXXXXXX  NNS  R  H  G SNFDHYNRPGN+SY  NV
Sbjct: 241 GVVDNARKDKQHQGGTAXXXXXXXXXXXXNNSSRRQWHLLGNSNFDHYNRPGNSSYIHNV 300

Query: 301 RHLTDRTRPNIHRNNQDWKYQYRYGKRPKDAEFDNFGR 337
           RHL DRT+PNIHRNNQD KYQ  YGKRPKD EFD +GR
Sbjct: 301 RHLPDRTQPNIHRNNQDLKYQ--YGKRPKDTEFDYYGR 336

BLAST of Cla97C01G007920 vs. NCBI nr
Match: XP_008465788.1 (PREDICTED: uncharacterized protein LOC103503387 [Cucumis melo])

HSP 1 Score: 565.5 bits (1456), Expect = 1.2e-157
Identity = 283/368 (76.90%), Postives = 293/368 (79.62%), Query Frame = 0

Query: 1   MGNWRERPRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDS 60
           MGNWRERPRRNFR QKPPWSAIRNYDHDP LE WRDGIPLWEKKFCKEIGCVPWGKI+DS
Sbjct: 1   MGNWRERPRRNFRYQKPPWSAIRNYDHDPPLERWRDGIPLWEKKFCKEIGCVPWGKIIDS 60

Query: 61  KNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMDS 120
           KNFIYCHTNV NWDDSACEEAFHNAKRRYW EINGHQSDIYLPDPDKYIEQIDWSPDMDS
Sbjct: 61  KNFIYCHTNVANWDDSACEEAFHNAKRRYWAEINGHQSDIYLPDPDKYIEQIDWSPDMDS 120

Query: 121 EMIEELDWAYHTPNM------------------------------KQHDVWLECRNKRTR 180
           +MIEELDWAYHTPN+                                   WLEC+NKRTR
Sbjct: 121 KMIEELDWAYHTPNIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWLECKNKRTR 180

Query: 181 NSSSVWTEGDNEDPGHVGNPWGDDNQFTENTGQGCSQWNLSDSGKVNNDRNPWDSSIDEG 240
           +SSSVW EG  E PGH  NPWG DNQ T+NTGQGCSQWN SDSGKVNND NPWDSSI +G
Sbjct: 181 DSSSVWKEGHIEGPGHGANPWGHDNQLTDNTGQGCSQWNFSDSGKVNNDGNPWDSSIYQG 240

Query: 241 NRGMVDSAWKVKANQVATSWKNKEFASDASGVVDNAWRDRLHQGGTAXXXXXXXXXXARN 300
           N+GMVDSAWKVK NQV TSWKNKEFASDA GVV    RD+LHQGGT XXXXXXXXXX   
Sbjct: 241 NKGMVDSAWKVKKNQVVTSWKNKEFASDARGVVXXXXRDKLHQGGTXXXXXXXXXXXXXX 300

Query: 301 NSWSRHQQ--GASNFDHYNRPGNNSYNRNVRHLTDRTRPNIHRNNQDWKYQYRYGKRPKD 337
           NS SRH    G SNFDHYN PGNNSYNRNV HL DRTRPNIH NNQDWKYQYRYGKRPKD
Sbjct: 301 NSSSRHWHLLGDSNFDHYNGPGNNSYNRNVWHLPDRTRPNIHGNNQDWKYQYRYGKRPKD 360

BLAST of Cla97C01G007920 vs. NCBI nr
Match: XP_022993174.1 (uncharacterized protein LOC111489271 [Cucurbita maxima])

HSP 1 Score: 554.7 bits (1428), Expect = 2.2e-154
Identity = 269/336 (80.06%), Postives = 287/336 (85.42%), Query Frame = 0

Query: 1   MGNWRERPRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDS 60
           MGNWRERPRRNFRNQKPPWSA+RNYD DP LEYWRDGIPLWEK FCKEIGCVPWGKIVDS
Sbjct: 1   MGNWRERPRRNFRNQKPPWSALRNYDQDPPLEYWRDGIPLWEKTFCKEIGCVPWGKIVDS 60

Query: 61  KNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMDS 120
           KNFIYCHTNVVNWDDSACEEAFHNAKRRYW  INGHQ DI LPDPDKYIEQIDWSP+MD 
Sbjct: 61  KNFIYCHTNVVNWDDSACEEAFHNAKRRYWAAINGHQCDINLPDPDKYIEQIDWSPEMDP 120

Query: 121 EMIEELDWAYHTPNMKQHDVWLECRNKRTRNSSSVWTEGDNEDPGHVGNPWGDDNQFTEN 180
           EMIEELDWAY+ PNMKQHD WLEC+NKRTRNSSSVWTE   EDPGHVGNPW   NQFTE 
Sbjct: 121 EMIEELDWAYYNPNMKQHDDWLECKNKRTRNSSSVWTESHIEDPGHVGNPWEHGNQFTET 180

Query: 181 TGQGCSQWNLSDSGKVNNDRNPWDSSIDEGNRGMVDSAWKVKANQVATSWKNKEFASDAS 240
            GQG SQWNLS+S +VNND N WD+ ID  NRGMVD+AWK K NQV TSW+NK F+ DA 
Sbjct: 181 KGQGWSQWNLSESRQVNNDGNHWDNIIDPKNRGMVDTAWKDKGNQVVTSWQNKGFSRDAR 240

Query: 241 GVVDNAWRDRL-HQGGTAXXXXXXXXXXARNNSWSRHQQGASNFDHYNRPGNNSYNRNVR 300
            +VDNAWR  + HQ G AXXXXXXXXXX   NSWSRHQQ ASNFDHYNRPGN++YN+NVR
Sbjct: 241 SMVDNAWRANVQHQDGAAXXXXXXXXXXXXXNSWSRHQQCASNFDHYNRPGNSNYNQNVR 300

Query: 301 HLTDRTRPNIHRNNQDWKYQYRYGKRPKDAEFDNFG 336
           +L DR  PNIH N QDWK +YRYGKRPKDA+FD  G
Sbjct: 301 NLPDRMPPNIHGNKQDWKCKYRYGKRPKDAQFDYIG 336

BLAST of Cla97C01G007920 vs. NCBI nr
Match: XP_023551570.1 (uncharacterized protein LOC111809355 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 540.4 bits (1391), Expect = 4.3e-150
Identity = 263/336 (78.27%), Postives = 282/336 (83.93%), Query Frame = 0

Query: 1   MGNWRERPRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDS 60
           MGNWRERPRRNFRNQKPPWSA+RNYD DP LEYWRDGIPLWEKKFCKEIGCVPWGKIVDS
Sbjct: 1   MGNWRERPRRNFRNQKPPWSALRNYDQDPPLEYWRDGIPLWEKKFCKEIGCVPWGKIVDS 60

Query: 61  KNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMDS 120
           KNFIYCHTNVVNWDDSACEEAFHNAKRRYW  INGHQ DI LPDPDKYIEQIDWSP+MD 
Sbjct: 61  KNFIYCHTNVVNWDDSACEEAFHNAKRRYWAAINGHQCDINLPDPDKYIEQIDWSPEMDP 120

Query: 121 EMIEELDWAYHTPNMKQHDVWLECRNKRTRNSSSVWTEGDNEDPGHVGNPWGDDNQFTEN 180
           E+IEELDWAY+ PNMKQ D WLEC+NKRTRNSSSVWTE   EDPGHVGNPW   NQFTE 
Sbjct: 121 ELIEELDWAYYNPNMKQQDDWLECKNKRTRNSSSVWTESHIEDPGHVGNPWEHGNQFTET 180

Query: 181 TGQGCSQWNLSDSGKVNNDRNPWDSSIDEGNRGMVDSAWKVKANQVATSWKNKEFASDAS 240
             QG SQWNLS+S +VNND NPWD+ ID  NRGMVD+AWK K NQV T      F+ DA 
Sbjct: 181 KRQGWSQWNLSESRQVNNDGNPWDNIIDPRNRGMVDTAWKDKGNQVVTXXXXXGFSRDAR 240

Query: 241 GVVDNAWRDRL-HQGGTAXXXXXXXXXXARNNSWSRHQQGASNFDHYNRPGNNSYNRNVR 300
            +VDNAWR  + +Q G  XXXXXXXXXX   NSWSRHQQGASNFDHYNRPGN++YNRNVR
Sbjct: 241 SMVDNAWRANVQYQDGAXXXXXXXXXXXXXXNSWSRHQQGASNFDHYNRPGNSNYNRNVR 300

Query: 301 HLTDRTRPNIHRNNQDWKYQYRYGKRPKDAEFDNFG 336
           +L DR   NIH N Q+WKY+YRYGKRPKDA+FD  G
Sbjct: 301 NLPDRMPSNIHGNKQEWKYKYRYGKRPKDAQFDYIG 336

BLAST of Cla97C01G007920 vs. NCBI nr
Match: XP_022939224.1 (uncharacterized protein LOC111445202 [Cucurbita moschata])

HSP 1 Score: 535.0 bits (1377), Expect = 1.8e-148
Identity = 261/336 (77.68%), Postives = 279/336 (83.04%), Query Frame = 0

Query: 1   MGNWRERPRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDS 60
           MGNWRERPRRNFRNQKPPWSA RNYD DP LEYWRDGIPLWEK FCKEIGCVPWGKIVDS
Sbjct: 1   MGNWRERPRRNFRNQKPPWSASRNYDQDPPLEYWRDGIPLWEKTFCKEIGCVPWGKIVDS 60

Query: 61  KNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMDS 120
           KNFIYCHTNVVNWDDSACE AFHNAKRRYW  INGHQ DI LPDPDKYIEQIDWSP+MD 
Sbjct: 61  KNFIYCHTNVVNWDDSACEAAFHNAKRRYWAAINGHQCDINLPDPDKYIEQIDWSPEMDP 120

Query: 121 EMIEELDWAYHTPNMKQHDVWLECRNKRTRNSSSVWTEGDNEDPGHVGNPWGDDNQFTEN 180
           E+IEELDWAY+ PNMKQ D WLEC+NKRTRNSSSVWTE   EDPGHVGNPW   NQFTE 
Sbjct: 121 ELIEELDWAYYNPNMKQQDDWLECKNKRTRNSSSVWTESHIEDPGHVGNPWEHGNQFTET 180

Query: 181 TGQGCSQWNLSDSGKVNNDRNPWDSSIDEGNRGMVDSAWKVKANQVATSWKNKEFASDAS 240
             QG SQWNLS+S +VN D NPWD+ ID  NRGMVD+AWK K NQV T      F+ DA 
Sbjct: 181 KRQGWSQWNLSESRQVNKDGNPWDNIIDPRNRGMVDTAWKDKGNQVVTXXXXXGFSRDAR 240

Query: 241 GVVDNAWRDRL-HQGGTAXXXXXXXXXXARNNSWSRHQQGASNFDHYNRPGNNSYNRNVR 300
            +VDNAWR  + +Q G  XXXXXXXXXX   NSWSRHQQGASNFDHYNRPGN++YNRNVR
Sbjct: 241 SMVDNAWRANVQYQDGAXXXXXXXXXXXXXXNSWSRHQQGASNFDHYNRPGNSNYNRNVR 300

Query: 301 HLTDRTRPNIHRNNQDWKYQYRYGKRPKDAEFDNFG 336
           +L DR  PNIH N Q+WKY+YRYGKRPKDA+FD  G
Sbjct: 301 NLPDRMPPNIHGNKQEWKYKYRYGKRPKDAQFDYIG 336

BLAST of Cla97C01G007920 vs. TrEMBL
Match: tr|A0A0A0KNE4|A0A0A0KNE4_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G440120 PE=4 SV=1)

HSP 1 Score: 570.1 bits (1468), Expect = 3.3e-159
Identity = 278/338 (82.25%), Postives = 298/338 (88.17%), Query Frame = 0

Query: 1   MGNWRERPRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDS 60
           MGNWRERPRRNFR QKPPWSAIRNYDHDP LE+WRDGIPLWEKKFC EIGCVPWGKIVDS
Sbjct: 1   MGNWRERPRRNFRYQKPPWSAIRNYDHDPPLEHWRDGIPLWEKKFCAEIGCVPWGKIVDS 60

Query: 61  KNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMDS 120
           KNFIYCH+NVV WDDSACEEAFHNAKRRYW EINGHQSDI+LPDPDK+IEQIDWSPDMDS
Sbjct: 61  KNFIYCHSNVVKWDDSACEEAFHNAKRRYWAEINGHQSDIHLPDPDKHIEQIDWSPDMDS 120

Query: 121 EMIEELDWAYHTPNMKQHDVWLECRNKRTRNSSSVWTEGDNEDPGHVGNPWGDDNQFTEN 180
           +MIEELDWAY+TPNMKQ D WLEC+NKRTRNS+SVWTEG  E PGH GNPWG DNQ T+ 
Sbjct: 121 KMIEELDWAYYTPNMKQRDDWLECKNKRTRNSNSVWTEGHIEGPGHEGNPWGHDNQLTDK 180

Query: 181 TGQGCSQWNLSDSGKVNNDRNPWDSSIDEGNRGMVDSAWKVKANQVATSWKNKEFASDAS 240
           TGQGC +WN SDSG VNND NPWDSSI++GN+GMVDS WKV+ NQVATSWKNKEFAS+A 
Sbjct: 181 TGQGCRRWNFSDSGNVNNDGNPWDSSINQGNKGMVDSIWKVEKNQVATSWKNKEFASNAR 240

Query: 241 GVVDNAWRDRLHQGGTAXXXXXXXXXXARNNSWSR--HQQGASNFDHYNRPGNNSYNRNV 300
           GVVDNA +D+ HQGGTAXXXXXXXXXX  NNS  R  H  G SNFDHYNRPGN+SY  NV
Sbjct: 241 GVVDNARKDKQHQGGTAXXXXXXXXXXXXNNSSRRQWHLLGNSNFDHYNRPGNSSYIHNV 300

Query: 301 RHLTDRTRPNIHRNNQDWKYQYRYGKRPKDAEFDNFGR 337
           RHL DRT+PNIHRNNQD KYQ  YGKRPKD EFD +GR
Sbjct: 301 RHLPDRTQPNIHRNNQDLKYQ--YGKRPKDTEFDYYGR 336

BLAST of Cla97C01G007920 vs. TrEMBL
Match: tr|A0A1S3CQ16|A0A1S3CQ16_CUCME (uncharacterized protein LOC103503387 OS=Cucumis melo OX=3656 GN=LOC103503387 PE=4 SV=1)

HSP 1 Score: 565.5 bits (1456), Expect = 8.2e-158
Identity = 283/368 (76.90%), Postives = 293/368 (79.62%), Query Frame = 0

Query: 1   MGNWRERPRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDS 60
           MGNWRERPRRNFR QKPPWSAIRNYDHDP LE WRDGIPLWEKKFCKEIGCVPWGKI+DS
Sbjct: 1   MGNWRERPRRNFRYQKPPWSAIRNYDHDPPLERWRDGIPLWEKKFCKEIGCVPWGKIIDS 60

Query: 61  KNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMDS 120
           KNFIYCHTNV NWDDSACEEAFHNAKRRYW EINGHQSDIYLPDPDKYIEQIDWSPDMDS
Sbjct: 61  KNFIYCHTNVANWDDSACEEAFHNAKRRYWAEINGHQSDIYLPDPDKYIEQIDWSPDMDS 120

Query: 121 EMIEELDWAYHTPNM------------------------------KQHDVWLECRNKRTR 180
           +MIEELDWAYHTPN+                                   WLEC+NKRTR
Sbjct: 121 KMIEELDWAYHTPNIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWLECKNKRTR 180

Query: 181 NSSSVWTEGDNEDPGHVGNPWGDDNQFTENTGQGCSQWNLSDSGKVNNDRNPWDSSIDEG 240
           +SSSVW EG  E PGH  NPWG DNQ T+NTGQGCSQWN SDSGKVNND NPWDSSI +G
Sbjct: 181 DSSSVWKEGHIEGPGHGANPWGHDNQLTDNTGQGCSQWNFSDSGKVNNDGNPWDSSIYQG 240

Query: 241 NRGMVDSAWKVKANQVATSWKNKEFASDASGVVDNAWRDRLHQGGTAXXXXXXXXXXARN 300
           N+GMVDSAWKVK NQV TSWKNKEFASDA GVV    RD+LHQGGT XXXXXXXXXX   
Sbjct: 241 NKGMVDSAWKVKKNQVVTSWKNKEFASDARGVVXXXXRDKLHQGGTXXXXXXXXXXXXXX 300

Query: 301 NSWSRHQQ--GASNFDHYNRPGNNSYNRNVRHLTDRTRPNIHRNNQDWKYQYRYGKRPKD 337
           NS SRH    G SNFDHYN PGNNSYNRNV HL DRTRPNIH NNQDWKYQYRYGKRPKD
Sbjct: 301 NSSSRHWHLLGDSNFDHYNGPGNNSYNRNVWHLPDRTRPNIHGNNQDWKYQYRYGKRPKD 360

BLAST of Cla97C01G007920 vs. TrEMBL
Match: tr|A0A2I4EDK6|A0A2I4EDK6_9ROSI (uncharacterized protein LOC108988620 isoform X1 OS=Juglans regia OX=51240 GN=LOC108988620 PE=4 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 1.9e-50
Identity = 108/227 (47.58%), Postives = 143/227 (63.00%), Query Frame = 0

Query: 1   MGNWRER-PRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVD 60
           MGNWR R  RR FR+ + PWS  R YDH    E W+DG+PLWEKKFC  IG +PWGKI+D
Sbjct: 1   MGNWRNRSSRRIFRHHRSPWSPPREYDHYIPPE-WKDGVPLWEKKFCTLIGSIPWGKILD 60

Query: 61  SKNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMD 120
           +KNF+YCH NVV+WDDSA EEAF NAK+R+W +ING   DI LPDPD YIE+IDW P++D
Sbjct: 61  TKNFMYCHNNVVSWDDSAGEEAFQNAKKRFWAKINGLCCDISLPDPDIYIEEIDWKPNID 120

Query: 121 SEMIEELDWAYHTPNMKQHDVWLECRNKRTRNSSSVWTEGDNEDPGHVGNPWGDDNQ--- 180
            E+I+ELD     P+ ++ +  +   N+ ++ +  V +EG N +  +  NPW  DN    
Sbjct: 121 PELIKELDQYCFVPDEEELNTQVRHTNRNSKTAVPVPSEGHNTNQEY-DNPWESDNMQDS 180

Query: 181 -FTENTGQGCSQW--NLSDSGKVNNDRNPWDSSIDEGNRGMVDSAWK 221
              EN  QG +QW    +D   +NN              GM+D+AW+
Sbjct: 181 GVLENRAQGWNQWENKKNDRKTLNNXXXXXXXXXXXXXXGMMDNAWR 225

BLAST of Cla97C01G007920 vs. TrEMBL
Match: tr|A0A1Q3BNW7|A0A1Q3BNW7_CEPFO (PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-containing protein/PPR_3 domain-containing protein (Fragment) OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_12904 PE=4 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 2.7e-44
Identity = 94/197 (47.72%), Postives = 130/197 (65.99%), Query Frame = 0

Query: 29  PSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDSKNFIYCHTNVVNWDDSACEEAFHNAKRR 88
           PS  +  DG+P WEKKFC  IG  PW K+VD+KN++YC+ NVVNWDDSA EEAF NAK+R
Sbjct: 1   PSTGFSEDGVPSWEKKFCALIGSTPWRKVVDAKNYMYCYGNVVNWDDSAGEEAFQNAKKR 60

Query: 89  YWVEINGHQSDIYLPDPDKYIEQIDWSPDMDSEMIEELDWAYHTPNMKQHDVWLECRNKR 148
           +W +ING   DI LPDPD YI++I+W+PD+D ++I++++ AY  P+   ++V   C  K+
Sbjct: 61  FWAKINGRLFDISLPDPDLYIDEINWNPDIDDKLIKDIEQAYFAPDEGDNEVKAWCNYKK 120

Query: 149 TRNSSSVWTEGDNEDPGHVGNPWGDDNQFTEN------TGQGCSQWNLS--DSGKVNNDR 208
           TRN   V ++  N++PG+  NPW + N  TE       T QG +QW+ S  DS  +N   
Sbjct: 121 TRNLVPVPSDLYNKNPGNDDNPW-EYNSNTEGTEPWKITSQGWNQWDNSANDSINLNKSD 180

Query: 209 NPWDSSIDEGNRGMVDS 218
           NPW+ SI  GN  +  S
Sbjct: 181 NPWEHSITLGNEAVKKS 196

BLAST of Cla97C01G007920 vs. TrEMBL
Match: tr|K7KK89|K7KK89_SOYBN (Uncharacterized protein OS=Glycine max OX=3847 GN=100797066 PE=4 SV=1)

HSP 1 Score: 186.8 bits (473), Expect = 7.9e-44
Identity = 102/223 (45.74%), Postives = 138/223 (61.88%), Query Frame = 0

Query: 1   MGNWRERP-RRNFRNQKPPWSAIRNYD-HDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIV 60
           MG W  RP RR FR ++ P      YD + P  EYW+DGIPLWEKK+C  +G VPW KIV
Sbjct: 1   MGKWDHRPSRRFFRRRRSPVRPPTFYDINAPLPEYWQDGIPLWEKKYCTIVGLVPWQKIV 60

Query: 61  DSKNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDM 120
           DSK F+YCH+NV +W+DSA EEA  NAK  YW +IN    DI LPDPD Y +QIDW+P +
Sbjct: 61  DSKMFVYCHSNVFDWNDSAAEEALQNAKNHYWAKINSLPCDISLPDPDTYNDQIDWNPYI 120

Query: 121 DSEMIEELDWAYHT-PNMKQHDVWLECRNKRTRNSSSVWTEGDNEDPGHVGNPWGDDNQF 180
           D +MI+E+D A+ T P+ +Q       +NKRT+      T  ++E+P    +     ++ 
Sbjct: 121 DPDMIKEIDKAFFTVPDEEQETA---IKNKRTK------TSVNDENPLECSDT--PLSRA 180

Query: 181 TENTGQGCSQWNLSDSGKVNNDRNPWDSSIDEGNRGMVDSAWK 221
            EN      +WN  +SG V+N  NPW+ S+  GN  + D+AW+
Sbjct: 181 LEN--NEVQRWNQGNSGDVDNTDNPWECSVTHGNGRLTDNAWE 210

BLAST of Cla97C01G007920 vs. TAIR10
Match: AT3G51940.1 (unknown protein)

HSP 1 Score: 135.6 bits (340), Expect = 5.7e-32
Identity = 90/262 (34.35%), Postives = 130/262 (49.62%), Query Frame = 0

Query: 1   MGNWRERPRRNFRNQKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKIVDS 60
           MG W  R R + R     W     Y    S     DGIP+WEK+FC+ IG VPW K+V++
Sbjct: 1   MGKWNHRSRHHRRRSPERW-----YSGRQSSSSSDDGIPVWEKRFCEVIGSVPWQKVVEA 60

Query: 61  KNF-IYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPDMD 120
           K+F  + + NV+ WDDSACE+ FHN K+R+W ++NG   D+ +PDPD YI ++DW   +D
Sbjct: 61  KDFKSWYNGNVITWDDSACEDTFHNEKKRFWSQVNGLHCDVSIPDPDLYISEVDWDTFVD 120

Query: 121 SEMIEELDWAYHTPNMKQHDVWLECRNKRTRNSSSVWTEGDN-EDPGHVGNPWGDDNQFT 180
            E+I +L+ AY  P     DV      KR R   + W+  D   +   +  PW + +   
Sbjct: 121 PELIRDLEKAYFAP---PDDV--NIGFKRGRGDKN-WSGCDTVPEARMLETPWKNSDDII 180

Query: 181 ENTGQGCSQWNLSDSGK-------VNNDRNPWDSSIDEGNRGMVDSAWKVKANQVATSWK 240
           E TG+  S WNL++          VN   N   S          ++ W  K ++V  SW 
Sbjct: 181 E-TGKKSSGWNLTEGSSWEAKPCCVNEKANDTASGGCLTTEEWRENQWIAK-DRVNDSW- 240

Query: 241 NKEFASDASGVVDNAWRDRLHQ 254
             E++       D+ W    HQ
Sbjct: 241 --EYSGQGK---DDGWDKSGHQ 243

BLAST of Cla97C01G007920 vs. TAIR10
Match: AT5G03990.1 (unknown protein)

HSP 1 Score: 125.6 bits (314), Expect = 5.9e-29
Identity = 79/217 (36.41%), Postives = 108/217 (49.77%), Query Frame = 0

Query: 1   MGNW-RERPRRNFRN--QKPPWSAIRNYDHDPSLEYWRDGIPLWEKKFCKEIGCVPWGKI 60
           M NW R++PR N  N   +   + +      P L   +  +P WEK FC  IG VPW K+
Sbjct: 1   MSNWRRQKPRNNNSNNYSRQRGTTMTQSSSKPPLANCKISVPAWEKDFCAVIGSVPWWKV 60

Query: 61  VDSKNFIYCHTNVVNWDDSACEEAFHNAKRRYWVEINGHQSDIYLPDPDKYIEQIDWSPD 120
           V++K F++ +  VV WDDSA E+AF NAK R+W EING   D+ LPDPD YI+ +DW  +
Sbjct: 61  VEAKRFMHIYDRVVQWDDSAGEDAFKNAKSRFWAEINGLTCDLSLPDPDVYIDDVDWDAE 120

Query: 121 MDSEMIEELDWAYHTPNMKQ-HDVWLECRNKRTRNSSSVW------TEGDNED---PGHV 180
           +D+E+I +L+        +Q H V L+      +     W       EG NE+    G  
Sbjct: 121 VDNELILDLERGPDPLTEEQEHVVILDALVLSGQYGGLGWGTXXXDAEGINEENVGKGKP 180

Query: 181 GNPWGDDNQFTENTGQGCSQWNLSDSGKVNNDRNPWD 205
            N WGD         Q    WN  DS  +      WD
Sbjct: 181 ENSWGD---------QKYDGWN-EDSWGIKEKTETWD 207

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011655255.15.0e-15982.25PREDICTED: uncharacterized protein LOC101203639 [Cucumis sativus] >KGN51098.1 hy... [more]
XP_008465788.11.2e-15776.90PREDICTED: uncharacterized protein LOC103503387 [Cucumis melo][more]
XP_022993174.12.2e-15480.06uncharacterized protein LOC111489271 [Cucurbita maxima][more]
XP_023551570.14.3e-15078.27uncharacterized protein LOC111809355 [Cucurbita pepo subsp. pepo][more]
XP_022939224.11.8e-14877.68uncharacterized protein LOC111445202 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A0A0KNE4|A0A0A0KNE4_CUCSA3.3e-15982.25Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G440120 PE=4 SV=1[more]
tr|A0A1S3CQ16|A0A1S3CQ16_CUCME8.2e-15876.90uncharacterized protein LOC103503387 OS=Cucumis melo OX=3656 GN=LOC103503387 PE=... [more]
tr|A0A2I4EDK6|A0A2I4EDK6_9ROSI1.9e-5047.58uncharacterized protein LOC108988620 isoform X1 OS=Juglans regia OX=51240 GN=LOC... [more]
tr|A0A1Q3BNW7|A0A1Q3BNW7_CEPFO2.7e-4447.72PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-conta... [more]
tr|K7KK89|K7KK89_SOYBN7.9e-4445.74Uncharacterized protein OS=Glycine max OX=3847 GN=100797066 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT3G51940.15.7e-3234.35unknown protein[more]
AT5G03990.15.9e-2936.41unknown protein[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0045493 xylan catabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016798 hydrolase activity, acting on glycosyl bonds
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G007920.1Cla97C01G007920.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 269..298
NoneNo IPR availablePANTHERPTHR34567FAMILY NOT NAMEDcoord: 1..333