CsaV3_4G031590.1 (mRNA) Cucumber (Chinese Long) v3

NameCsaV3_4G031590.1
TypemRNA
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionmicrofibrillar-associated protein 1
Locationchr4 : 22081298 .. 22084925 (-)
Sequence length1302
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATTTCAAAATAATTTTCTCTCGTTGTCAACACTACAGTTTAGGTCGATAAAAAAGTAAAAAAAAAAAGAAGAACGAAAATAGAAAAATACTTTCCTCTCCTCTACCCCTTTACCTGATTCTTAAAAAATCTCACAAATTATCGATTTCAGAGGATCCGGACTGCTTTCATTACTTGCCTCCGCCATTTCTCCCTCCCACCATTTTCTTCGTTGCTCGGATCGGATTTCTTCAACTCTGATACTCCCTTATACTCAACTTCAGACCAAAACTTTGAAGGTATTTCTAACTTTAGACTATTTCCGCCTGAAATTTCACCATACTTTTTATCCTTCGCCCTCTTTGTGGTTTCCAACGCGCGTTTAATTGAGAAATGCTATGGGCTTTCATCCCTGTTCCTGTTTCGAGTTTTCCTTTATCACAAAGGGAGGTTGAGTTTTCGTTGTGATTGTATAGGGTTTTGCTTAATTTGTGTTGAAGCTTATTTTAACTGTTTGGACAACTTTTCAATTTACATTTTGTACTCAAAAAAAGAGATAAAGAACCCTGTTGATCTTCCCCTTTCTTTTTCTTTCTTCCGAGAAGTCCTCACCATGAAGTTTTCATTAATTTTCATCTCCAATAGCACACCGGCAACCCCTAAACACCATGCTATGACGGTTTTTAATTTTTTTTCCTTCTGATTTATCGGTTCTTGAATCGCGAAGTTGCGATTCATGACGGTTCGTGCGATTAAATTTGAATCGTACGGTACGTCTAATTTCCGATCCAATTCTACGAGTGCTATCGTTTTCCCTTATTTTCCTTTCGAATCATACGATTTTAACGACATTGTCATAATATAGGGAGCTTGGAGACTCTATGTTTAACTCTGCACTTATTTCATTATTGGGTTTCAGGTTTAGCTTCATCGAATTGTGAATTTTTAAGTTGAAAAGGAGAAAATTCTTTTTTTCTTTTGGTGTATCTTTTGCTTCTGGAAATTTAGCCGATTAACAATGTCGGTCACGGCGGGGGTTAGTGATACTGTAATTGCTGTTAGAGACAAACTTAGAGGTAAAATTGGACAAACAAAAGTTAAGAGGTACTGGCCTGGAAAGGCTCCCGAGTGGGCGGATGATGCCGATGAAGATGGCGATATTAGGATGGCCAGGGCAGCAGCTCTTGAGAAAGCATTTCCTAGGCAGGAAGATTCAGATATATCTAGGAAGGATGACCCTAGGTTGCGCCGTCTGGCTGAGAGTAGGATAGATAATCGGGAGGAGATTAGAGCTGATCATCGACGTATTCGCCAAGCTGAGATTGTTTCAACCATTGAAGAGGAAACTCGGAGGCAGGAGGGATTAGATGCAGAGGAAGAGGATGAGGATGCTTTGGAGGAAAGAAGAAGAAGAATCAAGGAAAAGTTGCGACAAAGGGAGCTAGAAGAAGCTGCATTTCCTGAAGAAGAAGAGGAGGAGGAGCCAGAGGAAGAGGAAGAAGAGGAGTCTGAGTATGAAACTGACTCTGAAGATGAACCTACTGGTATAGCAATGGTGAAGCCAATCTTTGTTCCTAAATCAGAGAGAGAAACTATTGCTGAACGTGAGCGTATTGAAGAGGAGGAAAGGTCTCTTGAAGAATTGAGAAAAAGGAGATTGGAGGAAAGGAAGGCAGAGACAAAGCACATTGTGGTTGAGGAGATTAGAAAGGATGAAGAGATCCAGAAGAATTTGGAAATGGAGGCCAATATTGCAGATGTAGATACTGATGATGAAATAAATGAAGCAGAAGAATATGAGGCTTGGAAGGTTAGAGAGATTGCTAGGATCAAGAGGGATAGAGAACTTCGAGATGCAATGTTGAAGGAGAGGGAGGAGATTGAGAAGGTGAGAAATATGACCGAGGAAGAGAGGAGAGAATGGGAGAGGAAGAATCCGAAACCTGCTCCACCACCTAAGCAGAAGTGGAAGTTTATGCAGAAATATTACCACAAGGGTGCATTCTTCCAGGAAGATGCTGATGATAATGCTGGAACTGCTGGATCTGATAATATTTTCCATCGTGATTTCTCTTCCCCAACTGGAGAAGATAAAATGGACAAGACAATATTGCCGAAGGTTATGCAGGTCAAGCATTTTGGGCGCAGTGGGAGAACAAAATGGACGCATCTTGTCAATGAAGATACAACCGACTGGAACAACCCGTAAGTTGATTTAACTTCGTCTGCTTTCTGCTTCTCAGTTTCCTATAGTAAATTATTGCAATATCAATTTTGGTTTCTTAATTGAGTTTATATCTAGATTAAGTACAATACGATCTATTACAAAAGCTTCTGATCCTTTGTCTACTTATGTTTGAGTGCATGTTAAGTAGAGAATGATTGGAAATTTTTTTAGTATGAGCTTGAGGTATGGACGTGTTGAGCAGCCTGAGTTTAATATCAATATTGAGGCTTTTAGTTTGTTTATATTTCTAGAAGAGAATATAAAGATATGATTAGAACAATGCCCTATTTGCAATCTTCTCTATTTGTTTGTTTTTTTGCCTTCATTCTACTGTTTGTAGTGTGTTTTTCATGTCGAAGTTATTCTGCTGGTTGACTAACGTCAGTTGTTACTGTCATGCAGTTGGACCTATAACGATCCTCTTCGGGCAAAATACAATGCAAAAATGGCTGGAATGAATGCACCAATTACAAAACCTAAAGGGAGCAAGAAGTTGAAGGATTGGGAATCTCGTTGAGTTTATAAAGCACGGTTAGAAAGCTTTTCTAGTTATCCATTTTCATTGTTCAAATTAGAAAAAAAATGCTTGCATAATGGGTTACCAAGATAGTTGGGCACACAGTTTTCCTTCTCTTTCAAACTAGAGTTGTAACTGGATTTTGTATGTTGATGTAATATAATTACTCATTAGTTATCATGTTCCTCGTTCCCTCTTACTGTAGCTTTTTGGTTGGTCTTTCTCGTTATTTACCAAATATGTTTTCTCCTGTGTTTGTAGAGTTCTTATGCCCGATTATACCTTGGTTCTTTTTTTCTTTTTCTTTTTTTTTTGAAAAAGAAACCATTTCATTAATTGATGAAATGAGGGAGTAAAAACCCCAACACCTATTGGTGATTTCAACAGTGCTCTTCAATTGGAGATTAAATATGATAAAGCTTTAATGGACAATAAGTTGTTTAAATTTGCGCCCCCCAGAAAAAAAGAAAAAAAAACACAGTAGAGAGTACCAATTCCATGAAGTAATGTAAGGAAAAGTTCTTGAAACAGCGACCATTTTGCTCACTCCAAGATTTCTACAAAAAAAAAAAGAAAAAACCTATTTGCCAAGACTGACGGACGACGCCACAGAAATGTTTCAATTACACTCTTCCTAGAGGGTTTGGCTTTTCTCTCACCTATCAGAAAGAACTGGTTTAAATCTTCAATCTTCATGGTTCATGGAATTAGGGAAATGGGGTCACATGGAAACTGCTAAATAAATTGATATGCATAGCCTCTCTATAGCGTTGTGGAGATTATGATCTATGTTTTTTGTTTACAACATCAAAGTGACAGACCTCAGTCTGATGGTGAGATGTATTCTCTTTTTTATCTCAGAATCAAAATCTTGTGTGCAATGTCTCTAAATGCCCAAGACGC

mRNA sequence

ATGTCGGTCACGGCGGGGGTTAGTGATACTGTAATTGCTGTTAGAGACAAACTTAGAGGTAAAATTGGACAAACAAAAGTTAAGAGGTACTGGCCTGGAAAGGCTCCCGAGTGGGCGGATGATGCCGATGAAGATGGCGATATTAGGATGGCCAGGGCAGCAGCTCTTGAGAAAGCATTTCCTAGGCAGGAAGATTCAGATATATCTAGGAAGGATGACCCTAGGTTGCGCCGTCTGGCTGAGAGTAGGATAGATAATCGGGAGGAGATTAGAGCTGATCATCGACGTATTCGCCAAGCTGAGATTGTTTCAACCATTGAAGAGGAAACTCGGAGGCAGGAGGGATTAGATGCAGAGGAAGAGGATGAGGATGCTTTGGAGGAAAGAAGAAGAAGAATCAAGGAAAAGTTGCGACAAAGGGAGCTAGAAGAAGCTGCATTTCCTGAAGAAGAAGAGGAGGAGGAGCCAGAGGAAGAGGAAGAAGAGGAGTCTGAGTATGAAACTGACTCTGAAGATGAACCTACTGGTATAGCAATGGTGAAGCCAATCTTTGTTCCTAAATCAGAGAGAGAAACTATTGCTGAACGTGAGCGTATTGAAGAGGAGGAAAGGTCTCTTGAAGAATTGAGAAAAAGGAGATTGGAGGAAAGGAAGGCAGAGACAAAGCACATTGTGGTTGAGGAGATTAGAAAGGATGAAGAGATCCAGAAGAATTTGGAAATGGAGGCCAATATTGCAGATGTAGATACTGATGATGAAATAAATGAAGCAGAAGAATATGAGGCTTGGAAGGTTAGAGAGATTGCTAGGATCAAGAGGGATAGAGAACTTCGAGATGCAATGTTGAAGGAGAGGGAGGAGATTGAGAAGGTGAGAAATATGACCGAGGAAGAGAGGAGAGAATGGGAGAGGAAGAATCCGAAACCTGCTCCACCACCTAAGCAGAAGTGGAAGTTTATGCAGAAATATTACCACAAGGGTGCATTCTTCCAGGAAGATGCTGATGATAATGCTGGAACTGCTGGATCTGATAATATTTTCCATCGTGATTTCTCTTCCCCAACTGGAGAAGATAAAATGGACAAGACAATATTGCCGAAGGTTATGCAGGTCAAGCATTTTGGGCGCAGTGGGAGAACAAAATGGACGCATCTTGTCAATGAAGATACAACCGACTGGAACAACCCTTGGACCTATAACGATCCTCTTCGGGCAAAATACAATGCAAAAATGGCTGGAATGAATGCACCAATTACAAAACCTAAAGGGAGCAAGAAGTTGAAGGATTGGGAATCTCGTTGA

Coding sequence (CDS)

ATGTCGGTCACGGCGGGGGTTAGTGATACTGTAATTGCTGTTAGAGACAAACTTAGAGGTAAAATTGGACAAACAAAAGTTAAGAGGTACTGGCCTGGAAAGGCTCCCGAGTGGGCGGATGATGCCGATGAAGATGGCGATATTAGGATGGCCAGGGCAGCAGCTCTTGAGAAAGCATTTCCTAGGCAGGAAGATTCAGATATATCTAGGAAGGATGACCCTAGGTTGCGCCGTCTGGCTGAGAGTAGGATAGATAATCGGGAGGAGATTAGAGCTGATCATCGACGTATTCGCCAAGCTGAGATTGTTTCAACCATTGAAGAGGAAACTCGGAGGCAGGAGGGATTAGATGCAGAGGAAGAGGATGAGGATGCTTTGGAGGAAAGAAGAAGAAGAATCAAGGAAAAGTTGCGACAAAGGGAGCTAGAAGAAGCTGCATTTCCTGAAGAAGAAGAGGAGGAGGAGCCAGAGGAAGAGGAAGAAGAGGAGTCTGAGTATGAAACTGACTCTGAAGATGAACCTACTGGTATAGCAATGGTGAAGCCAATCTTTGTTCCTAAATCAGAGAGAGAAACTATTGCTGAACGTGAGCGTATTGAAGAGGAGGAAAGGTCTCTTGAAGAATTGAGAAAAAGGAGATTGGAGGAAAGGAAGGCAGAGACAAAGCACATTGTGGTTGAGGAGATTAGAAAGGATGAAGAGATCCAGAAGAATTTGGAAATGGAGGCCAATATTGCAGATGTAGATACTGATGATGAAATAAATGAAGCAGAAGAATATGAGGCTTGGAAGGTTAGAGAGATTGCTAGGATCAAGAGGGATAGAGAACTTCGAGATGCAATGTTGAAGGAGAGGGAGGAGATTGAGAAGGTGAGAAATATGACCGAGGAAGAGAGGAGAGAATGGGAGAGGAAGAATCCGAAACCTGCTCCACCACCTAAGCAGAAGTGGAAGTTTATGCAGAAATATTACCACAAGGGTGCATTCTTCCAGGAAGATGCTGATGATAATGCTGGAACTGCTGGATCTGATAATATTTTCCATCGTGATTTCTCTTCCCCAACTGGAGAAGATAAAATGGACAAGACAATATTGCCGAAGGTTATGCAGGTCAAGCATTTTGGGCGCAGTGGGAGAACAAAATGGACGCATCTTGTCAATGAAGATACAACCGACTGGAACAACCCTTGGACCTATAACGATCCTCTTCGGGCAAAATACAATGCAAAAATGGCTGGAATGAATGCACCAATTACAAAACCTAAAGGGAGCAAGAAGTTGAAGGATTGGGAATCTCGTTGA

Protein sequence

MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAFPRQEDSDISRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRRQEGLDAEEEDEDALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGIAMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERREWERKNPKPAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITKPKGSKKLKDWESR
BLAST of CsaV3_4G031590.1 vs. NCBI nr
Match: XP_011653945.1 (PREDICTED: microfibrillar-associated protein 1 [Cucumis sativus] >KGN54899.1 hypothetical protein Csa_4G578870 [Cucumis sativus])

HSP 1 Score: 557.8 bits (1436), Expect = 3.3e-155
Identity = 433/433 (100.00%), Postives = 433/433 (100.00%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120
           PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX
Sbjct: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180

Query: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240
           KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX
Sbjct: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240

Query: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360
           EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of CsaV3_4G031590.1 vs. NCBI nr
Match: XP_023543457.1 (microfibrillar-associated protein 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 540.4 bits (1391), Expect = 5.5e-150
Identity = 384/433 (88.68%), Postives = 390/433 (90.07%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRM+R AALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMSRVAALEKAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120
           P QEDSD+S                           IRQAEIV  IEEETRRQEG+D XX
Sbjct: 61  PSQEDSDLSKKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVLNIEEETRRQEGIDAXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180

Query: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240
           KPIFVPKSERETI XXXXXXXXXXXXXXXXXXXXXX   ETKHIVVEEIRK XXXXXXXX
Sbjct: 181 KPIFVPKSERETIAXXXXXXXXXXXXXXXXXXXXXXRKAETKHIVVEEIRKDXXXXXXXX 240

Query: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           X   IADVDTDDEINEAEEYEAWKVREI+RIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 XEANIADVDTDDEINEAEEYEAWKVREISRIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360
           EWERKNPK XXXX QKWKFMQKY+HKGAFFQEDADDNAGTAGSD+IFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKPXXXXKQKWKFMQKYFHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of CsaV3_4G031590.1 vs. NCBI nr
Match: XP_022950647.1 (microfibrillar-associated protein 1-like [Cucurbita moschata])

HSP 1 Score: 540.0 bits (1390), Expect = 7.2e-150
Identity = 384/433 (88.68%), Postives = 389/433 (89.84%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRM+R AALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMSRVAALEKAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120
           P QEDSD+S                           IRQAEIV  IEEETRRQEG+D XX
Sbjct: 61  PSQEDSDLSKKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVLNIEEETRRQEGIDAXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180

Query: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240
           KPIFVPKSERETI XXXXXXXXXXXXXXXXXXXXXX   ETKHIVVEEIRK XXXXXXXX
Sbjct: 181 KPIFVPKSERETIAXXXXXXXXXXXXXXXXXXXXXXRKAETKHIVVEEIRKDXXXXXXXX 240

Query: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           X   IADVDTDDEINEAEEYEAWKVREI+RIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 XEANIADVDTDDEINEAEEYEAWKVREISRIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360
           EWERKNPK XXXX QKWKFMQKY+HKGAFFQEDADDNAGTAGSD IFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKPXXXXKQKWKFMQKYFHKGAFFQEDADDNAGTAGSDTIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of CsaV3_4G031590.1 vs. NCBI nr
Match: XP_022978182.1 (microfibrillar-associated protein 1-like [Cucurbita maxima])

HSP 1 Score: 534.6 bits (1376), Expect = 3.0e-148
Identity = 407/433 (94.00%), Postives = 414/433 (95.61%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRM+R AALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMSRVAALEKAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120
           P +EDSD+S XXXXXXXXXXXXXXXXXXXXXXXXXX  QAEIV  IEEETRRQEG+D XX
Sbjct: 61  PSREDSDLSKXXXXXXXXXXXXXXXXXXXXXXXXXXXXQAEIVLNIEEETRRQEGIDAXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180

Query: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240
           KPIFVPKSERETI XXXXXXXXXXXXXXXXXXXXXX   ETKHIVVEEIRK XXXXXXXX
Sbjct: 181 KPIFVPKSERETIAXXXXXXXXXXXXXXXXXXXXXXRKAETKHIVVEEIRKDXXXXXXXX 240

Query: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           X   IADVDTDDEINEAEEYEAWKVREI+RIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 XEANIADVDTDDEINEAEEYEAWKVREISRIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360
           EWERKNPK XXXX QKWKFMQKY+HKGAFFQEDADDNAGTAGSD+IFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKPXXXXKQKWKFMQKYFHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of CsaV3_4G031590.1 vs. NCBI nr
Match: XP_008442034.1 (PREDICTED: microfibrillar-associated protein 1 [Cucumis melo] >XP_008442035.1 PREDICTED: microfibrillar-associated protein 1 [Cucumis melo])

HSP 1 Score: 531.6 bits (1368), Expect = 2.6e-147
Identity = 414/433 (95.61%), Postives = 416/433 (96.07%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120
           P QEDSD+S    XXXXXXXXXXXXXXXXXXXXXX IRQAEIVSTIEE         XXX
Sbjct: 61  PSQEDSDLSRKDDXXXXXXXXXXXXXXXXXXXXXXRIRQAEIVSTIEEXXXXXXXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX TGIAMV
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPTGIAMV 180

Query: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240
           KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRK XXXXXXXX
Sbjct: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKDXXXXXXXX 240

Query: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360
           EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSD+IFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of CsaV3_4G031590.1 vs. TAIR10
Match: AT5G17900.1 (microfibrillar-associated protein-related)

HSP 1 Score: 383.3 bits (983), Expect = 2.0e-106
Identity = 285/449 (63.47%), Postives = 329/449 (73.27%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVS++ IAVR+KL+G IGQTKV+RYWPGKAPEWA++A+ED D+RM + + L++AF
Sbjct: 1   MSVTAGVSESAIAVREKLKGGIGQTKVRRYWPGKAPEWAEEAEEDDDVRMQKVSVLDRAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIV-STIEEETRRQEGLDXX 120
           P+ +D  ++                           IRQAEI+               XX
Sbjct: 61  PKNDDLGVARKDDPRLRRLAKTKVENRDEVRADHRRIRQAEIIXXXXXXXXXXXXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAM 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        GIAM
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTDSEDDMPGIAM 180

Query: 181 VKPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXE---------------TKHI 240
           +KP+FVPK+ER+TI      XXXXXXXXXXXXXXXXXXXX                 K+I
Sbjct: 181 IKPVFVPKAERDTIAERERLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIRKNI 240

Query: 241 VVEEIRKXXXXXXXXXXXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKE 300
           ++EE                I DV+TDDE+NEAEEYE WK REI RIKR+R+ R+AML+E
Sbjct: 241 LLEEAN--------------IGDVETDDELNEAEEYEVWKTREIGRIKRERDAREAMLRE 300

Query: 301 REEIEKVRNMTEEERREWERKNPK-XXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGS 360
           REEIEK+RNMTE+ERR+WERKNPK       +KW FMQKYYHKGAFFQ D DD AG+AG+
Sbjct: 301 REEIEKLRNMTEQERRDWERKNPKPLSAQPKKKWNFMQKYYHKGAFFQADPDDEAGSAGT 360

Query: 361 DNIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPL 420
           D IF RDFS+PTGED++DK+ILPKVMQVKHFGRSGRTKWTHLVNEDTTDW+NPWT NDPL
Sbjct: 361 DGIFQRDFSAPTGEDRLDKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWSNPWTSNDPL 420

Query: 421 RAKYNAKMAGMNAPITKPKGSKKLKDWES 433
           R KYN KMAGM+API KPKGSKK+KDWES
Sbjct: 421 REKYNKKMAGMDAPIAKPKGSKKMKDWES 435

BLAST of CsaV3_4G031590.1 vs. TAIR10
Match: AT4G08580.1 (microfibrillar-associated protein-related)

HSP 1 Score: 378.6 bits (971), Expect = 5.0e-105
Identity = 282/449 (62.81%), Postives = 327/449 (72.83%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVS++ IAVR+KL+G IGQTKV+RYWPGKAPEWA++A+ED D+RM + + L++AF
Sbjct: 1   MSVTAGVSESAIAVREKLKGGIGQTKVRRYWPGKAPEWAEEAEEDDDVRMQKFSVLDRAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEI-VSTIEEETRRQEGLDXX 120
           P+ +D  ++                           IRQAEI                XX
Sbjct: 61  PKNDDLGVARKDDPRLRRLAQTKVENRDEVRADHRRIRQAEIXXXXXXXXXXXXXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAM 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX         GIA+
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXETDSEDDMPGIAL 180

Query: 181 VKPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXE---------------TKHI 240
           +KP+FVPK+ER+TI      XXXXXXXXXXXXXXXXXXXX                 K+I
Sbjct: 181 IKPVFVPKAERDTIAERERLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIRKNI 240

Query: 241 VVEEIRKXXXXXXXXXXXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKE 300
           ++EE                I DV+TDDE+NEAEEYE WK REI RIKR+R+ R+AML+E
Sbjct: 241 LLEEAN--------------IGDVETDDELNEAEEYEVWKTREIGRIKRERDAREAMLRE 300

Query: 301 REEIEKVRNMTEEERREWERKNPK-XXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGS 360
           REEIEK+RNMTE+ERR+WERKNPK       +KW FMQKYYHKGAFFQ D DD AG+AG+
Sbjct: 301 REEIEKLRNMTEQERRDWERKNPKPSSAQPKKKWNFMQKYYHKGAFFQADPDDEAGSAGT 360

Query: 361 DNIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPL 420
           D IF RDFS+PTGED++DK+ILPKVMQVKHFGRSGRTKWTHLVNEDTTDW+NPWT NDPL
Sbjct: 361 DGIFQRDFSAPTGEDRLDKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWSNPWTSNDPL 420

Query: 421 RAKYNAKMAGMNAPITKPKGSKKLKDWES 433
           R KYN KMAGM+API KPKGSKK+KDWE+
Sbjct: 421 REKYNKKMAGMDAPIAKPKGSKKMKDWET 435

BLAST of CsaV3_4G031590.1 vs. Swiss-Prot
Match: sp|Q93712|MFAP1_CAEEL (Microfibrillar-associated protein 1 OS=Caenorhabditis elegans OX=6239 GN=mfap-1 PE=1 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 1.2e-31
Identity = 73/172 (42.44%), Postives = 112/172 (65.12%), Query Frame = 0

Query: 260 YEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERREWERKNPKXXXXXXQ--KW 319
           YEAWK+RE+ R+KR+R+      +E+ E++K+  M+EEER ++ R NPK         K+
Sbjct: 304 YEAWKLREMKRLKRNRDEXXEAAREKAELDKIHAMSEEERLKYLRLNPKVITNKQDKGKY 363

Query: 320 KFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRS 379
           KF+QKY+H+GAFF ++ D+         +  R+F+  T +D+ DKTILPKVMQVK+FG++
Sbjct: 364 KFLQKYFHRGAFFLDEEDE---------VLKRNFAEATNDDQFDKTILPKVMQVKNFGKA 423

Query: 380 GRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITKPKGSKKLKD 430
            RTK+THL  EDTTD    W   + L ++++ K AG + P+ +   +KK K+
Sbjct: 424 SRTKYTHLTEEDTTDHQGVWASTNQLNSQFSTKRAGGSRPVFERPATKKRKN 466

BLAST of CsaV3_4G031590.1 vs. Swiss-Prot
Match: sp|Q9W062|MFAP1_DROME (Microfibrillar-associated protein 1 OS=Drosophila melanogaster OX=7227 GN=Mfap1 PE=1 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 4.0e-27
Identity = 69/142 (48.59%), Postives = 91/142 (64.08%), Query Frame = 0

Query: 290 KVRNMTEEERREWERKNPKXXXXXXQ--KWKFMQKYYHKGAFFQEDADDNAGTAGSDNIF 349
           ++RNMTEEERR+  R+NPK         K+KF+QKYYH+GAF+ ++ +D         + 
Sbjct: 344 RMRNMTEEERRQELRQNPKVVTNKATKGKYKFLQKYYHRGAFYLDEEND---------VL 403

Query: 350 HRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKY 409
            RDF+  T ED  DKTILPKVMQVK+FGR GRTK+THLV++DTT +++PW        K+
Sbjct: 404 KRDFAQATLEDHFDKTILPKVMQVKNFGRCGRTKYTHLVDQDTTKFDSPWYAESSSNIKF 463

Query: 410 -NAKMAGMNAPITKPKGSKKLK 429
            N    GM     KP GSK+ K
Sbjct: 464 HNEHAGGMRQQFDKPTGSKRKK 476

BLAST of CsaV3_4G031590.1 vs. Swiss-Prot
Match: sp|C0HKD8|MFA1A_MOUSE (Microfibrillar-associated protein 1A OS=Mus musculus OX=10090 GN=Mfap1a PE=1 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 2.9e-25
Identity = 66/142 (46.48%), Postives = 90/142 (63.38%), Query Frame = 0

Query: 288 IEKVRNMTEEERREWERKNPK--XXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDN 347
           IE++RN+TEEERR   R N K         K+KF+QKYYH+GAFF ++          + 
Sbjct: 304 IERMRNLTEEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMDE---------DEE 363

Query: 348 IFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRA 407
           ++ RDFS+PT ED  +KTILPKVMQVK+FGRSGRTK+THLV++DTT +++ W        
Sbjct: 364 VYKRDFSAPTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNT 423

Query: 408 K-YNAKMAGMNAPITKPKGSKK 427
           K +  K AG+     +P   K+
Sbjct: 424 KFFKQKAAGVRDVFERPSAKKR 436

BLAST of CsaV3_4G031590.1 vs. Swiss-Prot
Match: sp|C0HKD9|MFA1B_MOUSE (Microfibrillar-associated protein 1B OS=Mus musculus OX=10090 GN=Mfap1b PE=1 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 2.9e-25
Identity = 66/142 (46.48%), Postives = 90/142 (63.38%), Query Frame = 0

Query: 288 IEKVRNMTEEERREWERKNPK--XXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDN 347
           IE++RN+TEEERR   R N K         K+KF+QKYYH+GAFF ++          + 
Sbjct: 304 IERMRNLTEEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMDE---------DEE 363

Query: 348 IFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRA 407
           ++ RDFS+PT ED  +KTILPKVMQVK+FGRSGRTK+THLV++DTT +++ W        
Sbjct: 364 VYKRDFSAPTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNT 423

Query: 408 K-YNAKMAGMNAPITKPKGSKK 427
           K +  K AG+     +P   K+
Sbjct: 424 KFFKQKAAGVRDVFERPSAKKR 436

BLAST of CsaV3_4G031590.1 vs. Swiss-Prot
Match: sp|Q5EA98|MFAP1_BOVIN (Microfibrillar-associated protein 1 OS=Bos taurus OX=9913 GN=MFAP1 PE=2 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 2.9e-25
Identity = 66/142 (46.48%), Postives = 90/142 (63.38%), Query Frame = 0

Query: 288 IEKVRNMTEEERREWERKNPK--XXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDN 347
           IE++RN+TEEERR   R N K         K+KF+QKYYH+GAFF ++          + 
Sbjct: 304 IERMRNLTEEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMDE---------DEE 363

Query: 348 IFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRA 407
           ++ RDFS+PT ED  +KTILPKVMQVK+FGRSGRTK+THLV++DTT +++ W        
Sbjct: 364 VYKRDFSAPTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNT 423

Query: 408 K-YNAKMAGMNAPITKPKGSKK 427
           K +  K AG+     +P   K+
Sbjct: 424 KFFKQKAAGVRDVFERPSAKKR 436

BLAST of CsaV3_4G031590.1 vs. TrEMBL
Match: tr|A0A0A0L1H0|A0A0A0L1H0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G578870 PE=4 SV=1)

HSP 1 Score: 557.8 bits (1436), Expect = 2.2e-155
Identity = 433/433 (100.00%), Postives = 433/433 (100.00%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120
           PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX
Sbjct: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180

Query: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240
           KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX
Sbjct: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240

Query: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360
           EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of CsaV3_4G031590.1 vs. TrEMBL
Match: tr|A0A1S3B4S7|A0A1S3B4S7_CUCME (microfibrillar-associated protein 1 OS=Cucumis melo OX=3656 GN=LOC103486019 PE=4 SV=1)

HSP 1 Score: 531.6 bits (1368), Expect = 1.7e-147
Identity = 414/433 (95.61%), Postives = 416/433 (96.07%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120
           P QEDSD+S    XXXXXXXXXXXXXXXXXXXXXX IRQAEIVSTIEE         XXX
Sbjct: 61  PSQEDSDLSRKDDXXXXXXXXXXXXXXXXXXXXXXRIRQAEIVSTIEEXXXXXXXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX TGIAMV
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPTGIAMV 180

Query: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240
           KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRK XXXXXXXX
Sbjct: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKDXXXXXXXX 240

Query: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360
           EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSD+IFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of CsaV3_4G031590.1 vs. TrEMBL
Match: tr|A0A200QM48|A0A200QM48_9MAGN (Micro-fibrillar-associated protein 1 OS=Macleaya cordata OX=56857 GN=BVC80_1017g14 PE=4 SV=1)

HSP 1 Score: 407.9 bits (1047), Expect = 2.8e-110
Identity = 289/433 (66.74%), Postives = 299/433 (69.05%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADDA+EDGDIR AR  ALEKAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDAEEDGDIRTARDVALEKAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120
           PR+EDSD+                            IRQAEIVSTIE          XXX
Sbjct: 61  PRREDSDVVGKDDPRLRRLAESKIDNREEIRADHRRIRQAEIVSTIEXXXXXXXXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  TGIAMV
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXELTGIAMV 180

Query: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240
           KP+FVPKSER+TI                          ETK IVV EIRK         
Sbjct: 181 KPVFVPKSERDTIAERERLEAEEQALEDLMKRRLEERKVETKQIVVSEIRKDEEIQKNLE 240

Query: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
               IADVDTD                                       VRNMTEEERR
Sbjct: 241 VEANIADVDTDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVRNMTEEERR 300

Query: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360
           EWERKNPK      QKW+FMQKYYHKGAFFQ DADD   TAGSDNIF RDFS+PTGEDKM
Sbjct: 301 EWERKNPKPLAPSKQKWRFMQKYYHKGAFFQSDADDIVATAGSDNIFTRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWE+R
Sbjct: 421 PKGSKKLKDWETR 433

BLAST of CsaV3_4G031590.1 vs. TrEMBL
Match: tr|M5WI16|M5WI16_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_4G226700 PE=4 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 8.2e-110
Identity = 305/433 (70.44%), Postives = 324/433 (74.83%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRM+ AA+LEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMSAAASLEKAF 60

Query: 61  PRQEDSDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEIVSTIEEETRRQEGLDXXX 120
           P QE SD+                            IRQAEIVSTIEEE +RQEGL+XXX
Sbjct: 61  PTQEYSDVVRKDDPRLRRLAESRIDNREDVRADHRRIRQAEIVSTIEEEAKRQEGLEXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTGIAMV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX TG+ M+
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLTGMVML 180

Query: 181 KPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXXXXXXX 240
           KP+FVPKSER+TI      XXXXXXXXXXXXXXXXXXXX  K IVVEEIRK         
Sbjct: 181 KPVFVPKSERDTIAERERLXXXXXXXXXXXXXXXXXXXXXXKQIVVEEIRKDEEIQKGLE 240

Query: 241 XXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
               I D+DTD                                      +VRNMTEEERR
Sbjct: 241 QEGNIVDIDTDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRVRNMTEEERR 300

Query: 301 EWERKNPKXXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360
           +WERK+PK      QKW+FMQKYYHKGAFFQ + DD A T G+D I+ RDFS+PTGEDKM
Sbjct: 301 DWERKHPKAAPQPKQKWRFMQKYYHKGAFFQSEPDDYAATVGTDGIYTRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLR+KYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRSKYNAKMAGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of CsaV3_4G031590.1 vs. TrEMBL
Match: tr|A0A2U1KAH5|A0A2U1KAH5_ARTAN (Micro-fibrillar-associated protein 1, C-terminal OS=Artemisia annua OX=35608 GN=CTI12_AA625440 PE=4 SV=1)

HSP 1 Score: 404.8 bits (1039), Expect = 2.4e-109
Identity = 298/439 (67.88%), Postives = 320/439 (72.89%), Query Frame = 0

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADD DEDGDIR ++ AALE AF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDGDEDGDIRTSK-AALESAF 60

Query: 61  PRQED----SDISXXXXXXXXXXXXXXXXXXXXXXXXXXXIRQAEI-VSTIEEETRRQEG 120
           P ++D    + I                            IRQAEI              
Sbjct: 61  PSRDDDGDRTHIVKGDDRRLRRLAENRLDNKEEVRADHRRIRQAEIXXXXXXXXXXXXXX 120

Query: 121 LDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXT 180
             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDQM 180

Query: 181 GIAMVKPIFVPKSERETIXXXXXXXXXXXXXXXXXXXXXXXXXXETKHIVVEEIRKXXXX 240
           G+AMVKP+FVPKSER+TI                    XXXXXX         IRK    
Sbjct: 181 GMAMVKPVFVPKSERDTIAEREKIEAEERAVEELVKRRXXXXXXXXXXXXXXXIRKDMEV 240

Query: 241 XXXXXXXXXIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMT 300
                    IADV+TDD++N A+EYEAWKVREIARIKRDRE  DAM KEREEIE+VRNMT
Sbjct: 241 QKNLEAEADIADVETDDDLNNADEYEAWKVREIARIKRDREXXDAMAKEREEIERVRNMT 300

Query: 301 EEERREWERKNPK-XXXXXXQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSP 360
           EEERREWERKNPK       QKW+FMQKYYHKGAFFQ+D DD AGTAG+D ++ RD+SSP
Sbjct: 301 EEERREWERKNPKQNNGAQKQKWRFMQKYYHKGAFFQDDPDDTAGTAGADGVYRRDYSSP 360

Query: 361 TGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGM 420
           TGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYND LRAKYN++MAGM
Sbjct: 361 TGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDTLRAKYNSQMAGM 420

Query: 421 NAPITKPKGSKKLKDWESR 434
             PI KPKG KK+KDWESR
Sbjct: 421 K-PIAKPKG-KKIKDWESR 436

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011653945.13.3e-155100.00PREDICTED: microfibrillar-associated protein 1 [Cucumis sativus] >KGN54899.1 hyp... [more]
XP_023543457.15.5e-15088.68microfibrillar-associated protein 1-like [Cucurbita pepo subsp. pepo][more]
XP_022950647.17.2e-15088.68microfibrillar-associated protein 1-like [Cucurbita moschata][more]
XP_022978182.13.0e-14894.00microfibrillar-associated protein 1-like [Cucurbita maxima][more]
XP_008442034.12.6e-14795.61PREDICTED: microfibrillar-associated protein 1 [Cucumis melo] >XP_008442035.1 PR... [more]
Match NameE-valueIdentityDescription
AT5G17900.12.0e-10663.47microfibrillar-associated protein-related[more]
AT4G08580.15.0e-10562.81microfibrillar-associated protein-related[more]
Match NameE-valueIdentityDescription
sp|Q93712|MFAP1_CAEEL1.2e-3142.44Microfibrillar-associated protein 1 OS=Caenorhabditis elegans OX=6239 GN=mfap-1 ... [more]
sp|Q9W062|MFAP1_DROME4.0e-2748.59Microfibrillar-associated protein 1 OS=Drosophila melanogaster OX=7227 GN=Mfap1 ... [more]
sp|C0HKD8|MFA1A_MOUSE2.9e-2546.48Microfibrillar-associated protein 1A OS=Mus musculus OX=10090 GN=Mfap1a PE=1 SV=... [more]
sp|C0HKD9|MFA1B_MOUSE2.9e-2546.48Microfibrillar-associated protein 1B OS=Mus musculus OX=10090 GN=Mfap1b PE=1 SV=... [more]
sp|Q5EA98|MFAP1_BOVIN2.9e-2546.48Microfibrillar-associated protein 1 OS=Bos taurus OX=9913 GN=MFAP1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
tr|A0A0A0L1H0|A0A0A0L1H0_CUCSA2.2e-155100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G578870 PE=4 SV=1[more]
tr|A0A1S3B4S7|A0A1S3B4S7_CUCME1.7e-14795.61microfibrillar-associated protein 1 OS=Cucumis melo OX=3656 GN=LOC103486019 PE=4... [more]
tr|A0A200QM48|A0A200QM48_9MAGN2.8e-11066.74Micro-fibrillar-associated protein 1 OS=Macleaya cordata OX=56857 GN=BVC80_1017g... [more]
tr|M5WI16|M5WI16_PRUPE8.2e-11070.44Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_4G226700 PE=4 SV=1[more]
tr|A0A2U1KAH5|A0A2U1KAH5_ARTAN2.4e-10967.88Micro-fibrillar-associated protein 1, C-terminal OS=Artemisia annua OX=35608 GN=... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR009730MFAP1_C
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsaV3_4G031590CsaV3_4G031590gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsaV3_4G031590.1CsaV3_4G031590.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_4G031590.1.exon4CsaV3_4G031590.1.exon4exon
CsaV3_4G031590.1.exon3CsaV3_4G031590.1.exon3exon
CsaV3_4G031590.1.exon2CsaV3_4G031590.1.exon2exon
CsaV3_4G031590.1.exon1CsaV3_4G031590.1.exon1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsaV3_4G031590.1.cds3CsaV3_4G031590.1.cds3CDS
CsaV3_4G031590.1.cds2CsaV3_4G031590.1.cds2CDS


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 189..219
NoneNo IPR availableCOILSCoilCoilcoord: 116..146
NoneNo IPR availableCOILSCoilCoilcoord: 278..305
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 414..433
NoneNo IPR availablePANTHERPTHR15327MICROFIBRIL-ASSOCIATED PROTEINcoord: 1..432
NoneNo IPR availablePANTHERPTHR15327:SF2MICROFIBRIL-ASSOCIATED PROTEIN-LIKE-RELATEDcoord: 1..432
IPR009730Micro-fibrillar-associated protein 1, C-terminalPFAMPF06991MFAP1coord: 169..390
e-value: 8.6E-76
score: 254.5