Cla003085 (gene) Watermelon (97103) v1

NameCla003085
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionMicrofibrillar-associated protein-like protein (AHRD V1 *-*- O22281_ARATH); contains Interpro domain(s) IPR009730 Micro-fibrillar-associated 1, C-terminal
LocationChr4 : 15036865 .. 15038528 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGGTCACGGCAGGGGTTAGTGATACTGTAATTGCTGTTAGGGATAAACTTAGAGGTAAAATTGGACAAACAAAAGTTAAGAGGTACTGGCCTGGAAAGGCTCCTGAGTGGGCGGATGATGCTGATGAAGATGGCGATATTAGGATGGCCAGGGCAGCAGCACTTGAGAAAGCATTTCCAAGTCAGGAAGATTCAAATCTATCTAGGAAGGATGACCCTAGGTTGCGCCGTCTAGCTGAGAGTAGGATAGATAATCGGGAGGAGATCAGAGCTGATCATCGACGTATTCGCCAAGCTGAGATTGTTTCAACCATTGAAGAGGAAACGCGGAAGCAGGAGGGTTTAGATGCAGAGGAAGAGGATGAGGAGGCTTTGGAGGAAAGAAGAAGAAGAATCAAGGAAAAGTTGCGACAAAGGGAGCTAGAAGAAGCTGCATTCCCTGAAGAAGAAGAGGAGGAGGAACCAGAGGAAGAGGAAGAAGAGGAGTCTGAGTATGAAACTGATTCAGAAGATGAACCTACTGGGATAACAATGGTGAAGCCAATATTTGTTCCGAAATCAGAGAGAGAAACTATTGCCGAACGGGAGCGTATTGAAGAGGAGGAAAGGTCTCTTGAGGAATTGAGAAAACGGCGATTGGAGGAAAGGAAGGCAGAGACAAAGCACATTGTGGTTGAGGAGATTAGAAAGGATGAAGAGATCCAGAAGAATTTGGAAATGGAGGCAAATATTGCAGATGTGGACACTGATGATGAAATTAATGAAGCAGAAGAATATGAGGCTTGGAAGGTTAGGGAGATTGCTAGGATCAAGAGGGATAGAGAACTCCGAGATGCAATGTTGAAGGAGAGGGAGGAGATTGAGAAGGTGAGAAATATGACTGAGGAAGAGAGGAGAGAATGGGAGAGGAAGAATCCAAAAACTGCTCCACCACCTAAGCAGAAGTGGAAGTTTATGCAGAAATATTACCACAAGGGTGCGTTCTTCCAGGAAGATGCTGATGATAATGCCGGAACTGCTGGATCTGATTCTATTTTCCATCGTGATTTCTCTTCTCCAACTGGAGAAGATAAGATGGACAAGACAATATTGCCGAAGGTTATGCAGGTGAAGCACTTTGGACGTAGTGGGAGAACAAAATGGACGCATCTTGTCAATGAAGATACAACCGACTGGAACAACCCGTAAGTTAATTTAATTTTGATTGCCTTCTGCTTCTTTTTCCTATTGTAAATTATTGCAATGTCAATCTTGATTTCTTAATTGTGTAAGAACCTAGATTACATATAGTATGATCCATAGAAAAAGCTTCTATTGCATTGTCTACTTATGATTGAGAGTGTGTTAATCAGAGAATGAATGGAAAGAGTTTTAGTATGAGCATGAGATTTATGGACGTACTGAGCAGCCAGAGCTTGATATTGACATTGAGACTTTTAGTTTGTTTATATATCTACAGGAGAATACGAAGATATGACTAGAACAATGCCCTGTTTCCACTTCTCTGTTTGTTTGCTTTCTTTGCCCGTTTTTTCGTTTGTAGTGTGGTTGCCATGTGGAAGTTATTCTGCAGTGTGAGTGACGTCAATGGTTACTGTCATGCAGTTGGACCTATAACGATCCTCTTCGGGCAAAATACAATGCAAAAATGGCCGGAATGA

mRNA sequence

ATGTCGGTCACGGCAGGGGTTAGTGATACTGTAATTGCTGTTAGGGATAAACTTAGAGGTAAAATTGGACAAACAAAAGTTAAGAGGTACTGGCCTGGAAAGGCTCCTGAGTGGGCGGATGATGCTGATGAAGATGGCGATATTAGGATGGCCAGGGCAGCAGCACTTGAGAAAGCATTTCCAAGTCAGGAAGATTCAAATCTATCTAGGAAGGATGACCCTAGGTTGCGCCGTCTAGCTGAGAGTAGGATAGATAATCGGGAGGAGATCAGAGCTGATCATCGACGTATTCGCCAAGCTGAGATTGTTTCAACCATTGAAGAGGAAACGCGGAAGCAGGAGGGTTTAGATGCAGAGGAAGAGGATGAGGAGGCTTTGGAGGAAAGAAGAAGAAGAATCAAGGAAAAGTTGCGACAAAGGGAGCTAGAAGAAGCTGCATTCCCTGAAGAAGAAGAGGAGGAGGAACCAGAGGAAGAGGAAGAAGAGGAGTCTGAGTATGAAACTGATTCAGAAGATGAACCTACTGGGATAACAATGGTGAAGCCAATATTTGTTCCGAAATCAGAGAGAGAAACTATTGCCGAACGGGAGCGTATTGAAGAGGAGGAAAGGTCTCTTGAGGAATTGAGAAAACGGCGATTGGAGGAAAGGAAGGCAGAGACAAAGCACATTGTGGTTGAGGAGATTAGAAAGGATGAAGAGATCCAGAAGAATTTGGAAATGGAGGCAAATATTGCAGATGTGGACACTGATGATGAAATTAATGAAGCAGAAGAATATGAGGCTTGGAAGGTTAGGGAGATTGCTAGGATCAAGAGGGATAGAGAACTCCGAGATGCAATGTTGAAGGAGAGGGAGGAGATTGAGAAGGTGAGAAATATGACTGAGGAAGAGAGGAGAGAATGGGAGAGGAAGAATCCAAAAACTGCTCCACCACCTAAGCAGAAGTGGAAGTTTATGCAGAAATATTACCACAAGGGTGCGTTCTTCCAGGAAGATGCTGATGATAATGCCGGAACTGCTGGATCTGATTCTATTTTCCATCGTGATTTCTCTTCTCCAACTGGAGAAGATAAGATGGACAAGACAATATTGCCGAAGGTTATGCAGGTGAAGCACTTTGGACGTAGTGGGAGAACAAAATGGACGCATCTTGTCAATGAAGATACAACCGACTGGAACAACCCTGTGGTTGCCATGTGGAAGTTATTCTGCAGTGTGAGTGACGTCAATGGTTACTGTCATGCAGTTGGACCTATAACGATCCTCTTCGGGCAAAATACAATGCAAAAATGGCCGGAATGA

Coding sequence (CDS)

ATGTCGGTCACGGCAGGGGTTAGTGATACTGTAATTGCTGTTAGGGATAAACTTAGAGGTAAAATTGGACAAACAAAAGTTAAGAGGTACTGGCCTGGAAAGGCTCCTGAGTGGGCGGATGATGCTGATGAAGATGGCGATATTAGGATGGCCAGGGCAGCAGCACTTGAGAAAGCATTTCCAAGTCAGGAAGATTCAAATCTATCTAGGAAGGATGACCCTAGGTTGCGCCGTCTAGCTGAGAGTAGGATAGATAATCGGGAGGAGATCAGAGCTGATCATCGACGTATTCGCCAAGCTGAGATTGTTTCAACCATTGAAGAGGAAACGCGGAAGCAGGAGGGTTTAGATGCAGAGGAAGAGGATGAGGAGGCTTTGGAGGAAAGAAGAAGAAGAATCAAGGAAAAGTTGCGACAAAGGGAGCTAGAAGAAGCTGCATTCCCTGAAGAAGAAGAGGAGGAGGAACCAGAGGAAGAGGAAGAAGAGGAGTCTGAGTATGAAACTGATTCAGAAGATGAACCTACTGGGATAACAATGGTGAAGCCAATATTTGTTCCGAAATCAGAGAGAGAAACTATTGCCGAACGGGAGCGTATTGAAGAGGAGGAAAGGTCTCTTGAGGAATTGAGAAAACGGCGATTGGAGGAAAGGAAGGCAGAGACAAAGCACATTGTGGTTGAGGAGATTAGAAAGGATGAAGAGATCCAGAAGAATTTGGAAATGGAGGCAAATATTGCAGATGTGGACACTGATGATGAAATTAATGAAGCAGAAGAATATGAGGCTTGGAAGGTTAGGGAGATTGCTAGGATCAAGAGGGATAGAGAACTCCGAGATGCAATGTTGAAGGAGAGGGAGGAGATTGAGAAGGTGAGAAATATGACTGAGGAAGAGAGGAGAGAATGGGAGAGGAAGAATCCAAAAACTGCTCCACCACCTAAGCAGAAGTGGAAGTTTATGCAGAAATATTACCACAAGGGTGCGTTCTTCCAGGAAGATGCTGATGATAATGCCGGAACTGCTGGATCTGATTCTATTTTCCATCGTGATTTCTCTTCTCCAACTGGAGAAGATAAGATGGACAAGACAATATTGCCGAAGGTTATGCAGGTGAAGCACTTTGGACGTAGTGGGAGAACAAAATGGACGCATCTTGTCAATGAAGATACAACCGACTGGAACAACCCTGTGGTTGCCATGTGGAAGTTATTCTGCAGTGTGAGTGACGTCAATGGTTACTGTCATGCAGTTGGACCTATAACGATCCTCTTCGGGCAAAATACAATGCAAAAATGGCCGGAATGA

Protein sequence

MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAFPSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEEEDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERREWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPVVAMWKLFCSVSDVNGYCHAVGPITILFGQNTMQKWPE
BLAST of Cla003085 vs. Swiss-Prot
Match: MFAP1_HUMAN (Microfibrillar-associated protein 1 OS=Homo sapiens GN=MFAP1 PE=1 SV=2)

HSP 1 Score: 290.8 bits (743), Expect = 2.5e-77
Identity = 178/402 (44.28%), Postives = 255/402 (63.43%), Query Frame = 1

Query: 12  IAVRDKLRGKIG--QTKVKRYWPGKAPEWA---DDADEDGDIRMARAAALEKAFPSQEDS 71
           + VR++ +G+I   + KVKRY  GK P++A      +ED + +  + A  ++A P +++ 
Sbjct: 20  VPVRNE-KGEISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEE 79

Query: 72  NLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIV----STIEEETRKQEGLDAEEE- 131
           + S   DPRLRRL     ++ EE  A HR+I + E+V    S +E +  + E  D+ EE 
Sbjct: 80  DSS--SDPRLRRLQNRISEDVEERLARHRKIVEPEVVGESDSEVEGDAWRMEREDSSEEE 139

Query: 132 ----DEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYE--TDSEDEPT 191
               D+E +E RR  ++++ ++R+ EE    E E+E    EE E ESEYE  TDSEDE  
Sbjct: 140 EEEIDDEEIERRRGMMRQRAQERKNEEMEVMEVEDEGRSGEESESESEYEEYTDSEDEME 199

Query: 192 GITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEI 251
               +KP+F+ K +R T+ ERE    +++ LE+  KR  EER+  T  IV EE +K  E+
Sbjct: 200 --PRLKPVFIRKKDRVTVQEREAEALKQKELEQEAKRMAEERRKYTLKIVEEETKK--EL 259

Query: 252 QKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMT 311
           ++N    A +  ++TDDE N+ EEYEAWKVRE+ RIKRDRE R+A+ KE+ EIE++RN+T
Sbjct: 260 EENKRSLAALDALNTDDE-NDEEEYEAWKVRELKRIKRDREDREALEKEKAEIERMRNLT 319

Query: 312 EEERREWERKNPK--TAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSS 371
           EEERR   R N K  T    K K+KF+QKYYH+GAFF ++          + ++ RDFS+
Sbjct: 320 EEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMDE---------DEEVYKRDFSA 379

Query: 372 PTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNN 396
           PT ED  +KTILPKVMQVK+FGRSGRTK+THLV++DTT +++
Sbjct: 380 PTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDS 404

BLAST of Cla003085 vs. Swiss-Prot
Match: MFAP1_BOVIN (Microfibrillar-associated protein 1 OS=Bos taurus GN=MFAP1 PE=2 SV=1)

HSP 1 Score: 289.7 bits (740), Expect = 5.5e-77
Identity = 178/402 (44.28%), Postives = 254/402 (63.18%), Query Frame = 1

Query: 12  IAVRDKLRGKIG--QTKVKRYWPGKAPEWA---DDADEDGDIRMARAAALEKAFPSQEDS 71
           + VR++ +G+I   + KVKRY  GK P++A      +ED + +  + A  ++A P +++ 
Sbjct: 20  VPVRNE-KGEISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEE 79

Query: 72  NLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIV----STIEEETRKQEGLDAEEE- 131
           + S   DPRLRRL     ++ EE  A HR+I + E+V    S +E +  + E  D+ EE 
Sbjct: 80  DSS--SDPRLRRLQNRISEDVEERLARHRKIVEPEVVGESDSEVEGDPWRMEREDSSEEE 139

Query: 132 ----DEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYE--TDSEDEPT 191
               DEE +E RR  ++++ ++R+ EE    E E+E    EE E ESEYE  TDSEDE  
Sbjct: 140 EEEIDEEEIERRRGMMRQRAQERKNEELEVMEVEDEGRSGEESESESEYEEYTDSEDEME 199

Query: 192 GITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEI 251
               +KP+F+ K +R T+ ERE    +++ LE+  K   EER+  T  IV EE +K  E+
Sbjct: 200 --PRLKPVFIRKKDRVTVQEREAEALKQKELEQEAKHMAEERRKYTLKIVEEETKK--EL 259

Query: 252 QKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMT 311
           ++N    A +  ++TDDE N+ EEYEAWKVRE+ RIKRDRE R+A+ KE+ EIE++RN+T
Sbjct: 260 EENKRSLAALDALNTDDE-NDEEEYEAWKVRELKRIKRDREDREALEKEKAEIERMRNLT 319

Query: 312 EEERREWERKNPK--TAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSS 371
           EEERR   R N K  T    K K+KF+QKYYH+GAFF ++          + ++ RDFS+
Sbjct: 320 EEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMDE---------DEEVYKRDFSA 379

Query: 372 PTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNN 396
           PT ED  +KTILPKVMQVK+FGRSGRTK+THLV++DTT +++
Sbjct: 380 PTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDS 404

BLAST of Cla003085 vs. Swiss-Prot
Match: MFAP1_MOUSE (Microfibrillar-associated protein 1 OS=Mus musculus GN=Mfap1 PE=1 SV=1)

HSP 1 Score: 288.5 bits (737), Expect = 1.2e-76
Identity = 177/402 (44.03%), Postives = 255/402 (63.43%), Query Frame = 1

Query: 12  IAVRDKLRGKIG--QTKVKRYWPGKAPEWA---DDADEDGDIRMARAAALEKAFPSQEDS 71
           + VR++ +G+I   + KVKRY  GK P++A      +ED + +  + A  ++A P +++ 
Sbjct: 20  VPVRNE-KGEISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEE 79

Query: 72  NLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIV----STIEEETRKQEGLDAEEE- 131
           + S   DPRLRRL     ++ EE  A HR+I + E+V    S +E +  + E  D+ EE 
Sbjct: 80  DSS--SDPRLRRLQNRISEDVEERLARHRKIVEPEVVGESDSEVEGDAWRLEREDSSEEE 139

Query: 132 ----DEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYE--TDSEDEPT 191
               D+E +E RR  ++++ ++R+ EE    E E+E    EE E ESEYE  TDSEDE  
Sbjct: 140 EEEIDDEEIERRRGMMRQRAQERKNEEMEVMEVEDEGRSGEESESESEYEEYTDSEDEME 199

Query: 192 GITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEI 251
               +KP+F+ K +R T+ ERE    +++ LE+  KR  EER+  T  IV EE +K  E+
Sbjct: 200 --PRLKPVFIRKKDRVTVQEREAEALKQKELEQEAKRMAEERRKYTLKIVEEETKK--EL 259

Query: 252 QKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMT 311
           ++N    A +  ++TDDE N+ EEYEAWKVRE+ RIKR+RE R+A+ KE+ EIE++RN+T
Sbjct: 260 EENKRSLAALDALNTDDE-NDEEEYEAWKVRELKRIKREREDREALEKEKAEIERMRNLT 319

Query: 312 EEERREWERKNPK--TAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSS 371
           EEERR   R N K  T    K K+KF+QKYYH+GAFF ++          + ++ RDFS+
Sbjct: 320 EEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMDE---------DEEVYKRDFSA 379

Query: 372 PTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNN 396
           PT ED  +KTILPKVMQVK+FGRSGRTK+THLV++DTT +++
Sbjct: 380 PTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDS 404

BLAST of Cla003085 vs. Swiss-Prot
Match: MFAP1_CHICK (Microfibrillar-associated protein 1 OS=Gallus gallus GN=MFAP1 PE=2 SV=1)

HSP 1 Score: 277.7 bits (709), Expect = 2.1e-73
Identity = 166/368 (45.11%), Postives = 237/368 (64.40%), Query Frame = 1

Query: 43  DEDGDIRMARAAALEKAFPSQEDSNLSRKDDPRLRRLAESRI-DNREEIRADHRRIRQAE 102
           +ED + +  + A  ++  P +++  ++  +DPRLRRL ++RI ++ EE  A HR+I + E
Sbjct: 56  EEDEEFQFIKKAKEQEVEPEEQEEEVA--NDPRLRRLLQNRITEDVEERLARHRKIVEPE 115

Query: 103 IVS-----TIEEETRKQEGLDAEEEDEEALEER-----RRRIKEKLRQRELEEAAFPEEE 162
           +VS      +E E  + E  D  EE+EE +++      R  ++++ ++R+ EE    E E
Sbjct: 116 VVSGESDSEVEGEAWRVEREDTSEEEEEEIDDEEIERWRGMMRQRAQERKTEELEVMELE 175

Query: 163 EEEEPEEEEEEESEYE--TDSEDEPTGITMVKPIFVPKSERETIAERERIEEEERSLEEL 222
           +E    EE E ESEYE  TDSEDE      +KP+F+ K +R T+ ERE    +++ LE+ 
Sbjct: 176 DEGRSGEESELESEYEEYTDSEDEME--PRLKPVFIRKKDRITVQEREAEALKQKELEQE 235

Query: 223 RKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEEYEAWKVREIA 282
            KR  EER+  T  IV EE +K  E+++N    A +  +DTDDE N+ EEYEAWKVRE+ 
Sbjct: 236 AKRLAEERRKYTLKIVEEEAKK--ELEENKRSLAALDALDTDDE-NDEEEYEAWKVRELK 295

Query: 283 RIKRDRELRDAMLKEREEIEKVRNMTEEERREWERKNPK--TAPPPKQKWKFMQKYYHKG 342
           RIKRDRE R+AM KE+ EIE++RN+TEEERR   R N K  T    K K+KF+QKYYH+G
Sbjct: 296 RIKRDREEREAMEKEKAEIERMRNLTEEERRAELRANGKVVTNKAVKGKYKFLQKYYHRG 355

Query: 343 AFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVN 396
           AFF ++          + ++ RDFS+PT ED  +KTILPKVMQVK+FGRSGRTK+THLV+
Sbjct: 356 AFFMDE---------DEEVYKRDFSAPTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVD 407

BLAST of Cla003085 vs. Swiss-Prot
Match: MFAP1_DICDI (Protein MFAP1 homolog OS=Dictyostelium discoideum GN=mfap1 PE=3 SV=1)

HSP 1 Score: 192.2 bits (487), Expect = 1.2e-47
Identity = 134/436 (30.73%), Postives = 220/436 (50.46%), Query Frame = 1

Query: 14  VRDKLRGKIGQ----------TKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAFPSQ 73
           +RDKL+  +            TKV RY  G+ P++A+ A++D D   +          +Q
Sbjct: 7   IRDKLKNNLESSGGHSVVTSGTKVIRYRAGQRPDYAE-AEDDQDSHFSN-------IQNQ 66

Query: 74  EDSNLSRKDDPRLRRLAE--------SRIDNREEIRADH------------------RRI 133
           +    +  +DPRL R             ++ R+  R  H                  RRI
Sbjct: 67  KSIKEAETNDPRLARFKNRSSNQDEPQSVEERKASRRRHHDDDNDNDTTTTTTTTTSRRI 126

Query: 134 RQAEIVSTIEEETRKQEG-LDAEEEDEEALEERRRRIKEK-LRQRELEEAAFPEEEEEEE 193
           ++ EI+   ++           +  D++  ++RRRR KE+ L+++E EE    E EE+++
Sbjct: 127 QKTEIIKEDDDNNNNNNNDTKKDHNDDDEDDDRRRRAKERYLKKKEEEEQKQKELEEKQQ 186

Query: 194 P------EEEEEEESEYETDSEDEPTGI-----TMVKPIFVPKSERETIAERERIEEEER 253
           P      E EEE  SEYETDSE++          + +P F+ K +R TI   E+ E+EE+
Sbjct: 187 PFKDIEGESEEEGSSEYETDSEEDDEDEYWDQPPIFRPTFIKKDDRGTIKTDEQWEKEEQ 246

Query: 254 SLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEEYEAWK 313
             +   +R  E+RK E    + +E+ +D + Q+  E+E    +   DDE  +  +   W 
Sbjct: 247 EQQAQLEREKEQRKIEAHRKLKDELDRDRKEQEAKELEQKEEEEYDDDEDQDGSKKLLWI 306

Query: 314 VREIARIKRDRELRDAMLKEREEIEKVRNMTEEE--RREWERKNPKTAPPPKQKWKFMQK 373
            RE+ R++ +   R     E++E  + R MT+++  + +  R         K++ KF+Q+
Sbjct: 307 QRELERVRLEIHTRLLAEFEKKEFARRRAMTDDQILKEDPSRSRTNIDNSQKKQLKFLQR 366

Query: 374 YYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKW 395
            YH+GAFFQ+D          + I ++DFS+PTGEDK ++ +LPKVMQVK+FG++GRTK+
Sbjct: 367 DYHRGAFFQDD----------EYIKNKDFSAPTGEDKFNRELLPKVMQVKNFGKAGRTKY 424

BLAST of Cla003085 vs. TrEMBL
Match: A0A061EZF4_THECC (Microfibrillar-associated protein 1 OS=Theobroma cacao GN=TCM_025838 PE=4 SV=1)

HSP 1 Score: 686.8 bits (1771), Expect = 1.7e-194
Identity = 334/397 (84.13%), Postives = 368/397 (92.70%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADDADE+GDIRMARA ALEKAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDADEEGDIRMARAVALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           PS++DS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEE R+ EG++AEE
Sbjct: 61  PSRDDSDVVRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEENRRNEGVEAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAF-PEEEEEEEPEEEEEEESEYETDSEDEPTGITM 180
           EDE+ALEERRRRI+EKL QRE EE A   EEEEEEE EEEEEEESEYETDSE+E TGI M
Sbjct: 121 EDEDALEERRRRIREKLLQREQEETALLEEEEEEEEVEEEEEEESEYETDSEEEHTGIAM 180

Query: 181 VKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNL 240
           VKP+FVPKSER+TIAERER+E EER++EE  KR+LE RK ET+ IVVE+IR+DEEIQKN+
Sbjct: 181 VKPVFVPKSERDTIAERERLEAEERAIEEAEKRKLEHRKVETRQIVVEKIREDEEIQKNM 240

Query: 241 EMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEER 300
           E+EAN+ADVDTDDE+NEAEEYEAWK REIARIKRDRE R+AM+KE+EEIEKVRNMTEEER
Sbjct: 241 ELEANVADVDTDDEVNEAEEYEAWKAREIARIKRDREEREAMIKEKEEIEKVRNMTEEER 300

Query: 301 REWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDK 360
           REWERKNPK APPPKQKWKFMQKYYHKGAFFQ +ADD A   G+D+I+HRDFS PTGEDK
Sbjct: 301 REWERKNPKPAPPPKQKWKFMQKYYHKGAFFQAEADDPAAAVGADNIYHRDFSGPTGEDK 360

Query: 361 MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP
Sbjct: 361 MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397

BLAST of Cla003085 vs. TrEMBL
Match: V4T8Z0_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020229mg PE=4 SV=1)

HSP 1 Score: 686.8 bits (1771), Expect = 1.7e-194
Identity = 338/396 (85.35%), Postives = 366/396 (92.42%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADD +ED DIRM+RAAAL+KAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDIEEDNDIRMSRAAALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P +EDS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEETR QEGLD EE
Sbjct: 61  PRKEDSDIGRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEETR-QEGLDMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRI+EKL QRE EEAA   EEEEE  EEEEEEESEYETDSE+E  GI M+
Sbjct: 121 EDEEALEERRRRIREKLLQREQEEAALLPEEEEEAVEEEEEEESEYETDSEEEQMGIAML 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FVPKSER+TIAERER+E EE++LEEL KR+LEERK ETK I+VEE+RKDEEIQKNLE
Sbjct: 181 KPVFVPKSERDTIAERERLEAEEQALEELAKRKLEERKVETKKILVEEVRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEVNEAEEYEAWKVREIARIKRDREAREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWER+NPK APPPKQKW+FMQKYYHKGAFFQ DA D A T  +D I+HRDFS+PTGEDKM
Sbjct: 301 EWERRNPKPAPPPKQKWRFMQKYYHKGAFFQSDAADTAATVRTDEIYHRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 395

BLAST of Cla003085 vs. TrEMBL
Match: A0A067FS77_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013977mg PE=4 SV=1)

HSP 1 Score: 686.8 bits (1771), Expect = 1.7e-194
Identity = 338/396 (85.35%), Postives = 366/396 (92.42%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADD +ED DIRM+RAAAL+KAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDIEEDNDIRMSRAAALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P +EDS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEETR QEGLD EE
Sbjct: 61  PRKEDSDIGRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEETR-QEGLDMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRI+EKL QRE EEAA   EEEEE  EEEEEEESEYETDSE+E  GI M+
Sbjct: 121 EDEEALEERRRRIREKLLQREQEEAALLPEEEEEAVEEEEEEESEYETDSEEEQMGIAML 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FVPKSER+TIAERER+E EE++LEEL KR+LEERK ETK I+VEE+RKDEEIQKNLE
Sbjct: 181 KPVFVPKSERDTIAERERLEAEEQALEELAKRKLEERKVETKKILVEEVRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEVNEAEEYEAWKVREIARIKRDREAREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWER+NPK APPPKQKW+FMQKYYHKGAFFQ DA D A T  +D I+HRDFS+PTGEDKM
Sbjct: 301 EWERRNPKPAPPPKQKWRFMQKYYHKGAFFQSDAADTAATVRTDEIYHRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 395

BLAST of Cla003085 vs. TrEMBL
Match: A0A067KU23_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02667 PE=4 SV=1)

HSP 1 Score: 685.3 bits (1767), Expect = 5.0e-194
Identity = 334/396 (84.34%), Postives = 369/396 (93.18%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSD  +AVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDI+MARA ALEKAF
Sbjct: 1   MSVTAGVSDVALAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIKMARADALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P++EDS+++RKDDPRLRRLAES+IDNR+E+RADHRRIRQAEI++T EEET++QE  D EE
Sbjct: 61  PTKEDSDIARKDDPRLRRLAESKIDNRDEVRADHRRIRQAEIIATEEEETQRQEWADMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           E+EEALEERRRRIKEK R RE EEAA P EEEEEEPEEEEEEESEYETDS++E TG+ MV
Sbjct: 121 ENEEALEERRRRIKEKSRLREQEEAALPAEEEEEEPEEEEEEESEYETDSDEEMTGMAMV 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KPIFVPKSERETIAERER+E EE++LEE  KR+LEERK ETK I+VEEI+KDE IQKNLE
Sbjct: 181 KPIFVPKSERETIAERERLEAEEQALEEKAKRKLEERKVETKQILVEEIQKDELIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEA+IADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEASIADVDTDDEVNEAEEYEAWKVREIARIKRDREDREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWERKNPK APPPKQKW+FMQKYYHKGAFFQ ++DD A TAGSD I++RDFS+PTGEDKM
Sbjct: 301 EWERKNPKPAPPPKQKWRFMQKYYHKGAFFQNESDDRAATAGSDDIYNRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           DK+ILPKVMQVKHFGRSGRTKWTHLVNEDTTDWN P
Sbjct: 361 DKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNTP 396

BLAST of Cla003085 vs. TrEMBL
Match: A0A059AKH0_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_J03113 PE=4 SV=1)

HSP 1 Score: 681.8 bits (1758), Expect = 5.5e-193
Identity = 337/396 (85.10%), Postives = 365/396 (92.17%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGK PEWAD+ADEDGDIRMARA AL+KAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKVPEWADEADEDGDIRMARAVALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P+ E S++ +KDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVSTIEEE R+QEGL+AEE
Sbjct: 61  PTYEGSDIGKKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTIEEENRRQEGLEAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           ED EALEERRR+I+EKL  RE EEAA   EEEEEE EEEEEEESEYETDSE+E  GI MV
Sbjct: 121 EDAEALEERRRKIREKLLLREQEEAALLPEEEEEEEEEEEEEESEYETDSEEETKGIAMV 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FV KSER+TIAER+R+EEEER++EEL KRR EERKAETK IVVEEIRKDEEIQKNLE
Sbjct: 181 KPVFVVKSERDTIAERQRLEEEERAIEELMKRRQEERKAETKQIVVEEIRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWK REIARIKRDRE R+AMLK +EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDELNEAEEYEAWKAREIARIKRDREDREAMLKAKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWERKNPK +  PKQKW+FMQKYYHKGAFFQ + D++AGTAGSD I+ RDFS+PTGEDKM
Sbjct: 301 EWERKNPKPSSAPKQKWRFMQKYYHKGAFFQSEVDEHAGTAGSDYIYGRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 396

BLAST of Cla003085 vs. NCBI nr
Match: gi|659082793|ref|XP_008442034.1| (PREDICTED: microfibrillar-associated protein 1 [Cucumis melo])

HSP 1 Score: 785.0 bits (2026), Expect = 6.6e-224
Identity = 392/396 (98.99%), Postives = 394/396 (99.49%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           PSQEDS+LSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETR+QEGLDAEE
Sbjct: 61  PSQEDSDLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRRQEGLDAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGI MV
Sbjct: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGIAMV 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE
Sbjct: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWERKNPK APPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKPAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 396

BLAST of Cla003085 vs. NCBI nr
Match: gi|778695198|ref|XP_011653945.1| (PREDICTED: microfibrillar-associated protein 1 [Cucumis sativus])

HSP 1 Score: 780.0 bits (2013), Expect = 2.1e-222
Identity = 388/396 (97.98%), Postives = 393/396 (99.24%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P QEDS++SRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETR+QEGLDAEE
Sbjct: 61  PRQEDSDISRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRRQEGLDAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDE+ALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGI MV
Sbjct: 121 EDEDALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGIAMV 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE
Sbjct: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWERKNPK APPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSD+IFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKPAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 396

BLAST of Cla003085 vs. NCBI nr
Match: gi|641847174|gb|KDO66055.1| (hypothetical protein CISIN_1g013977mg [Citrus sinensis])

HSP 1 Score: 686.8 bits (1771), Expect = 2.5e-194
Identity = 338/396 (85.35%), Postives = 366/396 (92.42%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADD +ED DIRM+RAAAL+KAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDIEEDNDIRMSRAAALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P +EDS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEETR QEGLD EE
Sbjct: 61  PRKEDSDIGRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEETR-QEGLDMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRI+EKL QRE EEAA   EEEEE  EEEEEEESEYETDSE+E  GI M+
Sbjct: 121 EDEEALEERRRRIREKLLQREQEEAALLPEEEEEAVEEEEEEESEYETDSEEEQMGIAML 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FVPKSER+TIAERER+E EE++LEEL KR+LEERK ETK I+VEE+RKDEEIQKNLE
Sbjct: 181 KPVFVPKSERDTIAERERLEAEEQALEELAKRKLEERKVETKKILVEEVRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEVNEAEEYEAWKVREIARIKRDREAREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWER+NPK APPPKQKW+FMQKYYHKGAFFQ DA D A T  +D I+HRDFS+PTGEDKM
Sbjct: 301 EWERRNPKPAPPPKQKWRFMQKYYHKGAFFQSDAADTAATVRTDEIYHRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 395

BLAST of Cla003085 vs. NCBI nr
Match: gi|567902030|ref|XP_006443503.1| (hypothetical protein CICLE_v10020229mg [Citrus clementina])

HSP 1 Score: 686.8 bits (1771), Expect = 2.5e-194
Identity = 338/396 (85.35%), Postives = 366/396 (92.42%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADD +ED DIRM+RAAAL+KAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDIEEDNDIRMSRAAALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P +EDS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEETR QEGLD EE
Sbjct: 61  PRKEDSDIGRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEETR-QEGLDMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRI+EKL QRE EEAA   EEEEE  EEEEEEESEYETDSE+E  GI M+
Sbjct: 121 EDEEALEERRRRIREKLLQREQEEAALLPEEEEEAVEEEEEEESEYETDSEEEQMGIAML 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FVPKSER+TIAERER+E EE++LEEL KR+LEERK ETK I+VEE+RKDEEIQKNLE
Sbjct: 181 KPVFVPKSERDTIAERERLEAEEQALEELAKRKLEERKVETKKILVEEVRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEVNEAEEYEAWKVREIARIKRDREAREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWER+NPK APPPKQKW+FMQKYYHKGAFFQ DA D A T  +D I+HRDFS+PTGEDKM
Sbjct: 301 EWERRNPKPAPPPKQKWRFMQKYYHKGAFFQSDAADTAATVRTDEIYHRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 395

BLAST of Cla003085 vs. NCBI nr
Match: gi|590640551|ref|XP_007029983.1| (Microfibrillar-associated protein 1 [Theobroma cacao])

HSP 1 Score: 686.8 bits (1771), Expect = 2.5e-194
Identity = 334/397 (84.13%), Postives = 368/397 (92.70%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADDADE+GDIRMARA ALEKAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDADEEGDIRMARAVALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           PS++DS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEE R+ EG++AEE
Sbjct: 61  PSRDDSDVVRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEENRRNEGVEAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAF-PEEEEEEEPEEEEEEESEYETDSEDEPTGITM 180
           EDE+ALEERRRRI+EKL QRE EE A   EEEEEEE EEEEEEESEYETDSE+E TGI M
Sbjct: 121 EDEDALEERRRRIREKLLQREQEETALLEEEEEEEEVEEEEEEESEYETDSEEEHTGIAM 180

Query: 181 VKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNL 240
           VKP+FVPKSER+TIAERER+E EER++EE  KR+LE RK ET+ IVVE+IR+DEEIQKN+
Sbjct: 181 VKPVFVPKSERDTIAERERLEAEERAIEEAEKRKLEHRKVETRQIVVEKIREDEEIQKNM 240

Query: 241 EMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEER 300
           E+EAN+ADVDTDDE+NEAEEYEAWK REIARIKRDRE R+AM+KE+EEIEKVRNMTEEER
Sbjct: 241 ELEANVADVDTDDEVNEAEEYEAWKAREIARIKRDREEREAMIKEKEEIEKVRNMTEEER 300

Query: 301 REWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDK 360
           REWERKNPK APPPKQKWKFMQKYYHKGAFFQ +ADD A   G+D+I+HRDFS PTGEDK
Sbjct: 301 REWERKNPKPAPPPKQKWKFMQKYYHKGAFFQAEADDPAAAVGADNIYHRDFSGPTGEDK 360

Query: 361 MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397
           MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP
Sbjct: 361 MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNP 397

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MFAP1_HUMAN2.5e-7744.28Microfibrillar-associated protein 1 OS=Homo sapiens GN=MFAP1 PE=1 SV=2[more]
MFAP1_BOVIN5.5e-7744.28Microfibrillar-associated protein 1 OS=Bos taurus GN=MFAP1 PE=2 SV=1[more]
MFAP1_MOUSE1.2e-7644.03Microfibrillar-associated protein 1 OS=Mus musculus GN=Mfap1 PE=1 SV=1[more]
MFAP1_CHICK2.1e-7345.11Microfibrillar-associated protein 1 OS=Gallus gallus GN=MFAP1 PE=2 SV=1[more]
MFAP1_DICDI1.2e-4730.73Protein MFAP1 homolog OS=Dictyostelium discoideum GN=mfap1 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A061EZF4_THECC1.7e-19484.13Microfibrillar-associated protein 1 OS=Theobroma cacao GN=TCM_025838 PE=4 SV=1[more]
V4T8Z0_9ROSI1.7e-19485.35Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020229mg PE=4 SV=1[more]
A0A067FS77_CITSI1.7e-19485.35Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013977mg PE=4 SV=1[more]
A0A067KU23_JATCU5.0e-19484.34Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02667 PE=4 SV=1[more]
A0A059AKH0_EUCGR5.5e-19385.10Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_J03113 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659082793|ref|XP_008442034.1|6.6e-22498.99PREDICTED: microfibrillar-associated protein 1 [Cucumis melo][more]
gi|778695198|ref|XP_011653945.1|2.1e-22297.98PREDICTED: microfibrillar-associated protein 1 [Cucumis sativus][more]
gi|641847174|gb|KDO66055.1|2.5e-19485.35hypothetical protein CISIN_1g013977mg [Citrus sinensis][more]
gi|567902030|ref|XP_006443503.1|2.5e-19485.35hypothetical protein CICLE_v10020229mg [Citrus clementina][more]
gi|590640551|ref|XP_007029983.1|2.5e-19484.13Microfibrillar-associated protein 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009730MFAP1_C
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0001527 microfibril
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU32672watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU40462watermelon EST collection version 2.0transcribed_cluster
WMU58009watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla003085Cla003085.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU58009WMU58009transcribed_cluster
WMU32672WMU32672transcribed_cluster
WMU40462WMU40462transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009730Micro-fibrillar-associated protein 1, C-terminalPFAMPF06991MFAP1coord: 169..390
score: 1.1
NoneNo IPR availableunknownCoilCoilcoord: 189..219
score: -coord: 278..305
score: -coord: 116..146
scor
NoneNo IPR availablePANTHERPTHR15327MICROFIBRIL-ASSOCIATED PROTEINcoord: 1..396
score: 1.5E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla003085Cla97C04G071340Watermelon (97103) v2wmwmbB291
Cla003085ClCG04G004740Watermelon (Charleston Gray)wcgwmB255
Cla003085Bhi03G001874Wax gourdwgowmB443
The following gene(s) are paralogous to this gene:

None