ClCG04G004740 (gene) Watermelon (Charleston Gray)

NameClCG04G004740
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionMicrofibrillar-associated protein
LocationCG_Chr04 : 17825873 .. 17828489 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAATGATTAAATTTCAAGAATACTTTCCTCTCGTCTTCCCCTTCACCGATTCCTCAAAAATCTCACAGCTTATCGATTTCAGAGGATCCGGACTGCTTTCTCTACCTTCGCTCCGCCATTTCTCCCTCCCACCGTATTCTTCGTTGCTCGGATCAGATTTCTTCAACTCCATCTAGGGTTTCATATTCTTAAGTCTAGAACTCCATTATAGCCAAGTTCAGACGAAACTTTGAAGGTAATTCTATCTTTGCACTATTTCCGGCTGAAATATCCACCATTTCCAACGTGCGTTTAATTGAGAAATGCTCTGAATTTAAATGGGCTTTTATCCCTGTTCCTGTTTCGAGTTTTCGTGTATGAGAACGAAGGTTGAGATGTCGTTGTGATTGTATAGGGTTTTGTTTATTTTTTGTTGAATCTATTGCGTGATTATTAACTATTTTGATTGTTTTTTTTTTCTTCATATTTTAATTTATTTTTGGTTGGAGATGTTTGTTTGCTGGGATTTATTGATTCATAAAGTGTCTGTCTTAATTTGTGACGCTCGGAGACTTGTTTAACTGCACACTGCACTTATTTCATTATTGGGTTTCAGGTTTAGCGTCATCAAATTGGGCACTTTAAGTTGAAAAGGAGAAGGTTCTTATTTTCGGTGTATATTTTGCATCTGGAAATTTAGCTGATTAACAATGTCGGTCACGGCAGGGGTTAGTGATACTGTAATTGCTGTTAGGGATAAACTTAGAGGTAAAATTGGACAAACAAAAGTTAAGAGGTACTGGCCTGGAAAGGCTCCTGAGTGGGCGGATGATGCTGATGAAGATGGCGATATTAGGATGGCCAGGGCAGCAGCACTTGAGAAAGCATTTCCAAGTCAGGAAGATTCAAATCTATCTAGGAAGGATGACCCTAGGTTGCGCCGTCTAGCTGAGAGTAGGATAGATAATCGGGAGGAGATCAGAGCTGATCATCGACGTATTCGCCAAGCTGAGATTGTTTCAACCATTGAAGAGGAAACGCGGAAGCAGGAGGGTTTAGATGCAGAGGAAGAGGATGAGGAGGCTTTGGAGGAAAGAAGAAGAAGAATCAAGGAAAAGTTGCGACAAAGGGAGCTAGAAGAAGCTGCATTCCCTGAAGAAGAAGAGGAGGAGGAACCAGAGGAAGAGGAAGAAGAGGAGTCTGAGTATGAAACTGATTCAGAAGATGAACCTACTGGGATAACAATGGTGAAGCCAATATTTGTTCCGAAATCAGAGAGAGAAACTATTGCCGAACGGGAGCGTATTGAAGAGGAGGAAAGGTCTCTTGAGGAATTGAGAAAACGGCGATTGGAGGAAAGGAAGGCAGAGACAAAGCACATTGTGGTTGAGGAGATTAGAAAGGATGAAGAGATCCAGAAGAATTTGGAAATGGAGGCAAATATTGCAGATGTGGACACTGATGATGAAATTAATGAAGCAGAAGAATATGAGGCTTGGAAGGTTAGGGAGATTGCTAGGATCAAGAGGGATAGAGAACTCCGAGATGCAATGTTGAAGGAGAGGGAGGAGATTGAGAAGGTGAGAAATATGACTGAGGAAGAGAGGAGAGAATGGGAGAGGAAGAATCCAAAAACTGCTCCACCACCTAAGCAGAAGTGGAAGTTTATGCAGAAATATTACCACAAGGGTGCGTTCTTCCAGGAAGATGCTGATGATAATGCCGGAACTGCTGGATCTGATTCTATTTTCCATCGTGATTTCTCTTCTCCAACTGGAGAAGATAAGATGGACAAGACAATATTGCCGAAGGTTATGCAGGTGAAGCACTTTGGACGTAGTGGGAGAACAAAATGGACGCATCTTGTCAATGAAGATACAACCGACTGGAACAACCCGTAAGTTAATTTAATTTTGATTGCCTTCTGCTTCTTTTTCCTATTGTAAATTATTGCAATGTCAATCTTGATTTCTTAATTGTGTAAGAACCTAGATTACATATAGTATGATCCATAGAAAAAGCTTCTATTGCATTGTCTACTTATGATTGAGAGTGTGTTAATCAGAGAATGAATGGAAAGAGTTTTAGTATGAGCATGAGATTTATGGACGTACTGAGCAGCCAGAGCTTGATATTGACATTGAGACTTTTAGTTTGTTTATATATCTACAGGAGAATACGAAGATATGACTAGAACAATGCCCTGTTTCCACTTCTCTGTTTGTTTGCTTTCTTTGCCCGTTTTTTCGTTTGTAGTGTGGTTGCCATGTGGAAGTTATTCTGCAGTGTGAGTGACGTCAATGGTTACTGTCATGCAGTTGGACCTATAACGATCCTCTTCGGGCAAAATACAATGCAAAAATGGCCGGAATGAATGCACCAATATTGAAACCCAAAGGAAGCAAGAAGTTAAAGGATTGGGAATCTCGTTGAGTTTTAATGGCATAAAGCACGGTTAGAAAGCTTTTCTAGTTATCCCTTTTCATCGTTCAATTAGAAAAAAAATGCTTGCATAATGGGTTACCAGGATAGTTGGGCACCCAGTTTTCCTTCTCTGTCAAACTAAAGATGTACCTGGATTTTGTATGTTGATGTAATATAATTACTCATTAGTCATCACGTTCCCCGTTCCTG

mRNA sequence

AAAAAAATGATTAAATTTCAAGAATACTTTCCTCTCGTCTTCCCCTTCACCGATTCCTCAAAAATCTCACAGCTTATCGATTTCAGAGGATCCGGACTGCTTTCTCTACCTTCGCTCCGCCATTTCTCCCTCCCACCGTATTCTTCGTTGCTCGGATCAGATTTCTTCAACTCCATCTAGGGTTTCATATTCTTAAGTCTAGAACTCCATTATAGCCAAGTTCAGACGAAACTTTGAAGGTTTAGCGTCATCAAATTGGGCACTTTAAGTTGAAAAGGAGAAGGTTCTTATTTTCGGTGTATATTTTGCATCTGGAAATTTAGCTGATTAACAATGTCGGTCACGGCAGGGGTTAGTGATACTGTAATTGCTGTTAGGGATAAACTTAGAGGTAAAATTGGACAAACAAAAGTTAAGAGGTACTGGCCTGGAAAGGCTCCTGAGTGGGCGGATGATGCTGATGAAGATGGCGATATTAGGATGGCCAGGGCAGCAGCACTTGAGAAAGCATTTCCAAGTCAGGAAGATTCAAATCTATCTAGGAAGGATGACCCTAGGTTGCGCCGTCTAGCTGAGAGTAGGATAGATAATCGGGAGGAGATCAGAGCTGATCATCGACGTATTCGCCAAGCTGAGATTGTTTCAACCATTGAAGAGGAAACGCGGAAGCAGGAGGGTTTAGATGCAGAGGAAGAGGATGAGGAGGCTTTGGAGGAAAGAAGAAGAAGAATCAAGGAAAAGTTGCGACAAAGGGAGCTAGAAGAAGCTGCATTCCCTGAAGAAGAAGAGGAGGAGGAACCAGAGGAAGAGGAAGAAGAGGAGTCTGAGTATGAAACTGATTCAGAAGATGAACCTACTGGGATAACAATGGTGAAGCCAATATTTGTTCCGAAATCAGAGAGAGAAACTATTGCCGAACGGGAGCGTATTGAAGAGGAGGAAAGGTCTCTTGAGGAATTGAGAAAACGGCGATTGGAGGAAAGGAAGGCAGAGACAAAGCACATTGTGGTTGAGGAGATTAGAAAGGATGAAGAGATCCAGAAGAATTTGGAAATGGAGGCAAATATTGCAGATGTGGACACTGATGATGAAATTAATGAAGCAGAAGAATATGAGGCTTGGAAGGTTAGGGAGATTGCTAGGATCAAGAGGGATAGAGAACTCCGAGATGCAATGTTGAAGGAGAGGGAGGAGATTGAGAAGGTGAGAAATATGACTGAGGAAGAGAGGAGAGAATGGGAGAGGAAGAATCCAAAAACTGCTCCACCACCTAAGCAGAAGTGGAAGTTTATGCAGAAATATTACCACAAGGGTGCGTTCTTCCAGGAAGATGCTGATGATAATGCCGGAACTGCTGGATCTGATTCTATTTTCCATCGTGATTTCTCTTCTCCAACTGGAGAAGATAAGATGGACAAGACAATATTGCCGAAGGTTATGCAGGTGAAGCACTTTGGACGTAGTGGGAGAACAAAATGGACGCATCTTGTCAATGAAGATACAACCGACTGGAACAACCCTTGGACCTATAACGATCCTCTTCGGGCAAAATACAATGCAAAAATGGCCGGAATGAATGCACCAATATTGAAACCCAAAGGAAGCAAGAAGTTAAAGGATTGGGAATCTCGTTGAGTTTTAATGGCATAAAGCACGGTTAGAAAGCTTTTCTAGTTATCCCTTTTCATCGTTCAATTAGAAAAAAAATGCTTGCATAATGGGTTACCAGGATAGTTGGGCACCCAGTTTTCCTTCTCTGTCAAACTAAAGATGTACCTGGATTTTGTATGTTGATGTAATATAATTACTCATTAGTCATCACGTTCCCCGTTCCTG

Coding sequence (CDS)

ATGTCGGTCACGGCAGGGGTTAGTGATACTGTAATTGCTGTTAGGGATAAACTTAGAGGTAAAATTGGACAAACAAAAGTTAAGAGGTACTGGCCTGGAAAGGCTCCTGAGTGGGCGGATGATGCTGATGAAGATGGCGATATTAGGATGGCCAGGGCAGCAGCACTTGAGAAAGCATTTCCAAGTCAGGAAGATTCAAATCTATCTAGGAAGGATGACCCTAGGTTGCGCCGTCTAGCTGAGAGTAGGATAGATAATCGGGAGGAGATCAGAGCTGATCATCGACGTATTCGCCAAGCTGAGATTGTTTCAACCATTGAAGAGGAAACGCGGAAGCAGGAGGGTTTAGATGCAGAGGAAGAGGATGAGGAGGCTTTGGAGGAAAGAAGAAGAAGAATCAAGGAAAAGTTGCGACAAAGGGAGCTAGAAGAAGCTGCATTCCCTGAAGAAGAAGAGGAGGAGGAACCAGAGGAAGAGGAAGAAGAGGAGTCTGAGTATGAAACTGATTCAGAAGATGAACCTACTGGGATAACAATGGTGAAGCCAATATTTGTTCCGAAATCAGAGAGAGAAACTATTGCCGAACGGGAGCGTATTGAAGAGGAGGAAAGGTCTCTTGAGGAATTGAGAAAACGGCGATTGGAGGAAAGGAAGGCAGAGACAAAGCACATTGTGGTTGAGGAGATTAGAAAGGATGAAGAGATCCAGAAGAATTTGGAAATGGAGGCAAATATTGCAGATGTGGACACTGATGATGAAATTAATGAAGCAGAAGAATATGAGGCTTGGAAGGTTAGGGAGATTGCTAGGATCAAGAGGGATAGAGAACTCCGAGATGCAATGTTGAAGGAGAGGGAGGAGATTGAGAAGGTGAGAAATATGACTGAGGAAGAGAGGAGAGAATGGGAGAGGAAGAATCCAAAAACTGCTCCACCACCTAAGCAGAAGTGGAAGTTTATGCAGAAATATTACCACAAGGGTGCGTTCTTCCAGGAAGATGCTGATGATAATGCCGGAACTGCTGGATCTGATTCTATTTTCCATCGTGATTTCTCTTCTCCAACTGGAGAAGATAAGATGGACAAGACAATATTGCCGAAGGTTATGCAGGTGAAGCACTTTGGACGTAGTGGGAGAACAAAATGGACGCATCTTGTCAATGAAGATACAACCGACTGGAACAACCCTTGGACCTATAACGATCCTCTTCGGGCAAAATACAATGCAAAAATGGCCGGAATGAATGCACCAATATTGAAACCCAAAGGAAGCAAGAAGTTAAAGGATTGGGAATCTCGTTGA

Protein sequence

MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAFPSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEEEDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERREWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPILKPKGSKKLKDWESR
BLAST of ClCG04G004740 vs. Swiss-Prot
Match: MFAP1_HUMAN (Microfibrillar-associated protein 1 OS=Homo sapiens GN=MFAP1 PE=1 SV=2)

HSP 1 Score: 299.7 bits (766), Expect = 5.3e-80
Identity = 185/434 (42.63%), Postives = 266/434 (61.29%), Query Frame = 1

Query: 12  IAVRDKLRGKIG--QTKVKRYWPGKAPEWA---DDADEDGDIRMARAAALEKAFPSQEDS 71
           + VR++ +G+I   + KVKRY  GK P++A      +ED + +  + A  ++A P +++ 
Sbjct: 20  VPVRNE-KGEISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEE 79

Query: 72  NLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIV----STIEEETRKQEGLDAEEE- 131
           + S   DPRLRRL     ++ EE  A HR+I + E+V    S +E +  + E  D+ EE 
Sbjct: 80  DSS--SDPRLRRLQNRISEDVEERLARHRKIVEPEVVGESDSEVEGDAWRMEREDSSEEE 139

Query: 132 ----DEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYE--TDSEDEPT 191
               D+E +E RR  ++++ ++R+ EE    E E+E    EE E ESEYE  TDSEDE  
Sbjct: 140 EEEIDDEEIERRRGMMRQRAQERKNEEMEVMEVEDEGRSGEESESESEYEEYTDSEDEME 199

Query: 192 GITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEI 251
               +KP+F+ K +R T+ ERE    +++ LE+  KR  EER+  T  IV EE +K  E+
Sbjct: 200 --PRLKPVFIRKKDRVTVQEREAEALKQKELEQEAKRMAEERRKYTLKIVEEETKK--EL 259

Query: 252 QKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMT 311
           ++N    A +  ++TDDE N+ EEYEAWKVRE+ RIKRDRE R+A+ KE+ EIE++RN+T
Sbjct: 260 EENKRSLAALDALNTDDE-NDEEEYEAWKVRELKRIKRDREDREALEKEKAEIERMRNLT 319

Query: 312 EEERREWERKNPK--TAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSS 371
           EEERR   R N K  T    K K+KF+QKYYH+GAFF ++          + ++ RDFS+
Sbjct: 320 EEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMDE---------DEEVYKRDFSA 379

Query: 372 PTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAK-YNAKMA 427
           PT ED  +KTILPKVMQVK+FGRSGRTK+THLV++DTT +++ W        K +  K A
Sbjct: 380 PTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNTKFFKQKAA 436

BLAST of ClCG04G004740 vs. Swiss-Prot
Match: MFAP1_BOVIN (Microfibrillar-associated protein 1 OS=Bos taurus GN=MFAP1 PE=2 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 1.2e-79
Identity = 185/434 (42.63%), Postives = 265/434 (61.06%), Query Frame = 1

Query: 12  IAVRDKLRGKIG--QTKVKRYWPGKAPEWA---DDADEDGDIRMARAAALEKAFPSQEDS 71
           + VR++ +G+I   + KVKRY  GK P++A      +ED + +  + A  ++A P +++ 
Sbjct: 20  VPVRNE-KGEISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEE 79

Query: 72  NLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIV----STIEEETRKQEGLDAEEE- 131
           + S   DPRLRRL     ++ EE  A HR+I + E+V    S +E +  + E  D+ EE 
Sbjct: 80  DSS--SDPRLRRLQNRISEDVEERLARHRKIVEPEVVGESDSEVEGDPWRMEREDSSEEE 139

Query: 132 ----DEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYE--TDSEDEPT 191
               DEE +E RR  ++++ ++R+ EE    E E+E    EE E ESEYE  TDSEDE  
Sbjct: 140 EEEIDEEEIERRRGMMRQRAQERKNEELEVMEVEDEGRSGEESESESEYEEYTDSEDEME 199

Query: 192 GITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEI 251
               +KP+F+ K +R T+ ERE    +++ LE+  K   EER+  T  IV EE +K  E+
Sbjct: 200 --PRLKPVFIRKKDRVTVQEREAEALKQKELEQEAKHMAEERRKYTLKIVEEETKK--EL 259

Query: 252 QKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMT 311
           ++N    A +  ++TDDE N+ EEYEAWKVRE+ RIKRDRE R+A+ KE+ EIE++RN+T
Sbjct: 260 EENKRSLAALDALNTDDE-NDEEEYEAWKVRELKRIKRDREDREALEKEKAEIERMRNLT 319

Query: 312 EEERREWERKNPK--TAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSS 371
           EEERR   R N K  T    K K+KF+QKYYH+GAFF ++          + ++ RDFS+
Sbjct: 320 EEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMDE---------DEEVYKRDFSA 379

Query: 372 PTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAK-YNAKMA 427
           PT ED  +KTILPKVMQVK+FGRSGRTK+THLV++DTT +++ W        K +  K A
Sbjct: 380 PTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNTKFFKQKAA 436

BLAST of ClCG04G004740 vs. Swiss-Prot
Match: MFAP1_MOUSE (Microfibrillar-associated protein 1 OS=Mus musculus GN=Mfap1 PE=1 SV=1)

HSP 1 Score: 297.4 bits (760), Expect = 2.6e-79
Identity = 184/434 (42.40%), Postives = 266/434 (61.29%), Query Frame = 1

Query: 12  IAVRDKLRGKIG--QTKVKRYWPGKAPEWA---DDADEDGDIRMARAAALEKAFPSQEDS 71
           + VR++ +G+I   + KVKRY  GK P++A      +ED + +  + A  ++A P +++ 
Sbjct: 20  VPVRNE-KGEISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEE 79

Query: 72  NLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIV----STIEEETRKQEGLDAEEE- 131
           + S   DPRLRRL     ++ EE  A HR+I + E+V    S +E +  + E  D+ EE 
Sbjct: 80  DSS--SDPRLRRLQNRISEDVEERLARHRKIVEPEVVGESDSEVEGDAWRLEREDSSEEE 139

Query: 132 ----DEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYE--TDSEDEPT 191
               D+E +E RR  ++++ ++R+ EE    E E+E    EE E ESEYE  TDSEDE  
Sbjct: 140 EEEIDDEEIERRRGMMRQRAQERKNEEMEVMEVEDEGRSGEESESESEYEEYTDSEDEME 199

Query: 192 GITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEI 251
               +KP+F+ K +R T+ ERE    +++ LE+  KR  EER+  T  IV EE +K  E+
Sbjct: 200 --PRLKPVFIRKKDRVTVQEREAEALKQKELEQEAKRMAEERRKYTLKIVEEETKK--EL 259

Query: 252 QKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMT 311
           ++N    A +  ++TDDE N+ EEYEAWKVRE+ RIKR+RE R+A+ KE+ EIE++RN+T
Sbjct: 260 EENKRSLAALDALNTDDE-NDEEEYEAWKVRELKRIKREREDREALEKEKAEIERMRNLT 319

Query: 312 EEERREWERKNPK--TAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSS 371
           EEERR   R N K  T    K K+KF+QKYYH+GAFF ++          + ++ RDFS+
Sbjct: 320 EEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMDE---------DEEVYKRDFSA 379

Query: 372 PTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAK-YNAKMA 427
           PT ED  +KTILPKVMQVK+FGRSGRTK+THLV++DTT +++ W        K +  K A
Sbjct: 380 PTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWGQESAQNTKFFKQKAA 436

BLAST of ClCG04G004740 vs. Swiss-Prot
Match: MFAP1_CHICK (Microfibrillar-associated protein 1 OS=Gallus gallus GN=MFAP1 PE=2 SV=1)

HSP 1 Score: 286.6 bits (732), Expect = 4.6e-76
Identity = 173/400 (43.25%), Postives = 248/400 (62.00%), Query Frame = 1

Query: 43  DEDGDIRMARAAALEKAFPSQEDSNLSRKDDPRLRRLAESRI-DNREEIRADHRRIRQAE 102
           +ED + +  + A  ++  P +++  ++  +DPRLRRL ++RI ++ EE  A HR+I + E
Sbjct: 56  EEDEEFQFIKKAKEQEVEPEEQEEEVA--NDPRLRRLLQNRITEDVEERLARHRKIVEPE 115

Query: 103 IVS-----TIEEETRKQEGLDAEEEDEEALEER-----RRRIKEKLRQRELEEAAFPEEE 162
           +VS      +E E  + E  D  EE+EE +++      R  ++++ ++R+ EE    E E
Sbjct: 116 VVSGESDSEVEGEAWRVEREDTSEEEEEEIDDEEIERWRGMMRQRAQERKTEELEVMELE 175

Query: 163 EEEEPEEEEEEESEYE--TDSEDEPTGITMVKPIFVPKSERETIAERERIEEEERSLEEL 222
           +E    EE E ESEYE  TDSEDE      +KP+F+ K +R T+ ERE    +++ LE+ 
Sbjct: 176 DEGRSGEESELESEYEEYTDSEDEME--PRLKPVFIRKKDRITVQEREAEALKQKELEQE 235

Query: 223 RKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEEYEAWKVREIA 282
            KR  EER+  T  IV EE +K  E+++N    A +  +DTDDE N+ EEYEAWKVRE+ 
Sbjct: 236 AKRLAEERRKYTLKIVEEEAKK--ELEENKRSLAALDALDTDDE-NDEEEYEAWKVRELK 295

Query: 283 RIKRDRELRDAMLKEREEIEKVRNMTEEERREWERKNPK--TAPPPKQKWKFMQKYYHKG 342
           RIKRDRE R+AM KE+ EIE++RN+TEEERR   R N K  T    K K+KF+QKYYH+G
Sbjct: 296 RIKRDREEREAMEKEKAEIERMRNLTEEERRAELRANGKVVTNKAVKGKYKFLQKYYHRG 355

Query: 343 AFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKWTHLVN 402
           AFF ++          + ++ RDFS+PT ED  +KTILPKVMQVK+FGRSGRTK+THLV+
Sbjct: 356 AFFMDE---------DEEVYKRDFSAPTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVD 415

Query: 403 EDTTDWNNPWTYNDPLRAK-YNAKMAGMNAPILKPKGSKK 427
           +DTT +++ W        K +  K AG+     +P   K+
Sbjct: 416 QDTTSFDSAWGQESAQNTKFFKQKAAGVRDVFERPSAKKR 439

BLAST of ClCG04G004740 vs. Swiss-Prot
Match: MFAP1_DICDI (Protein MFAP1 homolog OS=Dictyostelium discoideum GN=mfap1 PE=3 SV=1)

HSP 1 Score: 199.9 bits (507), Expect = 5.7e-50
Identity = 137/446 (30.72%), Postives = 226/446 (50.67%), Query Frame = 1

Query: 14  VRDKLRGKIGQ----------TKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAFPSQ 73
           +RDKL+  +            TKV RY  G+ P++A+ A++D D   +          +Q
Sbjct: 7   IRDKLKNNLESSGGHSVVTSGTKVIRYRAGQRPDYAE-AEDDQDSHFSN-------IQNQ 66

Query: 74  EDSNLSRKDDPRLRRLAE--------SRIDNREEIRADH------------------RRI 133
           +    +  +DPRL R             ++ R+  R  H                  RRI
Sbjct: 67  KSIKEAETNDPRLARFKNRSSNQDEPQSVEERKASRRRHHDDDNDNDTTTTTTTTTSRRI 126

Query: 134 RQAEIVSTIEEETRKQEG-LDAEEEDEEALEERRRRIKEK-LRQRELEEAAFPEEEEEEE 193
           ++ EI+   ++           +  D++  ++RRRR KE+ L+++E EE    E EE+++
Sbjct: 127 QKTEIIKEDDDNNNNNNNDTKKDHNDDDEDDDRRRRAKERYLKKKEEEEQKQKELEEKQQ 186

Query: 194 P------EEEEEEESEYETDSEDEPTGI-----TMVKPIFVPKSERETIAERERIEEEER 253
           P      E EEE  SEYETDSE++          + +P F+ K +R TI   E+ E+EE+
Sbjct: 187 PFKDIEGESEEEGSSEYETDSEEDDEDEYWDQPPIFRPTFIKKDDRGTIKTDEQWEKEEQ 246

Query: 254 SLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEEYEAWK 313
             +   +R  E+RK E    + +E+ +D + Q+  E+E    +   DDE  +  +   W 
Sbjct: 247 EQQAQLEREKEQRKIEAHRKLKDELDRDRKEQEAKELEQKEEEEYDDDEDQDGSKKLLWI 306

Query: 314 VREIARIKRDRELRDAMLKEREEIEKVRNMTEEE--RREWERKNPKTAPPPKQKWKFMQK 373
            RE+ R++ +   R     E++E  + R MT+++  + +  R         K++ KF+Q+
Sbjct: 307 QRELERVRLEIHTRLLAEFEKKEFARRRAMTDDQILKEDPSRSRTNIDNSQKKQLKFLQR 366

Query: 374 YYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKMDKTILPKVMQVKHFGRSGRTKW 409
            YH+GAFFQ+D          + I ++DFS+PTGEDK ++ +LPKVMQVK+FG++GRTK+
Sbjct: 367 DYHRGAFFQDD----------EYIKNKDFSAPTGEDKFNRELLPKVMQVKNFGKAGRTKY 426

BLAST of ClCG04G004740 vs. TrEMBL
Match: A0A067FS77_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013977mg PE=4 SV=1)

HSP 1 Score: 762.3 bits (1967), Expect = 3.2e-217
Identity = 373/433 (86.14%), Postives = 402/433 (92.84%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADD +ED DIRM+RAAAL+KAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDIEEDNDIRMSRAAALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P +EDS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEETR QEGLD EE
Sbjct: 61  PRKEDSDIGRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEETR-QEGLDMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRI+EKL QRE EEAA   EEEEE  EEEEEEESEYETDSE+E  GI M+
Sbjct: 121 EDEEALEERRRRIREKLLQREQEEAALLPEEEEEAVEEEEEEESEYETDSEEEQMGIAML 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FVPKSER+TIAERER+E EE++LEEL KR+LEERK ETK I+VEE+RKDEEIQKNLE
Sbjct: 181 KPVFVPKSERDTIAERERLEAEEQALEELAKRKLEERKVETKKILVEEVRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEVNEAEEYEAWKVREIARIKRDREAREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWER+NPK APPPKQKW+FMQKYYHKGAFFQ DA D A T  +D I+HRDFS+PTGEDKM
Sbjct: 301 EWERRNPKPAPPPKQKWRFMQKYYHKGAFFQSDAADTAATVRTDEIYHRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPILK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWE+R
Sbjct: 421 PKGSKKLKDWETR 432

BLAST of ClCG04G004740 vs. TrEMBL
Match: V4T8Z0_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020229mg PE=4 SV=1)

HSP 1 Score: 760.8 bits (1963), Expect = 9.3e-217
Identity = 372/433 (85.91%), Postives = 402/433 (92.84%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADD +ED DIRM+RAAAL+KAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDIEEDNDIRMSRAAALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P +EDS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEETR QEGLD EE
Sbjct: 61  PRKEDSDIGRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEETR-QEGLDMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRI+EKL QRE EEAA   EEEEE  EEEEEEESEYETDSE+E  GI M+
Sbjct: 121 EDEEALEERRRRIREKLLQREQEEAALLPEEEEEAVEEEEEEESEYETDSEEEQMGIAML 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FVPKSER+TIAERER+E EE++LEEL KR+LEERK ETK I+VEE+RKDEEIQKNLE
Sbjct: 181 KPVFVPKSERDTIAERERLEAEEQALEELAKRKLEERKVETKKILVEEVRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEVNEAEEYEAWKVREIARIKRDREAREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWER+NPK APPPKQKW+FMQKYYHKGAFFQ DA D A T  +D I+HRDFS+PTGEDKM
Sbjct: 301 EWERRNPKPAPPPKQKWRFMQKYYHKGAFFQSDAADTAATVRTDEIYHRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPILK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           P+GSKKLKDWE+R
Sbjct: 421 PQGSKKLKDWETR 432

BLAST of ClCG04G004740 vs. TrEMBL
Match: A0A061EZF4_THECC (Microfibrillar-associated protein 1 OS=Theobroma cacao GN=TCM_025838 PE=4 SV=1)

HSP 1 Score: 758.1 bits (1956), Expect = 6.0e-216
Identity = 366/434 (84.33%), Postives = 403/434 (92.86%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADDADE+GDIRMARA ALEKAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDADEEGDIRMARAVALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           PS++DS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEE R+ EG++AEE
Sbjct: 61  PSRDDSDVVRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEENRRNEGVEAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAF-PEEEEEEEPEEEEEEESEYETDSEDEPTGITM 180
           EDE+ALEERRRRI+EKL QRE EE A   EEEEEEE EEEEEEESEYETDSE+E TGI M
Sbjct: 121 EDEDALEERRRRIREKLLQREQEETALLEEEEEEEEVEEEEEEESEYETDSEEEHTGIAM 180

Query: 181 VKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNL 240
           VKP+FVPKSER+TIAERER+E EER++EE  KR+LE RK ET+ IVVE+IR+DEEIQKN+
Sbjct: 181 VKPVFVPKSERDTIAERERLEAEERAIEEAEKRKLEHRKVETRQIVVEKIREDEEIQKNM 240

Query: 241 EMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEER 300
           E+EAN+ADVDTDDE+NEAEEYEAWK REIARIKRDRE R+AM+KE+EEIEKVRNMTEEER
Sbjct: 241 ELEANVADVDTDDEVNEAEEYEAWKAREIARIKRDREEREAMIKEKEEIEKVRNMTEEER 300

Query: 301 REWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDK 360
           REWERKNPK APPPKQKWKFMQKYYHKGAFFQ +ADD A   G+D+I+HRDFS PTGEDK
Sbjct: 301 REWERKNPKPAPPPKQKWKFMQKYYHKGAFFQAEADDPAAAVGADNIYHRDFSGPTGEDK 360

Query: 361 MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPIL 420
           MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMA +NAP+ 
Sbjct: 361 MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAAVNAPVA 420

Query: 421 KPKGSKKLKDWESR 434
           KPKGSKKLKDWES+
Sbjct: 421 KPKGSKKLKDWESK 434

BLAST of ClCG04G004740 vs. TrEMBL
Match: A0A059AKH0_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_J03113 PE=4 SV=1)

HSP 1 Score: 756.9 bits (1953), Expect = 1.3e-215
Identity = 372/433 (85.91%), Postives = 400/433 (92.38%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGK PEWAD+ADEDGDIRMARA AL+KAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKVPEWADEADEDGDIRMARAVALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P+ E S++ +KDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVSTIEEE R+QEGL+AEE
Sbjct: 61  PTYEGSDIGKKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTIEEENRRQEGLEAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           ED EALEERRR+I+EKL  RE EEAA   EEEEEE EEEEEEESEYETDSE+E  GI MV
Sbjct: 121 EDAEALEERRRKIREKLLLREQEEAALLPEEEEEEEEEEEEEESEYETDSEEETKGIAMV 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FV KSER+TIAER+R+EEEER++EEL KRR EERKAETK IVVEEIRKDEEIQKNLE
Sbjct: 181 KPVFVVKSERDTIAERQRLEEEERAIEELMKRRQEERKAETKQIVVEEIRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWK REIARIKRDRE R+AMLK +EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDELNEAEEYEAWKAREIARIKRDREDREAMLKAKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWERKNPK +  PKQKW+FMQKYYHKGAFFQ + D++AGTAGSD I+ RDFS+PTGEDKM
Sbjct: 301 EWERKNPKPSSAPKQKWRFMQKYYHKGAFFQSEVDEHAGTAGSDYIYGRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPILK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKM GMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMGGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of ClCG04G004740 vs. TrEMBL
Match: A0A067KU23_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02667 PE=4 SV=1)

HSP 1 Score: 756.5 bits (1952), Expect = 1.8e-215
Identity = 367/433 (84.76%), Postives = 404/433 (93.30%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSD  +AVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDI+MARA ALEKAF
Sbjct: 1   MSVTAGVSDVALAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIKMARADALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P++EDS+++RKDDPRLRRLAES+IDNR+E+RADHRRIRQAEI++T EEET++QE  D EE
Sbjct: 61  PTKEDSDIARKDDPRLRRLAESKIDNRDEVRADHRRIRQAEIIATEEEETQRQEWADMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           E+EEALEERRRRIKEK R RE EEAA P EEEEEEPEEEEEEESEYETDS++E TG+ MV
Sbjct: 121 ENEEALEERRRRIKEKSRLREQEEAALPAEEEEEEPEEEEEEESEYETDSDEEMTGMAMV 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KPIFVPKSERETIAERER+E EE++LEE  KR+LEERK ETK I+VEEI+KDE IQKNLE
Sbjct: 181 KPIFVPKSERETIAERERLEAEEQALEEKAKRKLEERKVETKQILVEEIQKDELIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEA+IADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEASIADVDTDDEVNEAEEYEAWKVREIARIKRDREDREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWERKNPK APPPKQKW+FMQKYYHKGAFFQ ++DD A TAGSD I++RDFS+PTGEDKM
Sbjct: 301 EWERKNPKPAPPPKQKWRFMQKYYHKGAFFQNESDDRAATAGSDDIYNRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPILK 420
           DK+ILPKVMQVKHFGRSGRTKWTHLVNEDTTDWN PWTYND LRAKYNAKMAGMNAPI K
Sbjct: 361 DKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNTPWTYNDQLRAKYNAKMAGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDW++R
Sbjct: 421 PKGSKKLKDWDTR 433

BLAST of ClCG04G004740 vs. TAIR10
Match: AT4G08580.1 (AT4G08580.1 microfibrillar-associated protein-related)

HSP 1 Score: 658.7 bits (1698), Expect = 2.5e-189
Identity = 316/436 (72.48%), Postives = 381/436 (87.39%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVS++ IAVR+KL+G IGQTKV+RYWPGKAPEWA++A+ED D+RM + + L++AF
Sbjct: 1   MSVTAGVSESAIAVREKLKGGIGQTKVRRYWPGKAPEWAEEAEEDDDVRMQKFSVLDRAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P  +D  ++RKDDPRLRRLA+++++NR+E+RADHRRIRQAEI+ST EEE+R QE  D E+
Sbjct: 61  PKNDDLGVARKDDPRLRRLAQTKVENRDEVRADHRRIRQAEIISTEEEESRNQENRD-ED 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFP--EEEEEEEPEEEEEEESEYETDSEDEPTGIT 180
           +DE+ALEERRRRIKEK  +R  EEAA    EEE+E + EEEEEEESEYETDSED+  GI 
Sbjct: 121 DDEDALEERRRRIKEKNLRRAQEEAALLPLEEEDEIQEEEEEEEESEYETDSEDDMPGIA 180

Query: 181 MVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKN 240
           ++KP+FVPK+ER+TIAERER+E EE +LEEL KR+LE+RK ETK IVVEE+RKDEEI+KN
Sbjct: 181 LIKPVFVPKAERDTIAERERLEAEEEALEELAKRKLEQRKIETKQIVVEEVRKDEEIRKN 240

Query: 241 LEME-ANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEE 300
           + +E ANI DV+TDDE+NEAEEYE WK REI RIKR+R+ R+AML+EREEIEK+RNMTE+
Sbjct: 241 ILLEEANIGDVETDDELNEAEEYEVWKTREIGRIKRERDAREAMLREREEIEKLRNMTEQ 300

Query: 301 ERREWERKNPK-TAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTG 360
           ERR+WERKNPK ++  PK+KW FMQKYYHKGAFFQ D DD AG+AG+D IF RDFS+PTG
Sbjct: 301 ERRDWERKNPKPSSAQPKKKWNFMQKYYHKGAFFQADPDDEAGSAGTDGIFQRDFSAPTG 360

Query: 361 EDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNA 420
           ED++DK+ILPKVMQVKHFGRSGRTKWTHLVNEDTTDW+NPWT NDPLR KYN KMAGM+A
Sbjct: 361 EDRLDKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWSNPWTSNDPLREKYNKKMAGMDA 420

Query: 421 PILKPKGSKKLKDWES 433
           PI KPKGSKK+KDWE+
Sbjct: 421 PIAKPKGSKKMKDWET 435

BLAST of ClCG04G004740 vs. TAIR10
Match: AT5G17900.1 (AT5G17900.1 microfibrillar-associated protein-related)

HSP 1 Score: 657.1 bits (1694), Expect = 7.3e-189
Identity = 316/436 (72.48%), Postives = 379/436 (86.93%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVS++ IAVR+KL+G IGQTKV+RYWPGKAPEWA++A+ED D+RM + + L++AF
Sbjct: 1   MSVTAGVSESAIAVREKLKGGIGQTKVRRYWPGKAPEWAEEAEEDDDVRMQKVSVLDRAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P  +D  ++RKDDPRLRRLA+++++NR+E+RADHRRIRQAEI+ T EEE+R QE  D E+
Sbjct: 61  PKNDDLGVARKDDPRLRRLAKTKVENRDEVRADHRRIRQAEIIYTEEEESRNQENRD-ED 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFP--EEEEEEEPEEEEEEESEYETDSEDEPTGIT 180
           +DE+ALEERRRRI+EK  +R  EEAA    EEE+E + EEEEEEESEYETDSED+  GI 
Sbjct: 121 DDEDALEERRRRIREKNLRRAQEEAALLPLEEEDEIQEEEEEEEESEYETDSEDDMPGIA 180

Query: 181 MVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKN 240
           M+KP+FVPK+ER+TIAERER+E EE +LEEL KR+LE+RK ETK IVVEE+RKDEEI+KN
Sbjct: 181 MIKPVFVPKAERDTIAERERLEAEEEALEELAKRKLEQRKLETKQIVVEEVRKDEEIRKN 240

Query: 241 LEME-ANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEE 300
           + +E ANI DV+TDDE+NEAEEYE WK REI RIKR+R+ R+AML+EREEIEK+RNMTE+
Sbjct: 241 ILLEEANIGDVETDDELNEAEEYEVWKTREIGRIKRERDAREAMLREREEIEKLRNMTEQ 300

Query: 301 ERREWERKNPKT-APPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTG 360
           ERR+WERKNPK  +  PK+KW FMQKYYHKGAFFQ D DD AG+AG+D IF RDFS+PTG
Sbjct: 301 ERRDWERKNPKPLSAQPKKKWNFMQKYYHKGAFFQADPDDEAGSAGTDGIFQRDFSAPTG 360

Query: 361 EDKMDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNA 420
           ED++DK+ILPKVMQVKHFGRSGRTKWTHLVNEDTTDW+NPWT NDPLR KYN KMAGM+A
Sbjct: 361 EDRLDKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWSNPWTSNDPLREKYNKKMAGMDA 420

Query: 421 PILKPKGSKKLKDWES 433
           PI KPKGSKK+KDWES
Sbjct: 421 PIAKPKGSKKMKDWES 435

BLAST of ClCG04G004740 vs. TAIR10
Match: AT2G18540.1 (AT2G18540.1 RmlC-like cupins superfamily protein)

HSP 1 Score: 103.2 bits (256), Expect = 4.1e-22
Identity = 86/272 (31.62%), Postives = 131/272 (48.16%), Query Frame = 1

Query: 45  DGDIRMARAAALEKAFPSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVS 104
           +G++        E+    +E+    RK++   R+  E++   REE  A  R   + E   
Sbjct: 422 EGELSKLMREIEERKRREEEEIERRRKEEEEARKREEAK--RREEEEAKRREEEETERKK 481

Query: 105 TIEEETRKQEGLDAEEEDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEES 164
             EEE RK+E    EE   E  E +RR  + K R+ E E+A   EEE E+E E  ++ E 
Sbjct: 482 REEEEARKRE----EERKREEEEAKRREEERKKREEEAEQARKREEEREKEEEMAKKREE 541

Query: 165 EYETDSEDEPTGITMVKPIFVPKSERETIAERERIEEEERSL-EELRKRRLEERKAETKH 224
           E +    +E      V+     + ER+   E  R  EEER   EE+ KRR +ER+ + + 
Sbjct: 542 ERQRKEREE------VERKRREEQERKRREEEARKREEERKREEEMAKRREQERQRKERE 601

Query: 225 IVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDREL-----R 284
            V  +IR+++E ++  EM       + + +  E EE E  K  E AR KR+ E+      
Sbjct: 602 EVERKIREEQERKREEEMAKR---REQERQKKEREEMERKKREEEAR-KREEEMAKIREE 661

Query: 285 DAMLKEREEIEKVRNMTEEERREWERKNPKTA 311
           +   KERE++E+ R   E  RRE ERK  + A
Sbjct: 662 ERQRKEREDVERKRREEEAMRREEERKREEEA 677


HSP 2 Score: 88.2 bits (217), Expect = 1.4e-17
Identity = 72/258 (27.91%), Postives = 121/258 (46.90%), Query Frame = 1

Query: 63  QEDSNLSRKDDPRLRRLAESRIDNREEI--RADHRRIRQ--AEIVSTIEEETRKQEGLDA 122
           +E++   ++++   R+  E R    EE   R + R+ R+  AE     EEE  K+E +  
Sbjct: 472 EEETERKKREEEEARKREEERKREEEEAKRREEERKKREEEAEQARKREEEREKEEEMAK 531

Query: 123 EEEDEEALEER----RRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEP 182
           + E+E   +ER    R+R +E+ R+R  EEA   EEE + E E  +  E E +       
Sbjct: 532 KREEERQRKEREEVERKRREEQERKRREEEARKREEERKREEEMAKRREQERQ------- 591

Query: 183 TGITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEE 242
                       + ERE +  + R E+E +  EE+ KRR +ER+ + +   +E  +++EE
Sbjct: 592 ------------RKEREEVERKIREEQERKREEEMAKRREQERQKKERE-EMERKKREEE 651

Query: 243 IQKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNM 302
            +K  E  A I + +   +  E E+ E  +  E A  + +   R+    +R E E+ +  
Sbjct: 652 ARKREEEMAKIREEERQRK--EREDVERKRREEEAMRREEERKREEEAAKRAEEERRKKE 704

Query: 303 TEEERREWERKNPKTAPP 313
            EEE+R W    P+  PP
Sbjct: 712 EEEEKRRWP---PQPKPP 704

BLAST of ClCG04G004740 vs. TAIR10
Match: AT3G28770.1 (AT3G28770.1 Protein of unknown function (DUF1216))

HSP 1 Score: 82.0 bits (201), Expect = 9.7e-16
Identity = 69/317 (21.77%), Postives = 150/317 (47.32%), Query Frame = 1

Query: 12   IAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAFPSQEDSNLSR- 71
            I    K +GK  + K K        +  +D  E  +  + +    +K     E+S L   
Sbjct: 933  INTSSKQKGKDKKKKKKESKNSNMKKKEEDKKEYVNNELKKQEDNKKETTKSENSKLKEE 992

Query: 72   -KDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEEEDEEALEER 131
             KD+   +   +S   NRE+   + ++       S  +EE +K++    +++ EE   E 
Sbjct: 993  NKDNKEKKESEDSASKNREKKEYEEKK-------SKTKEEAKKEKKKSQDKKREEKDSEE 1052

Query: 132  RRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMVKPIFVPKSE 191
            R+  KEK   R+L+  A  +EEE +E +E E  +S+ + D ++     +M K     + +
Sbjct: 1053 RKSKKEKEESRDLK--AKKKEEETKEKKESENHKSKKKEDKKEHEDNKSMKKEEDKKEKK 1112

Query: 192  RETIAERERIEEEERSLEEL------RKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEA 251
            +   ++  + EE+++ +E+L      +K+  +  K +++H+ + +   D++ +K  E ++
Sbjct: 1113 KHEESKSRKKEEDKKDMEKLEDQNSNKKKEDKNEKKKSQHVKLVKKESDKKEKKENEEKS 1172

Query: 252  NIADVDTD-DEINEAEEYEAWKVREIARIKRDRELRDA---MLKEREEIEKVRNMTEEER 311
               ++++   + NE ++ E    ++  + K+++E++++    LK+ EE  K +   EE +
Sbjct: 1173 ETKEIESSKSQKNEVDKKEKKSSKDQQK-KKEKEMKESEEKKLKKNEEDRKKQTSVEENK 1232

Query: 312  REWERKNPKTAPPPKQK 317
            ++ E K  K  P   +K
Sbjct: 1233 KQKETKKEKNKPKDDKK 1239

BLAST of ClCG04G004740 vs. TAIR10
Match: AT2G22795.1 (AT2G22795.1 unknown protein)

HSP 1 Score: 75.1 bits (183), Expect = 1.2e-13
Identity = 68/248 (27.42%), Postives = 115/248 (46.37%), Query Frame = 1

Query: 62  SQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEEE 121
           SQE S +S +++ + +   ES   ++EE  +      +       EE + ++E +D E E
Sbjct: 416 SQETSEVSSQEESKGK---ESETKDKEESSSQEESKDRETETKEKEESSSQEETMDKETE 475

Query: 122 DEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMVK 181
            +E +E   +  K + ++ E  E++F EE +E+E E +E+EES  +  +E++    T  K
Sbjct: 476 AKEKVESSSQE-KNEDKETEKIESSFLEETKEKEDETKEKEESSSQEKTEEKE---TETK 535

Query: 182 PIFVPKSERET-IAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 241
                 S+ ET   E E+IE+EE S +E  K    E K + +    EE ++ E   + +E
Sbjct: 536 DNEESSSQEETKDKENEKIEKEEASSQEESKENETETKEKEESSSQEETKEKE--NEKIE 595

Query: 242 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 301
            E +    +T ++ NE  E E    +E  + K             E  E V   TE E++
Sbjct: 596 KEESAPQEETKEKENEKIEKEESASQEETKEKETETKEKEESSSNESQENVN--TESEKK 652

Query: 302 EWERKNPK 309
           E   +N K
Sbjct: 656 EQVEENEK 652


HSP 2 Score: 72.0 bits (175), Expect = 1.0e-12
Identity = 64/273 (23.44%), Postives = 123/273 (45.05%), Query Frame = 1

Query: 62  SQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEEE 121
           SQE S +S +++ + +   ES   ++EE  +      +       EE + ++E +D E E
Sbjct: 416 SQETSEVSSQEESKGK---ESETKDKEESSSQEESKDRETETKEKEESSSQEETMDKETE 475

Query: 122 DEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPE---------EEEEEESEYETDSED 181
            +E +E   +  K + ++ E  E++F EE +E+E E         +E+ EE E ET   +
Sbjct: 476 AKEKVESSSQE-KNEDKETEKIESSFLEETKEKEDETKEKEESSSQEKTEEKETETKDNE 535

Query: 182 EPTGITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKD 241
           E +     K     K E+E  + +E  +E E   +E  +   +E   ETK    E+I K+
Sbjct: 536 ESSSQEETKDKENEKIEKEEASSQEESKENETETKEKEESSSQE---ETKEKENEKIEKE 595

Query: 242 EEIQKNLEMEANIADVDTDDEINEAEEYE---AWKVREIARIKRDRELRDAMLKEREEIE 301
           E   +    E     ++ ++  ++ E  E     K +E +     +E  +   +++E++E
Sbjct: 596 ESAPQEETKEKENEKIEKEESASQEETKEKETETKEKEESSSNESQENVNTESEKKEQVE 655

Query: 302 KVRNMTEEERREWERKNPKTAPPPKQKWKFMQK 323
           +    T+E+  E  ++N  +    KQ  +  +K
Sbjct: 656 ENEKKTDEDTSESSKENSVSDTEQKQSEETSEK 681


HSP 3 Score: 67.0 bits (162), Expect = 3.2e-11
Identity = 52/199 (26.13%), Postives = 93/199 (46.73%), Query Frame = 1

Query: 62  SQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEEE 121
           ++E S+     D    ++ +    ++EE + +    ++ E  S+ +EET+++E    E+E
Sbjct: 530 NEESSSQEETKDKENEKIEKEEASSQEESKENETETKEKE-ESSSQEETKEKENEKIEKE 589

Query: 122 DEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMVK 181
           +    EE + +  EK+   E EE+A  EE +E+E E +E+EES      E+  T      
Sbjct: 590 ESAPQEETKEKENEKI---EKEESASQEETKEKETETKEKEESSSNESQENVNT------ 649

Query: 182 PIFVPKSERETIAERERIEEEERSLEELRKRRLEERK-AETKHIVVEEIRKDEEIQKNLE 241
                +SE+     +E++EE E+  +E      +E   ++T+    EE  + EE  KN E
Sbjct: 650 -----ESEK-----KEQVEENEKKTDEDTSESSKENSVSDTEQKQSEETSEKEESNKNGE 708

Query: 242 MEANIADVDTDDEINEAEE 260
            E      D+  + N  +E
Sbjct: 710 TEVTQEQSDSSSDTNLPQE 708


HSP 4 Score: 63.5 bits (153), Expect = 3.6e-10
Identity = 52/239 (21.76%), Postives = 103/239 (43.10%), Query Frame = 1

Query: 34  KAPEWADDADEDGDIRMARAAALEKAFPSQEDSNLSRKDDPRLRRLAESRI---DNREEI 93
           K    + + +ED +     ++ LE+    +ED    +++     +  E      DN E  
Sbjct: 476 KVESSSQEKNEDKETEKIESSFLEET-KEKEDETKEKEESSSQEKTEEKETETKDNEESS 535

Query: 94  RADHRRIRQAEIV----STIEEETRKQEGLDAEEEDEEALEERRRRIKEKLR------QR 153
             +  + ++ E +    ++ +EE+++ E    E+E+  + EE + +  EK+       Q 
Sbjct: 536 SQEETKDKENEKIEKEEASSQEESKENETETKEKEESSSQEETKEKENEKIEKEESAPQE 595

Query: 154 ELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMVKPIFVPKSERETIAERERIE 213
           E +E    + E+EE   +EE +E E ET  ++E +     + +     ++E + E E+  
Sbjct: 596 ETKEKENEKIEKEESASQEETKEKETETKEKEESSSNESQENVNTESEKKEQVEENEKKT 655

Query: 214 EEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEE 260
           +E+ S     +   E   ++T+    EE  + EE  KN E E      D+  + N  +E
Sbjct: 656 DEDTS-----ESSKENSVSDTEQKQSEETSEKEESNKNGETEVTQEQSDSSSDTNLPQE 708


HSP 5 Score: 62.0 bits (149), Expect = 1.0e-09
Identity = 69/310 (22.26%), Postives = 129/310 (41.61%), Query Frame = 1

Query: 12  IAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAFPSQEDSNLSRK 71
           ++ +++ +GK  +TK K        E +   +E  D         EK   S ++  + ++
Sbjct: 422 VSSQEESKGKESETKDK--------EESSSQEESKD---RETETKEKEESSSQEETMDKE 481

Query: 72  DDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEEEDEEALEERRR 131
            + +  ++  S  +  E+   +       E     E+ET+++E   ++E+ EE   E + 
Sbjct: 482 TEAK-EKVESSSQEKNEDKETEKIESSFLEETKEKEDETKEKEESSSQEKTEEKETETKD 541

Query: 132 RIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMVKPIFVPKSERE 191
             +E   Q E ++    + E+EE   +EE +E+E ET  ++E +     K     K E+E
Sbjct: 542 N-EESSSQEETKDKENEKIEKEEASSQEESKENETETKEKEESSSQEETKEKENEKIEKE 601

Query: 192 TIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLEMEANIADVDTD 251
             A +E  +E+E    E  +   +E   E +     E ++ EE   N   E    +V+T+
Sbjct: 602 ESAPQEETKEKENEKIEKEESASQEETKEKE----TETKEKEESSSNESQE----NVNTE 661

Query: 252 DEINE-AEEYEAWKVREIARIKRDRELRDAMLKEREEI----EKVRNMTEEERREWERKN 311
            E  E  EE E     + +   ++  + D   K+ EE     E  +N   E  +E    +
Sbjct: 662 SEKKEQVEENEKKTDEDTSESSKENSVSDTEQKQSEETSEKEESNKNGETEVTQEQSDSS 710

Query: 312 PKTAPPPKQK 317
             T  P + K
Sbjct: 722 SDTNLPQEVK 710


HSP 6 Score: 60.8 bits (146), Expect = 2.3e-09
Identity = 59/295 (20.00%), Postives = 123/295 (41.69%), Query Frame = 1

Query: 54  AALEKAFPSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIV---------- 113
           + ++   P+  D+  S  D+      +    D+ E I+++   + + E++          
Sbjct: 342 SVIKSVLPNTTDNGESSSDEKSTGSSSGHESDSLEGIKSEGESMEKNELLEKEFNDSNGE 401

Query: 114 -STIEEETRKQEGLDAEEEDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEE 173
            S   + T   +G   E  +  + EE + +  E    ++ EE++  EE ++ E E +E+E
Sbjct: 402 SSVTGKSTGSGDGGSQETSEVSSQEESKGKESET---KDKEESSSQEESKDRETETKEKE 461

Query: 174 ESEYETDSEDEPTGITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETK 233
           ES  + ++ D+ T              +E +    + + E++  E++    LEE K   K
Sbjct: 462 ESSSQEETMDKET------------EAKEKVESSSQEKNEDKETEKIESSFLEETK--EK 521

Query: 234 HIVVEEIRKDEEIQKNLEMEANIADVDTDDEINEAEEYEAWKV-REIARIKRDRELRDAM 293
               +E  +    +K  E E    D +      E ++ E  K+ +E A  + + +  +  
Sbjct: 522 EDETKEKEESSSQEKTEEKETETKDNEESSSQEETKDKENEKIEKEEASSQEESKENETE 581

Query: 294 LKEREEIEKVRNMTEEERREWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADD 337
            KE+EE        E+E  + E++  ++AP  + K K  +K   + +  QE+  +
Sbjct: 582 TKEKEESSSQEETKEKENEKIEKE--ESAPQEETKEKENEKIEKEESASQEETKE 617


HSP 7 Score: 47.0 bits (110), Expect = 3.5e-05
Identity = 57/230 (24.78%), Postives = 97/230 (42.17%), Query Frame = 1

Query: 62  SQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEEE 121
           S EDSN   ++      + ES +   EE R +     + E   T E E  +++     EE
Sbjct: 120 SNEDSNSEIEEKKDSGGVEESEV---EEKRDNGGGTEENEKSGTEESEVEERKDNGGTEE 179

Query: 122 DEEA-LEERRRRIKEKLRQRELEEAAFPEEEEEEE------PEEEEEEESEYETDSEDEP 181
           +E++  EE     ++     E  E +  EE E EE       EE E+  SE     E + 
Sbjct: 180 NEKSGTEESEVEERKDNGGTEENEKSGTEESEVEERKENGGTEENEKSGSEESEVEEKKD 239

Query: 182 TGITMVKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEE 241
            G T          E E   +++    EE  +EE ++ R  +   E+K   ++E    EE
Sbjct: 240 NGGTEESREKSGTEESEVEEKKDNGSSEESEVEEKKENRGIDESEESKEKDIDEKANIEE 299

Query: 242 I-QKNLEMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLK 284
             + N + +   ++V  + E   +E   + KV + + IK + E+ D+++K
Sbjct: 300 ARENNYKGDDASSEVVHESEEKTSESENSEKVEDKSGIKTE-EVEDSVIK 345

BLAST of ClCG04G004740 vs. NCBI nr
Match: gi|659082793|ref|XP_008442034.1| (PREDICTED: microfibrillar-associated protein 1 [Cucumis melo])

HSP 1 Score: 861.7 bits (2225), Expect = 5.5e-247
Identity = 428/433 (98.85%), Postives = 430/433 (99.31%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           PSQEDS+LSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETR+QEGLDAEE
Sbjct: 61  PSQEDSDLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRRQEGLDAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGI MV
Sbjct: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGIAMV 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE
Sbjct: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWERKNPK APPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKPAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPILK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of ClCG04G004740 vs. NCBI nr
Match: gi|778695198|ref|XP_011653945.1| (PREDICTED: microfibrillar-associated protein 1 [Cucumis sativus])

HSP 1 Score: 856.7 bits (2212), Expect = 1.8e-245
Identity = 424/433 (97.92%), Postives = 429/433 (99.08%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF
Sbjct: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P QEDS++SRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETR+QEGLDAEE
Sbjct: 61  PRQEDSDISRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRRQEGLDAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDE+ALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGI MV
Sbjct: 121 EDEDALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGIAMV 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE
Sbjct: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWERKNPK APPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSD+IFHRDFSSPTGEDKM
Sbjct: 301 EWERKNPKPAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDNIFHRDFSSPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPILK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPITK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWESR
Sbjct: 421 PKGSKKLKDWESR 433

BLAST of ClCG04G004740 vs. NCBI nr
Match: gi|641847174|gb|KDO66055.1| (hypothetical protein CISIN_1g013977mg [Citrus sinensis])

HSP 1 Score: 762.3 bits (1967), Expect = 4.6e-217
Identity = 373/433 (86.14%), Postives = 402/433 (92.84%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADD +ED DIRM+RAAAL+KAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDIEEDNDIRMSRAAALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P +EDS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEETR QEGLD EE
Sbjct: 61  PRKEDSDIGRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEETR-QEGLDMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRI+EKL QRE EEAA   EEEEE  EEEEEEESEYETDSE+E  GI M+
Sbjct: 121 EDEEALEERRRRIREKLLQREQEEAALLPEEEEEAVEEEEEEESEYETDSEEEQMGIAML 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FVPKSER+TIAERER+E EE++LEEL KR+LEERK ETK I+VEE+RKDEEIQKNLE
Sbjct: 181 KPVFVPKSERDTIAERERLEAEEQALEELAKRKLEERKVETKKILVEEVRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEVNEAEEYEAWKVREIARIKRDREAREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWER+NPK APPPKQKW+FMQKYYHKGAFFQ DA D A T  +D I+HRDFS+PTGEDKM
Sbjct: 301 EWERRNPKPAPPPKQKWRFMQKYYHKGAFFQSDAADTAATVRTDEIYHRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPILK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           PKGSKKLKDWE+R
Sbjct: 421 PKGSKKLKDWETR 432

BLAST of ClCG04G004740 vs. NCBI nr
Match: gi|567902030|ref|XP_006443503.1| (hypothetical protein CICLE_v10020229mg [Citrus clementina])

HSP 1 Score: 760.8 bits (1963), Expect = 1.3e-216
Identity = 372/433 (85.91%), Postives = 402/433 (92.84%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADD +ED DIRM+RAAAL+KAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDIEEDNDIRMSRAAALDKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           P +EDS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEETR QEGLD EE
Sbjct: 61  PRKEDSDIGRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEETR-QEGLDMEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAFPEEEEEEEPEEEEEEESEYETDSEDEPTGITMV 180
           EDEEALEERRRRI+EKL QRE EEAA   EEEEE  EEEEEEESEYETDSE+E  GI M+
Sbjct: 121 EDEEALEERRRRIREKLLQREQEEAALLPEEEEEAVEEEEEEESEYETDSEEEQMGIAML 180

Query: 181 KPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNLE 240
           KP+FVPKSER+TIAERER+E EE++LEEL KR+LEERK ETK I+VEE+RKDEEIQKNLE
Sbjct: 181 KPVFVPKSERDTIAERERLEAEEQALEELAKRKLEERKVETKKILVEEVRKDEEIQKNLE 240

Query: 241 MEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEERR 300
           MEANIADVDTDDE+NEAEEYEAWKVREIARIKRDRE R+AMLKE+EEIEKVRNMTEEERR
Sbjct: 241 MEANIADVDTDDEVNEAEEYEAWKVREIARIKRDREAREAMLKEKEEIEKVRNMTEEERR 300

Query: 301 EWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDKM 360
           EWER+NPK APPPKQKW+FMQKYYHKGAFFQ DA D A T  +D I+HRDFS+PTGEDKM
Sbjct: 301 EWERRNPKPAPPPKQKWRFMQKYYHKGAFFQSDAADTAATVRTDEIYHRDFSAPTGEDKM 360

Query: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPILK 420
           DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPI K
Sbjct: 361 DKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPIAK 420

Query: 421 PKGSKKLKDWESR 434
           P+GSKKLKDWE+R
Sbjct: 421 PQGSKKLKDWETR 432

BLAST of ClCG04G004740 vs. NCBI nr
Match: gi|590640551|ref|XP_007029983.1| (Microfibrillar-associated protein 1 [Theobroma cacao])

HSP 1 Score: 758.1 bits (1956), Expect = 8.6e-216
Identity = 366/434 (84.33%), Postives = 403/434 (92.86%), Query Frame = 1

Query: 1   MSVTAGVSDTVIAVRDKLRGKIGQTKVKRYWPGKAPEWADDADEDGDIRMARAAALEKAF 60
           MSVTAGVSDT+IA+RDKLRGKIGQTKVKRYWPGKAPEWADDADE+GDIRMARA ALEKAF
Sbjct: 1   MSVTAGVSDTIIAIRDKLRGKIGQTKVKRYWPGKAPEWADDADEEGDIRMARAVALEKAF 60

Query: 61  PSQEDSNLSRKDDPRLRRLAESRIDNREEIRADHRRIRQAEIVSTIEEETRKQEGLDAEE 120
           PS++DS++ RKDDPRLRRLAESRIDNR+EIRADHRRIRQAEIVST EEE R+ EG++AEE
Sbjct: 61  PSRDDSDVVRKDDPRLRRLAESRIDNRDEIRADHRRIRQAEIVSTEEEENRRNEGVEAEE 120

Query: 121 EDEEALEERRRRIKEKLRQRELEEAAF-PEEEEEEEPEEEEEEESEYETDSEDEPTGITM 180
           EDE+ALEERRRRI+EKL QRE EE A   EEEEEEE EEEEEEESEYETDSE+E TGI M
Sbjct: 121 EDEDALEERRRRIREKLLQREQEETALLEEEEEEEEVEEEEEEESEYETDSEEEHTGIAM 180

Query: 181 VKPIFVPKSERETIAERERIEEEERSLEELRKRRLEERKAETKHIVVEEIRKDEEIQKNL 240
           VKP+FVPKSER+TIAERER+E EER++EE  KR+LE RK ET+ IVVE+IR+DEEIQKN+
Sbjct: 181 VKPVFVPKSERDTIAERERLEAEERAIEEAEKRKLEHRKVETRQIVVEKIREDEEIQKNM 240

Query: 241 EMEANIADVDTDDEINEAEEYEAWKVREIARIKRDRELRDAMLKEREEIEKVRNMTEEER 300
           E+EAN+ADVDTDDE+NEAEEYEAWK REIARIKRDRE R+AM+KE+EEIEKVRNMTEEER
Sbjct: 241 ELEANVADVDTDDEVNEAEEYEAWKAREIARIKRDREEREAMIKEKEEIEKVRNMTEEER 300

Query: 301 REWERKNPKTAPPPKQKWKFMQKYYHKGAFFQEDADDNAGTAGSDSIFHRDFSSPTGEDK 360
           REWERKNPK APPPKQKWKFMQKYYHKGAFFQ +ADD A   G+D+I+HRDFS PTGEDK
Sbjct: 301 REWERKNPKPAPPPKQKWKFMQKYYHKGAFFQAEADDPAAAVGADNIYHRDFSGPTGEDK 360

Query: 361 MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAGMNAPIL 420
           MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMA +NAP+ 
Sbjct: 361 MDKTILPKVMQVKHFGRSGRTKWTHLVNEDTTDWNNPWTYNDPLRAKYNAKMAAVNAPVA 420

Query: 421 KPKGSKKLKDWESR 434
           KPKGSKKLKDWES+
Sbjct: 421 KPKGSKKLKDWESK 434

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MFAP1_HUMAN5.3e-8042.63Microfibrillar-associated protein 1 OS=Homo sapiens GN=MFAP1 PE=1 SV=2[more]
MFAP1_BOVIN1.2e-7942.63Microfibrillar-associated protein 1 OS=Bos taurus GN=MFAP1 PE=2 SV=1[more]
MFAP1_MOUSE2.6e-7942.40Microfibrillar-associated protein 1 OS=Mus musculus GN=Mfap1 PE=1 SV=1[more]
MFAP1_CHICK4.6e-7643.25Microfibrillar-associated protein 1 OS=Gallus gallus GN=MFAP1 PE=2 SV=1[more]
MFAP1_DICDI5.7e-5030.72Protein MFAP1 homolog OS=Dictyostelium discoideum GN=mfap1 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A067FS77_CITSI3.2e-21786.14Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g013977mg PE=4 SV=1[more]
V4T8Z0_9ROSI9.3e-21785.91Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020229mg PE=4 SV=1[more]
A0A061EZF4_THECC6.0e-21684.33Microfibrillar-associated protein 1 OS=Theobroma cacao GN=TCM_025838 PE=4 SV=1[more]
A0A059AKH0_EUCGR1.3e-21585.91Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_J03113 PE=4 SV=1[more]
A0A067KU23_JATCU1.8e-21584.76Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02667 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G08580.12.5e-18972.48 microfibrillar-associated protein-related[more]
AT5G17900.17.3e-18972.48 microfibrillar-associated protein-related[more]
AT2G18540.14.1e-2231.62 RmlC-like cupins superfamily protein[more]
AT3G28770.19.7e-1621.77 Protein of unknown function (DUF1216)[more]
AT2G22795.11.2e-1327.42 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659082793|ref|XP_008442034.1|5.5e-24798.85PREDICTED: microfibrillar-associated protein 1 [Cucumis melo][more]
gi|778695198|ref|XP_011653945.1|1.8e-24597.92PREDICTED: microfibrillar-associated protein 1 [Cucumis sativus][more]
gi|641847174|gb|KDO66055.1|4.6e-21786.14hypothetical protein CISIN_1g013977mg [Citrus sinensis][more]
gi|567902030|ref|XP_006443503.1|1.3e-21685.91hypothetical protein CICLE_v10020229mg [Citrus clementina][more]
gi|590640551|ref|XP_007029983.1|8.6e-21684.33Microfibrillar-associated protein 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009730MFAP1_C
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0001527 microfibril
cellular_component GO:0009507 chloroplast
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG04G004740.1ClCG04G004740.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009730Micro-fibrillar-associated protein 1, C-terminalPFAMPF06991MFAP1coord: 169..390
score: 1.1
NoneNo IPR availableunknownCoilCoilcoord: 189..219
score: -coord: 278..305
score: -coord: 116..146
scor
NoneNo IPR availablePANTHERPTHR15327MICROFIBRIL-ASSOCIATED PROTEINcoord: 1..433
score: 3.1E