Cp4.1LG13g04650 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g04650
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionFilament-like plant protein
LocationCp4.1LG13 : 5957485 .. 5962872 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAATGCCAGAAATCCAGATTCTCAGTCAAGGGCGGGAATTCCGGCAGCGATATTTCAAAATATATAAACCAAAAAAAAAGAAAAAAAAAAAAACAAATTTGTATTTTTTTGTTGGTTACAAAAAAAGAAAAAACAAAGCAGGTGAGTACTGAATTTCGTTCTTTTGAGGCCCCATTCTTGAATTTCACACTTCCAATATGCCCCAATCACATGCTCTCTTTCTCTCTCACTCCATCCCTTGCCTTTCTTTTTCTTCAATATTTTGGAAGCTACCATTGTGAACAGAAATCTGAAGAACTAATTAAAATTAAACAACAAATGGTATTGGAACCCTTTATTCTCTGGCGTTCGCCGCTAATTTTTCTGCATATTTCCAGACATTCTACAGATTCTTGCAGCGATAGCTGTTCGTTTGCTTTCTTCCACTATTCTTAAACCCGAACGCCTCCTCCCAGGGATTCCGTTCTTATTGTTCTTCAGTTTAGGCTCCTGTTCGAGCTCTCGATTTTGTACATTTTTTCTGGTTTAAGGTAAATTTTCCATCTAGGTCTTCTTCATTTTGATGATTTTCTATTCATTTCTTGTACGAACTCTACTGAAACCTTTGATTCCTCAGACAAGAGGGGGAATCTTCTGTAGACTTGTTGATGGATCGCTTTTATTTCGAATGTTTGAAGGATTTTGCGTATCGTTTTCATTTTATCTGTTCTTCTCTGATGGCTTCTTCTTTTTCTAATCCATTGATGGCTTCCGAAGCTTTCACCTAATGTACTTGAGAATTGATGGGGTTTTTGCTTAGCTGAAACCCTTTTCTGTGATTTACTAGTTGTTGAGTGGGGTGATCTTAGGGTGGATTTGAAGCAAAGTTGAAAGAATGGATTGAAATTCCCGTATTAGCACAGGCAATTACAGTCTCTTTAGGCAAATTTTGAAGCTGGATTCGTAATGGATTTCTAAAGAACTGTGTATGTATGTCTGTAGATGGCGCTTTGACCAGTGGACTGATTTGGAAGTAAGGCGTTAGGATGGATAGAAGAAAATGGCCATGGAAGAAGAAGTCGTCATCTGATAAAAGTCCTGGACAAACCGAAAGTTCTGGATCAATGTCGACGTATTCCGAGAGGTTTTCCGACGAGCAGGTACGCAAATGAATCCTTAAAACATCTCCATTGTGGTGGCTGTTACATACACCCGAAAGGCGAAAACTAGTTGATAGTATCCAAATTTATGTCTGTTTTCAGGGATTTGCAAGAAACATTATGGTATTGTGTTTTGCTTGATTTGATTAACATCTTCTGTAAATTATATTGGAGAGATTCATCTTTTTGATGAAGCAAGATCTCCCAGGTGATTGTATGATTTGATCTCTTGGGGATGAAAAGAAAGGTGAAATAGTTTTGTTTCTTCCTCGTGAGGTGAAAAGGGTATGTAAGAACTTTTACAGGAGTGGAAACAAAAGCAGAATCAGAATGTTTGTTTGTTCTTAGCAACCCAAGTCCACCTCTATTAGATATTGTTCGCTTTGACCCGCTATGTATCGCCGTCAACCTCACGGTTTTAAAACTCATCTACTAGGGAGAGGTTTCCACACCATTATAAAGAATGTTTCGTTCTCCTCTCCAACCGATGTGGGAACTCACAATCCACCCCTTTGGGGGCCATATGTTCTCACTGTCACACCGTTGGGTGTCTGTCTCTAATACTATTTGTATCAGCCCAAGCCCTTGTGAGATCCCACATCGGTTCGAGAGAGGAACAAAACATTCTTTATAAAGGTGTGGAAACCTCTCCCTAACAGACGCGCTTTAAAACCGTGAAGCTGACGGCGATACGTAATGGGCCAAAGCAAACAATATCTACTAGCGGTGGGCTTGGGCTGTTACAGATGGAGGTGCTTGTAGAAGACAAAAGAAATCTTCAAGTTCTTAACTTGTAGGCAGAGCGTTTTCCTTGACAACCTAAGAGACAACTTCATTAGTTGGATTATCCTTATAGATTAGTTGTCATGAAGAATTGATTGTCTCTGTGGTATGCATTTGGAGAATTGGCTTGACTCACCCATCATTATTGATAAACAGGAGGCAGCCAAGTTGTCGCCTAATCACGAGATTCAGTCACCAGAAGTAACGTCAAAAGCAGTATGTGATGAAGACATTGATGATGATTCACCAAAACAAGAAGAGATCAATGATAGTGTAAAGGGTCTATCTGATAGGTTATCAGCTGCTCTTTTAAATGTGAAGGCCAAAGAGGATTTGGTCAACCAGCATGCCAAAGTTGCTGAAGAAGCTATCGCAGGTATAATTGGACACTTCAAATTTGTTCATGTTTTACATCCTTAGACATGAATGAACTAAATGAATTCTGCTTATTCTAATGTGAATTTAGGATGGGAAAATGCCGAAAATGAAGTTGGACTACTAAAACAGCAGCTTGGGACTACAGTTCAGCAAAAGTCTGCATTGGAGGACCGAGTGAGCCACCTTGATGGGGCCCTCAAAGAATGTGTTAGGCAGCTTAGGCTGGCAAGGGAGGAGCAGGAGCAGAAGATTCGCGACACGGTGGAAGAAAAAACCCGAGATTGGGAGTCCATCAAGGTGAACCTTGAGAGGCAGCTCCTCGAGCTCGAGTGCAAAGCGGATGAGGCTAAATGTGAATCTCCTCAAAATGATCCTAGCCTTGGCAAGATGCTCGAGTCATTGAAAAGGGAGAATGCAGCCCTCAGGTATGAGCTTCATGCTCAATATAGGGAGTTAGAAACCAGGACTATTGAGAGGGATTTGAGTACACAAACAGCTGAAACAGCCAGTAAGCAACATCTTGAGAGCATAAAGAAAATGACGAAGCTTGAGGCCGAGTGTCGCCGGTTAAAAGTAATGTCATACAAACCATCATTGGTTTATGATCACAAATCCATAGCTGCTTCGACGACTATTTCTATCGAGTCACTTACCGACACTCAATCGGATAACGGAGAGCAGCTTAATGGCGTGGATATGGATATAAGAAGAACCGAACGAAACAAATGTGTAGCTAGTTGCTCAGACTCATGGTCATCAAACTTACTTGTTAGCAGTAATCTTCCTTCTTCTCTTGAACTTGATCTCATGGATGATTTTCTTGAGATGGAAAGGCTTGCATCATTACCCGAAACTTCTATAAGAGAGAGTCATCAAGAACCCGAAGCTTCTGCTCGTCCTACTGCTGAAGAAAATGCCATAAGAACCGAGCTCGAGACGTTGCAGCATGAGAGATCTGTAATGGAAGAGAAGCTGGTTGAGATGGAAGAAGCAAAGATTGAATTGGAAGCGAAGCTGAAACAGATAGAAATGGAGAAGGATGAAATGGAAGAGAGGCTAGAGATGATGGAAACGGAGAGAGCTGAAGTCGATGAAATGGTAGCGATGATGGAGAGCGAGATATCCGAATCGGGGCGAAAGTTAGCGACGATGGAAGCAGAGAAAGCTGAAATGAGAGAGAAATTAATGAAGTTGGAGGCAGAGAAAGATGAACTGAGGAGTGCTCTTTCTCAGAGTCAGAACTCTGTTGACATTTCACAATTTCAATTCAAGGAAACTGAAATGAAACTGGAGAAGTTGCAGAATGAGCTAACCTTTGCAAATGAATCAAAGTTAAGAATTGAGTCTCAACTTATCAGCATGGAAGCTGAATCTCTGACCATGAATGCAAAGGTTGGTATGTTGGAATCTGATATTCAAAATGAGAGGGCTTTTGCATTGGCATTGACAGTCAAGTGTCAGGGACTAGAAGAAGAGCTTTCAAGATTGAAACATGATGAACAACTATCACAAACTGAAATCTCCAAACATGAGTTGAAGATAAAGCAGGTAAAAATTTGTAGTCAGCATTTTTCATGTTTAGTCCTAATAGTCCAAGCCCACCACTAGTAGATATTGTCCTCTTTGGGCTTTCGCTCAAGGTTTTTAAGACGCGTCTGCTAGGGAGAGATTTTCACACCCTTATAAACAATGCTTTGTTCCCCTCTCCAACCGATGTGGGATCTCACAATCGACCCCCTTTGAGGCTTAGTGTCTTTGTTGGCACATCGCCTGATGTCCACTCTCTTTCGGGGCTCAGCATTCTCGCTGGTACACTGTTCGGTGTCCACCCATTGCTAGCAATATCAAATAGGTCCCTTGTAACGGTCCAAACTCACTGCTAGCAAATATTGTCCTATTTGGGCTTTCCCTTTTGAGTTTTCCGTCTGTTAGGGTGAGGGGAACGAAACATTCTTTACAAAGGTGTGGACACCTCTCCCTAACAGACGCGTGTTTAGGGAGAGGTGTCCACACTCTTGTAAAGAATGTTTCGTTCCCCTCTCCAACCGATGTGAGATGTCACAATCCACCCCCTTCGAGGCCCAGCGTCCTCACTAGCACACGTTCCTCTCTCCAATCGATGTGAGACCCTCCACTCCATATCGGTTGGAGAGGAGAACAAAGGATTGGAGGGTCCCACATTGATTGGAGAGGGGAACGAGTGCCAATGAGGACGCTGGGCTCCGAAAGGAGGTAGACTATGAGATCCCTCATCGGTTGGAGAGGGGAATGAAGCATTCTTTATAAGAGTGTGGAAACCTCTCCCTAGCAGACGCGTTTTAAAAACCTCGAGGGAAAGTCTATAAAGGAAAGCCCAAAGAAGACGATATCTGCTAGTGGTGAGCTTGAGCCGTTACATCGATGCATGTATTTGAAATGCGGGGAAGTTGAATGAATACTCATAGTGGGGTAACATTTGCAGGAGGATCTAGCTGTAGCTGCAGGGAAACTTGCAGAGTGTCAAAAGACAATTGCTTCTCTTGGGAATCAGCTTAAATCTCTAGCATCCCTCGAAGATTTTCTGATCGATACGACGCAACTACCGGAGTTCACAGATGGCGAGGAGCATTGCAAGCATTCGAACGGGACACTTTCACCCAGAAGAGATTCAGACTATACTAAAGGAGTTGATGACAGTTCTGAACCATCATTGAATAAGAACGAAAACGACTCGCCCCCATTTTCATCTTCTTCAACTTCATCGTCGGTAATTGTAAGTCGTATCGTCAATTCTGAGAAAAACCGAAACGGTTTTGCTAAGTTTTTCTCTCGAACCAAGGGCGGGATAAAACTAGAAATTTAGTCAGGAATGCATATGGTTGGTGTCTAAAGATGATGGATTTGCTGATTAAATGCTGTTTAATCACATATGAATGTTTTCAATCTTCATATGGATAGACGTGTGTAGATTTGTAAGCAGCTACAAAATGGCCAGAGTAATTTTGTTTTCCCTCTGGCCTGATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTTTCCTTCCTAG

mRNA sequence

AAAAATGCCAGAAATCCAGATTCTCAGTCAAGGGCGGGAATTCCGGCAGCGATATTTCAAAATATATAAACCAAAAAAAAAGAAAAAAAAAAAAACAAATTTGTATTTTTTTGTTGGTTACAAAAAAAGAAAAAACAAAGCAGGTGAGTACTGAATTTCGTTCTTTTGAGGCCCCATTCTTGAATTTCACACTTCCAATATGCCCCAATCACATGCTCTCTTTCTCTCTCACTCCATCCCTTGCCTTTCTTTTTCTTCAATATTTTGGAAGCTACCATTGTGAACAGAAATCTGAAGAACTAATTAAAATTAAACAACAAATGGTATTGGAACCCTTTATTCTCTGGCGTTCGCCGCTAATTTTTCTGCATATTTCCAGACATTCTACAGATTCTTGCAGCGATAGCTGTTCGTTTGCTTTCTTCCACTATTCTTAAACCCGAACGCCTCCTCCCAGGGATTCCGTTCTTATTGTTCTTCAGTTTAGGCTCCTGTTCGAGCTCTCGATTTTGTACATTTTTTCTGGTTTAAGATGGCGCTTTGACCAGTGGACTGATTTGGAAGTAAGGCGTTAGGATGGATAGAAGAAAATGGCCATGGAAGAAGAAGTCGTCATCTGATAAAAGTCCTGGACAAACCGAAAGTTCTGGATCAATGTCGACGTATTCCGAGAGGTTTTCCGACGAGCAGGAGGCAGCCAAGTTGTCGCCTAATCACGAGATTCAGTCACCAGAAGTAACGTCAAAAGCAGTATGTGATGAAGACATTGATGATGATTCACCAAAACAAGAAGAGATCAATGATAGTGTAAAGGGTCTATCTGATAGGTTATCAGCTGCTCTTTTAAATGTGAAGGCCAAAGAGGATTTGGTCAACCAGCATGCCAAAGTTGCTGAAGAAGCTATCGCAGGATGGGAAAATGCCGAAAATGAAGTTGGACTACTAAAACAGCAGCTTGGGACTACAGTTCAGCAAAAGTCTGCATTGGAGGACCGAGTGAGCCACCTTGATGGGGCCCTCAAAGAATGTGTTAGGCAGCTTAGGCTGGCAAGGGAGGAGCAGGAGCAGAAGATTCGCGACACGGTGGAAGAAAAAACCCGAGATTGGGAGTCCATCAAGGTGAACCTTGAGAGGCAGCTCCTCGAGCTCGAGTGCAAAGCGGATGAGGCTAAATGTGAATCTCCTCAAAATGATCCTAGCCTTGGCAAGATGCTCGAGTCATTGAAAAGGGAGAATGCAGCCCTCAGGTATGAGCTTCATGCTCAATATAGGGAGTTAGAAACCAGGACTATTGAGAGGGATTTGAGTACACAAACAGCTGAAACAGCCAGTAAGCAACATCTTGAGAGCATAAAGAAAATGACGAAGCTTGAGGCCGAGTGTCGCCGGTTAAAAGTAATGTCATACAAACCATCATTGGTTTATGATCACAAATCCATAGCTGCTTCGACGACTATTTCTATCGAGTCACTTACCGACACTCAATCGGATAACGGAGAGCAGCTTAATGGCGTGGATATGGATATAAGAAGAACCGAACGAAACAAATGTGTAGCTAGTTGCTCAGACTCATGGTCATCAAACTTACTTGTTAGCAGTAATCTTCCTTCTTCTCTTGAACTTGATCTCATGGATGATTTTCTTGAGATGGAAAGGCTTGCATCATTACCCGAAACTTCTATAAGAGAGAGTCATCAAGAACCCGAAGCTTCTGCTCGTCCTACTGCTGAAGAAAATGCCATAAGAACCGAGCTCGAGACGTTGCAGCATGAGAGATCTGTAATGGAAGAGAAGCTGGTTGAGATGGAAGAAGCAAAGATTGAATTGGAAGCGAAGCTGAAACAGATAGAAATGGAGAAGGATGAAATGGAAGAGAGGCTAGAGATGATGGAAACGGAGAGAGCTGAAGTCGATGAAATGGTAGCGATGATGGAGAGCGAGATATCCGAATCGGGGCGAAAGTTAGCGACGATGGAAGCAGAGAAAGCTGAAATGAGAGAGAAATTAATGAAGTTGGAGGCAGAGAAAGATGAACTGAGGAGTGCTCTTTCTCAGAGTCAGAACTCTGTTGACATTTCACAATTTCAATTCAAGGAAACTGAAATGAAACTGGAGAAGTTGCAGAATGAGCTAACCTTTGCAAATGAATCAAAGTTAAGAATTGAGTCTCAACTTATCAGCATGGAAGCTGAATCTCTGACCATGAATGCAAAGGAGGATCTAGCTGTAGCTGCAGGGAAACTTGCAGAGTGTCAAAAGACAATTGCTTCTCTTGGGAATCAGCTTAAATCTCTAGCATCCCTCGAAGATTTTCTGATCGATACGACGCAACTACCGGAGTTCACAGATGGCGAGGAGCATTGCAAGCATTCGAACGGGACACTTTCACCCAGAAGAGATTCAGACTATACTAAAGGAGTTGATGACAGTTCTGAACCATCATTGAATAAGAACGAAAACGACTCGCCCCCATTTTCATCTTCTTCAACTTCATCGTCGGTAATTGTAAGTCGTATCGTCAATTCTGAGAAAAACCGAAACGGTTTTGCTAAGTTTTTCTCTCGAACCAAGGGCGGGATAAAACTAGAAATTTAGTCAGGAATGCATATGGTTGGTGTCTAAAGATGATGGATTTGCTGATTAAATGCTGTTTAATCACATATGAATGTTTTCAATCTTCATATGGATAGACGTGTGTAGATTTGTAAGCAGCTACAAAATGGCCAGAGTAATTTTGTTTTCCCTCTGGCCTGATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTATTTTCCTTCCTAG

Coding sequence (CDS)

ATGGATAGAAGAAAATGGCCATGGAAGAAGAAGTCGTCATCTGATAAAAGTCCTGGACAAACCGAAAGTTCTGGATCAATGTCGACGTATTCCGAGAGGTTTTCCGACGAGCAGGAGGCAGCCAAGTTGTCGCCTAATCACGAGATTCAGTCACCAGAAGTAACGTCAAAAGCAGTATGTGATGAAGACATTGATGATGATTCACCAAAACAAGAAGAGATCAATGATAGTGTAAAGGGTCTATCTGATAGGTTATCAGCTGCTCTTTTAAATGTGAAGGCCAAAGAGGATTTGGTCAACCAGCATGCCAAAGTTGCTGAAGAAGCTATCGCAGGATGGGAAAATGCCGAAAATGAAGTTGGACTACTAAAACAGCAGCTTGGGACTACAGTTCAGCAAAAGTCTGCATTGGAGGACCGAGTGAGCCACCTTGATGGGGCCCTCAAAGAATGTGTTAGGCAGCTTAGGCTGGCAAGGGAGGAGCAGGAGCAGAAGATTCGCGACACGGTGGAAGAAAAAACCCGAGATTGGGAGTCCATCAAGGTGAACCTTGAGAGGCAGCTCCTCGAGCTCGAGTGCAAAGCGGATGAGGCTAAATGTGAATCTCCTCAAAATGATCCTAGCCTTGGCAAGATGCTCGAGTCATTGAAAAGGGAGAATGCAGCCCTCAGGTATGAGCTTCATGCTCAATATAGGGAGTTAGAAACCAGGACTATTGAGAGGGATTTGAGTACACAAACAGCTGAAACAGCCAGTAAGCAACATCTTGAGAGCATAAAGAAAATGACGAAGCTTGAGGCCGAGTGTCGCCGGTTAAAAGTAATGTCATACAAACCATCATTGGTTTATGATCACAAATCCATAGCTGCTTCGACGACTATTTCTATCGAGTCACTTACCGACACTCAATCGGATAACGGAGAGCAGCTTAATGGCGTGGATATGGATATAAGAAGAACCGAACGAAACAAATGTGTAGCTAGTTGCTCAGACTCATGGTCATCAAACTTACTTGTTAGCAGTAATCTTCCTTCTTCTCTTGAACTTGATCTCATGGATGATTTTCTTGAGATGGAAAGGCTTGCATCATTACCCGAAACTTCTATAAGAGAGAGTCATCAAGAACCCGAAGCTTCTGCTCGTCCTACTGCTGAAGAAAATGCCATAAGAACCGAGCTCGAGACGTTGCAGCATGAGAGATCTGTAATGGAAGAGAAGCTGGTTGAGATGGAAGAAGCAAAGATTGAATTGGAAGCGAAGCTGAAACAGATAGAAATGGAGAAGGATGAAATGGAAGAGAGGCTAGAGATGATGGAAACGGAGAGAGCTGAAGTCGATGAAATGGTAGCGATGATGGAGAGCGAGATATCCGAATCGGGGCGAAAGTTAGCGACGATGGAAGCAGAGAAAGCTGAAATGAGAGAGAAATTAATGAAGTTGGAGGCAGAGAAAGATGAACTGAGGAGTGCTCTTTCTCAGAGTCAGAACTCTGTTGACATTTCACAATTTCAATTCAAGGAAACTGAAATGAAACTGGAGAAGTTGCAGAATGAGCTAACCTTTGCAAATGAATCAAAGTTAAGAATTGAGTCTCAACTTATCAGCATGGAAGCTGAATCTCTGACCATGAATGCAAAGGAGGATCTAGCTGTAGCTGCAGGGAAACTTGCAGAGTGTCAAAAGACAATTGCTTCTCTTGGGAATCAGCTTAAATCTCTAGCATCCCTCGAAGATTTTCTGATCGATACGACGCAACTACCGGAGTTCACAGATGGCGAGGAGCATTGCAAGCATTCGAACGGGACACTTTCACCCAGAAGAGATTCAGACTATACTAAAGGAGTTGATGACAGTTCTGAACCATCATTGAATAAGAACGAAAACGACTCGCCCCCATTTTCATCTTCTTCAACTTCATCGTCGGTAATTGTAAGTCGTATCGTCAATTCTGAGAAAAACCGAAACGGTTTTGCTAAGTTTTTCTCTCGAACCAAGGGCGGGATAAAACTAGAAATTTAG

Protein sequence

MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVCDEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEVGLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESIKVNLERQLLELECKADEAKCESPQNDPSLGKMLESLKRENAALRYELHAQYRELETRTIERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESLTDTQSDNGEQLNGVDMDIRRTERNKCVASCSDSWSSNLLVSSNLPSSLELDLMDDFLEMERLASLPETSIRESHQEPEASARPTAEENAIRTELETLQHERSVMEEKLVEMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKLATMEAEKAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQNELTFANESKLRIESQLISMEAESLTMNAKEDLAVAAGKLAECQKTIASLGNQLKSLASLEDFLIDTTQLPEFTDGEEHCKHSNGTLSPRRDSDYTKGVDDSSEPSLNKNENDSPPFSSSSTSSSVIVSRIVNSEKNRNGFAKFFSRTKGGIKLEI
BLAST of Cp4.1LG13g04650 vs. Swiss-Prot
Match: FPP_SOLLC (Filament-like plant protein (Fragment) OS=Solanum lycopersicum GN=FPP PE=1 SV=1)

HSP 1 Score: 351.7 bits (901), Expect = 1.8e-95
Identity = 271/604 (44.87%), Postives = 363/604 (60.10%), Query Frame = 1

Query: 95  KEDLVNQHAKVAEEAIAGWENAENEVGLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQ 154
           KEDLV QHAKVAEEAIAGWE AENEV +LKQQL   VQQ   LE RVSHLDGALKECVRQ
Sbjct: 1   KEDLVKQHAKVAEEAIAGWEKAENEVAVLKQQLDAAVQQNLTLEVRVSHLDGALKECVRQ 60

Query: 155 LRLAREEQEQKIRDTVEEKTRDWESIKVNLERQLLELECKADEAKCESPQN-DPSLGKML 214
           LR AR+EQE+ I+D + EK  + ES K  LE+QLL+L+ + +  K E P + DP +   L
Sbjct: 61  LRQARDEQEKMIQDAMAEKN-EMESEKTALEKQLLKLQTQVEAGKAEMPTSTDPDILVRL 120

Query: 215 ESLKRENAALRYELHAQYRELETRTIERDLSTQTAETASKQHLESIKKMTKLEAECRRLK 274
           + L++ENAAL+ EL +    LE RTIERDLSTQ AETASKQ LESIKK+TKLE ECR+L+
Sbjct: 121 KYLEKENAALKIELVSCSEVLEIRTIERDLSTQAAETASKQQLESIKKLTKLEVECRKLQ 180

Query: 275 VMSYKPSLVYDHKSIAASTTISIESLTDTQSDNGEQLNGVDMD---IRRTERNKCVASCS 334
            M+ K S   D +S A S +  ++S+TD+QSD+GE+LN VD D   + + E  +   SCS
Sbjct: 181 AMARKSSPFNDQRSSAVS-SFYVDSVTDSQSDSGERLNTVDNDALKMSKLETREYEPSCS 240

Query: 335 DSWSSNLLVSSN-------LPS-----SLELDLMDDFLEMERLASLPETSIRE------- 394
           +SW+S L+   +       +P      S+E+D+MDDFLEME+LA+L ET+ +        
Sbjct: 241 NSWASALIAELDQFKNEKAMPKTLAACSIEIDMMDDFLEMEQLAALSETANKTPSVTSDA 300

Query: 395 -SHQEPEASARPTAEENAIRTELETLQHERSVMEEKLVEMEEAKIELEAKLKQIEMEKDE 454
             H  P       AE N+I   +  L+ +   +E +  E+E A  E +  LK   ++  E
Sbjct: 301 VPHDSPNIENPLAAEYNSISQRVVELEQKLEKIEAEKAELENAFNESQDALKVSSLQLKE 360

Query: 455 MEERLEMMETERAEVDEMVAMMESEISESGRKLATMEAEKAEMREKLMKLEAEKDELRSA 514
            + RLE ++ E   V+E   ++E +       L  ME E   M   +  L+ E ++ +S 
Sbjct: 361 TQTRLEGLQKELDVVNESKELLEFQ-------LYGMEVEARTMSVNIDSLKTEVEKEKSL 420

Query: 515 LSQSQNSVDISQFQFKETEMKLEKLQNELTFANESKLRIESQLISMEAESLTMNAKEDLA 574
            S              E E K  +L+N+L    +     E+Q  S     L +  +EDLA
Sbjct: 421 SS--------------EMEAKCHELENDL---RKKSQEAEAQQTSGSNSELKIK-QEDLA 480

Query: 575 VAAGKLAECQKTIASLGNQLKSLASLEDFLIDTTQLP------EFTDGEEHCKHSNGTLS 634
           VAA KLAECQKTIASLG QL+SLA+LEDFL DT  LP          GE    H N T +
Sbjct: 481 VAADKLAECQKTIASLGKQLQSLATLEDFLTDTANLPGGGAVVAKAGGELWKLHVNETFT 540

Query: 635 PRRDSDYTKGVDDSSEPSLNKNENDSPPFSSSSTSSSVIVSRIVNSEKNRNGFAKFFSRT 669
           P+RDSD TK V+++   S N+NE +SP  SSSS++SS   +   ++ K++NGF K FSR+
Sbjct: 541 PKRDSDPTK-VEENVSHSTNENEGESPASSSSSSTSSTTQA---STGKSKNGFGKLFSRS 573

BLAST of Cp4.1LG13g04650 vs. Swiss-Prot
Match: FPP3_ARATH (Filament-like plant protein 3 OS=Arabidopsis thaliana GN=FPP3 PE=2 SV=2)

HSP 1 Score: 297.7 bits (761), Expect = 3.1e-79
Identity = 228/569 (40.07%), Postives = 338/569 (59.40%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           MDRR W W++KSS +KSPG+TES+GS+S++SERFSD+Q +         QSPE+ SK V 
Sbjct: 1   MDRRSWLWRRKSS-EKSPGETESTGSVSSHSERFSDDQRS---------QSPELNSKPVT 60

Query: 61  DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEV 120
                    ++EE    +K L++RLSAALLNV  KEDL  QHAKVAEEA++GWE AENE 
Sbjct: 61  ---------REEEATADIKILTERLSAALLNVSLKEDLAKQHAKVAEEAVSGWEKAENEA 120

Query: 121 GLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESI 180
             LKQQL  +  + SALEDR SHLD ALKECVRQL   REEQ QKI + +  K ++WE+ 
Sbjct: 121 AALKQQLDASTSKVSALEDRNSHLDSALKECVRQLWQGREEQNQKIEEAINNKCKEWETT 180

Query: 181 KVNLERQLLELECKADEAKCESPQNDPSLGKMLESLKRENAALRYELHAQYRELETRTIE 240
           K  LE ++ EL+ + D       ++   L   LE+L++EN+AL+ +L ++  E++ RTIE
Sbjct: 181 KSQLEARIEELQARQDVTTSSVHED---LYPKLEALEKENSALKLQLLSKSEEVKIRTIE 240

Query: 241 RDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESLT 300
           RDLSTQ AE+ASKQ LE IKK+TKLEAECR+L+VM  +           +  +  ++S  
Sbjct: 241 RDLSTQAAESASKQQLEGIKKLTKLEAECRKLRVMVRR-----------SDNSSDLKSSI 300

Query: 301 DTQSDNGEQLNGVDMDIRRTERNKCVASCSDSWSSNLLVSSNLPSSLELDLMDDFLEMER 360
           D QSD   +++  D +++             S S  ++  S++ +S+++ LMDDFLEME+
Sbjct: 301 DNQSDYSGRVSFSDNEMQ-------------SPSEKIIGKSSMATSVDIGLMDDFLEMEK 360

Query: 361 LASLPETSIRESHQEPEASA-RPTAEENAIRTELETLQHERSVMEEKLVEMEEAKIELEA 420
           LA+LP +     H E      +  A  N ++ EL+T     S +EEK+  +E  K++LE 
Sbjct: 361 LAALPHSEPGRKHSESNKELEKSNAHVNQLKHELKTSLRRISELEEKVEMVEVEKLQLEM 420

Query: 421 KLKQIEMEKDEMEERLEMMETERAEVDEMVAM---MESEISESGRKLATMEAEKAEMREK 480
            L   + + + ++ RL+ +E + +E+ ++ A    +E  + ESG+++  ++ +  + +  
Sbjct: 421 ALNGSKEQIEALQSRLKEIEGKLSEMKKLEAENQELELLLGESGKQMEDLQRQLNKAQVN 480

Query: 481 LMKLE---AEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQ--------------NEL 540
           L +LE   AEK EL   L+ ++  ++ SQ + KETE KL +LQ              + L
Sbjct: 481 LSELETRRAEKLELTMCLNGTKKQLETSQNRLKETERKLTELQTLLHLTKDAKEAAEDGL 523

Query: 541 TFANESKLRIESQL--ISMEAESLTMNAK 547
             AN     IES+L  +  EAESL +  K
Sbjct: 541 KAANGKTEAIESRLKDVEAEAESLILKIK 523

BLAST of Cp4.1LG13g04650 vs. Swiss-Prot
Match: FPP1_ARATH (Filament-like plant protein 1 OS=Arabidopsis thaliana GN=FPP1 PE=2 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 3.9e-74
Identity = 230/565 (40.71%), Postives = 326/565 (57.70%), Query Frame = 1

Query: 9   KKKSSSDKSPGQTESSGSMSTYSERFSDEQ-EAAKLSPNHEIQSPEVTSKAVCDEDIDDD 68
           +K+ SS++S G++ES   +S+ SE+ S+ Q E+   S + EIQSP V+ +          
Sbjct: 4   RKRESSERSFGESES---VSSLSEKDSEIQPESTMESRDDEIQSPTVSLEV--------- 63

Query: 69  SPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEVGLLKQQL 128
             ++EE+ DS+K L+++LSAAL NV AK+DLV QH KVAEEA+AGWE AENEV  LK++L
Sbjct: 64  ETEKEELKDSMKTLAEKLSAALANVSAKDDLVKQHVKVAEEAVAGWEKAENEVVELKEKL 123

Query: 129 GTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESIKVNLERQ 188
                +   LEDRVSHLDGALKECVRQLR AR+EQEQ+I+D V E+T++ +S + +LE Q
Sbjct: 124 EAADDKNRVLEDRVSHLDGALKECVRQLRQARDEQEQRIQDAVIERTQELQSSRTSLENQ 183

Query: 189 LLELECKADEAKCESPQNDPSLGKMLESLKRENAALRYELHAQYRELETRTIERDLSTQT 248
           + E   K++E           L +M ES+ +EN  LR+EL A+  ELE RTIERDLSTQ 
Sbjct: 184 IFETATKSEE-----------LSQMAESVAKENVMLRHELLARCEELEIRTIERDLSTQA 243

Query: 249 AETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESLTDTQSDNG 308
           AETASKQ L+SIKK+ KLEAECR+ ++++   +   DH+S            TD+ SD G
Sbjct: 244 AETASKQQLDSIKKVAKLEAECRKFRMLAKSSASFNDHRS------------TDSHSDGG 303

Query: 309 EQLNGVDMDIRRTERNKCVASCSDSWSSNLLVSSN----LPSSLELDLMDDFLEMERLAS 368
           E+++                SCSDSW+S+ L+         SS+ELDLM DFLEMERL +
Sbjct: 304 ERMD---------------VSCSDSWASSTLIEKRSLQGTSSSIELDLMGDFLEMERLVA 363

Query: 369 LPETSIRESHQEPEASARPTA--EENAIRTELETLQHERSVMEEKLVEMEEAKIELEAKL 428
           LPET        PE+         EN++ +E+E       V+  ++ E+EE       KL
Sbjct: 364 LPETPDGNGKSGPESVTEEVVVPSENSLASEIE-------VLTSRIKELEE-------KL 423

Query: 429 KQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKLATMEAEKAEMREKLMKLE 488
           +++E EK E+E  ++    E      +V +  SE+  S  K         E+ EKL KLE
Sbjct: 424 EKLEAEKHELENEVKCNREEA-----VVHIENSEVLTSRTK---------ELEEKLEKLE 483

Query: 489 AEKDELRS--------ALSQSQNS----VDISQFQFKETEMKLEKLQNE-LTFANESKLR 548
           AEK+EL+S        A+   +NS    +++   + KE E +LEKL+ E +   +E K  
Sbjct: 484 AEKEELKSEVKCNREKAVVHVENSLAAEIEVLTSRTKELEEQLEKLEAEKVELESEVKCN 490

BLAST of Cp4.1LG13g04650 vs. Swiss-Prot
Match: FPP2_ARATH (Filament-like plant protein 2 OS=Arabidopsis thaliana GN=FPP2 PE=1 SV=1)

HSP 1 Score: 233.8 bits (595), Expect = 5.5e-60
Identity = 223/668 (33.38%), Postives = 328/668 (49.10%), Query Frame = 1

Query: 94  AKEDLVNQHAKVAEEAIAGWENAENEVGLLKQQLGTTVQQKSALEDRVSHLDGALKECVR 153
           +K++LV QHAKVAE+A+AGWE AENEV  LKQ+L     +   LEDRVSHLDGALKECVR
Sbjct: 15  SKDELVKQHAKVAEDAVAGWEKAENEVVELKQKLEDAADKNIVLEDRVSHLDGALKECVR 74

Query: 154 QLRLAREEQEQKIRDTVEEKTRDWESIKVNLERQLLELECKADEAKCESPQNDPSLGKML 213
           QLR  R+EQE+ I+  V E T++  S    LE+++LEL+ +A+ AK E            
Sbjct: 75  QLRQFRDEQEKNIQAAVTESTKELHSANTGLEKRVLELQKEAEAAKSE------------ 134

Query: 214 ESLKRENAALRYELHAQYRELETRTIERDLSTQTAETASKQHLESIKKMTKLEAECRRLK 273
                 N  LR E   Q  +LE   IERDLSTQ AETASKQHL+ IKK+ KLEAECR+L+
Sbjct: 135 ------NMMLRREFLTQREDLEIVMIERDLSTQAAETASKQHLDIIKKLAKLEAECRKLR 194

Query: 274 VMSYKPSLVYDHKSIAASTTISIESLTDTQSDNGEQLNGVDMDIRRTERNKCVASCSDSW 333
           +++              S+++S     D+ SD G              R +   SCSDSW
Sbjct: 195 ILA------------KTSSSLSSNQSVDSHSDGG--------------RERVEGSCSDSW 254

Query: 334 SSNLLVSS---------------NLPSSLELDLMDDFLEMERLASLP-ETSIRESHQEPE 393
           +S+  +S                   SS E+DLMDDFLEMERL +LP ET  + S    E
Sbjct: 255 ASSAFISELDQIKNEKGGNRSLQGTTSSTEIDLMDDFLEMERLVALPTETQAKNSKDGYE 314

Query: 394 ASARPTAE-------------------ENAIRTELETLQHERSVMEEKLVEMEEAKIELE 453
            S     E                   E  +  E+E +  ++  +E+ L  +E  K EL+
Sbjct: 315 LSLMEKLEKIQAEKDDLEREVKCCREAEKRLSLEIEAVVGDKMELEDMLKRVEAEKAELK 374

Query: 454 A-------KLKQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKLATMEAEKA 513
                   K ++  +   E++ +LE ++ E+ E+D  V   +        +L  +  +K 
Sbjct: 375 TSFDVLKDKYQESRVCFQEVDTKLEKLQAEKDELDSEVICCKEAEKRFSLELEAVVGDKI 434

Query: 514 EMREKLMKLEAEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQNELTFANESKLRIES 573
           EM ++L K+EAEK EL+ +    ++    S+  F+E EMKLE ++ EL  ANESK + ES
Sbjct: 435 EMEDELEKMEAEKAELKISFDVIKDQYQESRVCFQEVEMKLEAMKRELKLANESKTQAES 494

Query: 574 QLISMEAESLTMNAKEDLAVAAGKLAECQKTIASLGNQLKSLASLEDFLID--------T 633
           ++  MEAE      +++  V+ G   +C+     L  +++    ++   ++         
Sbjct: 495 RVTRMEAE-----VRKERIVSDGLKEKCETFEEELRREIEEKTMIKREKVEPKIKQEDIA 554

Query: 634 TQLPEFTDGEEHCKHSNGTLSPRRDSDYTKG---VDDSSEPS-----------LNKNEND 673
           T   +F D    C+ +  +L  +  S  T     +D +S P            L K+ ++
Sbjct: 555 TAAGKFAD----CQKTIASLGKQLQSLATLEEFLIDTASIPGSARSVHNKEALLGKDPHE 614

BLAST of Cp4.1LG13g04650 vs. Swiss-Prot
Match: FPP5_ARATH (Filament-like plant protein 5 OS=Arabidopsis thaliana GN=FPP5 PE=2 SV=2)

HSP 1 Score: 149.1 bits (375), Expect = 1.8e-34
Identity = 107/302 (35.43%), Postives = 163/302 (53.97%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFS-----DEQEAAKLSPNHEIQSPEVT 60
           M+ R WPWK+KSS DK+  +    G  ST     S     + QE  K +   +I     T
Sbjct: 1   MEGRGWPWKRKSS-DKATTEKPVVGIESTPVCSLSYLASLENQEKCKNTNYVQITMDSYT 60

Query: 61  SKAVCDEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWEN 120
             +  ++ +     K  E+   VK L ++L+ A   +  KE L+ QHAKVAEEA++GWE 
Sbjct: 61  HMSRMEDQV-----KLFEVQ--VKDLKEKLTLAHSEINTKESLILQHAKVAEEAVSGWEK 120

Query: 121 AENEVGLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTR 180
           A+ E   LK+QL +    K   EDR SHLD ALKEC RQ+R+ +EE ++K++D +  KT 
Sbjct: 121 ADAETLALKRQLESVTLLKLTAEDRASHLDDALKECTRQIRIVKEESDKKLQDVILAKTS 180

Query: 181 DWESIKVNLERQLLELE------------------------CKADEAKCESPQNDPSLGK 240
            W+ IK  LE ++ EL                          +  E + ++  +   L  
Sbjct: 181 QWDKIKAELEGKIDELSEGLHRAASDNAALTRSLQERSEMIVRISEERSKAEADVEKLKT 240

Query: 241 MLESLKRENAALRYELHAQYRELETRTIERDLSTQTAETASKQHLESIKKMTKLEAECRR 274
            L+  ++E + L+Y+LH   +E+E R  E+++S ++A+ A+KQHLE +KK+ KLEAEC R
Sbjct: 241 NLQLAEKEISYLKYDLHVASKEVEIRNEEKNMSLKSADIANKQHLEGVKKIAKLEAECHR 294

BLAST of Cp4.1LG13g04650 vs. TrEMBL
Match: A0A0A0KDU7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G385120 PE=4 SV=1)

HSP 1 Score: 755.4 bits (1949), Expect = 6.1e-215
Identity = 446/561 (79.50%), Postives = 487/561 (86.81%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           M+RRKWPWK+KSS DKSPG+TESSGSMS+YSERFSDEQ+AAK SPNHE QSPEV+SKA+C
Sbjct: 1   MERRKWPWKRKSS-DKSPGETESSGSMSSYSERFSDEQDAAKSSPNHETQSPEVSSKAIC 60

Query: 61  -DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENE 120
            +EDIDDD PKQEEINDSVK LS+RLSAAL+NVKAKEDLV QHAKVAEEAIAGWE AENE
Sbjct: 61  KEEDIDDDLPKQEEINDSVKSLSERLSAALVNVKAKEDLVKQHAKVAEEAIAGWEKAENE 120

Query: 121 VGLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWES 180
           V  LKQQLGTTVQQKSALEDRVSHLDGALKECVRQLR AREEQEQKI D VEEKTRDW+S
Sbjct: 121 VTHLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRQAREEQEQKIHDAVEEKTRDWQS 180

Query: 181 IKVNLERQLLELECKADEAKCESPQNDPSLGKMLESLKRENAALRYELHAQYRELETRTI 240
            KV+LERQLL L+  AD AKCESP+ DPSLGKMLE LKRENAALR+ELHAQYRELETRTI
Sbjct: 181 TKVDLERQLLALQSIADTAKCESPKVDPSLGKMLELLKRENAALRHELHAQYRELETRTI 240

Query: 241 ERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESL 300
           ERDLSTQTAETASKQHLESIKKM KLEAECRRLK MS KPS V DHKSIAAS TISIESL
Sbjct: 241 ERDLSTQTAETASKQHLESIKKMAKLEAECRRLKFMSCKPSFV-DHKSIAAS-TISIESL 300

Query: 301 TDTQSDNGEQLNGVDMDIRRTERNKCVASCSDSWSSNLL-----------VSSNLPSSLE 360
           TDTQSDNGEQL+ VD+DI RTERNK   SCS   +S LL           VSSNLPSSLE
Sbjct: 301 TDTQSDNGEQLSAVDIDI-RTERNKGEPSCSHPRASTLLAELNQLGNEKAVSSNLPSSLE 360

Query: 361 LDLMDDFLEMERLASLPETSIRESHQEPEASARPTAEENAIRTELETLQHERSVMEEKLV 420
           LDLMDDFLEMERLASLPET   +S QE EA  R TAEENA+RTELE L+HERS+ME+KL 
Sbjct: 361 LDLMDDFLEMERLASLPETDTGKSRQESEAFPRSTAEENALRTELEALRHERSLMEKKLG 420

Query: 421 EMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKLATMEA 480
           EMEEAKIELE KLKQ+E+EKDE+EERLEMME ER E ++M+A ME++  E G+KL  ME 
Sbjct: 421 EMEEAKIELEEKLKQMEVEKDELEERLEMMEIERDEANQMLAKMETKQYELGQKLVKMEE 480

Query: 481 EKAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQNELTFANESKLR 540
           EK EM EKLMKLE +KDEL +ALS+SQNSV+ISQFQ KET+MKLEKLQNELT A+ESKLR
Sbjct: 481 EKVEMGEKLMKLETQKDELETALSRSQNSVEISQFQLKETQMKLEKLQNELTIADESKLR 540

Query: 541 IESQLISMEAESLTMNAKEDL 550
           IESQLISMEAESLTM+AK ++
Sbjct: 541 IESQLISMEAESLTMSAKVEM 557

BLAST of Cp4.1LG13g04650 vs. TrEMBL
Match: A0A067KCT1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12095 PE=4 SV=1)

HSP 1 Score: 553.1 bits (1424), Expect = 4.6e-154
Identity = 362/698 (51.86%), Postives = 477/698 (68.34%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           M++RKW WK+KSS ++SPG+TESSGS+S+ SERFSDEQ+  K SPN+E QSPEVTSK V 
Sbjct: 1   MEKRKWLWKRKSS-ERSPGETESSGSISSQSERFSDEQDNLKASPNNETQSPEVTSKTVV 60

Query: 61  DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEV 120
                    + E++NDSV+ L+++LSAAL+NV AK+DLV QH+KVAEEA+AGWE AENEV
Sbjct: 61  ---------RDEDVNDSVRILTEKLSAALVNVSAKDDLVKQHSKVAEEAVAGWEKAENEV 120

Query: 121 GLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESI 180
             LK+QL   +QQ  ALEDRVSHLDGALKECVRQLR AREE E+K+ + V +KT +WES+
Sbjct: 121 AALKKQLEAAIQQNCALEDRVSHLDGALKECVRQLRQAREEHEEKVYEAVTKKTIEWESV 180

Query: 181 KVNLERQLLELECKADEAKCESP-QNDPSLGKMLESLKRENAALRYELHAQYRELETRTI 240
           K  LE QLLEL+ KA+  K ESP Q  P L   LE L+++NA+L+ E+ +   ELE R I
Sbjct: 181 KSELENQLLELKTKAEATKSESPPQIVPDLWHKLEYLEKDNASLKLEILSLSEELELRII 240

Query: 241 ERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESL 300
           ERDLSTQ AETASKQHL+SIKK+ KLEAECRRLK ++ K S + DHK+  AS+ + +ESL
Sbjct: 241 ERDLSTQAAETASKQHLDSIKKVAKLEAECRRLKAVACKSSSLNDHKTSIASS-MYVESL 300

Query: 301 TDTQSDNGEQLNGVDMDIRRT---ERNKCVASCSDSWSSNLL-----------VSSNLP- 360
           TD+QSD+GE+LN V++D  +    E +KC  SCSDSW+S L+           V+ NLP 
Sbjct: 301 TDSQSDSGERLNAVELDAHKISCLEPSKCEPSCSDSWASALIAELDQFKNEKAVNRNLPA 360

Query: 361 SSLELDLMDDFLEMERLASLPETSIRESHQEPEASARPTAE-ENAIRTELETLQHERSVM 420
           SS+E+DLMDDFLEMERLASLPE        EPE  A  + + E+++R ELE + H  + +
Sbjct: 361 SSIEIDLMDDFLEMERLASLPENESGTHQSEPEPVATQSTDVESSLRAELEIMIHRTAEL 420

Query: 421 EEKLVEMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKL 480
           E++L +ME  K+ELE KL++I +E+ E+E  L +   +  E    +   E ++ +  ++L
Sbjct: 421 EKQLQKMEGEKVELEEKLEKILVERTELEMSLTISREKNEEFQIQLGEAELKMKQLHQEL 480

Query: 481 ATMEAEKAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQNELTFAN 540
           +     K ++  +L+ +E E   + S +   +  ++  +    E  +K   L+ EL+  N
Sbjct: 481 SIANESKQQIESQLVSMEVEARTMASKVDSLEAELEKEKVLSAELAVKCRTLEEELSEKN 540

Query: 541 ESKLRIESQLISMEAESLTMNAKEDLAVAAGKLAECQKTIASLGNQLKSLASLEDFLIDT 600
           +    +E Q  +     L +  +EDLAVAAGKLAECQKTIASLG QLKSLA+LEDFLIDT
Sbjct: 541 KE---VELQKSASSNGELKIK-QEDLAVAAGKLAECQKTIASLGKQLKSLATLEDFLIDT 600

Query: 601 TQLPEFTDG--------EEHCK-HSNGTLSPRRDSDYTKGVDDSSEPSLNKNENDSPPFS 660
             LPEFT G        EE  K HS+ TLSP+RDS  ++   ++S PS+NKNE  S P S
Sbjct: 601 ASLPEFTAGGALMPKATEEPWKLHSSDTLSPKRDSSSSRIASENSGPSVNKNEGHSTPSS 660

Query: 661 SSSTSSSVIVSRIVNSEKNRNGFAKFFSRTKGGIKLEI 673
           SSS SS  + S  +NSEKNRNGFAKFFSR K GI+LEI
Sbjct: 661 SSSASS--VSSIHINSEKNRNGFAKFFSRNKNGIQLEI 681

BLAST of Cp4.1LG13g04650 vs. TrEMBL
Match: B9GU37_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s08400g PE=4 SV=1)

HSP 1 Score: 532.7 bits (1371), Expect = 6.4e-148
Identity = 359/716 (50.14%), Postives = 484/716 (67.60%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           M++RKW WK+KSS ++SPG+T+SSGS+S++SERFSD+Q+ +K SP    QSPEVTSK + 
Sbjct: 1   MEKRKWLWKRKSS-ERSPGETDSSGSISSHSERFSDDQDPSKASPTDSAQSPEVTSKTIT 60

Query: 61  DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEV 120
            +         E++ND +K L+D+LSAAL+NV AK+DLV QH KVAEEA+AGWE AENEV
Sbjct: 61  TD---------EDVNDRIKSLTDKLSAALVNVSAKDDLVKQHVKVAEEAVAGWEKAENEV 120

Query: 121 GLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESI 180
             LK+QL   +QQK+ LEDRVSHLDGALKECVRQLR AREE E+KI + V +K+ +WESI
Sbjct: 121 TALKKQLEVAIQQKAGLEDRVSHLDGALKECVRQLRQAREELEEKIHEAVVQKSLEWESI 180

Query: 181 KVNLERQLLELECKADEAKCESPQN-DPSLGKMLESLKRENAALRYELHAQYRELETRTI 240
           K  LE Q +EL+ K   AK ESP      L + LE L++ENA L+ EL +Q  ELE RTI
Sbjct: 181 KSELENQFIELKSKEAAAKSESPAPIVDELCQKLEYLEQENATLKLELLSQSEELEIRTI 240

Query: 241 ERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESL 300
           ERDLSTQ AE ASKQHLESIKK+ KLEAECRRLK  + KPS V DHK+ AAS +I +ESL
Sbjct: 241 ERDLSTQAAEAASKQHLESIKKVAKLEAECRRLKAAACKPSSVNDHKTSAAS-SIYVESL 300

Query: 301 TDTQSDNGEQLNGVDMDIRR---TERNKCVASCSDSWSSNLL-----------VSSNLP- 360
            D+QSD+GE+LN V++D R+   +E  K   SC DSW+S L+           ++ NLP 
Sbjct: 301 PDSQSDSGEKLNAVELDARKVSCSEPYKSEQSCLDSWASTLISELNQFKNEKSINRNLPA 360

Query: 361 SSLELDLMDDFLEMERLASLPETSIRESHQEPEASARPTAE-ENAIRTELETLQHERSVM 420
           SS+E+DLMDDFLEME+LA+L E      + + EA  + + + E+++R ELE +    + +
Sbjct: 361 SSVEIDLMDDFLEMEQLAALSENETGTDNSKAEAVIKQSVDAESSLRAELEVMAKRTAEL 420

Query: 421 EEKLVEMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEM-VAMMES-------- 480
           EEKL ++E  K ELE KL+++E EK E+EE+LE +   +AE+DE+ +A+ ES        
Sbjct: 421 EEKLQKVEGEKFELEEKLQKVEGEKFELEEKLERI---KAEMDELEMALNESQDRNEASQ 480

Query: 481 -EISESGRKLATMEAE-------KAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFK 540
            ++SE+ +KL  ++ E       K ++  +L+ +EAE   + + ++  Q  ++  +    
Sbjct: 481 LQLSEAQQKLVELQEELLLTNESKQQIEFQLVSMEAEARTMSAKVNSIQGEIEKERVLSA 540

Query: 541 ETEMKLEKLQNELTFANESKLRIESQLISMEAESLTMNAK-EDLAVAAGKLAECQKTIAS 600
           E  +K  +L+ EL     S+ + E +L    + S     K ED  VAA KLAECQKTIAS
Sbjct: 541 EIALKYHELEEEL-----SRKKQEEELQQNVSSSGEPKIKQEDFDVAANKLAECQKTIAS 600

Query: 601 LGNQLKSLASLEDFLIDTTQLPEFT---------DGEEHCKHSNGTLSPRRDSDYTKGVD 660
           LGNQLKSLA+L+DFLIDT  +PEF+         +GE    HSN T SP+RDS   +  +
Sbjct: 601 LGNQLKSLATLKDFLIDTASIPEFSAGGSAIPKGNGEPWKLHSNETFSPKRDSGSLRIDN 660

Query: 661 DSSEPSLNKNENDSPPFSSSSTSSSVIVSRIVNSEKNRNGFAKFFSRTKGGIKLEI 673
           ++S P++  NE DSPP S SS++SS + S  V+SEKNRNGFAKFFSR+K GI+LEI
Sbjct: 661 ENSGPAVKINEGDSPP-SVSSSASSAVSSNHVSSEKNRNGFAKFFSRSKNGIQLEI 696

BLAST of Cp4.1LG13g04650 vs. TrEMBL
Match: A0A0B0MHL2_GOSAR (Filament-like plant protein OS=Gossypium arboreum GN=F383_15938 PE=4 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 6.8e-142
Identity = 359/733 (48.98%), Postives = 480/733 (65.48%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLS-PNHEIQSPEVTSKAV 60
           M++R W WK+KSS ++SPG+TESSGS+S+ SERFSD+QEA K S PN   +SPEV+SKA 
Sbjct: 1   MEKRSWLWKRKSS-ERSPGETESSGSISSQSERFSDDQEAFKASSPNDCTKSPEVSSKA- 60

Query: 61  CDEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENE 120
                   S   EE+NDS++ L+++LSAAL+NV AKEDLV QHAKVAEEAIAGWE AENE
Sbjct: 61  --------SAVPEEVNDSIRSLTEKLSAALVNVSAKEDLVKQHAKVAEEAIAGWEKAENE 120

Query: 121 VGLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWES 180
           V +LKQ+L T VQQ SALEDRV+HLDGALKECVRQLR AREEQEQKI + V + TRDWE+
Sbjct: 121 VVVLKQKLETAVQQNSALEDRVTHLDGALKECVRQLRQAREEQEQKINEAVAKTTRDWET 180

Query: 181 IKVNLERQLLELECKADEAKCESPQN-DPSLGKMLESLKRENAALRYELHAQYRELETRT 240
            +  LE QLLEL+ KA+  K E P    P+L    E+LK+EN+AL+ EL +Q  EL+ RT
Sbjct: 181 TQFELESQLLELQNKAESVKSEPPPPFSPNLLHKFEALKQENSALKLELSSQLEELQIRT 240

Query: 241 IERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIES 300
           IERDLSTQ AETASKQHLESIK+  KLEAECRRLK +  K S   D KS AAS+ I++ES
Sbjct: 241 IERDLSTQAAETASKQHLESIKRAAKLEAECRRLKAIGSKLSFTNDCKSPAASS-INVES 300

Query: 301 LTDTQSDNGEQLNGVDMDIRRT---ERNKCVASCSDSWSSNLL-----------VSSNLP 360
              +QSD+GE+L+ VD D ++    E NK   SCSDSW+S L+           ++ N+P
Sbjct: 301 FIGSQSDSGERLHVVDTDTQKMSGLEANKGEPSCSDSWASALIAELDQFKNEKVINRNVP 360

Query: 361 SS-LELDLMDDFLEMERLASLPETSIRESHQEPEASARPTAE-ENAIRTELETLQHERSV 420
           SS +E+DLMDDFLEME+LA+LP+T       E +A+ + + + +++++ ELE + H  + 
Sbjct: 361 SSSIEIDLMDDFLEMEQLAALPDTKNENQCLESKATVKQSNDGDSSLKAELEAMIHRTTE 420

Query: 421 MEEKLVEMEEAKIELEA---------------------KLKQIEME-------KDEMEER 480
           +EEKL ++E  K ELE                      KL++++ E       K  +E +
Sbjct: 421 LEEKLEKIEAEKAELEIALAESQESLDASELELRDNELKLEELQRELSKASEAKQHLESQ 480

Query: 481 LEMMETER----AEVDEMVAMMESEISESGRKLATMEAEKAEMREKLMKLEAEKDELRSA 540
           L +MET+     A++D + A +E E + S +  A     K  +  +L+ +EAE   + + 
Sbjct: 481 LSIMETDAETMSAKIDALGAEIEKERALSVQISADANESKQLLESQLVSIEAEARMMSAK 540

Query: 541 LSQSQNSVDISQFQFKETEMKLEKLQNELTFANESKLRIESQLISMEAESLTMNAK-EDL 600
           +   +  V+  +    E  +K ++L+ EL     S+ R E++L      ++ +  K EDL
Sbjct: 541 VGSLETEVEKEKALSAEITVKCQELEEEL-----SRTRQEAELQQTANSNVEVKIKQEDL 600

Query: 601 AVAAGKLAECQKTIASLGNQLKSLASLEDFLIDTTQLPEFTDG--------EEHCKHSNG 660
           AVAAGKLAECQKTIASLG QLKSLA+LEDFLIDTT +PEF+ G        E    HSN 
Sbjct: 601 AVAAGKLAECQKTIASLGQQLKSLATLEDFLIDTTSIPEFSRGVSLIPKSSEPWKLHSNE 660

Query: 661 TLSPRRDSDYTKGVDDSSEPSLNKNEND--SPPFSSSSTSSSVIVSRIVNSEKNRNGFAK 673
           T SP+ D + T+   D S P +NKN+ +  +PP   SS+SSS++ S   + EKNRNGFAK
Sbjct: 661 TYSPKADPESTRVGADHSSPQVNKNDGNGNTPP---SSSSSSIVSSTHASLEKNRNGFAK 714

BLAST of Cp4.1LG13g04650 vs. TrEMBL
Match: U5GP73_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s08400g PE=4 SV=1)

HSP 1 Score: 510.8 bits (1314), Expect = 2.6e-141
Identity = 349/707 (49.36%), Postives = 468/707 (66.20%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           M++RKW WK+KSS ++SPG+T+SSGS+S++SERFSD+Q+ +K SP    QSPEVTSK + 
Sbjct: 1   MEKRKWLWKRKSS-ERSPGETDSSGSISSHSERFSDDQDPSKASPTDSAQSPEVTSKTIT 60

Query: 61  DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEV 120
            +         E++ND +K L+D+LSAAL+NV AK+DLV QH KVAEEA+AGWE AENEV
Sbjct: 61  TD---------EDVNDRIKSLTDKLSAALVNVSAKDDLVKQHVKVAEEAVAGWEKAENEV 120

Query: 121 GLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESI 180
             LK+QL   +QQK+ LEDRVSHLDGALKECVRQLR AREE E+KI + V +K+ +WESI
Sbjct: 121 TALKKQLEVAIQQKAGLEDRVSHLDGALKECVRQLRQAREELEEKIHEAVVQKSLEWESI 180

Query: 181 KVNLERQLLELECKADEAKCESPQN-DPSLGKMLESLKRENAALRYELHAQYRELETRTI 240
           K  LE Q +EL+ K   AK ESP      L + LE L++ENA L+ EL +Q  ELE RTI
Sbjct: 181 KSELENQFIELKSKEAAAKSESPAPIVDELCQKLEYLEQENATLKLELLSQSEELEIRTI 240

Query: 241 ERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESL 300
           ERDLSTQ AE ASKQHLESIKK+ KLEAECRRLK  + KPS V DHK+ AAS +I +ESL
Sbjct: 241 ERDLSTQAAEAASKQHLESIKKVAKLEAECRRLKAAACKPSSVNDHKTSAAS-SIYVESL 300

Query: 301 TDTQSDNGEQLNGVDMDIRR---TERNKCVASCSDSWSSNLL-----------VSSNLP- 360
            D+QSD+GE+LN V++D R+   +E  K   SC DSW+S L+           ++ NLP 
Sbjct: 301 PDSQSDSGEKLNAVELDARKVSCSEPYKSEQSCLDSWASTLISELNQFKNEKSINRNLPA 360

Query: 361 SSLELDLMDDFLEMERLASLPETSIRESHQEPEASARPTAE-ENAIRTELETLQHERSVM 420
           SS+E+DLMDDFLEME+LA+L E      + + EA  + + + E+++R ELE +    + +
Sbjct: 361 SSVEIDLMDDFLEMEQLAALSENETGTDNSKAEAVIKQSVDAESSLRAELEVMAKRTAEL 420

Query: 421 EEKLVEMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEM-VAMMES-------- 480
           EEKL ++E  K ELE KL+++E EK E+EE+LE +   +AE+DE+ +A+ ES        
Sbjct: 421 EEKLQKVEGEKFELEEKLQKVEGEKFELEEKLERI---KAEMDELEMALNESQDRNEASQ 480

Query: 481 -EISESGRKLATMEAE-------KAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFK 540
            ++SE+ +KL  ++ E       K ++  +L+ +EAE   + + ++  Q  ++  +    
Sbjct: 481 LQLSEAQQKLVELQEELLLTNESKQQIEFQLVSMEAEARTMSAKVNSIQGEIEKERVLSA 540

Query: 541 ETEMKLEKLQNELTFANESKLRIESQLISMEAESLTMNAK-EDLAVAAGKLAECQKTIAS 600
           E  +K  +L+ EL     S+ + E +L    + S     K ED  VAA KLAECQKTIAS
Sbjct: 541 EIALKYHELEEEL-----SRKKQEEELQQNVSSSGEPKIKQEDFDVAANKLAECQKTIAS 600

Query: 601 LGNQLKSLASLEDFLIDTTQLPEFTDGEEHCKHSNGTLSPRRDSDYTKGVDDSSEPSLNK 660
           LGNQLKSLA+L+DFLIDT  +PEF+ G                         S+ P +  
Sbjct: 601 LGNQLKSLATLKDFLIDTASIPEFSAG------------------------GSAIPKVKI 660

Query: 661 NENDSPPFSSSSTSSSVIVSRIVNSEKNRNGFAKFFSRTKGGIKLEI 673
           NE DSPP S SS++SS + S  V+SEKNRNGFAKFFSR+K GI+LEI
Sbjct: 661 NEGDSPP-SVSSSASSAVSSNHVSSEKNRNGFAKFFSRSKNGIQLEI 663

BLAST of Cp4.1LG13g04650 vs. TAIR10
Match: AT3G05270.1 (AT3G05270.1 Plant protein of unknown function (DUF869))

HSP 1 Score: 297.7 bits (761), Expect = 1.7e-80
Identity = 228/569 (40.07%), Postives = 338/569 (59.40%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           MDRR W W++KSS +KSPG+TES+GS+S++SERFSD+Q +         QSPE+ SK V 
Sbjct: 1   MDRRSWLWRRKSS-EKSPGETESTGSVSSHSERFSDDQRS---------QSPELNSKPVT 60

Query: 61  DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEV 120
                    ++EE    +K L++RLSAALLNV  KEDL  QHAKVAEEA++GWE AENE 
Sbjct: 61  ---------REEEATADIKILTERLSAALLNVSLKEDLAKQHAKVAEEAVSGWEKAENEA 120

Query: 121 GLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESI 180
             LKQQL  +  + SALEDR SHLD ALKECVRQL   REEQ QKI + +  K ++WE+ 
Sbjct: 121 AALKQQLDASTSKVSALEDRNSHLDSALKECVRQLWQGREEQNQKIEEAINNKCKEWETT 180

Query: 181 KVNLERQLLELECKADEAKCESPQNDPSLGKMLESLKRENAALRYELHAQYRELETRTIE 240
           K  LE ++ EL+ + D       ++   L   LE+L++EN+AL+ +L ++  E++ RTIE
Sbjct: 181 KSQLEARIEELQARQDVTTSSVHED---LYPKLEALEKENSALKLQLLSKSEEVKIRTIE 240

Query: 241 RDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESLT 300
           RDLSTQ AE+ASKQ LE IKK+TKLEAECR+L+VM  +           +  +  ++S  
Sbjct: 241 RDLSTQAAESASKQQLEGIKKLTKLEAECRKLRVMVRR-----------SDNSSDLKSSI 300

Query: 301 DTQSDNGEQLNGVDMDIRRTERNKCVASCSDSWSSNLLVSSNLPSSLELDLMDDFLEMER 360
           D QSD   +++  D +++             S S  ++  S++ +S+++ LMDDFLEME+
Sbjct: 301 DNQSDYSGRVSFSDNEMQ-------------SPSEKIIGKSSMATSVDIGLMDDFLEMEK 360

Query: 361 LASLPETSIRESHQEPEASA-RPTAEENAIRTELETLQHERSVMEEKLVEMEEAKIELEA 420
           LA+LP +     H E      +  A  N ++ EL+T     S +EEK+  +E  K++LE 
Sbjct: 361 LAALPHSEPGRKHSESNKELEKSNAHVNQLKHELKTSLRRISELEEKVEMVEVEKLQLEM 420

Query: 421 KLKQIEMEKDEMEERLEMMETERAEVDEMVAM---MESEISESGRKLATMEAEKAEMREK 480
            L   + + + ++ RL+ +E + +E+ ++ A    +E  + ESG+++  ++ +  + +  
Sbjct: 421 ALNGSKEQIEALQSRLKEIEGKLSEMKKLEAENQELELLLGESGKQMEDLQRQLNKAQVN 480

Query: 481 LMKLE---AEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQ--------------NEL 540
           L +LE   AEK EL   L+ ++  ++ SQ + KETE KL +LQ              + L
Sbjct: 481 LSELETRRAEKLELTMCLNGTKKQLETSQNRLKETERKLTELQTLLHLTKDAKEAAEDGL 523

Query: 541 TFANESKLRIESQL--ISMEAESLTMNAK 547
             AN     IES+L  +  EAESL +  K
Sbjct: 541 KAANGKTEAIESRLKDVEAEAESLILKIK 523

BLAST of Cp4.1LG13g04650 vs. TAIR10
Match: AT1G77580.2 (AT1G77580.2 Plant protein of unknown function (DUF869))

HSP 1 Score: 280.8 bits (717), Expect = 2.2e-75
Identity = 230/565 (40.71%), Postives = 326/565 (57.70%), Query Frame = 1

Query: 9   KKKSSSDKSPGQTESSGSMSTYSERFSDEQ-EAAKLSPNHEIQSPEVTSKAVCDEDIDDD 68
           +K+ SS++S G++ES   +S+ SE+ S+ Q E+   S + EIQSP V+ +          
Sbjct: 4   RKRESSERSFGESES---VSSLSEKDSEIQPESTMESRDDEIQSPTVSLEV--------- 63

Query: 69  SPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEVGLLKQQL 128
             ++EE+ DS+K L+++LSAAL NV AK+DLV QH KVAEEA+AGWE AENEV  LK++L
Sbjct: 64  ETEKEELKDSMKTLAEKLSAALANVSAKDDLVKQHVKVAEEAVAGWEKAENEVVELKEKL 123

Query: 129 GTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESIKVNLERQ 188
                +   LEDRVSHLDGALKECVRQLR AR+EQEQ+I+D V E+T++ +S + +LE Q
Sbjct: 124 EAADDKNRVLEDRVSHLDGALKECVRQLRQARDEQEQRIQDAVIERTQELQSSRTSLENQ 183

Query: 189 LLELECKADEAKCESPQNDPSLGKMLESLKRENAALRYELHAQYRELETRTIERDLSTQT 248
           + E   K++E           L +M ES+ +EN  LR+EL A+  ELE RTIERDLSTQ 
Sbjct: 184 IFETATKSEE-----------LSQMAESVAKENVMLRHELLARCEELEIRTIERDLSTQA 243

Query: 249 AETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESLTDTQSDNG 308
           AETASKQ L+SIKK+ KLEAECR+ ++++   +   DH+S            TD+ SD G
Sbjct: 244 AETASKQQLDSIKKVAKLEAECRKFRMLAKSSASFNDHRS------------TDSHSDGG 303

Query: 309 EQLNGVDMDIRRTERNKCVASCSDSWSSNLLVSSN----LPSSLELDLMDDFLEMERLAS 368
           E+++                SCSDSW+S+ L+         SS+ELDLM DFLEMERL +
Sbjct: 304 ERMD---------------VSCSDSWASSTLIEKRSLQGTSSSIELDLMGDFLEMERLVA 363

Query: 369 LPETSIRESHQEPEASARPTA--EENAIRTELETLQHERSVMEEKLVEMEEAKIELEAKL 428
           LPET        PE+         EN++ +E+E       V+  ++ E+EE       KL
Sbjct: 364 LPETPDGNGKSGPESVTEEVVVPSENSLASEIE-------VLTSRIKELEE-------KL 423

Query: 429 KQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKLATMEAEKAEMREKLMKLE 488
           +++E EK E+E  ++    E      +V +  SE+  S  K         E+ EKL KLE
Sbjct: 424 EKLEAEKHELENEVKCNREEA-----VVHIENSEVLTSRTK---------ELEEKLEKLE 483

Query: 489 AEKDELRS--------ALSQSQNS----VDISQFQFKETEMKLEKLQNE-LTFANESKLR 548
           AEK+EL+S        A+   +NS    +++   + KE E +LEKL+ E +   +E K  
Sbjct: 484 AEKEELKSEVKCNREKAVVHVENSLAAEIEVLTSRTKELEEQLEKLEAEKVELESEVKCN 490

BLAST of Cp4.1LG13g04650 vs. TAIR10
Match: AT1G21810.1 (AT1G21810.1 Plant protein of unknown function (DUF869))

HSP 1 Score: 233.8 bits (595), Expect = 3.1e-61
Identity = 223/668 (33.38%), Postives = 328/668 (49.10%), Query Frame = 1

Query: 94  AKEDLVNQHAKVAEEAIAGWENAENEVGLLKQQLGTTVQQKSALEDRVSHLDGALKECVR 153
           +K++LV QHAKVAE+A+AGWE AENEV  LKQ+L     +   LEDRVSHLDGALKECVR
Sbjct: 14  SKDELVKQHAKVAEDAVAGWEKAENEVVELKQKLEDAADKNIVLEDRVSHLDGALKECVR 73

Query: 154 QLRLAREEQEQKIRDTVEEKTRDWESIKVNLERQLLELECKADEAKCESPQNDPSLGKML 213
           QLR  R+EQE+ I+  V E T++  S    LE+++LEL+ +A+ AK E            
Sbjct: 74  QLRQFRDEQEKNIQAAVTESTKELHSANTGLEKRVLELQKEAEAAKSE------------ 133

Query: 214 ESLKRENAALRYELHAQYRELETRTIERDLSTQTAETASKQHLESIKKMTKLEAECRRLK 273
                 N  LR E   Q  +LE   IERDLSTQ AETASKQHL+ IKK+ KLEAECR+L+
Sbjct: 134 ------NMMLRREFLTQREDLEIVMIERDLSTQAAETASKQHLDIIKKLAKLEAECRKLR 193

Query: 274 VMSYKPSLVYDHKSIAASTTISIESLTDTQSDNGEQLNGVDMDIRRTERNKCVASCSDSW 333
           +++              S+++S     D+ SD G              R +   SCSDSW
Sbjct: 194 ILA------------KTSSSLSSNQSVDSHSDGG--------------RERVEGSCSDSW 253

Query: 334 SSNLLVSS---------------NLPSSLELDLMDDFLEMERLASLP-ETSIRESHQEPE 393
           +S+  +S                   SS E+DLMDDFLEMERL +LP ET  + S    E
Sbjct: 254 ASSAFISELDQIKNEKGGNRSLQGTTSSTEIDLMDDFLEMERLVALPTETQAKNSKDGYE 313

Query: 394 ASARPTAE-------------------ENAIRTELETLQHERSVMEEKLVEMEEAKIELE 453
            S     E                   E  +  E+E +  ++  +E+ L  +E  K EL+
Sbjct: 314 LSLMEKLEKIQAEKDDLEREVKCCREAEKRLSLEIEAVVGDKMELEDMLKRVEAEKAELK 373

Query: 454 A-------KLKQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKLATMEAEKA 513
                   K ++  +   E++ +LE ++ E+ E+D  V   +        +L  +  +K 
Sbjct: 374 TSFDVLKDKYQESRVCFQEVDTKLEKLQAEKDELDSEVICCKEAEKRFSLELEAVVGDKI 433

Query: 514 EMREKLMKLEAEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQNELTFANESKLRIES 573
           EM ++L K+EAEK EL+ +    ++    S+  F+E EMKLE ++ EL  ANESK + ES
Sbjct: 434 EMEDELEKMEAEKAELKISFDVIKDQYQESRVCFQEVEMKLEAMKRELKLANESKTQAES 493

Query: 574 QLISMEAESLTMNAKEDLAVAAGKLAECQKTIASLGNQLKSLASLEDFLID--------T 633
           ++  MEAE      +++  V+ G   +C+     L  +++    ++   ++         
Sbjct: 494 RVTRMEAE-----VRKERIVSDGLKEKCETFEEELRREIEEKTMIKREKVEPKIKQEDIA 553

Query: 634 TQLPEFTDGEEHCKHSNGTLSPRRDSDYTKG---VDDSSEPS-----------LNKNEND 673
           T   +F D    C+ +  +L  +  S  T     +D +S P            L K+ ++
Sbjct: 554 TAAGKFAD----CQKTIASLGKQLQSLATLEEFLIDTASIPGSARSVHNKEALLGKDPHE 613

BLAST of Cp4.1LG13g04650 vs. TAIR10
Match: AT4G36120.1 (AT4G36120.1 Plant protein of unknown function (DUF869))

HSP 1 Score: 149.1 bits (375), Expect = 1.0e-35
Identity = 107/302 (35.43%), Postives = 163/302 (53.97%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFS-----DEQEAAKLSPNHEIQSPEVT 60
           M+ R WPWK+KSS DK+  +    G  ST     S     + QE  K +   +I     T
Sbjct: 1   MEGRGWPWKRKSS-DKATTEKPVVGIESTPVCSLSYLASLENQEKCKNTNYVQITMDSYT 60

Query: 61  SKAVCDEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWEN 120
             +  ++ +     K  E+   VK L ++L+ A   +  KE L+ QHAKVAEEA++GWE 
Sbjct: 61  HMSRMEDQV-----KLFEVQ--VKDLKEKLTLAHSEINTKESLILQHAKVAEEAVSGWEK 120

Query: 121 AENEVGLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTR 180
           A+ E   LK+QL +    K   EDR SHLD ALKEC RQ+R+ +EE ++K++D +  KT 
Sbjct: 121 ADAETLALKRQLESVTLLKLTAEDRASHLDDALKECTRQIRIVKEESDKKLQDVILAKTS 180

Query: 181 DWESIKVNLERQLLELE------------------------CKADEAKCESPQNDPSLGK 240
            W+ IK  LE ++ EL                          +  E + ++  +   L  
Sbjct: 181 QWDKIKAELEGKIDELSEGLHRAASDNAALTRSLQERSEMIVRISEERSKAEADVEKLKT 240

Query: 241 MLESLKRENAALRYELHAQYRELETRTIERDLSTQTAETASKQHLESIKKMTKLEAECRR 274
            L+  ++E + L+Y+LH   +E+E R  E+++S ++A+ A+KQHLE +KK+ KLEAEC R
Sbjct: 241 NLQLAEKEISYLKYDLHVASKEVEIRNEEKNMSLKSADIANKQHLEGVKKIAKLEAECHR 294

BLAST of Cp4.1LG13g04650 vs. TAIR10
Match: AT1G19835.1 (AT1G19835.1 Plant protein of unknown function (DUF869))

HSP 1 Score: 142.9 bits (359), Expect = 7.2e-34
Identity = 156/509 (30.65%), Postives = 233/509 (45.78%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           MDR+ WPWKKKSS +K+   TE              +QE  K     +I   + T+    
Sbjct: 1   MDRKSWPWKKKSS-EKTATVTEVV------------DQENGKKPSYIQISFDQYTNLNGL 60

Query: 61  DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEV 120
            +++     K  ++ D +K L  +LS A  ++ AKE LV QH+KVAEEA+ GWE AE E 
Sbjct: 61  KDEVKSYEEKVTKLEDQIKDLDLKLSTANADIVAKEVLVKQHSKVAEEAVTGWEKAEAEA 120

Query: 121 GLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESI 180
             LK  L T    K  +EDR +HLDGALKEC+RQ+R  +EE EQK+ D +  KT   +++
Sbjct: 121 SALKTHLETITLAKLTVEDRAAHLDGALKECMRQIRSLKEENEQKLHDVIATKTNQMDNL 180

Query: 181 KVNLERQLLE-----LECKAD-------------------EAKCESPQNDPSLGKMLESL 240
           +   E ++ E     L C A+                   E K ++      L   +ES 
Sbjct: 181 RAEFESRIGEYEEELLRCGAENDALSRSLQERSNMLMRISEEKSQAESEIEHLKNNIESC 240

Query: 241 KRENAALRYELHAQYRELETRTIERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMS 300
           +RE   L+YE H   +ELE R  E+++S ++AE A+KQHLE +KK+ KLEAEC+RL+ + 
Sbjct: 241 EREINTLKYETHVITKELEIRNEEKNMSMRSAEAANKQHLEGVKKIAKLEAECQRLRTLV 300

Query: 301 YKPSLVYDHKSIAASTTISIESL--TDTQSDNGEQLNGVDMDIRRTERNKCVASCSDSWS 360
            K        +  A   + +ESL   D + D+              +R   V   S   S
Sbjct: 301 RKK---LPGPAALAQMKMEVESLGFGDHRQDH-------------RQRRSPVRPSSPLMS 360

Query: 361 SNLLVSSNLPSSLELDLMDDF-----LEMERLASLPETSIRESHQEPEASARPTAEENAI 420
               +S    S   LD M  F     L  ERL ++ E    E+    EA A+  +E    
Sbjct: 361 PMSHMSQ--VSEFSLDNMQKFHKENDLLTERLLAMEE----ETKMLKEALAKRNSELQVS 420

Query: 421 RTELETLQHERSVMEEKLVEMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEMV 478
           R       +    +E +++     K   E   +    +       +  M  +  E    V
Sbjct: 421 RNLCAKTANRLQTLEAQMMSKSPTKRGFEMPAEIFSRQNASNPPSMASMSEDGNEDARSV 474

BLAST of Cp4.1LG13g04650 vs. NCBI nr
Match: gi|778715691|ref|XP_011657435.1| (PREDICTED: filament-like plant protein 3 [Cucumis sativus])

HSP 1 Score: 755.4 bits (1949), Expect = 8.7e-215
Identity = 446/561 (79.50%), Postives = 487/561 (86.81%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           M+RRKWPWK+KSS DKSPG+TESSGSMS+YSERFSDEQ+AAK SPNHE QSPEV+SKA+C
Sbjct: 1   MERRKWPWKRKSS-DKSPGETESSGSMSSYSERFSDEQDAAKSSPNHETQSPEVSSKAIC 60

Query: 61  -DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENE 120
            +EDIDDD PKQEEINDSVK LS+RLSAAL+NVKAKEDLV QHAKVAEEAIAGWE AENE
Sbjct: 61  KEEDIDDDLPKQEEINDSVKSLSERLSAALVNVKAKEDLVKQHAKVAEEAIAGWEKAENE 120

Query: 121 VGLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWES 180
           V  LKQQLGTTVQQKSALEDRVSHLDGALKECVRQLR AREEQEQKI D VEEKTRDW+S
Sbjct: 121 VTHLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRQAREEQEQKIHDAVEEKTRDWQS 180

Query: 181 IKVNLERQLLELECKADEAKCESPQNDPSLGKMLESLKRENAALRYELHAQYRELETRTI 240
            KV+LERQLL L+  AD AKCESP+ DPSLGKMLE LKRENAALR+ELHAQYRELETRTI
Sbjct: 181 TKVDLERQLLALQSIADTAKCESPKVDPSLGKMLELLKRENAALRHELHAQYRELETRTI 240

Query: 241 ERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESL 300
           ERDLSTQTAETASKQHLESIKKM KLEAECRRLK MS KPS V DHKSIAAS TISIESL
Sbjct: 241 ERDLSTQTAETASKQHLESIKKMAKLEAECRRLKFMSCKPSFV-DHKSIAAS-TISIESL 300

Query: 301 TDTQSDNGEQLNGVDMDIRRTERNKCVASCSDSWSSNLL-----------VSSNLPSSLE 360
           TDTQSDNGEQL+ VD+DI RTERNK   SCS   +S LL           VSSNLPSSLE
Sbjct: 301 TDTQSDNGEQLSAVDIDI-RTERNKGEPSCSHPRASTLLAELNQLGNEKAVSSNLPSSLE 360

Query: 361 LDLMDDFLEMERLASLPETSIRESHQEPEASARPTAEENAIRTELETLQHERSVMEEKLV 420
           LDLMDDFLEMERLASLPET   +S QE EA  R TAEENA+RTELE L+HERS+ME+KL 
Sbjct: 361 LDLMDDFLEMERLASLPETDTGKSRQESEAFPRSTAEENALRTELEALRHERSLMEKKLG 420

Query: 421 EMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKLATMEA 480
           EMEEAKIELE KLKQ+E+EKDE+EERLEMME ER E ++M+A ME++  E G+KL  ME 
Sbjct: 421 EMEEAKIELEEKLKQMEVEKDELEERLEMMEIERDEANQMLAKMETKQYELGQKLVKMEE 480

Query: 481 EKAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQNELTFANESKLR 540
           EK EM EKLMKLE +KDEL +ALS+SQNSV+ISQFQ KET+MKLEKLQNELT A+ESKLR
Sbjct: 481 EKVEMGEKLMKLETQKDELETALSRSQNSVEISQFQLKETQMKLEKLQNELTIADESKLR 540

Query: 541 IESQLISMEAESLTMNAKEDL 550
           IESQLISMEAESLTM+AK ++
Sbjct: 541 IESQLISMEAESLTMSAKVEM 557

BLAST of Cp4.1LG13g04650 vs. NCBI nr
Match: gi|659133397|ref|XP_008466711.1| (PREDICTED: filament-like plant protein 3 [Cucumis melo])

HSP 1 Score: 753.1 bits (1943), Expect = 4.3e-214
Identity = 444/561 (79.14%), Postives = 486/561 (86.63%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           M+RRKWPWK+KSS DKSPG+TESSGSMS+YSERFSDEQ+AAK SPNHE QSPEVTSKA+C
Sbjct: 1   MERRKWPWKRKSS-DKSPGETESSGSMSSYSERFSDEQDAAKSSPNHETQSPEVTSKAIC 60

Query: 61  -DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENE 120
            +EDIDDD PKQEEINDSVK LS+RLSAAL+NVKAKEDLV QHAKVAEEAIAGWE AENE
Sbjct: 61  KEEDIDDDLPKQEEINDSVKSLSERLSAALVNVKAKEDLVKQHAKVAEEAIAGWEKAENE 120

Query: 121 VGLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWES 180
           V  LKQQLGTTVQQKSALE+RVSHLDGALKECVRQLR AREEQEQKI D VEEK RDWES
Sbjct: 121 VTHLKQQLGTTVQQKSALENRVSHLDGALKECVRQLRQAREEQEQKIHDAVEEKIRDWES 180

Query: 181 IKVNLERQLLELECKADEAKCESPQNDPSLGKMLESLKRENAALRYELHAQYRELETRTI 240
            KV+LERQLL L+ KAD AKCESP+ DPS+GK LE LKRENAALR+ELHAQYRELETRTI
Sbjct: 181 TKVDLERQLLALQSKADTAKCESPKVDPSIGKRLELLKRENAALRHELHAQYRELETRTI 240

Query: 241 ERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESL 300
           ERDLSTQTAETASKQHLESIKKM KLEAECRRLK MS KPSLV DHKSIAAS TISIESL
Sbjct: 241 ERDLSTQTAETASKQHLESIKKMAKLEAECRRLKFMSCKPSLV-DHKSIAAS-TISIESL 300

Query: 301 TDTQSDNGEQLNGVDMDIRRTERNKCVASCSDSWSSNLL-----------VSSNLPSSLE 360
           TDTQSDNGEQL+ VD++I RTERNK   SCS   +S LL           VSSNLPSSLE
Sbjct: 301 TDTQSDNGEQLSAVDIEI-RTERNKGEPSCSHPRASTLLAELNQLGNEKPVSSNLPSSLE 360

Query: 361 LDLMDDFLEMERLASLPETSIRESHQEPEASARPTAEENAIRTELETLQHERSVMEEKLV 420
           LDLMDDFLEMERLASLPET   +S QE EA  R TAEENA+RTELE L+HERS+MEEKL 
Sbjct: 361 LDLMDDFLEMERLASLPETDTGKSRQESEAFPRSTAEENALRTELEALRHERSLMEEKLG 420

Query: 421 EMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKLATMEA 480
           EMEEAKIELE KLKQ+E+EKDE+EERLEMME ER E ++M+A ME+E  + G++L  ME 
Sbjct: 421 EMEEAKIELEEKLKQMEVEKDELEERLEMMEIERDEANQMLAKMETEQYKLGQELVKMEE 480

Query: 481 EKAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQNELTFANESKLR 540
           EK EM EKLMKLE +KDEL +ALS+SQNSV++SQFQ KET+MKLEKLQNELT  NESKLR
Sbjct: 481 EKVEMGEKLMKLETQKDELETALSRSQNSVELSQFQLKETQMKLEKLQNELTVGNESKLR 540

Query: 541 IESQLISMEAESLTMNAKEDL 550
           IESQLISMEAESLTM+AK ++
Sbjct: 541 IESQLISMEAESLTMSAKVEM 557

BLAST of Cp4.1LG13g04650 vs. NCBI nr
Match: gi|802636036|ref|XP_012078245.1| (PREDICTED: filament-like plant protein isoform X2 [Jatropha curcas])

HSP 1 Score: 553.1 bits (1424), Expect = 6.5e-154
Identity = 362/698 (51.86%), Postives = 477/698 (68.34%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           M++RKW WK+KSS ++SPG+TESSGS+S+ SERFSDEQ+  K SPN+E QSPEVTSK V 
Sbjct: 1   MEKRKWLWKRKSS-ERSPGETESSGSISSQSERFSDEQDNLKASPNNETQSPEVTSKTVV 60

Query: 61  DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEV 120
                    + E++NDSV+ L+++LSAAL+NV AK+DLV QH+KVAEEA+AGWE AENEV
Sbjct: 61  ---------RDEDVNDSVRILTEKLSAALVNVSAKDDLVKQHSKVAEEAVAGWEKAENEV 120

Query: 121 GLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESI 180
             LK+QL   +QQ  ALEDRVSHLDGALKECVRQLR AREE E+K+ + V +KT +WES+
Sbjct: 121 AALKKQLEAAIQQNCALEDRVSHLDGALKECVRQLRQAREEHEEKVYEAVTKKTIEWESV 180

Query: 181 KVNLERQLLELECKADEAKCESP-QNDPSLGKMLESLKRENAALRYELHAQYRELETRTI 240
           K  LE QLLEL+ KA+  K ESP Q  P L   LE L+++NA+L+ E+ +   ELE R I
Sbjct: 181 KSELENQLLELKTKAEATKSESPPQIVPDLWHKLEYLEKDNASLKLEILSLSEELELRII 240

Query: 241 ERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESL 300
           ERDLSTQ AETASKQHL+SIKK+ KLEAECRRLK ++ K S + DHK+  AS+ + +ESL
Sbjct: 241 ERDLSTQAAETASKQHLDSIKKVAKLEAECRRLKAVACKSSSLNDHKTSIASS-MYVESL 300

Query: 301 TDTQSDNGEQLNGVDMDIRRT---ERNKCVASCSDSWSSNLL-----------VSSNLP- 360
           TD+QSD+GE+LN V++D  +    E +KC  SCSDSW+S L+           V+ NLP 
Sbjct: 301 TDSQSDSGERLNAVELDAHKISCLEPSKCEPSCSDSWASALIAELDQFKNEKAVNRNLPA 360

Query: 361 SSLELDLMDDFLEMERLASLPETSIRESHQEPEASARPTAE-ENAIRTELETLQHERSVM 420
           SS+E+DLMDDFLEMERLASLPE        EPE  A  + + E+++R ELE + H  + +
Sbjct: 361 SSIEIDLMDDFLEMERLASLPENESGTHQSEPEPVATQSTDVESSLRAELEIMIHRTAEL 420

Query: 421 EEKLVEMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRKL 480
           E++L +ME  K+ELE KL++I +E+ E+E  L +   +  E    +   E ++ +  ++L
Sbjct: 421 EKQLQKMEGEKVELEEKLEKILVERTELEMSLTISREKNEEFQIQLGEAELKMKQLHQEL 480

Query: 481 ATMEAEKAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQNELTFAN 540
           +     K ++  +L+ +E E   + S +   +  ++  +    E  +K   L+ EL+  N
Sbjct: 481 SIANESKQQIESQLVSMEVEARTMASKVDSLEAELEKEKVLSAELAVKCRTLEEELSEKN 540

Query: 541 ESKLRIESQLISMEAESLTMNAKEDLAVAAGKLAECQKTIASLGNQLKSLASLEDFLIDT 600
           +    +E Q  +     L +  +EDLAVAAGKLAECQKTIASLG QLKSLA+LEDFLIDT
Sbjct: 541 KE---VELQKSASSNGELKIK-QEDLAVAAGKLAECQKTIASLGKQLKSLATLEDFLIDT 600

Query: 601 TQLPEFTDG--------EEHCK-HSNGTLSPRRDSDYTKGVDDSSEPSLNKNENDSPPFS 660
             LPEFT G        EE  K HS+ TLSP+RDS  ++   ++S PS+NKNE  S P S
Sbjct: 601 ASLPEFTAGGALMPKATEEPWKLHSSDTLSPKRDSSSSRIASENSGPSVNKNEGHSTPSS 660

Query: 661 SSSTSSSVIVSRIVNSEKNRNGFAKFFSRTKGGIKLEI 673
           SSS SS  + S  +NSEKNRNGFAKFFSR K GI+LEI
Sbjct: 661 SSSASS--VSSIHINSEKNRNGFAKFFSRNKNGIQLEI 681

BLAST of Cp4.1LG13g04650 vs. NCBI nr
Match: gi|802635967|ref|XP_012078241.1| (PREDICTED: filament-like plant protein isoform X1 [Jatropha curcas])

HSP 1 Score: 548.5 bits (1412), Expect = 1.6e-152
Identity = 362/699 (51.79%), Postives = 477/699 (68.24%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEA-AKLSPNHEIQSPEVTSKAV 60
           M++RKW WK+KSS ++SPG+TESSGS+S+ SERFSDEQ+   K SPN+E QSPEVTSK V
Sbjct: 1   MEKRKWLWKRKSS-ERSPGETESSGSISSQSERFSDEQQDNLKASPNNETQSPEVTSKTV 60

Query: 61  CDEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENE 120
                     + E++NDSV+ L+++LSAAL+NV AK+DLV QH+KVAEEA+AGWE AENE
Sbjct: 61  V---------RDEDVNDSVRILTEKLSAALVNVSAKDDLVKQHSKVAEEAVAGWEKAENE 120

Query: 121 VGLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWES 180
           V  LK+QL   +QQ  ALEDRVSHLDGALKECVRQLR AREE E+K+ + V +KT +WES
Sbjct: 121 VAALKKQLEAAIQQNCALEDRVSHLDGALKECVRQLRQAREEHEEKVYEAVTKKTIEWES 180

Query: 181 IKVNLERQLLELECKADEAKCESP-QNDPSLGKMLESLKRENAALRYELHAQYRELETRT 240
           +K  LE QLLEL+ KA+  K ESP Q  P L   LE L+++NA+L+ E+ +   ELE R 
Sbjct: 181 VKSELENQLLELKTKAEATKSESPPQIVPDLWHKLEYLEKDNASLKLEILSLSEELELRI 240

Query: 241 IERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIES 300
           IERDLSTQ AETASKQHL+SIKK+ KLEAECRRLK ++ K S + DHK+  AS+ + +ES
Sbjct: 241 IERDLSTQAAETASKQHLDSIKKVAKLEAECRRLKAVACKSSSLNDHKTSIASS-MYVES 300

Query: 301 LTDTQSDNGEQLNGVDMDIRRT---ERNKCVASCSDSWSSNLL-----------VSSNLP 360
           LTD+QSD+GE+LN V++D  +    E +KC  SCSDSW+S L+           V+ NLP
Sbjct: 301 LTDSQSDSGERLNAVELDAHKISCLEPSKCEPSCSDSWASALIAELDQFKNEKAVNRNLP 360

Query: 361 -SSLELDLMDDFLEMERLASLPETSIRESHQEPEASARPTAE-ENAIRTELETLQHERSV 420
            SS+E+DLMDDFLEMERLASLPE        EPE  A  + + E+++R ELE + H  + 
Sbjct: 361 ASSIEIDLMDDFLEMERLASLPENESGTHQSEPEPVATQSTDVESSLRAELEIMIHRTAE 420

Query: 421 MEEKLVEMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEMVAMMESEISESGRK 480
           +E++L +ME  K+ELE KL++I +E+ E+E  L +   +  E    +   E ++ +  ++
Sbjct: 421 LEKQLQKMEGEKVELEEKLEKILVERTELEMSLTISREKNEEFQIQLGEAELKMKQLHQE 480

Query: 481 LATMEAEKAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFKETEMKLEKLQNELTFA 540
           L+     K ++  +L+ +E E   + S +   +  ++  +    E  +K   L+ EL+  
Sbjct: 481 LSIANESKQQIESQLVSMEVEARTMASKVDSLEAELEKEKVLSAELAVKCRTLEEELSEK 540

Query: 541 NESKLRIESQLISMEAESLTMNAKEDLAVAAGKLAECQKTIASLGNQLKSLASLEDFLID 600
           N+    +E Q  +     L +  +EDLAVAAGKLAECQKTIASLG QLKSLA+LEDFLID
Sbjct: 541 NKE---VELQKSASSNGELKIK-QEDLAVAAGKLAECQKTIASLGKQLKSLATLEDFLID 600

Query: 601 TTQLPEFTDG--------EEHCK-HSNGTLSPRRDSDYTKGVDDSSEPSLNKNENDSPPF 660
           T  LPEFT G        EE  K HS+ TLSP+RDS  ++   ++S PS+NKNE  S P 
Sbjct: 601 TASLPEFTAGGALMPKATEEPWKLHSSDTLSPKRDSSSSRIASENSGPSVNKNEGHSTPS 660

Query: 661 SSSSTSSSVIVSRIVNSEKNRNGFAKFFSRTKGGIKLEI 673
           SSSS SS  + S  +NSEKNRNGFAKFFSR K GI+LEI
Sbjct: 661 SSSSASS--VSSIHINSEKNRNGFAKFFSRNKNGIQLEI 682

BLAST of Cp4.1LG13g04650 vs. NCBI nr
Match: gi|743845598|ref|XP_011027524.1| (PREDICTED: filament-like plant protein isoform X3 [Populus euphratica])

HSP 1 Score: 533.5 bits (1373), Expect = 5.4e-148
Identity = 356/715 (49.79%), Postives = 483/715 (67.55%), Query Frame = 1

Query: 1   MDRRKWPWKKKSSSDKSPGQTESSGSMSTYSERFSDEQEAAKLSPNHEIQSPEVTSKAVC 60
           M++RKW WK+KSS ++SPG+T+SSGS+S++SERFSD+QE +K SP    QSPEVTSK + 
Sbjct: 1   MEKRKWLWKRKSS-ERSPGETDSSGSISSHSERFSDDQEPSKASPTDSAQSPEVTSKTIT 60

Query: 61  DEDIDDDSPKQEEINDSVKGLSDRLSAALLNVKAKEDLVNQHAKVAEEAIAGWENAENEV 120
            +         E++ND +K L+D+LSAAL+NV AK+DLV QH KVAEEA+AGWE AENEV
Sbjct: 61  TD---------EDVNDRIKSLTDKLSAALVNVSAKDDLVKQHVKVAEEAVAGWEKAENEV 120

Query: 121 GLLKQQLGTTVQQKSALEDRVSHLDGALKECVRQLRLAREEQEQKIRDTVEEKTRDWESI 180
             LK+QL   +QQK+ LEDRVSHLDGALKECVRQLR AREE E+KI + V +K+ +WESI
Sbjct: 121 TALKKQLEVAIQQKAGLEDRVSHLDGALKECVRQLRQAREELEEKIHEAVVQKSLEWESI 180

Query: 181 KVNLERQLLELECKADEAKCESPQND-PSLGKMLESLKRENAALRYELHAQYRELETRTI 240
           K  LE Q +EL+ K    K ESP      L + LE L++ENA L+ EL +Q  ELE RTI
Sbjct: 181 KSELENQFIELKTKEPATKSESPAPIVDELCQKLEYLEQENATLKLELLSQSEELEIRTI 240

Query: 241 ERDLSTQTAETASKQHLESIKKMTKLEAECRRLKVMSYKPSLVYDHKSIAASTTISIESL 300
           ERDLSTQ AE ASKQHLESIKK+ KLEAECRRLK  + KPS V DHK+ AAS+ I +ESL
Sbjct: 241 ERDLSTQAAEAASKQHLESIKKVAKLEAECRRLKAAACKPSSVNDHKTSAASS-IYVESL 300

Query: 301 TDTQSDNGEQLNGVDMDIRR---TERNKCVASCSDSWSSNLL-----------VSSNLP- 360
            D+QSD+GE+LN V++D R+   ++  K   +C DSW+S L+           ++ NLP 
Sbjct: 301 PDSQSDSGEKLNAVELDARKVSCSDPYKSEQNCLDSWASTLISELNQFKNEKSINRNLPA 360

Query: 361 SSLELDLMDDFLEMERLASLPETSIRESHQEPEASARPTAE-ENAIRTELETLQHERSVM 420
           SS+E+DLMDDFLEME+LA+L E      + + EA  + + + E+ +R ELE +    + +
Sbjct: 361 SSVEIDLMDDFLEMEQLAALSENETGTDYSKAEAVIKQSVDAESLLRAELEVMAKRTAEL 420

Query: 421 EEKLVEMEEAKIELEAKLKQIEMEKDEMEERLEMMETERAEVDEM-VAMMESE------- 480
           EEKL ++E  K ELE KL+++E EK E+EE+LE +   +AE+DE+ +A+ ES+       
Sbjct: 421 EEKLQKVEGEKFELEEKLQKVEGEKFELEEKLERI---KAEMDELEMALNESQDRNEASQ 480

Query: 481 --ISESGRKLATMEAE-------KAEMREKLMKLEAEKDELRSALSQSQNSVDISQFQFK 540
             +SE+ +KL  ++ E       K ++  +L+ +EAE   + + ++  Q  +D  +    
Sbjct: 481 LQLSEAQQKLVELQEELLLTNESKQQIEFQLVSMEAEARTMSAKVNSIQGEIDKERVLSA 540

Query: 541 ETEMKLEKLQNELTFANESKLRIESQLISMEAESLTMNAKEDLAVAAGKLAECQKTIASL 600
           E  +K  +L+ EL+   + +   ++   S E +   +  +ED  VAA KLAECQKTIASL
Sbjct: 541 EIALKYHELEEELSRKKQEEEHQQNVSSSGEPK---IKQQEDFDVAANKLAECQKTIASL 600

Query: 601 GNQLKSLASLEDFLIDTTQLPEFT---------DGEEHCKHSNGTLSPRRDSDYTKGVDD 660
           GNQLKSLA+L+DFLIDT  +PEF+         +GE    HSN T SP+RDS   +   +
Sbjct: 601 GNQLKSLATLKDFLIDTASIPEFSTEGSAIPKGNGEPWKLHSNETFSPKRDSGSLRIDFE 660

Query: 661 SSEPSLNKNENDSPPFSSSSTSSSVIVSRIVNSEKNRNGFAKFFSRTKGGIKLEI 673
           +S P++N NE DSPP S SS++SS + S  V+SEKNRNGFAKFFSR+K GI+LEI
Sbjct: 661 NSGPAVNINEGDSPP-SVSSSASSAVSSNHVSSEKNRNGFAKFFSRSKNGIQLEI 697

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
FPP_SOLLC1.8e-9544.87Filament-like plant protein (Fragment) OS=Solanum lycopersicum GN=FPP PE=1 SV=1[more]
FPP3_ARATH3.1e-7940.07Filament-like plant protein 3 OS=Arabidopsis thaliana GN=FPP3 PE=2 SV=2[more]
FPP1_ARATH3.9e-7440.71Filament-like plant protein 1 OS=Arabidopsis thaliana GN=FPP1 PE=2 SV=1[more]
FPP2_ARATH5.5e-6033.38Filament-like plant protein 2 OS=Arabidopsis thaliana GN=FPP2 PE=1 SV=1[more]
FPP5_ARATH1.8e-3435.43Filament-like plant protein 5 OS=Arabidopsis thaliana GN=FPP5 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KDU7_CUCSA6.1e-21579.50Uncharacterized protein OS=Cucumis sativus GN=Csa_6G385120 PE=4 SV=1[more]
A0A067KCT1_JATCU4.6e-15451.86Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12095 PE=4 SV=1[more]
B9GU37_POPTR6.4e-14850.14Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s08400g PE=4 SV=1[more]
A0A0B0MHL2_GOSAR6.8e-14248.98Filament-like plant protein OS=Gossypium arboreum GN=F383_15938 PE=4 SV=1[more]
U5GP73_POPTR2.6e-14149.36Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s08400g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G05270.11.7e-8040.07 Plant protein of unknown function (DUF869)[more]
AT1G77580.22.2e-7540.71 Plant protein of unknown function (DUF869)[more]
AT1G21810.13.1e-6133.38 Plant protein of unknown function (DUF869)[more]
AT4G36120.11.0e-3535.43 Plant protein of unknown function (DUF869)[more]
AT1G19835.17.2e-3430.65 Plant protein of unknown function (DUF869)[more]
Match NameE-valueIdentityDescription
gi|778715691|ref|XP_011657435.1|8.7e-21579.50PREDICTED: filament-like plant protein 3 [Cucumis sativus][more]
gi|659133397|ref|XP_008466711.1|4.3e-21479.14PREDICTED: filament-like plant protein 3 [Cucumis melo][more]
gi|802636036|ref|XP_012078245.1|6.5e-15451.86PREDICTED: filament-like plant protein isoform X2 [Jatropha curcas][more]
gi|802635967|ref|XP_012078241.1|1.6e-15251.79PREDICTED: filament-like plant protein isoform X1 [Jatropha curcas][more]
gi|743845598|ref|XP_011027524.1|5.4e-14849.79PREDICTED: filament-like plant protein isoform X3 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008587FPP_plant
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g04650.1Cp4.1LG13g04650.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008587Filament-like plant proteinPFAMPF05911FPPcoord: 95..200
score: 1.5E-36coord: 209..275
score: 5.8
NoneNo IPR availableunknownCoilCoilcoord: 181..201
score: -coord: 213..233
score: -coord: 152..172
score: -coord: 386..532
scor
NoneNo IPR availablePANTHERPTHR31580FAMILY NOT NAMEDcoord: 2..672
score: 1.4E
NoneNo IPR availablePANTHERPTHR31580:SF2FILAMENT-LIKE PLANT PROTEIN 1-RELATEDcoord: 2..672
score: 1.4E
NoneNo IPR availableunknownSSF57997Tropomyosincoord: 387..574
score: 2.4

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG13g04650Cp4.1LG01g20390Cucurbita pepo (Zucchini)cpecpeB199