Csa2G404900 (gene) Cucumber (Chinese Long) v2

NameCsa2G404900
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionMyosin-4; contains IPR008545 (Protein of unknown function DUF827, plant)
LocationChr2 : 20859225 .. 20862437 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCTCTCTGTTATCTGTTGCATTGCGATTGTGGTTCATCTTTGCTATGCCCTTCTTCACTTAAACCCATTCCCCAATCTCTCCACAACCACAATCCCCCATGCCTTTCCCTCCCAATTTTCAGGTAATATCCATTTCTGCTATCATCGTCTTTTCATTTTTTTCCCATTTTTTTTTTTTTTTTACCTTTAATGGACTGACCACCAAGCCTATGGAATGCCGGTTTTTGCTTTACTTTATTCCCTACCTTCTTCAAGTCTTTGGTTCTGAGAATTTTATGATTTTCTTTTGAGCTTAACCTTTTTGTCATTGGTCTATCCGTGATTTCTTTTCTTTTTTTGTAAAAAAAATAATATTGTAAGTAAGGAGAGAGGGCGATACCCTCTGCTGATACTGACTTTGAGGTTGATAGTACAAGTAGTGAATCTGTTAAAGGACGCTCGTGTTGGGTCAACCAATGATTTAAGAGAACCTTTCCTTCTTGTGGATGAATGAATAGGTGGAGATGGAGAGGAGAGAGTTTGACAGCAAAATTAGAGGTGGATTAGTCAGAGCAGCTGTTAATCAATATGGAGATGGTAAAGAGAATGGTATTTCATGGAAGAATTCACTCACCCAAGATTCCCCAGAGGTATGCAAATGCATGATTTTAAGAGATTATAGGTGGTTTCAAATGGGCATTACTTTACATTTCATGTCTTTTTGACCTGTCTTTTTTAGAAGCATTTAACTGATTTTCTTTCTATAATAAGTGCTTCTACGAGGATTTTTCGTAAAAGTGCTTCCATACTCACTCAACATGCGTTTAGATTATAAAGATATTAAATGCTTCCACTGACCAACCATCCAAAATGTATTTAAAATATAAAAAAAAAAGTAATATATTTGAGTGAATTCTATTATGGATATGAAGTTCATCACATATATTAAAAAGTGAATTAAAATATTTTGAAAAGACATGACAAATTATGATTAGATAATTTGATGGATACTTTATTCTATTTTAGAATGCACTACAATATTTTCCTTTTAAAAAGTACTCTCCAATGTAGCATCTGGAAAAGGTTTAAATGGAAAGTAGTTAAATACCATGTTATCCAATTTCAGTATTCCTTAAAGGCAAGAGAGCTCCAAAAGGCAAAGACAGACATTGACCATTATAAAAAGAGTAGAAATGCAGCAGATTCCTCTAGTGCCCAAGCTCAACTTGAACTTCTCAATGCCAAGAACACAGTGAAAAAGCTCTCCTCACTTTTTGACAAATCAAATGCCATGGCTCGGGCGCATAAGCAGGAACTCGAAACGTTGAAGAAGTCAGCTTCTGTCCAGGGCACGCGGTTGGCTGTGGCAAGCAGTGAAAATCGTGAGTATGCAGAATTGATGCGGGAACTGGAATCTGCAAAACTAGAATTAAGCAAACTCAAACTTGATATGGCTTCTGTTTTTCATGAGAAATTGCTGGCAGAGAAGGAAAAAGAAGAAACCATTTCGAAATTTCAATCCCTGTCGAGCTCTATTGAAGAGCTAAGGAAGGAGATCGATGAAATAAATGAAGAACAAGTACTAGTTGAGTTAGCTCAGATAGAGGCTTTGAAGGAGTTTCAAGAGATAGAAGCCCAGCGAAGCATGGAAGCCAAAGAATTCTTATGTGCCATTGAAAACAAGAGGAAGATTATTGATGAACTTGTTCAAGAGGTTGAAGGCTTAAAAGAGCTAGAAAAGCAATTGAGTCTCACAACATCGGATGTGAATGTGTTACAGAGGGAACTAAAGTTGGTGAAGGAATTAGAAATCAAATCTCATAGAAAAGTTAAGATGATAGAACTGGAAAAAAAATCTCAGGTAGGAGAAGATGAACTTTTGTTGCAGTCCATCACAGAAGAACTCAAGACTGCAAAGAAGGATTTGGCCTTAATACGAGATGAAGGTTTTCAGTTCATGACATCAATGGATGCTGTACGAAGGGAACTAAGGCATGTCAAGGAAGAGATTGCTAGTTTGAAAAAACCTAATGAAAAAACAGATTCAATTGTTCAAAAACTGAACTCTAAACTGCTTAGAGCAAAAGCAAAATTGGAGGCTGTATCTTCTGCTGAAGATAAAGTCAAAGCAATTGCTTCTAATCTGTCTTTAAGCATAGAACAAATGAAGAAAGAAACGGAAGCTGCAAAGAAAGAAGAAGAGCTTACTGAAGAAGAAATTAAAAACAGTAAAGCAGAAATCCAAAAGATCGAATCTGAAATCGACTTAAACGAGATATGCTTACAAGATGCCTTGCAAGAGCTTGAAAAAGTGAAGTCCTCTGAGGCTTTGGTACTTGAAAATTTGAAGTCACTATCAGAAAGTACAATGAGATCTAGAGCTTGTGCAACTAATAATAGTTCCTTTATCACCATCTCTCGTTTCGAATACGAGTACTTAGCTGGTCATGCGGTTGCTGCCCAAGAAGTTGCTGAGAAAAAAGTTGCAGCAGCTCAGGCTTGGATTGAAGCCATTAAAGCAAGTGAAGTTGAAACAACAAAGAAAATTGAATTGGCCGAACTTGAAATCGAAGAGATGAGAATGGAAGAAGAGAAACAAGTATACAGGGCAAACAGATCTCTATCTGCAAAAAGAATGGTGGAGGGAGAGTTACAGAAGAGACAAAAGCGTGAGAATAATGTAGATGATGAAAATGGGGAACCAACAAATCGTCAGAAAACTATTAGGAGAAATGGAAGTATGACTCCATCAAGGCGATTAAAGTTCAGAATATCAGCCTCACCATCGCCTCATATGATGAATGGAAGAACCGACTCCTTTTCCACGCAGAAGAGAACAAAGGTTGTGAAAAATCTTGCCAAATTCTTCAATGGCAAGCAAGCTAAAATGAATCCTTGAATTGGGTGAAAGATTGTTCCTACTGATTGTGATTGTTGCAGTCAGAAGCTCCATGGCCTCATATTTTGCAATCAATGGGAGCGAGGAGTTCAGATAAGGATTTTCTTTTCCCTTTATAATTTGATGTGCATAGAAACAGAATTCATTCAACTTTGGATTTTGCTTATTGGAGTAAACTAAATTCCAAAATCCAGATTATACCAAAATCTGCCTTGAATAATTGCCTTTATGCAGCTTCTTGGCTATGTATGCTCTAACTCACTAACTGCAGTTTTCATGAACTCTCCTATTATGAATTATCATACACAGAAGTCAGGTTTTTGTCTTCTTGACCA

mRNA sequence

ATGCCTTTCCCTCCCAATTTTCAGGTGGAGATGGAGAGGAGAGAGTTTGACAGCAAAATTAGAGGTGGATTAGTCAGAGCAGCTGTTAATCAATATGGAGATGGTAAAGAGAATGGTATTTCATGGAAGAATTCACTCACCCAAGATTCCCCAGAGTATTCCTTAAAGGCAAGAGAGCTCCAAAAGGCAAAGACAGACATTGACCATTATAAAAAGAGTAGAAATGCAGCAGATTCCTCTAGTGCCCAAGCTCAACTTGAACTTCTCAATGCCAAGAACACAGTGAAAAAGCTCTCCTCACTTTTTGACAAATCAAATGCCATGGCTCGGGCGCATAAGCAGGAACTCGAAACGTTGAAGAAGTCAGCTTCTGTCCAGGGCACGCGGTTGGCTGTGGCAAGCAGTGAAAATCGTGAGTATGCAGAATTGATGCGGGAACTGGAATCTGCAAAACTAGAATTAAGCAAACTCAAACTTGATATGGCTTCTGTTTTTCATGAGAAATTGCTGGCAGAGAAGGAAAAAGAAGAAACCATTTCGAAATTTCAATCCCTGTCGAGCTCTATTGAAGAGCTAAGGAAGGAGATCGATGAAATAAATGAAGAACAAGTACTAGTTGAGTTAGCTCAGATAGAGGCTTTGAAGGAGTTTCAAGAGATAGAAGCCCAGCGAAGCATGGAAGCCAAAGAATTCTTATGTGCCATTGAAAACAAGAGGAAGATTATTGATGAACTTGTTCAAGAGGTTGAAGGCTTAAAAGAGCTAGAAAAGCAATTGAGTCTCACAACATCGGATGTGAATGTGTTACAGAGGGAACTAAAGTTGGTGAAGGAATTAGAAATCAAATCTCATAGAAAAGTTAAGATGATAGAACTGGAAAAAAAATCTCAGGTAGGAGAAGATGAACTTTTGTTGCAGTCCATCACAGAAGAACTCAAGACTGCAAAGAAGGATTTGGCCTTAATACGAGATGAAGGTTTTCAGTTCATGACATCAATGGATGCTGTACGAAGGGAACTAAGGCATGTCAAGGAAGAGATTGCTAGTTTGAAAAAACCTAATGAAAAAACAGATTCAATTGTTCAAAAACTGAACTCTAAACTGCTTAGAGCAAAAGCAAAATTGGAGGCTGTATCTTCTGCTGAAGATAAAGTCAAAGCAATTGCTTCTAATCTGTCTTTAAGCATAGAACAAATGAAGAAAGAAACGGAAGCTGCAAAGAAAGAAGAAGAGCTTACTGAAGAAGAAATTAAAAACAGTAAAGCAGAAATCCAAAAGATCGAATCTGAAATCGACTTAAACGAGATATGCTTACAAGATGCCTTGCAAGAGCTTGAAAAAGTGAAGTCCTCTGAGGCTTTGGTACTTGAAAATTTGAAGTCACTATCAGAAAGTACAATGAGATCTAGAGCTTGTGCAACTAATAATAGTTCCTTTATCACCATCTCTCGTTTCGAATACGAGTACTTAGCTGGTCATGCGGTTGCTGCCCAAGAAGTTGCTGAGAAAAAAGTTGCAGCAGCTCAGGCTTGGATTGAAGCCATTAAAGCAAGTGAAGTTGAAACAACAAAGAAAATTGAATTGGCCGAACTTGAAATCGAAGAGATGAGAATGGAAGAAGAGAAACAAGTATACAGGGCAAACAGATCTCTATCTGCAAAAAGAATGGTGGAGGGAGAGTTACAGAAGAGACAAAAGCGTGAGAATAATGTAGATGATGAAAATGGGGAACCAACAAATCGTCAGAAAACTATTAGGAGAAATGGAAGTATGACTCCATCAAGGCGATTAAAGTTCAGAATATCAGCCTCACCATCGCCTCATATGATGAATGGAAGAACCGACTCCTTTTCCACGCAGAAGAGAACAAAGGTTGTGAAAAATCTTGCCAAATTCTTCAATGGCAAGCAAGCTAAAATGAATCCTTGA

Coding sequence (CDS)

ATGCCTTTCCCTCCCAATTTTCAGGTGGAGATGGAGAGGAGAGAGTTTGACAGCAAAATTAGAGGTGGATTAGTCAGAGCAGCTGTTAATCAATATGGAGATGGTAAAGAGAATGGTATTTCATGGAAGAATTCACTCACCCAAGATTCCCCAGAGTATTCCTTAAAGGCAAGAGAGCTCCAAAAGGCAAAGACAGACATTGACCATTATAAAAAGAGTAGAAATGCAGCAGATTCCTCTAGTGCCCAAGCTCAACTTGAACTTCTCAATGCCAAGAACACAGTGAAAAAGCTCTCCTCACTTTTTGACAAATCAAATGCCATGGCTCGGGCGCATAAGCAGGAACTCGAAACGTTGAAGAAGTCAGCTTCTGTCCAGGGCACGCGGTTGGCTGTGGCAAGCAGTGAAAATCGTGAGTATGCAGAATTGATGCGGGAACTGGAATCTGCAAAACTAGAATTAAGCAAACTCAAACTTGATATGGCTTCTGTTTTTCATGAGAAATTGCTGGCAGAGAAGGAAAAAGAAGAAACCATTTCGAAATTTCAATCCCTGTCGAGCTCTATTGAAGAGCTAAGGAAGGAGATCGATGAAATAAATGAAGAACAAGTACTAGTTGAGTTAGCTCAGATAGAGGCTTTGAAGGAGTTTCAAGAGATAGAAGCCCAGCGAAGCATGGAAGCCAAAGAATTCTTATGTGCCATTGAAAACAAGAGGAAGATTATTGATGAACTTGTTCAAGAGGTTGAAGGCTTAAAAGAGCTAGAAAAGCAATTGAGTCTCACAACATCGGATGTGAATGTGTTACAGAGGGAACTAAAGTTGGTGAAGGAATTAGAAATCAAATCTCATAGAAAAGTTAAGATGATAGAACTGGAAAAAAAATCTCAGGTAGGAGAAGATGAACTTTTGTTGCAGTCCATCACAGAAGAACTCAAGACTGCAAAGAAGGATTTGGCCTTAATACGAGATGAAGGTTTTCAGTTCATGACATCAATGGATGCTGTACGAAGGGAACTAAGGCATGTCAAGGAAGAGATTGCTAGTTTGAAAAAACCTAATGAAAAAACAGATTCAATTGTTCAAAAACTGAACTCTAAACTGCTTAGAGCAAAAGCAAAATTGGAGGCTGTATCTTCTGCTGAAGATAAAGTCAAAGCAATTGCTTCTAATCTGTCTTTAAGCATAGAACAAATGAAGAAAGAAACGGAAGCTGCAAAGAAAGAAGAAGAGCTTACTGAAGAAGAAATTAAAAACAGTAAAGCAGAAATCCAAAAGATCGAATCTGAAATCGACTTAAACGAGATATGCTTACAAGATGCCTTGCAAGAGCTTGAAAAAGTGAAGTCCTCTGAGGCTTTGGTACTTGAAAATTTGAAGTCACTATCAGAAAGTACAATGAGATCTAGAGCTTGTGCAACTAATAATAGTTCCTTTATCACCATCTCTCGTTTCGAATACGAGTACTTAGCTGGTCATGCGGTTGCTGCCCAAGAAGTTGCTGAGAAAAAAGTTGCAGCAGCTCAGGCTTGGATTGAAGCCATTAAAGCAAGTGAAGTTGAAACAACAAAGAAAATTGAATTGGCCGAACTTGAAATCGAAGAGATGAGAATGGAAGAAGAGAAACAAGTATACAGGGCAAACAGATCTCTATCTGCAAAAAGAATGGTGGAGGGAGAGTTACAGAAGAGACAAAAGCGTGAGAATAATGTAGATGATGAAAATGGGGAACCAACAAATCGTCAGAAAACTATTAGGAGAAATGGAAGTATGACTCCATCAAGGCGATTAAAGTTCAGAATATCAGCCTCACCATCGCCTCATATGATGAATGGAAGAACCGACTCCTTTTCCACGCAGAAGAGAACAAAGGTTGTGAAAAATCTTGCCAAATTCTTCAATGGCAAGCAAGCTAAAATGAATCCTTGA

Protein sequence

MPFPPNFQVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKARELQKAKTDIDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLKKSASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGEDELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFRISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP*
BLAST of Csa2G404900 vs. Swiss-Prot
Match: PMI2_ARATH (Protein PLASTID MOVEMENT IMPAIRED 2 OS=Arabidopsis thaliana GN=PMI2 PE=1 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 5.7e-107
Identity = 245/631 (38.83%), Postives = 367/631 (58.16%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKARELQKAKTDIDHY 70
           M  R  D  +    V+A +N+YG      +  K+S+ +D          L K+  ++  Y
Sbjct: 1   MGERNLDGTVS---VKATINKYGQKATRSVI-KSSVAED----------LHKSGRELGIY 60

Query: 71  KKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLKKSASVQGTRL 130
           ++SR  A+S+ A+A++EL  AK  VK+L+   ++SN   ++ + ++E +   + + G   
Sbjct: 61  RESRRVAESAKAKAEVELCKAKKIVKELTLRIEESNRRLKSRRIDIEAVMNESRIDG--- 120

Query: 131 AVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSIE 190
                 N  Y  +MRELE  K ELSKLKLD+  V  EK++AEKE  E  S+ +     +E
Sbjct: 121 ------NGGYVRIMRELEDMKQELSKLKLDVVYVSREKVVAEKEVMELESRMEENLKLLE 180

Query: 191 ELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVE 250
            L+ E+D  NEE VLVE+A+IEALKE +E+E QR  E KE   ++  ++K I E+++E+E
Sbjct: 181 SLKLEVDVANEEHVLVEVAKIEALKECKEVEEQREKERKEVSESLHKRKKRIREMIREIE 240

Query: 251 GLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKS-QVGEDEL-LLQSI 310
             K  E +L+ T  D+ +L+ +LKLVKE+E K  R   M   + ++ + G+D L +L+ +
Sbjct: 241 RSKNFENELAETLLDIEMLETQLKLVKEMERKVQRNESMSRSKNRAFERGKDNLSVLKEV 300

Query: 311 TEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQKLNSKL 370
           TE  +  K +LA I  E F  + +MD +R+E  H K+E A L K  +K D ++++LN+KL
Sbjct: 301 TEATEAKKAELASINAELFCLVNTMDTLRKEFDHAKKETAWLDKMIQKDDVMLERLNTKL 360

Query: 371 LRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKAEIQKIE 430
           L AK +LEAVS AE+++  +A NL+ S E++K + EAAKKEE    EE +    EIQK E
Sbjct: 361 LIAKDQLEAVSKAEERISYLADNLTTSFEKLKSDREAAKKEELKLREEARIINNEIQKTE 420

Query: 431 SEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITISRFEYE 490
           +  D  E  L   L ELEK K +E+L LE L+++ E TM +R   +  +S ITISRFEYE
Sbjct: 421 TGFDGKEKELLSKLDELEKAKHAESLALEKLETMVEKTMETREMESRRNSTITISRFEYE 480

Query: 491 YLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEKQVYRAN 550
           YL+G A  A+E AEKKV AA AW+EA+KAS      K E  +    +  +EEE++ +R  
Sbjct: 481 YLSGKACHAEETAEKKVEAAMAWVEALKASTKAIMIKTESLKRVSGKTMLEEERESFRMQ 540

Query: 551 RSLSAKRMVEGELQKRQKRENNVDDEN--GEPTNRQKTIRRNGSMTPSRRLKFRISASPS 610
           RSLS KR+V+ E+   QK + N +D      P   +K++R +G   P +  K R  +S  
Sbjct: 541 RSLSIKRLVQDEI---QKFKGNSEDNGLINSPKPVRKSVRLSGKFAPVQGGKSRRYSSG- 600

Query: 611 PHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQ 638
               N  T +F   K+ K V N+ KFF+ K+
Sbjct: 601 ----NRATPTFFVIKKKKKVPNMVKFFSRKR 600

BLAST of Csa2G404900 vs. Swiss-Prot
Match: PMI15_ARATH (Protein PLASTID MOVEMENT IMPAIRED 15 OS=Arabidopsis thaliana GN=PMI15 PE=2 SV=3)

HSP 1 Score: 350.9 bits (899), Expect = 3.0e-95
Identity = 231/601 (38.44%), Postives = 341/601 (56.74%), Query Frame = 1

Query: 37  ENGISWKNSLTQ-DSP--EYSLKARELQKAKTDIDHYKKSRNAADSSSAQAQLELLNAKN 96
           EN    +NS T  D P  + SL    +  ++  +  Y +SR  +++  A+ +  L   K 
Sbjct: 7   ENSDMKRNSSTLLDLPVVKSSLVVEAIHMSRKKLGWYNESRRDSETVKARVEAGLSEVKK 66

Query: 97  TVKKLSSLFDKSNAMARAHKQELETLKKSASVQGTRLAVASSENREYAELMRELESAKLE 156
           +V++L+ L  +SN  A   ++++E LK                  +YAE+MR LE  K E
Sbjct: 67  SVEELALLIKRSNRSAGFQEKDMEVLKME---------------EKYAEVMRVLEVVKEE 126

Query: 157 LSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSIEELRKEIDEINEEQVLVELAQIEA 216
           +S++KLD++SV  E++ AE++ EE   K +     +E L+KEI+  NEE ++V L +IEA
Sbjct: 127 VSRVKLDVSSVLIERVAAEEKVEELRFKTEGGLRLLESLKKEIEVANEEHLMVALGKIEA 186

Query: 217 LKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVEGLKELEKQLSLTTSDVNVLQREL 276
           LK ++EIE QR  +A + L  +  + K I  +++E E  K++E +L  T++DV +L+ +L
Sbjct: 187 LKGYKEIERQREGKAIKVLDLLVERNKRIKNMLEEAERSKDIEIELFETSTDVEMLETQL 246

Query: 277 KLVKELEIKSHRKVKMIELEKKSQVGEDELLLQSITEELKTAKKDLALIRDEGFQFMTSM 336
           KL K++E +   +            G  +  L  + E  +  K++LA ++ E F+ MT M
Sbjct: 247 KLFKKMERRVQGRDSSSMSRSNRSFGRGKYSLSVLKEVTEGKKEELASVKVEIFRVMTVM 306

Query: 337 DAVRRELRHVKEEIASLKKPNEKTDSIVQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLS 396
           DA+R E+   ++E A L K   + D  ++KLNSK+L  K+KLE VS AE+++ ++A N  
Sbjct: 307 DALRNEIIRARDETACLGKILREDDVKIEKLNSKILIEKSKLEVVSIAEERISSLAENFV 366

Query: 397 LSIEQMKKETEAAKKEEELTEEEIKNSKAEIQKIESEIDLNEICLQDALQELEKVKSSEA 456
            S+E++KK   AAKKEE L +EE   +KAE QK + +ID  E  L   L ELEKVK +EA
Sbjct: 367 GSLEKIKKSRNAAKKEEFLFKEEKTVTKAETQKTKLDIDKKESELNSKLDELEKVKHTEA 426

Query: 457 LVLENLKSLSESTMRSRACATNNSSFITISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIE 516
           LVLE L+SL E  M SR   + + S ITISRFEYEYL+ HA  A+E AEKKVAAA AW+E
Sbjct: 427 LVLEKLESLVEDMMESREMESEHCSTITISRFEYEYLSKHASQAEETAEKKVAAAAAWVE 486

Query: 517 AIKASEVETTKKIELAELEIEEMRMEEEKQVYRANRSLSAKRMVEGELQKRQKRENNVDD 576
           A+KAS      K E    E E  + EEE++V+R  RSLS KR+VEGE+QK ++       
Sbjct: 487 ALKASTKSFLMKTETLMRESEMTKAEEEREVFRMERSLSTKRLVEGEIQKIKRNSEAEGY 546

Query: 577 ENGEPTNRQKTIRRNGSMTPSRRLKFRISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFF 635
            + +P          G  TP +R K R  +S         T +F   K+ K V  LAKFF
Sbjct: 547 ISPKPV---------GKFTPVQRGKPRRYSSVG-------TPTFFVIKKKKKVPRLAKFF 576

BLAST of Csa2G404900 vs. Swiss-Prot
Match: Y5586_ARATH (WEB family protein At5g55860 OS=Arabidopsis thaliana GN=At5g55860 PE=1 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 9.9e-59
Identity = 178/642 (27.73%), Postives = 321/642 (50.00%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGDGKENGIS--WKNSLTQDSPEYSLKARELQKAKTDID 70
           +E  E D+      V+ AVN +G+   +     ++    Q + +  +K  EL  A+ +++
Sbjct: 17  VEVGEIDTSAPFQSVKDAVNLFGEAAFSAEKPVFRKPNPQSAEKVLVKQTELHLAQKELN 76

Query: 71  HYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLF-------DKSNAMARAHKQELETLKK 130
             K+    A++   QA  EL  +K TV +L+          D +N    A K  +E  K 
Sbjct: 77  KLKEQLKNAETIREQALSELEWSKRTVDELTRKLEAVNESRDSANKATEAAKSLIEEAKP 136

Query: 131 SASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISK 190
                 +     + +  EY E+ +EL++AK EL K++     +   K +A  + EE    
Sbjct: 137 GNVSVASSSDAQTRDMEEYGEVCKELDTAKQELRKIRQVSNEILETKTVALSKVEEAKKV 196

Query: 191 FQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKI 250
            +  S  IE LRKEI  +NE     +LA  +A KE  EI A++ ++ K +   +E   K 
Sbjct: 197 SKVHSEKIELLRKEIAAVNESVEQTKLACSQARKEQSEIFAEKEIQQKSYKAGMEESAKK 256

Query: 251 IDELVQEV--EGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVG 310
              L  E   E  K+LE QL+ T ++++ LQ++++  K  +I S   V +       ++ 
Sbjct: 257 SLALKNEFDPEFAKKLEVQLTETYNEIDELQKQMETAKASDIDSVNGVSL-------ELN 316

Query: 311 EDELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDS 370
           E + L + + EE K+ ++               +++++ EL++VK E   ++    + +S
Sbjct: 317 EAKGLFEKLVEEEKSLQE--------------LVESLKAELKNVKMEHDEVEAKEAEIES 376

Query: 371 IVQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKN 430
           +   L+ KL R+K++LE   + E K KA   ++ L+I Q+  ETEAA++E E    + K 
Sbjct: 377 VAGDLHLKLSRSKSELEQCVTEESKAKAALEDMMLTINQISSETEAARREAEGMRNKAKE 436

Query: 431 SKAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSEST--MRSRACATNNS 490
              E +     ++ +E+ L+ AL E E+ K++E   LE +KS+SE T   R+   + + S
Sbjct: 437 LMKEAESAHLALEDSELHLRVALDEAEEAKAAETKALEQIKSMSEKTNAARNSTSSESGS 496

Query: 491 SFITISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMR 550
             IT+S+ E++ L+  A    ++AE KVAAA A +EA++ASE ET KK+E  + EI++++
Sbjct: 497 QSITLSQEEFKSLSKRAEVFDKLAEMKVAAALAQVEAVRASENETLKKLETTQEEIKKLK 556

Query: 551 MEEEKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRL 610
              E+ + +A  + +AK+ VEGEL++ ++R+    +E       +  ++     +P +  
Sbjct: 557 TATEEALKKAAMADAAKKAVEGELRRWRERDQKKAEEAATRILAEAEMKMASESSPQQHY 616

Query: 611 KFRISASPSPHMMNGRTDSFSTQKRTK--VVKNLAKFFNGKQ 638
           K     +P    +N + +   T   +K  ++ NL+  FN K+
Sbjct: 617 K-----APKQKPVNNKLEKTKTSVVSKKVLMPNLSGIFNRKK 632

BLAST of Csa2G404900 vs. Swiss-Prot
Match: WEB1_ARATH (Protein WEAK CHLOROPLAST MOVEMENT UNDER BLUE LIGHT 1 OS=Arabidopsis thaliana GN=WEB1 PE=1 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 2.2e-37
Identity = 150/641 (23.40%), Postives = 289/641 (45.09%), Query Frame = 1

Query: 9   VEMERREFDSKIRGGLVRAAVNQYGDGKENGIS-WKNSLTQDSPEYSLKARELQKAKTDI 68
           V+  R   D+      V+ AV+++G     GI+ WK+   Q      L   EL+K   +I
Sbjct: 158 VDSHRGLIDTAAPFESVKEAVSKFG-----GITDWKSHRMQAVERRKLIEEELKKIHEEI 217

Query: 69  DHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQ--ELETLKKSASV 128
             YK     A+++  Q   EL + K  +++L    DK+    +  KQ  EL  L+     
Sbjct: 218 PEYKTHSETAEAAKLQVLKELESTKRLIEQLKLNLDKAQTEEQQAKQDSELAKLRVEEME 277

Query: 129 QGTR--LAVASSENREYAEL-----MRELESAKLELSKLKLDMASVFHEKLLAEKEKEET 188
           QG    ++VA+    E A+      + EL S K EL  L  +  ++  +K +A K+ EE 
Sbjct: 278 QGIAEDVSVAAKAQLEVAKARHTTAITELSSVKEELETLHKEYDALVQDKDVAVKKVEEA 337

Query: 189 ISKFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENK 248
           +   + +  ++EEL  E+    E       + +EA ++       R  +   +   ++  
Sbjct: 338 MLASKEVEKTVEELTIELIATKESLESAHASHLEAEEQRIGAAMARDQDTHRWEKELKQA 397

Query: 249 RKIIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQ- 308
            + +  L Q++   K+L+ +L   ++ +  L+ EL    E ++K          +  ++ 
Sbjct: 398 EEELQRLNQQIHSSKDLKSKLDTASALLLDLKAELVAYMESKLKQEACDSTTNTDPSTEN 457

Query: 309 VGEDEL--LLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNE 368
           +   +L   + S  +EL+    ++     E      +  +++ EL   K  +AS+K+   
Sbjct: 458 MSHPDLHAAVASAKKELEEVNVNIEKAAAEVSCLKLASSSLQLELEKEKSTLASIKQREG 517

Query: 369 KTDSIVQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEE 428
                V  + +++ R ++++ +V S E   +     L   ++Q  +E + AK   E+  E
Sbjct: 518 MASIAVASIEAEIDRTRSEIASVQSKEKDAREKMVELPKQLQQAAEEADEAKSLAEVARE 577

Query: 429 EIKNSKAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATN 488
           E++ +K E ++ ++     E  L  A +E+E  K+SE L L  +K+L ES    +A  T+
Sbjct: 578 ELRKAKEEAEQAKAGASTMESRLFAAQKEIEAAKASERLALAAIKALEESESTLKANDTD 637

Query: 489 NSSFITISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEE 548
           +   +T+S  EY  L+  A  A+E+A  +VAAA + IE  K +E+ + +K+E    +++ 
Sbjct: 638 SPRSVTLSLEEYYELSKRAHEAEELANARVAAAVSRIEEAKETEMRSLEKLEEVNRDMDA 697

Query: 549 MRMEEEKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRN---GSMT 608
            +   ++   +A ++   K  VE EL+K  + E+    + G+  N +K ++ +   G M 
Sbjct: 698 RKKALKEATEKAEKAKEGKLGVEQELRK-WRAEHEQKRKAGDGVNTEKNLKESFEGGKME 757

Query: 609 PS-RRLKFRISASPSPHMMNGRTDSFSTQKRTKVVKNLAKF 633
            S   + +  S S S         + S Q +++  K    F
Sbjct: 758 QSPEAVVYASSPSESYGTEENSETNLSPQTKSRKKKKKLSF 792

BLAST of Csa2G404900 vs. Swiss-Prot
Match: Y1215_ARATH (WEB family protein At1g12150 OS=Arabidopsis thaliana GN=At1g12150 PE=2 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 1.0e-34
Identity = 150/573 (26.18%), Postives = 254/573 (44.33%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGD---GKENGISWKNSLTQDS-----PEYSLKARELQK 70
           ME  E D++     V+AAV+ +G+    K+     ++ L+ +S      +  L  +E  K
Sbjct: 16  MEVGEIDTRAPFQSVKAAVSLFGEVAVSKQRSTPRRSRLSSESVCDKETQLMLVHKEFMK 75

Query: 71  AKTDIDHYKKSRNAA--DSSSAQAQLE-LLNAKNTVKKLSSLFDKSNAMARAHKQELETL 130
            K  +D+ + +R+ A  D S A+  +E L N   TV K       +    +  +++LE  
Sbjct: 76  IKQKLDNAESTRSRALDDLSKAKKTMEDLSNKLETVNKSKQSAIDTKETVQQREEQLEHD 135

Query: 131 KKSASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETI 190
           K   S         + E  +Y     EL++AK +L+K++    S    K  A  +  E  
Sbjct: 136 KCHGSPPHHHELDVARE--QYISTTVELDAAKQQLNKIRQSFDSAMDFKATALNQAAEAQ 195

Query: 191 SKFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKR 250
              Q  S+ + EL KEI ++ +    ++LA  + L+E   I  ++    + +  A+E   
Sbjct: 196 RALQVNSAKVNELSKEISDMKDAIHQLKLAAAQNLQEHANIVKEKDDLRECYRTAVEEAE 255

Query: 251 KIIDELVQEVEG--LKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQ 310
           K +  L +E E    + LE +L  TTS++ VL+ E+K   E E+ +              
Sbjct: 256 KKLLVLRKEYEPELSRTLEAKLLETTSEIEVLREEMKKAHESEMNT-------------- 315

Query: 311 VGEDELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLK-KPNEK 370
                  ++ IT EL  A   L    D+     + ++++R EL  ++ E   L+ K  E+
Sbjct: 316 -------VKIITNELNEATMRLQEAADDECSLRSLVNSLRMELEDLRREREELQQKEAER 375

Query: 371 TDSIVQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEE 430
            +    K    L +   KLE + +   + +  A+N++  IE +KKETEAA    E  E+ 
Sbjct: 376 LEIEETKKLEALKQESLKLEQMKTEAIEARNEAANMNRKIESLKKETEAAMIAAEEAEKR 435

Query: 431 IKNSKAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNN 490
                                L+  ++E+E+ KS+E  V E +K +S+     +    ++
Sbjct: 436 ---------------------LELVIREVEEAKSAEEKVREEMKMISQKQESKKQDEESS 495

Query: 491 SSFITISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEM 550
            S I I+  E+E L   A   +   EKK+A   A +E I     E   K+E     IEEM
Sbjct: 496 GSKIKITIQEFESLKRGAGETEAAIEKKLATIAAELEEINKRRAEADNKLEANLKAIEEM 544

Query: 551 RMEEEKQVYRANRSLSAKRMVEGELQKRQKREN 570
           +   E     A  + +AKRMVE ELQ+ +++EN
Sbjct: 556 KQATELAQKSAESAEAAKRMVESELQRWRQQEN 544

BLAST of Csa2G404900 vs. TrEMBL
Match: A0A0A0LQ40_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G404900 PE=4 SV=1)

HSP 1 Score: 1207.6 bits (3123), Expect = 0.0e+00
Identity = 642/642 (100.00%), Postives = 642/642 (100.00%), Query Frame = 1

Query: 1   MPFPPNFQVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKAREL 60
           MPFPPNFQVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKAREL
Sbjct: 1   MPFPPNFQVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKAREL 60

Query: 61  QKAKTDIDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLK 120
           QKAKTDIDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLK
Sbjct: 61  QKAKTDIDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLK 120

Query: 121 KSASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETIS 180
           KSASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETIS
Sbjct: 121 KSASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETIS 180

Query: 181 KFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRK 240
           KFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRK
Sbjct: 181 KFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRK 240

Query: 241 IIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGE 300
           IIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGE
Sbjct: 241 IIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGE 300

Query: 301 DELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSI 360
           DELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSI
Sbjct: 301 DELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSI 360

Query: 361 VQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNS 420
           VQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNS
Sbjct: 361 VQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNS 420

Query: 421 KAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFI 480
           KAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFI
Sbjct: 421 KAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFI 480

Query: 481 TISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEE 540
           TISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEE
Sbjct: 481 TISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEE 540

Query: 541 EKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFR 600
           EKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFR
Sbjct: 541 EKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFR 600

Query: 601 ISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP 643
           ISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP
Sbjct: 601 ISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP 642

BLAST of Csa2G404900 vs. TrEMBL
Match: W9S0V7_9ROSA (Protein PLASTID MOVEMENT IMPAIRED 2 OS=Morus notabilis GN=L484_027082 PE=4 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 1.1e-162
Identity = 343/640 (53.59%), Postives = 441/640 (68.91%), Query Frame = 1

Query: 8   QVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSL-KARELQKAKTD 67
           QV M R EFD + R G V+AA+  YGD   NG S       D  E S  K REL  A+ D
Sbjct: 6   QVIMNRGEFDDRRRTGSVKAAIRLYGDKILNGRSSLKKPEIDVSEKSYSKTRELHMARRD 65

Query: 68  IDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELET-LKKSASV 127
           ID YK++R  ADS  AQA+ ELL+AK TV  LSSL  +S++ A+A KQE+ET L+KS   
Sbjct: 66  IDRYKETRAEADSLKAQAEFELLDAKTTVTNLSSLLRESDSKAKAQKQEIETTLRKSTRK 125

Query: 128 QGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSL 187
               LA    E  +Y+E+M+ELE+ K EL  LKLDMASV  EK  AEK+ E + SK +S 
Sbjct: 126 DKRALAFGDMETHKYSEVMKELEAVKQELRMLKLDMASVLEEKSRAEKQIEASRSKIRSH 185

Query: 188 SSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDEL 247
           SSS+E ++KE +E+NEEQVLVELA+IEALKE+ EIEA+R+ EA +F  AIE  R+ I+++
Sbjct: 186 SSSLEAVKKETEEVNEEQVLVELARIEALKEYGEIEAERAKEASQFASAIEQTRRKINDI 245

Query: 248 VQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGED---E 307
           V EVE  KELE +L++T +DV++LQ EL+ VKE+E +  R   +  LE   + GE+    
Sbjct: 246 VDEVEHSKELESKLAITIADVDMLQNELQSVKEMEKRIQRNDNLKRLETSFRGGEELDSS 305

Query: 308 LLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQ 367
           L LQS+TEEL+ AKK+LA ++ EGFQ+M SMD +R E +HVK+E A L++  +K D  VQ
Sbjct: 306 LSLQSVTEELEAAKKELASLKAEGFQYMASMDIIRNERKHVKKETARLEEIEKKGDLAVQ 365

Query: 368 KLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKA 427
            LNSKLLRAKAKLEAVS+AE+K K+I SNLSL++EQ+K E + A++E+ L  +E    K 
Sbjct: 366 NLNSKLLRAKAKLEAVSAAEEKAKSIVSNLSLTLEQLKTEAKTARREKVLVCQEAATIKE 425

Query: 428 EIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITI 487
           EI + ESEID  E  LQ A+QELE  KSSEAL L+NLKS  E+T+ +R     +SS ITI
Sbjct: 426 EIGRTESEIDSTEERLQAAMQELEAAKSSEALALKNLKSRIENTVGARTSVLKHSSSITI 485

Query: 488 SRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEK 547
           S FEYEYL G AV A+E+A+KKVAAAQAWIEAIKA+E E   KI+ A+ EI EMR+EEE+
Sbjct: 486 SNFEYEYLTGRAVGAEELADKKVAAAQAWIEAIKANEKEILMKIDFAQREIREMRLEEER 545

Query: 548 QVYRANRSLSAKRMVEGELQK-RQKRENNVDDENGEPTNRQKTIRRNG--SMTPSRRLKF 607
           + YR  RS SAKR VE ELQ  R KRE N   EN +    +K+IR NG  ++TPSRR KF
Sbjct: 546 EAYRMERSFSAKRTVERELQSWRTKREKNATPENLQLAMHKKSIRGNGNANLTPSRRAKF 605

Query: 608 RISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAK 640
           R SASP+        +SF  +KRT+V+  +AKFF GK  K
Sbjct: 606 RKSASPAAR------NSFPVKKRTQVMPLIAKFFKGKTDK 639

BLAST of Csa2G404900 vs. TrEMBL
Match: A0A061EGI3_THECC (Uncharacterized protein isoform 5 OS=Theobroma cacao GN=TCM_019366 PE=4 SV=1)

HSP 1 Score: 574.3 bits (1479), Expect = 1.8e-160
Identity = 325/633 (51.34%), Postives = 450/633 (71.09%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKARELQKAKTDIDHY 70
           M+R  ++ + R G V+AAVN YG+   +G        +D PE S +AREL  A+ D+  Y
Sbjct: 1   MDRTRYEGRRRNGTVKAAVNIYGERILDGNFSLKKPQEDFPEPSSRARELHMARRDMSRY 60

Query: 71  KKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLKKSASVQGTRL 130
           K+SR AA+S+ ++A+ EL +A  TVK L+S+ ++SN  A+A  +++E+L+KS + +   L
Sbjct: 61  KESRRAAESAKSKAESELFSATKTVKDLASMIEESNFKAKARMRDIESLRKSGNREEKAL 120

Query: 131 AVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSIE 190
           AV S E+  YAE+MREL+  K ELSKLKLDMASV  EK  AEKE E++  K  S SSS+E
Sbjct: 121 AVRSIESYHYAEVMRELDLVKQELSKLKLDMASVKGEKARAEKEFEDSSLKMWSNSSSVE 180

Query: 191 ELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVE 250
            LRK+I+  NEE VLVELA+IEALKE  E+EAQR  E   F  ++E  ++ + E+++E++
Sbjct: 181 ALRKQIEAANEEHVLVELARIEALKEVGELEAQREKEFGGFSFSMEETKEKMKEIIEEID 240

Query: 251 GLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELE---KKSQVGEDELLLQS 310
             KELEK+L++T SDVN+L+ +LK VK+L+ +  R   + + E   + +   E    LQS
Sbjct: 241 QSKELEKKLAVTLSDVNLLENKLKQVKKLDKRVQRSDDLKQSEHSFRSAAEVEGSPSLQS 300

Query: 311 ITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQKLNSK 370
           IT+EL+ AKK+LA IR+EGFQ+M+SMD +R EL+HV+EE A  KK  EK D  VQ LNSK
Sbjct: 301 ITKELEVAKKELASIREEGFQYMSSMDIIRNELKHVREETARSKKTGEKADLKVQNLNSK 360

Query: 371 LLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKAEIQKI 430
           LLRAK+KLEAV++A +K ++I +NLSL++EQ+K E EAA+KE+ L  E+    KAEIQK 
Sbjct: 361 LLRAKSKLEAVTAAGEKAESIVTNLSLTLEQLKTEAEAARKEKALITEDTATIKAEIQKT 420

Query: 431 ESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITISRFEY 490
           ESEIDL E  L  A+QELE VK+SEA  LE L+SL E+TM+SRA A+N S  ITIS+FEY
Sbjct: 421 ESEIDLTEERLNAAVQELEAVKASEASALEKLRSLIETTMQSRASASNQSYTITISKFEY 480

Query: 491 EYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEKQVYRA 550
           EYL G AV A+E+A+KKVAA QAWIEA+KASE E   K E+A  ++ +MR+EEE +V+R 
Sbjct: 481 EYLTGRAVGAEEIADKKVAATQAWIEALKASEREILMKTEIANRDLRDMRVEEEHEVHRT 540

Query: 551 NRSLSAKRMVEGELQ-KRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFRISASPS 610
             SLSAK+MVE EL+ +RQ RE N + +N +   R+++++ NG+++PSR+ KFR SASP+
Sbjct: 541 EWSLSAKKMVETELRNRRQTREKNAEAQNRQSPFRRRSMKSNGNLSPSRQAKFRKSASPA 600

Query: 611 PHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAK 640
             +  G +  F  +K+ KVV NLAKFF GK+ +
Sbjct: 601 --IRAGGSTPFIIKKKRKVVPNLAKFFLGKKVE 631

BLAST of Csa2G404900 vs. TrEMBL
Match: U5GG24_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s15190g PE=4 SV=1)

HSP 1 Score: 566.2 bits (1458), Expect = 5.0e-158
Identity = 332/645 (51.47%), Postives = 434/645 (67.29%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGDG--KENGISWKNSLTQDSPEYSL-KARELQKAKTDI 70
           M+RR FD + R G V+AAVN YG+   + +  S K     D PE S  +A+EL  AK D+
Sbjct: 1   MDRRVFDDRRRIGTVKAAVNMYGERILESSSSSLKTPAQMDLPEKSSSRAKELHMAKRDL 60

Query: 71  DHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLKKSASVQG 130
             YK++R AA+S+  +A+ EL  AK TVK+L    +KSN   +A  +++E L K +  Q 
Sbjct: 61  VRYKENRRAAESAKVKAESELSEAKRTVKELVLQIEKSNLKVKAQVRDMERLNKLSKRQD 120

Query: 131 TRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSLSS 190
             L V S E+ +YAE++RELE  K ELSKLKL+MASV   K  AEKE   +ISK  S  S
Sbjct: 121 MALIVGSDESHQYAEVIRELEGVKQELSKLKLEMASVLEAKTRAEKEIATSISKLSSNMS 180

Query: 191 SIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIE---NKRKIIDE 250
             E LRK+IDE NEEQVLVELAQIEALKEF EI+AQR  EA+EF  A++   NKRK + E
Sbjct: 181 HAEALRKKIDEANEEQVLVELAQIEALKEFGEIQAQREKEAREFSSAMQETKNKRKNVKE 240

Query: 251 LVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVG---ED 310
              E+    +LE +L++T  DVN++Q ELKL K+ + K  R   M  L    + G   ED
Sbjct: 241 ---EISSSTDLESKLAVTLYDVNLIQHELKLAKDKDAKVQRNDSMKHLGGSFREGKQLED 300

Query: 311 ELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIV 370
             LL+SITEEL+ AKK+LA  R+EGFQFMTSMD VR EL+HV EE   LKK  EK D   
Sbjct: 301 SSLLKSITEELQAAKKELASTREEGFQFMTSMDIVRNELKHVTEETVQLKKVKEKADITA 360

Query: 371 QKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSK 430
           Q LNSKLLRAK+KLE  ++ E+K ++  S+LS+++EQ+K E E A+KE++L  EE    K
Sbjct: 361 QNLNSKLLRAKSKLETATAVEEKARSTLSSLSVTLEQLKTEAEVARKEKKLICEETAKIK 420

Query: 431 AEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFIT 490
           AEI+  +S+IDL E  LQ A+QEL+ VK SE+  L+NLK++ E+TMRSRA A+ +SS IT
Sbjct: 421 AEIRNTDSQIDLTEEKLQYAIQELDAVKKSESSALQNLKNVIENTMRSRASASQHSSSIT 480

Query: 491 ISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEE 550
           IS+FEYEYL GHA  A+E+A+KKVAAA AWIEA+KASE E   KIELA  +I E R+EEE
Sbjct: 481 ISKFEYEYLTGHAAMAEEIADKKVAAAHAWIEALKASEKEILMKIELAHGDIRETRVEEE 540

Query: 551 KQVYRANRSLSAKRMVEGELQK-RQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFR 610
           K++YR   SLSAKRMVEGEL K RQ  + N + EN +    +K+++ NG++T SRR K R
Sbjct: 541 KEIYRTESSLSAKRMVEGELPKWRQVSKKNTEAENQQQPLPRKSMKANGNLTLSRRSKLR 600

Query: 611 ISASPSPHM---MNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP 643
            + SPS  M   +  R+ S + +K+  +V NLAK F GK+   +P
Sbjct: 601 NAGSPSVRMTPRITPRSTSIAIRKKRTIVPNLAKLFIGKKVDKDP 642

BLAST of Csa2G404900 vs. TrEMBL
Match: A0A061EGW1_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_019366 PE=4 SV=1)

HSP 1 Score: 566.2 bits (1458), Expect = 5.0e-158
Identity = 323/628 (51.43%), Postives = 446/628 (71.02%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEY-SLKARELQKAKTDIDH 70
           M+R  ++ + R G V+AAVN YG+   +G        +D PE  S +AREL  A+ D+  
Sbjct: 1   MDRTRYEGRRRNGTVKAAVNIYGERILDGNFSLKKPQEDFPEKPSSRARELHMARRDMSR 60

Query: 71  YKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLKKSASVQGTR 130
           YK+SR AA+S+ ++A+ EL +A  TVK L+S+ ++SN  A+A  +++E+L+KS + +   
Sbjct: 61  YKESRRAAESAKSKAESELFSATKTVKDLASMIEESNFKAKARMRDIESLRKSGNREEKA 120

Query: 131 LAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSI 190
           LAV S E+  YAE+MREL+  K ELSKLKLDMASV  EK  AEKE E++  K  S SSS+
Sbjct: 121 LAVRSIESYHYAEVMRELDLVKQELSKLKLDMASVKGEKARAEKEFEDSSLKMWSNSSSV 180

Query: 191 EELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEV 250
           E LRK+I+  NEE VLVELA+IEALKE  E+EAQR  E   F  ++E  ++ + E+++E+
Sbjct: 181 EALRKQIEAANEEHVLVELARIEALKEVGELEAQREKEFGGFSFSMEETKEKMKEIIEEI 240

Query: 251 EGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELE---KKSQVGEDELLLQ 310
           +  KELEK+L++T SDVN+L+ +LK VK+L+ +  R   + + E   + +   E    LQ
Sbjct: 241 DQSKELEKKLAVTLSDVNLLENKLKQVKKLDKRVQRSDDLKQSEHSFRSAAEVEGSPSLQ 300

Query: 311 SITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQKLNS 370
           SIT+EL+ AKK+LA IR+EGFQ+M+SMD +R EL+HV+EE A  KK  EK D  VQ LNS
Sbjct: 301 SITKELEVAKKELASIREEGFQYMSSMDIIRNELKHVREETARSKKTGEKADLKVQNLNS 360

Query: 371 KLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKAEIQK 430
           KLLRAK+KLEAV++A +K ++I +NLSL++EQ+K E EAA+KE+ L  E+    KAEIQK
Sbjct: 361 KLLRAKSKLEAVTAAGEKAESIVTNLSLTLEQLKTEAEAARKEKALITEDTATIKAEIQK 420

Query: 431 IESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITISRFE 490
            ESEIDL E  L  A+QELE VK+SEA  LE L+SL E+TM+SRA A+N S  ITIS+FE
Sbjct: 421 TESEIDLTEERLNAAVQELEAVKASEASALEKLRSLIETTMQSRASASNQSYTITISKFE 480

Query: 491 YEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEKQVYR 550
           YEYL G AV A+E+A+KKVAA QAWIEA+KASE E   K E+A  ++ +MR+EEE +V+R
Sbjct: 481 YEYLTGRAVGAEEIADKKVAATQAWIEALKASEREILMKTEIANRDLRDMRVEEEHEVHR 540

Query: 551 ANRSLSAKRMVEGELQ-KRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFRISASP 610
              SLSAK+MVE EL+ +RQ RE N + +N +   R+++++ NG+++PSR+ KFR SASP
Sbjct: 541 TEWSLSAKKMVETELRNRRQTREKNAEAQNRQSPFRRRSMKSNGNLSPSRQAKFRKSASP 600

Query: 611 SPHMMNGRTDSFSTQKRTKVVKNLAKFF 634
           +  +  G +  F  +K+ KVV NLAKFF
Sbjct: 601 A--IRAGGSTPFIIKKKRKVVPNLAKFF 626

BLAST of Csa2G404900 vs. TAIR10
Match: AT1G66840.1 (AT1G66840.1 Plant protein of unknown function (DUF827))

HSP 1 Score: 389.8 bits (1000), Expect = 3.2e-108
Identity = 245/631 (38.83%), Postives = 367/631 (58.16%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKARELQKAKTDIDHY 70
           M  R  D  +    V+A +N+YG      +  K+S+ +D          L K+  ++  Y
Sbjct: 1   MGERNLDGTVS---VKATINKYGQKATRSVI-KSSVAED----------LHKSGRELGIY 60

Query: 71  KKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLKKSASVQGTRL 130
           ++SR  A+S+ A+A++EL  AK  VK+L+   ++SN   ++ + ++E +   + + G   
Sbjct: 61  RESRRVAESAKAKAEVELCKAKKIVKELTLRIEESNRRLKSRRIDIEAVMNESRIDG--- 120

Query: 131 AVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSIE 190
                 N  Y  +MRELE  K ELSKLKLD+  V  EK++AEKE  E  S+ +     +E
Sbjct: 121 ------NGGYVRIMRELEDMKQELSKLKLDVVYVSREKVVAEKEVMELESRMEENLKLLE 180

Query: 191 ELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVE 250
            L+ E+D  NEE VLVE+A+IEALKE +E+E QR  E KE   ++  ++K I E+++E+E
Sbjct: 181 SLKLEVDVANEEHVLVEVAKIEALKECKEVEEQREKERKEVSESLHKRKKRIREMIREIE 240

Query: 251 GLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKS-QVGEDEL-LLQSI 310
             K  E +L+ T  D+ +L+ +LKLVKE+E K  R   M   + ++ + G+D L +L+ +
Sbjct: 241 RSKNFENELAETLLDIEMLETQLKLVKEMERKVQRNESMSRSKNRAFERGKDNLSVLKEV 300

Query: 311 TEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQKLNSKL 370
           TE  +  K +LA I  E F  + +MD +R+E  H K+E A L K  +K D ++++LN+KL
Sbjct: 301 TEATEAKKAELASINAELFCLVNTMDTLRKEFDHAKKETAWLDKMIQKDDVMLERLNTKL 360

Query: 371 LRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKAEIQKIE 430
           L AK +LEAVS AE+++  +A NL+ S E++K + EAAKKEE    EE +    EIQK E
Sbjct: 361 LIAKDQLEAVSKAEERISYLADNLTTSFEKLKSDREAAKKEELKLREEARIINNEIQKTE 420

Query: 431 SEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITISRFEYE 490
           +  D  E  L   L ELEK K +E+L LE L+++ E TM +R   +  +S ITISRFEYE
Sbjct: 421 TGFDGKEKELLSKLDELEKAKHAESLALEKLETMVEKTMETREMESRRNSTITISRFEYE 480

Query: 491 YLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEKQVYRAN 550
           YL+G A  A+E AEKKV AA AW+EA+KAS      K E  +    +  +EEE++ +R  
Sbjct: 481 YLSGKACHAEETAEKKVEAAMAWVEALKASTKAIMIKTESLKRVSGKTMLEEERESFRMQ 540

Query: 551 RSLSAKRMVEGELQKRQKRENNVDDEN--GEPTNRQKTIRRNGSMTPSRRLKFRISASPS 610
           RSLS KR+V+ E+   QK + N +D      P   +K++R +G   P +  K R  +S  
Sbjct: 541 RSLSIKRLVQDEI---QKFKGNSEDNGLINSPKPVRKSVRLSGKFAPVQGGKSRRYSSG- 600

Query: 611 PHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQ 638
               N  T +F   K+ K V N+ KFF+ K+
Sbjct: 601 ----NRATPTFFVIKKKKKVPNMVKFFSRKR 600

BLAST of Csa2G404900 vs. TAIR10
Match: AT5G38150.1 (AT5G38150.1 Plant protein of unknown function (DUF827))

HSP 1 Score: 350.9 bits (899), Expect = 1.7e-96
Identity = 231/601 (38.44%), Postives = 341/601 (56.74%), Query Frame = 1

Query: 37  ENGISWKNSLTQ-DSP--EYSLKARELQKAKTDIDHYKKSRNAADSSSAQAQLELLNAKN 96
           EN    +NS T  D P  + SL    +  ++  +  Y +SR  +++  A+ +  L   K 
Sbjct: 7   ENSDMKRNSSTLLDLPVVKSSLVVEAIHMSRKKLGWYNESRRDSETVKARVEAGLSEVKK 66

Query: 97  TVKKLSSLFDKSNAMARAHKQELETLKKSASVQGTRLAVASSENREYAELMRELESAKLE 156
           +V++L+ L  +SN  A   ++++E LK                  +YAE+MR LE  K E
Sbjct: 67  SVEELALLIKRSNRSAGFQEKDMEVLKME---------------EKYAEVMRVLEVVKEE 126

Query: 157 LSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSIEELRKEIDEINEEQVLVELAQIEA 216
           +S++KLD++SV  E++ AE++ EE   K +     +E L+KEI+  NEE ++V L +IEA
Sbjct: 127 VSRVKLDVSSVLIERVAAEEKVEELRFKTEGGLRLLESLKKEIEVANEEHLMVALGKIEA 186

Query: 217 LKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVEGLKELEKQLSLTTSDVNVLQREL 276
           LK ++EIE QR  +A + L  +  + K I  +++E E  K++E +L  T++DV +L+ +L
Sbjct: 187 LKGYKEIERQREGKAIKVLDLLVERNKRIKNMLEEAERSKDIEIELFETSTDVEMLETQL 246

Query: 277 KLVKELEIKSHRKVKMIELEKKSQVGEDELLLQSITEELKTAKKDLALIRDEGFQFMTSM 336
           KL K++E +   +            G  +  L  + E  +  K++LA ++ E F+ MT M
Sbjct: 247 KLFKKMERRVQGRDSSSMSRSNRSFGRGKYSLSVLKEVTEGKKEELASVKVEIFRVMTVM 306

Query: 337 DAVRRELRHVKEEIASLKKPNEKTDSIVQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLS 396
           DA+R E+   ++E A L K   + D  ++KLNSK+L  K+KLE VS AE+++ ++A N  
Sbjct: 307 DALRNEIIRARDETACLGKILREDDVKIEKLNSKILIEKSKLEVVSIAEERISSLAENFV 366

Query: 397 LSIEQMKKETEAAKKEEELTEEEIKNSKAEIQKIESEIDLNEICLQDALQELEKVKSSEA 456
            S+E++KK   AAKKEE L +EE   +KAE QK + +ID  E  L   L ELEKVK +EA
Sbjct: 367 GSLEKIKKSRNAAKKEEFLFKEEKTVTKAETQKTKLDIDKKESELNSKLDELEKVKHTEA 426

Query: 457 LVLENLKSLSESTMRSRACATNNSSFITISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIE 516
           LVLE L+SL E  M SR   + + S ITISRFEYEYL+ HA  A+E AEKKVAAA AW+E
Sbjct: 427 LVLEKLESLVEDMMESREMESEHCSTITISRFEYEYLSKHASQAEETAEKKVAAAAAWVE 486

Query: 517 AIKASEVETTKKIELAELEIEEMRMEEEKQVYRANRSLSAKRMVEGELQKRQKRENNVDD 576
           A+KAS      K E    E E  + EEE++V+R  RSLS KR+VEGE+QK ++       
Sbjct: 487 ALKASTKSFLMKTETLMRESEMTKAEEEREVFRMERSLSTKRLVEGEIQKIKRNSEAEGY 546

Query: 577 ENGEPTNRQKTIRRNGSMTPSRRLKFRISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFF 635
            + +P          G  TP +R K R  +S         T +F   K+ K V  LAKFF
Sbjct: 547 ISPKPV---------GKFTPVQRGKPRRYSSVG-------TPTFFVIKKKKKVPRLAKFF 576

BLAST of Csa2G404900 vs. TAIR10
Match: AT5G55860.1 (AT5G55860.1 Plant protein of unknown function (DUF827))

HSP 1 Score: 229.6 bits (584), Expect = 5.6e-60
Identity = 178/642 (27.73%), Postives = 321/642 (50.00%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGDGKENGIS--WKNSLTQDSPEYSLKARELQKAKTDID 70
           +E  E D+      V+ AVN +G+   +     ++    Q + +  +K  EL  A+ +++
Sbjct: 17  VEVGEIDTSAPFQSVKDAVNLFGEAAFSAEKPVFRKPNPQSAEKVLVKQTELHLAQKELN 76

Query: 71  HYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLF-------DKSNAMARAHKQELETLKK 130
             K+    A++   QA  EL  +K TV +L+          D +N    A K  +E  K 
Sbjct: 77  KLKEQLKNAETIREQALSELEWSKRTVDELTRKLEAVNESRDSANKATEAAKSLIEEAKP 136

Query: 131 SASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISK 190
                 +     + +  EY E+ +EL++AK EL K++     +   K +A  + EE    
Sbjct: 137 GNVSVASSSDAQTRDMEEYGEVCKELDTAKQELRKIRQVSNEILETKTVALSKVEEAKKV 196

Query: 191 FQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKI 250
            +  S  IE LRKEI  +NE     +LA  +A KE  EI A++ ++ K +   +E   K 
Sbjct: 197 SKVHSEKIELLRKEIAAVNESVEQTKLACSQARKEQSEIFAEKEIQQKSYKAGMEESAKK 256

Query: 251 IDELVQEV--EGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVG 310
              L  E   E  K+LE QL+ T ++++ LQ++++  K  +I S   V +       ++ 
Sbjct: 257 SLALKNEFDPEFAKKLEVQLTETYNEIDELQKQMETAKASDIDSVNGVSL-------ELN 316

Query: 311 EDELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDS 370
           E + L + + EE K+ ++               +++++ EL++VK E   ++    + +S
Sbjct: 317 EAKGLFEKLVEEEKSLQE--------------LVESLKAELKNVKMEHDEVEAKEAEIES 376

Query: 371 IVQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKN 430
           +   L+ KL R+K++LE   + E K KA   ++ L+I Q+  ETEAA++E E    + K 
Sbjct: 377 VAGDLHLKLSRSKSELEQCVTEESKAKAALEDMMLTINQISSETEAARREAEGMRNKAKE 436

Query: 431 SKAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSEST--MRSRACATNNS 490
              E +     ++ +E+ L+ AL E E+ K++E   LE +KS+SE T   R+   + + S
Sbjct: 437 LMKEAESAHLALEDSELHLRVALDEAEEAKAAETKALEQIKSMSEKTNAARNSTSSESGS 496

Query: 491 SFITISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMR 550
             IT+S+ E++ L+  A    ++AE KVAAA A +EA++ASE ET KK+E  + EI++++
Sbjct: 497 QSITLSQEEFKSLSKRAEVFDKLAEMKVAAALAQVEAVRASENETLKKLETTQEEIKKLK 556

Query: 551 MEEEKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRL 610
              E+ + +A  + +AK+ VEGEL++ ++R+    +E       +  ++     +P +  
Sbjct: 557 TATEEALKKAAMADAAKKAVEGELRRWRERDQKKAEEAATRILAEAEMKMASESSPQQHY 616

Query: 611 KFRISASPSPHMMNGRTDSFSTQKRTK--VVKNLAKFFNGKQ 638
           K     +P    +N + +   T   +K  ++ NL+  FN K+
Sbjct: 617 K-----APKQKPVNNKLEKTKTSVVSKKVLMPNLSGIFNRKK 632

BLAST of Csa2G404900 vs. TAIR10
Match: AT2G26570.1 (AT2G26570.1 Plant protein of unknown function (DUF827))

HSP 1 Score: 158.7 bits (400), Expect = 1.2e-38
Identity = 150/641 (23.40%), Postives = 289/641 (45.09%), Query Frame = 1

Query: 9   VEMERREFDSKIRGGLVRAAVNQYGDGKENGIS-WKNSLTQDSPEYSLKARELQKAKTDI 68
           V+  R   D+      V+ AV+++G     GI+ WK+   Q      L   EL+K   +I
Sbjct: 158 VDSHRGLIDTAAPFESVKEAVSKFG-----GITDWKSHRMQAVERRKLIEEELKKIHEEI 217

Query: 69  DHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQ--ELETLKKSASV 128
             YK     A+++  Q   EL + K  +++L    DK+    +  KQ  EL  L+     
Sbjct: 218 PEYKTHSETAEAAKLQVLKELESTKRLIEQLKLNLDKAQTEEQQAKQDSELAKLRVEEME 277

Query: 129 QGTR--LAVASSENREYAEL-----MRELESAKLELSKLKLDMASVFHEKLLAEKEKEET 188
           QG    ++VA+    E A+      + EL S K EL  L  +  ++  +K +A K+ EE 
Sbjct: 278 QGIAEDVSVAAKAQLEVAKARHTTAITELSSVKEELETLHKEYDALVQDKDVAVKKVEEA 337

Query: 189 ISKFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENK 248
           +   + +  ++EEL  E+    E       + +EA ++       R  +   +   ++  
Sbjct: 338 MLASKEVEKTVEELTIELIATKESLESAHASHLEAEEQRIGAAMARDQDTHRWEKELKQA 397

Query: 249 RKIIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQ- 308
            + +  L Q++   K+L+ +L   ++ +  L+ EL    E ++K          +  ++ 
Sbjct: 398 EEELQRLNQQIHSSKDLKSKLDTASALLLDLKAELVAYMESKLKQEACDSTTNTDPSTEN 457

Query: 309 VGEDEL--LLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNE 368
           +   +L   + S  +EL+    ++     E      +  +++ EL   K  +AS+K+   
Sbjct: 458 MSHPDLHAAVASAKKELEEVNVNIEKAAAEVSCLKLASSSLQLELEKEKSTLASIKQREG 517

Query: 369 KTDSIVQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEE 428
                V  + +++ R ++++ +V S E   +     L   ++Q  +E + AK   E+  E
Sbjct: 518 MASIAVASIEAEIDRTRSEIASVQSKEKDAREKMVELPKQLQQAAEEADEAKSLAEVARE 577

Query: 429 EIKNSKAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATN 488
           E++ +K E ++ ++     E  L  A +E+E  K+SE L L  +K+L ES    +A  T+
Sbjct: 578 ELRKAKEEAEQAKAGASTMESRLFAAQKEIEAAKASERLALAAIKALEESESTLKANDTD 637

Query: 489 NSSFITISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEE 548
           +   +T+S  EY  L+  A  A+E+A  +VAAA + IE  K +E+ + +K+E    +++ 
Sbjct: 638 SPRSVTLSLEEYYELSKRAHEAEELANARVAAAVSRIEEAKETEMRSLEKLEEVNRDMDA 697

Query: 549 MRMEEEKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRN---GSMT 608
            +   ++   +A ++   K  VE EL+K  + E+    + G+  N +K ++ +   G M 
Sbjct: 698 RKKALKEATEKAEKAKEGKLGVEQELRK-WRAEHEQKRKAGDGVNTEKNLKESFEGGKME 757

Query: 609 PS-RRLKFRISASPSPHMMNGRTDSFSTQKRTKVVKNLAKF 633
            S   + +  S S S         + S Q +++  K    F
Sbjct: 758 QSPEAVVYASSPSESYGTEENSETNLSPQTKSRKKKKKLSF 792

BLAST of Csa2G404900 vs. TAIR10
Match: AT4G33390.1 (AT4G33390.1 Plant protein of unknown function (DUF827))

HSP 1 Score: 157.5 bits (397), Expect = 2.7e-38
Identity = 145/564 (25.71%), Postives = 255/564 (45.21%), Query Frame = 1

Query: 25  VRAAVNQYGDGKENGIS-WKNSLTQDSPEYSLKARELQKAKTDIDHYKKSRNAADSSSAQ 84
           V+ AV+++G     GI+ WK    +     +   +EL K + +I  YKK     + S   
Sbjct: 165 VKEAVSKFG-----GITDWKAHRMKVLERRNFVEQELDKIQEEIPEYKKKSEMVEMSKML 224

Query: 85  AQLELLNAKNTVKKLSSLFDKSNAMARAHKQ--ELETLKKSASVQGT--RLAVASSENRE 144
           A  EL + K  +++L    +K+    +  KQ  EL  L+     QG     +VAS    E
Sbjct: 225 AVEELESTKRLIEELKLNLEKAETEEQQAKQDSELAKLRVQEMEQGIADEASVASKAQLE 284

Query: 145 YAEL-----MRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSIEELRK 204
            A+      + ELES K EL  L+ +  ++  EK LA KE EE +   + +   +EEL  
Sbjct: 285 VAQARHTSAISELESVKEELQTLQNEYDALVKEKDLAVKEAEEAVIASKEVERKVEELTI 344

Query: 205 EIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVEGLKE 264
           E+    E       + +EA +        R  E   +   ++   + +  L Q +   KE
Sbjct: 345 ELIATKESLECAHSSHLEAEEHRIGAAMLRDQETHRWEKELKQAEEELQRLKQHLVSTKE 404

Query: 265 LEKQLSLTTSDVNVLQRELKLVKE----LEIKSHRKVKMIELEKKSQVGEDELLLQSITE 324
           L+ +L   ++ +  L++EL   KE     E  S   V  IE+  + +  + +  + S  +
Sbjct: 405 LQVKLEFASALLLDLKKELADHKESSKVKEETSETVVTNIEISLQEKTTDIQKAVASAKK 464

Query: 325 ELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQKLNSKLLR 384
           EL+    ++     E      +  ++R E+   K  + SLK+        V  L +++  
Sbjct: 465 ELEEVNANVEKATSEVNCLKVASSSLRLEIDKEKSALDSLKQREGMASVTVASLEAEIDI 524

Query: 385 AKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKAEIQKIESE 444
            + ++  V S E + +     L   ++Q  +E + AK   EL  EE++ S+ E ++ ++ 
Sbjct: 525 TRCEIALVKSKEKETREEMVELPKQLQQASQEADEAKSFAELAREELRKSQEEAEQAKAG 584

Query: 445 IDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITISRFEYEYL 504
               E  L  A +E+E +K+SE L L  +K+L ES   S+  A ++   +T++  EY  L
Sbjct: 585 ASTMESRLFAAQKEIEAIKASERLALAAIKALQESESSSKENAVDSPRTVTLTIEEYYEL 644

Query: 505 AGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEKQVYRANRS 564
           +  A  A+E A  +VAAA + +   K +E  + +K+E    E+ E +      + +A ++
Sbjct: 645 SKRAHEAEEAANARVAAAVSEVGEAKETEKRSLEKLEEVNKEMVERKATLAGAMEKAEKA 704

Query: 565 LSAKRMVEGELQK-----RQKREN 570
              K  VE EL+K      +KR+N
Sbjct: 705 KEGKLGVEQELRKWREVSEKKRKN 723

BLAST of Csa2G404900 vs. NCBI nr
Match: gi|449442066|ref|XP_004138803.1| (PREDICTED: protein PLASTID MOVEMENT IMPAIRED 2 [Cucumis sativus])

HSP 1 Score: 1207.6 bits (3123), Expect = 0.0e+00
Identity = 642/642 (100.00%), Postives = 642/642 (100.00%), Query Frame = 1

Query: 1   MPFPPNFQVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKAREL 60
           MPFPPNFQVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKAREL
Sbjct: 1   MPFPPNFQVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKAREL 60

Query: 61  QKAKTDIDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLK 120
           QKAKTDIDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLK
Sbjct: 61  QKAKTDIDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLK 120

Query: 121 KSASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETIS 180
           KSASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETIS
Sbjct: 121 KSASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETIS 180

Query: 181 KFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRK 240
           KFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRK
Sbjct: 181 KFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRK 240

Query: 241 IIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGE 300
           IIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGE
Sbjct: 241 IIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGE 300

Query: 301 DELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSI 360
           DELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSI
Sbjct: 301 DELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSI 360

Query: 361 VQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNS 420
           VQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNS
Sbjct: 361 VQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNS 420

Query: 421 KAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFI 480
           KAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFI
Sbjct: 421 KAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFI 480

Query: 481 TISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEE 540
           TISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEE
Sbjct: 481 TISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEE 540

Query: 541 EKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFR 600
           EKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFR
Sbjct: 541 EKQVYRANRSLSAKRMVEGELQKRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFR 600

Query: 601 ISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP 643
           ISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP
Sbjct: 601 ISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP 642

BLAST of Csa2G404900 vs. NCBI nr
Match: gi|659081354|ref|XP_008441290.1| (PREDICTED: protein PLASTID MOVEMENT IMPAIRED 2 isoform X1 [Cucumis melo])

HSP 1 Score: 1117.1 bits (2888), Expect = 0.0e+00
Identity = 597/643 (92.85%), Postives = 617/643 (95.96%), Query Frame = 1

Query: 1   MPFPPNFQVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKAREL 60
           MPFPP+FQV+MERREFDSKIRGGLVRAA+NQYGDGKENGISWK SLTQDS EYSLKAREL
Sbjct: 1   MPFPPDFQVKMERREFDSKIRGGLVRAAINQYGDGKENGISWKKSLTQDSSEYSLKAREL 60

Query: 61  QKAKTDIDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLK 120
           QKAKTD+DHYKK+RNAADS SAQAQLELLNAKNTVKKLSSLFDKSNA ARAHKQELETLK
Sbjct: 61  QKAKTDVDHYKKTRNAADSFSAQAQLELLNAKNTVKKLSSLFDKSNATARAHKQELETLK 120

Query: 121 KSASVQGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETIS 180
           KSASVQG +LAV+SSEN +YAELMRELESAKLELSKLKLDM SVFHEKLLAEKEKEE IS
Sbjct: 121 KSASVQGLQLAVSSSENHQYAELMRELESAKLELSKLKLDMGSVFHEKLLAEKEKEEAIS 180

Query: 181 KFQSLSSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRK 240
           KFQSLS+SIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRK
Sbjct: 181 KFQSLSNSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRK 240

Query: 241 IIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGE 300
           IIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKS RKVKMIELEK SQV E
Sbjct: 241 IIDELVQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSQRKVKMIELEKNSQVEE 300

Query: 301 DELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSI 360
           DELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRREL+HVKEE+ASLKKPN KTDSI
Sbjct: 301 DELLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELKHVKEEVASLKKPNGKTDSI 360

Query: 361 VQKLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNS 420
           VQKLNSKLLRAKAKLEAVSSAE+KVKAIASNLSLSIEQMKKETEAAKKE+EL +EEIKN+
Sbjct: 361 VQKLNSKLLRAKAKLEAVSSAEEKVKAIASNLSLSIEQMKKETEAAKKEKELIDEEIKNT 420

Query: 421 KAEIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFI 480
           KAEIQKIESEIDLNEICLQDALQELEKVKSSEA VLENLKSL+ESTMRSRA AT NSSF+
Sbjct: 421 KAEIQKIESEIDLNEICLQDALQELEKVKSSEAFVLENLKSLTESTMRSRASATKNSSFV 480

Query: 481 TISRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEE 540
           TISRFEYEYLAGHAVAAQEVA+KKVAAAQAWIEAIKASEVETTK IELAELEIEEMRMEE
Sbjct: 481 TISRFEYEYLAGHAVAAQEVAKKKVAAAQAWIEAIKASEVETTKTIELAELEIEEMRMEE 540

Query: 541 EKQVYRANRSLSAKRMVEGELQK-RQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKF 600
           EKQ YRANRSLSAKRMVEGELQ  RQ RE NV+DENGE TNR KTIRRNGSMTP RRLKF
Sbjct: 541 EKQAYRANRSLSAKRMVEGELQNWRQNREKNVEDENGEATNRPKTIRRNGSMTP-RRLKF 600

Query: 601 RISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP 643
           RISASPSPHMMNGRTDSFS QKRTKVVKNLAKFFNGK+AKMNP
Sbjct: 601 RISASPSPHMMNGRTDSFSMQKRTKVVKNLAKFFNGKKAKMNP 642

BLAST of Csa2G404900 vs. NCBI nr
Match: gi|659081356|ref|XP_008441291.1| (PREDICTED: protein PLASTID MOVEMENT IMPAIRED 2 isoform X2 [Cucumis melo])

HSP 1 Score: 1098.2 bits (2839), Expect = 0.0e+00
Identity = 589/633 (93.05%), Postives = 607/633 (95.89%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKARELQKAKTDIDHY 70
           MERREFDSKIRGGLVRAA+NQYGDGKENGISWK SLTQDS EYSLKARELQKAKTD+DHY
Sbjct: 1   MERREFDSKIRGGLVRAAINQYGDGKENGISWKKSLTQDSSEYSLKARELQKAKTDVDHY 60

Query: 71  KKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLKKSASVQGTRL 130
           KK+RNAADS SAQAQLELLNAKNTVKKLSSLFDKSNA ARAHKQELETLKKSASVQG +L
Sbjct: 61  KKTRNAADSFSAQAQLELLNAKNTVKKLSSLFDKSNATARAHKQELETLKKSASVQGLQL 120

Query: 131 AVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSIE 190
           AV+SSEN +YAELMRELESAKLELSKLKLDM SVFHEKLLAEKEKEE ISKFQSLS+SIE
Sbjct: 121 AVSSSENHQYAELMRELESAKLELSKLKLDMGSVFHEKLLAEKEKEEAISKFQSLSNSIE 180

Query: 191 ELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVE 250
           ELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVE
Sbjct: 181 ELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVE 240

Query: 251 GLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGEDELLLQSITE 310
           GLKELEKQLSLTTSDVNVLQRELKLVKELEIKS RKVKMIELEK SQV EDELLLQSITE
Sbjct: 241 GLKELEKQLSLTTSDVNVLQRELKLVKELEIKSQRKVKMIELEKNSQVEEDELLLQSITE 300

Query: 311 ELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQKLNSKLLR 370
           ELKTAKKDLALIRDEGFQFMTSMDAVRREL+HVKEE+ASLKKPN KTDSIVQKLNSKLLR
Sbjct: 301 ELKTAKKDLALIRDEGFQFMTSMDAVRRELKHVKEEVASLKKPNGKTDSIVQKLNSKLLR 360

Query: 371 AKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKAEIQKIESE 430
           AKAKLEAVSSAE+KVKAIASNLSLSIEQMKKETEAAKKE+EL +EEIKN+KAEIQKIESE
Sbjct: 361 AKAKLEAVSSAEEKVKAIASNLSLSIEQMKKETEAAKKEKELIDEEIKNTKAEIQKIESE 420

Query: 431 IDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITISRFEYEYL 490
           IDLNEICLQDALQELEKVKSSEA VLENLKSL+ESTMRSRA AT NSSF+TISRFEYEYL
Sbjct: 421 IDLNEICLQDALQELEKVKSSEAFVLENLKSLTESTMRSRASATKNSSFVTISRFEYEYL 480

Query: 491 AGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEKQVYRANRS 550
           AGHAVAAQEVA+KKVAAAQAWIEAIKASEVETTK IELAELEIEEMRMEEEKQ YRANRS
Sbjct: 481 AGHAVAAQEVAKKKVAAAQAWIEAIKASEVETTKTIELAELEIEEMRMEEEKQAYRANRS 540

Query: 551 LSAKRMVEGELQK-RQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFRISASPSPHM 610
           LSAKRMVEGELQ  RQ RE NV+DENGE TNR KTIRRNGSMTP RRLKFRISASPSPHM
Sbjct: 541 LSAKRMVEGELQNWRQNREKNVEDENGEATNRPKTIRRNGSMTP-RRLKFRISASPSPHM 600

Query: 611 MNGRTDSFSTQKRTKVVKNLAKFFNGKQAKMNP 643
           MNGRTDSFS QKRTKVVKNLAKFFNGK+AKMNP
Sbjct: 601 MNGRTDSFSMQKRTKVVKNLAKFFNGKKAKMNP 632

BLAST of Csa2G404900 vs. NCBI nr
Match: gi|703146779|ref|XP_010108888.1| (Protein PLASTID MOVEMENT IMPAIRED 2 [Morus notabilis])

HSP 1 Score: 581.6 bits (1498), Expect = 1.6e-162
Identity = 343/640 (53.59%), Postives = 441/640 (68.91%), Query Frame = 1

Query: 8   QVEMERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSL-KARELQKAKTD 67
           QV M R EFD + R G V+AA+  YGD   NG S       D  E S  K REL  A+ D
Sbjct: 6   QVIMNRGEFDDRRRTGSVKAAIRLYGDKILNGRSSLKKPEIDVSEKSYSKTRELHMARRD 65

Query: 68  IDHYKKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELET-LKKSASV 127
           ID YK++R  ADS  AQA+ ELL+AK TV  LSSL  +S++ A+A KQE+ET L+KS   
Sbjct: 66  IDRYKETRAEADSLKAQAEFELLDAKTTVTNLSSLLRESDSKAKAQKQEIETTLRKSTRK 125

Query: 128 QGTRLAVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSL 187
               LA    E  +Y+E+M+ELE+ K EL  LKLDMASV  EK  AEK+ E + SK +S 
Sbjct: 126 DKRALAFGDMETHKYSEVMKELEAVKQELRMLKLDMASVLEEKSRAEKQIEASRSKIRSH 185

Query: 188 SSSIEELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDEL 247
           SSS+E ++KE +E+NEEQVLVELA+IEALKE+ EIEA+R+ EA +F  AIE  R+ I+++
Sbjct: 186 SSSLEAVKKETEEVNEEQVLVELARIEALKEYGEIEAERAKEASQFASAIEQTRRKINDI 245

Query: 248 VQEVEGLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELEKKSQVGED---E 307
           V EVE  KELE +L++T +DV++LQ EL+ VKE+E +  R   +  LE   + GE+    
Sbjct: 246 VDEVEHSKELESKLAITIADVDMLQNELQSVKEMEKRIQRNDNLKRLETSFRGGEELDSS 305

Query: 308 LLLQSITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQ 367
           L LQS+TEEL+ AKK+LA ++ EGFQ+M SMD +R E +HVK+E A L++  +K D  VQ
Sbjct: 306 LSLQSVTEELEAAKKELASLKAEGFQYMASMDIIRNERKHVKKETARLEEIEKKGDLAVQ 365

Query: 368 KLNSKLLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKA 427
            LNSKLLRAKAKLEAVS+AE+K K+I SNLSL++EQ+K E + A++E+ L  +E    K 
Sbjct: 366 NLNSKLLRAKAKLEAVSAAEEKAKSIVSNLSLTLEQLKTEAKTARREKVLVCQEAATIKE 425

Query: 428 EIQKIESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITI 487
           EI + ESEID  E  LQ A+QELE  KSSEAL L+NLKS  E+T+ +R     +SS ITI
Sbjct: 426 EIGRTESEIDSTEERLQAAMQELEAAKSSEALALKNLKSRIENTVGARTSVLKHSSSITI 485

Query: 488 SRFEYEYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEK 547
           S FEYEYL G AV A+E+A+KKVAAAQAWIEAIKA+E E   KI+ A+ EI EMR+EEE+
Sbjct: 486 SNFEYEYLTGRAVGAEELADKKVAAAQAWIEAIKANEKEILMKIDFAQREIREMRLEEER 545

Query: 548 QVYRANRSLSAKRMVEGELQK-RQKRENNVDDENGEPTNRQKTIRRNG--SMTPSRRLKF 607
           + YR  RS SAKR VE ELQ  R KRE N   EN +    +K+IR NG  ++TPSRR KF
Sbjct: 546 EAYRMERSFSAKRTVERELQSWRTKREKNATPENLQLAMHKKSIRGNGNANLTPSRRAKF 605

Query: 608 RISASPSPHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAK 640
           R SASP+        +SF  +KRT+V+  +AKFF GK  K
Sbjct: 606 RKSASPAAR------NSFPVKKRTQVMPLIAKFFKGKTDK 639

BLAST of Csa2G404900 vs. NCBI nr
Match: gi|590652560|ref|XP_007033184.1| (Uncharacterized protein isoform 5 [Theobroma cacao])

HSP 1 Score: 574.3 bits (1479), Expect = 2.6e-160
Identity = 325/633 (51.34%), Postives = 450/633 (71.09%), Query Frame = 1

Query: 11  MERREFDSKIRGGLVRAAVNQYGDGKENGISWKNSLTQDSPEYSLKARELQKAKTDIDHY 70
           M+R  ++ + R G V+AAVN YG+   +G        +D PE S +AREL  A+ D+  Y
Sbjct: 1   MDRTRYEGRRRNGTVKAAVNIYGERILDGNFSLKKPQEDFPEPSSRARELHMARRDMSRY 60

Query: 71  KKSRNAADSSSAQAQLELLNAKNTVKKLSSLFDKSNAMARAHKQELETLKKSASVQGTRL 130
           K+SR AA+S+ ++A+ EL +A  TVK L+S+ ++SN  A+A  +++E+L+KS + +   L
Sbjct: 61  KESRRAAESAKSKAESELFSATKTVKDLASMIEESNFKAKARMRDIESLRKSGNREEKAL 120

Query: 131 AVASSENREYAELMRELESAKLELSKLKLDMASVFHEKLLAEKEKEETISKFQSLSSSIE 190
           AV S E+  YAE+MREL+  K ELSKLKLDMASV  EK  AEKE E++  K  S SSS+E
Sbjct: 121 AVRSIESYHYAEVMRELDLVKQELSKLKLDMASVKGEKARAEKEFEDSSLKMWSNSSSVE 180

Query: 191 ELRKEIDEINEEQVLVELAQIEALKEFQEIEAQRSMEAKEFLCAIENKRKIIDELVQEVE 250
            LRK+I+  NEE VLVELA+IEALKE  E+EAQR  E   F  ++E  ++ + E+++E++
Sbjct: 181 ALRKQIEAANEEHVLVELARIEALKEVGELEAQREKEFGGFSFSMEETKEKMKEIIEEID 240

Query: 251 GLKELEKQLSLTTSDVNVLQRELKLVKELEIKSHRKVKMIELE---KKSQVGEDELLLQS 310
             KELEK+L++T SDVN+L+ +LK VK+L+ +  R   + + E   + +   E    LQS
Sbjct: 241 QSKELEKKLAVTLSDVNLLENKLKQVKKLDKRVQRSDDLKQSEHSFRSAAEVEGSPSLQS 300

Query: 311 ITEELKTAKKDLALIRDEGFQFMTSMDAVRRELRHVKEEIASLKKPNEKTDSIVQKLNSK 370
           IT+EL+ AKK+LA IR+EGFQ+M+SMD +R EL+HV+EE A  KK  EK D  VQ LNSK
Sbjct: 301 ITKELEVAKKELASIREEGFQYMSSMDIIRNELKHVREETARSKKTGEKADLKVQNLNSK 360

Query: 371 LLRAKAKLEAVSSAEDKVKAIASNLSLSIEQMKKETEAAKKEEELTEEEIKNSKAEIQKI 430
           LLRAK+KLEAV++A +K ++I +NLSL++EQ+K E EAA+KE+ L  E+    KAEIQK 
Sbjct: 361 LLRAKSKLEAVTAAGEKAESIVTNLSLTLEQLKTEAEAARKEKALITEDTATIKAEIQKT 420

Query: 431 ESEIDLNEICLQDALQELEKVKSSEALVLENLKSLSESTMRSRACATNNSSFITISRFEY 490
           ESEIDL E  L  A+QELE VK+SEA  LE L+SL E+TM+SRA A+N S  ITIS+FEY
Sbjct: 421 ESEIDLTEERLNAAVQELEAVKASEASALEKLRSLIETTMQSRASASNQSYTITISKFEY 480

Query: 491 EYLAGHAVAAQEVAEKKVAAAQAWIEAIKASEVETTKKIELAELEIEEMRMEEEKQVYRA 550
           EYL G AV A+E+A+KKVAA QAWIEA+KASE E   K E+A  ++ +MR+EEE +V+R 
Sbjct: 481 EYLTGRAVGAEEIADKKVAATQAWIEALKASEREILMKTEIANRDLRDMRVEEEHEVHRT 540

Query: 551 NRSLSAKRMVEGELQ-KRQKRENNVDDENGEPTNRQKTIRRNGSMTPSRRLKFRISASPS 610
             SLSAK+MVE EL+ +RQ RE N + +N +   R+++++ NG+++PSR+ KFR SASP+
Sbjct: 541 EWSLSAKKMVETELRNRRQTREKNAEAQNRQSPFRRRSMKSNGNLSPSRQAKFRKSASPA 600

Query: 611 PHMMNGRTDSFSTQKRTKVVKNLAKFFNGKQAK 640
             +  G +  F  +K+ KVV NLAKFF GK+ +
Sbjct: 601 --IRAGGSTPFIIKKKRKVVPNLAKFFLGKKVE 631

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PMI2_ARATH5.7e-10738.83Protein PLASTID MOVEMENT IMPAIRED 2 OS=Arabidopsis thaliana GN=PMI2 PE=1 SV=1[more]
PMI15_ARATH3.0e-9538.44Protein PLASTID MOVEMENT IMPAIRED 15 OS=Arabidopsis thaliana GN=PMI15 PE=2 SV=3[more]
Y5586_ARATH9.9e-5927.73WEB family protein At5g55860 OS=Arabidopsis thaliana GN=At5g55860 PE=1 SV=1[more]
WEB1_ARATH2.2e-3723.40Protein WEAK CHLOROPLAST MOVEMENT UNDER BLUE LIGHT 1 OS=Arabidopsis thaliana GN=... [more]
Y1215_ARATH1.0e-3426.18WEB family protein At1g12150 OS=Arabidopsis thaliana GN=At1g12150 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LQ40_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_2G404900 PE=4 SV=1[more]
W9S0V7_9ROSA1.1e-16253.59Protein PLASTID MOVEMENT IMPAIRED 2 OS=Morus notabilis GN=L484_027082 PE=4 SV=1[more]
A0A061EGI3_THECC1.8e-16051.34Uncharacterized protein isoform 5 OS=Theobroma cacao GN=TCM_019366 PE=4 SV=1[more]
U5GG24_POPTR5.0e-15851.47Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s15190g PE=4 SV=1[more]
A0A061EGW1_THECC5.0e-15851.43Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_019366 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G66840.13.2e-10838.83 Plant protein of unknown function (DUF827)[more]
AT5G38150.11.7e-9638.44 Plant protein of unknown function (DUF827)[more]
AT5G55860.15.6e-6027.73 Plant protein of unknown function (DUF827)[more]
AT2G26570.11.2e-3823.40 Plant protein of unknown function (DUF827)[more]
AT4G33390.12.7e-3825.71 Plant protein of unknown function (DUF827)[more]
Match NameE-valueIdentityDescription
gi|449442066|ref|XP_004138803.1|0.0e+00100.00PREDICTED: protein PLASTID MOVEMENT IMPAIRED 2 [Cucumis sativus][more]
gi|659081354|ref|XP_008441290.1|0.0e+0092.85PREDICTED: protein PLASTID MOVEMENT IMPAIRED 2 isoform X1 [Cucumis melo][more]
gi|659081356|ref|XP_008441291.1|0.0e+0093.05PREDICTED: protein PLASTID MOVEMENT IMPAIRED 2 isoform X2 [Cucumis melo][more]
gi|703146779|ref|XP_010108888.1|1.6e-16253.59Protein PLASTID MOVEMENT IMPAIRED 2 [Morus notabilis][more]
gi|590652560|ref|XP_007033184.1|2.6e-16051.34Uncharacterized protein isoform 5 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008545Web
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU160907cucumber EST collection version 3.0transcribed_cluster
CU173131cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa2G404900.1Csa2G404900.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU160907CU160907transcribed_cluster
CU173131CU173131transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008545WEB familyPFAMPF05701WEMBLcoord: 25..567
score: 2.3
NoneNo IPR availableunknownCoilCoilcoord: 175..209
score: -coord: 305..325
score: -coord: 361..381
score: -coord: 389..430
score: -coord: 519..548
score: -coord: 235..272
score: -coord: 130..160
scor
NoneNo IPR availablePANTHERPTHR32054FAMILY NOT NAMEDcoord: 4..637
score: 3.6E
NoneNo IPR availablePANTHERPTHR32054:SF5PROTEIN PLASTID MOVEMENT IMPAIRED 15-RELATEDcoord: 4..637
score: 3.6E