Csa1G073780 (gene) Cucumber (Chinese Long) v2

NameCsa1G073780
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein, putative; contains IPR002885 (Pentatricopeptide repeat), IPR011990 (Tetratricopeptide-like helical)
LocationChr1 : 7546815 .. 7549078 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTATATGATTTCTCACTCTGCTTCTCTTCTTCTTCTTCGTTCTAAGAGGAGCTCTCTTCGGTTCTCATTCATGGATCAGAAGCTCTTCTCCAAAGCTTTAACACGCTATGCTCTAGCTAGCCGATCTTACCACACGACTCGGTTGAAAAAGGCGACTCTATATGCCAAAATTAGTCCACTGGGCGATCCTAGAATCAGCGTGGAGTCGGAACTCGACGGTTGGGTCCAGGAGGGGAAGAAGCTACGAGTCGCTGAGCTTCAGAGAATCATTCGCGACTTTCGCAAGCGCAATCGTTTTAGCCAAGCTCTTGAGGTTCGAATTATTCTTTTCTTCTGGTTTGAGTATTTAGATGTGGAGAAAGAAATGAACTGTATAAAAATGATGCATCGTGTGAGAAGAATGTGCCTTTTAGACAATTCTTACTTCGAATCTCGAGATTACGGTGAATGGGGAATGGGGTTGTTTCAAATTATCGAAAGATTTAGTTTGGAGAAATGGAAAGCACGTAGGTTTTTTATGCTAAGAAATCCGAAGCTCTTACCTTACGTTCTTTGATTAAGTGAGCTTGTTCGAGTAACTTCGTTAGTTCACAGAATATGGACTTTTCAGAAATGGAATTATGTCATTTCTCCCCTACATGATCAGGAATTTGATTTCACTCGCAAGGAATTTAACTCTTTGTAGATAATGATATATGAGCTAGAAGTATGTGGGTTTGAACATGTTTTTCGTGGAGTTTCACTGTAATTGAAGATGAGTATTACTGCAGGTGTCCGAATGGATGAAAAAAAGTGGTGCCTGCATATTTTCACCAACCGAGCATGCGGTGCAATTGGATTTGATTGGCCGAGTACGAGGTTCTCTTTCTGCTGAAAACTATTTCAATCAGTTGAAGGAGCAAGACCAGACTATTAAAACATATGGTGCTCTTCTGAATTGCTATGTTCGACAGCGGCAAGTGGACAAATCCCTCTCCCATTTTCAAAAAATGAAAGAGTTGGGATTTGCAACTTCAGAGCTCACTTACAATGACATCATGTGTTTGTACACAAGAGTTGGCCAGCATGAGAAGGTCCCTGAGGTGCTAGCAGAGATGAAAGAGAATAATGTTTCTCCCGACAACTTTAGTTATAGAATCTGCATCAGTTCGTATGGTGCAAGAAAAGATATTGAGGGGATGGAGAATGTATTGAAAGAGATGGAATCTCAACCTCATATTGTAATGGACTGGAACACATATGCAGTAGTTGCAAACTTCTTTATAAAAGCTGCTCTTACTGATAAGGCAGTTGATGCCTTGAAAAAATCAGAAGAGAAACTGAAAAGCAGTAACGATAGAATCGGCCATAACCAGCTGATCTCGCTTTATGCAACCTTAGGTAACAAGGAAAAGGTGCTGAGATTGTGGAATCTGGATAAAACTGCTACTACGAGAATCATCAATAGGGACTACATCACGATGCTTGAATCTTTGGTGAGACTAGGTGAACTTGAAGAAGCTGAGAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACATTGTCATTATTGGATATATTGACAAGGGAATGTGTGAGAGAGGTGAAACACTTCTTGAAAACTTGAAGCAGAATGAAAAGGCTACCACACCAAACAGTTGGGGTGCTCTGGCTGTTAAGTATCTGGACCTGGGTGAGACCGAAAAAGCCTTAGAGTGTATGATGACAGCCCTTTCTGTAAACATTGGTAAAGGATGGAAGCCTAATCTTCGGGTGATCACAGGACTATTGAATTGGCTTGGTGATAAGGGCATTGTAGAAGAAGTAGAAGCTTTTGTAAGCGCATTGAGGTCTGTCACTCCAGTGAATAGAGAGATGTATCATGCCTTGTTAAAGGTTTATATAAGAGCTGATAAAGAAGTAAAGGAGGTGTTAAACAACATGAAGGCTGATAAAATAGATGAAGATGAAGAAACCAAGAAAATTCTTGGCACTTAGGAAGAAACAACTGAACGTAAGAGCGTTGCCTCATTATTTTTGTTATTGTATCTCATTATTGATGGTGTCCATCTGCCAAATCTTGCTAGGGTTTCAACAATGCAGACGCTTGTTGATCAAATTCTTCACTTGAAATGAGTAGAAGTAGCTGATTTCATTGCCAAGTTTGCTGCTTTTGCTTCGCTTAGTTTTTAATTAAAGTTTTGTGGGCTAATAAAGCGAGGTTGCATGCAGGGATGATTGTGTATTTCATACGTAGGGGG

mRNA sequence

ATGATTTCTCACTCTGCTTCTCTTCTTCTTCTTCGTTCTAAGAGGAGCTCTCTTCGGTTCTCATTCATGGATCAGAAGCTCTTCTCCAAAGCTTTAACACGCTATGCTCTAGCTAGCCGATCTTACCACACGACTCGGTTGAAAAAGGCGACTCTATATGCCAAAATTAGTCCACTGGGCGATCCTAGAATCAGCGTGGAGTCGGAACTCGACGGTTGGGTCCAGGAGGGGAAGAAGCTACGAGTCGCTGAGCTTCAGAGAATCATTCGCGACTTTCGCAAGCGCAATCGTTTTAGCCAAGCTCTTGAGGTGTCCGAATGGATGAAAAAAAGTGGTGCCTGCATATTTTCACCAACCGAGCATGCGGTGCAATTGGATTTGATTGGCCGAGTACGAGGTTCTCTTTCTGCTGAAAACTATTTCAATCAGTTGAAGGAGCAAGACCAGACTATTAAAACATATGGTGCTCTTCTGAATTGCTATGTTCGACAGCGGCAAGTGGACAAATCCCTCTCCCATTTTCAAAAAATGAAAGAGTTGGGATTTGCAACTTCAGAGCTCACTTACAATGACATCATGTGTTTGTACACAAGAGTTGGCCAGCATGAGAAGGTCCCTGAGGTGCTAGCAGAGATGAAAGAGAATAATGTTTCTCCCGACAACTTTAGTTATAGAATCTGCATCAGTTCGTATGGTGCAAGAAAAGATATTGAGGGGATGGAGAATGTATTGAAAGAGATGGAATCTCAACCTCATATTGTAATGGACTGGAACACATATGCAGTAGTTGCAAACTTCTTTATAAAAGCTGCTCTTACTGATAAGGCAGTTGATGCCTTGAAAAAATCAGAAGAGAAACTGAAAAGCAGTAACGATAGAATCGGCCATAACCAGCTGATCTCGCTTTATGCAACCTTAGGTAACAAGGAAAAGGTGCTGAGATTGTGGAATCTGGATAAAACTGCTACTACGAGAATCATCAATAGGGACTACATCACGATGCTTGAATCTTTGGTGAGACTAGGTGAACTTGAAGAAGCTGAGAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACATTGTCATTATTGGATATATTGACAAGGGAATGTGTGAGAGAGGTGAAACACTTCTTGAAAACTTGAAGCAGAATGAAAAGGCTACCACACCAAACAGTTGGGGTGCTCTGGCTGTTAAGTATCTGGACCTGGGTGAGACCGAAAAAGCCTTAGAGTGTATGATGACAGCCCTTTCTGTAAACATTGGTAAAGGATGGAAGCCTAATCTTCGGGTGATCACAGGACTATTGAATTGGCTTGGTGATAAGGGCATTGTAGAAGAAGTAGAAGCTTTTGTAAGCGCATTGAGGTCTGTCACTCCAGTGAATAGAGAGATGTATCATGCCTTGTTAAAGGTTTATATAAGAGCTGATAAAGAAGTAAAGGAGGTGTTAAACAACATGAAGGCTGATAAAATAGATGAAGATGAAGAAACCAAGAAAATTCTTGGCACTTAG

Coding sequence (CDS)

ATGATTTCTCACTCTGCTTCTCTTCTTCTTCTTCGTTCTAAGAGGAGCTCTCTTCGGTTCTCATTCATGGATCAGAAGCTCTTCTCCAAAGCTTTAACACGCTATGCTCTAGCTAGCCGATCTTACCACACGACTCGGTTGAAAAAGGCGACTCTATATGCCAAAATTAGTCCACTGGGCGATCCTAGAATCAGCGTGGAGTCGGAACTCGACGGTTGGGTCCAGGAGGGGAAGAAGCTACGAGTCGCTGAGCTTCAGAGAATCATTCGCGACTTTCGCAAGCGCAATCGTTTTAGCCAAGCTCTTGAGGTGTCCGAATGGATGAAAAAAAGTGGTGCCTGCATATTTTCACCAACCGAGCATGCGGTGCAATTGGATTTGATTGGCCGAGTACGAGGTTCTCTTTCTGCTGAAAACTATTTCAATCAGTTGAAGGAGCAAGACCAGACTATTAAAACATATGGTGCTCTTCTGAATTGCTATGTTCGACAGCGGCAAGTGGACAAATCCCTCTCCCATTTTCAAAAAATGAAAGAGTTGGGATTTGCAACTTCAGAGCTCACTTACAATGACATCATGTGTTTGTACACAAGAGTTGGCCAGCATGAGAAGGTCCCTGAGGTGCTAGCAGAGATGAAAGAGAATAATGTTTCTCCCGACAACTTTAGTTATAGAATCTGCATCAGTTCGTATGGTGCAAGAAAAGATATTGAGGGGATGGAGAATGTATTGAAAGAGATGGAATCTCAACCTCATATTGTAATGGACTGGAACACATATGCAGTAGTTGCAAACTTCTTTATAAAAGCTGCTCTTACTGATAAGGCAGTTGATGCCTTGAAAAAATCAGAAGAGAAACTGAAAAGCAGTAACGATAGAATCGGCCATAACCAGCTGATCTCGCTTTATGCAACCTTAGGTAACAAGGAAAAGGTGCTGAGATTGTGGAATCTGGATAAAACTGCTACTACGAGAATCATCAATAGGGACTACATCACGATGCTTGAATCTTTGGTGAGACTAGGTGAACTTGAAGAAGCTGAGAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACATTGTCATTATTGGATATATTGACAAGGGAATGTGTGAGAGAGGTGAAACACTTCTTGAAAACTTGAAGCAGAATGAAAAGGCTACCACACCAAACAGTTGGGGTGCTCTGGCTGTTAAGTATCTGGACCTGGGTGAGACCGAAAAAGCCTTAGAGTGTATGATGACAGCCCTTTCTGTAAACATTGGTAAAGGATGGAAGCCTAATCTTCGGGTGATCACAGGACTATTGAATTGGCTTGGTGATAAGGGCATTGTAGAAGAAGTAGAAGCTTTTGTAAGCGCATTGAGGTCTGTCACTCCAGTGAATAGAGAGATGTATCATGCCTTGTTAAAGGTTTATATAAGAGCTGATAAAGAAGTAAAGGAGGTGTTAAACAACATGAAGGCTGATAAAATAGATGAAGATGAAGAAACCAAGAAAATTCTTGGCACTTAG

Protein sequence

MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDEDEETKKILGT*
BLAST of Csa1G073780 vs. Swiss-Prot
Match: PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 561.6 bits (1446), Expect = 8.8e-159
Identity = 279/478 (58.37%), Postives = 356/478 (74.48%), Query Frame = 1

Query: 37  LASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRN 96
           +ASR Y+T R+KK TLY+KISPLGDP+ SV  EL  WVQ GKK+ VAEL RI+ D R+R 
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 97  RFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGA 156
           RF  ALEVS+WM ++G C+FSPTEHAV LDLIGRV G ++AE YF  LKEQ +  KTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 157 LLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENN 216
           LLNCYVRQ+ V+KSL HF+KMKE+GF TS LTYN+IMCLYT +GQHEKVP+VL EMKE N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 217 VSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKA 276
           V+PDN+SYRICI+++GA  D+E +   L++ME +  I MDWNTYAV A F+I     D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 277 VDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLE 336
           V+ LK SE +L+   D  G+N LI+LYA LG K +VLRLW+L+K    R IN+DY+T+L+
Sbjct: 252 VELLKMSENRLE-KKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 311

Query: 337 SLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKAT 396
           SLV++  L EAE+VL EW+SSGNCYDFRVPN VI GYI K M E+ E +LE+L +  KAT
Sbjct: 312 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 371

Query: 397 TPNSWGALAVKYLDLGETEKALECMMTALSVNIG-KGWKPNLRVITGLLNWLGDKGIVEE 456
           TP SW  +A  Y + G  E A +CM TAL V +G + W+P L ++T +L+W+GD+G ++E
Sbjct: 372 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 431

Query: 457 VEAFVSALRSVTPVNREMYHALLKVYIR-ADKEVKEVLNNMKADKIDEDEETKKILGT 513
           VE+FV++LR+   VN++MYHAL+K  IR   + +  +L  MK DKI+ DEET  IL T
Sbjct: 432 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of Csa1G073780 vs. Swiss-Prot
Match: PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 9.3e-76
Identity = 152/428 (35.51%), Postives = 247/428 (57.71%), Query Frame = 1

Query: 51  TLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKK 110
           TL  +++  GDP  S+   LDGW+ +G  ++ +EL  II+  RK +RFS AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 111 SGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKS 170
                 S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + K+
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 171 LSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISS 230
              FQ+MKELGF    L YN ++ LY R G++  V ++L EM++  V PD F+    + +
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 231 YGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSS 290
           Y    D+EGME  L   E+   + +DW TYA  AN +IKA LT+KA++ L+KSE+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 291 NDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKV 350
             +  +  L+S Y   G KE+V RLW+L K       N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 351 LKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLD 410
           ++EWE+  + +D R+P+++I GY  KGM E+ E ++  L Q  +    ++W  LA+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 411 LGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVN 470
            G+ EKA+E    A+ V+   GW+P+  V+   +++L  +    ++E     LR ++   
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYLEGQ---RDMEGLRKILRLLSERG 458

Query: 471 REMYHALL 479
              Y  LL
Sbjct: 459 HISYDQLL 461

BLAST of Csa1G073780 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 275.4 bits (703), Expect = 1.3e-72
Identity = 154/453 (34.00%), Postives = 254/453 (56.07%), Query Frame = 1

Query: 52  LYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKS 111
           +Y KIS +  P +   S L+ W + G+KL   EL R++++ RK  R +QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 112 GACI-FSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKS 171
           G     S ++ A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  +K+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 172 LSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISS 231
            +    M++ G+A   L +N +M LY  + +++KV  ++ EMK+ ++  D +SY I +SS
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 232 YGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSS 291
            G+   +E ME V ++M+S   I  +W T++ +A  +IK   T+KA DAL+K E ++   
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 292 NDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKV 351
           N RI ++ L+SLY +LGNK+++ R+W++ K+    I N  Y  ++ SLVR+G++E AEKV
Sbjct: 309 N-RIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKV 368

Query: 352 LKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLD 411
            +EW    + YD R+PN+++  Y+     E  E L +++ +     + ++W  LAV +  
Sbjct: 369 YEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTR 428

Query: 412 LGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVN 471
                +AL C+  A S      W+P + +++G      ++  V   EA +  LR    + 
Sbjct: 429 KRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLE 488

Query: 472 REMYHALLKVYIRADKEVKEVLNNMKADKIDED 504
            + Y AL+      D +    +NN + D  + D
Sbjct: 489 DKSYLALI------DVDENRTVNNSEIDAHETD 514

BLAST of Csa1G073780 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 265.8 bits (678), Expect = 1.0e-69
Identity = 146/470 (31.06%), Postives = 252/470 (53.62%), Query Frame = 1

Query: 44  TTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALE 103
           T +  +  LY ++   G   + V  +L+ +++  K +   E+   I+  R R  +  AL+
Sbjct: 17  TKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALK 76

Query: 104 VSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVR 163
           +SE M++ G    + ++ A+ LDL+ + R   + ENYF  L E  +T  TYG+LLNCY +
Sbjct: 77  LSEVMEERGMNK-TVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCK 136

Query: 164 QRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFS 223
           +   +K+     KMKEL    S ++YN +M LYT+ G+ EKVP ++ E+K  NV PD+++
Sbjct: 137 ELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYT 196

Query: 224 YRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKS 283
           Y + + +  A  DI G+E V++EM     +  DW TY+ +A+ ++ A L+ KA  AL++ 
Sbjct: 197 YNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQEL 256

Query: 284 EEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGE 343
           E K  +  D   +  LI+LY  LG   +V R+W   + A  +  N  Y+ M++ LV+L +
Sbjct: 257 EMK-NTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLND 316

Query: 344 LEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGA 403
           L  AE + KEW+++ + YD R+ N++I  Y  +G+ ++   L E   +        +W  
Sbjct: 317 LPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEI 376

Query: 404 LAVKYLDLGETEKALECMMTALSVNIGKG--WKPNLRVITGLLNWLGDKGIVEEVEAFVS 463
               Y+  G+  +ALECM  A+S+  G G  W P+   +  L+++   K  V   E  + 
Sbjct: 377 FMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLE 436

Query: 464 ALRSVTP-VNREMYHALLKVYIRADKEVKEVLNNMKADKIDEDEETKKIL 511
            L++ T  +  E++  L++ Y  A K    +   +K + ++ +E TKK+L
Sbjct: 437 ILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

BLAST of Csa1G073780 vs. Swiss-Prot
Match: PP302_ARATH (Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidopsis thaliana GN=At4g02820 PE=2 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 3.2e-68
Identity = 150/439 (34.17%), Postives = 248/439 (56.49%), Query Frame = 1

Query: 73  WVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVR 132
           W +EG  +R  EL RI+R+ RK  R+  ALE+ EWM           ++AV LDLI ++R
Sbjct: 84  WKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDIKLQAGDYAVHLDLISKIR 143

Query: 133 GSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDI 192
           G  SAE +F  + +Q +      +LL+ YV+ +  DK+ + F+KM E GF  S L YN +
Sbjct: 144 GLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALFEKMGECGFLKSCLPYNHM 203

Query: 193 MCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENV-LKEMESQP 252
           + +Y   GQ EKVP ++ E+K    SPD  +Y + ++++ +  D+EG E V LK  E + 
Sbjct: 204 LSMYISRGQFEKVPVLIKELK-IRTSPDIVTYNLWLTAFASGNDVEGAEKVYLKAKEEK- 263

Query: 253 HIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEK 312
            +  DW TY+V+ N + K    +KA  ALK+  EKL S  +R+ +  LISL+A LG+K+ 
Sbjct: 264 -LNPDWVTYSVLTNLYAKTDNVEKARLALKEM-EKLVSKKNRVAYASLISLHANLGDKDG 323

Query: 313 VLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVII 372
           V   W   K++  ++ + +Y++M+ ++V+LGE E+A+ +  EWES     D R+PN+++ 
Sbjct: 324 VNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEWESVSGTGDARIPNLILA 383

Query: 373 GYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGK 432
            Y+++     GE   E + +     + ++W  L   YL   + EK L+C   A  ++  K
Sbjct: 384 EYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDMEKVLDCFGKA--IDSVK 443

Query: 433 GWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEV 492
            W  N+R++ G    L ++G V+  E  ++ L+    VN ++Y++LL+ Y +A +    V
Sbjct: 444 KWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQLYNSLLRTYAKAGEMALIV 503

Query: 493 LNNMKADKIDEDEETKKIL 511
              M  D ++ DEETK+++
Sbjct: 504 EERMAKDNVELDEETKELI 516

BLAST of Csa1G073780 vs. TrEMBL
Match: A0A0A0LXD9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073780 PE=4 SV=1)

HSP 1 Score: 1011.9 bits (2615), Expect = 2.7e-292
Identity = 512/512 (100.00%), Postives = 512/512 (100.00%), Query Frame = 1

Query: 1   MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG 60
           MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG
Sbjct: 1   MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG 60

Query: 61  DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE 120
           DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE
Sbjct: 61  DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE 120

Query: 121 HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL 180
           HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL
Sbjct: 121 HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL 180

Query: 181 GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM 240
           GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM
Sbjct: 181 GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM 240

Query: 241 ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI 300
           ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI
Sbjct: 241 ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI 300

Query: 301 SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC 360
           SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC
Sbjct: 301 SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC 360

Query: 361 YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC 420
           YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC
Sbjct: 361 YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC 420

Query: 421 MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV 480
           MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV
Sbjct: 421 MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV 480

Query: 481 YIRADKEVKEVLNNMKADKIDEDEETKKILGT 513
           YIRADKEVKEVLNNMKADKIDEDEETKKILGT
Sbjct: 481 YIRADKEVKEVLNNMKADKIDEDEETKKILGT 512

BLAST of Csa1G073780 vs. TrEMBL
Match: F6HRW8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0597g00040 PE=4 SV=1)

HSP 1 Score: 693.0 bits (1787), Expect = 2.8e-196
Identity = 350/502 (69.72%), Postives = 408/502 (81.27%), Query Frame = 1

Query: 23  MDQKLFS---KALTRYA--------LASRSYHTTRLKKATLYAKISPLGDPRISVESELD 82
           MD +LFS   +++ +Y         +++R+Y+T+R  K +LY KISPLGDP  SV  ELD
Sbjct: 1   MDSRLFSLLRQSIQQYPQSLIRKNPISNRTYYTSRYGKISLYNKISPLGDPNTSVVPELD 60

Query: 83  GWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRV 142
            WVQ G K+ VAELQRII D RKR RFSQALE+SEWM K G C FSPTEHAVQLDLIGRV
Sbjct: 61  NWVQNGNKVWVAELQRIIHDLRKRKRFSQALEISEWMSKKGICAFSPTEHAVQLDLIGRV 120

Query: 143 RGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYND 202
           RG LSAE+YFN L+  D+T KTYGALLNCYVRQRQ DKSLSH QKMKE+GFA+S LTYND
Sbjct: 121 RGFLSAESYFNSLQNHDKTDKTYGALLNCYVRQRQTDKSLSHLQKMKEMGFASSPLTYND 180

Query: 203 IMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQP 262
           IMCLYT VGQHEKVP+VL EMK++NV PDNFSYRICI+SYGA+ DI+GMENVLKEME QP
Sbjct: 181 IMCLYTNVGQHEKVPDVLTEMKQSNVYPDNFSYRICINSYGAQSDIQGMENVLKEMERQP 240

Query: 263 HIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEK 322
           HIVMDWNTYAV ANF+IKA L DKA++ALKKSEE+L    D +G+N LISLYA+LGNK +
Sbjct: 241 HIVMDWNTYAVAANFYIKAGLPDKAIEALKKSEERL-DKRDGLGYNHLISLYASLGNKAE 300

Query: 323 VLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVII 382
           VLRLW+L+K+A  R INRDYITMLESLVRLGELEEAEKVL+EWESSGNCYDFRVPNIVII
Sbjct: 301 VLRLWSLEKSACKRNINRDYITMLESLVRLGELEEAEKVLREWESSGNCYDFRVPNIVII 360

Query: 383 GYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNI-G 442
           GY +KG+ E+ E +L+ L +  K TTPNSWG +A  Y+D GE EKA+ECM  A+S+++  
Sbjct: 361 GYSEKGLFEKAEAMLKELMEKGKITTPNSWGTVASGYMDEGEMEKAVECMKAAISLHVNN 420

Query: 443 KGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKE 502
           KG KPN RVI G+L+WLGDKG VE+VEAFV +LR V P+NR MYH L+   IRA KEV  
Sbjct: 421 KGRKPNSRVIAGILSWLGDKGRVEDVEAFVGSLRIVIPMNRRMYHTLIMANIRAGKEVDG 480

Query: 503 VLNNMKADKIDEDEETKKILGT 513
           +L +MKADKI EDEETKKILGT
Sbjct: 481 LLASMKADKIVEDEETKKILGT 501

BLAST of Csa1G073780 vs. TrEMBL
Match: M5WLN5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022486mg PE=4 SV=1)

HSP 1 Score: 689.5 bits (1778), Expect = 3.1e-195
Identity = 337/489 (68.92%), Postives = 401/489 (82.00%), Query Frame = 1

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MD KLF+K L R A+ SRSY+T+R KK TLY KISPLG+P ++V  ELD WV +G K+RV
Sbjct: 1   MDPKLFAKTLIRSAMTSRSYYTSRTKKPTLYTKISPLGNPSLNVVPELDDWVYKGHKVRV 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRII D RKR RFSQAL++SEWMK+ G CIFSP EHAVQLDLIG+VRG +SAE YFN
Sbjct: 61  AELQRIIHDLRKRKRFSQALQISEWMKQKGICIFSPVEHAVQLDLIGKVRGLVSAEEYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
            L+E+D+ +KTYGALLNCYVRQ Q DKSL+H +KMKE+GFA+S LTYNDIMCLYT VG+H
Sbjct: 121 NLREEDKNLKTYGALLNCYVRQLQTDKSLAHLRKMKEMGFASSPLTYNDIMCLYTNVGEH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           EKVP VL EMKENNV PDNFSYRICI+SYG R D+EGME VL+EMESQPHIVMDWNTYAV
Sbjct: 181 EKVPGVLTEMKENNVPPDNFSYRICINSYGVRSDLEGMEKVLEEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANF+IK   T KA++ALKKSEE+L  + D +GHN LISLYA++GNK++VLRLW L+K+A
Sbjct: 241 VANFYIKEGQTHKAINALKKSEERL-DNKDGLGHNHLISLYASMGNKDEVLRLWGLEKSA 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
             R INRDYI +L SLVRLGEL+EAEKV+KEWE SGNCYDFRVP  VIIGY  KG+ ER 
Sbjct: 301 CKRCINRDYIGLLISLVRLGELDEAEKVVKEWELSGNCYDFRVPQTVIIGYTVKGLYERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           E +L +L +  KATTP SW  +A  Y++ GETEKA +CM  AL ++  KGWKPNLRV T 
Sbjct: 361 EAMLGDLMEKGKATTPKSWEIVAAGYVNKGETEKAFQCMKAALCLSAEKGWKPNLRVSTT 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +L+WLGDKG VE+ EAFV  LR+V PVN++MYHALLK Y+R  KEV  VL+ MKADK+++
Sbjct: 421 ILSWLGDKGSVEDAEAFVGLLRNVIPVNKQMYHALLKAYMRGGKEVNSVLDRMKADKVED 480

Query: 503 DE-ETKKIL 511
           D+ ETKK+L
Sbjct: 481 DDIETKKVL 488

BLAST of Csa1G073780 vs. TrEMBL
Match: A5C3G3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_006333 PE=4 SV=1)

HSP 1 Score: 684.5 bits (1765), Expect = 1.0e-193
Identity = 346/502 (68.92%), Postives = 406/502 (80.88%), Query Frame = 1

Query: 23  MDQKLFS---KALTRYA--------LASRSYHTTRLKKATLYAKISPLGDPRISVESELD 82
           MD +LFS   +++ +Y         +++R+Y+T+R  K +LY KISPLGDP  SV  ELD
Sbjct: 1   MDSRLFSLLRQSIQQYPQSLIRKNPISNRTYYTSRYGKISLYNKISPLGDPNTSVVPELD 60

Query: 83  GWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRV 142
            WVQ G K+ VAELQRII D RKR RFSQALE+SEWM K G C FSPTEHAVQLDLIGRV
Sbjct: 61  NWVQNGNKVWVAELQRIIHDLRKRKRFSQALEISEWMSKKGICAFSPTEHAVQLDLIGRV 120

Query: 143 RGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYND 202
           RG LSAE+YFN L+  D+T KTYGALLNCYVRQRQ DKSLSH QKMKE+GFA+S LTYND
Sbjct: 121 RGFLSAESYFNSLQNHDKTDKTYGALLNCYVRQRQTDKSLSHLQKMKEMGFASSPLTYND 180

Query: 203 IMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQP 262
           IMCLYT VGQHEKVP+VL EMK+++V PDNFSYRICI+SY A+ DI+GME VLKEME QP
Sbjct: 181 IMCLYTNVGQHEKVPDVLTEMKQSHVYPDNFSYRICINSYAAQSDIQGMEKVLKEMERQP 240

Query: 263 HIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEK 322
           HIVMDWNTYAV ANF+IKA L DKA++ALKKSEE+L    D +G+N LISLYA+LGNK +
Sbjct: 241 HIVMDWNTYAVAANFYIKAGLPDKAIEALKKSEERL-DKRDGLGYNHLISLYASLGNKAE 300

Query: 323 VLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVII 382
           VLRLW+L+K+A  R INRDYITMLESLVRLGELEEAEKVL+EWESSGNCYDFRVPNIVII
Sbjct: 301 VLRLWSLEKSACKRNINRDYITMLESLVRLGELEEAEKVLREWESSGNCYDFRVPNIVII 360

Query: 383 GYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNI-G 442
           GY +KG+ E+ E +L+ L +  K TTP+SWG +A  Y+D GE EKA+ECM  A+S+++  
Sbjct: 361 GYSEKGLFEKAEAMLKELMEKGKITTPDSWGTVASGYMDEGEMEKAVECMKAAISLHVNN 420

Query: 443 KGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKE 502
           KG KPN RVI G+L+WLGDKG VE+VEAFV +LR V P+NR MYH L+   IRA KEV  
Sbjct: 421 KGRKPNSRVIAGILSWLGDKGRVEDVEAFVGSLRIVIPMNRRMYHTLIMANIRAGKEVDG 480

Query: 503 VLNNMKADKIDEDEETKKILGT 513
           +L +MKADKI EDEETKKILGT
Sbjct: 481 LLASMKADKIVEDEETKKILGT 501

BLAST of Csa1G073780 vs. TrEMBL
Match: A0A0D2VHH2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G027300 PE=4 SV=1)

HSP 1 Score: 675.6 bits (1742), Expect = 4.7e-191
Identity = 328/483 (67.91%), Postives = 395/483 (81.78%), Query Frame = 1

Query: 29  SKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRI 88
           SK+L +  + SR Y+T R  KATLY++ISPLG P  SVE+ELD W++ G  +RVAELQRI
Sbjct: 10  SKSLAQNGVFSRFYYTNRFNKATLYSRISPLGSPDKSVEAELDDWLKHGNNIRVAELQRI 69

Query: 89  IRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQD 148
           I D RKR RF+QAL+VSEWM K G C FSPTEHAVQLDLIG+VRG LSAE+YFN+LK+QD
Sbjct: 70  IHDLRKRKRFTQALQVSEWMNKKGLCAFSPTEHAVQLDLIGKVRGFLSAESYFNKLKDQD 129

Query: 149 QTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEV 208
           +T KTYGALLNCYVRQRQ+DKSLSH QKMKELGFA+S LTYNDIMCLYT +GQHEKVP+V
Sbjct: 130 KTEKTYGALLNCYVRQRQIDKSLSHLQKMKELGFASSTLTYNDIMCLYTNIGQHEKVPDV 189

Query: 209 LAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFI 268
           L EMKENNVSPDNFSYRICI+S G R D+EG+E +L EME QPHI MDWNTYAVVA+F+I
Sbjct: 190 LREMKENNVSPDNFSYRICINSLGVRSDLEGIEEILTEMEDQPHIKMDWNTYAVVASFYI 249

Query: 269 KAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIIN 328
           KA LT+KA+DALKKSE+KL  + D  G+N LISLY +LGNK +VLRLW L+K A  R IN
Sbjct: 250 KAGLTEKAIDALKKSEQKL-DNKDGTGYNHLISLYTSLGNKAEVLRLWGLEKEACKRYIN 309

Query: 329 RDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLEN 388
           +D+I ML+SLV+L E EEAEK+LKEWESSGN YDFR+PNI+I+GY+ KG+ E+ ET+LEN
Sbjct: 310 KDFIIMLQSLVKLDEFEEAEKILKEWESSGNYYDFRIPNIIIVGYVKKGLHEKAETMLEN 369

Query: 389 LKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSV-NIGKGWKPNLRVITGLLNWL 448
           LK+  K T PNSWG +A  YLD G+ +KA +CM  ALS+    KGWKPNLRV+T +L+WL
Sbjct: 370 LKEKGKTTIPNSWGIVAASYLDKGQAKKAFKCMKAALSLFTENKGWKPNLRVVTSILDWL 429

Query: 449 GDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDEDEETK 508
           GD+G V+EVE FV +L+   PV+R+MYH LLK  +R  + V +VL+ MKADKI+EDEETK
Sbjct: 430 GDEGSVQEVEEFVESLKRTVPVDRKMYHTLLKANVRHGERVDKVLDLMKADKINEDEETK 489

Query: 509 KIL 511
            IL
Sbjct: 490 SIL 491

BLAST of Csa1G073780 vs. TAIR10
Match: AT4G21705.1 (AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 561.6 bits (1446), Expect = 5.0e-160
Identity = 279/478 (58.37%), Postives = 356/478 (74.48%), Query Frame = 1

Query: 37  LASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRN 96
           +ASR Y+T R+KK TLY+KISPLGDP+ SV  EL  WVQ GKK+ VAEL RI+ D R+R 
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 97  RFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGA 156
           RF  ALEVS+WM ++G C+FSPTEHAV LDLIGRV G ++AE YF  LKEQ +  KTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 157 LLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENN 216
           LLNCYVRQ+ V+KSL HF+KMKE+GF TS LTYN+IMCLYT +GQHEKVP+VL EMKE N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 217 VSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKA 276
           V+PDN+SYRICI+++GA  D+E +   L++ME +  I MDWNTYAV A F+I     D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 277 VDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLE 336
           V+ LK SE +L+   D  G+N LI+LYA LG K +VLRLW+L+K    R IN+DY+T+L+
Sbjct: 252 VELLKMSENRLE-KKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 311

Query: 337 SLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKAT 396
           SLV++  L EAE+VL EW+SSGNCYDFRVPN VI GYI K M E+ E +LE+L +  KAT
Sbjct: 312 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 371

Query: 397 TPNSWGALAVKYLDLGETEKALECMMTALSVNIG-KGWKPNLRVITGLLNWLGDKGIVEE 456
           TP SW  +A  Y + G  E A +CM TAL V +G + W+P L ++T +L+W+GD+G ++E
Sbjct: 372 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 431

Query: 457 VEAFVSALRSVTPVNREMYHALLKVYIR-ADKEVKEVLNNMKADKIDEDEETKKILGT 513
           VE+FV++LR+   VN++MYHAL+K  IR   + +  +L  MK DKI+ DEET  IL T
Sbjct: 432 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of Csa1G073780 vs. TAIR10
Match: AT2G20710.1 (AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 285.8 bits (730), Expect = 5.2e-77
Identity = 152/428 (35.51%), Postives = 247/428 (57.71%), Query Frame = 1

Query: 51  TLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKK 110
           TL  +++  GDP  S+   LDGW+ +G  ++ +EL  II+  RK +RFS AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 111 SGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKS 170
                 S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + K+
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 171 LSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISS 230
              FQ+MKELGF    L YN ++ LY R G++  V ++L EM++  V PD F+    + +
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 231 YGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSS 290
           Y    D+EGME  L   E+   + +DW TYA  AN +IKA LT+KA++ L+KSE+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 291 NDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKV 350
             +  +  L+S Y   G KE+V RLW+L K       N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 351 LKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLD 410
           ++EWE+  + +D R+P+++I GY  KGM E+ E ++  L Q  +    ++W  LA+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 411 LGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVN 470
            G+ EKA+E    A+ V+   GW+P+  V+   +++L  +    ++E     LR ++   
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYLEGQ---RDMEGLRKILRLLSERG 458

Query: 471 REMYHALL 479
              Y  LL
Sbjct: 459 HISYDQLL 461

BLAST of Csa1G073780 vs. TAIR10
Match: AT1G02150.1 (AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 275.4 bits (703), Expect = 7.1e-74
Identity = 154/453 (34.00%), Postives = 254/453 (56.07%), Query Frame = 1

Query: 52  LYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKS 111
           +Y KIS +  P +   S L+ W + G+KL   EL R++++ RK  R +QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 112 GACI-FSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKS 171
           G     S ++ A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  +K+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 172 LSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISS 231
            +    M++ G+A   L +N +M LY  + +++KV  ++ EMK+ ++  D +SY I +SS
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 232 YGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSS 291
            G+   +E ME V ++M+S   I  +W T++ +A  +IK   T+KA DAL+K E ++   
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 292 NDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKV 351
           N RI ++ L+SLY +LGNK+++ R+W++ K+    I N  Y  ++ SLVR+G++E AEKV
Sbjct: 309 N-RIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKV 368

Query: 352 LKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLD 411
            +EW    + YD R+PN+++  Y+     E  E L +++ +     + ++W  LAV +  
Sbjct: 369 YEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTR 428

Query: 412 LGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVN 471
                +AL C+  A S      W+P + +++G      ++  V   EA +  LR    + 
Sbjct: 429 KRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLE 488

Query: 472 REMYHALLKVYIRADKEVKEVLNNMKADKIDED 504
            + Y AL+      D +    +NN + D  + D
Sbjct: 489 DKSYLALI------DVDENRTVNNSEIDAHETD 514

BLAST of Csa1G073780 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 265.8 bits (678), Expect = 5.6e-71
Identity = 146/470 (31.06%), Postives = 252/470 (53.62%), Query Frame = 1

Query: 44  TTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALE 103
           T +  +  LY ++   G   + V  +L+ +++  K +   E+   I+  R R  +  AL+
Sbjct: 17  TKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALK 76

Query: 104 VSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVR 163
           +SE M++ G    + ++ A+ LDL+ + R   + ENYF  L E  +T  TYG+LLNCY +
Sbjct: 77  LSEVMEERGMNK-TVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCK 136

Query: 164 QRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFS 223
           +   +K+     KMKEL    S ++YN +M LYT+ G+ EKVP ++ E+K  NV PD+++
Sbjct: 137 ELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYT 196

Query: 224 YRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKS 283
           Y + + +  A  DI G+E V++EM     +  DW TY+ +A+ ++ A L+ KA  AL++ 
Sbjct: 197 YNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQEL 256

Query: 284 EEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGE 343
           E K  +  D   +  LI+LY  LG   +V R+W   + A  +  N  Y+ M++ LV+L +
Sbjct: 257 EMK-NTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLND 316

Query: 344 LEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGA 403
           L  AE + KEW+++ + YD R+ N++I  Y  +G+ ++   L E   +        +W  
Sbjct: 317 LPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEI 376

Query: 404 LAVKYLDLGETEKALECMMTALSVNIGKG--WKPNLRVITGLLNWLGDKGIVEEVEAFVS 463
               Y+  G+  +ALECM  A+S+  G G  W P+   +  L+++   K  V   E  + 
Sbjct: 377 FMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLE 436

Query: 464 ALRSVTP-VNREMYHALLKVYIRADKEVKEVLNNMKADKIDEDEETKKIL 511
            L++ T  +  E++  L++ Y  A K    +   +K + ++ +E TKK+L
Sbjct: 437 ILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

BLAST of Csa1G073780 vs. TAIR10
Match: AT4G02820.1 (AT4G02820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 260.8 bits (665), Expect = 1.8e-69
Identity = 150/439 (34.17%), Postives = 248/439 (56.49%), Query Frame = 1

Query: 73  WVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVR 132
           W +EG  +R  EL RI+R+ RK  R+  ALE+ EWM           ++AV LDLI ++R
Sbjct: 84  WKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDIKLQAGDYAVHLDLISKIR 143

Query: 133 GSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDI 192
           G  SAE +F  + +Q +      +LL+ YV+ +  DK+ + F+KM E GF  S L YN +
Sbjct: 144 GLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALFEKMGECGFLKSCLPYNHM 203

Query: 193 MCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENV-LKEMESQP 252
           + +Y   GQ EKVP ++ E+K    SPD  +Y + ++++ +  D+EG E V LK  E + 
Sbjct: 204 LSMYISRGQFEKVPVLIKELK-IRTSPDIVTYNLWLTAFASGNDVEGAEKVYLKAKEEK- 263

Query: 253 HIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEK 312
            +  DW TY+V+ N + K    +KA  ALK+  EKL S  +R+ +  LISL+A LG+K+ 
Sbjct: 264 -LNPDWVTYSVLTNLYAKTDNVEKARLALKEM-EKLVSKKNRVAYASLISLHANLGDKDG 323

Query: 313 VLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVII 372
           V   W   K++  ++ + +Y++M+ ++V+LGE E+A+ +  EWES     D R+PN+++ 
Sbjct: 324 VNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEWESVSGTGDARIPNLILA 383

Query: 373 GYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGK 432
            Y+++     GE   E + +     + ++W  L   YL   + EK L+C   A  ++  K
Sbjct: 384 EYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDMEKVLDCFGKA--IDSVK 443

Query: 433 GWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEV 492
            W  N+R++ G    L ++G V+  E  ++ L+    VN ++Y++LL+ Y +A +    V
Sbjct: 444 KWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQLYNSLLRTYAKAGEMALIV 503

Query: 493 LNNMKADKIDEDEETKKIL 511
              M  D ++ DEETK+++
Sbjct: 504 EERMAKDNVELDEETKELI 516

BLAST of Csa1G073780 vs. NCBI nr
Match: gi|700209575|gb|KGN64671.1| (hypothetical protein Csa_1G073780 [Cucumis sativus])

HSP 1 Score: 1011.9 bits (2615), Expect = 3.9e-292
Identity = 512/512 (100.00%), Postives = 512/512 (100.00%), Query Frame = 1

Query: 1   MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG 60
           MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG
Sbjct: 1   MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG 60

Query: 61  DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE 120
           DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE
Sbjct: 61  DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE 120

Query: 121 HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL 180
           HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL
Sbjct: 121 HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL 180

Query: 181 GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM 240
           GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM
Sbjct: 181 GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM 240

Query: 241 ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI 300
           ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI
Sbjct: 241 ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI 300

Query: 301 SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC 360
           SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC
Sbjct: 301 SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC 360

Query: 361 YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC 420
           YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC
Sbjct: 361 YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC 420

Query: 421 MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV 480
           MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV
Sbjct: 421 MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV 480

Query: 481 YIRADKEVKEVLNNMKADKIDEDEETKKILGT 513
           YIRADKEVKEVLNNMKADKIDEDEETKKILGT
Sbjct: 481 YIRADKEVKEVLNNMKADKIDEDEETKKILGT 512

BLAST of Csa1G073780 vs. NCBI nr
Match: gi|778658722|ref|XP_011653147.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Cucumis sativus])

HSP 1 Score: 973.0 bits (2514), Expect = 2.0e-280
Identity = 490/490 (100.00%), Postives = 490/490 (100.00%), Query Frame = 1

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV
Sbjct: 1   MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN
Sbjct: 61  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH
Sbjct: 121 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA
Sbjct: 241 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG
Sbjct: 301 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG
Sbjct: 361 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE
Sbjct: 421 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 480

Query: 503 DEETKKILGT 513
           DEETKKILGT
Sbjct: 481 DEETKKILGT 490

BLAST of Csa1G073780 vs. NCBI nr
Match: gi|659068038|ref|XP_008442422.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Cucumis melo])

HSP 1 Score: 904.0 bits (2335), Expect = 1.2e-259
Identity = 455/490 (92.86%), Postives = 467/490 (95.31%), Query Frame = 1

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQKLFSKALT YALASRSYHTTRLKKATLYAKISPLGDP ISVESELDGWVQEGKK+RV
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRIIRDFRKR+RFSQAL+VSEWMKKSGACIFSPTEHAVQLDLIGRVRG LSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QLKEQDQ IKTYGALLNCYVRQ+QVDKSLSH QKMKELGFATSELTYNDIMCLYTRVGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           EKVPEVLAEMK NNVSPDNFSYRICI+SYGARKD+EGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKA LTDKAVDAL+KSEEKLK S DRIGHN LISLYATLGNKEKVLR+WNLDKTA
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLK-SKDRIGHNHLISLYATLGNKEKVLRVWNLDKTA 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPN VI+GYIDKGMCER 
Sbjct: 301 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           ETLLENL QNEKATTPNSWGA+AVKYLD GETEKALECM  ALSVN  KGWKPN RVITG
Sbjct: 361 ETLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITG 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +LNWLGDKGIVEEVEAFVSALRSV PVNREMYHALLKVYIRADKEV EVLN MKADKI+E
Sbjct: 421 VLNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINE 480

Query: 503 DEETKKILGT 513
           DEETKKILGT
Sbjct: 481 DEETKKILGT 489

BLAST of Csa1G073780 vs. NCBI nr
Match: gi|359483464|ref|XP_003632962.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Vitis vinifera])

HSP 1 Score: 693.0 bits (1787), Expect = 4.0e-196
Identity = 350/502 (69.72%), Postives = 408/502 (81.27%), Query Frame = 1

Query: 23  MDQKLFS---KALTRYA--------LASRSYHTTRLKKATLYAKISPLGDPRISVESELD 82
           MD +LFS   +++ +Y         +++R+Y+T+R  K +LY KISPLGDP  SV  ELD
Sbjct: 1   MDSRLFSLLRQSIQQYPQSLIRKNPISNRTYYTSRYGKISLYNKISPLGDPNTSVVPELD 60

Query: 83  GWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRV 142
            WVQ G K+ VAELQRII D RKR RFSQALE+SEWM K G C FSPTEHAVQLDLIGRV
Sbjct: 61  NWVQNGNKVWVAELQRIIHDLRKRKRFSQALEISEWMSKKGICAFSPTEHAVQLDLIGRV 120

Query: 143 RGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYND 202
           RG LSAE+YFN L+  D+T KTYGALLNCYVRQRQ DKSLSH QKMKE+GFA+S LTYND
Sbjct: 121 RGFLSAESYFNSLQNHDKTDKTYGALLNCYVRQRQTDKSLSHLQKMKEMGFASSPLTYND 180

Query: 203 IMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQP 262
           IMCLYT VGQHEKVP+VL EMK++NV PDNFSYRICI+SYGA+ DI+GMENVLKEME QP
Sbjct: 181 IMCLYTNVGQHEKVPDVLTEMKQSNVYPDNFSYRICINSYGAQSDIQGMENVLKEMERQP 240

Query: 263 HIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEK 322
           HIVMDWNTYAV ANF+IKA L DKA++ALKKSEE+L    D +G+N LISLYA+LGNK +
Sbjct: 241 HIVMDWNTYAVAANFYIKAGLPDKAIEALKKSEERL-DKRDGLGYNHLISLYASLGNKAE 300

Query: 323 VLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVII 382
           VLRLW+L+K+A  R INRDYITMLESLVRLGELEEAEKVL+EWESSGNCYDFRVPNIVII
Sbjct: 301 VLRLWSLEKSACKRNINRDYITMLESLVRLGELEEAEKVLREWESSGNCYDFRVPNIVII 360

Query: 383 GYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNI-G 442
           GY +KG+ E+ E +L+ L +  K TTPNSWG +A  Y+D GE EKA+ECM  A+S+++  
Sbjct: 361 GYSEKGLFEKAEAMLKELMEKGKITTPNSWGTVASGYMDEGEMEKAVECMKAAISLHVNN 420

Query: 443 KGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKE 502
           KG KPN RVI G+L+WLGDKG VE+VEAFV +LR V P+NR MYH L+   IRA KEV  
Sbjct: 421 KGRKPNSRVIAGILSWLGDKGRVEDVEAFVGSLRIVIPMNRRMYHTLIMANIRAGKEVDG 480

Query: 503 VLNNMKADKIDEDEETKKILGT 513
           +L +MKADKI EDEETKKILGT
Sbjct: 481 LLASMKADKIVEDEETKKILGT 501

BLAST of Csa1G073780 vs. NCBI nr
Match: gi|595884470|ref|XP_007212982.1| (hypothetical protein PRUPE_ppa022486mg [Prunus persica])

HSP 1 Score: 689.5 bits (1778), Expect = 4.5e-195
Identity = 337/489 (68.92%), Postives = 401/489 (82.00%), Query Frame = 1

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MD KLF+K L R A+ SRSY+T+R KK TLY KISPLG+P ++V  ELD WV +G K+RV
Sbjct: 1   MDPKLFAKTLIRSAMTSRSYYTSRTKKPTLYTKISPLGNPSLNVVPELDDWVYKGHKVRV 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRII D RKR RFSQAL++SEWMK+ G CIFSP EHAVQLDLIG+VRG +SAE YFN
Sbjct: 61  AELQRIIHDLRKRKRFSQALQISEWMKQKGICIFSPVEHAVQLDLIGKVRGLVSAEEYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
            L+E+D+ +KTYGALLNCYVRQ Q DKSL+H +KMKE+GFA+S LTYNDIMCLYT VG+H
Sbjct: 121 NLREEDKNLKTYGALLNCYVRQLQTDKSLAHLRKMKEMGFASSPLTYNDIMCLYTNVGEH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           EKVP VL EMKENNV PDNFSYRICI+SYG R D+EGME VL+EMESQPHIVMDWNTYAV
Sbjct: 181 EKVPGVLTEMKENNVPPDNFSYRICINSYGVRSDLEGMEKVLEEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANF+IK   T KA++ALKKSEE+L  + D +GHN LISLYA++GNK++VLRLW L+K+A
Sbjct: 241 VANFYIKEGQTHKAINALKKSEERL-DNKDGLGHNHLISLYASMGNKDEVLRLWGLEKSA 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
             R INRDYI +L SLVRLGEL+EAEKV+KEWE SGNCYDFRVP  VIIGY  KG+ ER 
Sbjct: 301 CKRCINRDYIGLLISLVRLGELDEAEKVVKEWELSGNCYDFRVPQTVIIGYTVKGLYERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           E +L +L +  KATTP SW  +A  Y++ GETEKA +CM  AL ++  KGWKPNLRV T 
Sbjct: 361 EAMLGDLMEKGKATTPKSWEIVAAGYVNKGETEKAFQCMKAALCLSAEKGWKPNLRVSTT 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +L+WLGDKG VE+ EAFV  LR+V PVN++MYHALLK Y+R  KEV  VL+ MKADK+++
Sbjct: 421 ILSWLGDKGSVEDAEAFVGLLRNVIPVNKQMYHALLKAYMRGGKEVNSVLDRMKADKVED 480

Query: 503 DE-ETKKIL 511
           D+ ETKK+L
Sbjct: 481 DDIETKKVL 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP334_ARATH8.8e-15958.37Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
PP166_ARATH9.3e-7635.51Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
PPR3_ARATH1.3e-7234.00Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PPR86_ARATH1.0e-6931.06Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
PP302_ARATH3.2e-6834.17Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LXD9_CUCSA2.7e-292100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073780 PE=4 SV=1[more]
F6HRW8_VITVI2.8e-19669.72Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0597g00040 PE=4 SV=... [more]
M5WLN5_PRUPE3.1e-19568.92Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022486mg PE=4 SV=1[more]
A5C3G3_VITVI1.0e-19368.92Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_006333 PE=4 SV=1[more]
A0A0D2VHH2_GOSRA4.7e-19167.91Uncharacterized protein OS=Gossypium raimondii GN=B456_011G027300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G21705.15.0e-16058.37 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G20710.15.2e-7735.51 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.17.1e-7434.00 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.15.6e-7131.06 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G02820.11.8e-6934.17 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700209575|gb|KGN64671.1|3.9e-292100.00hypothetical protein Csa_1G073780 [Cucumis sativus][more]
gi|778658722|ref|XP_011653147.1|2.0e-280100.00PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial ... [more]
gi|659068038|ref|XP_008442422.1|1.2e-25992.86PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial ... [more]
gi|359483464|ref|XP_003632962.1|4.0e-19669.72PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial ... [more]
gi|595884470|ref|XP_007212982.1|4.5e-19568.92hypothetical protein PRUPE_ppa022486mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0009451 RNA modification
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU159301cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G073780.1Csa1G073780.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU159301CU159301transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 153..182
score: 4.2E-5coord: 331..358
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 187..231
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 331..361
score: 3.9E-4coord: 153..182
score: 2.6E-5coord: 188..220
score: 3.6E-6coord: 222..256
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 150..184
score: 10.106coord: 362..392
score: 5.59coord: 220..254
score: 7.224coord: 81..115
score: 5.864coord: 327..361
score: 7.322coord: 397..431
score: 5.02coord: 256..286
score: 5.251coord: 185..219
score: 10.326coord: 292..326
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 331..427
score: 8.9E-5coord: 137..221
score: 6.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 143..228
score: 1.98E-5coord: 396..428
score: 1.98E-5coord: 260..294
score: 1.98E-5coord: 334..427
score: 1.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 37..509
score: 6.0E
NoneNo IPR availablePANTHERPTHR24015:SF845SUBFAMILY NOT NAMEDcoord: 37..509
score: 6.0E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Csa1G073780Csa3G104890Cucumber (Chinese Long) v2cucuB015
Csa1G073780Csa7G047430Cucumber (Chinese Long) v2cucuB045