Csa3G730820 (gene) Cucumber (Chinese Long) v2

NameCsa3G730820
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat protein; contains IPR002885 (Pentatricopeptide repeat), IPR011990 (Tetratricopeptide-like helical)
LocationChr3 : 27361093 .. 27363159 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCATTCTCACCAGTCCAACTTCTCCCGTCTTCTCCAAGGCATTAGAGATCAAGAATTACCTCTCCAATGGCCTCAACTTCTTCAACCAGCTCAAGCATATCCATGCCCGCCTTCTCCGCCTTCACCTCGACCAAGATAACTACCTTCTCAACCTGATTCTTTGTTGTGCCTTAGATTTTGGTTCCACAAATTACTCTAAACTCGTATTTTCTCAAGTTAAAGAGCCCAACATTTTCCTCTGGAATACCATGATCCGTGGCTTGGTTTCCAAGGATTGTTTCGATGACGCTATCCATCTTTATGGGTCGATGCGTGGAGGGGGATTCTTACCCAACAATTTCACAATCCCTTTTGTTTTGAAAGCTTGTGCTAGGAAATTGGATGTTCGATTGGGATTGAAGATTCACTCCCTCTTGGTGAAAGCAGGTTATGATCATGATGTGTTTGTGAAGACCAGTTTGCTTTCTCTCTATGTTAAGTGTGATAATTTTGACGATGCACTCAAGGTGTTTGATGATATTCCTGACAAAAATGTGGTCTCTTGGACTGCTATAATCACTGGATATATAAGCTCTGGCCACTTTAGAGAAGCTATTGGTGCATTTAAAAAACTACTTGAAATGGGATTAAAGCCCGACAGCTTCTCTCTAGTCAAGGTCCTGGCTGCCTGTGCTAGGCTTGGTGATTGTACAAGTGGCGAGTGGATTGATAGATATATAAGTGATAGTGGTATGGGGAGGAATGTTTTTGTGGCTACTTCTTTGTTGGATATGTATGTCAAGTGTGGAAACTTGGAGCGAGCAAATCTTATCTTTAGTGCCATGCCAGAGAAAGATATTGTTTCTTGGAGTACAATGATTCAAGGCTATGCATTCAATGGATTGCCTCAACAAGCGCTAGATCTTTTCTTTCAAATGCAGTCTGAGAATTTGAAGCCTGATTGTTATACGATGGTTGGTGTTCTGTCTGCTTGTGCAACATTGGGAGCTTTAGATTTAGGCATCTGGGCTAGCAGCCTGATGGATAGAAATGAGTTCTTGTCAAATCCCGTCCTCGGTACAGCTTTGATTGACATGTACTCCAAATGTGGTAGCGTCACTCAAGCTTGGGAAATCTTCACAGCGATGAAAAGGAAAGACCGTGTAGTTTGGAATGCCATGATGGTGGGTCTCTCCATGAACGGGCATGCCAAAGCTGTATTTTCATTGTTCAGTCTTGTGGAGAAACATGGAATTCGGCCTGATGAAAACACCTTTATTGGCCTGCTCTGTGGGTGCACTCATGGCGGTTTTGTCAATGAGGGGCGTCAATTTTTCAATAACATGAAGCGAGTATTCTCATTAACCCCTTCCATCGAGCATTACGGATGTATGGTGGATTTGCTTGGGCGTGCAGGGTTATTAAATGAGGCTCATCAGTTGATAAACAACATGCCAATGAAGCCTAATGCGGTTGTTTGGGGTGCATTGTTGGGTGGATGTAAATTGCACAAAGATACCCACTTGGCTGAGCAAGTACTGAAAAAGCTTATTGAATTAGAGCCATGGAACTCAGGAAACTATGTTCAGTTATCAAATATTTACTCTGGAAATCATCGATGGGAGGAAGCCGAAAAGATACGGTCAACAATGAAGGAACAGCAGATTCAGAAGATCCGTGCCTGCAGTTGGATTGAGATAGATGGAATTGTTCACGAGTTTCTGGTAGGTGACAAATCACATTGGTTATCGGAGAAAATATATGCAAAACTTGATGAATTAGGTAGAGAATTGAAAGCAGTTGGTCATGTACCAACTACAGAGTTTGTTCTTTTCGACATAGAAGAGGAGGAGAAGGAACATTTCCTTGGTTACCACAGTGAGAAGCTAGCTGTGGCTTTTGGTTTGATAGCCTCTCCTCCAAATCATGTTATTCGCGTTGTTAAAAACCTTCGCGTATGTGGTGATTGCCACGATGCCATAAAGCTCATTTCTAAGATCACTAAAAGAGAGATTATCATAAGGGATACCAATCGGTTCCATACATTTATTGACGGCTCTTGTTCTTGTAGAGACTATTGGTGA

mRNA sequence

ATGACCATTCTCACCAGTCCAACTTCTCCCGTCTTCTCCAAGGCATTAGAGATCAAGAATTACCTCTCCAATGGCCTCAACTTCTTCAACCAGCTCAAGCATATCCATGCCCGCCTTCTCCGCCTTCACCTCGACCAAGATAACTACCTTCTCAACCTGATTCTTTGTTGTGCCTTAGATTTTGGTTCCACAAATTACTCTAAACTCGTATTTTCTCAAGTTAAAGAGCCCAACATTTTCCTCTGGAATACCATGATCCGTGGCTTGGTTTCCAAGGATTGTTTCGATGACGCTATCCATCTTTATGGGTCGATGCGTGGAGGGGGATTCTTACCCAACAATTTCACAATCCCTTTTGTTTTGAAAGCTTGTGCTAGGAAATTGGATGTTCGATTGGGATTGAAGATTCACTCCCTCTTGGTGAAAGCAGGTTATGATCATGATGTGTTTGTGAAGACCAGTTTGCTTTCTCTCTATGTTAAGTGTGATAATTTTGACGATGCACTCAAGGTGTTTGATGATATTCCTGACAAAAATGTGGTCTCTTGGACTGCTATAATCACTGGATATATAAGCTCTGGCCACTTTAGAGAAGCTATTGGTGCATTTAAAAAACTACTTGAAATGGGATTAAAGCCCGACAGCTTCTCTCTAGTCAAGGTCCTGGCTGCCTGTGCTAGGCTTGGTGATTGTACAAGTGGCGAGTGGATTGATAGATATATAAGTGATAGTGGTATGGGGAGGAATGTTTTTGTGGCTACTTCTTTGTTGGATATGTATGTCAAGTGTGGAAACTTGGAGCGAGCAAATCTTATCTTTAGTGCCATGCCAGAGAAAGATATTGTTTCTTGGAGTACAATGATTCAAGGCTATGCATTCAATGGATTGCCTCAACAAGCGCTAGATCTTTTCTTTCAAATGCAGTCTGAGAATTTGAAGCCTGATTGTTATACGATGGTTGGTGTTCTGTCTGCTTGTGCAACATTGGGAGCTTTAGATTTAGGCATCTGGGCTAGCAGCCTGATGGATAGAAATGAGTTCTTGTCAAATCCCGTCCTCGGTACAGCTTTGATTGACATGTACTCCAAATGTGGTAGCGTCACTCAAGCTTGGGAAATCTTCACAGCGATGAAAAGGAAAGACCGTGTAGTTTGGAATGCCATGATGGTGGGTCTCTCCATGAACGGGCATGCCAAAGCTCGAGTATTCTCATTAACCCCTTCCATCGAGCATTACGGATGTATGGTGGATTTGCTTGGGCGTGCAGGGTTATTAAATGAGGCTCATCAGTTGATAAACAACATGCCAATGAAGCCTAATGCGGTTGTTTGGGGTGCATTGTTGGGTGGATGTAAATTGCACAAAGATACCCACTTGGCTGAGCAAGTACTGAAAAAGCTTATTGAATTAGAGCCATGGAACTCAGGAAACTATGTTCAGTTATCAAATATTTACTCTGGAAATCATCGATGGGAGGAAGCCGAAAAGATACGGTCAACAATGAAGGAACAGCAGATTCAGAAGATCCGTGCCTGCAGTTGGATTGAGATAGATGGAATTGTTCACGAGTTTCTGGTAGGTGACAAATCACATTGGTTATCGGAGAAAATATATGCAAAACTTGATGAATTAGGTAGAGAATTGAAAGCAGTTGGTCATGTACCAACTACAGAGTTTGTTCTTTTCGACATAGAAGAGGAGGAGAAGGAACATTTCCTTGGTTACCACAGTGAGAAGCTAGCTGTGGCTTTTGGTTTGATAGCCTCTCCTCCAAATCATGTTATTCGCGTTGTTAAAAACCTTCGCGTATGTGGTGATTGCCACGATGCCATAAAGCTCATTTCTAAGATCACTAAAAGAGAGATTATCATAAGGGATACCAATCGGTTCCATACATTTATTGACGGCTCTTGTTCTTGTAGAGACTATTGGTGA

Coding sequence (CDS)

ATGACCATTCTCACCAGTCCAACTTCTCCCGTCTTCTCCAAGGCATTAGAGATCAAGAATTACCTCTCCAATGGCCTCAACTTCTTCAACCAGCTCAAGCATATCCATGCCCGCCTTCTCCGCCTTCACCTCGACCAAGATAACTACCTTCTCAACCTGATTCTTTGTTGTGCCTTAGATTTTGGTTCCACAAATTACTCTAAACTCGTATTTTCTCAAGTTAAAGAGCCCAACATTTTCCTCTGGAATACCATGATCCGTGGCTTGGTTTCCAAGGATTGTTTCGATGACGCTATCCATCTTTATGGGTCGATGCGTGGAGGGGGATTCTTACCCAACAATTTCACAATCCCTTTTGTTTTGAAAGCTTGTGCTAGGAAATTGGATGTTCGATTGGGATTGAAGATTCACTCCCTCTTGGTGAAAGCAGGTTATGATCATGATGTGTTTGTGAAGACCAGTTTGCTTTCTCTCTATGTTAAGTGTGATAATTTTGACGATGCACTCAAGGTGTTTGATGATATTCCTGACAAAAATGTGGTCTCTTGGACTGCTATAATCACTGGATATATAAGCTCTGGCCACTTTAGAGAAGCTATTGGTGCATTTAAAAAACTACTTGAAATGGGATTAAAGCCCGACAGCTTCTCTCTAGTCAAGGTCCTGGCTGCCTGTGCTAGGCTTGGTGATTGTACAAGTGGCGAGTGGATTGATAGATATATAAGTGATAGTGGTATGGGGAGGAATGTTTTTGTGGCTACTTCTTTGTTGGATATGTATGTCAAGTGTGGAAACTTGGAGCGAGCAAATCTTATCTTTAGTGCCATGCCAGAGAAAGATATTGTTTCTTGGAGTACAATGATTCAAGGCTATGCATTCAATGGATTGCCTCAACAAGCGCTAGATCTTTTCTTTCAAATGCAGTCTGAGAATTTGAAGCCTGATTGTTATACGATGGTTGGTGTTCTGTCTGCTTGTGCAACATTGGGAGCTTTAGATTTAGGCATCTGGGCTAGCAGCCTGATGGATAGAAATGAGTTCTTGTCAAATCCCGTCCTCGGTACAGCTTTGATTGACATGTACTCCAAATGTGGTAGCGTCACTCAAGCTTGGGAAATCTTCACAGCGATGAAAAGGAAAGACCGTGTAGTTTGGAATGCCATGATGGTGGGTCTCTCCATGAACGGGCATGCCAAAGCTCGAGTATTCTCATTAACCCCTTCCATCGAGCATTACGGATGTATGGTGGATTTGCTTGGGCGTGCAGGGTTATTAAATGAGGCTCATCAGTTGATAAACAACATGCCAATGAAGCCTAATGCGGTTGTTTGGGGTGCATTGTTGGGTGGATGTAAATTGCACAAAGATACCCACTTGGCTGAGCAAGTACTGAAAAAGCTTATTGAATTAGAGCCATGGAACTCAGGAAACTATGTTCAGTTATCAAATATTTACTCTGGAAATCATCGATGGGAGGAAGCCGAAAAGATACGGTCAACAATGAAGGAACAGCAGATTCAGAAGATCCGTGCCTGCAGTTGGATTGAGATAGATGGAATTGTTCACGAGTTTCTGGTAGGTGACAAATCACATTGGTTATCGGAGAAAATATATGCAAAACTTGATGAATTAGGTAGAGAATTGAAAGCAGTTGGTCATGTACCAACTACAGAGTTTGTTCTTTTCGACATAGAAGAGGAGGAGAAGGAACATTTCCTTGGTTACCACAGTGAGAAGCTAGCTGTGGCTTTTGGTTTGATAGCCTCTCCTCCAAATCATGTTATTCGCGTTGTTAAAAACCTTCGCGTATGTGGTGATTGCCACGATGCCATAAAGCTCATTTCTAAGATCACTAAAAGAGAGATTATCATAAGGGATACCAATCGGTTCCATACATTTATTGACGGCTCTTGTTCTTGTAGAGACTATTGGTGA

Protein sequence

MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALDFGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFVLKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNVVSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRYISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQALDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCMVDLLGRAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW*
BLAST of Csa3G730820 vs. Swiss-Prot
Match: PP219_ARATH (Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis thaliana GN=PCMP-H84 PE=3 SV=1)

HSP 1 Score: 727.6 bits (1877), Expect = 1.2e-208
Identity = 364/688 (52.91%), Postives = 479/688 (69.62%), Query Frame = 1

Query: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60
           M+I+T P++   SK  +IK  +S      N LK IH  L+  HL  D +L+NL+L   L 
Sbjct: 1   MSIVTVPSAT--SKVQQIKTLISVACTV-NHLKQIHVSLINHHLHHDTFLVNLLLKRTLF 60

Query: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120
           F  T YS L+FS  + PNIFL+N++I G V+   F + + L+ S+R  G   + FT P V
Sbjct: 61  FRQTKYSYLLFSHTQFPNIFLYNSLINGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLV 120

Query: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180
           LKAC R    +LG+ +HSL+VK G++HDV   TSLLS+Y      +DA K+FD+IPD++V
Sbjct: 121 LKACTRASSRKLGIDLHSLVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSV 180

Query: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240
           V+WTA+ +GY +SG  REAI  FKK++EMG+KPDS+ +V+VL+AC  +GD  SGEWI +Y
Sbjct: 181 VTWTALFSGYTTSGRHREAIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKY 240

Query: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300
           + +  M +N FV T+L+++Y KCG +E+A  +F +M EKDIV+WSTMIQGYA N  P++ 
Sbjct: 241 MEEMEMQKNSFVRTTLVNLYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEG 300

Query: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360
           ++LF QM  ENLKPD +++VG LS+CA+LGALDLG W  SL+DR+EFL+N  +  ALIDM
Sbjct: 301 IELFLQMLQENLKPDQFSIVGFLSSCASLGALDLGEWGISLIDRHEFLTNLFMANALIDM 360

Query: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKA--RVFSLTPSIE-------- 420
           Y+KCG++ + +E+F  MK KD V+ NA + GL+ NGH K    VF  T  +         
Sbjct: 361 YAKCGAMARGFEVFKEMKEKDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTF 420

Query: 421 ----------------------------------HYGCMVDLLGRAGLLNEAHQLINNMP 480
                                             HYGCMVDL GRAG+L++A++LI +MP
Sbjct: 421 LGLLCGCVHAGLIQDGLRFFNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMP 480

Query: 481 MKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEK 540
           M+PNA+VWGALL GC+L KDT LAE VLK+LI LEPWN+GNYVQLSNIYS   RW+EA +
Sbjct: 481 MRPNAIVWGALLSGCRLVKDTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAE 540

Query: 541 IRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPT 600
           +R  M ++ ++KI   SWIE++G VHEFL  DKSH LS+KIYAKL++LG E++ +G VPT
Sbjct: 541 VRDMMNKKGMKKIPGYSWIELEGKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLMGFVPT 600

Query: 601 TEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISK 645
           TEFV FD+EEEEKE  LGYHSEKLAVA GLI++    VIRVVKNLRVCGDCH+ +KLISK
Sbjct: 601 TEFVFFDVEEEEKERVLGYHSEKLAVALGLISTDHGQVIRVVKNLRVCGDCHEVMKLISK 660

BLAST of Csa3G730820 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 502.7 bits (1293), Expect = 6.1e-141
Identity = 276/693 (39.83%), Postives = 397/693 (57.29%), Query Frame = 1

Query: 32  LKHIHARLLRLHLDQDNYLLN-LILCCALD--FGSTNYSKLVFSQVKEPNIFLWNTMIRG 91
           L+ IHA+++++ L   NY L+ LI  C L   F    Y+  VF  ++EPN+ +WNTM RG
Sbjct: 49  LRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRG 108

Query: 92  LVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFVLKACARKLDVRLGLKIHSLLVKAGYDHD 151
                    A+ LY  M   G LPN++T PFVLK+CA+    + G +IH  ++K G D D
Sbjct: 109 HALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLD 168

Query: 152 VFVKTSLLSL-------------------------------YVKCDNFDDALKVFDDIPD 211
           ++V TSL+S+                               Y      ++A K+FD+IP 
Sbjct: 169 LYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPV 228

Query: 212 KNVVSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWI 271
           K+VVSW A+I+GY  +G+++EA+  FK +++  ++PD  ++V V++ACA+ G    G  +
Sbjct: 229 KDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQV 288

Query: 272 DRYISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLP 331
             +I D G G N+ +  +L+D+Y KCG LE A  +F  +P KD++SW+T+I GY    L 
Sbjct: 289 HLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY 348

Query: 332 QQALDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDR--NEFLSNPVLGT 391
           ++AL LF +M      P+  TM+ +L ACA LGA+D+G W    +D+      +   L T
Sbjct: 349 KEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRT 408

Query: 392 ALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKAR--VFS--------- 451
           +LIDMY+KCG +  A ++F ++  K    WNAM+ G +M+G A A   +FS         
Sbjct: 409 SLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQP 468

Query: 452 -------LTPSIEH--------------------------YGCMVDLLGRAGLLNEAHQL 511
                  L  +  H                          YGCM+DLLG +GL  EA ++
Sbjct: 469 DDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEM 528

Query: 512 INNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRW 571
           IN M M+P+ V+W +LL  CK+H +  L E   + LI++EP N G+YV LSNIY+   RW
Sbjct: 529 INMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRW 588

Query: 572 EEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAV 631
            E  K R+ + ++ ++K+  CS IEID +VHEF++GDK H  + +IY  L+E+   L+  
Sbjct: 589 NEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKA 648

Query: 632 GHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAI 645
           G VP T  VL ++EEE KE  L +HSEKLA+AFGLI++ P   + +VKNLRVC +CH+A 
Sbjct: 649 GFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEAT 708

BLAST of Csa3G730820 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 473.4 bits (1217), Expect = 4.0e-132
Identity = 245/660 (37.12%), Postives = 378/660 (57.27%), Query Frame = 1

Query: 31  QLKHIHARLLRLHLDQDNYLLNLILCCALDFGSTNYSKLVFSQVKEPNIFLWNTMIRGLV 90
           QLK IHARLL L L    +L+  ++  +  FG   +++ VF  +  P IF WN +IRG  
Sbjct: 36  QLKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYS 95

Query: 91  SKDCFDDAIHLYGSMRGGGFLPNNFTIPFVLKACARKLDVRLGLKIHSLLVKAGYDHDVF 150
             + F DA+ +Y +M+     P++FT P +LKAC+    +++G  +H+ + + G+D DVF
Sbjct: 96  RNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVF 155

Query: 151 VKTSLLSLYVKCDNFDDALKVFDDIP--DKNVVSWTAIITGYISSGHFREAIGAFKKLLE 210
           V+  L++LY KC     A  VF+ +P  ++ +VSWTAI++ Y  +G   EA+  F ++ +
Sbjct: 156 VQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRK 215

Query: 211 MGLKPDSFSLVKVLAACARLGDCTSGEWIDRYISDSGMGRNVFVATSLLDMYVKCGNLER 270
           M +KPD  +LV VL A   L D   G  I   +   G+     +  SL  MY KCG +  
Sbjct: 216 MDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVAT 275

Query: 271 ANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQALDLFFQMQSENLKPDCYTMVGVLSACAT 330
           A ++F  M   +++ W+ MI GYA NG  ++A+D+F +M +++++PD  ++   +SACA 
Sbjct: 276 AKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQ 335

Query: 331 LGALDLGIWASSLMDRNEFLSNPVLGTALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAM 390
           +G+L+        + R+++  +  + +ALIDM++KCGSV  A  +F     +D VVW+AM
Sbjct: 336 VGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAM 395

Query: 391 MVGLSMNGHAKARVFSLTPSIE-------------------------------------- 450
           +VG  ++G A+  + SL  ++E                                      
Sbjct: 396 IVGYGLHGRAREAI-SLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMADHK 455

Query: 451 ------HYGCMVDLLGRAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVL 510
                 HY C++DLLGRAG L++A+++I  MP++P   VWGALL  CK H+   L E   
Sbjct: 456 INPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAA 515

Query: 511 KKLIELEPWNSGNYVQLSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEF 570
           ++L  ++P N+G+YVQLSN+Y+    W+   ++R  MKE+ + K   CSW+E+ G +  F
Sbjct: 516 QQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAF 575

Query: 571 LVGDKSHWLSEKIYAKLDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAF 630
            VGDKSH   E+I  +++ +   LK  G V   +  L D+ +EE E  L  HSE++A+A+
Sbjct: 576 RVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEETLCSHSERIAIAY 635

Query: 631 GLIASPPNHVIRVVKNLRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW 645
           GLI++P    +R+ KNLR C +CH A KLISK+  REI++RDTNRFH F DG CSC DYW
Sbjct: 636 GLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of Csa3G730820 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 444.9 bits (1143), Expect = 1.5e-123
Identity = 240/657 (36.53%), Postives = 372/657 (56.62%), Query Frame = 1

Query: 33  KHIHARLLRLHLDQDNYLLNLILCCALDFGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSK 92
           + +H  +L+    + N + N ++   L     + ++ VF ++ E ++  WN++I G VS 
Sbjct: 215 EQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSN 274

Query: 93  DCFDDAIHLYGSMRGGGFLPNNFTIPFVLKACARKLDVRLGLKIHSLLVKAGYDHDVFVK 152
              +  + ++  M   G   +  TI  V   CA    + LG  +HS+ VKA +  +    
Sbjct: 275 GLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFC 334

Query: 153 TSLLSLYVKCDNFDDALKVFDDIPDKNVVSWTAIITGYISSGHFREAIGAFKKLLEMGLK 212
            +LL +Y KC + D A  VF ++ D++VVS+T++I GY   G   EA+  F+++ E G+ 
Sbjct: 335 NTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGIS 394

Query: 213 PDSFSLVKVLAACARLGDCTSGEWIDRYISDSGMGRNVFVATSLLDMYVKCGNLERANLI 272
           PD +++  VL  CAR      G+ +  +I ++ +G ++FV+ +L+DMY KCG+++ A L+
Sbjct: 395 PDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 454

Query: 273 FSAMPEKDIVSWSTMIQGYAFNGLPQQALDLF-FQMQSENLKPDCYTMVGVLSACATLGA 332
           FS M  KDI+SW+T+I GY+ N    +AL LF   ++ +   PD  T+  VL ACA+L A
Sbjct: 455 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 514

Query: 333 LDLGIWASSLMDRNEFLSNPVLGTALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAMMVG 392
            D G      + RN + S+  +  +L+DMY+KCG++  A  +F  +  KD V W  M+ G
Sbjct: 515 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 574

Query: 393 LSMNGHAKARV------------------FSLTPSIEH---------------------- 452
             M+G  K  +                   SL  +  H                      
Sbjct: 575 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 634

Query: 453 ----YGCMVDLLGRAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKL 512
               Y C+VD+L R G L +A++ I NMP+ P+A +WGALL GC++H D  LAE+V +K+
Sbjct: 635 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKV 694

Query: 513 IELEPWNSGNYVQLSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVG 572
            ELEP N+G YV ++NIY+   +WE+ +++R  + ++ ++K   CSWIEI G V+ F+ G
Sbjct: 695 FELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAG 754

Query: 573 DKSHWLSEKIYAKLDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLI 632
           D S+  +E I A L ++   +   G+ P T++ L D EE EKE  L  HSEKLA+A G+I
Sbjct: 755 DSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGII 814

Query: 633 ASPPNHVIRVVKNLRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW 645
           +S    +IRV KNLRVCGDCH+  K +SK+T+REI++RD+NRFH F DG CSCR +W
Sbjct: 815 SSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 871


HSP 2 Score: 203.8 bits (517), Expect = 5.9e-51
Identity = 110/337 (32.64%), Postives = 180/337 (53.41%), Query Frame = 1

Query: 62  GSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFVL 121
           G    +  VF +VK      WN ++  L     F  +I L+  M   G   +++T   V 
Sbjct: 143 GDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVS 202

Query: 122 KACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNVV 181
           K+ +    V  G ++H  ++K+G+     V  SL++ Y+K    D A KVFD++ +++V+
Sbjct: 203 KSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVI 262

Query: 182 SWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRYI 241
           SW +II GY+S+G   + +  F ++L  G++ D  ++V V A CA     + G  +    
Sbjct: 263 SWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIG 322

Query: 242 SDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQAL 301
             +   R      +LLDMY KCG+L+ A  +F  M ++ +VS+++MI GYA  GL  +A+
Sbjct: 323 VKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAV 382

Query: 302 DLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDMY 361
            LF +M+ E + PD YT+  VL+ CA    LD G      +  N+   +  +  AL+DMY
Sbjct: 383 KLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMY 442

Query: 362 SKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHA 399
           +KCGS+ +A  +F+ M+ KD + WN ++ G S N +A
Sbjct: 443 AKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYA 479


HSP 3 Score: 151.8 bits (382), Expect = 2.6e-35
Identity = 90/311 (28.94%), Postives = 152/311 (48.87%), Query Frame = 1

Query: 116 TIPFVLKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDI 175
           T+  VL+ CA    ++ G ++ + +   G+  D  + + L  +Y  C +  +A +VFD++
Sbjct: 96  TLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEV 155

Query: 176 PDKNVVSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGE 235
             +  + W  ++     SG F  +IG FKK++  G++ DS++   V  + + L     GE
Sbjct: 156 KIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGE 215

Query: 236 WIDRYISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNG 295
            +  +I  SG G    V  SL+  Y+K   ++ A  +F  M E+D++SW+++I GY  NG
Sbjct: 216 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 275

Query: 296 LPQQALDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGT 355
           L ++ L +F QM    ++ D  T+V V + CA    + LG    S+  +  F        
Sbjct: 276 LAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCN 335

Query: 356 ALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCM 415
            L+DMYSKCG +  A  +F  M  +  V + +M+ G +  G A   V  L   +E  G  
Sbjct: 336 TLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAV-KLFEEMEEEGIS 395

Query: 416 VDLLGRAGLLN 427
            D+     +LN
Sbjct: 396 PDVYTVTAVLN 405

BLAST of Csa3G730820 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 418.3 bits (1074), Expect = 1.5e-115
Identity = 233/626 (37.22%), Postives = 353/626 (56.39%), Query Frame = 1

Query: 25  GLNFFNQLKHIHARLLRLHLD-QDNYLLNLILCCALDFGST---NYSKLVFSQVKEP-NI 84
           G++   +L+ IHA  +R  +   D  L   ++   +   S    +Y+  VFS++++P N+
Sbjct: 26  GVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINV 85

Query: 85  FLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFL-PNNFTIPFVLKACARKLDVRLGLKIHS 144
           F+WNT+IRG         A  LY  MR  G + P+  T PF++KA     DVRLG  IHS
Sbjct: 86  FIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHS 145

Query: 145 LLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNVVSWTAIITGYISSGHFRE 204
           +++++G+   ++V+ SLL LY  C +   A KVFD +P+K++V+W ++I G+  +G   E
Sbjct: 146 VVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEE 205

Query: 205 AIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRYISDSGMGRNVFVATSLLD 264
           A+  + ++   G+KPD F++V +L+ACA++G  T G+ +  Y+   G+ RN+  +  LLD
Sbjct: 206 ALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLD 265

Query: 265 MYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQALDLFFQMQSENLKPDCYT 324
           +Y +CG +E A                                 LF +M  +N       
Sbjct: 266 LYARCGRVEEAKT-------------------------------LFDEMVDKNSVSWTSL 325

Query: 325 MVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDMYSKCGSVTQAWEIFTAMK 384
           +VG+        A++L  +  S       L   +    ++   S CG V + +E F  M+
Sbjct: 326 IVGLAVNGFGKEAIELFKYMEST---EGLLPCEITFVGILYACSHCGMVKEGFEYFRRMR 385

Query: 385 RKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCMVDLLGRAGLLNEAHQLINNMPMK 444
            +                      + + P IEH+GCMVDLL RAG + +A++ I +MPM+
Sbjct: 386 EE----------------------YKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQ 445

Query: 445 PNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEKIR 504
           PN V+W  LLG C +H D+ LAE    ++++LEP +SG+YV LSN+Y+   RW + +KIR
Sbjct: 446 PNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIR 505

Query: 505 STMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPTTE 564
             M    ++K+   S +E+   VHEFL+GDKSH  S+ IYAKL E+   L++ G+VP   
Sbjct: 506 KQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQIS 565

Query: 565 FVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISKIT 624
            V  D+EEEEKE+ + YHSEK+A+AF LI++P    I VVKNLRVC DCH AIKL+SK+ 
Sbjct: 566 NVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVY 595

Query: 625 KREIIIRDTNRFHTFIDGSCSCRDYW 645
            REI++RD +RFH F +GSCSC+DYW
Sbjct: 626 NREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Csa3G730820 vs. TrEMBL
Match: A0A0A0LA44_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G730820 PE=4 SV=1)

HSP 1 Score: 1314.3 bits (3400), Expect = 0.0e+00
Identity = 644/644 (100.00%), Postives = 644/644 (100.00%), Query Frame = 1

Query: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60
           MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD
Sbjct: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60

Query: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120
           FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV
Sbjct: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120

Query: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180
           LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV
Sbjct: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180

Query: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240
           VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY
Sbjct: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240

Query: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300
           ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA
Sbjct: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300

Query: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360
           LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM
Sbjct: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360

Query: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCMVDLLG 420
           YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCMVDLLG
Sbjct: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCMVDLLG 420

Query: 421 RAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQ 480
           RAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQ
Sbjct: 421 RAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQ 480

Query: 481 LSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAK 540
           LSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAK
Sbjct: 481 LSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAK 540

Query: 541 LDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKN 600
           LDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKN
Sbjct: 541 LDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKN 600

Query: 601 LRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW 645
           LRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW
Sbjct: 601 LRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW 644

BLAST of Csa3G730820 vs. TrEMBL
Match: F6GY00_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0058g00760 PE=4 SV=1)

HSP 1 Score: 924.1 bits (2387), Expect = 9.4e-266
Identity = 452/687 (65.79%), Postives = 527/687 (76.71%), Query Frame = 1

Query: 3   ILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALDFG 62
           +L+ PTSP  SK LEIK  +  G N F  LKH+HA LLR  L  DNYLLN+IL C+ DF 
Sbjct: 1   MLSRPTSPPISKGLEIKKLILQGFNSFKHLKHLHAHLLRFGLCHDNYLLNMILRCSFDFS 60

Query: 63  STNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFVLK 122
            TNY++ +F Q+K+PNIFLWNTMIRGLVS DCFDDAI  YG MR  GFLPNNFT PFVLK
Sbjct: 61  DTNYTRFLFHQIKQPNIFLWNTMIRGLVSNDCFDDAIEFYGLMRSEGFLPNNFTFPFVLK 120

Query: 123 ACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNVVS 182
           ACAR LD++LG+KIH+L+VK G+D DVFVKTSL+ LY KC   +DA KVFDDIPDKNVVS
Sbjct: 121 ACARLLDLQLGVKIHTLVVKGGFDCDVFVKTSLVCLYAKCGYLEDAHKVFDDIPDKNVVS 180

Query: 183 WTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRYIS 242
           WTAII+GYI  G FREAI  F++LLEM L PDSF++V+VL+AC +LGD  SGEWI + I 
Sbjct: 181 WTAIISGYIGVGKFREAIDMFRRLLEMNLAPDSFTIVRVLSACTQLGDLNSGEWIHKCIM 240

Query: 243 DSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQALD 302
           + GM RNVFV TSL+DMY KCGN+E+A  +F  MPEKDIVSW  MIQGYA NGLP++A+D
Sbjct: 241 EMGMVRNVFVGTSLVDMYAKCGNMEKARSVFDGMPEKDIVSWGAMIQGYALNGLPKEAID 300

Query: 303 LFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDMYS 362
           LF QMQ EN+KPDCYT+VGVLSACA LGAL+LG W S L+DRNEFL NPVLGTALID+Y+
Sbjct: 301 LFLQMQRENVKPDCYTVVGVLSACARLGALELGEWVSGLVDRNEFLYNPVLGTALIDLYA 360

Query: 363 KCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEH----------- 422
           KCGS+++AWE+F  MK KDRVVWNA++ GL+MNG+ K   F L   +E            
Sbjct: 361 KCGSMSRAWEVFKGMKEKDRVVWNAIISGLAMNGYVKIS-FGLFGQVEKLGIKPDGNTFI 420

Query: 423 ----------------------------------YGCMVDLLGRAGLLNEAHQLINNMPM 482
                                             YGCMVDLLGRAGLL+EAHQLI NMPM
Sbjct: 421 GLLCGCTHAGLVDEGRRYFNSMYRFFSLTPSIEHYGCMVDLLGRAGLLDEAHQLIRNMPM 480

Query: 483 KPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEKI 542
           + NA+VWGALLG C++H+DT LAE  LK+LIELEPWNSGNYV LSNIYS N +W+EA K+
Sbjct: 481 EANAIVWGALLGACRIHRDTQLAELALKQLIELEPWNSGNYVLLSNIYSANLKWDEAAKV 540

Query: 543 RSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPTT 602
           R +M E++IQK   CSWIE+DGIVHEFLVGDK H LSEKIYAKLDEL +++K  G+VPTT
Sbjct: 541 RLSMNEKRIQKPPGCSWIEVDGIVHEFLVGDKYHPLSEKIYAKLDELTKKMKVAGYVPTT 600

Query: 603 EFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISKI 645
           +FVLFDIEEEEKEHFLG HSEKLA+AFGLI++ P  VIRVVKNLRVCGDCH AIKLIS I
Sbjct: 601 DFVLFDIEEEEKEHFLGCHSEKLAIAFGLISATPTAVIRVVKNLRVCGDCHMAIKLISSI 660

BLAST of Csa3G730820 vs. TrEMBL
Match: A0A061FR98_THECC (Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_043921 PE=4 SV=1)

HSP 1 Score: 912.5 bits (2357), Expect = 2.8e-262
Identity = 453/688 (65.84%), Postives = 524/688 (76.16%), Query Frame = 1

Query: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60
           ++ L S TS   SK  EI   +  G      LK +HA L RL L Q NYLLN+IL     
Sbjct: 2   LSTLNSATSSC-SKVTEITKRILGGFTSVRHLKQVHAALFRLGLHQHNYLLNIILKATFH 61

Query: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120
           FG TNY+ L+F+Q K+PNI+LWNTMI+GLVS DCF +A   Y SMR  GFLPN+FT PFV
Sbjct: 62  FGQTNYACLIFNQTKQPNIYLWNTMIQGLVSGDCFLEAAQFYASMRSQGFLPNSFTFPFV 121

Query: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180
           LKA AR LD++LG++IH+L+VK G+D D+FVKT LL LY KC   D A+KVFDDIP+KNV
Sbjct: 122 LKAYARLLDLQLGIRIHALVVKLGFDCDIFVKTGLLCLYAKCGCLDRAIKVFDDIPEKNV 181

Query: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240
           VSWTA+I+GYI  G +REA+  F KLLEMGL+PDSFSLV+VLAACA LGD  SGEWIDR 
Sbjct: 182 VSWTAMISGYIDVGRYREAVNMFSKLLEMGLRPDSFSLVRVLAACAHLGDLNSGEWIDRS 241

Query: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300
           I+  G+ R+VFVATS++DMY KCGN+E+A L F  +PEKDIV+WSTMIQGYA NGLP++A
Sbjct: 242 ITQFGLSRDVFVATSVVDMYAKCGNMEKARLAFDGIPEKDIVTWSTMIQGYASNGLPKEA 301

Query: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360
           LDLFFQMQ E L PDCY MVGVLSACA LGAL+LG WAS LMDR EFLSNPVLGTALIDM
Sbjct: 302 LDLFFQMQKEKLAPDCYVMVGVLSACARLGALELGDWASKLMDRAEFLSNPVLGTALIDM 361

Query: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKA-------------------- 420
           ++KCGS+ QA+EIF  MK KD VVWNA + GL+MNGH KA                    
Sbjct: 362 FAKCGSIAQAFEIFKRMKEKDLVVWNAAISGLAMNGHVKAAFGLFSQMEKSGVLPNGNTF 421

Query: 421 ------------------------RVFSLTPSIEHYGCMVDLLGRAGLLNEAHQLINNMP 480
                                   RVFSLTP+IEHYGCMVDLLGRAGLL+EAHQLI NMP
Sbjct: 422 IGLLCCCTHVGLVDDGHRYFDSMSRVFSLTPTIEHYGCMVDLLGRAGLLDEAHQLIKNMP 481

Query: 481 MKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEK 540
           M+ N++VWGALLGGC+LHKDT L E VLKKLIELEPWNSGNYV LSNIYS +H+W++A K
Sbjct: 482 MEANSIVWGALLGGCRLHKDTQLVEHVLKKLIELEPWNSGNYVLLSNIYSASHKWDDAAK 541

Query: 541 IRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPT 600
           IRS M E+ IQK+   SWIE++G VHEFLVGDKSH LSE IY KL EL +ELKA G+VPT
Sbjct: 542 IRSIMNERGIQKVPGYSWIEVNGFVHEFLVGDKSHPLSEMIYTKLGELAKELKAAGYVPT 601

Query: 601 TEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISK 645
           TE+VLFDIEEEEKEHFLG HSEKLA+AFGLI++ P  VIRVVKNLRVCGDCH+ IKL S+
Sbjct: 602 TEYVLFDIEEEEKEHFLGCHSEKLAIAFGLISTAPTDVIRVVKNLRVCGDCHEVIKLFSR 661

BLAST of Csa3G730820 vs. TrEMBL
Match: M5W3F9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppb002198mg PE=4 SV=1)

HSP 1 Score: 876.7 bits (2264), Expect = 1.7e-251
Identity = 428/636 (67.30%), Postives = 497/636 (78.14%), Query Frame = 1

Query: 53  LILCCALDFGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLP 112
           ++L    DFG  +YS+LVF Q  +PNIFLWNTMIRGLVS DCFDDAI  + SMR  G LP
Sbjct: 1   MVLRSGFDFGHASYSRLVFDQTTQPNIFLWNTMIRGLVSDDCFDDAIEFFISMRTEGILP 60

Query: 113 NNFTIPFVLKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVF 172
           N+FT PFVLKACAR+ D  LGL IH+L+VK G++ DV+VKTSLL LY KC   + A KVF
Sbjct: 61  NSFTFPFVLKACARRSDFPLGLNIHTLVVKTGFNFDVYVKTSLLCLYAKCGYLEHAHKVF 120

Query: 173 DDIPDKNVVSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCT 232
           DDIPDKNVVSWTAII GYI +G +REAI  F++LLEMGL+PDSFSLV+VL+AC +LGD +
Sbjct: 121 DDIPDKNVVSWTAIICGYIGAGQYREAIDTFRRLLEMGLRPDSFSLVRVLSACGKLGDLS 180

Query: 233 SGEWIDRYISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYA 292
           SGEWIDRYI++ GMG+NVFVATSL+D+Y KCG +E+A  IF  M EKDIVSWS+MIQGYA
Sbjct: 181 SGEWIDRYITEIGMGKNVFVATSLVDLYAKCGQMEKARGIFDGMLEKDIVSWSSMIQGYA 240

Query: 293 FNGLPQQALDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPV 352
            NGLP++A+DLFFQMQ ENLKPDCY MVGVLSACA LGAL+LG WA SLMD++EF  NPV
Sbjct: 241 SNGLPKEAIDLFFQMQKENLKPDCYAMVGVLSACARLGALELGEWAGSLMDKHEFFVNPV 300

Query: 353 LGTALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKA------------ 412
           LGTALIDMY+KCG + QAWE+F  MK++D VVWNA M GL+MNGH K             
Sbjct: 301 LGTALIDMYAKCGCMIQAWEVFKGMKKRDHVVWNAAMSGLAMNGHVKTVFGLFGQVEKNG 360

Query: 413 --------------------------------RVFSLTPSIEHYGCMVDLLGRAGLLNEA 472
                                            VFSL  +IEHYGCMVDLL RAGLL+EA
Sbjct: 361 IRPDGNTFMGLLCGCSHAGLVDEGRRYFNNMTSVFSLAHTIEHYGCMVDLLSRAGLLDEA 420

Query: 473 HQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGN 532
           + LI  MPMK N+VVWGALLGGC+LH+ T LAE VLK+LIELEPWNS +YV LSNIYS +
Sbjct: 421 YNLIKTMPMKANSVVWGALLGGCRLHRQTQLAELVLKQLIELEPWNSAHYVLLSNIYSAS 480

Query: 533 HRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGREL 592
           H+W+EA   RS M EQ ++KI  CSWIE++G+V EFLVGDKSH LSEKIYAKLDEL +EL
Sbjct: 481 HKWDEAADTRSRMNEQGMKKIPGCSWIEVNGVVQEFLVGDKSHALSEKIYAKLDELAKEL 540

Query: 593 KAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCH 645
           KA G+VPTT+FVLFDIEEEEKEHFLG HSEKLA+AFGLI++ P   IRVVKNLRVCGDCH
Sbjct: 541 KAAGYVPTTDFVLFDIEEEEKEHFLGCHSEKLAIAFGLISTAPKDTIRVVKNLRVCGDCH 600

BLAST of Csa3G730820 vs. TrEMBL
Match: B9IGL4_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0016s13660g PE=4 SV=2)

HSP 1 Score: 869.4 bits (2245), Expect = 2.8e-249
Identity = 428/681 (62.85%), Postives = 515/681 (75.62%), Query Frame = 1

Query: 8   TSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALDFGSTNYS 67
           +S + +K+  IKN L  G +    LKHIHA LLRL LD+D YLLN +L  + +FG+TNYS
Sbjct: 2   SSLIVTKSAGIKNRLIQGFSCLKHLKHIHAALLRLGLDEDTYLLNKVLRFSFNFGNTNYS 61

Query: 68  KLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFVLKACARK 127
             +  Q KEPNIFL+NTMIRGLV  DCF ++I +Y SMR  G  P++FT PFVLKACAR 
Sbjct: 62  FRILDQTKEPNIFLFNTMIRGLVLNDCFQESIEIYHSMRKEGLSPDSFTFPFVLKACARV 121

Query: 128 LDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNVVSWTAII 187
           LD  LG+K+HSL+VKAG + D FVK SL++LY KC   D+A KVFDDIPDKN  SWTA I
Sbjct: 122 LDSELGVKMHSLVVKAGCEADAFVKISLINLYTKCGFIDNAFKVFDDIPDKNFASWTATI 181

Query: 188 TGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRYISDSGMG 247
           +GY+  G  REAI  F++LLEMGL+PDSFSLV+VL+AC R GD  SGEWID YI+++GM 
Sbjct: 182 SGYVGVGKCREAIDMFRRLLEMGLRPDSFSLVEVLSACKRTGDLRSGEWIDEYITENGMA 241

Query: 248 RNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQALDLFFQM 307
           RNVFVAT+L+D Y KCGN+ERA  +F  M EK+IVSWS+MIQGYA NGLP++ALDLFF+M
Sbjct: 242 RNVFVATALVDFYGKCGNMERARSVFDGMLEKNIVSWSSMIQGYASNGLPKEALDLFFKM 301

Query: 308 QSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDMYSKCGSV 367
            +E LKPDCY MVGVL +CA LGAL+LG WAS+L++ NEFL N VLGTALIDMY+KCG +
Sbjct: 302 LNEGLKPDCYAMVGVLCSCARLGALELGDWASNLINGNEFLDNSVLGTALIDMYAKCGRM 361

Query: 368 TQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKAR-------------------------- 427
            +AWE+F  M++KDRVVWNA + GL+M+GH K                            
Sbjct: 362 DRAWEVFRGMRKKDRVVWNAAISGLAMSGHVKDALGLFGQMEKSGIKPDRNTFVGLLCAC 421

Query: 428 ------------------VFSLTPSIEHYGCMVDLLGRAGLLNEAHQLINNMPMKPNAVV 487
                             V++LTP IEHYGCMVDLLGRAG L+EAHQLI +MPM+ NA+V
Sbjct: 422 THAGLVEEGRRYFNSMECVYTLTPEIEHYGCMVDLLGRAGCLDEAHQLIKSMPMEANAIV 481

Query: 488 WGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEKIRSTMKE 547
           WGALLGGC+LH+DT L E VLKKLI LEPW+SGNYV LSNIY+ +H+WEEA KIRS M E
Sbjct: 482 WGALLGGCRLHRDTQLVEVVLKKLIALEPWHSGNYVLLSNIYAASHKWEEAAKIRSIMSE 541

Query: 548 QQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPTTEFVLFD 607
           + ++KI   SWIE+DG+VH+FLVGD SH LSEKIYAKL EL ++LKA G+VPTT+ VLFD
Sbjct: 542 RGVKKIPGYSWIEVDGVVHQFLVGDTSHPLSEKIYAKLGELAKDLKAAGYVPTTDHVLFD 601

Query: 608 IEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISKITKREII 645
           IEEEEKEHF+G HSEKLAVAFGLI++ PN  I VVKNLRVCGDCH+AIK IS+I  REII
Sbjct: 602 IEEEEKEHFIGCHSEKLAVAFGLISTAPNDKILVVKNLRVCGDCHEAIKHISRIAGREII 661

BLAST of Csa3G730820 vs. TAIR10
Match: AT3G08820.1 (AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 727.6 bits (1877), Expect = 6.6e-210
Identity = 364/688 (52.91%), Postives = 479/688 (69.62%), Query Frame = 1

Query: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60
           M+I+T P++   SK  +IK  +S      N LK IH  L+  HL  D +L+NL+L   L 
Sbjct: 1   MSIVTVPSAT--SKVQQIKTLISVACTV-NHLKQIHVSLINHHLHHDTFLVNLLLKRTLF 60

Query: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120
           F  T YS L+FS  + PNIFL+N++I G V+   F + + L+ S+R  G   + FT P V
Sbjct: 61  FRQTKYSYLLFSHTQFPNIFLYNSLINGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLV 120

Query: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180
           LKAC R    +LG+ +HSL+VK G++HDV   TSLLS+Y      +DA K+FD+IPD++V
Sbjct: 121 LKACTRASSRKLGIDLHSLVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSV 180

Query: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240
           V+WTA+ +GY +SG  REAI  FKK++EMG+KPDS+ +V+VL+AC  +GD  SGEWI +Y
Sbjct: 181 VTWTALFSGYTTSGRHREAIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKY 240

Query: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300
           + +  M +N FV T+L+++Y KCG +E+A  +F +M EKDIV+WSTMIQGYA N  P++ 
Sbjct: 241 MEEMEMQKNSFVRTTLVNLYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEG 300

Query: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360
           ++LF QM  ENLKPD +++VG LS+CA+LGALDLG W  SL+DR+EFL+N  +  ALIDM
Sbjct: 301 IELFLQMLQENLKPDQFSIVGFLSSCASLGALDLGEWGISLIDRHEFLTNLFMANALIDM 360

Query: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKA--RVFSLTPSIE-------- 420
           Y+KCG++ + +E+F  MK KD V+ NA + GL+ NGH K    VF  T  +         
Sbjct: 361 YAKCGAMARGFEVFKEMKEKDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTF 420

Query: 421 ----------------------------------HYGCMVDLLGRAGLLNEAHQLINNMP 480
                                             HYGCMVDL GRAG+L++A++LI +MP
Sbjct: 421 LGLLCGCVHAGLIQDGLRFFNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMP 480

Query: 481 MKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEK 540
           M+PNA+VWGALL GC+L KDT LAE VLK+LI LEPWN+GNYVQLSNIYS   RW+EA +
Sbjct: 481 MRPNAIVWGALLSGCRLVKDTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAE 540

Query: 541 IRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPT 600
           +R  M ++ ++KI   SWIE++G VHEFL  DKSH LS+KIYAKL++LG E++ +G VPT
Sbjct: 541 VRDMMNKKGMKKIPGYSWIELEGKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLMGFVPT 600

Query: 601 TEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISK 645
           TEFV FD+EEEEKE  LGYHSEKLAVA GLI++    VIRVVKNLRVCGDCH+ +KLISK
Sbjct: 601 TEFVFFDVEEEEKERVLGYHSEKLAVALGLISTDHGQVIRVVKNLRVCGDCHEVMKLISK 660

BLAST of Csa3G730820 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 502.7 bits (1293), Expect = 3.4e-142
Identity = 276/693 (39.83%), Postives = 397/693 (57.29%), Query Frame = 1

Query: 32  LKHIHARLLRLHLDQDNYLLN-LILCCALD--FGSTNYSKLVFSQVKEPNIFLWNTMIRG 91
           L+ IHA+++++ L   NY L+ LI  C L   F    Y+  VF  ++EPN+ +WNTM RG
Sbjct: 49  LRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRG 108

Query: 92  LVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFVLKACARKLDVRLGLKIHSLLVKAGYDHD 151
                    A+ LY  M   G LPN++T PFVLK+CA+    + G +IH  ++K G D D
Sbjct: 109 HALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLD 168

Query: 152 VFVKTSLLSL-------------------------------YVKCDNFDDALKVFDDIPD 211
           ++V TSL+S+                               Y      ++A K+FD+IP 
Sbjct: 169 LYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPV 228

Query: 212 KNVVSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWI 271
           K+VVSW A+I+GY  +G+++EA+  FK +++  ++PD  ++V V++ACA+ G    G  +
Sbjct: 229 KDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQV 288

Query: 272 DRYISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLP 331
             +I D G G N+ +  +L+D+Y KCG LE A  +F  +P KD++SW+T+I GY    L 
Sbjct: 289 HLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY 348

Query: 332 QQALDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDR--NEFLSNPVLGT 391
           ++AL LF +M      P+  TM+ +L ACA LGA+D+G W    +D+      +   L T
Sbjct: 349 KEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRT 408

Query: 392 ALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKAR--VFS--------- 451
           +LIDMY+KCG +  A ++F ++  K    WNAM+ G +M+G A A   +FS         
Sbjct: 409 SLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQP 468

Query: 452 -------LTPSIEH--------------------------YGCMVDLLGRAGLLNEAHQL 511
                  L  +  H                          YGCM+DLLG +GL  EA ++
Sbjct: 469 DDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEM 528

Query: 512 INNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRW 571
           IN M M+P+ V+W +LL  CK+H +  L E   + LI++EP N G+YV LSNIY+   RW
Sbjct: 529 INMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRW 588

Query: 572 EEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAV 631
            E  K R+ + ++ ++K+  CS IEID +VHEF++GDK H  + +IY  L+E+   L+  
Sbjct: 589 NEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKA 648

Query: 632 GHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAI 645
           G VP T  VL ++EEE KE  L +HSEKLA+AFGLI++ P   + +VKNLRVC +CH+A 
Sbjct: 649 GFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEAT 708

BLAST of Csa3G730820 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 473.4 bits (1217), Expect = 2.2e-133
Identity = 245/660 (37.12%), Postives = 378/660 (57.27%), Query Frame = 1

Query: 31  QLKHIHARLLRLHLDQDNYLLNLILCCALDFGSTNYSKLVFSQVKEPNIFLWNTMIRGLV 90
           QLK IHARLL L L    +L+  ++  +  FG   +++ VF  +  P IF WN +IRG  
Sbjct: 36  QLKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYS 95

Query: 91  SKDCFDDAIHLYGSMRGGGFLPNNFTIPFVLKACARKLDVRLGLKIHSLLVKAGYDHDVF 150
             + F DA+ +Y +M+     P++FT P +LKAC+    +++G  +H+ + + G+D DVF
Sbjct: 96  RNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVF 155

Query: 151 VKTSLLSLYVKCDNFDDALKVFDDIP--DKNVVSWTAIITGYISSGHFREAIGAFKKLLE 210
           V+  L++LY KC     A  VF+ +P  ++ +VSWTAI++ Y  +G   EA+  F ++ +
Sbjct: 156 VQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRK 215

Query: 211 MGLKPDSFSLVKVLAACARLGDCTSGEWIDRYISDSGMGRNVFVATSLLDMYVKCGNLER 270
           M +KPD  +LV VL A   L D   G  I   +   G+     +  SL  MY KCG +  
Sbjct: 216 MDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVAT 275

Query: 271 ANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQALDLFFQMQSENLKPDCYTMVGVLSACAT 330
           A ++F  M   +++ W+ MI GYA NG  ++A+D+F +M +++++PD  ++   +SACA 
Sbjct: 276 AKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQ 335

Query: 331 LGALDLGIWASSLMDRNEFLSNPVLGTALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAM 390
           +G+L+        + R+++  +  + +ALIDM++KCGSV  A  +F     +D VVW+AM
Sbjct: 336 VGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAM 395

Query: 391 MVGLSMNGHAKARVFSLTPSIE-------------------------------------- 450
           +VG  ++G A+  + SL  ++E                                      
Sbjct: 396 IVGYGLHGRAREAI-SLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMADHK 455

Query: 451 ------HYGCMVDLLGRAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVL 510
                 HY C++DLLGRAG L++A+++I  MP++P   VWGALL  CK H+   L E   
Sbjct: 456 INPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAA 515

Query: 511 KKLIELEPWNSGNYVQLSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEF 570
           ++L  ++P N+G+YVQLSN+Y+    W+   ++R  MKE+ + K   CSW+E+ G +  F
Sbjct: 516 QQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAF 575

Query: 571 LVGDKSHWLSEKIYAKLDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAF 630
            VGDKSH   E+I  +++ +   LK  G V   +  L D+ +EE E  L  HSE++A+A+
Sbjct: 576 RVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEETLCSHSERIAIAY 635

Query: 631 GLIASPPNHVIRVVKNLRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW 645
           GLI++P    +R+ KNLR C +CH A KLISK+  REI++RDTNRFH F DG CSC DYW
Sbjct: 636 GLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of Csa3G730820 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 444.9 bits (1143), Expect = 8.5e-125
Identity = 240/657 (36.53%), Postives = 372/657 (56.62%), Query Frame = 1

Query: 33  KHIHARLLRLHLDQDNYLLNLILCCALDFGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSK 92
           + +H  +L+    + N + N ++   L     + ++ VF ++ E ++  WN++I G VS 
Sbjct: 215 EQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSN 274

Query: 93  DCFDDAIHLYGSMRGGGFLPNNFTIPFVLKACARKLDVRLGLKIHSLLVKAGYDHDVFVK 152
              +  + ++  M   G   +  TI  V   CA    + LG  +HS+ VKA +  +    
Sbjct: 275 GLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFC 334

Query: 153 TSLLSLYVKCDNFDDALKVFDDIPDKNVVSWTAIITGYISSGHFREAIGAFKKLLEMGLK 212
            +LL +Y KC + D A  VF ++ D++VVS+T++I GY   G   EA+  F+++ E G+ 
Sbjct: 335 NTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGIS 394

Query: 213 PDSFSLVKVLAACARLGDCTSGEWIDRYISDSGMGRNVFVATSLLDMYVKCGNLERANLI 272
           PD +++  VL  CAR      G+ +  +I ++ +G ++FV+ +L+DMY KCG+++ A L+
Sbjct: 395 PDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 454

Query: 273 FSAMPEKDIVSWSTMIQGYAFNGLPQQALDLF-FQMQSENLKPDCYTMVGVLSACATLGA 332
           FS M  KDI+SW+T+I GY+ N    +AL LF   ++ +   PD  T+  VL ACA+L A
Sbjct: 455 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 514

Query: 333 LDLGIWASSLMDRNEFLSNPVLGTALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAMMVG 392
            D G      + RN + S+  +  +L+DMY+KCG++  A  +F  +  KD V W  M+ G
Sbjct: 515 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 574

Query: 393 LSMNGHAKARV------------------FSLTPSIEH---------------------- 452
             M+G  K  +                   SL  +  H                      
Sbjct: 575 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 634

Query: 453 ----YGCMVDLLGRAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKL 512
               Y C+VD+L R G L +A++ I NMP+ P+A +WGALL GC++H D  LAE+V +K+
Sbjct: 635 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKV 694

Query: 513 IELEPWNSGNYVQLSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVG 572
            ELEP N+G YV ++NIY+   +WE+ +++R  + ++ ++K   CSWIEI G V+ F+ G
Sbjct: 695 FELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAG 754

Query: 573 DKSHWLSEKIYAKLDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLI 632
           D S+  +E I A L ++   +   G+ P T++ L D EE EKE  L  HSEKLA+A G+I
Sbjct: 755 DSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGII 814

Query: 633 ASPPNHVIRVVKNLRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW 645
           +S    +IRV KNLRVCGDCH+  K +SK+T+REI++RD+NRFH F DG CSCR +W
Sbjct: 815 SSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 871


HSP 2 Score: 203.8 bits (517), Expect = 3.3e-52
Identity = 110/337 (32.64%), Postives = 180/337 (53.41%), Query Frame = 1

Query: 62  GSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFVL 121
           G    +  VF +VK      WN ++  L     F  +I L+  M   G   +++T   V 
Sbjct: 143 GDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVS 202

Query: 122 KACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNVV 181
           K+ +    V  G ++H  ++K+G+     V  SL++ Y+K    D A KVFD++ +++V+
Sbjct: 203 KSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVI 262

Query: 182 SWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRYI 241
           SW +II GY+S+G   + +  F ++L  G++ D  ++V V A CA     + G  +    
Sbjct: 263 SWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIG 322

Query: 242 SDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQAL 301
             +   R      +LLDMY KCG+L+ A  +F  M ++ +VS+++MI GYA  GL  +A+
Sbjct: 323 VKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAV 382

Query: 302 DLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDMY 361
            LF +M+ E + PD YT+  VL+ CA    LD G      +  N+   +  +  AL+DMY
Sbjct: 383 KLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMY 442

Query: 362 SKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHA 399
           +KCGS+ +A  +F+ M+ KD + WN ++ G S N +A
Sbjct: 443 AKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYA 479


HSP 3 Score: 151.8 bits (382), Expect = 1.5e-36
Identity = 90/311 (28.94%), Postives = 152/311 (48.87%), Query Frame = 1

Query: 116 TIPFVLKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDI 175
           T+  VL+ CA    ++ G ++ + +   G+  D  + + L  +Y  C +  +A +VFD++
Sbjct: 96  TLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEV 155

Query: 176 PDKNVVSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGE 235
             +  + W  ++     SG F  +IG FKK++  G++ DS++   V  + + L     GE
Sbjct: 156 KIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGE 215

Query: 236 WIDRYISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNG 295
            +  +I  SG G    V  SL+  Y+K   ++ A  +F  M E+D++SW+++I GY  NG
Sbjct: 216 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 275

Query: 296 LPQQALDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGT 355
           L ++ L +F QM    ++ D  T+V V + CA    + LG    S+  +  F        
Sbjct: 276 LAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCN 335

Query: 356 ALIDMYSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCM 415
            L+DMYSKCG +  A  +F  M  +  V + +M+ G +  G A   V  L   +E  G  
Sbjct: 336 TLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAV-KLFEEMEEEGIS 395

Query: 416 VDLLGRAGLLN 427
            D+     +LN
Sbjct: 396 PDVYTVTAVLN 405

BLAST of Csa3G730820 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 418.3 bits (1074), Expect = 8.5e-117
Identity = 233/626 (37.22%), Postives = 353/626 (56.39%), Query Frame = 1

Query: 25  GLNFFNQLKHIHARLLRLHLD-QDNYLLNLILCCALDFGST---NYSKLVFSQVKEP-NI 84
           G++   +L+ IHA  +R  +   D  L   ++   +   S    +Y+  VFS++++P N+
Sbjct: 26  GVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINV 85

Query: 85  FLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFL-PNNFTIPFVLKACARKLDVRLGLKIHS 144
           F+WNT+IRG         A  LY  MR  G + P+  T PF++KA     DVRLG  IHS
Sbjct: 86  FIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHS 145

Query: 145 LLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNVVSWTAIITGYISSGHFRE 204
           +++++G+   ++V+ SLL LY  C +   A KVFD +P+K++V+W ++I G+  +G   E
Sbjct: 146 VVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEE 205

Query: 205 AIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRYISDSGMGRNVFVATSLLD 264
           A+  + ++   G+KPD F++V +L+ACA++G  T G+ +  Y+   G+ RN+  +  LLD
Sbjct: 206 ALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLD 265

Query: 265 MYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQALDLFFQMQSENLKPDCYT 324
           +Y +CG +E A                                 LF +M  +N       
Sbjct: 266 LYARCGRVEEAKT-------------------------------LFDEMVDKNSVSWTSL 325

Query: 325 MVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDMYSKCGSVTQAWEIFTAMK 384
           +VG+        A++L  +  S       L   +    ++   S CG V + +E F  M+
Sbjct: 326 IVGLAVNGFGKEAIELFKYMEST---EGLLPCEITFVGILYACSHCGMVKEGFEYFRRMR 385

Query: 385 RKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCMVDLLGRAGLLNEAHQLINNMPMK 444
            +                      + + P IEH+GCMVDLL RAG + +A++ I +MPM+
Sbjct: 386 EE----------------------YKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQ 445

Query: 445 PNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEKIR 504
           PN V+W  LLG C +H D+ LAE    ++++LEP +SG+YV LSN+Y+   RW + +KIR
Sbjct: 446 PNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIR 505

Query: 505 STMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPTTE 564
             M    ++K+   S +E+   VHEFL+GDKSH  S+ IYAKL E+   L++ G+VP   
Sbjct: 506 KQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQIS 565

Query: 565 FVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISKIT 624
            V  D+EEEEKE+ + YHSEK+A+AF LI++P    I VVKNLRVC DCH AIKL+SK+ 
Sbjct: 566 NVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVY 595

Query: 625 KREIIIRDTNRFHTFIDGSCSCRDYW 645
            REI++RD +RFH F +GSCSC+DYW
Sbjct: 626 NREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Csa3G730820 vs. NCBI nr
Match: gi|700203585|gb|KGN58718.1| (hypothetical protein Csa_3G730820 [Cucumis sativus])

HSP 1 Score: 1314.3 bits (3400), Expect = 0.0e+00
Identity = 644/644 (100.00%), Postives = 644/644 (100.00%), Query Frame = 1

Query: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60
           MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD
Sbjct: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60

Query: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120
           FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV
Sbjct: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120

Query: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180
           LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV
Sbjct: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180

Query: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240
           VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY
Sbjct: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240

Query: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300
           ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA
Sbjct: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300

Query: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360
           LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM
Sbjct: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360

Query: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCMVDLLG 420
           YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCMVDLLG
Sbjct: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEHYGCMVDLLG 420

Query: 421 RAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQ 480
           RAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQ
Sbjct: 421 RAGLLNEAHQLINNMPMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQ 480

Query: 481 LSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAK 540
           LSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAK
Sbjct: 481 LSNIYSGNHRWEEAEKIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAK 540

Query: 541 LDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKN 600
           LDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKN
Sbjct: 541 LDELGRELKAVGHVPTTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKN 600

Query: 601 LRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW 645
           LRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW
Sbjct: 601 LRVCGDCHDAIKLISKITKREIIIRDTNRFHTFIDGSCSCRDYW 644

BLAST of Csa3G730820 vs. NCBI nr
Match: gi|449440243|ref|XP_004137894.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Cucumis sativus])

HSP 1 Score: 1274.2 bits (3296), Expect = 0.0e+00
Identity = 638/689 (92.60%), Postives = 639/689 (92.74%), Query Frame = 1

Query: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60
           MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD
Sbjct: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60

Query: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120
           FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV
Sbjct: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120

Query: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180
           LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV
Sbjct: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180

Query: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240
           VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY
Sbjct: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240

Query: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300
           ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA
Sbjct: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300

Query: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360
           LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM
Sbjct: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360

Query: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEH--------- 420
           YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKA VFSL   +E          
Sbjct: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKA-VFSLFSLVEKHGIRPDENT 420

Query: 421 ------------------------------------YGCMVDLLGRAGLLNEAHQLINNM 480
                                               YGCMVDLLGRAGLLNEAHQLINNM
Sbjct: 421 FIGLLCGCTHGGFVNEGRQFFNNMKRVFSLTPSIEHYGCMVDLLGRAGLLNEAHQLINNM 480

Query: 481 PMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAE 540
           PMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAE
Sbjct: 481 PMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAE 540

Query: 541 KIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVP 600
           KIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVP
Sbjct: 541 KIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVP 600

Query: 601 TTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLIS 645
           TTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLIS
Sbjct: 601 TTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLIS 660

BLAST of Csa3G730820 vs. NCBI nr
Match: gi|659083446|ref|XP_008442361.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Cucumis melo])

HSP 1 Score: 1208.4 bits (3125), Expect = 0.0e+00
Identity = 604/689 (87.66%), Postives = 624/689 (90.57%), Query Frame = 1

Query: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60
           MTILT+P+SPVFSKAL+IKNYLSNG+NFF QLKHIHARLLRLHL QDNYLLN+ILCC LD
Sbjct: 1   MTILTNPSSPVFSKALDIKNYLSNGVNFFKQLKHIHARLLRLHLHQDNYLLNMILCCGLD 60

Query: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120
           FGST+Y+KLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFT PFV
Sbjct: 61  FGSTDYTKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTFPFV 120

Query: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180
           LKACARKLD+RLGLKIH+LLVKAGYD DVFVKTSLLSLYVKCDN DDALKVFDDIPDKNV
Sbjct: 121 LKACARKLDIRLGLKIHTLLVKAGYDCDVFVKTSLLSLYVKCDNLDDALKVFDDIPDKNV 180

Query: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240
           VSWTAIITGYISSGHFREAIGAF+KLLEMGLKPDSFSLVKVLAACARLGD TSGEWIDRY
Sbjct: 181 VSWTAIITGYISSGHFREAIGAFRKLLEMGLKPDSFSLVKVLAACARLGDFTSGEWIDRY 240

Query: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300
           ISD+GMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKD+VSWSTMIQGYAFNGLPQQA
Sbjct: 241 ISDNGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDVVSWSTMIQGYAFNGLPQQA 300

Query: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360
           LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDR+EFLSNPVLGTALIDM
Sbjct: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRHEFLSNPVLGTALIDM 360

Query: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKARVFSLTPSIEH--------- 420
           YSKCGSVT+AWEIF  M++KDRVVWNAMMVGLSMNGHAKA VFSL   +E          
Sbjct: 361 YSKCGSVTRAWEIFRTMEKKDRVVWNAMMVGLSMNGHAKA-VFSLFSLVEKHGIRPDENT 420

Query: 421 ------------------------------------YGCMVDLLGRAGLLNEAHQLINNM 480
                                               YGCMVDLLGRAGLLNEAHQLIN+M
Sbjct: 421 FIGLLCGCTHGGFVNEGRQFFNNMKRVFSLTPSIEHYGCMVDLLGRAGLLNEAHQLINDM 480

Query: 481 PMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAE 540
           PMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYV LSNIYS NHRWEEAE
Sbjct: 481 PMKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVLLSNIYSANHRWEEAE 540

Query: 541 KIRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVP 600
           KIR TMKEQQIQKIRACSWIEI+GIVHEFLVGD SH LSEKIYAKLDELGRELKAVGHVP
Sbjct: 541 KIRLTMKEQQIQKIRACSWIEINGIVHEFLVGDNSHSLSEKIYAKLDELGRELKAVGHVP 600

Query: 601 TTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLIS 645
           TTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPN VIRVVKNLRVCGDCHDAIKLIS
Sbjct: 601 TTEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNCVIRVVKNLRVCGDCHDAIKLIS 660

BLAST of Csa3G730820 vs. NCBI nr
Match: gi|645279612|ref|XP_008244805.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Prunus mume])

HSP 1 Score: 943.3 bits (2437), Expect = 2.2e-271
Identity = 462/688 (67.15%), Postives = 534/688 (77.62%), Query Frame = 1

Query: 1   MTILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALD 60
           MT+L +   PV SKALE K  L  G N F  LKH HARLLRL LDQDNYLLN++L    D
Sbjct: 1   MTVLPNRAFPVLSKALETKQCLLQGFNSFKHLKHAHARLLRLGLDQDNYLLNVVLRSGFD 60

Query: 61  FGSTNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFV 120
           FG  +YS+LVF Q  +PNIFLWNTMIRGLVS DCFDDAI  + SMR  G LPN+FT PFV
Sbjct: 61  FGHASYSRLVFRQTTQPNIFLWNTMIRGLVSDDCFDDAIEFFSSMRTEGILPNSFTFPFV 120

Query: 121 LKACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNV 180
           LKACAR+ D +LGL IH+L+VK G+D DV+VKTSLL LY KC   + A KVFDDIPDKNV
Sbjct: 121 LKACARRSDFQLGLNIHTLVVKTGFDFDVYVKTSLLYLYAKCGYLEHAHKVFDDIPDKNV 180

Query: 181 VSWTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRY 240
           VSWTAII GYI +G +REAI  F++LLEMGL+PDSFSLV+VL+AC +LGD +SGEWIDRY
Sbjct: 181 VSWTAIICGYIGAGQYREAIDTFRRLLEMGLRPDSFSLVRVLSACGKLGDLSSGEWIDRY 240

Query: 241 ISDSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQA 300
           I++ GMG+NVFVATSL+D+Y KCG +E+A  IF  M EKDIVSWS+MIQGYA NGLP++A
Sbjct: 241 ITEIGMGKNVFVATSLVDLYAKCGQMEKARGIFDGMLEKDIVSWSSMIQGYASNGLPKEA 300

Query: 301 LDLFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDM 360
           +DLFFQMQ ENLKPDCY MVGVLSACA LGAL+LG WAS+LMD++EF  NPVLGTALIDM
Sbjct: 301 IDLFFQMQKENLKPDCYAMVGVLSACARLGALELGEWASNLMDKHEFFVNPVLGTALIDM 360

Query: 361 YSKCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSMNGHAKA-------------------- 420
           Y+KCG + QAWE+F  MK++D VVWNA M GL+MNGH K                     
Sbjct: 361 YAKCGCMIQAWEVFKGMKKRDHVVWNAAMSGLAMNGHVKTVFGLFGQVEKNGIRPDGNTF 420

Query: 421 ------------------------RVFSLTPSIEHYGCMVDLLGRAGLLNEAHQLINNMP 480
                                     FSL P+IEHYGCMVDLL RA LL+EA+ LI  MP
Sbjct: 421 MGLLCGCSHAGLVDEGRRYFNNMTSAFSLAPTIEHYGCMVDLLSRADLLDEAYNLIKTMP 480

Query: 481 MKPNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEK 540
           MK N+VVWGALLGGC+LH+ T LAE VLK+LIELEPWNS +YV LSNIYS +H+W+EA  
Sbjct: 481 MKANSVVWGALLGGCRLHRQTQLAELVLKQLIELEPWNSAHYVLLSNIYSASHKWDEAAD 540

Query: 541 IRSTMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPT 600
            RS M EQ ++KI  CSWIE+ G+VHEFLVGDKSH LSEKIYAKLDEL +ELKAVG+VPT
Sbjct: 541 TRSRMNEQGMKKIPGCSWIEVKGVVHEFLVGDKSHALSEKIYAKLDELAKELKAVGYVPT 600

Query: 601 TEFVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISK 645
           T+FVLFDIEEEEKEHFLG HSEKLA+AFGLI++ P   IRVVKNLRVCGDCH+AIKLISK
Sbjct: 601 TDFVLFDIEEEEKEHFLGCHSEKLAIAFGLISTAPKDTIRVVKNLRVCGDCHEAIKLISK 660

BLAST of Csa3G730820 vs. NCBI nr
Match: gi|694395578|ref|XP_009373109.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Pyrus x bretschneideri])

HSP 1 Score: 928.3 bits (2398), Expect = 7.2e-267
Identity = 457/686 (66.62%), Postives = 532/686 (77.55%), Query Frame = 1

Query: 3   ILTSPTSPVFSKALEIKNYLSNGLNFFNQLKHIHARLLRLHLDQDNYLLNLILCCALDFG 62
           I    +SP FSKALE K  L  G   FN LKH HARLLRL LDQDNYLLNL+L    DFG
Sbjct: 4   IANRASSPAFSKALETKQCLLQGFTSFNHLKHAHARLLRLGLDQDNYLLNLVLRSGFDFG 63

Query: 63  STNYSKLVFSQVKEPNIFLWNTMIRGLVSKDCFDDAIHLYGSMRGGGFLPNNFTIPFVLK 122
            T YS+ VF Q  +PNIFLWNTMIRGLVS DCFDDAI  YGSMR  GFLPN+FT PFVLK
Sbjct: 64  HTAYSRHVFRQTAQPNIFLWNTMIRGLVSNDCFDDAIQFYGSMRKDGFLPNSFTYPFVLK 123

Query: 123 ACARKLDVRLGLKIHSLLVKAGYDHDVFVKTSLLSLYVKCDNFDDALKVFDDIPDKNVVS 182
           ACAR+ D +LGL IH+L+VK G+D DV+VKTSLL LY KC   + A KVFD++P+KNVVS
Sbjct: 124 ACARRSDFQLGLNIHTLVVKTGFDFDVYVKTSLLCLYAKCGYLEHAHKVFDEMPEKNVVS 183

Query: 183 WTAIITGYISSGHFREAIGAFKKLLEMGLKPDSFSLVKVLAACARLGDCTSGEWIDRYIS 242
           WTA+I GYI +  +REAI  F+ LLEMGL+PDSFSLV+VL+AC RLGD  SGEWID YI 
Sbjct: 184 WTAVICGYIEARRYREAIDTFRGLLEMGLRPDSFSLVRVLSACGRLGDIGSGEWIDGYIM 243

Query: 243 DSGMGRNVFVATSLLDMYVKCGNLERANLIFSAMPEKDIVSWSTMIQGYAFNGLPQQALD 302
           + GMGRNVFVATSL+DM+ KCGN+E+A  +F  MPEKDIVSWS+MIQGYA NGLP++A+D
Sbjct: 244 EIGMGRNVFVATSLVDMFTKCGNMEKARRVFDVMPEKDIVSWSSMIQGYASNGLPKEAID 303

Query: 303 LFFQMQSENLKPDCYTMVGVLSACATLGALDLGIWASSLMDRNEFLSNPVLGTALIDMYS 362
           LFFQMQ ENLKPDCY MVGVLSACA LGAL+LG WAS LMD++E  +NPVLGTALIDMY+
Sbjct: 304 LFFQMQKENLKPDCYAMVGVLSACARLGALELGDWASHLMDKDECFTNPVLGTALIDMYA 363

Query: 363 KCGSVTQAWEIFTAMKRKDRVVWNAMMVGLSM---------------------NGHA--- 422
           KCG++  AWE+F  MK+KD VVWNA+M GL+M                     +G+    
Sbjct: 364 KCGNMVLAWEVFKGMKKKDHVVWNAVMSGLAMNGHVKAVFGLFGQVVKIGIRPDGNTFMG 423

Query: 423 --------------------KARVFSLTPSIEHYGCMVDLLGRAGLLNEAHQLINNMPMK 482
                                  VFSLTP+IEHYGCMVDLLGRAGLL+EA++LI +MPM+
Sbjct: 424 LLCGCCHAGLVDEGRRYFNNMTSVFSLTPTIEHYGCMVDLLGRAGLLDEAYELIKSMPME 483

Query: 483 PNAVVWGALLGGCKLHKDTHLAEQVLKKLIELEPWNSGNYVQLSNIYSGNHRWEEAEKIR 542
            N++VWGALLGGC+LH++T LAE VLK+LI LEPWNS +YV LSNIYS +H+W EA   R
Sbjct: 484 ANSIVWGALLGGCRLHRNTQLAELVLKQLIGLEPWNSAHYVLLSNIYSASHKWNEAADTR 543

Query: 543 STMKEQQIQKIRACSWIEIDGIVHEFLVGDKSHWLSEKIYAKLDELGRELKAVGHVPTTE 602
           S M  Q ++KI  CSWIE++G+VHEFLVGD+SH LS+KIYAKL EL +ELKA G+VPTT+
Sbjct: 544 SQMSRQGMKKIPGCSWIEVNGVVHEFLVGDESHTLSDKIYAKLHELAKELKAAGYVPTTD 603

Query: 603 FVLFDIEEEEKEHFLGYHSEKLAVAFGLIASPPNHVIRVVKNLRVCGDCHDAIKLISKIT 645
           FVLFDIEEEEKEHFLG HSEKLAVAFGLI++ P   IRVVKNLRVCGDCH+AIKLISKIT
Sbjct: 604 FVLFDIEEEEKEHFLGCHSEKLAVAFGLISTAPTDTIRVVKNLRVCGDCHEAIKLISKIT 663

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP219_ARATH1.2e-20852.91Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis th... [more]
PPR21_ARATH6.1e-14139.83Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP224_ARATH4.0e-13237.12Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP320_ARATH1.5e-12336.53Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PP330_ARATH1.5e-11537.22Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LA44_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G730820 PE=4 SV=1[more]
F6GY00_VITVI9.4e-26665.79Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0058g00760 PE=4 SV=... [more]
A0A061FR98_THECC2.8e-26265.84Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 OS=Theobr... [more]
M5W3F9_PRUPE1.7e-25167.30Uncharacterized protein OS=Prunus persica GN=PRUPE_ppb002198mg PE=4 SV=1[more]
B9IGL4_POPTR2.8e-24962.85Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
Match NameE-valueIdentityDescription
AT3G08820.16.6e-21052.91 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.13.4e-14239.83 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G12770.12.2e-13337.12 mitochondrial editing factor 22[more]
AT4G18750.18.5e-12536.53 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21065.18.5e-11737.22 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700203585|gb|KGN58718.1|0.0e+00100.00hypothetical protein Csa_3G730820 [Cucumis sativus][more]
gi|449440243|ref|XP_004137894.1|0.0e+0092.60PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Cucum... [more]
gi|659083446|ref|XP_008442361.1|0.0e+0087.66PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Cucum... [more]
gi|645279612|ref|XP_008244805.1|2.2e-27167.15PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Prunu... [more]
gi|694395578|ref|XP_009373109.1|7.2e-26766.62PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Pyrus... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:1901363 heterocyclic compound binding
molecular_function GO:0097159 organic cyclic compound binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa3G730820.1Csa3G730820.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 411..436
score: 0.082coord: 355..380
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 279..327
score: 7.3E-11coord: 178..226
score: 5.6E-8coord: 77..126
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 282..316
score: 4.2E-6coord: 181..215
score: 6.9E-6coord: 81..113
score: 7.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 350..384
score: 8.78coord: 280..314
score: 11.772coord: 474..508
score: 7.267coord: 113..147
score: 5.579coord: 440..470
score: 6.303coord: 148..178
score: 8.024coord: 214..248
score: 5.415coord: 179..213
score: 11.477coord: 249..279
score: 7.991coord: 78..112
score: 10.731coord: 408..438
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 157..231
score: 7.8E-10coord: 441..495
score: 7.8E-10coord: 283..316
score: 7.8
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 447..496
score: 6.54E-7coord: 158..232
score: 6.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 34..515
score:
NoneNo IPR availablePANTHERPTHR24015:SF880SUBFAMILY NOT NAMEDcoord: 34..515
score:

The following gene(s) are paralogous to this gene:

None