Cp4.1LG20g03740 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g03740
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG20 : 2108000 .. 2109217 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCTTCTTTATCGCCTCCGCAATGCTTTCCCTTCGAATTCGAGTTACATCAACCATCGTCTTCACTACCGTTGTCTATCGACAATCCTTTCCCCTGATTCCACTAATCCTTTGTCAGCTAAACAGAAATCGAGGGCCGCATTGTCCCTACTGAAAGTTGAGGAAAATCCTGAGCGTATAATCGATATTTGCCGAGCTGCTACTCTCACTCCAGAATCACATCTCGATCGCATCGCTTTCTCTGTTGCTATTTCTAAGCTTTCAGAGTCCAAGCATTTCGATGGGATTCGCCTGTTCCTCGAGGAATTGAAGTCTCGTCCCGACTTGAAAAACGAGCGTTTTGCTTGCCACGCTATTATTCTCTATGGCCAGGCCAATATGCTCGATCATGCTATTCGCACTTTCAAGCAAATTGATGAACTGGGTGTGCGTCCTTCGGTTAAATCGCTTAATGCATTGCTATTTGCTTGTAATGTAGCTAAGGACTACAAGGAACTGAAGCGGGTTTTTATGGAGTTTCCTAAGATTTATGGTATCGAACCAGATCTCGATACTTATAACAGAGTAATCAAGGCGTTTTCAAAGTCGGGTTCCGCGAGTGCAGTGTATTCGATTGTGGCGGAAATGGACAGGAAGGGTGTCAAACCAAATGCGACTACTTTTGCGAACTGGCTTGCTGGATGCTATAAGGAAGAGAAGTACGAGGATGTTGAGAAAGTCATAAATTTGATGGGAAAATATGGTGTGCGGCGTGGAGTTAGTACATATAATGCAAGAATACTGAGTCTTTGTAAATTGAAGAAATCATCAGAGGCGAAGGCTTTATTTGATGGGATGTTGTCGAGAGGTATGAAACCAAACTCTGTTACATACTGTAAGTTGATTCATGGATTCTGTAGGGAAGGAAATCTGGATGAAGCTAAGAATCTTTTTAAGAGAATGATCAACAGCGGTTGCCAACCTGACAGTGATTGCTATTTCACTTTGGTTTATTTCCATTGTCGAGGAGGAGATTATGATACTGCTTTCAAGATCTGTGGCGAAAGCATGGAGAAGGGGTGGGTTCCAAATATCACTACGATGAAGTCTCTTGTTTATGGATTAGTTAGCATTTCAAAGGTCGACGAGGCAAAGCAACTTATTGGGCAAATCAAGGAGAGGTTCTCAAAGAATGTTGAAAAGTGGAATGAAATTGAAGCTGGATTACCTCAGTGA

mRNA sequence

ATGGCGCTTCTTTATCGCCTCCGCAATGCTTTCCCTTCGAATTCGAGTTACATCAACCATCGTCTTCACTACCGTTGTCTATCGACAATCCTTTCCCCTGATTCCACTAATCCTTTGTCAGCTAAACAGAAATCGAGGGCCGCATTGTCCCTACTGAAAGTTGAGGAAAATCCTGAGCGTATAATCGATATTTGCCGAGCTGCTACTCTCACTCCAGAATCACATCTCGATCGCATCGCTTTCTCTGTTGCTATTTCTAAGCTTTCAGAGTCCAAGCATTTCGATGGGATTCGCCTGTTCCTCGAGGAATTGAAGTCTCGTCCCGACTTGAAAAACGAGCGTTTTGCTTGCCACGCTATTATTCTCTATGGCCAGGCCAATATGCTCGATCATGCTATTCGCACTTTCAAGCAAATTGATGAACTGGGTGTGCGTCCTTCGGTTAAATCGCTTAATGCATTGCTATTTGCTTGTAATGTAGCTAAGGACTACAAGGAACTGAAGCGGGTTTTTATGGAGTTTCCTAAGATTTATGGTATCGAACCAGATCTCGATACTTATAACAGAGTAATCAAGGCGTTTTCAAAGTCGGGTTCCGCGAGTGCAGTGTATTCGATTGTGGCGGAAATGGACAGGAAGGGTGTCAAACCAAATGCGACTACTTTTGCGAACTGGCTTGCTGGATGCTATAAGGAAGAGAAGTACGAGGATGTTGAGAAAGTCATAAATTTGATGGGAAAATATGGTGTGCGGCGTGGAGTTAGTACATATAATGCAAGAATACTGAGTCTTTGTAAATTGAAGAAATCATCAGAGGCGAAGGCTTTATTTGATGGGATGTTGTCGAGAGGTATGAAACCAAACTCTGTTACATACTGTAAGTTGATTCATGGATTCTGTAGGGAAGGAAATCTGGATGAAGCTAAGAATCTTTTTAAGAGAATGATCAACAGCGGTTGCCAACCTGACAGTGATTGCTATTTCACTTTGGTTTATTTCCATTGTCGAGGAGGAGATTATGATACTGCTTTCAAGATCTGTGGCGAAAGCATGGAGAAGGGGTGGGTTCCAAATATCACTACGATGAAGTCTCTTGTTTATGGATTAGTTAGCATTTCAAAGGTCGACGAGGCAAAGCAACTTATTGGGCAAATCAAGGAGAGGTTCTCAAAGAATGTTGAAAAGTGGAATGAAATTGAAGCTGGATTACCTCAGTGA

Coding sequence (CDS)

ATGGCGCTTCTTTATCGCCTCCGCAATGCTTTCCCTTCGAATTCGAGTTACATCAACCATCGTCTTCACTACCGTTGTCTATCGACAATCCTTTCCCCTGATTCCACTAATCCTTTGTCAGCTAAACAGAAATCGAGGGCCGCATTGTCCCTACTGAAAGTTGAGGAAAATCCTGAGCGTATAATCGATATTTGCCGAGCTGCTACTCTCACTCCAGAATCACATCTCGATCGCATCGCTTTCTCTGTTGCTATTTCTAAGCTTTCAGAGTCCAAGCATTTCGATGGGATTCGCCTGTTCCTCGAGGAATTGAAGTCTCGTCCCGACTTGAAAAACGAGCGTTTTGCTTGCCACGCTATTATTCTCTATGGCCAGGCCAATATGCTCGATCATGCTATTCGCACTTTCAAGCAAATTGATGAACTGGGTGTGCGTCCTTCGGTTAAATCGCTTAATGCATTGCTATTTGCTTGTAATGTAGCTAAGGACTACAAGGAACTGAAGCGGGTTTTTATGGAGTTTCCTAAGATTTATGGTATCGAACCAGATCTCGATACTTATAACAGAGTAATCAAGGCGTTTTCAAAGTCGGGTTCCGCGAGTGCAGTGTATTCGATTGTGGCGGAAATGGACAGGAAGGGTGTCAAACCAAATGCGACTACTTTTGCGAACTGGCTTGCTGGATGCTATAAGGAAGAGAAGTACGAGGATGTTGAGAAAGTCATAAATTTGATGGGAAAATATGGTGTGCGGCGTGGAGTTAGTACATATAATGCAAGAATACTGAGTCTTTGTAAATTGAAGAAATCATCAGAGGCGAAGGCTTTATTTGATGGGATGTTGTCGAGAGGTATGAAACCAAACTCTGTTACATACTGTAAGTTGATTCATGGATTCTGTAGGGAAGGAAATCTGGATGAAGCTAAGAATCTTTTTAAGAGAATGATCAACAGCGGTTGCCAACCTGACAGTGATTGCTATTTCACTTTGGTTTATTTCCATTGTCGAGGAGGAGATTATGATACTGCTTTCAAGATCTGTGGCGAAAGCATGGAGAAGGGGTGGGTTCCAAATATCACTACGATGAAGTCTCTTGTTTATGGATTAGTTAGCATTTCAAAGGTCGACGAGGCAAAGCAACTTATTGGGCAAATCAAGGAGAGGTTCTCAAAGAATGTTGAAAAGTGGAATGAAATTGAAGCTGGATTACCTCAGTGA

Protein sequence

MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPERIIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAIILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGIEPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEKVINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFCREGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNITTMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ
BLAST of Cp4.1LG20g03740 vs. Swiss-Prot
Match: PPR87_ARATH (Pentatricopeptide repeat-containing protein At1g61870, mitochondrial OS=Arabidopsis thaliana GN=PPR336 PE=2 SV=2)

HSP 1 Score: 511.1 bits (1315), Expect = 1.1e-143
Identity = 250/409 (61.12%), Postives = 324/409 (79.22%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLS---TILSPDSTNPLSAKQKSRAALSLLKVEEN 60
           MALL R+R++  S   ++N     R LS   TILSPDS  PL++K+KS+AALSLLK E++
Sbjct: 1   MALLSRIRSS-TSLFRHLNASPQIRSLSSASTILSPDSKTPLTSKEKSKAALSLLKSEKD 60

Query: 61  PERIIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEE-LKSRPDLKNERFA 120
           P+RI++ICRAA+LTP+  +DRIAFS A+  L+E KHF  +   L+  +++RPDLK+ERFA
Sbjct: 61  PDRILEICRAASLTPDCRIDRIAFSAAVENLAEKKHFSAVSNLLDGFIENRPDLKSERFA 120

Query: 121 CHAIILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPK 180
            HAI+LY QANMLDH++R F+ +++  +  +VKSLNALLFAC VAKDYKE KRV++E PK
Sbjct: 121 AHAIVLYAQANMLDHSLRVFRDLEKFEISRTVKSLNALLFACLVAKDYKEAKRVYIEMPK 180

Query: 181 IYGIEPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYE 240
           +YGIEPDL+TYNR+IK F +SGSAS+ YSIVAEM+RKG+KPN+++F   ++G Y E+K +
Sbjct: 181 MYGIEPDLETYNRMIKVFCESGSASSSYSIVAEMERKGIKPNSSSFGLMISGFYAEDKSD 240

Query: 241 DVEKVINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLI 300
           +V KV+ +M   GV  GVSTYN RI SLCK KKS EAKAL DGMLS GMKPN+VTY  LI
Sbjct: 241 EVGKVLAMMKDRGVNIGVSTYNIRIQSLCKRKKSKEAKALLDGMLSAGMKPNTVTYSHLI 300

Query: 301 HGFCREGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWV 360
           HGFC E + +EAK LFK M+N GC+PDS+CYFTL+Y+ C+GGD++TA  +C ESMEK WV
Sbjct: 301 HGFCNEDDFEEAKKLFKIMVNRGCKPDSECYFTLIYYLCKGGDFETALSLCKESMEKNWV 360

Query: 361 PNITTMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           P+ + MKSLV GL   SKV+EAK+LIGQ+KE+F++NVE WNE+EA LPQ
Sbjct: 361 PSFSIMKSLVNGLAKDSKVEEAKELIGQVKEKFTRNVELWNEVEAALPQ 408

BLAST of Cp4.1LG20g03740 vs. Swiss-Prot
Match: PPR33_ARATH (Pentatricopeptide repeat-containing protein At1g11630, mitochondrial OS=Arabidopsis thaliana GN=At1g11630 PE=2 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 1.4e-122
Identity = 220/406 (54.19%), Postives = 301/406 (74.14%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQK-SRAALSLLKVEENPE 60
           MA L+R+R    ++   +     +R  S+  S  +   L++KQK SR  LSLLK E NP+
Sbjct: 1   MAFLFRIR----TSEFILQKATQFRLKSSSSSIFTLKSLTSKQKKSRDTLSLLKSENNPD 60

Query: 61  RIIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEE-LKSRPDLKNERFACH 120
           RI++ICR+ +L+P+ H+DRI FSVA+  L+  KHF  +   L+  ++++PD K+E FA  
Sbjct: 61  RILEICRSTSLSPDYHVDRIIFSVAVVTLAREKHFVAVSQLLDGFIQNQPDPKSESFAVR 120

Query: 121 AIILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIY 180
           AIILYG+ANMLD +I+TF+ +++  +  +VKSLNALLFAC +AKDYKE  RV++E PK+Y
Sbjct: 121 AIILYGRANMLDRSIQTFRNLEQYEIPRTVKSLNALLFACLMAKDYKEANRVYLEMPKMY 180

Query: 181 GIEPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDV 240
           GIEPDL+TYNR+I+   +SGS S+ YSIVAEM+RK +KP A +F   + G YKEEK+++V
Sbjct: 181 GIEPDLETYNRMIRVLCESGSTSSSYSIVAEMERKWIKPTAASFGLMIDGFYKEEKFDEV 240

Query: 241 EKVINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHG 300
            KV+ +M ++GV  GV+TYN  I  LCK KKS+EAKAL DG++S  M+PNSVTY  LIHG
Sbjct: 241 RKVMRMMDEFGVHVGVATYNIMIQCLCKRKKSAEAKALIDGVMSCRMRPNSVTYSLLIHG 300

Query: 301 FCREGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPN 360
           FC E NLDEA NLF+ M+ +G +PDS+CYFTL++  C+GGD++TA  +C ESMEK WVP+
Sbjct: 301 FCSEENLDEAMNLFEVMVCNGYKPDSECYFTLIHCLCKGGDFETALILCRESMEKNWVPS 360

Query: 361 ITTMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLP 405
            + MK LV GL S SKVDEAK+LI  +KE+F++NV+ WNE+EA LP
Sbjct: 361 FSVMKWLVNGLASRSKVDEAKELIAVVKEKFTRNVDLWNEVEAALP 402

BLAST of Cp4.1LG20g03740 vs. Swiss-Prot
Match: PP352_ARATH (Pentatricopeptide repeat-containing protein At4g36680, mitochondrial OS=Arabidopsis thaliana GN=At4g36680 PE=2 SV=1)

HSP 1 Score: 203.8 bits (517), Expect = 3.7e-51
Identity = 126/392 (32.14%), Postives = 208/392 (53.06%), Query Frame = 1

Query: 15  SSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPERIIDICRAATLTPES 74
           SS I+ RL  R  S      +T P S K     A S L+ E +P++ + I    +    S
Sbjct: 3   SSRISLRLVRRFASAAADGTTTAPSSGKISVSKAKSTLRKEHDPDKALKIYANVSDHSAS 62

Query: 75  HLD-RIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAIILYGQANMLDHAI 134
            +  R A  + + +L++ + F  I   +E  K+ P +K E F    I  YGQA+M +HA+
Sbjct: 63  PVSSRYAQELTVRRLAKCRRFSDIETLIESHKNDPKIKEEPFYSTLIRSYGQASMFNHAM 122

Query: 135 RTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYG-IEPDLDTYNRVIK 194
           RTF+Q+D+ G   S  S NALL AC  +K++ ++ ++F E P+ Y  I PD  +Y  +IK
Sbjct: 123 RTFEQMDQYGTPRSAVSFNALLNACLHSKNFDKVPQLFDEIPQRYNKIIPDKISYGILIK 182

Query: 195 AFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEKVINLMGKYGVRR 254
           ++  SG+      I+ +M  KG++     F   L+  YK+ + E  + + N M K G   
Sbjct: 183 SYCDSGTPEKAIEIMRQMQGKGMEVTTIAFTTILSSLYKKGELEVADNLWNEMVKKGCEL 242

Query: 255 GVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFCREGNLDEAKNLF 314
             + YN RI+S  K +     K L + M S G+KP++++Y  L+  +C  G LDEAK ++
Sbjct: 243 DNAAYNVRIMSAQK-ESPERVKELIEEMSSMGLKPDTISYNYLMTAYCERGMLDEAKKVY 302

Query: 315 KRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNITTMKSLVYGLVSI 374
           + +  + C P++  + TL++  C    Y+  + I  +S+    +P+  T+K LV GLV  
Sbjct: 303 EGLEGNNCAPNAATFRTLIFHLCYSRLYEQGYAIFKKSVYMHKIPDFNTLKHLVVGLVEN 362

Query: 375 SKVDEAKQLIGQIKERFSKN-VEKWNEIEAGL 404
            K D+AK LI  +K++F  + +  W ++E  L
Sbjct: 363 KKRDDAKGLIRTVKKKFPPSFLNAWKKLEEEL 393

BLAST of Cp4.1LG20g03740 vs. Swiss-Prot
Match: PP226_ARATH (Pentatricopeptide repeat-containing protein At3g13160, mitochondrial OS=Arabidopsis thaliana GN=At3g13160 PE=2 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 9.4e-47
Identity = 111/378 (29.37%), Postives = 196/378 (51.85%), Query Frame = 1

Query: 5   YRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPERIIDI 64
           + LR  F S S++ N R      +   +P    P        + ++L+  E +P+ I + 
Sbjct: 7   FLLRGNF-SFSTHTNRRFFSAVTAAAATPSPPKP--------SLITLVNDERDPKFITEK 66

Query: 65  CRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAIILYG 124
            + A        +   +   + +L+ +K F+ +   LEE    P++  E F    I LYG
Sbjct: 67  FKKACQAEWFRKNIAVYERTVRRLAAAKKFEWVEEILEEQNKYPNMSKEGFVARIINLYG 126

Query: 125 QANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGIEPDL 184
           +  M ++A + F ++ E   + +  S NALL AC  +K +  ++ +F E P    IEPD+
Sbjct: 127 RVGMFENAQKVFDEMPERNCKRTALSFNALLNACVNSKKFDLVEGIFKELPGKLSIEPDV 186

Query: 185 DTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEKVINL 244
            +YN +IK     GS +   +++ E++ KG+KP+  TF   L   Y + K+E+ E++   
Sbjct: 187 ASYNTLIKGLCGKGSFTEAVALIDEIENKGLKPDHITFNILLHESYTKGKFEEGEQIWAR 246

Query: 245 MGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFCREGN 304
           M +  V+R + +YNAR+L L    KS E  +LFD +    +KP+  T+  +I GF  EG 
Sbjct: 247 MVEKNVKRDIRSYNARLLGLAMENKSEEMVSLFDKLKGNELKPDVFTFTAMIKGFVSEGK 306

Query: 305 LDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNITTMKS 364
           LDEA   +K +  +GC+P    + +L+   C+ GD ++A+++C E   K  + +   ++ 
Sbjct: 307 LDEAITWYKEIEKNGCRPLKFVFNSLLPAICKAGDLESAYELCKEIFAKRLLVDEAVLQE 366

Query: 365 LVYGLVSISKVDEAKQLI 383
           +V  LV  SK DEA++++
Sbjct: 367 VVDALVKGSKQDEAEEIV 375

BLAST of Cp4.1LG20g03740 vs. Swiss-Prot
Match: PPR82_ARATH (Pentatricopeptide repeat-containing protein At1g55890, mitochondrial OS=Arabidopsis thaliana GN=At1g55890 PE=2 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 1.6e-46
Identity = 105/355 (29.58%), Postives = 190/355 (53.52%), Query Frame = 1

Query: 28  STILSPDSTNPLSAKQKSRAALSLLKVEENPERIIDICRAATLTPESHLDRIAFSVAISK 87
           +T++S  +    +     ++  SL+  E NP+RI++  + A  +     +   +   + +
Sbjct: 24  ATVVSEPTAVTAAISPPQKSLTSLVNGERNPKRIVEKFKKACESERFRTNIAVYDRTVRR 83

Query: 88  LSESKHFDGIRLFLEELKSRPDLKNERFACHAIILYGQANMLDHAIRTFKQIDELGVRPS 147
           L  +K    +   LEE K   D+  E FA   I LYG+A M ++A + F+++     + S
Sbjct: 84  LVAAKRLHYVEEILEEQKKYRDMSKEGFAARIISLYGKAGMFENAQKVFEEMPNRDCKRS 143

Query: 148 VKSLNALLFACNVAKDYKELKRVFMEFPKIYGIEPDLDTYNRVIKAFSKSGSASAVYSIV 207
           V S NALL A  ++K +  ++ +F E P    I+PD+ +YN +IKA  +  S     +++
Sbjct: 144 VLSFNALLSAYRLSKKFDVVEELFNELPGKLSIKPDIVSYNTLIKALCEKDSLPEAVALL 203

Query: 208 AEMDRKGVKPNATTFANWLAGCYKEEKYEDVEKVINLMGKYGVRRGVSTYNARILSLCKL 267
            E++ KG+KP+  TF   L   Y + ++E  E++   M +  V   + TYNAR+L L   
Sbjct: 204 DEIENKGLKPDIVTFNTLLLSSYLKGQFELGEEIWAKMVEKNVAIDIRTYNARLLGLANE 263

Query: 268 KKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFCREGNLDEAKNLFKRMINSGCQPDSDCY 327
            KS E   LF  + + G+KP+  ++  +I G   EG +DEA+  +K ++  G +PD   +
Sbjct: 264 AKSKELVNLFGELKASGLKPDVFSFNAMIRGSINEGKMDEAEAWYKEIVKHGYRPDKATF 323

Query: 328 FTLVYFHCRGGDYDTAFKICGESMEKGWVPNITTMKSLVYGLVSISKVDEAKQLI 383
             L+   C+ GD+++A ++  E+  K ++   TT++ LV  LV  SK +EA++++
Sbjct: 324 ALLLPAMCKAGDFESAIELFKETFSKRYLVGQTTLQQLVDELVKGSKREEAEEIV 378

BLAST of Cp4.1LG20g03740 vs. TrEMBL
Match: A0A0A0LRL9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G063590 PE=4 SV=1)

HSP 1 Score: 731.9 bits (1888), Expect = 4.3e-208
Identity = 359/405 (88.64%), Postives = 388/405 (95.80%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPER 60
           MALLYRLR+AFPSNS+YIN+RLHYR LSTILSPDS+NPLSAKQKSRAALSLLK EENPER
Sbjct: 1   MALLYRLRSAFPSNSTYINYRLHYRSLSTILSPDSSNPLSAKQKSRAALSLLKTEENPER 60

Query: 61  IIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAI 120
           IIDICRAA+LTPE HLDRIAFSVAISKLS+ KHFDGIR FLEELKSRPDLKNERFACHAI
Sbjct: 61  IIDICRAASLTPEFHLDRIAFSVAISKLSKFKHFDGIRRFLEELKSRPDLKNERFACHAI 120

Query: 121 ILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGI 180
           +LYGQANMLDHAIRTFKQIDELGVR SVK+LNALLFACN+AKDYKELKRV+MEFPKIYGI
Sbjct: 121 VLYGQANMLDHAIRTFKQIDELGVRHSVKTLNALLFACNLAKDYKELKRVYMEFPKIYGI 180

Query: 181 EPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEK 240
           EPD+DTYNRVIKAFS+SGS+S+V SIVAEMDRK VKPNATTFANWLAGCY EEK+EDVEK
Sbjct: 181 EPDIDTYNRVIKAFSESGSSSSVSSIVAEMDRKDVKPNATTFANWLAGCYMEEKFEDVEK 240

Query: 241 VINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFC 300
           V+NLM KYGVRRGV+TYNARI SLCKLK+S+EAKALFDGMLSRGM PNSVTYC+LIHGFC
Sbjct: 241 VLNLMEKYGVRRGVATYNARIRSLCKLKRSTEAKALFDGMLSRGMDPNSVTYCELIHGFC 300

Query: 301 REGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNIT 360
           +EGNLDEAK++FKRMINSGCQPDS+CYFTL YF CRGGDY+TAFKIC ESM+KGWVPN +
Sbjct: 301 KEGNLDEAKSIFKRMINSGCQPDSECYFTLTYFLCRGGDYETAFKICLESMKKGWVPNFS 360

Query: 361 TMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           TMKSLV GLVSISKV+EAKQLIGQIKERFSKNVEKW+EIEAGLPQ
Sbjct: 361 TMKSLVDGLVSISKVEEAKQLIGQIKERFSKNVEKWSEIEAGLPQ 405

BLAST of Cp4.1LG20g03740 vs. TrEMBL
Match: A0A059CAY2_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E04079 PE=4 SV=1)

HSP 1 Score: 565.1 bits (1455), Expect = 7.0e-158
Identity = 272/378 (71.96%), Postives = 320/378 (84.66%), Query Frame = 1

Query: 28  STILSPDSTNPLSAKQKSRAALSLLKVEENPERIIDICRAATLTPESHLDRIAFSVAISK 87
           S+ILSPDS+ PLS+K+K+RAALSLLK E+NPERIIDICRAA+LTP+SHLDR+AFSVAISK
Sbjct: 29  SSILSPDSSAPLSSKEKTRAALSLLKAEKNPERIIDICRAASLTPQSHLDRVAFSVAISK 88

Query: 88  LSESKHFDGIRLFLEELKSRPDLKNERFACHAIILYGQANMLDHAIRTFKQIDELGVRPS 147
           L+ES +FDGIR FLEE K RPDL+NERF CHAI+LYGQA ML+ AI TFK ++ELG+R +
Sbjct: 89  LTESSYFDGIRRFLEESKGRPDLRNERFMCHAIVLYGQAGMLNEAIDTFKHVEELGIRRT 148

Query: 148 VKSLNALLFACNVAKDYKELKRVFMEFPKIYGIEPDLDTYNRVIKAFSKSGSASAVYSIV 207
           VKSLNALLFA  VAKD+KE KR+FMEFP+IY + PDL+T+N VIKAFS+S S S+ YS +
Sbjct: 149 VKSLNALLFASIVAKDFKETKRIFMEFPRIYSVAPDLETFNTVIKAFSESESTSSAYSAL 208

Query: 208 AEMDRKGVKPNATTFANWLAGCYKEEKYEDVEKVINLMGKYGVRRGVSTYNARILSLCKL 267
           AEMDRKGVKPNATTF   LAG YKEEKYEDV KV+ LM KYGV RGVS YN RI SLCKL
Sbjct: 209 AEMDRKGVKPNATTFGTMLAGFYKEEKYEDVGKVLELMQKYGVARGVSIYNIRIHSLCKL 268

Query: 268 KKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFCREGNLDEAKNLFKRMINSGCQPDSDCY 327
           +KS EAK L DGML+RGMKPNS TY  LIHGFC E   DEAK +FK M+N GC+P SDCY
Sbjct: 269 RKSDEAKVLLDGMLARGMKPNSETYAHLIHGFCSEERYDEAKKMFKSMMNHGCRPTSDCY 328

Query: 328 FTLVYFHCRGGDYDTAFKICGESMEKGWVPNITTMKSLVYGLVSISKVDEAKQLIGQIKE 387
           FT +++ C+GG++DTA +IC ESMEKGW+PN  TMKSLV GLVSI KVDEA++LI Q+KE
Sbjct: 329 FTFIHYLCKGGEFDTALQICKESMEKGWIPNFGTMKSLVNGLVSIDKVDEARELIKQVKE 388

Query: 388 RFSKNVEKWNEIEAGLPQ 406
           RFS+N + W+E+EAGLPQ
Sbjct: 389 RFSRNTDLWDEVEAGLPQ 406

BLAST of Cp4.1LG20g03740 vs. TrEMBL
Match: F6HRE2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0116g00030 PE=4 SV=1)

HSP 1 Score: 565.1 bits (1455), Expect = 7.0e-158
Identity = 280/405 (69.14%), Postives = 341/405 (84.20%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPER 60
           MA L RLR   P +S    HR  +   S+ILSPDS  PLS+K+KSRAALSLLK E++P+R
Sbjct: 1   MAFLSRLR---PISS----HRCRF--FSSILSPDSATPLSSKEKSRAALSLLKSEQDPQR 60

Query: 61  IIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAI 120
           I++ICRAA LTPESHLDR+AFSVAISKL++SKHFD IR FL+ELK+RPDL+ ERF  HAI
Sbjct: 61  ILEICRAAALTPESHLDRVAFSVAISKLADSKHFDSIRHFLDELKARPDLRTERFVSHAI 120

Query: 121 ILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGI 180
           +L+GQA ML+ A+RTF+Q+ +LGV  +V+SLNALLF+C +AK+YKE  R+F+EFPK YGI
Sbjct: 121 VLFGQAGMLNDAVRTFEQMHQLGVDRTVRSLNALLFSCILAKNYKEANRIFLEFPKTYGI 180

Query: 181 EPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEK 240
           E +LD+YN V+KAFS+SGS+S+ YSI+AEM RKGVKPNAT+F   LAG Y EEKYEDV K
Sbjct: 181 ELNLDSYNTVLKAFSESGSSSSGYSILAEMGRKGVKPNATSFGILLAGFYNEEKYEDVGK 240

Query: 241 VINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFC 300
           V+ +M +Y ++ G+STYN RI SLCKLKKSSEAKAL DG+L+R MKPNS TYC LIHGFC
Sbjct: 241 VLKMMEEYKMQPGISTYNIRIQSLCKLKKSSEAKALLDGILARRMKPNSETYCHLIHGFC 300

Query: 301 REGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNIT 360
           +EGNLDEAK LFK M+N GC+PDSDCYFTLVYF C+GGD+++A + C E MEKGW PNI+
Sbjct: 301 KEGNLDEAKKLFKDMVNRGCKPDSDCYFTLVYFLCQGGDFESALRFCKECMEKGWFPNIS 360

Query: 361 TMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           TM SLV GLVSISKV+EA++LIGQIKE+FS+NV+KWNEIEAGLPQ
Sbjct: 361 TMTSLVNGLVSISKVEEARELIGQIKEKFSRNVDKWNEIEAGLPQ 396

BLAST of Cp4.1LG20g03740 vs. TrEMBL
Match: A5AGX1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027645 PE=4 SV=1)

HSP 1 Score: 564.7 bits (1454), Expect = 9.1e-158
Identity = 280/405 (69.14%), Postives = 341/405 (84.20%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPER 60
           MA L RLR   P +S    HR  +   S+ILSPDS  PLS+K+KSRAALSLLK E++P+R
Sbjct: 1   MAFLSRLR---PISS----HRCRF--FSSILSPDSATPLSSKEKSRAALSLLKSEQDPQR 60

Query: 61  IIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAI 120
           I++ICRAA LTPESHLDR+AFSVAISKL++SKHFD IR FL+ELK+RPDL+ ERF  HAI
Sbjct: 61  ILEICRAAALTPESHLDRVAFSVAISKLADSKHFDSIRHFLDELKARPDLRTERFVSHAI 120

Query: 121 ILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGI 180
           +L+GQA ML+ A+RTF+Q+ +LGV  +V+SLNALLF+C +AK+YKE  R+F+EFPK YGI
Sbjct: 121 VLFGQAGMLNDAVRTFEQMHQLGVDRTVRSLNALLFSCILAKNYKEANRIFLEFPKTYGI 180

Query: 181 EPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEK 240
           E +LD+YN V+KAFS+SGS+S+ YSI+AEM RKGVKPNAT+F   LAG Y EEKYEDV K
Sbjct: 181 ELNLDSYNTVLKAFSESGSSSSGYSILAEMGRKGVKPNATSFGILLAGFYNEEKYEDVGK 240

Query: 241 VINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFC 300
           V+ +M +Y ++ G+STYN RI SLCKLKKSSEAKAL DG+L+R MKPNS TYC LIHGFC
Sbjct: 241 VLKMMEEYKMQPGISTYNIRIQSLCKLKKSSEAKALLDGILARRMKPNSETYCHLIHGFC 300

Query: 301 REGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNIT 360
           +EGNLDEAK LFK M+N GC+PDSDCYFTLVYF C+GGD+++A + C E MEKGW PNI+
Sbjct: 301 KEGNLDEAKKLFKDMVNRGCKPDSDCYFTLVYFLCQGGDFESALRFCKECMEKGWFPNIS 360

Query: 361 TMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           TM SLV GLVSISKV+EA++LIGQIKE+FS+NV+KWNEIEAGLPQ
Sbjct: 361 TMTSLVNGLVSISKVEEAQELIGQIKEKFSRNVDKWNEIEAGLPQ 396

BLAST of Cp4.1LG20g03740 vs. TrEMBL
Match: W9RQY2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_012870 PE=4 SV=1)

HSP 1 Score: 530.8 bits (1366), Expect = 1.5e-147
Identity = 259/405 (63.95%), Postives = 327/405 (80.74%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPER 60
           M+L+ RLR A  S+  +          ST+LSPDS  PLSAK+K+RAAL+L+K E+NP R
Sbjct: 1   MSLISRLRQASLSHCRF----------STLLSPDS-KPLSAKEKTRAALALIKTEKNPSR 60

Query: 61  IIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAI 120
           I+++C+AA+LTPE++LDRI  SVA+SKL++S HFD IR FL++LK+R DLK ERF  H I
Sbjct: 61  IVELCKAASLTPETYLDRITLSVAVSKLADSNHFDAIRQFLDDLKTRADLKTERFVSHVI 120

Query: 121 ILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGI 180
           +LYGQA M+D A+R+FKQ DELGV  SV+ LN+L+FAC +AK+YKE   VF+EFPKIYGI
Sbjct: 121 VLYGQAKMIDCAVRSFKQCDELGVARSVRVLNSLIFACILAKNYKEANHVFVEFPKIYGI 180

Query: 181 EPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEK 240
           EPD+DTYN VI+AF++SGS SA YS++ EMDRKGVKPN+TTF N L G   EEK+EDV K
Sbjct: 181 EPDVDTYNWVIRAFAESGSTSAAYSVLGEMDRKGVKPNSTTFGNMLPGFSSEEKFEDVGK 240

Query: 241 VINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFC 300
           VINLM KYGVR+G+STYN RI SLCK K++SEAKAL D M+SRGMKPNSV++  LI+G+C
Sbjct: 241 VINLMKKYGVRQGLSTYNIRIQSLCKRKRTSEAKALLDSMISRGMKPNSVSFNHLIYGYC 300

Query: 301 REGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNIT 360
           +EG L+EAK LFK M+  GC+P+S+CYFTLVYF C+G D+D A +IC ES+ K WVPN +
Sbjct: 301 KEGKLEEAKKLFKEMVYRGCKPESNCYFTLVYFMCQGKDFDAALEICKESIAKNWVPNFS 360

Query: 361 TMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           TMKSLV GLVS S+V EA++LI Q+KE+F+ NV+ WNEIEAGLPQ
Sbjct: 361 TMKSLVEGLVSASRVTEARELISQVKEKFTVNVDMWNEIEAGLPQ 394

BLAST of Cp4.1LG20g03740 vs. TAIR10
Match: AT1G61870.1 (AT1G61870.1 pentatricopeptide repeat 336)

HSP 1 Score: 511.1 bits (1315), Expect = 6.1e-145
Identity = 250/409 (61.12%), Postives = 324/409 (79.22%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLS---TILSPDSTNPLSAKQKSRAALSLLKVEEN 60
           MALL R+R++  S   ++N     R LS   TILSPDS  PL++K+KS+AALSLLK E++
Sbjct: 1   MALLSRIRSS-TSLFRHLNASPQIRSLSSASTILSPDSKTPLTSKEKSKAALSLLKSEKD 60

Query: 61  PERIIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEE-LKSRPDLKNERFA 120
           P+RI++ICRAA+LTP+  +DRIAFS A+  L+E KHF  +   L+  +++RPDLK+ERFA
Sbjct: 61  PDRILEICRAASLTPDCRIDRIAFSAAVENLAEKKHFSAVSNLLDGFIENRPDLKSERFA 120

Query: 121 CHAIILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPK 180
            HAI+LY QANMLDH++R F+ +++  +  +VKSLNALLFAC VAKDYKE KRV++E PK
Sbjct: 121 AHAIVLYAQANMLDHSLRVFRDLEKFEISRTVKSLNALLFACLVAKDYKEAKRVYIEMPK 180

Query: 181 IYGIEPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYE 240
           +YGIEPDL+TYNR+IK F +SGSAS+ YSIVAEM+RKG+KPN+++F   ++G Y E+K +
Sbjct: 181 MYGIEPDLETYNRMIKVFCESGSASSSYSIVAEMERKGIKPNSSSFGLMISGFYAEDKSD 240

Query: 241 DVEKVINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLI 300
           +V KV+ +M   GV  GVSTYN RI SLCK KKS EAKAL DGMLS GMKPN+VTY  LI
Sbjct: 241 EVGKVLAMMKDRGVNIGVSTYNIRIQSLCKRKKSKEAKALLDGMLSAGMKPNTVTYSHLI 300

Query: 301 HGFCREGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWV 360
           HGFC E + +EAK LFK M+N GC+PDS+CYFTL+Y+ C+GGD++TA  +C ESMEK WV
Sbjct: 301 HGFCNEDDFEEAKKLFKIMVNRGCKPDSECYFTLIYYLCKGGDFETALSLCKESMEKNWV 360

Query: 361 PNITTMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           P+ + MKSLV GL   SKV+EAK+LIGQ+KE+F++NVE WNE+EA LPQ
Sbjct: 361 PSFSIMKSLVNGLAKDSKVEEAKELIGQVKEKFTRNVELWNEVEAALPQ 408

BLAST of Cp4.1LG20g03740 vs. TAIR10
Match: AT1G11630.1 (AT1G11630.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 441.0 bits (1133), Expect = 7.7e-124
Identity = 220/406 (54.19%), Postives = 301/406 (74.14%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQK-SRAALSLLKVEENPE 60
           MA L+R+R    ++   +     +R  S+  S  +   L++KQK SR  LSLLK E NP+
Sbjct: 1   MAFLFRIR----TSEFILQKATQFRLKSSSSSIFTLKSLTSKQKKSRDTLSLLKSENNPD 60

Query: 61  RIIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEE-LKSRPDLKNERFACH 120
           RI++ICR+ +L+P+ H+DRI FSVA+  L+  KHF  +   L+  ++++PD K+E FA  
Sbjct: 61  RILEICRSTSLSPDYHVDRIIFSVAVVTLAREKHFVAVSQLLDGFIQNQPDPKSESFAVR 120

Query: 121 AIILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIY 180
           AIILYG+ANMLD +I+TF+ +++  +  +VKSLNALLFAC +AKDYKE  RV++E PK+Y
Sbjct: 121 AIILYGRANMLDRSIQTFRNLEQYEIPRTVKSLNALLFACLMAKDYKEANRVYLEMPKMY 180

Query: 181 GIEPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDV 240
           GIEPDL+TYNR+I+   +SGS S+ YSIVAEM+RK +KP A +F   + G YKEEK+++V
Sbjct: 181 GIEPDLETYNRMIRVLCESGSTSSSYSIVAEMERKWIKPTAASFGLMIDGFYKEEKFDEV 240

Query: 241 EKVINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHG 300
            KV+ +M ++GV  GV+TYN  I  LCK KKS+EAKAL DG++S  M+PNSVTY  LIHG
Sbjct: 241 RKVMRMMDEFGVHVGVATYNIMIQCLCKRKKSAEAKALIDGVMSCRMRPNSVTYSLLIHG 300

Query: 301 FCREGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPN 360
           FC E NLDEA NLF+ M+ +G +PDS+CYFTL++  C+GGD++TA  +C ESMEK WVP+
Sbjct: 301 FCSEENLDEAMNLFEVMVCNGYKPDSECYFTLIHCLCKGGDFETALILCRESMEKNWVPS 360

Query: 361 ITTMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLP 405
            + MK LV GL S SKVDEAK+LI  +KE+F++NV+ WNE+EA LP
Sbjct: 361 FSVMKWLVNGLASRSKVDEAKELIAVVKEKFTRNVDLWNEVEAALP 402

BLAST of Cp4.1LG20g03740 vs. TAIR10
Match: AT4G36680.1 (AT4G36680.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 203.8 bits (517), Expect = 2.1e-52
Identity = 126/392 (32.14%), Postives = 208/392 (53.06%), Query Frame = 1

Query: 15  SSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPERIIDICRAATLTPES 74
           SS I+ RL  R  S      +T P S K     A S L+ E +P++ + I    +    S
Sbjct: 3   SSRISLRLVRRFASAAADGTTTAPSSGKISVSKAKSTLRKEHDPDKALKIYANVSDHSAS 62

Query: 75  HLD-RIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAIILYGQANMLDHAI 134
            +  R A  + + +L++ + F  I   +E  K+ P +K E F    I  YGQA+M +HA+
Sbjct: 63  PVSSRYAQELTVRRLAKCRRFSDIETLIESHKNDPKIKEEPFYSTLIRSYGQASMFNHAM 122

Query: 135 RTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYG-IEPDLDTYNRVIK 194
           RTF+Q+D+ G   S  S NALL AC  +K++ ++ ++F E P+ Y  I PD  +Y  +IK
Sbjct: 123 RTFEQMDQYGTPRSAVSFNALLNACLHSKNFDKVPQLFDEIPQRYNKIIPDKISYGILIK 182

Query: 195 AFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEKVINLMGKYGVRR 254
           ++  SG+      I+ +M  KG++     F   L+  YK+ + E  + + N M K G   
Sbjct: 183 SYCDSGTPEKAIEIMRQMQGKGMEVTTIAFTTILSSLYKKGELEVADNLWNEMVKKGCEL 242

Query: 255 GVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFCREGNLDEAKNLF 314
             + YN RI+S  K +     K L + M S G+KP++++Y  L+  +C  G LDEAK ++
Sbjct: 243 DNAAYNVRIMSAQK-ESPERVKELIEEMSSMGLKPDTISYNYLMTAYCERGMLDEAKKVY 302

Query: 315 KRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNITTMKSLVYGLVSI 374
           + +  + C P++  + TL++  C    Y+  + I  +S+    +P+  T+K LV GLV  
Sbjct: 303 EGLEGNNCAPNAATFRTLIFHLCYSRLYEQGYAIFKKSVYMHKIPDFNTLKHLVVGLVEN 362

Query: 375 SKVDEAKQLIGQIKERFSKN-VEKWNEIEAGL 404
            K D+AK LI  +K++F  + +  W ++E  L
Sbjct: 363 KKRDDAKGLIRTVKKKFPPSFLNAWKKLEEEL 393

BLAST of Cp4.1LG20g03740 vs. TAIR10
Match: AT3G13160.1 (AT3G13160.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 189.1 bits (479), Expect = 5.3e-48
Identity = 111/378 (29.37%), Postives = 196/378 (51.85%), Query Frame = 1

Query: 5   YRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPERIIDI 64
           + LR  F S S++ N R      +   +P    P        + ++L+  E +P+ I + 
Sbjct: 7   FLLRGNF-SFSTHTNRRFFSAVTAAAATPSPPKP--------SLITLVNDERDPKFITEK 66

Query: 65  CRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAIILYG 124
            + A        +   +   + +L+ +K F+ +   LEE    P++  E F    I LYG
Sbjct: 67  FKKACQAEWFRKNIAVYERTVRRLAAAKKFEWVEEILEEQNKYPNMSKEGFVARIINLYG 126

Query: 125 QANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGIEPDL 184
           +  M ++A + F ++ E   + +  S NALL AC  +K +  ++ +F E P    IEPD+
Sbjct: 127 RVGMFENAQKVFDEMPERNCKRTALSFNALLNACVNSKKFDLVEGIFKELPGKLSIEPDV 186

Query: 185 DTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEKVINL 244
            +YN +IK     GS +   +++ E++ KG+KP+  TF   L   Y + K+E+ E++   
Sbjct: 187 ASYNTLIKGLCGKGSFTEAVALIDEIENKGLKPDHITFNILLHESYTKGKFEEGEQIWAR 246

Query: 245 MGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFCREGN 304
           M +  V+R + +YNAR+L L    KS E  +LFD +    +KP+  T+  +I GF  EG 
Sbjct: 247 MVEKNVKRDIRSYNARLLGLAMENKSEEMVSLFDKLKGNELKPDVFTFTAMIKGFVSEGK 306

Query: 305 LDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNITTMKS 364
           LDEA   +K +  +GC+P    + +L+   C+ GD ++A+++C E   K  + +   ++ 
Sbjct: 307 LDEAITWYKEIEKNGCRPLKFVFNSLLPAICKAGDLESAYELCKEIFAKRLLVDEAVLQE 366

Query: 365 LVYGLVSISKVDEAKQLI 383
           +V  LV  SK DEA++++
Sbjct: 367 VVDALVKGSKQDEAEEIV 375

BLAST of Cp4.1LG20g03740 vs. TAIR10
Match: AT1G55890.1 (AT1G55890.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 188.3 bits (477), Expect = 9.0e-48
Identity = 105/355 (29.58%), Postives = 190/355 (53.52%), Query Frame = 1

Query: 28  STILSPDSTNPLSAKQKSRAALSLLKVEENPERIIDICRAATLTPESHLDRIAFSVAISK 87
           +T++S  +    +     ++  SL+  E NP+RI++  + A  +     +   +   + +
Sbjct: 24  ATVVSEPTAVTAAISPPQKSLTSLVNGERNPKRIVEKFKKACESERFRTNIAVYDRTVRR 83

Query: 88  LSESKHFDGIRLFLEELKSRPDLKNERFACHAIILYGQANMLDHAIRTFKQIDELGVRPS 147
           L  +K    +   LEE K   D+  E FA   I LYG+A M ++A + F+++     + S
Sbjct: 84  LVAAKRLHYVEEILEEQKKYRDMSKEGFAARIISLYGKAGMFENAQKVFEEMPNRDCKRS 143

Query: 148 VKSLNALLFACNVAKDYKELKRVFMEFPKIYGIEPDLDTYNRVIKAFSKSGSASAVYSIV 207
           V S NALL A  ++K +  ++ +F E P    I+PD+ +YN +IKA  +  S     +++
Sbjct: 144 VLSFNALLSAYRLSKKFDVVEELFNELPGKLSIKPDIVSYNTLIKALCEKDSLPEAVALL 203

Query: 208 AEMDRKGVKPNATTFANWLAGCYKEEKYEDVEKVINLMGKYGVRRGVSTYNARILSLCKL 267
            E++ KG+KP+  TF   L   Y + ++E  E++   M +  V   + TYNAR+L L   
Sbjct: 204 DEIENKGLKPDIVTFNTLLLSSYLKGQFELGEEIWAKMVEKNVAIDIRTYNARLLGLANE 263

Query: 268 KKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFCREGNLDEAKNLFKRMINSGCQPDSDCY 327
            KS E   LF  + + G+KP+  ++  +I G   EG +DEA+  +K ++  G +PD   +
Sbjct: 264 AKSKELVNLFGELKASGLKPDVFSFNAMIRGSINEGKMDEAEAWYKEIVKHGYRPDKATF 323

Query: 328 FTLVYFHCRGGDYDTAFKICGESMEKGWVPNITTMKSLVYGLVSISKVDEAKQLI 383
             L+   C+ GD+++A ++  E+  K ++   TT++ LV  LV  SK +EA++++
Sbjct: 324 ALLLPAMCKAGDFESAIELFKETFSKRYLVGQTTLQQLVDELVKGSKREEAEEIV 378

BLAST of Cp4.1LG20g03740 vs. NCBI nr
Match: gi|778658444|ref|XP_011652746.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g61870, mitochondrial [Cucumis sativus])

HSP 1 Score: 731.9 bits (1888), Expect = 6.2e-208
Identity = 359/405 (88.64%), Postives = 388/405 (95.80%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPER 60
           MALLYRLR+AFPSNS+YIN+RLHYR LSTILSPDS+NPLSAKQKSRAALSLLK EENPER
Sbjct: 1   MALLYRLRSAFPSNSTYINYRLHYRSLSTILSPDSSNPLSAKQKSRAALSLLKTEENPER 60

Query: 61  IIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAI 120
           IIDICRAA+LTPE HLDRIAFSVAISKLS+ KHFDGIR FLEELKSRPDLKNERFACHAI
Sbjct: 61  IIDICRAASLTPEFHLDRIAFSVAISKLSKFKHFDGIRRFLEELKSRPDLKNERFACHAI 120

Query: 121 ILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGI 180
           +LYGQANMLDHAIRTFKQIDELGVR SVK+LNALLFACN+AKDYKELKRV+MEFPKIYGI
Sbjct: 121 VLYGQANMLDHAIRTFKQIDELGVRHSVKTLNALLFACNLAKDYKELKRVYMEFPKIYGI 180

Query: 181 EPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEK 240
           EPD+DTYNRVIKAFS+SGS+S+V SIVAEMDRK VKPNATTFANWLAGCY EEK+EDVEK
Sbjct: 181 EPDIDTYNRVIKAFSESGSSSSVSSIVAEMDRKDVKPNATTFANWLAGCYMEEKFEDVEK 240

Query: 241 VINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFC 300
           V+NLM KYGVRRGV+TYNARI SLCKLK+S+EAKALFDGMLSRGM PNSVTYC+LIHGFC
Sbjct: 241 VLNLMEKYGVRRGVATYNARIRSLCKLKRSTEAKALFDGMLSRGMDPNSVTYCELIHGFC 300

Query: 301 REGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNIT 360
           +EGNLDEAK++FKRMINSGCQPDS+CYFTL YF CRGGDY+TAFKIC ESM+KGWVPN +
Sbjct: 301 KEGNLDEAKSIFKRMINSGCQPDSECYFTLTYFLCRGGDYETAFKICLESMKKGWVPNFS 360

Query: 361 TMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           TMKSLV GLVSISKV+EAKQLIGQIKERFSKNVEKW+EIEAGLPQ
Sbjct: 361 TMKSLVDGLVSISKVEEAKQLIGQIKERFSKNVEKWSEIEAGLPQ 405

BLAST of Cp4.1LG20g03740 vs. NCBI nr
Match: gi|659067768|ref|XP_008441198.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g61870, mitochondrial [Cucumis melo])

HSP 1 Score: 710.3 bits (1832), Expect = 1.9e-201
Identity = 351/405 (86.67%), Postives = 376/405 (92.84%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPER 60
           MALLYRLR+AF SNS+YIN+ LHYR LSTILSPDS+ PLSAKQKSRAALSLLK EENPER
Sbjct: 1   MALLYRLRSAFSSNSTYINYHLHYRSLSTILSPDSSTPLSAKQKSRAALSLLKTEENPER 60

Query: 61  IIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAI 120
           IIDICRAA LTPE HLDRIAFSVAISKLS+SKH+DGI  FLEELKSRPDLKNERFACH I
Sbjct: 61  IIDICRAAALTPEFHLDRIAFSVAISKLSKSKHYDGIHRFLEELKSRPDLKNERFACHVI 120

Query: 121 ILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGI 180
            LYGQANMLDHAIRTFKQIDELGVR SVK LN+LLFACNVAKDYKELKRVFMEFPKIYGI
Sbjct: 121 ALYGQANMLDHAIRTFKQIDELGVRHSVKLLNSLLFACNVAKDYKELKRVFMEFPKIYGI 180

Query: 181 EPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEK 240
           EPD+DTYNRVIKAFS+SGS+S+V SIVAEMDRK VKPNAT+FANWLAGCY EEKYEDVEK
Sbjct: 181 EPDIDTYNRVIKAFSESGSSSSVSSIVAEMDRKNVKPNATSFANWLAGCYMEEKYEDVEK 240

Query: 241 VINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFC 300
           V+ LM KYGVRRGV+TYNARI SLCKLKKS+EAKALFDGMLSRG KPN VTYC+LIHGF 
Sbjct: 241 VLKLMEKYGVRRGVATYNARIQSLCKLKKSAEAKALFDGMLSRGTKPNCVTYCELIHGFS 300

Query: 301 REGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNIT 360
           +EGNLDEAK+LFKRMINSGC+PDS+CYFTL+YF CRGGDY+TA KIC ESMEKGWVPN  
Sbjct: 301 KEGNLDEAKSLFKRMINSGCKPDSNCYFTLIYFLCRGGDYETALKICSESMEKGWVPNFG 360

Query: 361 TMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           TMKSLV GLVSISKV+EAKQLIGQIKERFS NVEKW+E+EAGLPQ
Sbjct: 361 TMKSLVDGLVSISKVEEAKQLIGQIKERFSNNVEKWSEMEAGLPQ 405

BLAST of Cp4.1LG20g03740 vs. NCBI nr
Match: gi|702351964|ref|XP_010057943.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g61870, mitochondrial-like [Eucalyptus grandis])

HSP 1 Score: 565.1 bits (1455), Expect = 1.0e-157
Identity = 272/378 (71.96%), Postives = 320/378 (84.66%), Query Frame = 1

Query: 28  STILSPDSTNPLSAKQKSRAALSLLKVEENPERIIDICRAATLTPESHLDRIAFSVAISK 87
           S+ILSPDS+ PLS+K+K+RAALSLLK E+NPERIIDICRAA+LTP+SHLDR+AFSVAISK
Sbjct: 29  SSILSPDSSAPLSSKEKTRAALSLLKAEKNPERIIDICRAASLTPQSHLDRVAFSVAISK 88

Query: 88  LSESKHFDGIRLFLEELKSRPDLKNERFACHAIILYGQANMLDHAIRTFKQIDELGVRPS 147
           L+ES +FDGIR FLEE K RPDL+NERF CHAI+LYGQA ML+ AI TFK ++ELG+R +
Sbjct: 89  LTESSYFDGIRRFLEESKGRPDLRNERFMCHAIVLYGQAGMLNEAIDTFKHVEELGIRRT 148

Query: 148 VKSLNALLFACNVAKDYKELKRVFMEFPKIYGIEPDLDTYNRVIKAFSKSGSASAVYSIV 207
           VKSLNALLFA  VAKD+KE KR+FMEFP+IY + PDL+T+N VIKAFS+S S S+ YS +
Sbjct: 149 VKSLNALLFASIVAKDFKETKRIFMEFPRIYSVAPDLETFNTVIKAFSESESTSSAYSAL 208

Query: 208 AEMDRKGVKPNATTFANWLAGCYKEEKYEDVEKVINLMGKYGVRRGVSTYNARILSLCKL 267
           AEMDRKGVKPNATTF   LAG YKEEKYEDV KV+ LM KYGV RGVS YN RI SLCKL
Sbjct: 209 AEMDRKGVKPNATTFGTMLAGFYKEEKYEDVGKVLELMQKYGVARGVSIYNIRIHSLCKL 268

Query: 268 KKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFCREGNLDEAKNLFKRMINSGCQPDSDCY 327
           +KS EAK L DGML+RGMKPNS TY  LIHGFC E   DEAK +FK M+N GC+P SDCY
Sbjct: 269 RKSDEAKVLLDGMLARGMKPNSETYAHLIHGFCSEERYDEAKKMFKSMMNHGCRPTSDCY 328

Query: 328 FTLVYFHCRGGDYDTAFKICGESMEKGWVPNITTMKSLVYGLVSISKVDEAKQLIGQIKE 387
           FT +++ C+GG++DTA +IC ESMEKGW+PN  TMKSLV GLVSI KVDEA++LI Q+KE
Sbjct: 329 FTFIHYLCKGGEFDTALQICKESMEKGWIPNFGTMKSLVNGLVSIDKVDEARELIKQVKE 388

Query: 388 RFSKNVEKWNEIEAGLPQ 406
           RFS+N + W+E+EAGLPQ
Sbjct: 389 RFSRNTDLWDEVEAGLPQ 406

BLAST of Cp4.1LG20g03740 vs. NCBI nr
Match: gi|731435181|ref|XP_003634851.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g61870, mitochondrial-like [Vitis vinifera])

HSP 1 Score: 565.1 bits (1455), Expect = 1.0e-157
Identity = 280/405 (69.14%), Postives = 341/405 (84.20%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPER 60
           MA L RLR   P +S    HR  +   S+ILSPDS  PLS+K+KSRAALSLLK E++P+R
Sbjct: 27  MAFLSRLR---PISS----HRCRF--FSSILSPDSATPLSSKEKSRAALSLLKSEQDPQR 86

Query: 61  IIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAI 120
           I++ICRAA LTPESHLDR+AFSVAISKL++SKHFD IR FL+ELK+RPDL+ ERF  HAI
Sbjct: 87  ILEICRAAALTPESHLDRVAFSVAISKLADSKHFDSIRHFLDELKARPDLRTERFVSHAI 146

Query: 121 ILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGI 180
           +L+GQA ML+ A+RTF+Q+ +LGV  +V+SLNALLF+C +AK+YKE  R+F+EFPK YGI
Sbjct: 147 VLFGQAGMLNDAVRTFEQMHQLGVDRTVRSLNALLFSCILAKNYKEANRIFLEFPKTYGI 206

Query: 181 EPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEK 240
           E +LD+YN V+KAFS+SGS+S+ YSI+AEM RKGVKPNAT+F   LAG Y EEKYEDV K
Sbjct: 207 ELNLDSYNTVLKAFSESGSSSSGYSILAEMGRKGVKPNATSFGILLAGFYNEEKYEDVGK 266

Query: 241 VINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFC 300
           V+ +M +Y ++ G+STYN RI SLCKLKKSSEAKAL DG+L+R MKPNS TYC LIHGFC
Sbjct: 267 VLKMMEEYKMQPGISTYNIRIQSLCKLKKSSEAKALLDGILARRMKPNSETYCHLIHGFC 326

Query: 301 REGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNIT 360
           +EGNLDEAK LFK M+N GC+PDSDCYFTLVYF C+GGD+++A + C E MEKGW PNI+
Sbjct: 327 KEGNLDEAKKLFKDMVNRGCKPDSDCYFTLVYFLCQGGDFESALRFCKECMEKGWFPNIS 386

Query: 361 TMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           TM SLV GLVSISKV+EA++LIGQIKE+FS+NV+KWNEIEAGLPQ
Sbjct: 387 TMTSLVNGLVSISKVEEARELIGQIKEKFSRNVDKWNEIEAGLPQ 422

BLAST of Cp4.1LG20g03740 vs. NCBI nr
Match: gi|147767812|emb|CAN77919.1| (hypothetical protein VITISV_027645 [Vitis vinifera])

HSP 1 Score: 564.7 bits (1454), Expect = 1.3e-157
Identity = 280/405 (69.14%), Postives = 341/405 (84.20%), Query Frame = 1

Query: 1   MALLYRLRNAFPSNSSYINHRLHYRCLSTILSPDSTNPLSAKQKSRAALSLLKVEENPER 60
           MA L RLR   P +S    HR  +   S+ILSPDS  PLS+K+KSRAALSLLK E++P+R
Sbjct: 1   MAFLSRLR---PISS----HRCRF--FSSILSPDSATPLSSKEKSRAALSLLKSEQDPQR 60

Query: 61  IIDICRAATLTPESHLDRIAFSVAISKLSESKHFDGIRLFLEELKSRPDLKNERFACHAI 120
           I++ICRAA LTPESHLDR+AFSVAISKL++SKHFD IR FL+ELK+RPDL+ ERF  HAI
Sbjct: 61  ILEICRAAALTPESHLDRVAFSVAISKLADSKHFDSIRHFLDELKARPDLRTERFVSHAI 120

Query: 121 ILYGQANMLDHAIRTFKQIDELGVRPSVKSLNALLFACNVAKDYKELKRVFMEFPKIYGI 180
           +L+GQA ML+ A+RTF+Q+ +LGV  +V+SLNALLF+C +AK+YKE  R+F+EFPK YGI
Sbjct: 121 VLFGQAGMLNDAVRTFEQMHQLGVDRTVRSLNALLFSCILAKNYKEANRIFLEFPKTYGI 180

Query: 181 EPDLDTYNRVIKAFSKSGSASAVYSIVAEMDRKGVKPNATTFANWLAGCYKEEKYEDVEK 240
           E +LD+YN V+KAFS+SGS+S+ YSI+AEM RKGVKPNAT+F   LAG Y EEKYEDV K
Sbjct: 181 ELNLDSYNTVLKAFSESGSSSSGYSILAEMGRKGVKPNATSFGILLAGFYNEEKYEDVGK 240

Query: 241 VINLMGKYGVRRGVSTYNARILSLCKLKKSSEAKALFDGMLSRGMKPNSVTYCKLIHGFC 300
           V+ +M +Y ++ G+STYN RI SLCKLKKSSEAKAL DG+L+R MKPNS TYC LIHGFC
Sbjct: 241 VLKMMEEYKMQPGISTYNIRIQSLCKLKKSSEAKALLDGILARRMKPNSETYCHLIHGFC 300

Query: 301 REGNLDEAKNLFKRMINSGCQPDSDCYFTLVYFHCRGGDYDTAFKICGESMEKGWVPNIT 360
           +EGNLDEAK LFK M+N GC+PDSDCYFTLVYF C+GGD+++A + C E MEKGW PNI+
Sbjct: 301 KEGNLDEAKKLFKDMVNRGCKPDSDCYFTLVYFLCQGGDFESALRFCKECMEKGWFPNIS 360

Query: 361 TMKSLVYGLVSISKVDEAKQLIGQIKERFSKNVEKWNEIEAGLPQ 406
           TM SLV GLVSISKV+EA++LIGQIKE+FS+NV+KWNEIEAGLPQ
Sbjct: 361 TMTSLVNGLVSISKVEEAQELIGQIKEKFSRNVDKWNEIEAGLPQ 396

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR87_ARATH1.1e-14361.12Pentatricopeptide repeat-containing protein At1g61870, mitochondrial OS=Arabidop... [more]
PPR33_ARATH1.4e-12254.19Pentatricopeptide repeat-containing protein At1g11630, mitochondrial OS=Arabidop... [more]
PP352_ARATH3.7e-5132.14Pentatricopeptide repeat-containing protein At4g36680, mitochondrial OS=Arabidop... [more]
PP226_ARATH9.4e-4729.37Pentatricopeptide repeat-containing protein At3g13160, mitochondrial OS=Arabidop... [more]
PPR82_ARATH1.6e-4629.58Pentatricopeptide repeat-containing protein At1g55890, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LRL9_CUCSA4.3e-20888.64Uncharacterized protein OS=Cucumis sativus GN=Csa_1G063590 PE=4 SV=1[more]
A0A059CAY2_EUCGR7.0e-15871.96Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_E04079 PE=4 SV=1[more]
F6HRE2_VITVI7.0e-15869.14Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0116g00030 PE=4 SV=... [more]
A5AGX1_VITVI9.1e-15869.14Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027645 PE=4 SV=1[more]
W9RQY2_9ROSA1.5e-14763.95Uncharacterized protein OS=Morus notabilis GN=L484_012870 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G61870.16.1e-14561.12 pentatricopeptide repeat 336[more]
AT1G11630.17.7e-12454.19 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G36680.12.1e-5232.14 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G13160.15.3e-4829.37 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G55890.19.0e-4829.58 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778658444|ref|XP_011652746.1|6.2e-20888.64PREDICTED: pentatricopeptide repeat-containing protein At1g61870, mitochondrial ... [more]
gi|659067768|ref|XP_008441198.1|1.9e-20186.67PREDICTED: pentatricopeptide repeat-containing protein At1g61870, mitochondrial ... [more]
gi|702351964|ref|XP_010057943.1|1.0e-15771.96PREDICTED: pentatricopeptide repeat-containing protein At1g61870, mitochondrial-... [more]
gi|731435181|ref|XP_003634851.2|1.0e-15769.14PREDICTED: pentatricopeptide repeat-containing protein At1g61870, mitochondrial-... [more]
gi|147767812|emb|CAN77919.1|1.3e-15769.14hypothetical protein VITISV_027645 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006333 chromatin assembly or disassembly
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g03740.1Cp4.1LG20g03740.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 221..250
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 253..301
score: 1.1
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 136..195
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 256..289
score: 3.7E-7coord: 290..324
score: 1.0E-11coord: 329..359
score: 4.5E-4coord: 186..219
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 77..107
score: 5.7coord: 323..357
score: 8.988coord: 183..217
score: 11.192coord: 112..146
score: 7.004coord: 218..252
score: 7.662coord: 253..287
score: 10.863coord: 147..182
score: 6.193coord: 288..322
score: 14.447coord: 358..388
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 256..398
score: 4.8E-9coord: 39..164
score: 4.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 72..395
score: 6.9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 25..399
score: 3.6E
NoneNo IPR availablePANTHERPTHR24015:SF579SUBFAMILY NOT NAMEDcoord: 25..399
score: 3.6E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG20g03740Cp4.1LG09g09080Cucurbita pepo (Zucchini)cpecpeB048