Cp4.1LG10g12370 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g12370
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG10 : 9668418 .. 9669941 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTATAGAAATCTCCATGGCTGTAAGCTCAAATCTCCTCTCCCTTCAGCCATTTCCTCCCTCCGCTCTCTCTGTTCTCTTCTTCCCCAAATCGAAGCCTCAAGAGATGCCGATTTGGTTTCCCAAGTCCTCGTCCACCACCACAACCCCTTCCACGCCATGGAATCCTCCCTCCAGCTCCACTCAATCTCCTTCTCCTCCCACCTCCTCGATCAAACCCTCCTCCGCCTCACCCACCACTCCAAAATCGCCCTCTCCCTCTTCCACTACGCCAAATCCCTCCCCTTCAACCCCCTCTCCACTTCCTCCTACAACATACTCATCGACATCCTCGCCAAAGTTCGCCAGTTCGACGCCGCCTGGCATCTCATCCTCCAAATGGATCACAAGGGTACCGAGACTTTTCTCTTGCTGATTCGCCGCTTGATTTCTGCTGGCCTAACTCGCCAGGCCGTTCGAGCTTTTGATGACATTGAGGGGCTTACTGGAAACAAGGTACGAACCGATGAATTTTGCTATTTATTGGATACCCTTTGTAAGTATGGCTATGTGAAAGTTGCGGCGGAGGTTTTTAACAAGAGGAAGGCGGAGTTTCATGTTGATGTCAAGATTTATACGGTTTTGATATATGGGTGGTGTAAGATTGGGCGATTTGAGATGGCGGAGAGGTTTTTGAAGGATATGATCGGGCGTGGGATTGAACCGAATGTGGTTACTTATAATGTTTTGTTGAATGGGGTTTGCAGGAGGGCGAGTTTGCATCCGGAGGGGAGGTTTGAGCAGACAATTAGGACTGCAGAGAAGGTGTTCGATGAAATGCGTCAGAGAGGGATTGATCCTGATGTGACCAGTTTCTCTATTGTGCTTCATGTGTATAGCCGTGCCCACAAGCCGGAGCTATCACTTGATAAGTTGAAGCAGATGAAAGAGCTTGGCATCTCCCCGACCGTCGCTACATATACTTCGGTGATCAAATGCCTCTGTTCTTGTGGGAGACTCGAGGAGGCCGAAAACTTGGTAGGAGAAATGGTGAGAAATGGAATCTCTCCATCCCCTGCAACTTATAATTGTTTCTTCAAAGAGTACAGAGGTAGAAAGGATGGGGCAGGTGCTTTGAGATTGTACAAGAAGATGAGGGAGGATTGTCTATATGCTGCTCCTAGTTTGCATACCTATAACATATTACTGAGCTTGTTTCTAAATCTTGACAAGAAGGAGACACTGAAAGAGGTGTGGAATGATATGAAGGAGAGCGGGATTGGGCCGGATTTGGATTCGTATACGACGATGATCCACGGGTTGTGCGAGAAGCAGAGATGGAGAGAGGCCTGTCAATTTTTTGTTGAGATGATTGAAAGGGGGTTTCTTCCCCAGAAGGTGACCTTTGAGATGCTTTACAGAGGGTTGATACAGTCTGATATGTTGAGAACTTGGAGGAGATTGAAGAAGAAACTGGAAGAAGAGTCTATAACTTATGCCTCTGAGTTCAAAAACTATCACATTAAGCCTTACAGGAGATGA

mRNA sequence

ATGGCGTATAGAAATCTCCATGGCTGTAAGCTCAAATCTCCTCTCCCTTCAGCCATTTCCTCCCTCCGCTCTCTCTGTTCTCTTCTTCCCCAAATCGAAGCCTCAAGAGATGCCGATTTGGTTTCCCAAGTCCTCGTCCACCACCACAACCCCTTCCACGCCATGGAATCCTCCCTCCAGCTCCACTCAATCTCCTTCTCCTCCCACCTCCTCGATCAAACCCTCCTCCGCCTCACCCACCACTCCAAAATCGCCCTCTCCCTCTTCCACTACGCCAAATCCCTCCCCTTCAACCCCCTCTCCACTTCCTCCTACAACATACTCATCGACATCCTCGCCAAAGTTCGCCAGTTCGACGCCGCCTGGCATCTCATCCTCCAAATGGATCACAAGGGTACCGAGACTTTTCTCTTGCTGATTCGCCGCTTGATTTCTGCTGGCCTAACTCGCCAGGCCGTTCGAGCTTTTGATGACATTGAGGGGCTTACTGGAAACAAGGTACGAACCGATGAATTTTGCTATTTATTGGATACCCTTTGTAAGTATGGCTATGTGAAAGTTGCGGCGGAGGTTTTTAACAAGAGGAAGGCGGAGTTTCATGTTGATGTCAAGATTTATACGGTTTTGATATATGGGTGGTGTAAGATTGGGCGATTTGAGATGGCGGAGAGGTTTTTGAAGGATATGATCGGGCGTGGGATTGAACCGAATGTGGTTACTTATAATGTTTTGTTGAATGGGGTTTGCAGGAGGGCGAGTTTGCATCCGGAGGGGAGGTTTGAGCAGACAATTAGGACTGCAGAGAAGGTGTTCGATGAAATGCGTCAGAGAGGGATTGATCCTGATGTGACCAGTTTCTCTATTGTGCTTCATGTGTATAGCCGTGCCCACAAGCCGGAGCTATCACTTGATAAGTTGAAGCAGATGAAAGAGCTTGGCATCTCCCCGACCGTCGCTACATATACTTCGGTGATCAAATGCCTCTGTTCTTGTGGGAGACTCGAGGAGGCCGAAAACTTGGTAGGAGAAATGGTGAGAAATGGAATCTCTCCATCCCCTGCAACTTATAATTGTTTCTTCAAAGAGTACAGAGGTAGAAAGGATGGGGCAGGTGCTTTGAGATTGTACAAGAAGATGAGGGAGGATTGTCTATATGCTGCTCCTAGTTTGCATACCTATAACATATTACTGAGCTTGTTTCTAAATCTTGACAAGAAGGAGACACTGAAAGAGGTGTGGAATGATATGAAGGAGAGCGGGATTGGGCCGGATTTGGATTCGTATACGACGATGATCCACGGGTTGTGCGAGAAGCAGAGATGGAGAGAGGCCTGTCAATTTTTTGTTGAGATGATTGAAAGGGGGTTTCTTCCCCAGAAGGTGACCTTTGAGATGCTTTACAGAGGGTTGATACAGTCTGATATGTTGAGAACTTGGAGGAGATTGAAGAAGAAACTGGAAGAAGAGTCTATAACTTATGCCTCTGAGTTCAAAAACTATCACATTAAGCCTTACAGGAGATGA

Coding sequence (CDS)

ATGGCGTATAGAAATCTCCATGGCTGTAAGCTCAAATCTCCTCTCCCTTCAGCCATTTCCTCCCTCCGCTCTCTCTGTTCTCTTCTTCCCCAAATCGAAGCCTCAAGAGATGCCGATTTGGTTTCCCAAGTCCTCGTCCACCACCACAACCCCTTCCACGCCATGGAATCCTCCCTCCAGCTCCACTCAATCTCCTTCTCCTCCCACCTCCTCGATCAAACCCTCCTCCGCCTCACCCACCACTCCAAAATCGCCCTCTCCCTCTTCCACTACGCCAAATCCCTCCCCTTCAACCCCCTCTCCACTTCCTCCTACAACATACTCATCGACATCCTCGCCAAAGTTCGCCAGTTCGACGCCGCCTGGCATCTCATCCTCCAAATGGATCACAAGGGTACCGAGACTTTTCTCTTGCTGATTCGCCGCTTGATTTCTGCTGGCCTAACTCGCCAGGCCGTTCGAGCTTTTGATGACATTGAGGGGCTTACTGGAAACAAGGTACGAACCGATGAATTTTGCTATTTATTGGATACCCTTTGTAAGTATGGCTATGTGAAAGTTGCGGCGGAGGTTTTTAACAAGAGGAAGGCGGAGTTTCATGTTGATGTCAAGATTTATACGGTTTTGATATATGGGTGGTGTAAGATTGGGCGATTTGAGATGGCGGAGAGGTTTTTGAAGGATATGATCGGGCGTGGGATTGAACCGAATGTGGTTACTTATAATGTTTTGTTGAATGGGGTTTGCAGGAGGGCGAGTTTGCATCCGGAGGGGAGGTTTGAGCAGACAATTAGGACTGCAGAGAAGGTGTTCGATGAAATGCGTCAGAGAGGGATTGATCCTGATGTGACCAGTTTCTCTATTGTGCTTCATGTGTATAGCCGTGCCCACAAGCCGGAGCTATCACTTGATAAGTTGAAGCAGATGAAAGAGCTTGGCATCTCCCCGACCGTCGCTACATATACTTCGGTGATCAAATGCCTCTGTTCTTGTGGGAGACTCGAGGAGGCCGAAAACTTGGTAGGAGAAATGGTGAGAAATGGAATCTCTCCATCCCCTGCAACTTATAATTGTTTCTTCAAAGAGTACAGAGGTAGAAAGGATGGGGCAGGTGCTTTGAGATTGTACAAGAAGATGAGGGAGGATTGTCTATATGCTGCTCCTAGTTTGCATACCTATAACATATTACTGAGCTTGTTTCTAAATCTTGACAAGAAGGAGACACTGAAAGAGGTGTGGAATGATATGAAGGAGAGCGGGATTGGGCCGGATTTGGATTCGTATACGACGATGATCCACGGGTTGTGCGAGAAGCAGAGATGGAGAGAGGCCTGTCAATTTTTTGTTGAGATGATTGAAAGGGGGTTTCTTCCCCAGAAGGTGACCTTTGAGATGCTTTACAGAGGGTTGATACAGTCTGATATGTTGAGAACTTGGAGGAGATTGAAGAAGAAACTGGAAGAAGAGTCTATAACTTATGCCTCTGAGTTCAAAAACTATCACATTAAGCCTTACAGGAGATGA

Protein sequence

MAYRNLHGCKLKSPLPSAISSLRSLCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKIALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKGTETFLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVKVAAEVFNKRKAEFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLYKKMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGLCEKQRWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEESITYASEFKNYHIKPYRR
BLAST of Cp4.1LG10g12370 vs. Swiss-Prot
Match: PP150_ARATH (Pentatricopeptide repeat-containing protein At2g13420, mitochondrial OS=Arabidopsis thaliana GN=At2g13420 PE=2 SV=1)

HSP 1 Score: 657.1 bits (1694), Expect = 1.5e-187
Identity = 317/480 (66.04%), Postives = 389/480 (81.04%), Query Frame = 1

Query: 29  LPQIEASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKIALSL 88
           LP++E S DA+L+SQ+L+ +HNPFH MESSLQL+ IS + +L+ QTLLRL H+SKIALS 
Sbjct: 32  LPKLEPSSDAELISQMLITNHNPFHFMESSLQLNGISLTPNLIHQTLLRLRHNSKIALSF 91

Query: 89  FHYAKSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKGTETFLLLIRRLISAGL 148
           F Y +SLP    + +S+N++IDIL +VRQFD    LI++MD    ETFL+L++RLI+AGL
Sbjct: 92  FQYLRSLPSPSTTPTSFNLIIDILGRVRQFDVVRQLIVEMDQTSPETFLILVKRLIAAGL 151

Query: 149 TRQAVRAFDDIEGLTGNK-VRTDEFCYLLDTLCKYGYVKVAAEVFNKRKAEFHVDVKIYT 208
           TRQAVRAFDD      N+  R  EF +LLDTLCKYGY K+A  VFN+RK EF  D K+YT
Sbjct: 152 TRQAVRAFDDAPCFLENRRFRLVEFGFLLDTLCKYGYTKMAVGVFNERKEEFGSDEKVYT 211

Query: 209 VLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPEGRFEQTIRTA 268
           +LI GWCK+ R +MAE+FL +MI  GIEPNVVTYNVLLNG+CR ASLHPE RFE+ +R A
Sbjct: 212 ILIAGWCKLRRIDMAEKFLVEMIESGIEPNVVTYNVLLNGICRTASLHPEERFERNVRNA 271

Query: 269 EKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGISPTVATYTSVIKC 328
           EKVFDEMRQRGI+PDVTSFSIVLH+YSRAHK EL+LDK+K MK  GISPT+ TYTSV+KC
Sbjct: 272 EKVFDEMRQRGIEPDVTSFSIVLHMYSRAHKAELTLDKMKLMKAKGISPTIETYTSVVKC 331

Query: 329 LCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLYKKMREDCLYAA 388
           LCSCGRLEEAE L+  MV +GISPS ATYNCFFKEY+GRKD  GA+ LY+KM+       
Sbjct: 332 LCSCGRLEEAEELLETMVESGISPSSATYNCFFKEYKGRKDANGAMNLYRKMKNG--LCK 391

Query: 389 PSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGLCEKQRWREACQF 448
           PS  TYN+LL  F+NL K ET+KE+W+D+K S  GPDLDSYT+++HGLC K++W+EAC +
Sbjct: 392 PSTQTYNVLLGTFINLGKMETVKEIWDDLKASETGPDLDSYTSLVHGLCSKEKWKEACGY 451

Query: 449 FVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEESITYASEFKNYHIKPYRR 508
           FVEMIERGFLPQK+TFE LY+GLIQS+ +RTWRRLKKKL+EESIT+ SEF+ Y  +PY+R
Sbjct: 452 FVEMIERGFLPQKLTFETLYKGLIQSNKMRTWRRLKKKLDEESITFGSEFQRYPFEPYKR 509

BLAST of Cp4.1LG10g12370 vs. Swiss-Prot
Match: PPR54_ARATH (Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidopsis thaliana GN=At1g20300 PE=2 SV=1)

HSP 1 Score: 210.7 bits (535), Expect = 3.8e-53
Identity = 128/415 (30.84%), Postives = 211/415 (50.84%), Query Frame = 1

Query: 85  ALSLFHYAKSLP-FNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKGTE----TFLLL 144
           +L+ F++A S   ++  S   YN +ID+  KVRQFD AWHLI  M  +  E    TF +L
Sbjct: 133 SLAFFNWATSRDDYDHKSPHPYNEMIDLSGKVRQFDLAWHLIDLMKSRNVEISIETFTIL 192

Query: 145 IRRLISAGLTRQAVRAFDDIE--GLTGNKVRTDEFCYLLDTLCKYGYVKVAAEVFNKRKA 204
           IRR + AGL  +AV  F+ +E  G   +K+    F  ++  L +      A   F+  K 
Sbjct: 193 IRRYVRAGLASEAVHCFNRMEDYGCVPDKIA---FSIVISNLSRKRRASEAQSFFDSLKD 252

Query: 205 EFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPE 264
            F  DV +YT L+ GWC+ G    AE+  K+M   GIEPNV TY+++++ +CR       
Sbjct: 253 RFEPDVIVYTNLVRGWCRAGEISEAEKVFKEMKLAGIEPNVYTYSIVIDALCRCGQ---- 312

Query: 265 GRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGISPT 324
                 I  A  VF +M   G  P+  +F+ ++ V+ +A + E  L    QMK+LG  P 
Sbjct: 313 ------ISRAHDVFADMLDSGCAPNAITFNNLMRVHVKAGRTEKVLQVYNQMKKLGCEPD 372

Query: 325 VATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLYK 384
             TY  +I+  C    LE A  ++  M++     + +T+N  F+    ++D  GA R+Y 
Sbjct: 373 TITYNFLIEAHCRDENLENAVKVLNTMIKKKCEVNASTFNTIFRYIEKKRDVNGAHRMYS 432

Query: 385 KMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGLCE 444
           KM E      P+  TYNIL+ +F+     + + ++  +M +  + P++++Y  ++   C 
Sbjct: 433 KMME--AKCEPNTVTYNILMRMFVGSKSTDMVLKMKKEMDDKEVEPNVNTYRLLVTMFCG 492

Query: 445 KQRWREACQFFVEMIERGFL-PQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEESI 492
              W  A + F EM+E   L P    +EM+   L ++  L+    L +K+ ++ +
Sbjct: 493 MGHWNNAYKLFKEMVEEKCLTPSLSLYEMVLAQLRRAGQLKKHEELVEKMIQKGL 532

BLAST of Cp4.1LG10g12370 vs. Swiss-Prot
Match: PPR78_ARATH (Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidopsis thaliana GN=At1g52640 PE=2 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 1.4e-52
Identity = 145/478 (30.33%), Postives = 231/478 (48.33%), Query Frame = 1

Query: 21  SLRSLCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTH 80
           S R   +LL    +    + +S+VL  H NP   +E +L  +S   SS+L++Q L R  +
Sbjct: 21  SFRIFSTLLHDPPSPDLVNEISRVLSDHRNPKDDLEHTLVAYSPRVSSNLVEQVLKRCKN 80

Query: 81  HSKIALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKG-----TET 140
               A   F +A+ +P    S  SY+IL++IL   +QF   W  +++          ++ 
Sbjct: 81  LGFPAHRFFLWARRIPDFAHSLESYHILVEILGSSKQFALLWDFLIEAREYNYFEISSKV 140

Query: 141 FLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVKVAAEVFNKR 200
           F ++ R    A L  +A RAF+ +    G K   D+   LL +LC   +V  A E F K 
Sbjct: 141 FWIVFRAYSRANLPSEACRAFNRMVEF-GIKPCVDDLDQLLHSLCDKKHVNHAQEFFGKA 200

Query: 201 KAEFHV-DVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASL 260
           K    V   K Y++L+ GW +I     A +   +M+ R    +++ YN LL+ +C+   +
Sbjct: 201 KGFGIVPSAKTYSILVRGWARIRDASGARKVFDEMLERNCVVDLLAYNALLDALCKSGDV 260

Query: 261 HPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGI 320
             +G +        K+F EM   G+ PD  SF+I +H Y  A     +   L +MK   +
Sbjct: 261 --DGGY--------KMFQEMGNLGLKPDAYSFAIFIHAYCDAGDVHSAYKVLDRMKRYDL 320

Query: 321 SPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALR 380
            P V T+  +IK LC   ++++A  L+ EM++ G +P   TYN     +    +   A +
Sbjct: 321 VPNVYTFNHIIKTLCKNEKVDDAYLLLDEMIQKGANPDTWTYNSIMAYHCDHCEVNRATK 380

Query: 381 LYKKM-REDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIH 440
           L  +M R  CL   P  HTYN++L L + + + +   E+W  M E    P + +YT MIH
Sbjct: 381 LLSRMDRTKCL---PDRHTYNMVLKLLIRIGRFDRATEIWEGMSERKFYPTVATYTVMIH 440

Query: 441 GLCEKQ-RWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEES 491
           GL  K+ +  EAC++F  MI+ G  P   T EML   L+    +     L  K+E  S
Sbjct: 441 GLVRKKGKLEEACRYFEMMIDEGIPPYSTTVEMLRNRLVGWGQMDVVDVLAGKMERSS 484

BLAST of Cp4.1LG10g12370 vs. Swiss-Prot
Match: PP112_ARATH (Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidopsis thaliana GN=At1g71060 PE=2 SV=1)

HSP 1 Score: 202.6 bits (514), Expect = 1.0e-50
Identity = 132/463 (28.51%), Postives = 235/463 (50.76%), Query Frame = 1

Query: 33  EASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKIALSLFHYA 92
           +AS+DA+ + ++L    +    +E+ L   S+  S  L+++ L +L++   +ALS+F +A
Sbjct: 61  DASQDAERICKILTKFTDS--KVETLLNEASVKLSPALIEEVLKKLSNAGVLALSVFKWA 120

Query: 93  KSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHK---GTETFLLLIRRLISAGLT 152
           ++      +TS+YN LI+ L K++QF   W L+  M  K     ETF L+ RR   A   
Sbjct: 121 ENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDMKAKKLLSKETFALISRRYARARKV 180

Query: 153 RQAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVKVAAEVFNK-RKAEFHVDVKIYTV 212
           ++A+ AF  +E   G K+ + +F  +LDTL K   V  A +VF+K +K  F  D+K YT+
Sbjct: 181 KEAIGAFHKMEEF-GFKMESSDFNRMLDTLSKSRNVGDAQKVFDKMKKKRFEPDIKSYTI 240

Query: 213 LIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPEGRFEQTIRTAE 272
           L+ GW +       +   ++M   G EP+VV Y +++N  C+        ++E+ IR   
Sbjct: 241 LLEGWGQELNLLRVDEVNREMKDEGFEPDVVAYGIIINAHCKAK------KYEEAIR--- 300

Query: 273 KVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGISPTVATYTSVIKCL 332
             F+EM QR   P    F  +++      K   +L+  ++ K  G      TY +++   
Sbjct: 301 -FFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPTYNALVGAY 360

Query: 333 CSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLYKKMREDCLYAAP 392
           C   R+E+A   V EM   G+ P+  TY+         +    A  +Y+ M        P
Sbjct: 361 CWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTMS-----CEP 420

Query: 393 SLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGLCEKQRWREACQFF 452
           ++ TY I++ +F N ++ +   ++W++MK  G+ P +  ++++I  LC + +  EAC++F
Sbjct: 421 TVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLDEACEYF 480

Query: 453 VEMIERGFLPQKVTFEMLYRGLIQ-------SDMLRTWRRLKK 485
            EM++ G  P    F  L + L+        +D++    RL+K
Sbjct: 481 NEMLDVGIRPPGHMFSRLKQTLLDEGRKDKVTDLVVKMDRLRK 505

BLAST of Cp4.1LG10g12370 vs. Swiss-Prot
Match: PP275_ARATH (Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN=At3g49730 PE=2 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 5.1e-50
Identity = 127/463 (27.43%), Postives = 228/463 (49.24%), Query Frame = 1

Query: 25  LCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKI 84
           +C    + E + + + + ++L +HH+    +E +L    I     L+ + L R      +
Sbjct: 54  VCPEKHEDEFAGEVEKIYRILRNHHSRVPKLELALNESGIDLRPGLIIRVLSRCGDAGNL 113

Query: 85  ALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKGTET-----FLLL 144
               F +A   P    S      ++ IL+K+RQF A W LI +M     E      F++L
Sbjct: 114 GYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVL 173

Query: 145 IRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCY--LLDTLCKYGYVKVAAEVFNKRKA 204
           +RR  SA + ++AV   D++       +  DE+ +  LLD LCK G VK A++VF   + 
Sbjct: 174 MRRFASANMVKKAVEVLDEMPKYG---LEPDEYVFGCLLDALCKNGSVKEASKVFEDMRE 233

Query: 205 EFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPE 264
           +F  +++ +T L+YGWC+ G+   A+  L  M   G+EP++V +  LL+G      +   
Sbjct: 234 KFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKM--- 293

Query: 265 GRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHK-PELSLDKLKQMKELGISP 324
                    A  + ++MR+RG +P+V  +++++    R  K  + ++    +M+  G   
Sbjct: 294 -------ADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEA 353

Query: 325 TVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLY 384
            + TYT++I   C  G +++  +++ +M + G+ PS  TY      +  ++     L L 
Sbjct: 354 DIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELI 413

Query: 385 KKM-REDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGL 444
           +KM R  C    P L  YN+++ L   L + +    +WN+M+ +G+ P +D++  MI+G 
Sbjct: 414 EKMKRRGC---HPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGF 473

Query: 445 CEKQRWREACQFFVEMIERGFL--PQKVTFEMLYRGLIQSDML 477
             +    EAC  F EM+ RG    PQ  T + L   L++ D L
Sbjct: 474 TSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKL 500

BLAST of Cp4.1LG10g12370 vs. TrEMBL
Match: A0A0A0KZ73_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G506840 PE=4 SV=1)

HSP 1 Score: 914.1 bits (2361), Expect = 7.7e-263
Identity = 448/507 (88.36%), Postives = 474/507 (93.49%), Query Frame = 1

Query: 1   MAYRNLHGCKLKSPLPSAISSLRSLCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSLQ 60
           MA+  LH   L SPLPS ISS+R L SLLPQI+ S+DADLVSQ+L+HHHNPFH+MESSLQ
Sbjct: 1   MAFTKLHSSSLISPLPSIISSIRFLSSLLPQIQPSKDADLVSQILLHHHNPFHSMESSLQ 60

Query: 61  LHSISFSSHLLDQTLLRLTHHSKIALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDA 120
           LHSISFSSHLLDQTLLRLTHHSKIALS F YA SLP NP+ST+SYNIL+DILAKVRQFDA
Sbjct: 61  LHSISFSSHLLDQTLLRLTHHSKIALSFFDYANSLPSNPISTTSYNILLDILAKVRQFDA 120

Query: 121 AWHLILQMDHKGTETFLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLDTLC 180
           AWHLILQMDHKGT+TFLLLIRRLIS+G TRQA+RAFDDIEGLTGNKV  D+FCYLLD LC
Sbjct: 121 AWHLILQMDHKGTDTFLLLIRRLISSGRTRQAIRAFDDIEGLTGNKVGIDDFCYLLDVLC 180

Query: 181 KYGYVKVAAEVFNKRKAEFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVT 240
           KYGYVKVA EVFNKRK EF VDVKIYT+LIYGWCKIGRFEMAERFLKDM+ RGIEPNVVT
Sbjct: 181 KYGYVKVAVEVFNKRKEEFGVDVKIYTILIYGWCKIGRFEMAERFLKDMVERGIEPNVVT 240

Query: 241 YNVLLNGVCRRASLHPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPE 300
           YNVLLNGVCRRASLHPEGRFE+TIR AEKVFDEMR+RGI+PDVTSFSIVLHVYSRAHKPE
Sbjct: 241 YNVLLNGVCRRASLHPEGRFEKTIRHAEKVFDEMRKRGIEPDVTSFSIVLHVYSRAHKPE 300

Query: 301 LSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFF 360
           LSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLE+ ENL+ EMVR+GISPSP TYNCFF
Sbjct: 301 LSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLEDGENLIEEMVRSGISPSPTTYNCFF 360

Query: 361 KEYRGRKDGAGALRLYKKMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESG 420
           KEYRGRKDGAGALRLYKKMREDCL  APSLHTYNILL+LFLNLDKKETLKE+WNDMKESG
Sbjct: 361 KEYRGRKDGAGALRLYKKMREDCL-CAPSLHTYNILLALFLNLDKKETLKELWNDMKESG 420

Query: 421 IGPDLDSYTTMIHGLCEKQRWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWR 480
           +GPDLDSYTT+IHGLCEKQRW EACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWR
Sbjct: 421 VGPDLDSYTTIIHGLCEKQRWSEACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWR 480

Query: 481 RLKKKLEEESITYASEFKNYHIKPYRR 508
           RLKKKLEEES TY SE KNYHIKPY R
Sbjct: 481 RLKKKLEEESKTYGSELKNYHIKPYMR 506

BLAST of Cp4.1LG10g12370 vs. TrEMBL
Match: M5VHD4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015078mg PE=4 SV=1)

HSP 1 Score: 723.8 bits (1867), Expect = 1.5e-205
Identity = 348/477 (72.96%), Postives = 412/477 (86.37%), Query Frame = 1

Query: 35  SRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKIALSLFHYAKS 94
           S DA+L+S++LVHHHNPFH+MESSLQLH I+ S  LL+ TLLRL H+SKIAL+ F+YAKS
Sbjct: 5   SNDAELISKILVHHHNPFHSMESSLQLHGITLSPQLLNHTLLRLIHNSKIALAFFNYAKS 64

Query: 95  LPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKG----TETFLLLIRRLISAGLTR 154
           LP  PLST+S+N+LIDILAKVRQ+D AW LIL+MD+        TFL+LIRRLIS+GLTR
Sbjct: 65  LPDPPLSTASFNLLIDILAKVRQYDVAWQLILEMDNFNLTPTASTFLILIRRLISSGLTR 124

Query: 155 QAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVKVAAEVFNKRKAEFHVDVKIYTVLI 214
           QAVRAF+D+E     K  + +FC LLDTL KYG+VKVAAEVFNK+K  F  DVK+YTVL+
Sbjct: 125 QAVRAFEDMETFVQTKPSSQDFCCLLDTLSKYGHVKVAAEVFNKKKNGFVPDVKMYTVLV 184

Query: 215 YGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPEGRFEQTIRTAEKV 274
           YGWCKIGRF+MAERFL+DMI RGIEPNVVTYNV LNG+CRRASLHPE RFE+TIR AEKV
Sbjct: 185 YGWCKIGRFDMAERFLRDMIERGIEPNVVTYNVFLNGICRRASLHPEERFERTIRNAEKV 244

Query: 275 FDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGISPTVATYTSVIKCLCS 334
           F EM +RGI+PDVTSFSIVLHVYSRAHKPELSL+KLK M+E GI PT+ TYTSV+KCLCS
Sbjct: 245 FKEMWERGIEPDVTSFSIVLHVYSRAHKPELSLEKLKLMRERGICPTLETYTSVVKCLCS 304

Query: 335 CGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLYKKMREDCLYAAPSL 394
           CGRLE+AE L+G+MV +G+SP  ATYNCFFKEYRGRKD  GAL+LY+KM+E+ L   PS+
Sbjct: 305 CGRLEDAEELLGKMVTSGVSPCAATYNCFFKEYRGRKDSEGALKLYRKMKEEGL-CVPSM 364

Query: 395 HTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGLCEKQRWREACQFFVE 454
           HTYNIL+ + L L++ E ++E+WNDMKESG+GPDLDSYT +IHGLC KQ+WREACQ FVE
Sbjct: 365 HTYNILVGMLLELNRMEIVREIWNDMKESGVGPDLDSYTMLIHGLCGKQKWREACQLFVE 424

Query: 455 MIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEESITYASEFKNYHIKPYRR 508
           MIE+G LPQK+TFE LY+GLIQSDMLRTWRRLKKKL+EESI++ SEF+NYH+KPYRR
Sbjct: 425 MIEKGLLPQKITFETLYKGLIQSDMLRTWRRLKKKLDEESISFGSEFQNYHLKPYRR 480

BLAST of Cp4.1LG10g12370 vs. TrEMBL
Match: V4T738_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000792mg PE=4 SV=1)

HSP 1 Score: 709.5 bits (1830), Expect = 2.9e-201
Identity = 343/494 (69.43%), Postives = 410/494 (83.00%), Query Frame = 1

Query: 29  LPQIEASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKIALSL 88
           LP++E S DADL+SQ+++ HHNPFHAMESSLQLH I+ S HLL+QTLLRL H SK+AL+ 
Sbjct: 48  LPKLEPSNDADLLSQIVLAHHNPFHAMESSLQLHGITLSPHLLNQTLLRLQHSSKVALAF 107

Query: 89  FHYAKSLPFNP------LSTSSYNILIDILAKVRQFDAAWHLILQMDHKG----TETFLL 148
           FHY++SLP N        S SSYN++IDIL+KVRQFD  W LI+QMD       +  FL+
Sbjct: 108 FHYSQSLPSNSNNPRPSTSASSYNLIIDILSKVRQFDVVWQLIVQMDQNNITPISSAFLI 167

Query: 149 LIRRLISAGLTRQAVRAFDDIEGLT-----GNKVRTDEFCYLLDTLCKYGYVKVAAEVFN 208
           LIRRLI+AGLTRQA+RAFDD++         N    + FC+LLDTLCKYGYVKVA EVFN
Sbjct: 168 LIRRLIAAGLTRQAIRAFDDMKCFVQTENQNNNNNVNFFCFLLDTLCKYGYVKVAVEVFN 227

Query: 209 KRKAEFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRAS 268
           + K E   +VK+YT LIYGWCKI R +MAERFL +MI RG+EPNVVTYNVLLNGVCRRAS
Sbjct: 228 RNKHEIIPNVKMYTSLIYGWCKINRIDMAERFLGEMIERGVEPNVVTYNVLLNGVCRRAS 287

Query: 269 LHPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELG 328
           LHP  RFE+TIR AEKVFDEMR RGI+PDVTSFSIVLHVYSRAH+P+LSLDKL  MKE G
Sbjct: 288 LHPSERFEKTIRNAEKVFDEMRVRGIEPDVTSFSIVLHVYSRAHQPQLSLDKLNFMKEKG 347

Query: 329 ISPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGAL 388
           I PTVATY+SV+KCLCSCGR+E+AE L+GEMVRNG+ PS  TYNCFFKEYRGRKD  GA+
Sbjct: 348 ICPTVATYSSVVKCLCSCGRIEDAEELLGEMVRNGVCPSAETYNCFFKEYRGRKDANGAM 407

Query: 389 RLYKKMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIH 448
           +LY++M+ED L   P++H+YNIL+ +F+ L++ + ++E+WND+K SG+GPDLDSYT +IH
Sbjct: 408 KLYRQMKEDGL-CVPNMHSYNILIGMFMALNRMDMVREIWNDVKGSGLGPDLDSYTMLIH 467

Query: 449 GLCEKQRWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEESITY 508
           GLCEKQ+W+EACQ+FVEMIE+G LPQKVTFE LYRGLIQSDMLRTWRRLKKKL+EESIT+
Sbjct: 468 GLCEKQKWKEACQYFVEMIEKGLLPQKVTFETLYRGLIQSDMLRTWRRLKKKLDEESITF 527

BLAST of Cp4.1LG10g12370 vs. TrEMBL
Match: B9RFN9_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1436140 PE=4 SV=1)

HSP 1 Score: 706.8 bits (1823), Expect = 1.9e-200
Identity = 339/501 (67.66%), Postives = 416/501 (83.03%), Query Frame = 1

Query: 11  LKSPLPSAISSLRSLCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHL 70
           L SP    +  L S  SL P +E S DA++VS++L++HHNPFHAMESSLQLH I+ SS L
Sbjct: 13  LTSPPNLRVQCLFSTFSL-PSLEPSNDAEIVSEILLNHHNPFHAMESSLQLHGITLSSSL 72

Query: 71  LDQTLLRLTHHSKIALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDH 130
           L QTLLRL H+SKIALS FHY+ SLP + ++T++YN++IDIL+KVRQFD +W LI+QMD 
Sbjct: 73  LHQTLLRLRHNSKIALSFFHYSLSLPSSSVNTTTYNLIIDILSKVRQFDVSWQLIIQMDQ 132

Query: 131 KGTE----TFLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVK 190
              +    TFL+LIRRLISAG TRQA+RAFDD+E      V    FC+LLDTLCKYGY+K
Sbjct: 133 NNLQPNSHTFLILIRRLISAGFTRQAIRAFDDMESFIAETVNQTHFCFLLDTLCKYGYIK 192

Query: 191 VAAEVFNKRKAEFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLN 250
           VA EVFNKRK  F  +V+IYTVLIYGWCKIGR +MAERF+++M   GIE NVVTYNVLL+
Sbjct: 193 VAVEVFNKRKFRFLPNVRIYTVLIYGWCKIGRIDMAERFIREMDEMGIEANVVTYNVLLD 252

Query: 251 GVCRRASLHPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKL 310
           G+CRRA L PEGRFE+TI  A+KVFDEMRQ+GI+PDVTSFSI+LHVYSRAHKP+L++DKL
Sbjct: 253 GICRRAKLQPEGRFERTIMKADKVFDEMRQKGIEPDVTSFSILLHVYSRAHKPQLTVDKL 312

Query: 311 KQMKELGISPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGR 370
           K M+E+GI PTVATYTSV+KCLCSCGR+++AE L+ +MVRNG+SP+ ATYNCFFKEYRGR
Sbjct: 313 KLMEEMGICPTVATYTSVLKCLCSCGRIDDAEELLEQMVRNGVSPNAATYNCFFKEYRGR 372

Query: 371 KDGAGALRLYKKMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLD 430
           KD   AL+LY+K+R++ L   PS+HTYNILL +F+ L++   + E+WND++ SG GPDLD
Sbjct: 373 KDPETALKLYRKIRQENL-CDPSVHTYNILLGMFMKLNRFNIVNEIWNDLRSSGSGPDLD 432

Query: 431 SYTTMIHGLCEKQRWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKL 490
           SYT ++HGLCEKQ+W++ACQFFVEMIE+G LPQK TFEMLY GLIQS+MLRTWRRLKKKL
Sbjct: 433 SYTLLVHGLCEKQKWQKACQFFVEMIEKGLLPQKATFEMLYAGLIQSNMLRTWRRLKKKL 492

Query: 491 EEESITYASEFKNYHIKPYRR 508
           +EESI + SEF NYH+KPYRR
Sbjct: 493 DEESIAFGSEFSNYHLKPYRR 511

BLAST of Cp4.1LG10g12370 vs. TrEMBL
Match: A0A067KE50_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12786 PE=4 SV=1)

HSP 1 Score: 705.3 bits (1819), Expect = 5.4e-200
Identity = 345/510 (67.65%), Postives = 413/510 (80.98%), Query Frame = 1

Query: 1   MAYRNLHGCKLKSPLPSAISSLR-SLCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSL 60
           MA R  H   L    P  I  L  S  S LP +E S DA+L+SQ+L+ H+NPFHAMESSL
Sbjct: 1   MAIRLFHSRALIPSTPFCIHRLFCSSSSSLPSLEPSNDAELLSQILLRHYNPFHAMESSL 60

Query: 61  QLHSISFSSHLLDQTLLRLTHHSKIALSLFHYAKSLPFNP--LSTSSYNILIDILAKVRQ 120
           QLH IS ++ LL QTLLRL HHSKIALSLFHYA SLP +   ++++SY+I+IDIL+KV Q
Sbjct: 61  QLHGISLTTPLLQQTLLRLRHHSKIALSLFHYALSLPSSQSTVTSTSYDIIIDILSKVHQ 120

Query: 121 FDAAWHLILQMDHKGTETFLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLD 180
           FD +W LI+QMD   + TF +LIRRLI+AGLTRQA+RAFDD+E      V    FC+LLD
Sbjct: 121 FDVSWQLIIQMDEPTSHTFFVLIRRLIAAGLTRQAIRAFDDMESFITENVDETHFCFLLD 180

Query: 181 TLCKYGYVKVAAEVFNKRKAEFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPN 240
           TLCKYGY KVA E+FNKRK  F  +V++YT+LIYGWCKIGR +MAERFL++M  RGIEPN
Sbjct: 181 TLCKYGYPKVAVEIFNKRKYRFSPNVRMYTILIYGWCKIGRIDMAERFLREMDERGIEPN 240

Query: 241 VVTYNVLLNGVCRRASLHPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAH 300
           VVTYNVLLNG+CRRA L PE RFE+TI  AE VFDEMR RGI+PDVTSFSI+LHVYSRAH
Sbjct: 241 VVTYNVLLNGICRRAKLQPESRFERTITLAENVFDEMRHRGIEPDVTSFSILLHVYSRAH 300

Query: 301 KPELSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYN 360
           KP+L+LDKLK M+E GI PTV TYTSV+KCLCSCGR+E+AE L+ EM+RNG+SP+  TYN
Sbjct: 301 KPQLTLDKLKLMEEKGICPTVTTYTSVVKCLCSCGRVEDAEELLVEMIRNGVSPNAVTYN 360

Query: 361 CFFKEYRGRKDGAGALRLYKKMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMK 420
           CFFKEYRGRKD   AL+LY++MRED  +A  S+ TY ILL +F+ L++ E + E+WND+ 
Sbjct: 361 CFFKEYRGRKDAESALKLYRRMREDDPFAL-SMQTYKILLGMFMKLNRMEIVNEIWNDLC 420

Query: 421 ESGIGPDLDSYTTMIHGLCEKQRWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLR 480
            SG GPDLDSYT +I GLCEKQ+W+EAC++FVEMIERGFLPQKVTFE LY+GLIQSDMLR
Sbjct: 421 TSGPGPDLDSYTMLIRGLCEKQKWKEACKYFVEMIERGFLPQKVTFETLYKGLIQSDMLR 480

Query: 481 TWRRLKKKLEEESITYASEFKNYHIKPYRR 508
           TWRRLKKKL+EESI + SEF+NYH+KPYRR
Sbjct: 481 TWRRLKKKLDEESIAFGSEFQNYHLKPYRR 509

BLAST of Cp4.1LG10g12370 vs. TAIR10
Match: AT1G20300.1 (AT1G20300.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 210.7 bits (535), Expect = 2.1e-54
Identity = 128/415 (30.84%), Postives = 211/415 (50.84%), Query Frame = 1

Query: 85  ALSLFHYAKSLP-FNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKGTE----TFLLL 144
           +L+ F++A S   ++  S   YN +ID+  KVRQFD AWHLI  M  +  E    TF +L
Sbjct: 133 SLAFFNWATSRDDYDHKSPHPYNEMIDLSGKVRQFDLAWHLIDLMKSRNVEISIETFTIL 192

Query: 145 IRRLISAGLTRQAVRAFDDIE--GLTGNKVRTDEFCYLLDTLCKYGYVKVAAEVFNKRKA 204
           IRR + AGL  +AV  F+ +E  G   +K+    F  ++  L +      A   F+  K 
Sbjct: 193 IRRYVRAGLASEAVHCFNRMEDYGCVPDKIA---FSIVISNLSRKRRASEAQSFFDSLKD 252

Query: 205 EFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPE 264
            F  DV +YT L+ GWC+ G    AE+  K+M   GIEPNV TY+++++ +CR       
Sbjct: 253 RFEPDVIVYTNLVRGWCRAGEISEAEKVFKEMKLAGIEPNVYTYSIVIDALCRCGQ---- 312

Query: 265 GRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGISPT 324
                 I  A  VF +M   G  P+  +F+ ++ V+ +A + E  L    QMK+LG  P 
Sbjct: 313 ------ISRAHDVFADMLDSGCAPNAITFNNLMRVHVKAGRTEKVLQVYNQMKKLGCEPD 372

Query: 325 VATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLYK 384
             TY  +I+  C    LE A  ++  M++     + +T+N  F+    ++D  GA R+Y 
Sbjct: 373 TITYNFLIEAHCRDENLENAVKVLNTMIKKKCEVNASTFNTIFRYIEKKRDVNGAHRMYS 432

Query: 385 KMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGLCE 444
           KM E      P+  TYNIL+ +F+     + + ++  +M +  + P++++Y  ++   C 
Sbjct: 433 KMME--AKCEPNTVTYNILMRMFVGSKSTDMVLKMKKEMDDKEVEPNVNTYRLLVTMFCG 492

Query: 445 KQRWREACQFFVEMIERGFL-PQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEESI 492
              W  A + F EM+E   L P    +EM+   L ++  L+    L +K+ ++ +
Sbjct: 493 MGHWNNAYKLFKEMVEEKCLTPSLSLYEMVLAQLRRAGQLKKHEELVEKMIQKGL 532

BLAST of Cp4.1LG10g12370 vs. TAIR10
Match: AT1G52640.1 (AT1G52640.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 208.8 bits (530), Expect = 8.1e-54
Identity = 145/478 (30.33%), Postives = 231/478 (48.33%), Query Frame = 1

Query: 21  SLRSLCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTH 80
           S R   +LL    +    + +S+VL  H NP   +E +L  +S   SS+L++Q L R  +
Sbjct: 21  SFRIFSTLLHDPPSPDLVNEISRVLSDHRNPKDDLEHTLVAYSPRVSSNLVEQVLKRCKN 80

Query: 81  HSKIALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKG-----TET 140
               A   F +A+ +P    S  SY+IL++IL   +QF   W  +++          ++ 
Sbjct: 81  LGFPAHRFFLWARRIPDFAHSLESYHILVEILGSSKQFALLWDFLIEAREYNYFEISSKV 140

Query: 141 FLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVKVAAEVFNKR 200
           F ++ R    A L  +A RAF+ +    G K   D+   LL +LC   +V  A E F K 
Sbjct: 141 FWIVFRAYSRANLPSEACRAFNRMVEF-GIKPCVDDLDQLLHSLCDKKHVNHAQEFFGKA 200

Query: 201 KAEFHV-DVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASL 260
           K    V   K Y++L+ GW +I     A +   +M+ R    +++ YN LL+ +C+   +
Sbjct: 201 KGFGIVPSAKTYSILVRGWARIRDASGARKVFDEMLERNCVVDLLAYNALLDALCKSGDV 260

Query: 261 HPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGI 320
             +G +        K+F EM   G+ PD  SF+I +H Y  A     +   L +MK   +
Sbjct: 261 --DGGY--------KMFQEMGNLGLKPDAYSFAIFIHAYCDAGDVHSAYKVLDRMKRYDL 320

Query: 321 SPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALR 380
            P V T+  +IK LC   ++++A  L+ EM++ G +P   TYN     +    +   A +
Sbjct: 321 VPNVYTFNHIIKTLCKNEKVDDAYLLLDEMIQKGANPDTWTYNSIMAYHCDHCEVNRATK 380

Query: 381 LYKKM-REDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIH 440
           L  +M R  CL   P  HTYN++L L + + + +   E+W  M E    P + +YT MIH
Sbjct: 381 LLSRMDRTKCL---PDRHTYNMVLKLLIRIGRFDRATEIWEGMSERKFYPTVATYTVMIH 440

Query: 441 GLCEKQ-RWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEES 491
           GL  K+ +  EAC++F  MI+ G  P   T EML   L+    +     L  K+E  S
Sbjct: 441 GLVRKKGKLEEACRYFEMMIDEGIPPYSTTVEMLRNRLVGWGQMDVVDVLAGKMERSS 484

BLAST of Cp4.1LG10g12370 vs. TAIR10
Match: AT1G71060.1 (AT1G71060.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 202.6 bits (514), Expect = 5.8e-52
Identity = 132/463 (28.51%), Postives = 235/463 (50.76%), Query Frame = 1

Query: 33  EASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKIALSLFHYA 92
           +AS+DA+ + ++L    +    +E+ L   S+  S  L+++ L +L++   +ALS+F +A
Sbjct: 61  DASQDAERICKILTKFTDS--KVETLLNEASVKLSPALIEEVLKKLSNAGVLALSVFKWA 120

Query: 93  KSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHK---GTETFLLLIRRLISAGLT 152
           ++      +TS+YN LI+ L K++QF   W L+  M  K     ETF L+ RR   A   
Sbjct: 121 ENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDMKAKKLLSKETFALISRRYARARKV 180

Query: 153 RQAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVKVAAEVFNK-RKAEFHVDVKIYTV 212
           ++A+ AF  +E   G K+ + +F  +LDTL K   V  A +VF+K +K  F  D+K YT+
Sbjct: 181 KEAIGAFHKMEEF-GFKMESSDFNRMLDTLSKSRNVGDAQKVFDKMKKKRFEPDIKSYTI 240

Query: 213 LIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPEGRFEQTIRTAE 272
           L+ GW +       +   ++M   G EP+VV Y +++N  C+        ++E+ IR   
Sbjct: 241 LLEGWGQELNLLRVDEVNREMKDEGFEPDVVAYGIIINAHCKAK------KYEEAIR--- 300

Query: 273 KVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGISPTVATYTSVIKCL 332
             F+EM QR   P    F  +++      K   +L+  ++ K  G      TY +++   
Sbjct: 301 -FFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPTYNALVGAY 360

Query: 333 CSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLYKKMREDCLYAAP 392
           C   R+E+A   V EM   G+ P+  TY+         +    A  +Y+ M        P
Sbjct: 361 CWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTMS-----CEP 420

Query: 393 SLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGLCEKQRWREACQFF 452
           ++ TY I++ +F N ++ +   ++W++MK  G+ P +  ++++I  LC + +  EAC++F
Sbjct: 421 TVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLDEACEYF 480

Query: 453 VEMIERGFLPQKVTFEMLYRGLIQ-------SDMLRTWRRLKK 485
            EM++ G  P    F  L + L+        +D++    RL+K
Sbjct: 481 NEMLDVGIRPPGHMFSRLKQTLLDEGRKDKVTDLVVKMDRLRK 505

BLAST of Cp4.1LG10g12370 vs. TAIR10
Match: AT3G49730.1 (AT3G49730.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 200.3 bits (508), Expect = 2.9e-51
Identity = 127/463 (27.43%), Postives = 228/463 (49.24%), Query Frame = 1

Query: 25  LCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKI 84
           +C    + E + + + + ++L +HH+    +E +L    I     L+ + L R      +
Sbjct: 54  VCPEKHEDEFAGEVEKIYRILRNHHSRVPKLELALNESGIDLRPGLIIRVLSRCGDAGNL 113

Query: 85  ALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKGTET-----FLLL 144
               F +A   P    S      ++ IL+K+RQF A W LI +M     E      F++L
Sbjct: 114 GYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVL 173

Query: 145 IRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCY--LLDTLCKYGYVKVAAEVFNKRKA 204
           +RR  SA + ++AV   D++       +  DE+ +  LLD LCK G VK A++VF   + 
Sbjct: 174 MRRFASANMVKKAVEVLDEMPKYG---LEPDEYVFGCLLDALCKNGSVKEASKVFEDMRE 233

Query: 205 EFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPE 264
           +F  +++ +T L+YGWC+ G+   A+  L  M   G+EP++V +  LL+G      +   
Sbjct: 234 KFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKM--- 293

Query: 265 GRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHK-PELSLDKLKQMKELGISP 324
                    A  + ++MR+RG +P+V  +++++    R  K  + ++    +M+  G   
Sbjct: 294 -------ADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEA 353

Query: 325 TVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLY 384
            + TYT++I   C  G +++  +++ +M + G+ PS  TY      +  ++     L L 
Sbjct: 354 DIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELI 413

Query: 385 KKM-REDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGL 444
           +KM R  C    P L  YN+++ L   L + +    +WN+M+ +G+ P +D++  MI+G 
Sbjct: 414 EKMKRRGC---HPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGF 473

Query: 445 CEKQRWREACQFFVEMIERGFL--PQKVTFEMLYRGLIQSDML 477
             +    EAC  F EM+ RG    PQ  T + L   L++ D L
Sbjct: 474 TSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKL 500

BLAST of Cp4.1LG10g12370 vs. TAIR10
Match: AT5G65820.1 (AT5G65820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 200.3 bits (508), Expect = 2.9e-51
Identity = 119/453 (26.27%), Postives = 225/453 (49.67%), Query Frame = 1

Query: 33  EASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKIALSLFHYA 92
           E + D +   ++L   H+    +E +L    +     L+++ L R      +    F +A
Sbjct: 78  EFASDVEKSYRILRKFHSRVPKLELALNESGVELRPGLIERVLNRCGDAGNLGYRFFVWA 137

Query: 93  KSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKGT-----ETFLLLIRRLISAG 152
              P    S   Y  ++ IL+K+RQF A W LI +M  +       E F++L++R  SA 
Sbjct: 138 AKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASAD 197

Query: 153 LTRQAVRAFDDIEGLTGNKVRTDEFCY--LLDTLCKYGYVKVAAEVFNKRKAEFHVDVKI 212
           + ++A+   D++          DE+ +  LLD LCK+G VK AA++F   +  F V+++ 
Sbjct: 198 MVKKAIEVLDEMPKFG---FEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLRY 257

Query: 213 YTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPEGRFEQTIR 272
           +T L+YGWC++G+   A+  L  M   G EP++V Y  LL+G      +           
Sbjct: 258 FTSLLYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKM----------A 317

Query: 273 TAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGISPTVATYTSVI 332
            A  +  +MR+RG +P+   +++++    +  + E ++    +M+       V TYT+++
Sbjct: 318 DAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALV 377

Query: 333 KCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLYKKMREDCLY 392
              C  G++++   ++ +M++ G+ PS  TY      +  ++     L L +KMR+  + 
Sbjct: 378 SGFCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQ--IE 437

Query: 393 AAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGLCEKQRWREAC 452
             P +  YN+++ L   L + +    +WN+M+E+G+ P +D++  MI+GL  +    EA 
Sbjct: 438 YHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLASQGCLLEAS 497

Query: 453 QFFVEMIERGF--LPQKVTFEMLYRGLIQSDML 477
             F EM+ RG   + Q  T ++L   +++   L
Sbjct: 498 DHFKEMVTRGLFSVSQYGTLKLLLNTVLKDKKL 515

BLAST of Cp4.1LG10g12370 vs. NCBI nr
Match: gi|449457341|ref|XP_004146407.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial [Cucumis sativus])

HSP 1 Score: 914.1 bits (2361), Expect = 1.1e-262
Identity = 448/507 (88.36%), Postives = 474/507 (93.49%), Query Frame = 1

Query: 1   MAYRNLHGCKLKSPLPSAISSLRSLCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSLQ 60
           MA+  LH   L SPLPS ISS+R L SLLPQI+ S+DADLVSQ+L+HHHNPFH+MESSLQ
Sbjct: 1   MAFTKLHSSSLISPLPSIISSIRFLSSLLPQIQPSKDADLVSQILLHHHNPFHSMESSLQ 60

Query: 61  LHSISFSSHLLDQTLLRLTHHSKIALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDA 120
           LHSISFSSHLLDQTLLRLTHHSKIALS F YA SLP NP+ST+SYNIL+DILAKVRQFDA
Sbjct: 61  LHSISFSSHLLDQTLLRLTHHSKIALSFFDYANSLPSNPISTTSYNILLDILAKVRQFDA 120

Query: 121 AWHLILQMDHKGTETFLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLDTLC 180
           AWHLILQMDHKGT+TFLLLIRRLIS+G TRQA+RAFDDIEGLTGNKV  D+FCYLLD LC
Sbjct: 121 AWHLILQMDHKGTDTFLLLIRRLISSGRTRQAIRAFDDIEGLTGNKVGIDDFCYLLDVLC 180

Query: 181 KYGYVKVAAEVFNKRKAEFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVT 240
           KYGYVKVA EVFNKRK EF VDVKIYT+LIYGWCKIGRFEMAERFLKDM+ RGIEPNVVT
Sbjct: 181 KYGYVKVAVEVFNKRKEEFGVDVKIYTILIYGWCKIGRFEMAERFLKDMVERGIEPNVVT 240

Query: 241 YNVLLNGVCRRASLHPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPE 300
           YNVLLNGVCRRASLHPEGRFE+TIR AEKVFDEMR+RGI+PDVTSFSIVLHVYSRAHKPE
Sbjct: 241 YNVLLNGVCRRASLHPEGRFEKTIRHAEKVFDEMRKRGIEPDVTSFSIVLHVYSRAHKPE 300

Query: 301 LSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFF 360
           LSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLE+ ENL+ EMVR+GISPSP TYNCFF
Sbjct: 301 LSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLEDGENLIEEMVRSGISPSPTTYNCFF 360

Query: 361 KEYRGRKDGAGALRLYKKMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESG 420
           KEYRGRKDGAGALRLYKKMREDCL  APSLHTYNILL+LFLNLDKKETLKE+WNDMKESG
Sbjct: 361 KEYRGRKDGAGALRLYKKMREDCL-CAPSLHTYNILLALFLNLDKKETLKELWNDMKESG 420

Query: 421 IGPDLDSYTTMIHGLCEKQRWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWR 480
           +GPDLDSYTT+IHGLCEKQRW EACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWR
Sbjct: 421 VGPDLDSYTTIIHGLCEKQRWSEACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWR 480

Query: 481 RLKKKLEEESITYASEFKNYHIKPYRR 508
           RLKKKLEEES TY SE KNYHIKPY R
Sbjct: 481 RLKKKLEEESKTYGSELKNYHIKPYMR 506

BLAST of Cp4.1LG10g12370 vs. NCBI nr
Match: gi|659082952|ref|XP_008442112.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial-like [Cucumis melo])

HSP 1 Score: 914.1 bits (2361), Expect = 1.1e-262
Identity = 449/507 (88.56%), Postives = 474/507 (93.49%), Query Frame = 1

Query: 1   MAYRNLHGCKLKSPLPSAISSLRSLCSLLPQIEASRDADLVSQVLVHHHNPFHAMESSLQ 60
           MA+  LH   L SPLPS IS +R L SLLPQI+ S DADLVSQ+L+HHHNPFH+MESSLQ
Sbjct: 1   MAFAKLHCSTLISPLPSTISFIRFLSSLLPQIQPSEDADLVSQILLHHHNPFHSMESSLQ 60

Query: 61  LHSISFSSHLLDQTLLRLTHHSKIALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDA 120
           LHSISFSSHLLDQTLLRLTHHSKIALS F YA +LP NP+ST+SYNIL+DILAKVRQFDA
Sbjct: 61  LHSISFSSHLLDQTLLRLTHHSKIALSFFDYANTLPSNPISTTSYNILLDILAKVRQFDA 120

Query: 121 AWHLILQMDHKGTETFLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLDTLC 180
           AWHLILQMDHKGT+TFLLLIRRLIS+G TRQA+RAFDDIEGLTGNKV  D+FCYLLD LC
Sbjct: 121 AWHLILQMDHKGTDTFLLLIRRLISSGRTRQAIRAFDDIEGLTGNKVGIDDFCYLLDVLC 180

Query: 181 KYGYVKVAAEVFNKRKAEFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVT 240
           KYGYVKVA EVFNKRK EF VDVKIYTVLIYGWCKIGRFEMAERFLKDM+ RGIEPNVVT
Sbjct: 181 KYGYVKVAVEVFNKRKEEFGVDVKIYTVLIYGWCKIGRFEMAERFLKDMVERGIEPNVVT 240

Query: 241 YNVLLNGVCRRASLHPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPE 300
           YNVLLNGVCRRASLHPEGRFE+TIR AEKVFDEMR+RGI+PDVTSFSIVLHVYSRAHKPE
Sbjct: 241 YNVLLNGVCRRASLHPEGRFEKTIRNAEKVFDEMRKRGIEPDVTSFSIVLHVYSRAHKPE 300

Query: 301 LSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFF 360
           LSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLE+AENLVGEMVR+GISPSP TYNCFF
Sbjct: 301 LSLDKLKQMKELGISPTVATYTSVIKCLCSCGRLEDAENLVGEMVRSGISPSPTTYNCFF 360

Query: 361 KEYRGRKDGAGALRLYKKMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESG 420
           KEYRGRKDGAGALRLYK+MREDCL  APSLHTYNILL+LFLNLDKK+TLKE+WNDMK SG
Sbjct: 361 KEYRGRKDGAGALRLYKRMREDCL-CAPSLHTYNILLALFLNLDKKQTLKELWNDMKASG 420

Query: 421 IGPDLDSYTTMIHGLCEKQRWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWR 480
           +GPDLDSYTTMIHGLCEKQRW EACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWR
Sbjct: 421 VGPDLDSYTTMIHGLCEKQRWSEACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWR 480

Query: 481 RLKKKLEEESITYASEFKNYHIKPYRR 508
           RLKKKLEEES TY SEFKNYHIKPY R
Sbjct: 481 RLKKKLEEESKTYGSEFKNYHIKPYMR 506

BLAST of Cp4.1LG10g12370 vs. NCBI nr
Match: gi|1009146967|ref|XP_015891158.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial-like [Ziziphus jujuba])

HSP 1 Score: 740.7 bits (1911), Expect = 1.7e-210
Identity = 359/498 (72.09%), Postives = 427/498 (85.74%), Query Frame = 1

Query: 15  LPSAISSLRSLCS-LLPQIEASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQ 74
           LPS++    SL    LP ++ S DA+LVS +L+ HHNPFHAMESSL+LH +S S+HLL Q
Sbjct: 22  LPSSLRPFSSLSDDALPSLQPSNDAELVSNILLQHHNPFHAMESSLELHGVSLSTHLLHQ 81

Query: 75  TLLRLTHHSKIALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKGT 134
           TLLR+ H SKIALS FHY+KSLP  PL+ +SYN+LIDILAKVRQFD AW LI++MD    
Sbjct: 82  TLLRIRHSSKIALSFFHYSKSLPTPPLTATSYNLLIDILAKVRQFDVAWQLIVEMDQNKL 141

Query: 135 ----ETFLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVKVAA 194
                 FL+LIRRLISAGLTRQAVRAF+DIE   G K  T++F +LLDTLCKYGYVKVAA
Sbjct: 142 CPPPTPFLILIRRLISAGLTRQAVRAFNDIESFIGTKPTTEDFRFLLDTLCKYGYVKVAA 201

Query: 195 EVFNKRKAEFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVC 254
           EVFN+RK  F  DVK+YTV++YGWCKIGR +MAERFL+DMIGRGIEPNVVTYNVLLNG+C
Sbjct: 202 EVFNERKHGFVPDVKMYTVMVYGWCKIGRVDMAERFLRDMIGRGIEPNVVTYNVLLNGIC 261

Query: 255 RRASLHPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQM 314
           RRASLHPE RFE+TIR+A+KVFDEM +RGI+PDVTS+SIVLHVYSRAHKP+LSLDKLK M
Sbjct: 262 RRASLHPEERFERTIRSADKVFDEMWERGIEPDVTSYSIVLHVYSRAHKPQLSLDKLKLM 321

Query: 315 KELGISPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDG 374
           +E GISPTVATYTSV+KCL SCGRL++AE L+ EMV NG+SP  ATYNCFFKEYRGRKD 
Sbjct: 322 RERGISPTVATYTSVVKCLSSCGRLDDAEELLSEMVSNGVSPCAATYNCFFKEYRGRKDT 381

Query: 375 AGALRLYKKMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYT 434
            GAL+LYKKM++D L   P++HTYNIL+ +FL L++ E +KE+WNDMK++G GPDLDSYT
Sbjct: 382 DGALKLYKKMKQDGL-CMPNMHTYNILVGMFLTLNRMEIVKELWNDMKDNGTGPDLDSYT 441

Query: 435 TMIHGLCEKQRWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEE 494
            +IHGLCEK++WR+AC+FFVEMIE+GFLPQKVTFE LYRGLIQS+ LRTWRRLKKKL++E
Sbjct: 442 VLIHGLCEKKKWRKACKFFVEMIEKGFLPQKVTFETLYRGLIQSNKLRTWRRLKKKLDQE 501

Query: 495 SITYASEFKNYHIKPYRR 508
           SIT+ SEF++YH+KPYRR
Sbjct: 502 SITFGSEFQSYHLKPYRR 518

BLAST of Cp4.1LG10g12370 vs. NCBI nr
Match: gi|658015096|ref|XP_008342880.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial-like [Malus domestica])

HSP 1 Score: 734.9 bits (1896), Expect = 9.2e-209
Identity = 355/497 (71.43%), Postives = 421/497 (84.71%), Query Frame = 1

Query: 22  LRSLCSLLPQI-------EASRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQT 81
           +R  CS  P +       + + DA+L+S++LVHHHNPFHAMESSLQLH I+ SS L+  T
Sbjct: 30  IRRSCSSFPNLNDPNGDTKDANDAELISKILVHHHNPFHAMESSLQLHGITLSSQLVHHT 89

Query: 82  LLRLTHHSKIALSLFHYAKSLPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKG-- 141
           LLRLT++SKIAL+ FHYAKSLP  PL+T+S+N+LIDILAKVRQ+D AW LI++MD+    
Sbjct: 90  LLRLTNNSKIALAFFHYAKSLPNPPLTTASFNLLIDILAKVRQYDVAWQLIVEMDNFNLT 149

Query: 142 --TETFLLLIRRLISAGLTRQAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVKVAAE 201
               TFL LIRRLIS+GLTRQA+RAFDDIE     K  + +FC+LLDTLCKYG+VKVA E
Sbjct: 150 PTATTFLTLIRRLISSGLTRQAIRAFDDIETFVQTKPSSQDFCFLLDTLCKYGHVKVATE 209

Query: 202 VFNKRKAEFHVDVKIYTVLIYGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCR 261
           VFNK+K EF  DVK+YTVLIYGWCKIGRF+MAERFL+DMI   IEPNVVTYNV LNG+CR
Sbjct: 210 VFNKKKHEFVPDVKMYTVLIYGWCKIGRFDMAERFLRDMIEHSIEPNVVTYNVFLNGICR 269

Query: 262 RASLHPEGRFEQTIRTAEKVFDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMK 321
           RASLHP+ RFE+TIR AEKVF+EM +RGI+PDVTSFSIVLHVYSRAHKPELSLDKLK M+
Sbjct: 270 RASLHPQERFEKTIRNAEKVFEEMWKRGIEPDVTSFSIVLHVYSRAHKPELSLDKLKLMR 329

Query: 322 ELGISPTVATYTSVIKCLCSCGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGA 381
           E GI PTV TYTSV+KCLCSCGRLE+AE L+GEMVR+G+SP  ATYNCFFKEYRGRK+G 
Sbjct: 330 ERGIFPTVETYTSVVKCLCSCGRLEDAEELLGEMVRSGVSPCAATYNCFFKEYRGRKNGE 389

Query: 382 GALRLYKKMREDCLYAAPSLHTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTT 441
           GAL+LY+KM+ED     PS+HTYNIL+ + L L++ E +KE+WNDMKESG+GPDLDSYT 
Sbjct: 390 GALKLYRKMKEDG-XCVPSMHTYNILVGMLLELNRMEIVKEIWNDMKESGVGPDLDSYTM 449

Query: 442 MIHGLCEKQRWREACQFFVEMIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEES 501
           +IHGLC KQ+WREACQFFVEMIE+G LPQK+TFE LY+GLIQSDMLRTWRRLKKKL+EES
Sbjct: 450 LIHGLCAKQKWREACQFFVEMIEKGLLPQKITFETLYKGLIQSDMLRTWRRLKKKLDEES 509

Query: 502 ITYASEFKNYHIKPYRR 508
           I++ SEF+ YH+KP+RR
Sbjct: 510 ISFGSEFQKYHLKPFRR 525

BLAST of Cp4.1LG10g12370 vs. NCBI nr
Match: gi|694326786|ref|XP_009354296.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial-like [Pyrus x bretschneideri])

HSP 1 Score: 733.0 bits (1891), Expect = 3.5e-208
Identity = 353/477 (74.00%), Postives = 413/477 (86.58%), Query Frame = 1

Query: 35  SRDADLVSQVLVHHHNPFHAMESSLQLHSISFSSHLLDQTLLRLTHHSKIALSLFHYAKS 94
           + DA+L+S++LVHHHNPFHAMESSLQLH I+ SS L+  TLLRLT++SKIAL+ F YAKS
Sbjct: 50  ANDAELISKILVHHHNPFHAMESSLQLHGITLSSQLVHHTLLRLTNNSKIALAFFQYAKS 109

Query: 95  LPFNPLSTSSYNILIDILAKVRQFDAAWHLILQMDHKG----TETFLLLIRRLISAGLTR 154
           LP  PL+T+S+N+LIDILAKVRQ+D AW LI++MD+      T TFL LIRRLIS+GLTR
Sbjct: 110 LPNPPLTTASFNLLIDILAKVRQYDVAWQLIVEMDNFNLTPTTATFLTLIRRLISSGLTR 169

Query: 155 QAVRAFDDIEGLTGNKVRTDEFCYLLDTLCKYGYVKVAAEVFNKRKAEFHVDVKIYTVLI 214
           QAVRAFDDIE     K    +FC+LLDTLCKYG+VKVA EVFNKRK EF  DVK+YTVLI
Sbjct: 170 QAVRAFDDIETFVQTKPSCQDFCFLLDTLCKYGHVKVATEVFNKRKHEFVPDVKMYTVLI 229

Query: 215 YGWCKIGRFEMAERFLKDMIGRGIEPNVVTYNVLLNGVCRRASLHPEGRFEQTIRTAEKV 274
           YGWCKIGRF+MAERFL DMI RGIEPNVVTYNV LNG+CRRASLHP+ RFE+TIR AEKV
Sbjct: 230 YGWCKIGRFDMAERFLSDMIERGIEPNVVTYNVFLNGICRRASLHPQERFEKTIRNAEKV 289

Query: 275 FDEMRQRGIDPDVTSFSIVLHVYSRAHKPELSLDKLKQMKELGISPTVATYTSVIKCLCS 334
           F+EMR+RGI+PDVTSFSIVLHVYSRAHKPELSLDKLK M+E GI PTV TYTSV+KCLCS
Sbjct: 290 FEEMRKRGIEPDVTSFSIVLHVYSRAHKPELSLDKLKLMRERGIFPTVETYTSVVKCLCS 349

Query: 335 CGRLEEAENLVGEMVRNGISPSPATYNCFFKEYRGRKDGAGALRLYKKMREDCLYAAPSL 394
           CGRLE+AE L+G+MVR+G+SP  ATYNCFFKEYRGRK+G  AL+LY+KM+ED L   PS+
Sbjct: 350 CGRLEDAEELLGDMVRSGVSPCAATYNCFFKEYRGRKNGECALKLYRKMKEDGL-CMPSM 409

Query: 395 HTYNILLSLFLNLDKKETLKEVWNDMKESGIGPDLDSYTTMIHGLCEKQRWREACQFFVE 454
           HTYNIL+ + L L++ E + E+WNDMKE G+GPDLDSYT +IHGLC KQ+WREACQFFVE
Sbjct: 410 HTYNILVGMLLELNRMEIVNEIWNDMKEGGVGPDLDSYTMLIHGLCAKQKWREACQFFVE 469

Query: 455 MIERGFLPQKVTFEMLYRGLIQSDMLRTWRRLKKKLEEESITYASEFKNYHIKPYRR 508
           MIE+G LPQK+TFE LY+GLIQSDMLRTWRRLKKKL+EESI++ SEF+ YH+KP+RR
Sbjct: 470 MIEKGLLPQKITFETLYKGLIQSDMLRTWRRLKKKLDEESISFGSEFQKYHLKPFRR 525

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP150_ARATH1.5e-18766.04Pentatricopeptide repeat-containing protein At2g13420, mitochondrial OS=Arabidop... [more]
PPR54_ARATH3.8e-5330.84Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidop... [more]
PPR78_ARATH1.4e-5230.33Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidop... [more]
PP112_ARATH1.0e-5028.51Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidop... [more]
PP275_ARATH5.1e-5027.43Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KZ73_CUCSA7.7e-26388.36Uncharacterized protein OS=Cucumis sativus GN=Csa_4G506840 PE=4 SV=1[more]
M5VHD4_PRUPE1.5e-20572.96Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015078mg PE=4 SV=1[more]
V4T738_9ROSI2.9e-20169.43Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000792mg PE=4 SV=1[more]
B9RFN9_RICCO1.9e-20067.66Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A067KE50_JATCU5.4e-20067.65Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12786 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G20300.12.1e-5430.84 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G52640.18.1e-5430.33 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G71060.15.8e-5228.51 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G49730.12.9e-5127.43 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65820.12.9e-5126.27 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449457341|ref|XP_004146407.1|1.1e-26288.36PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial ... [more]
gi|659082952|ref|XP_008442112.1|1.1e-26288.56PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial-... [more]
gi|1009146967|ref|XP_015891158.1|1.7e-21072.09PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial-... [more]
gi|658015096|ref|XP_008342880.1|9.2e-20971.43PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial-... [more]
gi|694326786|ref|XP_009354296.1|3.5e-20874.00PREDICTED: pentatricopeptide repeat-containing protein At2g13420, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g12370.1Cp4.1LG10g12370.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 104..132
score: 0.01coord: 285..314
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 316..362
score: 7.2E-13coord: 202..250
score: 3.1E-15coord: 388..436
score: 9.1
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 427..458
score: 1.9E-9coord: 285..318
score: 6.4E-4coord: 320..352
score: 1.0E-9coord: 205..238
score: 2.1E-8coord: 104..132
score: 1.0E-4coord: 391..424
score: 9.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 317..351
score: 13.187coord: 389..423
score: 9.778coord: 352..386
score: 7.037coord: 237..281
score: 8.572coord: 202..236
score: 12.43coord: 424..458
score: 12.978coord: 168..198
score: 5.207coord: 101..135
score: 9.449coord: 282..316
score: 10
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 10..500
score: 1.3E
NoneNo IPR availablePANTHERPTHR24015:SF516SUBFAMILY NOT NAMEDcoord: 10..500
score: 1.3E

The following gene(s) are paralogous to this gene:

None