Cp4.1LG04g07750 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g07750
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG04 : 3919784 .. 3920803 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGATAAGGTTGTTCTTCCCCTCCTCCTCCCCAATCCACCGCCCTCCAAACCCCTCATCTCCGTCTTCCACCACCAGCCCCCTTCGCCCTCTTCCCCGCATACAACACTATCCTTTCCTCCGCCGCAGCCGCAGCCGCTCTCTTCCCCATCATCCCCCATAGCCCCTCTTCTCCAAGGCATCCTTCATCCCCACCAAGACCCCTCTTCCCCTCAAACCCATAACCCCAAATCCATTTTCAGAACCCACACCCGAAAAGGCCGATCCCGCGGGAACCCATGGTCGCACCACCGTCTCTCTACCAAGGGTAAACAAATTCTCGATTCTTTACTCAACCCAGAATTAGATTCCTCCTATTTTAACGAAATTTTGCTTCAATTCTTTGAAACCAGTCCCGTAGAGCTTAATTTCACCCCTGAATCTGTTTCATTCGACATTTTGGGGATAATCAAGGGCTTAGTGTTTCGCAATAAGAATGAAGTAGCCTTGCGTAATCAGAAGGATTTTGCATCGATTTTGAATAGCTCTGTCATTGCTGTGATTATTAGTGTTCTTGGTAAACAGGGTCGGGCTTCTTTTGCAGCTTCTTTGCTTCATGAGCTTCGAAACAATGGACTAATTATTGACATTTATGCTTATACTTCTTTGATAACTGCTTATGCTAGCAATGGTAGATATAGAGAGGCTGTGATGTTGTTTAACAAGCTAGAACAAGAAGGTTATACACCAACTTTAATTACTTATAATATCATCTTGAATGCCTATGGGAAAATGGGTATGCCTTGGAGTAAAATTGCTGCCATTGTTGATAGCATGAAGAGTTTCGGGGTTGCCCCAGATTTGTGTACATATAATATGCTTATTAGCAGTTGTTGCAGAGGGTCATTGTATAAAGAAGCAGCAGAGGTGTTTGAAGAAATGAAAGCAGCTGGGTTTATTCCTGATAAGGTTACTTACAATGCGTTGTTGGATGTGTATGGGAAGTCTCGACGACCTAAGGAGCGATGGAGGTTTTGA

mRNA sequence

ATGGCGGATAAGGTTGTTCTTCCCCTCCTCCTCCCCAATCCACCGCCCTCCAAACCCCTCATCTCCGTCTTCCACCACCAGCCCCCTTCGCCCTCTTCCCCGCATACAACACTATCCTTTCCTCCGCCGCAGCCGCAGCCGCTCTCTTCCCCATCATCCCCCATAGCCCCTCTTCTCCAAGGCATCCTTCATCCCCACCAAGACCCCTCTTCCCCTCAAACCCATAACCCCAAATCCATTTTCAGAACCCACACCCGAAAAGGCCGATCCCGCGGGAACCCATGGTCGCACCACCGTCTCTCTACCAAGGGTAAACAAATTCTCGATTCTTTACTCAACCCAGAATTAGATTCCTCCTATTTTAACGAAATTTTGCTTCAATTCTTTGAAACCAGTCCCGTAGAGCTTAATTTCACCCCTGAATCTGTTTCATTCGACATTTTGGGGATAATCAAGGGCTTAGTGTTTCGCAATAAGAATGAAGTAGCCTTGCGTAATCAGAAGGATTTTGCATCGATTTTGAATAGCTCTGTCATTGCTGTGATTATTAGTGTTCTTGGTAAACAGGGTCGGGCTTCTTTTGCAGCTTCTTTGCTTCATGAGCTTCGAAACAATGGACTAATTATTGACATTTATGCTTATACTTCTTTGATAACTGCTTATGCTAGCAATGGTAGATATAGAGAGGCTGTGATGTTGTTTAACAAGCTAGAACAAGAAGGTTATACACCAACTTTAATTACTTATAATATCATCTTGAATGCCTATGGGAAAATGGGTATGCCTTGGAGTAAAATTGCTGCCATTGTTGATAGCATGAAGAGTTTCGGGGTTGCCCCAGATTTGTGTACATATAATATGCTTATTAGCAGTTGTTGCAGAGGGTCATTGTATAAAGAAGCAGCAGAGGTGTTTGAAGAAATGAAAGCAGCTGGGTTTATTCCTGATAAGGTTACTTACAATGCGTTGTTGGATGTGTATGGGAAGTCTCGACGACCTAAGGAGCGATGGAGGTTTTGA

Coding sequence (CDS)

ATGGCGGATAAGGTTGTTCTTCCCCTCCTCCTCCCCAATCCACCGCCCTCCAAACCCCTCATCTCCGTCTTCCACCACCAGCCCCCTTCGCCCTCTTCCCCGCATACAACACTATCCTTTCCTCCGCCGCAGCCGCAGCCGCTCTCTTCCCCATCATCCCCCATAGCCCCTCTTCTCCAAGGCATCCTTCATCCCCACCAAGACCCCTCTTCCCCTCAAACCCATAACCCCAAATCCATTTTCAGAACCCACACCCGAAAAGGCCGATCCCGCGGGAACCCATGGTCGCACCACCGTCTCTCTACCAAGGGTAAACAAATTCTCGATTCTTTACTCAACCCAGAATTAGATTCCTCCTATTTTAACGAAATTTTGCTTCAATTCTTTGAAACCAGTCCCGTAGAGCTTAATTTCACCCCTGAATCTGTTTCATTCGACATTTTGGGGATAATCAAGGGCTTAGTGTTTCGCAATAAGAATGAAGTAGCCTTGCGTAATCAGAAGGATTTTGCATCGATTTTGAATAGCTCTGTCATTGCTGTGATTATTAGTGTTCTTGGTAAACAGGGTCGGGCTTCTTTTGCAGCTTCTTTGCTTCATGAGCTTCGAAACAATGGACTAATTATTGACATTTATGCTTATACTTCTTTGATAACTGCTTATGCTAGCAATGGTAGATATAGAGAGGCTGTGATGTTGTTTAACAAGCTAGAACAAGAAGGTTATACACCAACTTTAATTACTTATAATATCATCTTGAATGCCTATGGGAAAATGGGTATGCCTTGGAGTAAAATTGCTGCCATTGTTGATAGCATGAAGAGTTTCGGGGTTGCCCCAGATTTGTGTACATATAATATGCTTATTAGCAGTTGTTGCAGAGGGTCATTGTATAAAGAAGCAGCAGAGGTGTTTGAAGAAATGAAAGCAGCTGGGTTTATTCCTGATAAGGTTACTTACAATGCGTTGTTGGATGTGTATGGGAAGTCTCGACGACCTAAGGAGCGATGGAGGTTTTGA

Protein sequence

MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQGILHPHQDPSSPQTHNPKSIFRTHTRKGRSRGNPWSHHRLSTKGKQILDSLLNPELDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVALRNQKDFASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREAVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLISSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKERWRF
BLAST of Cp4.1LG04g07750 vs. Swiss-Prot
Match: PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 310.5 bits (794), Expect = 2.3e-83
Identity = 177/346 (51.16%), Postives = 235/346 (67.92%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQ 60
           MADK+ LPLLLP  P SKP     +H         T+LS PPP P         + PLL 
Sbjct: 1   MADKLALPLLLPCTPSSKPYSHDQNHHISRTPFLTTSLSSPPPPP---------VEPLLH 60

Query: 61  GILHPHQDPSSPQTHNPK-SIFRTHTRKGRSR----GNPWSHHRLSTKGKQILDSLLNPE 120
            +   HQ+P+S Q  + + S  R  TR G+SR    G PWS+H LS +G+Q+L SL+ P 
Sbjct: 61  DVFL-HQNPNSRQPISSQTSRNRNRTRIGKSRDPNLGKPWSYHGLSPQGQQVLRSLIEPN 120

Query: 121 LDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVALR------NQKD 180
            DS   + +L + FE         PES S ++L  +KGL F  K ++ALR       QKD
Sbjct: 121 FDSGQLDSVLSELFEP----FKDKPESTSSELLAFLKGLGFHKKFDLALRAFDWFMKQKD 180

Query: 181 FASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYRE 240
           + S+L++SV+A+IIS+LGK+GR S AA++ + L+ +G  +D+Y+YTSLI+A+A++GRYRE
Sbjct: 181 YQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDVYSYTSLISAFANSGRYRE 240

Query: 241 AVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLI 300
           AV +F K+E++G  PTLITYN+ILN +GKMG PW+KI ++V+ MKS G+APD  TYN LI
Sbjct: 241 AVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLVEKMKSDGIAPDAYTYNTLI 300

Query: 301 SSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           + C RGSL++EAA+VFEEMKAAGF  DKVTYNALLDVYGKS RPKE
Sbjct: 301 TCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKE 332

BLAST of Cp4.1LG04g07750 vs. Swiss-Prot
Match: PP163_ARATH (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 140.6 bits (353), Expect = 3.2e-32
Identity = 103/328 (31.40%), Postives = 167/328 (50.91%), Query Frame = 1

Query: 10  LLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQGILHPHQDP 69
           L P  PPS PL S+ HH           LS PPP+    ++   P   +           
Sbjct: 38  LPPPSPPSFPLDSLLHHL--------VHLSSPPPRHSNSAAARFPSLEV----------- 97

Query: 70  SSPQTHNPKSIFRTHTRKGRSRGNPWSHHRLSTKGKQILDSLLNPELDSSYFNEILLQFF 129
            S  + + K I        R+     S   L  K   +++S++   L        L +FF
Sbjct: 98  -STDSSSSKPILGIEIENERNG----SLKLLCKKEVVLVNSIVEQPLTG------LSRFF 157

Query: 130 ETSPVELNFTPESVSFDILGIIKGLVFRNKNEVAL-------RNQKDFASILNSSVIAVI 189
           ++   EL  T      D++ ++KGL      E A+        +    A  L+  VI + 
Sbjct: 158 DSVKSELLRT------DLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIF 217

Query: 190 ISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREAVMLFNKLEQEGY 249
           + +LG++ + S AA LL ++     ++D+ AYT+++ AY+  G+Y +A+ LF ++++ G 
Sbjct: 218 VRILGRESQYSVAAKLLDKIPLQEYLLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGP 277

Query: 250 TPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLISSCCRGSLYKEAA 309
           +PTL+TYN+IL+ +GKMG  W KI  ++D M+S G+  D  T + ++S+C R  L +EA 
Sbjct: 278 SPTLVTYNVILDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSACAREGLLREAK 329

Query: 310 EVFEEMKAAGFIPDKVTYNALLDVYGKS 331
           E F E+K+ G+ P  VTYNALL V+GK+
Sbjct: 338 EFFAELKSCGYEPGTVTYNALLQVFGKA 329

BLAST of Cp4.1LG04g07750 vs. Swiss-Prot
Match: PP124_ARATH (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana GN=PTAC2 PE=2 SV=1)

HSP 1 Score: 123.6 bits (309), Expect = 4.0e-27
Identity = 54/158 (34.18%), Postives = 94/158 (59.49%), Query Frame = 1

Query: 175 NSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREAVMLF 234
           N  +  ++IS+LG++G       +  E+ + G+   +++YT+LI AY  NGRY  ++ L 
Sbjct: 140 NEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSLELL 199

Query: 235 NKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLISSCCR 294
           ++++ E  +P+++TYN ++NA  + G+ W  +  +   M+  G+ PD+ TYN L+S+C  
Sbjct: 200 DRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDIVTYNTLLSACAI 259

Query: 295 GSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRR 333
             L  EA  VF  M   G +PD  TY+ L++ +GK RR
Sbjct: 260 RGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRR 297

BLAST of Cp4.1LG04g07750 vs. Swiss-Prot
Match: PP178_ARATH (Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidopsis thaliana GN=At2g31400 PE=2 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 1.8e-22
Identity = 89/317 (28.08%), Postives = 148/317 (46.69%), Query Frame = 1

Query: 24  FHHQPPSPSSPHTTLSFPPPQ------PQPLSSPSSPIAPLLQGILHPHQDPSSPQTHNP 83
           ++H+P   SS     S PPP       P  LS P     P    +  P  D SS  +   
Sbjct: 87  YNHRPYGASSSPRG-SAPPPSSVATVAPAQLSQP-----PNFSPLQTPKSDLSSDFSGRR 146

Query: 84  KSIFRTHTRKGRSRGNPWSHHRLSTKGKQILDSLLNPELDSSYFNEILLQFFETSPVELN 143
            + F +    GR +    + H  S+  +  L + ++   D   F+ ++L F         
Sbjct: 147 STRFVSKMHFGRQKTTMATRH--SSAAEDALQNAIDFSGDDEMFHSLMLSFESKL----- 206

Query: 144 FTPESVSFDILGIIKGLVFRNKNEVAL-----RNQKDFASILNSSVIAVIISVLGKQGRA 203
                 S D   II+ L  RN+ + A+       +++        + + +IS LG+ G+ 
Sbjct: 207 ----CGSDDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKV 266

Query: 204 SFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREAVMLFNKLEQEGYTPTLITYNII 263
           + A  +       G    +YA+++LI+AY  +G + EA+ +FN +++ G  P L+TYN +
Sbjct: 267 TIAKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAV 326

Query: 264 LNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLISSCCRGSLYKEAAEVFEEMKAAG 323
           ++A GK GM + ++A   D M+  GV PD  T+N L++ C RG L++ A  +F+EM    
Sbjct: 327 IDACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRR 386

Query: 324 FIPDKVTYNALLDVYGK 330
              D  +YN LLD   K
Sbjct: 387 IEQDVFSYNTLLDAICK 386

BLAST of Cp4.1LG04g07750 vs. Swiss-Prot
Match: PP342_ARATH (Pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Arabidopsis thaliana GN=At4g30825 PE=2 SV=2)

HSP 1 Score: 99.0 bits (245), Expect = 1.1e-19
Identity = 56/194 (28.87%), Postives = 106/194 (54.64%), Query Frame = 1

Query: 134 VELNFTPESVSFDILGII--KGLVFRNKNEVALRNQKDFASILNSSVIAVIISVLGKQGR 193
           +   FTP +V+F++L  +  K  +F+  NE+ L  ++    +++      II+  GK   
Sbjct: 691 IRYGFTPNTVTFNVLLDVYGKAKLFKKVNELFLLAKRH--GVVDVISYNTIIAAYGKNKD 750

Query: 194 ASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREAVMLFNKLEQEGYTPTLITYNI 253
            +  +S +  ++ +G  + + AY +L+ AY  + +  +   +  ++++    P   TYNI
Sbjct: 751 YTNMSSAIKNMQFDGFSVSLEAYNTLLDAYGKDKQMEKFRSILKRMKKSTSGPDHYTYNI 810

Query: 254 ILNAYGKMGMPW-SKIAAIVDSMKSFGVAPDLCTYNMLISSCCRGSLYKEAAEVFEEMKA 313
           ++N YG+ G  W  ++A ++  +K  G+ PDLC+YN LI +   G + +EA  + +EM+ 
Sbjct: 811 MINIYGEQG--WIDEVADVLKELKESGLGPDLCSYNTLIKAYGIGGMVEEAVGLVKEMRG 870

Query: 314 AGFIPDKVTYNALL 325
              IPDKVTY  L+
Sbjct: 871 RNIIPDKVTYTNLV 880

BLAST of Cp4.1LG04g07750 vs. TrEMBL
Match: A0A0A0K4H7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G071530 PE=4 SV=1)

HSP 1 Score: 473.8 bits (1218), Expect = 1.8e-130
Identity = 263/346 (76.01%), Postives = 286/346 (82.66%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQ 60
           MADKV LPLLLPNPPPSK    VFHHQP SPSSP       PP P  LSS SSP+APLLQ
Sbjct: 1   MADKVSLPLLLPNPPPSKSHFPVFHHQPLSPSSPPPPPLTFPPTPH-LSSASSPLAPLLQ 60

Query: 61  GILHPHQDPSSP-QTHNPKSIFRTHTRKGRS----RGNPWSHHRLSTKGKQILDSLLNPE 120
            +L PHQ PSS  Q H PK  FRT TR GRS    RG PWSHHRLST+G++ILDSLLNPE
Sbjct: 61  DLL-PHQHPSSSTQPHLPKPTFRTRTRIGRSHDPNRGKPWSHHRLSTQGQRILDSLLNPE 120

Query: 121 LDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVAL------RNQKD 180
            DSS  +EILLQ FETS   LNFT +SVSFDILGIIKGLVF  KNE+AL      RN++D
Sbjct: 121 FDSSSLDEILLQLFETSSDGLNFTSDSVSFDILGIIKGLVFYKKNELALCVFYFVRNRED 180

Query: 181 FASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYRE 240
           FASIL++SV+AVIISVLGK+GRASFAASLLH+LRN+G+ IDIYAYTSLITAYASNGRYRE
Sbjct: 181 FASILSNSVVAVIISVLGKEGRASFAASLLHDLRNDGVHIDIYAYTSLITAYASNGRYRE 240

Query: 241 AVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLI 300
           AVM+F KLE+EG  PTLITYN+ILN YGKMGMPWSKIA +VDSMKS GVAPDL TYN LI
Sbjct: 241 AVMVFKKLEEEGCRPTLITYNVILNVYGKMGMPWSKIAGLVDSMKSSGVAPDLYTYNTLI 300

Query: 301 SSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           SSC RGSLY+EAAEVFEEMKAAGF PDKVTYNALLDVYGKSRRP+E
Sbjct: 301 SSCRRGSLYEEAAEVFEEMKAAGFSPDKVTYNALLDVYGKSRRPRE 344

BLAST of Cp4.1LG04g07750 vs. TrEMBL
Match: M5VJ93_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001449mg PE=4 SV=1)

HSP 1 Score: 347.4 bits (890), Expect = 1.9e-92
Identity = 195/348 (56.03%), Postives = 245/348 (70.40%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQ 60
           MA+++ LPLLL NPPPS       HHQ  +P++P      PPP P P   P  P+ PLLQ
Sbjct: 1   MAEQIALPLLLHNPPPSSRPFFQNHHQNQNPATPT-----PPPLPPP---PPMPVTPLLQ 60

Query: 61  GILHPHQDPSSPQTHNPKS---IFRTHTRKGRSR----GNPWSHHRLSTKGKQILDSLLN 120
            +L  H +PS+PQT NP S   + R  TR G+SR    G PWSHHRLS++G+ IL S L+
Sbjct: 61  ELLL-HPNPSTPQTQNPTSPPTLPRARTRIGKSRDSNRGKPWSHHRLSSQGQHILHSFLD 120

Query: 121 PELDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVAL------RNQ 180
           P+ DSS  +E LL   +    E   + +S+S D+LGI+KGL F  K ++A+      + +
Sbjct: 121 PQFDSSKLDEQLLGLVDLHRDEFGSSLDSLSLDVLGIVKGLGFHKKFDLAIDVFEWFKKR 180

Query: 181 KDFASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRY 240
           +D  SIL+ SV+AVIIS+LGK GR S A SL   L  +G  +D+YAYTSLITA ASNGRY
Sbjct: 181 EDCDSILSGSVVAVIISILGKVGRVSSATSLFQSLHKDGFALDVYAYTSLITACASNGRY 240

Query: 241 REAVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNM 300
           REAV +F K+E+EG  PTLITYN+ILN YGKMGMPW+KI A+V+ MKS G+APD  TYN 
Sbjct: 241 REAVTVFKKMEEEGCMPTLITYNVILNVYGKMGMPWNKIRALVECMKSAGIAPDSYTYNT 300

Query: 301 LISSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           LI+ C RGSL+ EAAEVF+EMK+AG++PDKVTYNALLDVYGKSRR KE
Sbjct: 301 LITCCRRGSLHVEAAEVFQEMKSAGYVPDKVTYNALLDVYGKSRRTKE 339

BLAST of Cp4.1LG04g07750 vs. TrEMBL
Match: B9RY68_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0810720 PE=4 SV=1)

HSP 1 Score: 336.3 bits (861), Expect = 4.4e-89
Identity = 189/351 (53.85%), Postives = 242/351 (68.95%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQ 60
           MA+KV LPLL+ NP P+KP      H    PSSP ++           ++ S P  PL+Q
Sbjct: 1   MAEKVALPLLISNPLPTKPQFPTQSHNLQQPSSPISS----------SNTSSPPFTPLIQ 60

Query: 61  GILHPHQ--DPSSPQTHNPKSIFRTHTR----KGRSRGNPWSHHRLSTKGKQILDSLLNP 120
            +L  H    P SP+  NP    R  TR    +  +RG PW+ HRLST G+Q+LDSL++P
Sbjct: 61  NMLINHHKIQPHSPKFINPNVSIRPRTRISKARDPNRGKPWASHRLSTLGQQVLDSLIDP 120

Query: 121 ELDSSYFNEILLQFFETSPVE----LNFTPESVSFDILGIIKGLVFRNKNEVAL------ 180
             + S  +++L Q FE    E     + T  S+S D+LGIIKGL F  K ++A+      
Sbjct: 121 CFEGSELDKVLSQLFEYYHKEELSLSSGTWNSLSMDVLGIIKGLGFYKKCDMAMSVFSWV 180

Query: 181 RNQKDFASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASN 240
           R ++DF S+LN SV+AVII++LGK+G+ S A+S+L+ LR +G  +D+YAYTSLITAYASN
Sbjct: 181 REREDFESVLNCSVVAVIITMLGKEGKVSAASSILNNLRKDGFDLDVYAYTSLITAYASN 240

Query: 241 GRYREAVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCT 300
           GRYR+AV++F K+E+EG  PTLITYN+ILN YGKMGMPWSKI+ +V  MKS GVAPD  T
Sbjct: 241 GRYRDAVLVFKKMEEEGCKPTLITYNVILNVYGKMGMPWSKISGLVHGMKSSGVAPDDYT 300

Query: 301 YNMLISSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           YN LIS C RGSLY+EAA+VFEEMK +GF PDKVT+N LLDVYGKSRRPKE
Sbjct: 301 YNTLISCCRRGSLYEEAAQVFEEMKLSGFSPDKVTFNTLLDVYGKSRRPKE 341

BLAST of Cp4.1LG04g07750 vs. TrEMBL
Match: A0A067F3R3_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003451mg PE=4 SV=1)

HSP 1 Score: 333.2 bits (853), Expect = 3.7e-88
Identity = 189/345 (54.78%), Postives = 232/345 (67.25%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLI--SVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPL 60
           MA  + LPLLLP PPP+KPL      HH  P PS P         Q  P +SP++ I+PL
Sbjct: 1   MAQNLSLPLLLPTPPPAKPLFLTQTNHHNLPPPSPPQQ-------QTTPSASPTT-ISPL 60

Query: 61  LQGILHPHQDPSSPQTHNPKSIFRTHTRKGRSRGNPWSHHRLSTKGKQILDSLLNPELDS 120
           LQ + +   + SS   H P+S  R    +  +RG PWSHHRLS KG+Q+L SL++   D 
Sbjct: 61  LQDLYN--NNSSSQPIHQPRSRTRLGKSRDSNRGKPWSHHRLSAKGQQVLQSLIDDSFDV 120

Query: 121 SYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVALRN--------QKDF 180
              + +L Q  + +P E +   E +  D+LGI+KGL F  K ++AL           KD 
Sbjct: 121 KDIDSVLSQLLDQNPGEKS---EDLGADLLGIVKGLGFHKKTDLALDVFEWFRSCCSKDG 180

Query: 181 ASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREA 240
             +L  SVIAV+IS+LGK+G+ S AASLLH L  +G  ID+YAYTSLIT YASNGRYREA
Sbjct: 181 NLVLRGSVIAVLISMLGKEGKVSVAASLLHGLHKDGFDIDVYAYTSLITTYASNGRYREA 240

Query: 241 VMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLIS 300
           VM+F K+E+EG  PTLITYN+ILN YGKMGMPW+KI A+V+ MKS GV PD  T+N LIS
Sbjct: 241 VMVFKKMEEEGCKPTLITYNVILNVYGKMGMPWNKIMALVEGMKSAGVKPDSYTFNTLIS 300

Query: 301 SCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
            C RGSL++EAA VFEEMK AGF PDKVTYNALLDVYGK RRPKE
Sbjct: 301 CCRRGSLHEEAAGVFEEMKLAGFSPDKVTYNALLDVYGKCRRPKE 332

BLAST of Cp4.1LG04g07750 vs. TrEMBL
Match: A0A067EVB9_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003451mg PE=4 SV=1)

HSP 1 Score: 333.2 bits (853), Expect = 3.7e-88
Identity = 189/345 (54.78%), Postives = 232/345 (67.25%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLI--SVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPL 60
           MA  + LPLLLP PPP+KPL      HH  P PS P         Q  P +SP++ I+PL
Sbjct: 1   MAQNLSLPLLLPTPPPAKPLFLTQTNHHNLPPPSPPQQ-------QTTPSASPTT-ISPL 60

Query: 61  LQGILHPHQDPSSPQTHNPKSIFRTHTRKGRSRGNPWSHHRLSTKGKQILDSLLNPELDS 120
           LQ + +   + SS   H P+S  R    +  +RG PWSHHRLS KG+Q+L SL++   D 
Sbjct: 61  LQDLYN--NNSSSQPIHQPRSRTRLGKSRDSNRGKPWSHHRLSAKGQQVLQSLIDDSFDV 120

Query: 121 SYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVALRN--------QKDF 180
              + +L Q  + +P E +   E +  D+LGI+KGL F  K ++AL           KD 
Sbjct: 121 KDIDSVLSQLLDQNPGEKS---EDLGADLLGIVKGLGFHKKTDLALDVFEWFRSCCSKDG 180

Query: 181 ASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREA 240
             +L  SVIAV+IS+LGK+G+ S AASLLH L  +G  ID+YAYTSLIT YASNGRYREA
Sbjct: 181 NLVLRGSVIAVLISMLGKEGKVSVAASLLHGLHKDGFDIDVYAYTSLITTYASNGRYREA 240

Query: 241 VMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLIS 300
           VM+F K+E+EG  PTLITYN+ILN YGKMGMPW+KI A+V+ MKS GV PD  T+N LIS
Sbjct: 241 VMVFKKMEEEGCKPTLITYNVILNVYGKMGMPWNKIMALVEGMKSAGVKPDSYTFNTLIS 300

Query: 301 SCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
            C RGSL++EAA VFEEMK AGF PDKVTYNALLDVYGK RRPKE
Sbjct: 301 CCRRGSLHEEAAGVFEEMKLAGFSPDKVTYNALLDVYGKCRRPKE 332

BLAST of Cp4.1LG04g07750 vs. TAIR10
Match: AT5G02860.1 (AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 310.5 bits (794), Expect = 1.3e-84
Identity = 177/346 (51.16%), Postives = 235/346 (67.92%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQ 60
           MADK+ LPLLLP  P SKP     +H         T+LS PPP P         + PLL 
Sbjct: 1   MADKLALPLLLPCTPSSKPYSHDQNHHISRTPFLTTSLSSPPPPP---------VEPLLH 60

Query: 61  GILHPHQDPSSPQTHNPK-SIFRTHTRKGRSR----GNPWSHHRLSTKGKQILDSLLNPE 120
            +   HQ+P+S Q  + + S  R  TR G+SR    G PWS+H LS +G+Q+L SL+ P 
Sbjct: 61  DVFL-HQNPNSRQPISSQTSRNRNRTRIGKSRDPNLGKPWSYHGLSPQGQQVLRSLIEPN 120

Query: 121 LDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVALR------NQKD 180
            DS   + +L + FE         PES S ++L  +KGL F  K ++ALR       QKD
Sbjct: 121 FDSGQLDSVLSELFEP----FKDKPESTSSELLAFLKGLGFHKKFDLALRAFDWFMKQKD 180

Query: 181 FASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYRE 240
           + S+L++SV+A+IIS+LGK+GR S AA++ + L+ +G  +D+Y+YTSLI+A+A++GRYRE
Sbjct: 181 YQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDVYSYTSLISAFANSGRYRE 240

Query: 241 AVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLI 300
           AV +F K+E++G  PTLITYN+ILN +GKMG PW+KI ++V+ MKS G+APD  TYN LI
Sbjct: 241 AVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLVEKMKSDGIAPDAYTYNTLI 300

Query: 301 SSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           + C RGSL++EAA+VFEEMKAAGF  DKVTYNALLDVYGKS RPKE
Sbjct: 301 TCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKE 332

BLAST of Cp4.1LG04g07750 vs. TAIR10
Match: AT2G18940.1 (AT2G18940.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 140.6 bits (353), Expect = 1.8e-33
Identity = 103/328 (31.40%), Postives = 167/328 (50.91%), Query Frame = 1

Query: 10  LLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQGILHPHQDP 69
           L P  PPS PL S+ HH           LS PPP+    ++   P   +           
Sbjct: 38  LPPPSPPSFPLDSLLHHL--------VHLSSPPPRHSNSAAARFPSLEV----------- 97

Query: 70  SSPQTHNPKSIFRTHTRKGRSRGNPWSHHRLSTKGKQILDSLLNPELDSSYFNEILLQFF 129
            S  + + K I        R+     S   L  K   +++S++   L        L +FF
Sbjct: 98  -STDSSSSKPILGIEIENERNG----SLKLLCKKEVVLVNSIVEQPLTG------LSRFF 157

Query: 130 ETSPVELNFTPESVSFDILGIIKGLVFRNKNEVAL-------RNQKDFASILNSSVIAVI 189
           ++   EL  T      D++ ++KGL      E A+        +    A  L+  VI + 
Sbjct: 158 DSVKSELLRT------DLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIF 217

Query: 190 ISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREAVMLFNKLEQEGY 249
           + +LG++ + S AA LL ++     ++D+ AYT+++ AY+  G+Y +A+ LF ++++ G 
Sbjct: 218 VRILGRESQYSVAAKLLDKIPLQEYLLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGP 277

Query: 250 TPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLISSCCRGSLYKEAA 309
           +PTL+TYN+IL+ +GKMG  W KI  ++D M+S G+  D  T + ++S+C R  L +EA 
Sbjct: 278 SPTLVTYNVILDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSACAREGLLREAK 329

Query: 310 EVFEEMKAAGFIPDKVTYNALLDVYGKS 331
           E F E+K+ G+ P  VTYNALL V+GK+
Sbjct: 338 EFFAELKSCGYEPGTVTYNALLQVFGKA 329

BLAST of Cp4.1LG04g07750 vs. TAIR10
Match: AT1G74850.1 (AT1G74850.1 plastid transcriptionally active 2)

HSP 1 Score: 123.6 bits (309), Expect = 2.3e-28
Identity = 54/158 (34.18%), Postives = 94/158 (59.49%), Query Frame = 1

Query: 175 NSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREAVMLF 234
           N  +  ++IS+LG++G       +  E+ + G+   +++YT+LI AY  NGRY  ++ L 
Sbjct: 140 NEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSLELL 199

Query: 235 NKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLISSCCR 294
           ++++ E  +P+++TYN ++NA  + G+ W  +  +   M+  G+ PD+ TYN L+S+C  
Sbjct: 200 DRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDIVTYNTLLSACAI 259

Query: 295 GSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRR 333
             L  EA  VF  M   G +PD  TY+ L++ +GK RR
Sbjct: 260 RGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRR 297

BLAST of Cp4.1LG04g07750 vs. TAIR10
Match: AT2G31400.1 (AT2G31400.1 genomes uncoupled 1)

HSP 1 Score: 108.2 bits (269), Expect = 9.9e-24
Identity = 89/317 (28.08%), Postives = 148/317 (46.69%), Query Frame = 1

Query: 24  FHHQPPSPSSPHTTLSFPPPQ------PQPLSSPSSPIAPLLQGILHPHQDPSSPQTHNP 83
           ++H+P   SS     S PPP       P  LS P     P    +  P  D SS  +   
Sbjct: 87  YNHRPYGASSSPRG-SAPPPSSVATVAPAQLSQP-----PNFSPLQTPKSDLSSDFSGRR 146

Query: 84  KSIFRTHTRKGRSRGNPWSHHRLSTKGKQILDSLLNPELDSSYFNEILLQFFETSPVELN 143
            + F +    GR +    + H  S+  +  L + ++   D   F+ ++L F         
Sbjct: 147 STRFVSKMHFGRQKTTMATRH--SSAAEDALQNAIDFSGDDEMFHSLMLSFESKL----- 206

Query: 144 FTPESVSFDILGIIKGLVFRNKNEVAL-----RNQKDFASILNSSVIAVIISVLGKQGRA 203
                 S D   II+ L  RN+ + A+       +++        + + +IS LG+ G+ 
Sbjct: 207 ----CGSDDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGKV 266

Query: 204 SFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREAVMLFNKLEQEGYTPTLITYNII 263
           + A  +       G    +YA+++LI+AY  +G + EA+ +FN +++ G  P L+TYN +
Sbjct: 267 TIAKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNAV 326

Query: 264 LNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLISSCCRGSLYKEAAEVFEEMKAAG 323
           ++A GK GM + ++A   D M+  GV PD  T+N L++ C RG L++ A  +F+EM    
Sbjct: 327 IDACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRR 386

Query: 324 FIPDKVTYNALLDVYGK 330
              D  +YN LLD   K
Sbjct: 387 IEQDVFSYNTLLDAICK 386

BLAST of Cp4.1LG04g07750 vs. TAIR10
Match: AT4G30825.1 (AT4G30825.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 99.0 bits (245), Expect = 6.0e-21
Identity = 56/194 (28.87%), Postives = 106/194 (54.64%), Query Frame = 1

Query: 134 VELNFTPESVSFDILGII--KGLVFRNKNEVALRNQKDFASILNSSVIAVIISVLGKQGR 193
           +   FTP +V+F++L  +  K  +F+  NE+ L  ++    +++      II+  GK   
Sbjct: 691 IRYGFTPNTVTFNVLLDVYGKAKLFKKVNELFLLAKRH--GVVDVISYNTIIAAYGKNKD 750

Query: 194 ASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYREAVMLFNKLEQEGYTPTLITYNI 253
            +  +S +  ++ +G  + + AY +L+ AY  + +  +   +  ++++    P   TYNI
Sbjct: 751 YTNMSSAIKNMQFDGFSVSLEAYNTLLDAYGKDKQMEKFRSILKRMKKSTSGPDHYTYNI 810

Query: 254 ILNAYGKMGMPW-SKIAAIVDSMKSFGVAPDLCTYNMLISSCCRGSLYKEAAEVFEEMKA 313
           ++N YG+ G  W  ++A ++  +K  G+ PDLC+YN LI +   G + +EA  + +EM+ 
Sbjct: 811 MINIYGEQG--WIDEVADVLKELKESGLGPDLCSYNTLIKAYGIGGMVEEAVGLVKEMRG 870

Query: 314 AGFIPDKVTYNALL 325
              IPDKVTY  L+
Sbjct: 871 RNIIPDKVTYTNLV 880

BLAST of Cp4.1LG04g07750 vs. NCBI nr
Match: gi|659110047|ref|XP_008455020.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Cucumis melo])

HSP 1 Score: 492.3 bits (1266), Expect = 6.9e-136
Identity = 268/347 (77.23%), Postives = 292/347 (84.15%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHT-TLSFPPPQPQPLSSPSSPIAPLL 60
           MADKV LPLLLPNPPPSK L  VFHHQPP PSSP   +L+FPPP   P SS SSP+APLL
Sbjct: 17  MADKVALPLLLPNPPPSKSLFPVFHHQPPLPSSPSPPSLTFPPPPQSPPSSSSSPLAPLL 76

Query: 61  QGILHPHQDPSSP-QTHNPKSIFRTHTRKGRSR----GNPWSHHRLSTKGKQILDSLLNP 120
           Q +L PHQ PSS  Q H PK  FRT TR GRSR    G PWSHHRLST+G++ILDSLLNP
Sbjct: 77  QDLL-PHQHPSSSAQPHLPKPTFRTRTRIGRSRDPNRGKPWSHHRLSTQGQRILDSLLNP 136

Query: 121 ELDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVAL------RNQK 180
           E DSS  +EILLQ FETSP  LNFT +SVSFDILGIIKGLVF  KNE+AL      RN++
Sbjct: 137 EFDSSSLDEILLQLFETSPDGLNFTSDSVSFDILGIIKGLVFNKKNELALGVFDFVRNRE 196

Query: 181 DFASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYR 240
           DFASIL++SVIAVIISVLGK+GRASFAASLLHELRN+G+ IDIYAYTSLITAYASNGRYR
Sbjct: 197 DFASILSNSVIAVIISVLGKEGRASFAASLLHELRNDGVHIDIYAYTSLITAYASNGRYR 256

Query: 241 EAVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNML 300
           EAVM+F KLE+EG  PTLITYN+ILN YGKMGMPWSKI+A+VDSMKS GV PDL TYN L
Sbjct: 257 EAVMVFKKLEEEGCRPTLITYNVILNVYGKMGMPWSKISALVDSMKSSGVVPDLYTYNTL 316

Query: 301 ISSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           ISSC RGSLY+EAAE+FEEMKAAGF PDKVTYNALLDVYGKSRRPKE
Sbjct: 317 ISSCRRGSLYEEAAEIFEEMKAAGFSPDKVTYNALLDVYGKSRRPKE 362

BLAST of Cp4.1LG04g07750 vs. NCBI nr
Match: gi|449438627|ref|XP_004137089.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Cucumis sativus])

HSP 1 Score: 473.8 bits (1218), Expect = 2.5e-130
Identity = 263/346 (76.01%), Postives = 286/346 (82.66%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQ 60
           MADKV LPLLLPNPPPSK    VFHHQP SPSSP       PP P  LSS SSP+APLLQ
Sbjct: 1   MADKVSLPLLLPNPPPSKSHFPVFHHQPLSPSSPPPPPLTFPPTPH-LSSASSPLAPLLQ 60

Query: 61  GILHPHQDPSSP-QTHNPKSIFRTHTRKGRS----RGNPWSHHRLSTKGKQILDSLLNPE 120
            +L PHQ PSS  Q H PK  FRT TR GRS    RG PWSHHRLST+G++ILDSLLNPE
Sbjct: 61  DLL-PHQHPSSSTQPHLPKPTFRTRTRIGRSHDPNRGKPWSHHRLSTQGQRILDSLLNPE 120

Query: 121 LDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVAL------RNQKD 180
            DSS  +EILLQ FETS   LNFT +SVSFDILGIIKGLVF  KNE+AL      RN++D
Sbjct: 121 FDSSSLDEILLQLFETSSDGLNFTSDSVSFDILGIIKGLVFYKKNELALCVFYFVRNRED 180

Query: 181 FASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYRE 240
           FASIL++SV+AVIISVLGK+GRASFAASLLH+LRN+G+ IDIYAYTSLITAYASNGRYRE
Sbjct: 181 FASILSNSVVAVIISVLGKEGRASFAASLLHDLRNDGVHIDIYAYTSLITAYASNGRYRE 240

Query: 241 AVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLI 300
           AVM+F KLE+EG  PTLITYN+ILN YGKMGMPWSKIA +VDSMKS GVAPDL TYN LI
Sbjct: 241 AVMVFKKLEEEGCRPTLITYNVILNVYGKMGMPWSKIAGLVDSMKSSGVAPDLYTYNTLI 300

Query: 301 SSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           SSC RGSLY+EAAEVFEEMKAAGF PDKVTYNALLDVYGKSRRP+E
Sbjct: 301 SSCRRGSLYEEAAEVFEEMKAAGFSPDKVTYNALLDVYGKSRRPRE 344

BLAST of Cp4.1LG04g07750 vs. NCBI nr
Match: gi|700188631|gb|KGN43864.1| (hypothetical protein Csa_7G071530 [Cucumis sativus])

HSP 1 Score: 473.8 bits (1218), Expect = 2.5e-130
Identity = 263/346 (76.01%), Postives = 286/346 (82.66%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQ 60
           MADKV LPLLLPNPPPSK    VFHHQP SPSSP       PP P  LSS SSP+APLLQ
Sbjct: 1   MADKVSLPLLLPNPPPSKSHFPVFHHQPLSPSSPPPPPLTFPPTPH-LSSASSPLAPLLQ 60

Query: 61  GILHPHQDPSSP-QTHNPKSIFRTHTRKGRS----RGNPWSHHRLSTKGKQILDSLLNPE 120
            +L PHQ PSS  Q H PK  FRT TR GRS    RG PWSHHRLST+G++ILDSLLNPE
Sbjct: 61  DLL-PHQHPSSSTQPHLPKPTFRTRTRIGRSHDPNRGKPWSHHRLSTQGQRILDSLLNPE 120

Query: 121 LDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVAL------RNQKD 180
            DSS  +EILLQ FETS   LNFT +SVSFDILGIIKGLVF  KNE+AL      RN++D
Sbjct: 121 FDSSSLDEILLQLFETSSDGLNFTSDSVSFDILGIIKGLVFYKKNELALCVFYFVRNRED 180

Query: 181 FASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRYRE 240
           FASIL++SV+AVIISVLGK+GRASFAASLLH+LRN+G+ IDIYAYTSLITAYASNGRYRE
Sbjct: 181 FASILSNSVVAVIISVLGKEGRASFAASLLHDLRNDGVHIDIYAYTSLITAYASNGRYRE 240

Query: 241 AVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNMLI 300
           AVM+F KLE+EG  PTLITYN+ILN YGKMGMPWSKIA +VDSMKS GVAPDL TYN LI
Sbjct: 241 AVMVFKKLEEEGCRPTLITYNVILNVYGKMGMPWSKIAGLVDSMKSSGVAPDLYTYNTLI 300

Query: 301 SSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           SSC RGSLY+EAAEVFEEMKAAGF PDKVTYNALLDVYGKSRRP+E
Sbjct: 301 SSCRRGSLYEEAAEVFEEMKAAGFSPDKVTYNALLDVYGKSRRPRE 344

BLAST of Cp4.1LG04g07750 vs. NCBI nr
Match: gi|657962926|ref|XP_008373065.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Malus domestica])

HSP 1 Score: 348.6 bits (893), Expect = 1.2e-92
Identity = 194/348 (55.75%), Postives = 239/348 (68.68%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQ 60
           MA+++ LPLLLPNPPPS       HHQP +P++P      PPP          P+ PLLQ
Sbjct: 1   MAEQIALPLLLPNPPPSSRPFFQTHHQPQNPATP------PPP----------PMTPLLQ 60

Query: 61  GILHPHQDPSSPQTHNPKS---IFRTHTRKGRSR----GNPWSHHRLSTKGKQILDSLLN 120
            +L  H +PS+PQT NP S   + R  TR GRSR    G PWSHHRLS++G+ IL S L+
Sbjct: 61  ELLL-HPNPSTPQTQNPSSPSTLPRARTRIGRSRDSNRGKPWSHHRLSSQGQHILHSFLD 120

Query: 121 PELDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVAL------RNQ 180
           P+ DSS   E L+   E    E   + +S++ D+LGI+KGL F  K ++A+      + +
Sbjct: 121 PQFDSSXLGEKLVGLVEMHRDEFGSSLDSLALDLLGIVKGLSFHKKFDLAISVFEWFKKR 180

Query: 181 KDFASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRY 240
           +D  S+L+ SV+AVIIS+LGK GR S A SL   L   G  +D+YAYTSLITA ASNGRY
Sbjct: 181 EDCDSVLSGSVVAVIISILGKVGRVSNATSLFQNLHKEGFALDVYAYTSLITACASNGRY 240

Query: 241 REAVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNM 300
           REAV +F K+E+EG  PTLITYN+ILN YGKMGMPW KI A+V+ MKS G+ PD  TYN 
Sbjct: 241 REAVSVFKKMEEEGCRPTLITYNVILNVYGKMGMPWHKIRALVEGMKSAGITPDSYTYNT 300

Query: 301 LISSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           LI+ C RGSLY EAAEVF+EMK AGF+PDKVTYNALLDVYGKSRR KE
Sbjct: 301 LITCCRRGSLYVEAAEVFQEMKTAGFVPDKVTYNALLDVYGKSRRTKE 331

BLAST of Cp4.1LG04g07750 vs. NCBI nr
Match: gi|645261687|ref|XP_008236415.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Prunus mume])

HSP 1 Score: 347.4 bits (890), Expect = 2.8e-92
Identity = 195/348 (56.03%), Postives = 245/348 (70.40%), Query Frame = 1

Query: 1   MADKVVLPLLLPNPPPSKPLISVFHHQPPSPSSPHTTLSFPPPQPQPLSSPSSPIAPLLQ 60
           MA+++ LPLLL NPPPS       HHQ  +P++P      PPP P P   P  P+ PLLQ
Sbjct: 1   MAEQIALPLLLHNPPPSSRPFFQNHHQNQNPATPT-----PPPLPPP---PPMPVTPLLQ 60

Query: 61  GILHPHQDPSSPQTHNPKS---IFRTHTRKGRSR----GNPWSHHRLSTKGKQILDSLLN 120
            +L  H +PS+PQT NP S   + R  TR G+SR    G PWSHHRLS++G+ IL S L+
Sbjct: 61  ELLL-HPNPSTPQTQNPTSPPTLPRARTRIGKSRDSNRGKPWSHHRLSSQGQHILHSFLD 120

Query: 121 PELDSSYFNEILLQFFETSPVELNFTPESVSFDILGIIKGLVFRNKNEVAL------RNQ 180
           P+ DSS  +E LL   +    E   + +S+S D+LGI+KGL F  K ++A+      + +
Sbjct: 121 PQFDSSKLDEQLLGLVDLHRDEFGSSLDSLSLDVLGIVKGLGFHKKFDLAIDVFEWFKKR 180

Query: 181 KDFASILNSSVIAVIISVLGKQGRASFAASLLHELRNNGLIIDIYAYTSLITAYASNGRY 240
           +D  SIL+ SV+AVIIS+LGK GR S A SL   L  +G  +D+YAYTSLIT+ ASNGRY
Sbjct: 181 EDCDSILSGSVVAVIISILGKVGRVSSATSLFQSLHKDGFALDVYAYTSLITSCASNGRY 240

Query: 241 REAVMLFNKLEQEGYTPTLITYNIILNAYGKMGMPWSKIAAIVDSMKSFGVAPDLCTYNM 300
           REAV +F K+E+EG  PTLITYN+ILN YGKMGMPW+KI A+V+ MKS G+APD  TYN 
Sbjct: 241 REAVTVFKKMEEEGCRPTLITYNVILNVYGKMGMPWNKIRALVECMKSAGIAPDSYTYNT 300

Query: 301 LISSCCRGSLYKEAAEVFEEMKAAGFIPDKVTYNALLDVYGKSRRPKE 336
           LI+ C RGSL+ EAAEVF+EMK+AGF+PDKVTYNALLDVYGKSRR KE
Sbjct: 301 LITCCRRGSLHVEAAEVFQEMKSAGFVPDKVTYNALLDVYGKSRRTKE 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP362_ARATH2.3e-8351.16Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN... [more]
PP163_ARATH3.2e-3231.40Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
PP124_ARATH4.0e-2734.18Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... [more]
PP178_ARATH1.8e-2228.08Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidop... [more]
PP342_ARATH1.1e-1928.87Pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0K4H7_CUCSA1.8e-13076.01Uncharacterized protein OS=Cucumis sativus GN=Csa_7G071530 PE=4 SV=1[more]
M5VJ93_PRUPE1.9e-9256.03Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001449mg PE=4 SV=1[more]
B9RY68_RICCO4.4e-8953.85Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A067F3R3_CITSI3.7e-8854.78Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003451mg PE=4 SV=1[more]
A0A067EVB9_CITSI3.7e-8854.78Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003451mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G02860.11.3e-8451.16 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G18940.11.8e-3331.40 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G74850.12.3e-2834.18 plastid transcriptionally active 2[more]
AT2G31400.19.9e-2428.08 genomes uncoupled 1[more]
AT4G30825.16.0e-2128.87 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659110047|ref|XP_008455020.1|6.9e-13677.23PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Cucumis melo][more]
gi|449438627|ref|XP_004137089.1|2.5e-13076.01PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Cucumis sativu... [more]
gi|700188631|gb|KGN43864.1|2.5e-13076.01hypothetical protein Csa_7G071530 [Cucumis sativus][more]
gi|657962926|ref|XP_008373065.1|1.2e-9255.75PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Malus domestic... [more]
gi|645261687|ref|XP_008236415.1|2.8e-9256.03PREDICTED: pentatricopeptide repeat-containing protein At5g02860 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g07750.1Cp4.1LG04g07750.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 280..327
score: 8.8
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 197..259
score: 8.7
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 212..245
score: 6.1E-7coord: 247..281
score: 3.5E-6coord: 283..316
score: 1.5
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 210..244
score: 12.321coord: 245..280
score: 10.49coord: 281..315
score: 13.197coord: 316..339
score: 8.024coord: 175..209
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 210..311
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 8..78
score: 1.4E-69coord: 113..145
score: 1.4E-69coord: 180..335
score: 1.4

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG04g07750Cp4.1LG15g04550Cucurbita pepo (Zucchini)cpecpeB269