Cp4.1LG15g01850 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g01850
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein family
LocationCp4.1LG15 : 1504820 .. 1507430 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGGGGACATCAGTCCTTAACCACAACCATCATTTATTGCCCTCTAAAGACATACAACAAAGTTTAGATTTGAGTTTGAAGCAGAAGGAGCAGGAATGTTTGTGCCTTCTAAAGAAATGCAAGAGCTTAGAAGAATTCAAACAAGTTCATGTTCAAATTCTGAAGTTGGGTCTTTTCTGGGATTCTTTCTGCTCAAGCAGTCTTTTGAGCACTTGTGCGCTCTCAGATTGGAGCAGCATGGACTATGCCTGCTCCATTTTCCAACAGCTAGATGAACCTACCACATTTCATTTCAACACAATGATCAAAGGCTATGTTAACAACATGAACTTTGAGAGTGCTCTAAATCTGTATGCCGATATGTTTCAAAGAGAAGTAGAACCCGACAACTTCACGTACCCGGTAGTTCTCAAGGCTTGTGCTCGGTTAGCAGCGATCGAGGAAGGGATGCAGATTCATGGTCATGTATTCAAGCTAGGTTTGGAAGATGATCTATTTGTACAGAATAGCTTAATCAATATGTATGGGAAATGTGGGGATATCGAACTGTCTTGTGCTGTTTTTCGACGTATGGAGGAAAAGAGTGTGGCTTCTTGGAGTGCTATAATTTCAGCTCATGCTCGTGTTGGATTGTGGCGGGAATGTTTGATGTTGTTTGAGGATATGAGTACAGAAGGATGTTGGAGGGCTGAGGAAAGTATATTAGTCAGTGTGGTCTCTGCTTGCACCCATTTGGGTGCTCTTCATTTAGGAAGATGTGCCCATGGTGCTCTATTGAGAAACATAACTGAACTAAATGTTGCGGTTAGGACTTCCTTAATGGATATGTATGTGAAATGTGGGTCGCTTCAGAAAGGATTATGTCTCTTCCAGAACATGACCAAAAGGAATCAACTATCCTATAGTGTCATAATCTCAGGGCTTGGCTTACATGGACATGGTAGACAAGCTCTAAGAATCTTCTCAGAAATGGTTGAAGAAGGCTTAGAGCCTGATGATGTTATCTATGTTGGTGTGCTTAGCGCTTGTAGTCATTCCGGCCTTGTCGAAGAAGGTCTTGATCTCTTCAATAGGATGAAGAACGAGTGCGGGATTAAACCAACAATGCAGCATTATGGCTGCGTGGTAGACCTGATGGGACGAGCTGGTTTGCTTGAAGAAGCATTTGAGCTTGTGAAAGGTATGCCTATAAAAGCAAACGATATAATTTGGCGGAGTATTCTAAGTGCTTGTAAGATTCATGACAACTTAAAGCTTGGTGAGGTAGCTGCAGAGAATCTATTTCGATCGTCTTCGCATAATCCTAGCGATTACCTAGTTTTGTCTAATATGTATGCAAGAGCTCAACAATGGGAGAATGTAGCTAAGATCAGGACAAAAATGTTCGATGATGGCTTTGTCCAGACACCAGGGTATAGCTTGGTGGAGGTGAAAAGGAAGGTATACCAATTTGTTTCACAGGATAAATCGAACTGCAAATCGGGTAAAATCTACGAGATGATTCATCAGATGGAATGGCAATTGAGATTTGAAGGCTATATGGCAGATACATCACAGGTTATGCTTAACGTAGATGAAGAAGAAAAGAGAGAGAGATTGAAAGGTCATAGCCAGAAGTTGGCTATAGCTTTTGCGCTCATTCATACTTCACAGGGATCTGCAATAAGGATAACTAGAAACCTGAGAATGTGTACTGACTGTCATACATACACTAAACTGATTTCAATGATCTATGAACGAGAAATTACTGTAAGAGACCGGAATCGGTTCCACCGTTTTAAAGATGGAAACTGCTCGTGTAGGGATTACTGGTGAATCACTGAACTGTTCAACTATGATTTATGGGAGAAAAACTTTCCCATATATTTTTATTTTGAATCTCTAGAATGTCTCAGCAGCTAGTGTTAGGAATCACGGCTCTCTACAATGGTATGATATTGTCCACTTTGAGCATAAGCTCTCGTAGTTTTGCTTTGAGCTTCCCAAAAGGCCTCGTACCAATGAAGATGTATTCCTTACTTACAAACCTATGATCATTCCCTAAATCAGCCAATGTGGGACTCCCTCCCAACAATCTTCCCCTCAAACAAAGTACACCATAGAGCCTCCCCGGAGGTCGATGTAGCCCTCGACAGTCTCCCCTTAATCGAGACTCGACTCTTTCTCTAGAGCCCTCGAACAAAGTACACCCTTTGTTCGACACATGAGTCACTTTTGACTAGACCTTCGAGGCTAGCAATTTATTTGTTCAACACTTGAGGATTTGATTGATATGGCTTAACTAAGGGCATGATTCTGATACCATGTTAGGAATCACAGATCTCCACAATGGTATAATATTGTCCACTTTGAGCATAAGCTCTCATGGCTTTGCTTTGGGCTTCCCCAAAGGGCCTCATACCAATAGATATATTTCTTACTTATAAACCCATGATCATTCCGTAAATTAGCCAATGTGAGACTCATTCCCAACAATCCTTGACCGTTAGCACCTAGGGAGGAAGAAATGGTAGGAAGCAAATAATAGACAGATGGATGACATTAACAAGATGATACCTAATTTTGCTTCTGAATGACACCAAAACAGGTATGCTACCGCCCTGTATAA

mRNA sequence

ATGATGGGGACATCAGTCCTTAACCACAACCATCATTTATTGCCCTCTAAAGACATACAACAAAGTTTAGATTTGAGTTTGAAGCAGAAGGAGCAGGAATGTTTGTGCCTTCTAAAGAAATGCAAGAGCTTAGAAGAATTCAAACAAGTTCATGTTCAAATTCTGAAGTTGGGTCTTTTCTGGGATTCTTTCTGCTCAAGCAGTCTTTTGAGCACTTGTGCGCTCTCAGATTGGAGCAGCATGGACTATGCCTGCTCCATTTTCCAACAGCTAGATGAACCTACCACATTTCATTTCAACACAATGATCAAAGGCTATGTTAACAACATGAACTTTGAGAGTGCTCTAAATCTGTATGCCGATATGTTTCAAAGAGAAGTAGAACCCGACAACTTCACGTACCCGGTAGTTCTCAAGGCTTGTGCTCGGTTAGCAGCGATCGAGGAAGGGATGCAGATTCATGGTCATGTATTCAAGCTAGGTTTGGAAGATGATCTATTTGTACAGAATAGCTTAATCAATATGTATGGGAAATGTGGGGATATCGAACTGTCTTGTGCTGTTTTTCGACGTATGGAGGAAAAGAGTGTGGCTTCTTGGAGTGCTATAATTTCAGCTCATGCTCGTGTTGGATTGTGGCGGGAATGTTTGATGTTGTTTGAGGATATGAGTACAGAAGGATGTTGGAGGGCTGAGGAAAGTATATTAGTCAGTGTGGTATGCTACCGCCCTGTATAA

Coding sequence (CDS)

ATGATGGGGACATCAGTCCTTAACCACAACCATCATTTATTGCCCTCTAAAGACATACAACAAAGTTTAGATTTGAGTTTGAAGCAGAAGGAGCAGGAATGTTTGTGCCTTCTAAAGAAATGCAAGAGCTTAGAAGAATTCAAACAAGTTCATGTTCAAATTCTGAAGTTGGGTCTTTTCTGGGATTCTTTCTGCTCAAGCAGTCTTTTGAGCACTTGTGCGCTCTCAGATTGGAGCAGCATGGACTATGCCTGCTCCATTTTCCAACAGCTAGATGAACCTACCACATTTCATTTCAACACAATGATCAAAGGCTATGTTAACAACATGAACTTTGAGAGTGCTCTAAATCTGTATGCCGATATGTTTCAAAGAGAAGTAGAACCCGACAACTTCACGTACCCGGTAGTTCTCAAGGCTTGTGCTCGGTTAGCAGCGATCGAGGAAGGGATGCAGATTCATGGTCATGTATTCAAGCTAGGTTTGGAAGATGATCTATTTGTACAGAATAGCTTAATCAATATGTATGGGAAATGTGGGGATATCGAACTGTCTTGTGCTGTTTTTCGACGTATGGAGGAAAAGAGTGTGGCTTCTTGGAGTGCTATAATTTCAGCTCATGCTCGTGTTGGATTGTGGCGGGAATGTTTGATGTTGTTTGAGGATATGAGTACAGAAGGATGTTGGAGGGCTGAGGAAAGTATATTAGTCAGTGTGGTATGCTACCGCCCTGTATAA

Protein sequence

MMGTSVLNHNHHLLPSKDIQQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGLFWDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYADMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCGDIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVVCYRPV
BLAST of Cp4.1LG15g01850 vs. Swiss-Prot
Match: PPR68_ARATH (Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana GN=PCMP-H11 PE=2 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 1.8e-69
Identity = 130/213 (61.03%), Postives = 167/213 (78.40%), Query Frame = 1

Query: 30  KEQECLCLLKKCKSLEEFKQVHVQILKLGLFWDS-FCSSSLLSTCALSDW-SSMDYACSI 89
           KEQECL LLK+C +++EFKQVH + +KL LF+ S F +SS+L+ CA S W +SM+YA SI
Sbjct: 29  KEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASI 88

Query: 90  FQQLDEPTTFHFNTMIKGYVNNMNFESALNLYADMFQREVEPDNFTYPVVLKACARLAAI 149
           F+ +D+P TF FNTMI+GYVN M+FE AL  Y +M QR  EPDNFTYP +LKAC RL +I
Sbjct: 89  FRGIDDPCTFDFNTMIRGYVNVMSFEEALCFYNEMMQRGNEPDNFTYPCLLKACTRLKSI 148

Query: 150 EEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCGDIELSCAVFRRMEEKSVASWSAIISAH 209
            EG QIHG VFKLGLE D+FVQNSLINMYG+CG++ELS AVF ++E K+ ASWS+++SA 
Sbjct: 149 REGKQIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSAR 208

Query: 210 ARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 241
           A +G+W ECL+LF  M +E   +AEES +VS +
Sbjct: 209 AGMGMWSECLLLFRGMCSETNLKAEESGMVSAL 241

BLAST of Cp4.1LG15g01850 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 4.7e-33
Identity = 85/238 (35.71%), Postives = 130/238 (54.62%), Query Frame = 1

Query: 35  LCLLKKCKSLEEFKQVHVQILKLGLFWDSFCSSSLLSTCALSD-WSSMDYACSIFQQLDE 94
           L LL  CK+L+  + +H Q++K+GL   ++  S L+  C LS  +  + YA S+F+ + E
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 96

Query: 95  PTTFHFNTMIKGYVNNMNFESALNLYADMFQREVEPDNFTYPVVLKACARLAAIEEGMQI 154
           P    +NTM +G+  + +  SAL LY  M    + P+++T+P VLK+CA+  A +EG QI
Sbjct: 97  PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 156

Query: 155 HGHVFKLGLEDDLFVQNSLINMYGKCGDIELSCAVFRR---------------------- 214
           HGHV KLG + DL+V  SLI+MY + G +E +  VF +                      
Sbjct: 157 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 216

Query: 215 ---------MEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 241
                    +  K V SW+A+IS +A  G ++E L LF+DM      R +ES +V+VV
Sbjct: 217 ENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNV-RPDESTMVTVV 273

BLAST of Cp4.1LG15g01850 vs. Swiss-Prot
Match: PP321_ARATH (Pentatricopeptide repeat-containing protein At4g18840 OS=Arabidopsis thaliana GN=PCMP-E101 PE=3 SV=2)

HSP 1 Score: 137.5 bits (345), Expect = 2.0e-31
Identity = 79/216 (36.57%), Postives = 121/216 (56.02%), Query Frame = 1

Query: 21  QSLDLSLKQKEQ------------ECLCLLKKCKSLEEFKQVHVQILKLGLFWDSFCSSS 80
           Q+ +L L QKE               L   ++ KSL E +Q H  +LK GLF D+F +S 
Sbjct: 17  QAYNLRLLQKENLKKMSVCSSTPVPILSFTERAKSLTEIQQAHAFMLKTGLFHDTFSASK 76

Query: 81  LLSTCALS-DWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYADMFQREV 140
           L++  A + +  ++ YA SI  ++  P  F  N++I+ Y N+   E AL ++ +M    V
Sbjct: 77  LVAFAATNPEPKTVSYAHSILNRIGSPNGFTHNSVIRAYANSSTPEVALTVFREMLLGPV 136

Query: 141 EPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCGDIELSCA 200
            PD +++  VLKACA     EEG QIHG   K GL  D+FV+N+L+N+YG+ G  E++  
Sbjct: 137 FPDKYSFTFVLKACAAFCGFEEGRQIHGLFIKSGLVTDVFVENTLVNVYGRSGYFEIARK 196

Query: 201 VFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDM 224
           V  RM  +   SW++++SA+   GL  E   LF++M
Sbjct: 197 VLDRMPVRDAVSWNSLLSAYLEKGLVDEARALFDEM 232

BLAST of Cp4.1LG15g01850 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 7.4e-31
Identity = 63/193 (32.64%), Postives = 119/193 (61.66%), Query Frame = 1

Query: 35  LCLLKKCKSLEEFKQVHVQILKLGLFWDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEP 94
           + L+++C SL + KQ H  +++ G F D + +S L +  ALS ++S++YA  +F ++ +P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 95  TTFHFNTMIKGYVNNMNFESALNLYADMF-QREVEPDNFTYPVVLKACARLAAIEEGMQI 154
            +F +NT+I+ Y +  +   ++  + DM  + +  P+ +T+P ++KA A ++++  G  +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 155 HGHVFKLGLEDDLFVQNSLINMYGKCGDIELSCAVFRRMEEKSVASWSAIISAHARVGLW 214
           HG   K  +  D+FV NSLI+ Y  CGD++ +C VF  ++EK V SW+++I+   + G  
Sbjct: 154 HGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSP 213

Query: 215 RECLMLFEDMSTE 227
            + L LF+ M +E
Sbjct: 214 DKALELFKKMESE 226

BLAST of Cp4.1LG15g01850 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 1.3e-30
Identity = 73/201 (36.32%), Postives = 118/201 (58.71%), Query Frame = 1

Query: 46  EFKQVHVQILKLGLFWDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKG 105
           + KQ+H ++L LGL +  F  + L+   A S +  + +A  +F  L  P  F +N +I+G
Sbjct: 36  QLKQIHARLLVLGLQFSGFLITKLIH--ASSSFGDITFARQVFDDLPRPQIFPWNAIIRG 95

Query: 106 YVNNMNFESALNLYADMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDD 165
           Y  N +F+ AL +Y++M    V PD+FT+P +LKAC+ L+ ++ G  +H  VF+LG + D
Sbjct: 96  YSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDAD 155

Query: 166 LFVQNSLINMYGKCGDIELSCAVFR--RMEEKSVASWSAIISAHARVGLWRECLMLFED- 225
           +FVQN LI +Y KC  +  +  VF    + E+++ SW+AI+SA+A+ G   E L +F   
Sbjct: 156 VFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQM 215

Query: 226 --MSTEGCWRAEESILVSVVC 242
             M  +  W A  S+L +  C
Sbjct: 216 RKMDVKPDWVALVSVLNAFTC 234

BLAST of Cp4.1LG15g01850 vs. TrEMBL
Match: A0A0A0LE81_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G895910 PE=4 SV=1)

HSP 1 Score: 417.5 bits (1072), Expect = 1.1e-113
Identity = 202/241 (83.82%), Postives = 224/241 (92.95%), Query Frame = 1

Query: 1   MMGTSVLNHNHHLLPSKDI-QQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGL 60
           MMGTSVLN+NHHLLPSKD+ Q S +L+LKQKEQE LCL+KKCKSLEEFKQVHVQILK GL
Sbjct: 1   MMGTSVLNYNHHLLPSKDLPQSSSELNLKQKEQEYLCLVKKCKSLEEFKQVHVQILKFGL 60

Query: 61  FWDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLY 120
           F DSFCSSS+L+TCALSDW+SMDYACSIFQQLDEPTTF FNTMI+GYVNNMNFE+A+ LY
Sbjct: 61  FLDSFCSSSVLATCALSDWNSMDYACSIFQQLDEPTTFDFNTMIRGYVNNMNFENAIYLY 120

Query: 121 ADMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKC 180
            DM QREVEPDNFTYPVVLKACARLA I+EGMQIHGHVFKLGLEDD++VQNSLINMYGKC
Sbjct: 121 NDMLQREVEPDNFTYPVVLKACARLAVIQEGMQIHGHVFKLGLEDDVYVQNSLINMYGKC 180

Query: 181 GDIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSV 240
            DIE+SCA+FRRME+KSVASWSAII+AHA + +W ECL LFEDMS EGCWRAEESILV+V
Sbjct: 181 RDIEMSCAIFRRMEQKSVASWSAIIAAHASLAMWWECLALFEDMSREGCWRAEESILVNV 240

BLAST of Cp4.1LG15g01850 vs. TrEMBL
Match: W9SFP7_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022794 PE=4 SV=1)

HSP 1 Score: 340.5 bits (872), Expect = 1.7e-90
Identity = 165/240 (68.75%), Postives = 197/240 (82.08%), Query Frame = 1

Query: 1   MMGTSVLNHNHHLLPSKDIQQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGLF 60
           M GTSVLN  H LLP+K+  QS +  L  KEQECL LLK+CKS+ E KQ+HVQILK+GL 
Sbjct: 1   MTGTSVLNQTHLLLPAKEPIQSPEFHLSLKEQECLSLLKRCKSVRELKQIHVQILKIGLL 60

Query: 61  WDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYA 120
            DSFC+ +L++TCALSDW SMDYACSIF+ + EP TF FNTM++G+V + N+  AL LY 
Sbjct: 61  GDSFCAGNLVATCALSDWGSMDYACSIFRHVKEPQTFLFNTMMRGHVKDGNWGQALILYF 120

Query: 121 DMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCG 180
           DM +  VEPDNFTYPV+LKACARL+A EEGMQIHGH  KLGL+ DLFVQNSLINMYGKCG
Sbjct: 121 DMLKSGVEPDNFTYPVLLKACARLSATEEGMQIHGHTSKLGLQGDLFVQNSLINMYGKCG 180

Query: 181 DIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 240
            IEL+CAVF +M++KSVASW AII+AHA +G+W ECL+LF DM+ EGCWRAEES LVSV+
Sbjct: 181 KIELACAVFDQMDQKSVASWGAIIAAHASLGMWWECLVLFGDMNREGCWRAEESTLVSVL 240

BLAST of Cp4.1LG15g01850 vs. TrEMBL
Match: A0A075M181_CAMSI (Tetratricopeptide repeat-like superfamily protein OS=Camellia sinensis var. sinensis PE=2 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 5.1e-87
Identity = 150/234 (64.10%), Postives = 193/234 (82.48%), Query Frame = 1

Query: 7   LNHNHHLLPSKDIQQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGLFWDSFCS 66
           ++  H L+P +D  QS + + + +EQEC+ L+K+CK+LEEFKQ H QILK G+FW SFC+
Sbjct: 7   VHQTHFLIPQEDRPQSPESNFRLREQECVSLIKQCKNLEEFKQAHAQILKFGMFWSSFCA 66

Query: 67  SSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYADMFQRE 126
           ++L++TCALSDW SMDYA SIFQQ++EP +F FN MI+G+V +MN E AL +Y +M +  
Sbjct: 67  NNLVATCALSDWGSMDYASSIFQQINEPGSFAFNHMIRGHVKDMNLEEALLMYDEMLELG 126

Query: 127 VEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCGDIELSC 186
           VEPDNFTYP +LKACA L A+EEGMQIHGH FKLG EDD+FVQNSLINMYGKCG+I LSC
Sbjct: 127 VEPDNFTYPTLLKACANLPALEEGMQIHGHSFKLGFEDDVFVQNSLINMYGKCGEIGLSC 186

Query: 187 AVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 241
           AVF +ME+++VASWSA+I+AHA +GLW ECL +F +MS EGCWR EES+LV+V+
Sbjct: 187 AVFEKMEQRTVASWSALIAAHANLGLWCECLEIFGEMSREGCWRVEESVLVNVL 240

BLAST of Cp4.1LG15g01850 vs. TrEMBL
Match: A0A061G7X6_THECC (Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_016613 PE=4 SV=1)

HSP 1 Score: 327.4 bits (838), Expect = 1.5e-86
Identity = 156/240 (65.00%), Postives = 189/240 (78.75%), Query Frame = 1

Query: 1   MMGTSVLNHNHHLLPSKDIQQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGLF 60
           M GTSVL          D  QSL+LSL+ KEQEC  +LK+CK++EEF+Q H QI+K G F
Sbjct: 99  MPGTSVLQQTKFFSLPADPPQSLELSLRLKEQECFSILKRCKNMEEFRQAHAQIVKWGFF 158

Query: 61  WDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYA 120
           W+SFC+S+L++ CALSD  SMDYACSIFQQ+DEP TF FNTMI+ +V +M FE AL  Y 
Sbjct: 159 WNSFCASNLVAACALSDGGSMDYACSIFQQIDEPGTFEFNTMIRAHVKDMTFEEALVFYY 218

Query: 121 DMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCG 180
           +M ++ VEPDNFTYP + KACA L A EEG QIHGH FKLGLE DL+VQNSLINMYGKCG
Sbjct: 219 EMLEKGVEPDNFTYPALFKACACLQAQEEGKQIHGHAFKLGLESDLYVQNSLINMYGKCG 278

Query: 181 DIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 240
           +IE SCA+F +M++KSVASWSAII+AHA  G W ECLM+F +MS+EGCWR EES LV+V+
Sbjct: 279 EIEHSCAIFEQMDQKSVASWSAIIAAHASFGKWYECLMMFGNMSSEGCWRPEESTLVTVL 338

BLAST of Cp4.1LG15g01850 vs. TrEMBL
Match: A0A0D2PE14_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G181600 PE=4 SV=1)

HSP 1 Score: 327.0 bits (837), Expect = 1.9e-86
Identity = 154/240 (64.17%), Postives = 191/240 (79.58%), Query Frame = 1

Query: 1   MMGTSVLNHNHHLLPSKDIQQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGLF 60
           M GTSVL   +   P  D  Q  +L+L+ KEQ+CL LLK+CK+LE+FKQ H QI+K G F
Sbjct: 1   MAGTSVLQQTNFFSPPADPPQFSELNLRLKEQQCLSLLKRCKNLEDFKQAHAQIIKWGFF 60

Query: 61  WDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYA 120
           W+SF +S+L++ CALSDW S+DYACSIFQQ  EP TF FNTMI+ +V +MNF+ AL  Y 
Sbjct: 61  WNSFSASNLVAACALSDWGSLDYACSIFQQFHEPGTFEFNTMIRAHVKDMNFQDALVFYY 120

Query: 121 DMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCG 180
           +M +R VEPDNFTYP + KACA L A EEGMQIHGHVFK G E DL+VQNSLINMYGKCG
Sbjct: 121 EMLERGVEPDNFTYPALFKACAWLKAREEGMQIHGHVFKFGFESDLYVQNSLINMYGKCG 180

Query: 181 DIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 240
           +I+ SCAVF +M+EKSVASWSAII+A+A +G+W ECLM+F +MS+EGCWR EES LV+++
Sbjct: 181 EIQHSCAVFEQMDEKSVASWSAIIAANASLGMWYECLMVFGNMSSEGCWRPEESTLVTLL 240

BLAST of Cp4.1LG15g01850 vs. TAIR10
Match: AT1G31920.1 (AT1G31920.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 263.8 bits (673), Expect = 1.0e-70
Identity = 130/213 (61.03%), Postives = 167/213 (78.40%), Query Frame = 1

Query: 30  KEQECLCLLKKCKSLEEFKQVHVQILKLGLFWDS-FCSSSLLSTCALSDW-SSMDYACSI 89
           KEQECL LLK+C +++EFKQVH + +KL LF+ S F +SS+L+ CA S W +SM+YA SI
Sbjct: 29  KEQECLYLLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASI 88

Query: 90  FQQLDEPTTFHFNTMIKGYVNNMNFESALNLYADMFQREVEPDNFTYPVVLKACARLAAI 149
           F+ +D+P TF FNTMI+GYVN M+FE AL  Y +M QR  EPDNFTYP +LKAC RL +I
Sbjct: 89  FRGIDDPCTFDFNTMIRGYVNVMSFEEALCFYNEMMQRGNEPDNFTYPCLLKACTRLKSI 148

Query: 150 EEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCGDIELSCAVFRRMEEKSVASWSAIISAH 209
            EG QIHG VFKLGLE D+FVQNSLINMYG+CG++ELS AVF ++E K+ ASWS+++SA 
Sbjct: 149 REGKQIHGQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSAR 208

Query: 210 ARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 241
           A +G+W ECL+LF  M +E   +AEES +VS +
Sbjct: 209 AGMGMWSECLLLFRGMCSETNLKAEESGMVSAL 241

BLAST of Cp4.1LG15g01850 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 142.9 bits (359), Expect = 2.6e-34
Identity = 85/238 (35.71%), Postives = 130/238 (54.62%), Query Frame = 1

Query: 35  LCLLKKCKSLEEFKQVHVQILKLGLFWDSFCSSSLLSTCALSD-WSSMDYACSIFQQLDE 94
           L LL  CK+L+  + +H Q++K+GL   ++  S L+  C LS  +  + YA S+F+ + E
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 96

Query: 95  PTTFHFNTMIKGYVNNMNFESALNLYADMFQREVEPDNFTYPVVLKACARLAAIEEGMQI 154
           P    +NTM +G+  + +  SAL LY  M    + P+++T+P VLK+CA+  A +EG QI
Sbjct: 97  PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 156

Query: 155 HGHVFKLGLEDDLFVQNSLINMYGKCGDIELSCAVFRR---------------------- 214
           HGHV KLG + DL+V  SLI+MY + G +E +  VF +                      
Sbjct: 157 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 216

Query: 215 ---------MEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 241
                    +  K V SW+A+IS +A  G ++E L LF+DM      R +ES +V+VV
Sbjct: 217 ENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNV-RPDESTMVTVV 273

BLAST of Cp4.1LG15g01850 vs. TAIR10
Match: AT4G18840.1 (AT4G18840.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 137.5 bits (345), Expect = 1.1e-32
Identity = 79/216 (36.57%), Postives = 121/216 (56.02%), Query Frame = 1

Query: 21  QSLDLSLKQKEQ------------ECLCLLKKCKSLEEFKQVHVQILKLGLFWDSFCSSS 80
           Q+ +L L QKE               L   ++ KSL E +Q H  +LK GLF D+F +S 
Sbjct: 17  QAYNLRLLQKENLKKMSVCSSTPVPILSFTERAKSLTEIQQAHAFMLKTGLFHDTFSASK 76

Query: 81  LLSTCALS-DWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYADMFQREV 140
           L++  A + +  ++ YA SI  ++  P  F  N++I+ Y N+   E AL ++ +M    V
Sbjct: 77  LVAFAATNPEPKTVSYAHSILNRIGSPNGFTHNSVIRAYANSSTPEVALTVFREMLLGPV 136

Query: 141 EPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCGDIELSCA 200
            PD +++  VLKACA     EEG QIHG   K GL  D+FV+N+L+N+YG+ G  E++  
Sbjct: 137 FPDKYSFTFVLKACAAFCGFEEGRQIHGLFIKSGLVTDVFVENTLVNVYGRSGYFEIARK 196

Query: 201 VFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDM 224
           V  RM  +   SW++++SA+   GL  E   LF++M
Sbjct: 197 VLDRMPVRDAVSWNSLLSAYLEKGLVDEARALFDEM 232

BLAST of Cp4.1LG15g01850 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 135.6 bits (340), Expect = 4.2e-32
Identity = 63/193 (32.64%), Postives = 119/193 (61.66%), Query Frame = 1

Query: 35  LCLLKKCKSLEEFKQVHVQILKLGLFWDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEP 94
           + L+++C SL + KQ H  +++ G F D + +S L +  ALS ++S++YA  +F ++ +P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 95  TTFHFNTMIKGYVNNMNFESALNLYADMF-QREVEPDNFTYPVVLKACARLAAIEEGMQI 154
            +F +NT+I+ Y +  +   ++  + DM  + +  P+ +T+P ++KA A ++++  G  +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 155 HGHVFKLGLEDDLFVQNSLINMYGKCGDIELSCAVFRRMEEKSVASWSAIISAHARVGLW 214
           HG   K  +  D+FV NSLI+ Y  CGD++ +C VF  ++EK V SW+++I+   + G  
Sbjct: 154 HGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSP 213

Query: 215 RECLMLFEDMSTE 227
            + L LF+ M +E
Sbjct: 214 DKALELFKKMESE 226

BLAST of Cp4.1LG15g01850 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 134.8 bits (338), Expect = 7.1e-32
Identity = 73/201 (36.32%), Postives = 118/201 (58.71%), Query Frame = 1

Query: 46  EFKQVHVQILKLGLFWDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKG 105
           + KQ+H ++L LGL +  F  + L+   A S +  + +A  +F  L  P  F +N +I+G
Sbjct: 36  QLKQIHARLLVLGLQFSGFLITKLIH--ASSSFGDITFARQVFDDLPRPQIFPWNAIIRG 95

Query: 106 YVNNMNFESALNLYADMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDD 165
           Y  N +F+ AL +Y++M    V PD+FT+P +LKAC+ L+ ++ G  +H  VF+LG + D
Sbjct: 96  YSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDAD 155

Query: 166 LFVQNSLINMYGKCGDIELSCAVFR--RMEEKSVASWSAIISAHARVGLWRECLMLFED- 225
           +FVQN LI +Y KC  +  +  VF    + E+++ SW+AI+SA+A+ G   E L +F   
Sbjct: 156 VFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQM 215

Query: 226 --MSTEGCWRAEESILVSVVC 242
             M  +  W A  S+L +  C
Sbjct: 216 RKMDVKPDWVALVSVLNAFTC 234

BLAST of Cp4.1LG15g01850 vs. NCBI nr
Match: gi|659132121|ref|XP_008466029.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g31920 [Cucumis melo])

HSP 1 Score: 419.9 bits (1078), Expect = 3.2e-114
Identity = 203/240 (84.58%), Postives = 224/240 (93.33%), Query Frame = 1

Query: 1   MMGTSVLNHNHHLLPSKDIQQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGLF 60
           MMGTSVLN+NHHLLPSKD+ QS +L+LKQKEQE L LLKKCKSLEEFKQVHVQILK GLF
Sbjct: 1   MMGTSVLNYNHHLLPSKDLPQSSELNLKQKEQEFLRLLKKCKSLEEFKQVHVQILKFGLF 60

Query: 61  WDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYA 120
            DSFCSSS+L+TCALSDW+SMDYACSIFQQLDEPTTF FNTMI+GYVNNMNFE+A+ LY 
Sbjct: 61  LDSFCSSSILATCALSDWNSMDYACSIFQQLDEPTTFDFNTMIRGYVNNMNFENAIYLYN 120

Query: 121 DMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCG 180
           DM QREVEPDNFTYPVVLKACARLAAI+EGMQIHGHVFKLGLEDD+FVQNSLINMYGKC 
Sbjct: 121 DMLQREVEPDNFTYPVVLKACARLAAIQEGMQIHGHVFKLGLEDDVFVQNSLINMYGKCR 180

Query: 181 DIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 240
           DI++SCA+FRRME+KSVASWSAII+AHA + +W ECL LFEDMS EGCWRAEESILV+V+
Sbjct: 181 DIKMSCAIFRRMEQKSVASWSAIIAAHASLAMWWECLALFEDMSREGCWRAEESILVNVL 240

BLAST of Cp4.1LG15g01850 vs. NCBI nr
Match: gi|778687802|ref|XP_011652628.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g31920 [Cucumis sativus])

HSP 1 Score: 417.5 bits (1072), Expect = 1.6e-113
Identity = 202/241 (83.82%), Postives = 224/241 (92.95%), Query Frame = 1

Query: 1   MMGTSVLNHNHHLLPSKDI-QQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGL 60
           MMGTSVLN+NHHLLPSKD+ Q S +L+LKQKEQE LCL+KKCKSLEEFKQVHVQILK GL
Sbjct: 1   MMGTSVLNYNHHLLPSKDLPQSSSELNLKQKEQEYLCLVKKCKSLEEFKQVHVQILKFGL 60

Query: 61  FWDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLY 120
           F DSFCSSS+L+TCALSDW+SMDYACSIFQQLDEPTTF FNTMI+GYVNNMNFE+A+ LY
Sbjct: 61  FLDSFCSSSVLATCALSDWNSMDYACSIFQQLDEPTTFDFNTMIRGYVNNMNFENAIYLY 120

Query: 121 ADMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKC 180
            DM QREVEPDNFTYPVVLKACARLA I+EGMQIHGHVFKLGLEDD++VQNSLINMYGKC
Sbjct: 121 NDMLQREVEPDNFTYPVVLKACARLAVIQEGMQIHGHVFKLGLEDDVYVQNSLINMYGKC 180

Query: 181 GDIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSV 240
            DIE+SCA+FRRME+KSVASWSAII+AHA + +W ECL LFEDMS EGCWRAEESILV+V
Sbjct: 181 RDIEMSCAIFRRMEQKSVASWSAIIAAHASLAMWWECLALFEDMSREGCWRAEESILVNV 240

BLAST of Cp4.1LG15g01850 vs. NCBI nr
Match: gi|1009158478|ref|XP_015897312.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like [Ziziphus jujuba])

HSP 1 Score: 342.0 bits (876), Expect = 8.4e-91
Identity = 158/240 (65.83%), Postives = 199/240 (82.92%), Query Frame = 1

Query: 1   MMGTSVLNHNHHLLPSKDIQQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGLF 60
           M+GT+VLN  H LLP+KD  Q+ + +L  KEQECL LLK+CKS+EEFK+VHV  +K GLF
Sbjct: 11  MIGTTVLNQTHLLLPTKDPPQNPEFNLSLKEQECLSLLKRCKSIEEFKRVHVHFIKFGLF 70

Query: 61  WDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYA 120
           W SFC+ +L++TCALSDW S+DYACSIFQQ+DEP TF +NTMI+G+V  MN+  AL LY 
Sbjct: 71  WGSFCAGNLVATCALSDWGSLDYACSIFQQIDEPDTFLYNTMIRGHVKGMNWGQALLLYH 130

Query: 121 DMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCG 180
           +M +R VEPDNFTYP +LKAC+ L  +E+G QIHGH+FKLGL+DD+FVQNSLINMYGKC 
Sbjct: 131 EMLERGVEPDNFTYPALLKACSLLRFLEDGKQIHGHIFKLGLQDDVFVQNSLINMYGKCK 190

Query: 181 DIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 240
           + +LSCAVF +M +K++ASWSAII+AHA +G+W ECL+LF DM +EG WR EESILVSV+
Sbjct: 191 ETDLSCAVFEQMNQKTIASWSAIIAAHASLGMWSECLILFGDMRSEGYWRPEESILVSVL 250

BLAST of Cp4.1LG15g01850 vs. NCBI nr
Match: gi|1009174393|ref|XP_015868324.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like [Ziziphus jujuba])

HSP 1 Score: 341.7 bits (875), Expect = 1.1e-90
Identity = 158/240 (65.83%), Postives = 198/240 (82.50%), Query Frame = 1

Query: 1   MMGTSVLNHNHHLLPSKDIQQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGLF 60
           M+GT+VLN  H LLP+KD  Q+ + +L  KEQECL LLK+CKS+EEFK+VHV  +K GLF
Sbjct: 11  MIGTTVLNQTHLLLPTKDPPQNPEFNLSLKEQECLSLLKRCKSIEEFKRVHVHFIKFGLF 70

Query: 61  WDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYA 120
           W SFC  +L++TCALSDW S+DYACSIFQQ+DEP TF +NTMI+G+V  MN+  AL LY 
Sbjct: 71  WGSFCEGNLVATCALSDWGSLDYACSIFQQIDEPDTFLYNTMIRGHVKGMNWGQALLLYH 130

Query: 121 DMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCG 180
           +M +R VEPDNFTYP +LKAC+ L  +E+G QIHGH+FKLGL+DD+FVQNSLINMYGKC 
Sbjct: 131 EMLERGVEPDNFTYPALLKACSLLRFLEDGKQIHGHIFKLGLQDDVFVQNSLINMYGKCK 190

Query: 181 DIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 240
           + +LSCAVF +M +K++ASWSAII+AHA +G+W ECL+LF DM +EG WR EESILVSV+
Sbjct: 191 ETDLSCAVFEQMNQKTIASWSAIIAAHASLGMWSECLILFGDMRSEGFWRPEESILVSVL 250

BLAST of Cp4.1LG15g01850 vs. NCBI nr
Match: gi|703152375|ref|XP_010110391.1| (hypothetical protein L484_022794 [Morus notabilis])

HSP 1 Score: 340.5 bits (872), Expect = 2.4e-90
Identity = 165/240 (68.75%), Postives = 197/240 (82.08%), Query Frame = 1

Query: 1   MMGTSVLNHNHHLLPSKDIQQSLDLSLKQKEQECLCLLKKCKSLEEFKQVHVQILKLGLF 60
           M GTSVLN  H LLP+K+  QS +  L  KEQECL LLK+CKS+ E KQ+HVQILK+GL 
Sbjct: 1   MTGTSVLNQTHLLLPAKEPIQSPEFHLSLKEQECLSLLKRCKSVRELKQIHVQILKIGLL 60

Query: 61  WDSFCSSSLLSTCALSDWSSMDYACSIFQQLDEPTTFHFNTMIKGYVNNMNFESALNLYA 120
            DSFC+ +L++TCALSDW SMDYACSIF+ + EP TF FNTM++G+V + N+  AL LY 
Sbjct: 61  GDSFCAGNLVATCALSDWGSMDYACSIFRHVKEPQTFLFNTMMRGHVKDGNWGQALILYF 120

Query: 121 DMFQREVEPDNFTYPVVLKACARLAAIEEGMQIHGHVFKLGLEDDLFVQNSLINMYGKCG 180
           DM +  VEPDNFTYPV+LKACARL+A EEGMQIHGH  KLGL+ DLFVQNSLINMYGKCG
Sbjct: 121 DMLKSGVEPDNFTYPVLLKACARLSATEEGMQIHGHTSKLGLQGDLFVQNSLINMYGKCG 180

Query: 181 DIELSCAVFRRMEEKSVASWSAIISAHARVGLWRECLMLFEDMSTEGCWRAEESILVSVV 240
            IEL+CAVF +M++KSVASW AII+AHA +G+W ECL+LF DM+ EGCWRAEES LVSV+
Sbjct: 181 KIELACAVFDQMDQKSVASWGAIIAAHASLGMWWECLVLFGDMNREGCWRAEESTLVSVL 240

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR68_ARATH1.8e-6961.03Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana GN... [more]
PPR21_ARATH4.7e-3335.71Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP321_ARATH2.0e-3136.57Pentatricopeptide repeat-containing protein At4g18840 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH7.4e-3132.64Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP224_ARATH1.3e-3036.32Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LE81_CUCSA1.1e-11383.82Uncharacterized protein OS=Cucumis sativus GN=Csa_3G895910 PE=4 SV=1[more]
W9SFP7_9ROSA1.7e-9068.75Uncharacterized protein OS=Morus notabilis GN=L484_022794 PE=4 SV=1[more]
A0A075M181_CAMSI5.1e-8764.10Tetratricopeptide repeat-like superfamily protein OS=Camellia sinensis var. sine... [more]
A0A061G7X6_THECC1.5e-8665.00Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_0166... [more]
A0A0D2PE14_GOSRA1.9e-8664.17Uncharacterized protein OS=Gossypium raimondii GN=B456_004G181600 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G31920.11.0e-7061.03 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G08070.12.6e-3435.71 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18840.11.1e-3236.57 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT2G29760.14.2e-3232.64 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G12770.17.1e-3236.32 mitochondrial editing factor 22[more]
Match NameE-valueIdentityDescription
gi|659132121|ref|XP_008466029.1|3.2e-11484.58PREDICTED: pentatricopeptide repeat-containing protein At1g31920 [Cucumis melo][more]
gi|778687802|ref|XP_011652628.1|1.6e-11383.82PREDICTED: pentatricopeptide repeat-containing protein At1g31920 [Cucumis sativu... [more]
gi|1009158478|ref|XP_015897312.1|8.4e-9165.83PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like [Ziziphus ... [more]
gi|1009174393|ref|XP_015868324.1|1.1e-9065.83PREDICTED: pentatricopeptide repeat-containing protein At1g31920-like [Ziziphus ... [more]
gi|703152375|ref|XP_010110391.1|2.4e-9068.75hypothetical protein L484_022794 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0016556 mRNA modification
biological_process GO:0010075 regulation of meristem growth
cellular_component GO:0005575 cellular_component
molecular_function GO:0005488 binding
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g01850.1Cp4.1LG15g01850.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 170..196
score: 8.1E-5coord: 199..228
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 94..143
score: 7.6
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 99..130
score: 9.6E-7coord: 168..197
score: 2.4E-5coord: 199..228
score: 5.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 95..129
score: 10.358coord: 165..199
score: 9.887coord: 130..164
score: 7.815coord: 200..230
score: 6
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 23..240
score: 4.2E
NoneNo IPR availablePANTHERPTHR24015:SF552SUBFAMILY NOT NAMEDcoord: 23..240
score: 4.2E

The following gene(s) are paralogous to this gene:

None