Cp4.1LG01g17540 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g17540
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG01 : 13592108 .. 13598043 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTTGATTCAACGCAAACATCACATTTGTGTGTTATGACCGTCAAAAATGGAGAGCAGTAGTCCTAATCTGAAGATTCTATGCCGAGAACAGCAGTGGGTTTCGTGAATCTTGTTGACCGAAGTGGCCGAAGAATCGTAAGTGATTTCCGTTCCTTTTAAATTCTCTCTTCTTAATCAAACGAACAATCTTTGGTTGAAACTCGTAGTGATTAATGTAGGAGTACGAGTTATGAGTTGTAAATTACTTCCCCTTCCAACCTCTCTGAACAGCCTTATCAATTCAGGGCGGACCTTGAAACATGCCACTCAAATTCATTCCCATCTCATCACCACTGCCTTCCTTTCCCTCCCTTTCCTCTTCAATAACCTCCTCAACTTGTACGCCAAATCTGGTTCTCTCGACCAAACCTTGTTGTTCTTCTCCTCAACCCCACATGACTCCAAGAACGTTGTCTCTTGGACTTCTCTTATCACCCAATTTTCCCGTTCTAAAAGACCCTTTCACGCCTTGGCTTTCTTTAACCATATGAGGTGTTCTGGGGTTTATCCCAATCACTACACCTTCTCTGCTGTCTTATCTGCTTGCACTGACACTATGGCTTCTGTTCATGGGGATCAGATGCATTCTTTGATTTGGAAACACGGGTTCCATGCTGACGTCTTTGTTGCCAGTGCGTTACTTGACATGTATGCCAAGTGTTGTGATATGTGCAGGGCCGAGAAGGTGTTCGAGGAAATGCCCCTGAGAAACCTAGTGTCTTGGAACTCTATGATTGTGGGATTCTTGCAGAACAAGCTATATGATCGGGCCATTCTCTTCTTTAAGACGCTTCTACTCGAGAGTCTAACTGCTGTTGATGAGGTAAGCTTTTCTAGTGCTTTGAGTGCTTGTGCCAATGTTGGTAACTGGGAGTTTGGGAAACAAGTTCATGGAGTTGCTCTCAAGCTTGGTGTCAGGAATTTGGTTTATGTAAATAACTCGCTGATTGACATGTATAGTAAGTGTGGCTTGTTTGACGACGTTGTGAAGTTGTTTTCAAACATTGGAGCTAGAGATGTTGTTACTTGGAACATTATGATCATGGCATGCGTTTATAATAACAATTATGAAGATGCCTGCAATAATTTTTGGATGATGAGGAGGGAAGGTTTAATACCTGATGAAGCGTCATACTCCTCTGTTCTTCATTCTTGTGCAAATCTTGCAGCATTATATCAGGGAGCACTCATCCATAATCAGATCATAAAATCTGGATTCGTGAAGAATTTGTGTGTTGCAAGCTCTTTGGTCACGATGTATGCAAAGTGCGGCAGCTTGGTAGATGCTTTTCAGATATTTGAAGAGACTGTGGACCGTAATGTGGTTTGTTGGACAGCCATAATTGCAGCTTGTCAACAACACGGTCATGCTAACCAGGTCGTTCAGTTATTCGAGCAAATGTTGAGAGAGGGGATTAAACCTGACTATATTACTTTTGTTTCTGTTCTCTCTGCTTGCAGCCACACTGGTCGGGTTGAAGAAGGGTTCTTCTACTTTAATTCAATGACTAAAGTGCATGGTATTTACCCTGGATATGAGCATTATGCGTGTATAGTCGACTTGCTTGGCCGTGCTGGGCAGTTGGATAGAGCTAAGAGGTTTATAGAACTGATGCCTATCAAACCAGACGCCTCTGTATGGGGTGCTCTGCTTAGTGCTTGCAGGAATCATAGCAACCTTGAAATGGGCAAGGAAGTGGCTCAGGAACTTTTTGAATTGGAACCAGATAATCCTGGAAATTATGTGCTGCTTTGTAACATCTTGACACGTAATGGGATGTTACGCGAGGCCGACGAGGTTAGAAGAAAGATGGAAGCCATTGGAGTGAGGAAGGAACCAGGATGCAGCTGGATTGACATAAAGAATTCAACATATGTGTTCACAGTGCATGATAAGTCGCATGAGAAAACGCAGGAGATTTACGAGATGTTGGAGAAGGTAAAGGAGTTAGTAAAGAAGAAAGGGTATGTGGCTGAAACTGAATTTGCAATAAACATGGCAGAAGCATACAAGGAGCAGAGTTTGTGGTACCATAGCGAGAAACTAGCTCTTGCACTTGGGCTGCTCAGCCTTCCCGCCGGTGCTCCAATTAGGATAAAAAAGAACTTAAGAACTTGTGGAGACTGTCATACCGTAATGAAGTTTGCATCAGAAGTTTTTGGGCGAGAGATTATAGTAAGAGACATAAGCAGATTTCATCATTTCACCAATGGCATTTGTTCTTGTGGAGATTACTGGTGATTGATTGGAGGTTGATGATCCGGTGGATCACCCTTAGGAGCCCATTGAGCTAAAACTAGCTCATGAAACACCAGTACAACAAATATAACTTTTGAAGTACGGTCTAGAGTTGCAAGATTAATTTAGTTAGAGTTGGTAATATGGATCATCAAGTTTTTGTTCATCTTGATTTGTTCTTCTATTCATCATTGTTACGTTTCAATTGAGGCAACTGTGATTTATCTGATATCATAATGCTTGCAGTGTGAACGCAGGGATACAAATTGGACACTGTTGAATACTCAAATGTGTTCCCCATCAGAAATTCACTGCAGCAAGGCAATTTAAACACAATGGGGAGCTAACTTTCAACATTTGATTAGGTATTATACCTTCCACCTTCAATACTACACCTTCATGTATGCTAACTCATGAAACATTGAGTTTGTCCCTTAATCTACAAGTCATCAAAGATTATCATCAACTATAAGTGCTCACATAGCCCTTGTGTAATAGCTTGAAACTCCACATTTTATATTTTATATTTTATATTTTATATTTTTTACTTCATCATATCACTAAATTACCTCTAAGATGAGTATATACCTTGTCGTAGATCTTTATCTACTATTAATCCAACATAGTCAATGTCCGTGTAAGCTTCCAATGCAAGGGGATGGTTTTTGTTTAAATAAGGCTCCTTTTCCTGAAGTCTCTTGCAAGTAATGTAATACTCTATATGTTGTCTGAAGATGACATCTTCGGTAATGCATAAATTGACTAATTACACTTACGGAGTATGCTATGTCATGTCGAATGTGAGTTAGATATACAAGTCTCCCCAATAGCCTCTAGTAAAACTCTGTTCATTGTAAGTTCTTTTGACACTCCCCATCTTTGATGTGGATCCATTTCCATTGCATGATAAGATAAGGAATCACTACAAGGGGAGTAAGATTTGGATTACCTTGTTGATCGAAGATCTTAAGGCAAGAACACTTGTTTGAGATTCGAATCACTCCACAAGCAAGATCGATCATGTCTAGCTTGAATGATTCTTGTTGATCAAATATCTCAGCGCAAGACCACTTGTTTGAGATTCGAATCACTCCACAAGCAAGATTGATCATGTCGAGCTTGAATGATTCTACATGCAACCTAAACTACATAGAATTGCAAAGAAACTTAGCCATTGGCTAAAGAAAAGCACAAATGCTTCTTTTACTACATTTTCCAACTACCTTACAAATACAACATACATGGCTTTATATAGTCTCAAAATGAAACTATTAAAGACATTCCAAGAGTTGTAACATTCATACGTAATGACCATAATTAACCATTATGTAATTGTAACCTAAAGTAAATAAAAACTCTTAAACTACGTTAATGAAATACAATAACTCTAAATTGTAATCCACCCAAAATTTATCATATGAAACTTTATTCTTCTTCAATGTGGCATGAATTGAAATATCTTTTGATAATTTCAACAATATTTTCTTCACATCTTCATTGAAGCATATTGTATGATTTATGTCTCATTCTGCTTTTTCTTATTTCCATTCATAAATTTGTTACTTATTTCTATTGTGATATGCAAATGTCCTAAGTTGAGTATGCATCTTCCTTGTTGAGAAAGTACTTTAGCCTTCTAGGTTATTTGATCTCAAATTCTCTCATTAATTTCCTTCTAAGTTCAATCTTTTCTTCTTTATTATCTCCCGTAACTATGATATCATCTACATACACTAGAAGTATTGTAACTCTTCTTGGTTTTGAGTGCTTGATAGATAGAGTAATCTCCTTAACTCTACATATACTCATCGTCTTTCATTACCTTCGTGAATCTTCATAACCACACACTTAGGTGATTGTTTTAATTCATACCGGACTTAGACTGCACACTTTACCTCTTGAGCTTGTTAGGATCGCACAACAACGCACACACTCGATCTAGATGAACACAAAAAATAGGATAGAGAAAATGCAAGGAGAATATTGGCTAAAGGTTTATATTGATGACTTCAGACATGTAGAGTAAGAGTGTAACGCCTCAAATCCACAGCTAGCCGATATTGTTCTCTTTGGGCTGCCCCTCAAGGCTTTAAAACGCGTATGCTAGGGGAAGGTTTCCACACAGTTGGGGAGGAGAACAAACCACCCTTTGGGGTCCAGCGTCCTCGCTGGCATCTTTCCTTCCTCCAATCGATGTGGGACCGCCCCCAAATCCACCCCTCTCTGGGGCCCAGCGTCCTTACTGGCACACCGCCTCTTCGGGGAACAGCGAGAAGGCTGACACATCGTCTGGTGTCTGGCTCTGATACCATTTGTAACGTCCCAAATCCACCGCTAGTAGATATTATCCTCTTTGAGCTTTCCCTTTCGGGCTTTCCCTCAAGGCTTTAAAACGTGTCGGGCTTTCCCTCAAGGCTTTAAAACGTGTCGGGCTTTCCCTCAAGGCTTTAAAACGTGTCGGGCTTTCCCTCAAGGCTTTAAAACGTGTCTGCTAGGGGAAGGTTTTCACACCCTTATAAAGGGTGGTTTGTTCTCCTTCCTCCAATAGAGTGCCAGAGAGGACGCTGGGGTCCGAAGGGGGGTGGATTGTGATGTCCCACATTGGTTGGGAAGGAGAACAAACCACCATTTATAAGGGTGTGGAAACCTTCCCCTAGCAGACGCGTTTTAAAGCCTTGAGTGGAAGCCCGAAAGGGAAAACCCAAAGAGGACAATATCTGTTAGCGGTGGATCTGGGTCGTTACAAAGAGAGAATACAGAGAATACCTAAAGTACATTTAGGGATATATATGGGTTTAGCTTACTATATATAACTTTACTAAATATAACCATCTACCTATATAATTATTTACCATATATAGCTTTACCTTTTTGCTATTTACTATTTATGGCAATTTACTATATATCATTTACCAAATATAGTTTTACTATGTATTATCGACTAAGCTATAAATAAGCAGCAGACTCCATAGCTTACAAATCCTAAAGGAATGCACTCATAGCTTTCCTCGTCTAGATCACTGCAAAAGAACACATTCTTACATGTGTAAGAGCTGCCAATTTAAGTTGGCAACTAAAGATAGCAATATACTTGTAGTGTTCATCTTTGTTACTAAGGCAAATGTCTCCCAGTAACTATGTCGTCAAGTTTGAATAGATCCCTTAGCTACTAACCTTACATTAAGAGAGGCAAGGTTATTCAGTGTGTCCAAAGGGAGGCTTTAGTGAGAACAATTTACACTCGGTGAATGTCACATCAATTGAGATGTAAGACTTCCTATATTTAAGATTGTAACATTTCGTACCCCTTTTTGGTAGAAGAATATCCTAAAAAGACACATTTTATTGCTCTTTGAGTGTTGTTCAAGGCTAGGCACCAGAACATGGGGGTGTGAGGTCATTTGATATTAGAAAGTGAAGATAAAATTGGTTGCACACTTTTACTTGGAAAGATGCACTAATACCATCTTTTACTTCGATGACCTTCTTTCCTTGTTTCTTGTCTTCTTCAGTACCCAATTTTCAATCATTGGGTTCTTGACCTCTTCTTGACTGTACACCAACACCTGTCACAGAACAAGAATAGCATGCTCTTTCACACACACAGAGGTGTAATAATGAAAACTAAGAAGCGCCTGAAGAAACTTAAGTAG

mRNA sequence

ATGGATTCTATGCCGAGAACAGCAGTGGGTTTCGTGAATCTTGTTGACCGAAGTGGCCGAAGAATCGTAAGAGTACGAGTTATGAGTTGTAAATTACTTCCCCTTCCAACCTCTCTGAACAGCCTTATCAATTCAGGGCGGACCTTGAAACATGCCACTCAAATTCATTCCCATCTCATCACCACTGCCTTCCTTTCCCTCCCTTTCCTCTTCAATAACCTCCTCAACTTGTACGCCAAATCTGGTTCTCTCGACCAAACCTTGTTGTTCTTCTCCTCAACCCCACATGACTCCAAGAACGTTGTCTCTTGGACTTCTCTTATCACCCAATTTTCCCGTTCTAAAAGACCCTTTCACGCCTTGGCTTTCTTTAACCATATGAGTGCTTTGAGTGCTTGTGCCAATGTTGGTAACTGGGAGTTTGGGAAACAAGTTCATGGAGTTGCTCTCAAGCTTGGTGTCAGGAATTTGGTTTATGTAAATAACTCGCTGATTGACATGTATAGTAAGTGTGGCTTGTTTGACGACGTTGTGAAGTTGTTTTCAAACATTGGAGCTAGAGATGTTGTTACTTGGAACATTATGATCATGGCATGCGTTTATAATAACAATTATGAAGATGCCTGCAATAATTTTTGGATGATGAGGAGGGAAGGTTTAATACCTGATGAAGCGTCATACTCCTCTTACCCAATTTTCAATCATTGGGTTCTTGACCTCTTCTTGACTGTACACCAACACCTGTCACAGAACAAGAATAGCATGCTCTTTCACACACACAGAGGTGTAATAATGAAAACTAAGAAGCGCCTGAAGAAACTTAAGTAG

Coding sequence (CDS)

ATGGATTCTATGCCGAGAACAGCAGTGGGTTTCGTGAATCTTGTTGACCGAAGTGGCCGAAGAATCGTAAGAGTACGAGTTATGAGTTGTAAATTACTTCCCCTTCCAACCTCTCTGAACAGCCTTATCAATTCAGGGCGGACCTTGAAACATGCCACTCAAATTCATTCCCATCTCATCACCACTGCCTTCCTTTCCCTCCCTTTCCTCTTCAATAACCTCCTCAACTTGTACGCCAAATCTGGTTCTCTCGACCAAACCTTGTTGTTCTTCTCCTCAACCCCACATGACTCCAAGAACGTTGTCTCTTGGACTTCTCTTATCACCCAATTTTCCCGTTCTAAAAGACCCTTTCACGCCTTGGCTTTCTTTAACCATATGAGTGCTTTGAGTGCTTGTGCCAATGTTGGTAACTGGGAGTTTGGGAAACAAGTTCATGGAGTTGCTCTCAAGCTTGGTGTCAGGAATTTGGTTTATGTAAATAACTCGCTGATTGACATGTATAGTAAGTGTGGCTTGTTTGACGACGTTGTGAAGTTGTTTTCAAACATTGGAGCTAGAGATGTTGTTACTTGGAACATTATGATCATGGCATGCGTTTATAATAACAATTATGAAGATGCCTGCAATAATTTTTGGATGATGAGGAGGGAAGGTTTAATACCTGATGAAGCGTCATACTCCTCTTACCCAATTTTCAATCATTGGGTTCTTGACCTCTTCTTGACTGTACACCAACACCTGTCACAGAACAAGAATAGCATGCTCTTTCACACACACAGAGGTGTAATAATGAAAACTAAGAAGCGCCTGAAGAAACTTAAGTAG

Protein sequence

MDSMPRTAVGFVNLVDRSGRRIVRVRVMSCKLLPLPTSLNSLINSGRTLKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLITQFSRSKRPFHALAFFNHMSALSACANVGNWEFGKQVHGVALKLGVRNLVYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRREGLIPDEASYSSYPIFNHWVLDLFLTVHQHLSQNKNSMLFHTHRGVIMKTKKRLKKLK
BLAST of Cp4.1LG01g17540 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 3.2e-22
Identity = 65/200 (32.50%), Postives = 103/200 (51.50%), Query Frame = 1

Query: 43  INSGRTLKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVV 102
           +++ R +    +IH + + + F SL  +   L+++YAK GSL+     F       +NVV
Sbjct: 246 VSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLE--RNVV 305

Query: 103 SWTSLITQFSRSKRPFHALAFFNHM-------------SALSACANVGNWEFGKQVHGVA 162
           SW S+I  + +++ P  A+  F  M              AL ACA++G+ E G+ +H ++
Sbjct: 306 SWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLS 365

Query: 163 LKLGVRNLVYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDAC 222
           ++LG+   V V NSLI MY KC   D    +F  + +R +V+WN MI+    N    DA 
Sbjct: 366 VELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDAL 425

Query: 223 NNFWMMRREGLIPDEASYSS 230
           N F  MR   + PD  +Y S
Sbjct: 426 NYFSQMRSRTVKPDTFTYVS 443

BLAST of Cp4.1LG01g17540 vs. Swiss-Prot
Match: PP252_ARATH (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 5.4e-22
Identity = 63/188 (33.51%), Postives = 99/188 (52.66%), Query Frame = 1

Query: 55  IHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLITQFSRS 114
           +H+H++ + F     + N LLN+YAK GSL++    F   P   ++ V+WT+LI+ +S+ 
Sbjct: 82  VHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQ--RDFVTWTTLISGYSQH 141

Query: 115 KRPFHALAFFNHM-------------SALSACANVGNWEFGKQVHGVALKLGVRNLVYVN 174
            RP  AL FFN M             S + A A       G Q+HG  +K G  + V+V 
Sbjct: 142 DRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVG 201

Query: 175 NSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRREGLI 230
           ++L+D+Y++ GL DD   +F  + +R+ V+WN +I      +  E A   F  M R+G  
Sbjct: 202 SALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFR 261

BLAST of Cp4.1LG01g17540 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 100.5 bits (249), Expect = 3.0e-20
Identity = 60/187 (32.09%), Postives = 101/187 (54.01%), Query Frame = 1

Query: 55  IHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLITQFSRS 114
           IHS +I + F SL ++ N+LL+LYA  G +      F   P   K++V+W S+I  F+ +
Sbjct: 143 IHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPE--KDLVAWNSVINGFAEN 202

Query: 115 KRPFHALAFFNHM-------------SALSACANVGNWEFGKQVHGVALKLGVRNLVYVN 174
            +P  ALA +  M             S LSACA +G    GK+VH   +K+G+   ++ +
Sbjct: 203 GKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSS 262

Query: 175 NSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNF-WMMRREGL 228
           N L+D+Y++CG  ++   LF  +  ++ V+W  +I+    N   ++A   F +M   EGL
Sbjct: 263 NVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGL 322

BLAST of Cp4.1LG01g17540 vs. Swiss-Prot
Match: PP353_ARATH (Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana GN=PCMP-H5 PE=3 SV=1)

HSP 1 Score: 100.1 bits (248), Expect = 3.9e-20
Identity = 53/194 (27.32%), Postives = 97/194 (50.00%), Query Frame = 1

Query: 47  RTLKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTS 106
           + ++   +IH H++     S   L+++L+++Y K G +D+    F       K+VVSWTS
Sbjct: 232 KCIRRGKEIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVE--KDVVSWTS 291

Query: 107 LITQFSRSKRPFHALAFFNHM-------------SALSACANVGNWEFGKQVHGVALKLG 166
           +I ++ +S R     + F+ +               L+ACA++   E GKQVHG   ++G
Sbjct: 292 MIDRYFKSSRWREGFSLFSELVGSCERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVG 351

Query: 167 VRNLVYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFW 226
                + ++SL+DMY+KCG  +    +       D+V+W  +I  C  N   ++A   F 
Sbjct: 352 FDPYSFASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFD 411

Query: 227 MMRREGLIPDEASY 228
           ++ + G  PD  ++
Sbjct: 412 LLLKSGTKPDHVTF 423

BLAST of Cp4.1LG01g17540 vs. Swiss-Prot
Match: PPR8_ARATH (Pentatricopeptide repeat-containing protein At1g03540 OS=Arabidopsis thaliana GN=PCMP-E4 PE=2 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 5.1e-20
Identity = 59/190 (31.05%), Postives = 98/190 (51.58%), Query Frame = 1

Query: 47  RTLKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTS 106
           R LK   +IH  LIT    S   + ++LL++Y K GS+ +    F+      KN VSW++
Sbjct: 279 RRLKQGKEIHGKLITNGIGSNVVVESSLLDMYGKCGSVREARQVFNGM--SKKNSVSWSA 338

Query: 107 LITQFSRSKRPFHALAFFNHM---------SALSACANVGNWEFGKQVHGVALKLGVRNL 166
           L+  + ++     A+  F  M         + L ACA +     GK++HG  ++ G    
Sbjct: 339 LLGGYCQNGEHEKAIEIFREMEEKDLYCFGTVLKACAGLAAVRLGKEIHGQYVRRGCFGN 398

Query: 167 VYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRR 226
           V V ++LID+Y K G  D   +++S +  R+++TWN M+ A   N   E+A + F  M +
Sbjct: 399 VIVESALIDLYGKSGCIDSASRVYSKMSIRNMITWNAMLSALAQNGRGEEAVSFFNDMVK 458

Query: 227 EGLIPDEASY 228
           +G+ PD  S+
Sbjct: 459 KGIKPDYISF 466

BLAST of Cp4.1LG01g17540 vs. TrEMBL
Match: A0A0A0LVV0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G560780 PE=4 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 3.9e-43
Identity = 88/110 (80.00%), Postives = 94/110 (85.45%), Query Frame = 1

Query: 120 ALAFFNHMSALSACANVGNWEFGKQVHGVALKLGVRNLVYVNNSLIDMYSKCGLFDDVVK 179
           AL   +  S  SACAN GN EFGKQVHGVALKLGV NLVY+NNSL DMY KCGLF+DV K
Sbjct: 185 ALDEVSFSSVFSACANAGNLEFGKQVHGVALKLGVWNLVYINNSLSDMYGKCGLFNDVAK 244

Query: 180 LFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRREGLIPDEASYSS 230
           LFSN GARDVVTWNIMIMA VYN+NYEDACN+FWMMRR+G IPDEASYSS
Sbjct: 245 LFSNTGARDVVTWNIMIMAYVYNHNYEDACNSFWMMRRKGSIPDEASYSS 294

BLAST of Cp4.1LG01g17540 vs. TrEMBL
Match: A0A0S3S001_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G378000 PE=4 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 6.4e-38
Identity = 91/192 (47.40%), Postives = 115/192 (59.90%), Query Frame = 1

Query: 49  LKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLI 108
           L    QIH+ L+  AF +  F+   LL++YA  GS+      F   PH  +N+VSW S+I
Sbjct: 143 LSQGQQIHALLLKHAFHTDTFVATALLHMYASCGSMSLAENVFDQMPH--RNLVSWNSMI 202

Query: 109 TQFSRSKRPFHALAFFNHM-----------SALSACANVGNWEFGKQVHGVALKLGVRNL 168
             F ++K    A+ FF  +           S LSACA +    FGKQVHG  +K G+  L
Sbjct: 203 VGFLKNKMYGRAIGFFREVLSLEPDQVSFSSVLSACAGLVELGFGKQVHGSIVKRGLVGL 262

Query: 169 VYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRR 228
           VYV NSL+DMY KCGLF+D  KLF   G RDVVTWN+MI  CV++ N+E AC  F  M R
Sbjct: 263 VYVKNSLVDMYCKCGLFEDATKLFCGGGERDVVTWNVMITGCVHSQNFEQACTFFQAMIR 322

Query: 229 EGLIPDEASYSS 230
           EG+ PDEASYSS
Sbjct: 323 EGVEPDEASYSS 332

BLAST of Cp4.1LG01g17540 vs. TrEMBL
Match: A0A0L9U9B3_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g269300 PE=4 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 6.4e-38
Identity = 91/192 (47.40%), Postives = 115/192 (59.90%), Query Frame = 1

Query: 49  LKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLI 108
           L    QIH+ L+  AF +  F+   LL++YA  GS+      F   PH  +N+VSW S+I
Sbjct: 146 LSQGQQIHALLLKHAFHTDTFVATALLHMYASCGSMSLAENVFDQMPH--RNLVSWNSMI 205

Query: 109 TQFSRSKRPFHALAFFNHM-----------SALSACANVGNWEFGKQVHGVALKLGVRNL 168
             F ++K    A+ FF  +           S LSACA +    FGKQVHG  +K G+  L
Sbjct: 206 VGFLKNKMYGRAIGFFREVLSLEPDQVSFSSVLSACAGLVELGFGKQVHGSIVKRGLVGL 265

Query: 169 VYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRR 228
           VYV NSL+DMY KCGLF+D  KLF   G RDVVTWN+MI  CV++ N+E AC  F  M R
Sbjct: 266 VYVKNSLVDMYCKCGLFEDATKLFCGGGERDVVTWNVMITGCVHSQNFEQACTFFQAMIR 325

Query: 229 EGLIPDEASYSS 230
           EG+ PDEASYSS
Sbjct: 326 EGVEPDEASYSS 335

BLAST of Cp4.1LG01g17540 vs. TrEMBL
Match: A0A0B2PYI8_GLYSO (Pentatricopeptide repeat-containing protein, chloroplastic OS=Glycine soja GN=glysoja_028603 PE=4 SV=1)

HSP 1 Score: 165.2 bits (417), Expect = 1.1e-37
Identity = 91/192 (47.40%), Postives = 115/192 (59.90%), Query Frame = 1

Query: 49  LKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLI 108
           L    QIH+ +    FL+ PF+   LL++YAK GS+      F   PH  +N+VSW S+I
Sbjct: 145 LSEGQQIHALIHKHCFLNDPFVATALLDMYAKCGSMLLAENVFDEMPH--RNLVSWNSMI 204

Query: 109 TQFSRSKRPFHALAFFNHM-----------SALSACANVGNWEFGKQVHGVALKLGVRNL 168
             F ++K    A+  F  +           S LSACA +   +FGKQVHG  +K G+  L
Sbjct: 205 VGFVKNKLYGRAIGVFREVLSLGPDQVSISSVLSACAGLVELDFGKQVHGSIVKRGLVGL 264

Query: 169 VYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRR 228
           VYV NSL+DMY KCGLF+D  KLF   G RDVVTWN+MIM C +  N+E AC  F  M R
Sbjct: 265 VYVKNSLVDMYCKCGLFEDATKLFCGGGDRDVVTWNVMIMGCFHCQNFEQACTYFQAMIR 324

Query: 229 EGLIPDEASYSS 230
           EG+ PDEASYSS
Sbjct: 325 EGVEPDEASYSS 334

BLAST of Cp4.1LG01g17540 vs. TrEMBL
Match: I1JR12_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_03G227900 PE=4 SV=2)

HSP 1 Score: 163.7 bits (413), Expect = 3.2e-37
Identity = 91/192 (47.40%), Postives = 114/192 (59.38%), Query Frame = 1

Query: 49  LKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLI 108
           L    QIH+ +    FL+ PF+   LL++YAK GS+      F   PH  +N+VSW S+I
Sbjct: 148 LSEGQQIHALIHKHCFLNDPFVATALLDMYAKCGSMLLAENVFDEMPH--RNLVSWNSMI 207

Query: 109 TQFSRSKRPFHALAFFNHM-----------SALSACANVGNWEFGKQVHGVALKLGVRNL 168
             F ++K    A+  F  +           S LSACA +   +FGKQVHG  +K G+  L
Sbjct: 208 VGFVKNKLYGRAIGVFREVLSLGPDQVSISSVLSACAGLVELDFGKQVHGSIVKRGLVGL 267

Query: 169 VYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRR 228
           VYV NSL+DMY KCGLF+D  KLF   G RDVVTWN+MIM C    N+E AC  F  M R
Sbjct: 268 VYVKNSLVDMYCKCGLFEDATKLFCGGGDRDVVTWNVMIMGCFRCRNFEQACTYFQAMIR 327

Query: 229 EGLIPDEASYSS 230
           EG+ PDEASYSS
Sbjct: 328 EGVEPDEASYSS 337

BLAST of Cp4.1LG01g17540 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 107.1 bits (266), Expect = 1.8e-23
Identity = 65/200 (32.50%), Postives = 103/200 (51.50%), Query Frame = 1

Query: 43  INSGRTLKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVV 102
           +++ R +    +IH + + + F SL  +   L+++YAK GSL+     F       +NVV
Sbjct: 246 VSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLE--RNVV 305

Query: 103 SWTSLITQFSRSKRPFHALAFFNHM-------------SALSACANVGNWEFGKQVHGVA 162
           SW S+I  + +++ P  A+  F  M              AL ACA++G+ E G+ +H ++
Sbjct: 306 SWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLS 365

Query: 163 LKLGVRNLVYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDAC 222
           ++LG+   V V NSLI MY KC   D    +F  + +R +V+WN MI+    N    DA 
Sbjct: 366 VELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDAL 425

Query: 223 NNFWMMRREGLIPDEASYSS 230
           N F  MR   + PD  +Y S
Sbjct: 426 NYFSQMRSRTVKPDTFTYVS 443

BLAST of Cp4.1LG01g17540 vs. TAIR10
Match: AT3G61170.1 (AT3G61170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 106.7 bits (265), Expect = 2.3e-23
Identity = 69/208 (33.17%), Postives = 107/208 (51.44%), Query Frame = 1

Query: 35  LPTSLNSLINSGRTLKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSST 94
           +P+ LN    S   +K A+  H  ++ T + +   + N L+++YAK G +D  L  F   
Sbjct: 331 IPSILNCFALSRTEMKIASSAHCLIVKTGYATYKLVNNALVDMYAKRGIMDSALKVFEGM 390

Query: 95  PHDSKNVVSWTSLITQFSRSKRPFHALAFFNHM-------------SALSACANVGNWEF 154
               K+V+SWT+L+T  + +     AL  F +M             S LSA A +   EF
Sbjct: 391 IE--KDVISWTALVTGNTHNGSYDEALKLFCNMRVGGITPDKIVTASVLSASAELTLLEF 450

Query: 155 GKQVHGVALKLGVRNLVYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVY 214
           G+QVHG  +K G  + + VNNSL+ MY+KCG  +D   +F+++  RD++TW  +I+    
Sbjct: 451 GQQVHGNYIKSGFPSSLSVNNSLVTMYTKCGSLEDANVIFNSMEIRDLITWTCLIVGYAK 510

Query: 215 NNNYEDACNNFWMMRR-EGLIPDEASYS 229
           N   EDA   F  MR   G+ P    Y+
Sbjct: 511 NGLLEDAQRYFDSMRTVYGITPGPEHYA 536

BLAST of Cp4.1LG01g17540 vs. TAIR10
Match: AT3G24000.1 (AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 106.3 bits (264), Expect = 3.1e-23
Identity = 63/188 (33.51%), Postives = 99/188 (52.66%), Query Frame = 1

Query: 55  IHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLITQFSRS 114
           +H+H++ + F     + N LLN+YAK GSL++    F   P   ++ V+WT+LI+ +S+ 
Sbjct: 82  VHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQ--RDFVTWTTLISGYSQH 141

Query: 115 KRPFHALAFFNHM-------------SALSACANVGNWEFGKQVHGVALKLGVRNLVYVN 174
            RP  AL FFN M             S + A A       G Q+HG  +K G  + V+V 
Sbjct: 142 DRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVG 201

Query: 175 NSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRREGLI 230
           ++L+D+Y++ GL DD   +F  + +R+ V+WN +I      +  E A   F  M R+G  
Sbjct: 202 SALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFR 261

BLAST of Cp4.1LG01g17540 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 100.5 bits (249), Expect = 1.7e-21
Identity = 60/187 (32.09%), Postives = 101/187 (54.01%), Query Frame = 1

Query: 55  IHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLITQFSRS 114
           IHS +I + F SL ++ N+LL+LYA  G +      F   P   K++V+W S+I  F+ +
Sbjct: 143 IHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPE--KDLVAWNSVINGFAEN 202

Query: 115 KRPFHALAFFNHM-------------SALSACANVGNWEFGKQVHGVALKLGVRNLVYVN 174
            +P  ALA +  M             S LSACA +G    GK+VH   +K+G+   ++ +
Sbjct: 203 GKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSS 262

Query: 175 NSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNF-WMMRREGL 228
           N L+D+Y++CG  ++   LF  +  ++ V+W  +I+    N   ++A   F +M   EGL
Sbjct: 263 NVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGL 322

BLAST of Cp4.1LG01g17540 vs. TAIR10
Match: AT4G37170.1 (AT4G37170.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 100.1 bits (248), Expect = 2.2e-21
Identity = 53/194 (27.32%), Postives = 97/194 (50.00%), Query Frame = 1

Query: 47  RTLKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTS 106
           + ++   +IH H++     S   L+++L+++Y K G +D+    F       K+VVSWTS
Sbjct: 232 KCIRRGKEIHGHIVRAGLDSDEVLWSSLMDMYGKCGCIDEARNIFDKIVE--KDVVSWTS 291

Query: 107 LITQFSRSKRPFHALAFFNHM-------------SALSACANVGNWEFGKQVHGVALKLG 166
           +I ++ +S R     + F+ +               L+ACA++   E GKQVHG   ++G
Sbjct: 292 MIDRYFKSSRWREGFSLFSELVGSCERPNEYTFAGVLNACADLTTEELGKQVHGYMTRVG 351

Query: 167 VRNLVYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFW 226
                + ++SL+DMY+KCG  +    +       D+V+W  +I  C  N   ++A   F 
Sbjct: 352 FDPYSFASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFD 411

Query: 227 MMRREGLIPDEASY 228
           ++ + G  PD  ++
Sbjct: 412 LLLKSGTKPDHVTF 423

BLAST of Cp4.1LG01g17540 vs. NCBI nr
Match: gi|778662180|ref|XP_011659482.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 194.9 bits (494), Expect = 1.9e-46
Identity = 111/207 (53.62%), Postives = 134/207 (64.73%), Query Frame = 1

Query: 37  TSLNSLINSGRTLKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPH 96
           TSLNSL+N  RT KHATQIHS LITTA LSLPFLFNNLLNLYAK GS+DQTLL FSS P 
Sbjct: 31  TSLNSLLNCSRTSKHATQIHSQLITTALLSLPFLFNNLLNLYAKCGSVDQTLLLFSSAPD 90

Query: 97  DSKNVVSWTSLITQFSRSKRPFHALAFFNHM-------------SALSACANVGNWEFGK 156
           DSKNVVSWTSLITQ +R KRPF AL FFNHM             + LSAC +      G+
Sbjct: 91  DSKNVVSWTSLITQLTRFKRPFKALTFFNHMRRSGVYPNHYTFSAVLSACTDTTASVHGE 150

Query: 157 QVHGVALKLGVRNLVYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNN 216
           Q+H +  K G    V+V ++L+DMY+KC       K+F  +  R++V+WN MI+  + N 
Sbjct: 151 QMHSLVWKHGFLAEVFVVSALVDMYAKCCDMLMAEKVFEEMPVRNLVSWNTMIVGFLQNK 210

Query: 217 NYEDACNNFWMMRREGLIP-DEASYSS 230
            Y+ A   F  +  E L   DE S+SS
Sbjct: 211 LYDQAIFFFKTLLLENLTALDEVSFSS 237

BLAST of Cp4.1LG01g17540 vs. NCBI nr
Match: gi|659099256|ref|XP_008450508.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like [Cucumis melo])

HSP 1 Score: 191.4 bits (485), Expect = 2.0e-45
Identity = 109/208 (52.40%), Postives = 134/208 (64.42%), Query Frame = 1

Query: 36  PTSLNSLINSGRTLKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTP 95
           PTS N+L+NSGRT KHATQIHS LITTA LSLPFLFNNLLNLYAK G +DQTLL FSS P
Sbjct: 30  PTSFNNLLNSGRTSKHATQIHSQLITTALLSLPFLFNNLLNLYAKCGCVDQTLLLFSSAP 89

Query: 96  HDSKNVVSWTSLITQFSRSKRPFHALAFFNHM-------------SALSACANVGNWEFG 155
             SK+VVSWTSLITQ +RSKRPF AL FFN M             + LSAC +      G
Sbjct: 90  DVSKDVVSWTSLITQLTRSKRPFKALTFFNQMRLSGVYPNHYTLSAVLSACTDTMVSVHG 149

Query: 156 KQVHGVALKLGVRNLVYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYN 215
           +Q+H +  K G    V+V ++L+DMY+KC       K+F  +  R++V+WN MI+  + N
Sbjct: 150 EQMHSLVWKHGFHAEVFVVSALVDMYAKCCDMRMAEKVFEEMPVRNLVSWNSMIVGFLQN 209

Query: 216 NNYEDACNNFWMMRREGLIP-DEASYSS 230
             Y+ A   F  +  E L   DE S+SS
Sbjct: 210 KLYDQAIFFFKTLLLENLTALDEVSFSS 237

BLAST of Cp4.1LG01g17540 vs. NCBI nr
Match: gi|700210899|gb|KGN65995.1| (hypothetical protein Csa_1G560780 [Cucumis sativus])

HSP 1 Score: 183.3 bits (464), Expect = 5.6e-43
Identity = 88/110 (80.00%), Postives = 94/110 (85.45%), Query Frame = 1

Query: 120 ALAFFNHMSALSACANVGNWEFGKQVHGVALKLGVRNLVYVNNSLIDMYSKCGLFDDVVK 179
           AL   +  S  SACAN GN EFGKQVHGVALKLGV NLVY+NNSL DMY KCGLF+DV K
Sbjct: 185 ALDEVSFSSVFSACANAGNLEFGKQVHGVALKLGVWNLVYINNSLSDMYGKCGLFNDVAK 244

Query: 180 LFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRREGLIPDEASYSS 230
           LFSN GARDVVTWNIMIMA VYN+NYEDACN+FWMMRR+G IPDEASYSS
Sbjct: 245 LFSNTGARDVVTWNIMIMAYVYNHNYEDACNSFWMMRRKGSIPDEASYSS 294

BLAST of Cp4.1LG01g17540 vs. NCBI nr
Match: gi|720076691|ref|XP_010240804.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial [Nelumbo nucifera])

HSP 1 Score: 172.2 bits (435), Expect = 1.3e-39
Identity = 87/193 (45.08%), Postives = 123/193 (63.73%), Query Frame = 1

Query: 51  HATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLITQ 110
           H  QIHS +    F +  F+ + ++++YAK  ++D     F   P   +N+VSW ++I  
Sbjct: 162 HGQQIHSLIHKHGFGTDVFVGSAMIDMYAKCSNMDSAEKVFDEMPE--RNLVSWNAMIVG 221

Query: 111 FSRSKRPFHALAFFNHM--------------SALSACANVGNWEFGKQVHGVALKLGVRN 170
           FS +K    A+  F  +              S LSACANVG+ +FG+QVHGV +K G+ +
Sbjct: 222 FSHNKIFDRAIDVFKEVLRDKSVSPDQVSFSSVLSACANVGSLDFGRQVHGVVVKHGLMH 281

Query: 171 LVYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMR 230
           L YV NSL+DMY+KCG F D ++LF +I  RDVVTWN+M M CV N+ +E+ACN FW+MR
Sbjct: 282 LAYVKNSLVDMYNKCGCFQDSIQLFGSIPDRDVVTWNVMAMGCVQNDRFEEACNYFWVMR 341

BLAST of Cp4.1LG01g17540 vs. NCBI nr
Match: gi|950948959|ref|XP_014495064.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like [Vigna radiata var. radiata])

HSP 1 Score: 168.7 bits (426), Expect = 1.4e-38
Identity = 92/192 (47.92%), Postives = 116/192 (60.42%), Query Frame = 1

Query: 49  LKHATQIHSHLITTAFLSLPFLFNNLLNLYAKSGSLDQTLLFFSSTPHDSKNVVSWTSLI 108
           L    QIH+ L+  AF +  F+   LL++YA  GS+      F   PH  +N+VSW S+I
Sbjct: 143 LSQGQQIHALLLKHAFYTDTFVATALLHMYASCGSMSFAENVFDQMPH--RNLVSWNSMI 202

Query: 109 TQFSRSKRPFHALAFFNHM-----------SALSACANVGNWEFGKQVHGVALKLGVRNL 168
             F ++K    A+ FF  +           S LSACA +    FGKQVHG  +K G+  L
Sbjct: 203 VGFVKNKMYCRAIGFFREVLSLDPDQVSFSSVLSACAGLVELGFGKQVHGSIVKRGLVGL 262

Query: 169 VYVNNSLIDMYSKCGLFDDVVKLFSNIGARDVVTWNIMIMACVYNNNYEDACNNFWMMRR 228
           VYV NSL+DMY KCGLF+D  KLF   G RDVVTWN+MIM CV++ N+E AC  F  M R
Sbjct: 263 VYVKNSLVDMYCKCGLFEDATKLFCGGGDRDVVTWNVMIMGCVHSQNFEQACTFFQAMIR 322

Query: 229 EGLIPDEASYSS 230
           EG+ PDEASYSS
Sbjct: 323 EGVEPDEASYSS 332

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR32_ARATH3.2e-2232.50Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
PP252_ARATH5.4e-2233.51Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
PP330_ARATH3.0e-2032.09Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
PP353_ARATH3.9e-2027.32Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana GN... [more]
PPR8_ARATH5.1e-2031.05Pentatricopeptide repeat-containing protein At1g03540 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LVV0_CUCSA3.9e-4380.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G560780 PE=4 SV=1[more]
A0A0S3S001_PHAAN6.4e-3847.40Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G378000 PE=... [more]
A0A0L9U9B3_PHAAN6.4e-3847.40Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g269300 PE=4 SV=1[more]
A0A0B2PYI8_GLYSO1.1e-3747.40Pentatricopeptide repeat-containing protein, chloroplastic OS=Glycine soja GN=gl... [more]
I1JR12_SOYBN3.2e-3747.40Uncharacterized protein OS=Glycine max GN=GLYMA_03G227900 PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT1G11290.11.8e-2332.50 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G61170.12.3e-2333.17 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G24000.13.1e-2333.51 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21065.11.7e-2132.09 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G37170.12.2e-2127.32 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778662180|ref|XP_011659482.1|1.9e-4653.62PREDICTED: pentatricopeptide repeat-containing protein At3g24000, mitochondrial-... [more]
gi|659099256|ref|XP_008450508.1|2.0e-4552.40PREDICTED: pentatricopeptide repeat-containing protein At3g24000, mitochondrial-... [more]
gi|700210899|gb|KGN65995.1|5.6e-4380.00hypothetical protein Csa_1G560780 [Cucumis sativus][more]
gi|720076691|ref|XP_010240804.1|1.3e-3945.08PREDICTED: putative pentatricopeptide repeat-containing protein At3g13770, mitoc... [more]
gi|950948959|ref|XP_014495064.1|1.4e-3847.92PREDICTED: pentatricopeptide repeat-containing protein At3g24000, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g17540.1Cp4.1LG01g17540.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 102..127
score: 0.0049coord: 162..182
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 188..229
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 190..224
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 100..134
score: 7.805coord: 188..222
score: 10.698coord: 157..187
score: 6.741coord: 67..97
score: 5
NoneNo IPR availableunknownCoilCoilcoord: 264..275
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 4..229
score: 3.0

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None