Cp4.1LG08g02150 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g02150
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing family protein
LocationCp4.1LG08 : 2956465 .. 2959368 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAAACAGACAAGGGCAAATAAGTAAATTCACAAAACCTTATCGAGAAACTGAAAAGCACAGAGCTTGCGCGCATTCGCAGCGCACTCTCTTCAACCCTAACAGAGACGGCGGTGGAGACGACGACGACGACGACGAGGATGCTTCTCCAAACTTCCGTTCACCACTGCAATGTCTCTCTCTCCTCTTCAACTTCTTATTCGCATAAATTGCCGCCTGTAATTCAATCTCAGGCTTTTTTCCGGAATGTAAATTATCGGAGGCTTCCACTCTCCATAACCTGCTCAGTTTCCCAAACTCATAGCTATGGAACTGTGGATTTCGAGCGAAGGCCTATGGTGAAATGGAACGCTATTTACAGAAGGATATCGCTGATGGAAAATCCTGAGTTAGGCTCTGCGAGTGTACTGAACCAGTGGGAGAACGAGGGCAAAAAGATTACGAAATGGGAGATTTTTCGAGTCATTAAGGAGTTGAGAAAGTACAGGCGCTATGAAAGGGCTTTGGAGGTAAAGTAGATAATGATTTTAGGTAGTTTTTGGATGCACATATTGGAGTTCATGAATTGGTTCAATTTTCTTGCGAATTGCGATGTTCGATTCATCTTTTATTGGAGAAAAAGGAGGTAGAAATTATCATCGATTCAAATAGGTTTCTGCTAACTCTAGTTTAGCTACTTGGATTGCATTCTTATGTTTAATTTGCTTCTTTAGCAACCATGGAAGCTATTGAAGCTTTAATCTTCGGATTTTGAACTGGATTAGTTCATTCTTATGTGAAATTCTACTGTTGTATGAGTTGTTTGTGCAAATCTTAAGTGTACAAGCTTTGATGATCATCTAATTCTTGGTCGAAATCCTCACTCTTGCATGAACTAATACTGATTCGCTTCTTAGTAAACCGAAATATACAAGAAATCAAGCTTTTCTGTTTGTTAATTGGGAGGCATTGATATATTTCATTGAAGAACATTGATGGAGAAATTATACGTTTTCATCATTTAGCTGAACTCAATGCTGGTTTATGCATGTTCTTCATTTTTCTGTTCATACATATGATGTGCATCTGATAATTTGAATACGGTTCATTAAGATGATGAACGATCGTTCTTAAGTTTGCAATTTGTTGCGGTTCGGTGATCGTGTCGGTATGTAGATTCTCGAAGCTGTTTCGATTCTAAAATTTGATGTTTTTGTTTGTAGATATATGATTGGATGAGCAATAGAGGAGAGACATTTAAATTAACAGCTAGTGATGCTGCTATACAATTGGATCTTATTTCAAAAGTCTATGGAATTGCGAGTGCTGAAGATTACTTCGTAAGGTTGCCTAAAAATTTGAAGGATAGAAGAACATATGGAGCTCTTCTGAATGCCTATGTGAAAGCAAGACAGAGAGAAAAGGCAGAATCTCTACTTGCAAAAATGAGAACTAAAGGTTATGCGATTCACACGCTTTCGTTCAATGTAATGATGACTCTCTACGTGAACTTCAAAGAGTACGAGAAAGTCGAGTCGTTAGTGTCGGAAATGATAGAAAAGGGCATTCGCCTTGACATTTACTCGTATAATATATGGATATCATCTCGTGGATTACAAGGATCGATTGAGAAAATGGAAGAAGTGTATGAGCGGATGAAGCAAGACAGGACCATCAATGCTAATTGGACTACATTCAGTACCATGGCCACGATGTATATTAAGACGGGAATGATCGAAAAGGCTGAAGAATGCTTGAGGAGGGTCGAAAGTAGAATTGTTGGTCGGGATCGAATACCGTTTCATTATCTGATGAGTTTGTATGGTAGTGTTGGTAACAAAGAAGAAACTTATCGAGTGTGGAAGGTTTACAAAACTATTTTCCCGACCATTCCAAATTTGGGATACCATGCTATAATCTCTGCTCTAATCAGGGCAGGTGACATTGAAGGTGCCGAAAAAATTTACGAGGAGTGGCTATCCGTTAAGACAGCGTACGACCCTAGAATTGCCAATCTTTTCATGGGGTGGTATGTTAAGGAAGGCATGTCGAGCAAAGCTGAGAGCTTCTTTAATCACATGGTTGAAGCTGGGGGAAAGCCAAATTCGAGTTCATGGGAGATTCTCGCCGATGGGCTTTCGAAAGAGGGTCGGGTTTCTGACGCTTTAGCTAGTTGGAAAGAAGCGTTTTCTGCCGAGGGTTCGAGGACTTGGAGGCCAAAGCACTTTAAGGTGTTGGCTTTCTTCAACCTCTGTGAGAAAGAAGGCGACATCGCCAGCAAGGAGGTTCTTGTTGGATTGTTAAGGCAGTCGAAATGTCTTCAAGACAAAGCATATGTATCACTCATTGGTTTGTCGGATGAGACAATCGACAACGATGTAGTTTCGGGGGAAGGTAGCAATATCGACGACGAAAGTGACAAAACCGTGTACGAGTCTGATGATTCTGAGATGCTTCTCGAGATGTGACAGATTCAATGACATTGTCATCTATCTATGGTTTCTGCGATTTGTAATCAGAAATTTGGAAAATATATTTTTATCTGCATGTCGTCCCGGTACCGGTCCCGGTCCCGGTCTTGGAGGGTTGAGAACGATCATCAGTGGTTCAGTGTTGAAGTGTAGAGGAGGTGAGAGCAACTTGGGATTTTGCTTTGGGGTGTGGGAGTTTGGTTAAACCCAAAGCTATACTCAGCAGAGGTATCTTTATTTAGATCTTTGTATGAATTTCTAAATATAATTTTCCCAGATAGAAATCTTGGCATTTCCTGAATAGGGTGGAAATTTTAGTGCAATTCCCTCATGGGCTTTGTGATTAATAGCTTTTAAAAAGTATCGATTTCCATCTTATAAGTGTTAGATGAACACGACTGAACACGACTCTCCATAATGGTATGATATTGTTCACTTTGAACATAAGCTCTCGT

mRNA sequence

GGAAACAGACAAGGGCAAATAAGTAAATTCACAAAACCTTATCGAGAAACTGAAAAGCACAGAGCTTGCGCGCATTCGCAGCGCACTCTCTTCAACCCTAACAGAGACGGCGGTGGAGACGACGACGACGACGACGAGGATGCTTCTCCAAACTTCCGTTCACCACTGCAATGTCTCTCTCTCCTCTTCAACTTCTTATTCGCATAAATTGCCGCCTGTAATTCAATCTCAGGCTTTTTTCCGGAATGTAAATTATCGGAGGCTTCCACTCTCCATAACCTGCTCAGTTTCCCAAACTCATAGCTATGGAACTGTGGATTTCGAGCGAAGGCCTATGGTGAAATGGAACGCTATTTACAGAAGGATATCGCTGATGGAAAATCCTGAGTTAGGCTCTGCGAGTGTACTGAACCAGTGGGAGAACGAGGGCAAAAAGATTACGAAATGGGAGATTTTTCGAGTCATTAAGGAGTTGAGAAAGTACAGGCGCTATGAAAGGGCTTTGGAGATATATGATTGGATGAGCAATAGAGGAGAGACATTTAAATTAACAGCTAGTGATGCTGCTATACAATTGGATCTTATTTCAAAAGTCTATGGAATTGCGAGTGCTGAAGATTACTTCGTAAGGTTGCCTAAAAATTTGAAGGATAGAAGAACATATGGAGCTCTTCTGAATGCCTATGTGAAAGCAAGACAGAGAGAAAAGGCAGAATCTCTACTTGCAAAAATGAGAACTAAAGGTTATGCGATTCACACGCTTTCGTTCAATGTAATGATGACTCTCTACGTGAACTTCAAAGAGTACGAGAAAGTCGAGTCGTTAGTGTCGGAAATGATAGAAAAGGGCATTCGCCTTGACATTTACTCGTATAATATATGGATATCATCTCGTGGATTACAAGGATCGATTGAGAAAATGGAAGAAGTGTATGAGCGGATGAAGCAAGACAGGACCATCAATGCTAATTGGACTACATTCAGTACCATGGCCACGATGTATATTAAGACGGGAATGATCGAAAAGGCTGAAGAATGCTTGAGGAGGGTCGAAAGTAGAATTGTTGGTCGGGATCGAATACCGTTTCATTATCTGATGAGTTTGTATGGTAGTGTTGGTAACAAAGAAGAAACTTATCGAGTGTGGAAGGTTTACAAAACTATTTTCCCGACCATTCCAAATTTGGGATACCATGCTATAATCTCTGCTCTAATCAGGGCAGGTGACATTGAAGGTGCCGAAAAAATTTACGAGGAGTGGCTATCCGTTAAGACAGCGTACGACCCTAGAATTGCCAATCTTTTCATGGGGTGGTATGTTAAGGAAGGCATGTCGAGCAAAGCTGAGAGCTTCTTTAATCACATGGTTGAAGCTGGGGGAAAGCCAAATTCGAGTTCATGGGAGATTCTCGCCGATGGGCTTTCGAAAGAGGGTCGGGTTTCTGACGCTTTAGCTAGTTGGAAAGAAGCGTTTTCTGCCGAGGGTTCGAGGACTTGGAGGCCAAAGCACTTTAAGGTGTTGGCTTTCTTCAACCTCTGTGAGAAAGAAGGCGACATCGCCAGCAAGGAGGTTCTTGTTGGATTGTTAAGGCAGTCGAAATGTCTTCAAGACAAAGCATATGTATCACTCATTGGTTTGTCGGATGAGACAATCGACAACGATGTAGTTTCGGGGGAAGGTAGCAATATCGACGACGAAAGTGACAAAACCGTGTACGAGTCTGATGATTCTGAGATGCTTCTCGAGATGTGACAGATTCAATGACATTGTCATCTATCTATGGTTTCTGCGATTTGTAATCAGAAATTTGGAAAATATATTTTTATCTGCATGTCGTCCCGGTACCGGTCCCGGTCCCGGTCTTGGAGGGTTGAGAACGATCATCAGTGGTTCAGTGTTGAAGTGTAGAGGAGGTGAGAGCAACTTGGGATTTTGCTTTGGGGTGTGGGAGTTTGGTTAAACCCAAAGCTATACTCAGCAGAGGTATCTTTATTTAGATCTTTGTATGAATTTCTAAATATAATTTTCCCAGATAGAAATCTTGGCATTTCCTGAATAGGGTGGAAATTTTAGTGCAATTCCCTCATGGGCTTTGTGATTAATAGCTTTTAAAAAGTATCGATTTCCATCTTATAAGTGTTAGATGAACACGACTGAACACGACTCTCCATAATGGTATGATATTGTTCACTTTGAACATAAGCTCTCGT

Coding sequence (CDS)

ATGCTTCTCCAAACTTCCGTTCACCACTGCAATGTCTCTCTCTCCTCTTCAACTTCTTATTCGCATAAATTGCCGCCTGTAATTCAATCTCAGGCTTTTTTCCGGAATGTAAATTATCGGAGGCTTCCACTCTCCATAACCTGCTCAGTTTCCCAAACTCATAGCTATGGAACTGTGGATTTCGAGCGAAGGCCTATGGTGAAATGGAACGCTATTTACAGAAGGATATCGCTGATGGAAAATCCTGAGTTAGGCTCTGCGAGTGTACTGAACCAGTGGGAGAACGAGGGCAAAAAGATTACGAAATGGGAGATTTTTCGAGTCATTAAGGAGTTGAGAAAGTACAGGCGCTATGAAAGGGCTTTGGAGATATATGATTGGATGAGCAATAGAGGAGAGACATTTAAATTAACAGCTAGTGATGCTGCTATACAATTGGATCTTATTTCAAAAGTCTATGGAATTGCGAGTGCTGAAGATTACTTCGTAAGGTTGCCTAAAAATTTGAAGGATAGAAGAACATATGGAGCTCTTCTGAATGCCTATGTGAAAGCAAGACAGAGAGAAAAGGCAGAATCTCTACTTGCAAAAATGAGAACTAAAGGTTATGCGATTCACACGCTTTCGTTCAATGTAATGATGACTCTCTACGTGAACTTCAAAGAGTACGAGAAAGTCGAGTCGTTAGTGTCGGAAATGATAGAAAAGGGCATTCGCCTTGACATTTACTCGTATAATATATGGATATCATCTCGTGGATTACAAGGATCGATTGAGAAAATGGAAGAAGTGTATGAGCGGATGAAGCAAGACAGGACCATCAATGCTAATTGGACTACATTCAGTACCATGGCCACGATGTATATTAAGACGGGAATGATCGAAAAGGCTGAAGAATGCTTGAGGAGGGTCGAAAGTAGAATTGTTGGTCGGGATCGAATACCGTTTCATTATCTGATGAGTTTGTATGGTAGTGTTGGTAACAAAGAAGAAACTTATCGAGTGTGGAAGGTTTACAAAACTATTTTCCCGACCATTCCAAATTTGGGATACCATGCTATAATCTCTGCTCTAATCAGGGCAGGTGACATTGAAGGTGCCGAAAAAATTTACGAGGAGTGGCTATCCGTTAAGACAGCGTACGACCCTAGAATTGCCAATCTTTTCATGGGGTGGTATGTTAAGGAAGGCATGTCGAGCAAAGCTGAGAGCTTCTTTAATCACATGGTTGAAGCTGGGGGAAAGCCAAATTCGAGTTCATGGGAGATTCTCGCCGATGGGCTTTCGAAAGAGGGTCGGGTTTCTGACGCTTTAGCTAGTTGGAAAGAAGCGTTTTCTGCCGAGGGTTCGAGGACTTGGAGGCCAAAGCACTTTAAGGTGTTGGCTTTCTTCAACCTCTGTGAGAAAGAAGGCGACATCGCCAGCAAGGAGGTTCTTGTTGGATTGTTAAGGCAGTCGAAATGTCTTCAAGACAAAGCATATGTATCACTCATTGGTTTGTCGGATGAGACAATCGACAACGATGTAGTTTCGGGGGAAGGTAGCAATATCGACGACGAAAGTGACAAAACCGTGTACGAGTCTGATGATTCTGAGATGCTTCTCGAGATGTGA

Protein sequence

MLLQTSVHHCNVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGTVDFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYERALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALLNAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRLDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEECLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIRAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSSWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVLVGLLRQSKCLQDKAYVSLIGLSDETIDNDVVSGEGSNIDDESDKTVYESDDSEMLLEM
BLAST of Cp4.1LG08g02150 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 647.1 bits (1668), Expect = 1.7e-184
Identity = 317/514 (61.67%), Postives = 404/514 (78.60%), Query Frame = 1

Query: 1   MLLQTSVHHCNVSLSSSTSYSHKL---PPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYG 60
           MLLQ +V + NV L+SS SYS  L    PV+   A  +         +I CS+SQ + YG
Sbjct: 1   MLLQAAVQNRNVPLASSASYSRLLRCRSPVVSVAALSKKT------AAIVCSISQVYGYG 60

Query: 61  TVDFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRR 120
           TVD+ERRP+V+WNAIY++ISLME PELG+ASVLNQWE  G+K+TKWE+ RV+KELRKY+R
Sbjct: 61  TVDYERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKR 120

Query: 121 YERALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGA 180
             +ALE+YDWM+NRGE F+L+ASDAAIQLDLI KV GI  AE++F++LP+N KDRR YG+
Sbjct: 121 ANQALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGS 180

Query: 181 LLNAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKG 240
           LLNAYV+A+ REKAE+LL  MR KGYA+H L FNVMMTLY+N +EY+KV+++V EM +K 
Sbjct: 181 LLNAYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKD 240

Query: 241 IRLDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKA 300
           IRLDIYSYNIW+SS G  GS+EKME VY++MK D +I  NWTTFSTMATMYIK G  EKA
Sbjct: 241 IRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKA 300

Query: 301 EECLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISA 360
           E+ LR+VE+RI GR+RIP+HYL+SLYGS+GNK+E YRVW VYK++ P+IPNLGYHA++S+
Sbjct: 301 EDALRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSS 360

Query: 361 LIRAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPN 420
           L+R GDIEGAEK+YEEWL VK++YDPRI NL M  YVK      AE  F+HMVE GGKP+
Sbjct: 361 LVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPS 420

Query: 421 SSSWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKE 480
           SS+WEILA G +++  +S+AL   + AFSAEGS  WRPK   +  FF LCE+E D+ SKE
Sbjct: 421 SSTWEILAVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKE 480

Query: 481 VLVGLLRQSKCLQDKAYVSLIGLSD-ETIDNDVV 511
            ++ LLRQS  L+DK+Y++LI + +  T++N  +
Sbjct: 481 AVLELLRQSGDLEDKSYLALIDVDENRTVNNSEI 508

BLAST of Cp4.1LG08g02150 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 264.2 bits (674), Expect = 3.0e-69
Identity = 149/440 (33.86%), Postives = 235/440 (53.41%), Query Frame = 1

Query: 50  VSQTHSYGTVDFERRPMVKW--NAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFR 109
           ++  H   + D  +R   K+    +Y R+      E+     LNQ+    K + KWE+  
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 110 VIKELRKYRRYERALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPK 169
            IK+LR    Y  AL++ + M  RG     T SD AI LDL++K   I + E+YFV LP+
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERG--MNKTVSDQAIHLDLVAKAREITAGENYFVDLPE 120

Query: 170 NLKDRRTYGALLNAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVE 229
             K   TYG+LLN Y K    EKAE LL KM+       ++S+N +MTLY    E EKV 
Sbjct: 121 TSKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVP 180

Query: 230 SLVSEMIEKGIRLDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATM 289
           +++ E+  + +  D Y+YN+W+ +      I  +E V E M +D  +  +WTT+S MA++
Sbjct: 181 AMIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASI 240

Query: 290 YIKTGMIEKAEECLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIP 349
           Y+  G+ +KAE+ L+ +E +   RD   + +L++LYG +G   E YR+W+  +   P   
Sbjct: 241 YVDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTS 300

Query: 350 NLGYHAIISALIRAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFN 409
           N+ Y  +I  L++  D+ GAE +++EW +  + YD RI N+ +G Y +EG+  KA     
Sbjct: 301 NVAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKE 360

Query: 410 HMVEAGGKPNSSSWEILADGLSKEGRVSDALASWKEAFS---AEGSRTWRPKHFKVLAFF 469
                GGK N+ +WEI  D   K G ++ AL    +A S    +G + W P    V A  
Sbjct: 361 KAPRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGK-WLPSPETVRALM 420

Query: 470 NLCEKEGDIASKEVLVGLLR 485
           +  E++ D+   E L+ +L+
Sbjct: 421 SYFEQKKDVNGAENLLEILK 437

BLAST of Cp4.1LG08g02150 vs. Swiss-Prot
Match: PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 262.3 bits (669), Expect = 1.2e-68
Identity = 145/422 (34.36%), Postives = 234/422 (55.45%), Query Frame = 1

Query: 65  PMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYERALEI 124
           P+  ++ + RR++   +P      VL+ W ++G  +   E+  +IK LRK+ R+  AL+I
Sbjct: 33  PLDPYDTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQI 92

Query: 125 YDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALLNAYVK 184
            DWMS      +++  D AI+LDLI+KV G+  AE +F  +P   ++   YGALLN Y  
Sbjct: 93  SDWMSEH-RVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYAS 152

Query: 185 ARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRLDIYS 244
            +   KAE +  +M+  G+    L +NVM+ LYV   +Y  VE L+ EM ++ ++ DI++
Sbjct: 153 KKVLHKAEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFT 212

Query: 245 YNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEECLRRV 304
            N  + +  +   +E ME+   R + D+ ++ +W T++  A  YIK G+ EKA E LR+ 
Sbjct: 213 VNTRLHAYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKS 272

Query: 305 ESRIVGRDR-IPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIRAGD 364
           E  +  + R   +  LMS YG+ G KEE YR+W +YK +     N GY ++ISAL++  D
Sbjct: 273 EQMVNAQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKEL-DGFYNTGYISVISALLKMDD 332

Query: 365 IEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSSWEI 424
           IE  EKI EEW +  + +D RI +L +  Y K+GM  KAE   N +V+     ++S+WE 
Sbjct: 333 IEEVEKIMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWER 392

Query: 425 LADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVLVGLL 484
           LA G    G++  A+  WK A        WRP    +++  +  E + D+     ++ LL
Sbjct: 393 LALGYKMAGKMEKAVEKWKRAIEV-SKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL 451

Query: 485 RQ 486
            +
Sbjct: 453 SE 451

BLAST of Cp4.1LG08g02150 vs. Swiss-Prot
Match: PPR4_ARATH (Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidopsis thaliana GN=At1g02370 PE=2 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 8.6e-64
Identity = 147/430 (34.19%), Postives = 229/430 (53.26%), Query Frame = 1

Query: 72  IYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYERALEIYDWMSNR 131
           +Y+++S++       A  LNQ+  EG  + K ++FR  K LRK+RR + A EI+DWM  R
Sbjct: 73  LYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWMEKR 132

Query: 132 GETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRR-TYGALLNAYVKARQREK 191
             TF +  SD AI LDLI K  G+ +AE+YF  L  + K+ + TYGAL+N Y    + EK
Sbjct: 133 KMTFSV--SDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEEEK 192

Query: 192 AESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRLDIYSYNIWIS 251
           A++    M    +  ++L FN MM++Y+   + EKV  LV  M ++GI     +Y+IW+ 
Sbjct: 193 AKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIWMQ 252

Query: 252 SRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEECLRRVESRIVG 311
           S G    ++ +E++ + M +D      W TFS +A +Y K G+ EKA+  L+ +E ++  
Sbjct: 253 SCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKMNP 312

Query: 312 RDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIRAGDIEGAEKI 371
            +R   H+LMSLY  +    E YRVW+  K   P + NL Y  ++ A+ + GD++G +KI
Sbjct: 313 NNRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDGIKKI 372

Query: 372 YEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSSWEILADGLSK 431
           + EW S   AYD R+AN+ +  Y+K  M  +AE   +  ++    P S + ++L   L +
Sbjct: 373 FTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMIHLLE 432

Query: 432 EGRVSDALASWKEAF--SAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVLVGLLRQSKC 491
             +   A+   + A   SAE    W      V  FF   EK  D+   E    +L   K 
Sbjct: 433 NDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFCKILSNWKP 492

Query: 492 LQDKAYVSLI 499
           L  +    LI
Sbjct: 493 LDSETMTFLI 500

BLAST of Cp4.1LG08g02150 vs. Swiss-Prot
Match: PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 245.0 bits (624), Expect = 1.9e-63
Identity = 135/433 (31.18%), Postives = 230/433 (53.12%), Query Frame = 1

Query: 67  VKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYERALEIYD 126
           VK   +Y +IS + +P+      L  W   GKK++  E+ R++ +LR+ +R+  ALE+  
Sbjct: 22  VKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRKRFLHALEVSK 81

Query: 127 WMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALLNAYVKAR 186
           WM+  G     + ++ A+ LDLI +VYG  +AE+YF  L +  K+ +TYGALLN YV+ +
Sbjct: 82  WMNETGVCV-FSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGALLNCYVRQQ 141

Query: 187 QREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRLDIYSYN 246
             EK+     KM+  G+   +L++N +M LY N  ++EKV  ++ EM E+ +  D YSY 
Sbjct: 142 NVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEENVAPDNYSYR 201

Query: 247 IWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEECLRRVES 306
           I I++ G    +E++      M++ + I  +W T++  A  YI  G  ++A E L+  E+
Sbjct: 202 ICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRAVELLKMSEN 261

Query: 307 RIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIRAGDIEG 366
           R+  +D   +++L++LY  +G K E  R+W + K +     N  Y  ++ +L++   +  
Sbjct: 262 RLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQSLVKIDALVE 321

Query: 367 AEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSSWEILAD 426
           AE++  EW S    YD R+ N  +  Y+ + M  KAE+    +   G      SWE++A 
Sbjct: 322 AEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATTPESWELVAT 381

Query: 427 GLSKEGRVSDALASWKEAFSAE-GSRTWRPKHFKVLAFFNLCEKEGDIASKEVLVGLLRQ 486
             +++G + +A    K A   E GSR WRP    V +  +    EG +   E  V  LR 
Sbjct: 382 AYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKEVESFVASLRN 441

Query: 487 SKCLQDKAYVSLI 499
              +  + Y +L+
Sbjct: 442 CIGVNKQMYHALV 453

BLAST of Cp4.1LG08g02150 vs. TrEMBL
Match: A0A0A0L6I4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G127020 PE=4 SV=1)

HSP 1 Score: 887.9 bits (2293), Expect = 6.2e-255
Identity = 439/537 (81.75%), Postives = 480/537 (89.39%), Query Frame = 1

Query: 1   MLLQTSVHHCNVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGTVD 60
           MLLQTSVHH  VSL S  SYS K  P+IQSQ FF NVNYRRLPLS+TCS+SQ HSYGTVD
Sbjct: 1   MLLQTSVHHRTVSLPSPISYSRKFLPLIQSQPFFHNVNYRRLPLSVTCSISQVHSYGTVD 60

Query: 61  FERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYER 120
           FERRPM KWNAIYRRISLMENPELGSASVLNQWENEGK ITKWE+ RV+KELRKY+R+ER
Sbjct: 61  FERRPMFKWNAIYRRISLMENPELGSASVLNQWENEGKNITKWELSRVVKELRKYKRFER 120

Query: 121 ALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALLN 180
           ALEIYDWMSNR E F+LT SDAAIQLDLISKV GI SAE+YF+RLP +LKDRR YGALLN
Sbjct: 121 ALEIYDWMSNREERFRLTTSDAAIQLDLISKVRGIKSAEEYFLRLPNHLKDRRIYGALLN 180

Query: 181 AYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRL 240
           AY K RQREKAE+LL KMRTKG+  H L FNVMMTLY+N KEYEKVESLVSEM E  I+L
Sbjct: 181 AYAKGRQREKAENLLEKMRTKGFTTHPLPFNVMMTLYMNVKEYEKVESLVSEMTENSIQL 240

Query: 241 DIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEEC 300
           DIYSYNIW+SS GLQGS EKMEEVYE+MKQDRTINANWTTFSTMATMYIK G++EKAEEC
Sbjct: 241 DIYSYNIWLSSCGLQGSTEKMEEVYEQMKQDRTINANWTTFSTMATMYIKMGLMEKAEEC 300

Query: 301 LRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIR 360
           LRRVESRIVGRDRIP+HYL+SLYGSVGNKEE YRVW +YK +FPTIPNLGYHAIISALIR
Sbjct: 301 LRRVESRIVGRDRIPYHYLISLYGSVGNKEEMYRVWNIYKNVFPTIPNLGYHAIISALIR 360

Query: 361 AGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSS 420
            GD+EGAEKIYEEWL+VK+ YDPRIANLF+GWYVKEG +SKAESFF+HMVE GGKPNSS+
Sbjct: 361 VGDVEGAEKIYEEWLTVKSTYDPRIANLFIGWYVKEGNTSKAESFFDHMVEVGGKPNSST 420

Query: 421 WEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVLV 480
           WEIL D  +KEGRVSDALASWKEAFSAEGS++WRPK + VLA+F+LCEKEGDIASKEVLV
Sbjct: 421 WEILVDRHTKEGRVSDALASWKEAFSAEGSKSWRPKPYNVLAYFDLCEKEGDIASKEVLV 480

Query: 481 GLLRQSKCLQDKAYVSLIGLSDETIDNDVVSGEGSNIDDESDKTVYESDDSEMLLEM 538
           GLLRQ K LQDK Y SLIGL DETIDN+ VS +GSNI+DE DKT YESDDSEM L++
Sbjct: 481 GLLRQPKYLQDKTYASLIGLLDETIDNNEVSEKGSNINDEIDKTEYESDDSEMFLKL 537

BLAST of Cp4.1LG08g02150 vs. TrEMBL
Match: A0A061DUM9_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_005749 PE=4 SV=1)

HSP 1 Score: 740.7 bits (1911), Expect = 1.2e-210
Identity = 361/534 (67.60%), Postives = 447/534 (83.71%), Query Frame = 1

Query: 1   MLLQ-TSVHHCNVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGTV 60
           MLLQ +S+ +  VSLSS++SYS +LP  +      +  +Y++LP  +TCS+SQ HSYGTV
Sbjct: 1   MLLQPSSLLNHRVSLSSTSSYSRQLPCQVPQLILSQTQSYQKLP--VTCSISQIHSYGTV 60

Query: 61  DFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYE 120
           D+ERRPM+KWNAIY++ISLMENPELGSASVLN+WE  G+K+TKWE+ RV+KELRKY+RY+
Sbjct: 61  DYERRPMIKWNAIYKKISLMENPELGSASVLNEWEKGGRKLTKWELCRVVKELRKYKRYK 120

Query: 121 RALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALL 180
           +ALE+YDWM+NRGE F+L+ASDAAIQLDLI+KV G++SAED+FV+LP  +KD+R YGALL
Sbjct: 121 QALEVYDWMNNRGERFRLSASDAAIQLDLIAKVRGVSSAEDFFVQLPDTMKDKRIYGALL 180

Query: 181 NAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIR 240
           NAYV+A+ R+KAE+L+  MR KGYA+H L FNVMMTLY+N KEY+KVES+VSEM+EK IR
Sbjct: 181 NAYVRAKMRDKAETLIDNMRGKGYAMHPLPFNVMMTLYMNLKEYDKVESMVSEMMEKNIR 240

Query: 241 LDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEE 300
           LDIYSYNIW+SS G QGS+EKMEEVYE+MKQD++IN NWTTFSTMATMYIK G+ EKAEE
Sbjct: 241 LDIYSYNIWLSSCGSQGSVEKMEEVYEQMKQDQSINPNWTTFSTMATMYIKMGLTEKAEE 300

Query: 301 CLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALI 360
           CLR VESRI GRDRIP+HYL+SLYG VGN+EE YRVWKVYK+IFP+IPNLG+HA+IS+L+
Sbjct: 301 CLRNVESRITGRDRIPYHYLISLYGGVGNREEVYRVWKVYKSIFPSIPNLGFHAVISSLV 360

Query: 361 RAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSS 420
           RAGDI+GAE+IYEEWL+VKT+YDPRIANL MGWYVKEG   KAES F+H+ E GGKPNSS
Sbjct: 361 RAGDIQGAERIYEEWLTVKTSYDPRIANLLMGWYVKEGNLDKAESLFSHIAEVGGKPNSS 420

Query: 421 SWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVL 480
           SWEILA+G   E R+ DAL+  K+AF+ EGSR WRPK   V AFFNLCE++ D+AS+EV 
Sbjct: 421 SWEILAEGHILEKRIPDALSCLKDAFATEGSRGWRPKPTSVSAFFNLCEEKVDMASREVF 480

Query: 481 VGLLRQSKCLQDKAYVSLIGLSDETIDNDVVSGEGSNIDDESDKTVYESDDSEM 534
           VGLLRQS CL+++AY SLIGLS+E +    +  + +     S     + D SE+
Sbjct: 481 VGLLRQSGCLKNEAYASLIGLSEEALSESELPRDKNRKSSYSSSDENQDDGSEV 532

BLAST of Cp4.1LG08g02150 vs. TrEMBL
Match: A0A0B0NQQ4_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_17735 PE=4 SV=1)

HSP 1 Score: 726.9 bits (1875), Expect = 1.8e-206
Identity = 360/536 (67.16%), Postives = 448/536 (83.58%), Query Frame = 1

Query: 1   MLLQ-TSVHHCNVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGTV 60
           MLLQ +S+ +  VS+SS+ SYS +LP  I      R  ++++LP  ITCS+SQ HSYGTV
Sbjct: 1   MLLQPSSLLNHRVSISSADSYSRQLPCRIPKFILSRTPSFQKLP--ITCSISQVHSYGTV 60

Query: 61  DFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYE 120
           DFERRPMVKWNA+Y++ISLMENPELGSASVLN+WE  G+K+TKWE+ RV+KELRKY+R++
Sbjct: 61  DFERRPMVKWNALYKKISLMENPELGSASVLNEWEKGGRKLTKWELCRVVKELRKYKRFK 120

Query: 121 RALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALL 180
           +ALE+Y+WM+NRGE F+ +ASDAAIQLDLISKV G++SAED+F++LP  LKD+R YGALL
Sbjct: 121 QALEVYEWMNNRGERFRFSASDAAIQLDLISKVRGVSSAEDFFLQLPDTLKDKRIYGALL 180

Query: 181 NAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIR 240
           NAYV+AR +EKAESL+  MR KGYA+H L FNVMMTLY+N KEY+KVES++SEM+EK +R
Sbjct: 181 NAYVRARMQEKAESLIDNMRNKGYALHPLPFNVMMTLYMNLKEYDKVESMISEMMEKNVR 240

Query: 241 LDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEE 300
           LDIYSYNIW+SS G QG +E+ME+VYE+MK+DR+IN NWTTFSTMATMYIK G+ EKAEE
Sbjct: 241 LDIYSYNIWLSSCGSQGYVERMEQVYEQMKEDRSINPNWTTFSTMATMYIKMGLSEKAEE 300

Query: 301 CLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALI 360
           CLR VESRI GRDRIP+HYL++LYG+VGNKEE YR+WKVYK+IFP+IPNLGYHA+IS+L+
Sbjct: 301 CLRNVESRITGRDRIPYHYLITLYGTVGNKEEVYRIWKVYKSIFPSIPNLGYHAMISSLV 360

Query: 361 RAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSS 420
           RA DIEGAEKIYEEWLSVKT+YDPRIANL MG YVKEG   KA+SFFNH+ + GGKPNSS
Sbjct: 361 RASDIEGAEKIYEEWLSVKTSYDPRIANLLMGLYVKEGNLDKAQSFFNHIADVGGKPNSS 420

Query: 421 SWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVL 480
           SWEILA+G  +E R+ +AL+  KEAF+ EGSR+WRPK   V AFFNLC+++ D  ++EV+
Sbjct: 421 SWEILAEGNIQEERIDEALSCLKEAFATEGSRSWRPKPTNVSAFFNLCDEKEDTETREVV 480

Query: 481 VGLLRQSKCLQDKAYVSLIGLSDETIDNDVVSGEGSNIDDESDKTVYESDDSEMLL 536
           VGLLRQS  L+++AY S IGLSD  +++  V    S+ D+  D      DDSE+LL
Sbjct: 481 VGLLRQSGYLKNEAYASQIGLSDGAVES--VLPTYSSGDENQD------DDSEVLL 526

BLAST of Cp4.1LG08g02150 vs. TrEMBL
Match: M5XM86_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003822mg PE=4 SV=1)

HSP 1 Score: 726.5 bits (1874), Expect = 2.4e-206
Identity = 352/539 (65.31%), Postives = 448/539 (83.12%), Query Frame = 1

Query: 1   MLLQTSVHHC--NVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGT 60
           M LQTSVHH   N+ LSSS SYS  LP  I +     ++N++RLP SI+CS+SQ H+YGT
Sbjct: 1   MPLQTSVHHHHHNLPLSSSLSYSTLLPCKIPTLPLPSSINFQRLP-SISCSISQVHNYGT 60

Query: 61  VDFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRY 120
           VD+ERRPMVKWNAIYR+ISL ++PE+ SA VLNQWE EG+K+TKWE+ RV+KELRKY+RY
Sbjct: 61  VDYERRPMVKWNAIYRKISLTDDPEVRSADVLNQWEKEGRKLTKWELCRVVKELRKYKRY 120

Query: 121 ERALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGAL 180
           +RALE+YDWMSNRGE F+++ SDAAIQLDL++KV G+ASAE+YF+ LP  LKDRR YGAL
Sbjct: 121 DRALEVYDWMSNRGERFRISTSDAAIQLDLVAKVRGVASAENYFLSLPDTLKDRRIYGAL 180

Query: 181 LNAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGI 240
           LNAYV+ R +EKAESLL KMR+KG+A+ +L FNVMMTLY+N KEY+KV+S++SEM+EK I
Sbjct: 181 LNAYVRTRMKEKAESLLDKMRSKGHALQSLPFNVMMTLYMNLKEYDKVDSIISEMMEKNI 240

Query: 241 RLDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAE 300
           +LDIYSYNIW+SSRG QGS E+ME+V+E+MK DRT+N NWTTFSTMATMYIK G +EKAE
Sbjct: 241 QLDIYSYNIWLSSRGSQGSEERMEQVFEQMKLDRTVNPNWTTFSTMATMYIKMGQLEKAE 300

Query: 301 ECLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISAL 360
            CL++VESRI GRDRIP+HYL+SLYG+VGNKEE YRVW +YK+ FP+IPNLGYHAI+S+L
Sbjct: 301 ACLKKVESRITGRDRIPYHYLLSLYGNVGNKEELYRVWNIYKSSFPSIPNLGYHAIMSSL 360

Query: 361 IRAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNS 420
           +R GD+EGAEKIYEEWL+VK+ YDPRIAN+F+ +Y+K+G   KA+SF++HMV+ GGKPNS
Sbjct: 361 LRVGDVEGAEKIYEEWLTVKSTYDPRIANVFIAYYIKDGDFEKAQSFYDHMVDVGGKPNS 420

Query: 421 SSWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEV 480
           ++WE LA+G  +E R+S+AL+ WKEAFSAEGS++WRPK   V AF  LCE+E +  SKE 
Sbjct: 421 TTWETLAEGHIEEQRISEALSCWKEAFSAEGSKSWRPKPVNVSAFLELCEQEANSVSKEF 480

Query: 481 LVGLLRQSKCLQDKAYVSLIGLSDETIDNDVVS--GEGSNIDDESDKTVYESDDSEMLL 536
            +GLL+QS  L++K+Y SLIGL+DE + +D +S   + +NI  + D      D SE+LL
Sbjct: 481 FMGLLKQSGQLKNKSYASLIGLADEDVSDDDLSLKKDRTNITKDDDDEKEAGDGSELLL 538

BLAST of Cp4.1LG08g02150 vs. TrEMBL
Match: A0A0D2MUP8_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G069400 PE=4 SV=1)

HSP 1 Score: 722.2 bits (1863), Expect = 4.5e-205
Identity = 358/536 (66.79%), Postives = 446/536 (83.21%), Query Frame = 1

Query: 1   MLLQ-TSVHHCNVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGTV 60
           MLLQ +S+ +   S+SS+ SYS +LP  I      R  ++++LP  + CS+SQ HSYGTV
Sbjct: 1   MLLQPSSLLNHRFSISSADSYSRQLPCRIPKFILSRTPSFQKLP--VMCSISQVHSYGTV 60

Query: 61  DFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYE 120
           DFERRPMVKWNA+Y++ISLMENPELGSASVLN+WE  G+K+TKWE+ RV+KELRKY+R++
Sbjct: 61  DFERRPMVKWNALYKKISLMENPELGSASVLNEWEKGGRKLTKWELCRVVKELRKYKRFK 120

Query: 121 RALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALL 180
           +ALE+Y+WM+NRGE F+ +ASDAAIQLDLISKV G++SAED+F++L   LKD+R YGALL
Sbjct: 121 QALEVYEWMNNRGERFRFSASDAAIQLDLISKVRGVSSAEDFFLQLSDTLKDKRIYGALL 180

Query: 181 NAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIR 240
           NAYV+AR +EKAESL+  MR KGYA+H L FNVMMTLY+N KEY+KVES++SEM+EK IR
Sbjct: 181 NAYVRARMQEKAESLIDNMRNKGYALHPLPFNVMMTLYMNLKEYDKVESMISEMMEKNIR 240

Query: 241 LDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEE 300
           LDIYSYNIW+SS G QGS+E+ME+VYE+MK+DR+IN NWTTFSTMATMYIK G+ EKAEE
Sbjct: 241 LDIYSYNIWLSSCGSQGSVERMEQVYEQMKEDRSINPNWTTFSTMATMYIKMGLSEKAEE 300

Query: 301 CLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALI 360
           CLR VESRI GRDRIP+HYL++LYG+VGNKEE YR+WKVYK+IFP+IPNLGYHA+IS+L+
Sbjct: 301 CLRNVESRITGRDRIPYHYLITLYGTVGNKEEVYRIWKVYKSIFPSIPNLGYHAMISSLV 360

Query: 361 RAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSS 420
           RA DIEGAEKIYEEWLSVKT+YDPRIANL MG YVKEG   KA+SFFNH+ + GGKPNSS
Sbjct: 361 RASDIEGAEKIYEEWLSVKTSYDPRIANLLMGLYVKEGNLGKAQSFFNHIADVGGKPNSS 420

Query: 421 SWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVL 480
           SWEILA+G  +E R+ +AL+  KEAF+ EGSR+WRPK   V AFFNLC+++ D  S+EV+
Sbjct: 421 SWEILAEGNIQEERIDEALSCLKEAFATEGSRSWRPKPTNVSAFFNLCDEKEDTESREVV 480

Query: 481 VGLLRQSKCLQDKAYVSLIGLSDETIDNDVVSGEGSNIDDESDKTVYESDDSEMLL 536
           VGLL+QS  L+++AY S IGLSD  +++  V    S+ D+  D      DDSE+LL
Sbjct: 481 VGLLQQSGYLKNEAYASQIGLSDSAVES--VLPTYSSGDENQD------DDSEVLL 526

BLAST of Cp4.1LG08g02150 vs. TAIR10
Match: AT1G02150.1 (AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 647.1 bits (1668), Expect = 9.4e-186
Identity = 317/514 (61.67%), Postives = 404/514 (78.60%), Query Frame = 1

Query: 1   MLLQTSVHHCNVSLSSSTSYSHKL---PPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYG 60
           MLLQ +V + NV L+SS SYS  L    PV+   A  +         +I CS+SQ + YG
Sbjct: 1   MLLQAAVQNRNVPLASSASYSRLLRCRSPVVSVAALSKKT------AAIVCSISQVYGYG 60

Query: 61  TVDFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRR 120
           TVD+ERRP+V+WNAIY++ISLME PELG+ASVLNQWE  G+K+TKWE+ RV+KELRKY+R
Sbjct: 61  TVDYERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKR 120

Query: 121 YERALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGA 180
             +ALE+YDWM+NRGE F+L+ASDAAIQLDLI KV GI  AE++F++LP+N KDRR YG+
Sbjct: 121 ANQALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGS 180

Query: 181 LLNAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKG 240
           LLNAYV+A+ REKAE+LL  MR KGYA+H L FNVMMTLY+N +EY+KV+++V EM +K 
Sbjct: 181 LLNAYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKD 240

Query: 241 IRLDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKA 300
           IRLDIYSYNIW+SS G  GS+EKME VY++MK D +I  NWTTFSTMATMYIK G  EKA
Sbjct: 241 IRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKA 300

Query: 301 EECLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISA 360
           E+ LR+VE+RI GR+RIP+HYL+SLYGS+GNK+E YRVW VYK++ P+IPNLGYHA++S+
Sbjct: 301 EDALRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSS 360

Query: 361 LIRAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPN 420
           L+R GDIEGAEK+YEEWL VK++YDPRI NL M  YVK      AE  F+HMVE GGKP+
Sbjct: 361 LVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPS 420

Query: 421 SSSWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKE 480
           SS+WEILA G +++  +S+AL   + AFSAEGS  WRPK   +  FF LCE+E D+ SKE
Sbjct: 421 SSTWEILAVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKE 480

Query: 481 VLVGLLRQSKCLQDKAYVSLIGLSD-ETIDNDVV 511
            ++ LLRQS  L+DK+Y++LI + +  T++N  +
Sbjct: 481 AVLELLRQSGDLEDKSYLALIDVDENRTVNNSEI 508

BLAST of Cp4.1LG08g02150 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 264.2 bits (674), Expect = 1.7e-70
Identity = 149/440 (33.86%), Postives = 235/440 (53.41%), Query Frame = 1

Query: 50  VSQTHSYGTVDFERRPMVKW--NAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFR 109
           ++  H   + D  +R   K+    +Y R+      E+     LNQ+    K + KWE+  
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 110 VIKELRKYRRYERALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPK 169
            IK+LR    Y  AL++ + M  RG     T SD AI LDL++K   I + E+YFV LP+
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERG--MNKTVSDQAIHLDLVAKAREITAGENYFVDLPE 120

Query: 170 NLKDRRTYGALLNAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVE 229
             K   TYG+LLN Y K    EKAE LL KM+       ++S+N +MTLY    E EKV 
Sbjct: 121 TSKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVP 180

Query: 230 SLVSEMIEKGIRLDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATM 289
           +++ E+  + +  D Y+YN+W+ +      I  +E V E M +D  +  +WTT+S MA++
Sbjct: 181 AMIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASI 240

Query: 290 YIKTGMIEKAEECLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIP 349
           Y+  G+ +KAE+ L+ +E +   RD   + +L++LYG +G   E YR+W+  +   P   
Sbjct: 241 YVDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTS 300

Query: 350 NLGYHAIISALIRAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFN 409
           N+ Y  +I  L++  D+ GAE +++EW +  + YD RI N+ +G Y +EG+  KA     
Sbjct: 301 NVAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKE 360

Query: 410 HMVEAGGKPNSSSWEILADGLSKEGRVSDALASWKEAFS---AEGSRTWRPKHFKVLAFF 469
                GGK N+ +WEI  D   K G ++ AL    +A S    +G + W P    V A  
Sbjct: 361 KAPRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGK-WLPSPETVRALM 420

Query: 470 NLCEKEGDIASKEVLVGLLR 485
           +  E++ D+   E L+ +L+
Sbjct: 421 SYFEQKKDVNGAENLLEILK 437

BLAST of Cp4.1LG08g02150 vs. TAIR10
Match: AT2G20710.1 (AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 262.3 bits (669), Expect = 6.5e-70
Identity = 145/422 (34.36%), Postives = 234/422 (55.45%), Query Frame = 1

Query: 65  PMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYERALEI 124
           P+  ++ + RR++   +P      VL+ W ++G  +   E+  +IK LRK+ R+  AL+I
Sbjct: 33  PLDPYDTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQI 92

Query: 125 YDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALLNAYVK 184
            DWMS      +++  D AI+LDLI+KV G+  AE +F  +P   ++   YGALLN Y  
Sbjct: 93  SDWMSEH-RVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYAS 152

Query: 185 ARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRLDIYS 244
            +   KAE +  +M+  G+    L +NVM+ LYV   +Y  VE L+ EM ++ ++ DI++
Sbjct: 153 KKVLHKAEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFT 212

Query: 245 YNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEECLRRV 304
            N  + +  +   +E ME+   R + D+ ++ +W T++  A  YIK G+ EKA E LR+ 
Sbjct: 213 VNTRLHAYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKS 272

Query: 305 ESRIVGRDR-IPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIRAGD 364
           E  +  + R   +  LMS YG+ G KEE YR+W +YK +     N GY ++ISAL++  D
Sbjct: 273 EQMVNAQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKEL-DGFYNTGYISVISALLKMDD 332

Query: 365 IEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSSWEI 424
           IE  EKI EEW +  + +D RI +L +  Y K+GM  KAE   N +V+     ++S+WE 
Sbjct: 333 IEEVEKIMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWER 392

Query: 425 LADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVLVGLL 484
           LA G    G++  A+  WK A        WRP    +++  +  E + D+     ++ LL
Sbjct: 393 LALGYKMAGKMEKAVEKWKRAIEV-SKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL 451

Query: 485 RQ 486
            +
Sbjct: 453 SE 451

BLAST of Cp4.1LG08g02150 vs. TAIR10
Match: AT1G02370.1 (AT1G02370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 246.1 bits (627), Expect = 4.8e-65
Identity = 147/430 (34.19%), Postives = 229/430 (53.26%), Query Frame = 1

Query: 72  IYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYERALEIYDWMSNR 131
           +Y+++S++       A  LNQ+  EG  + K ++FR  K LRK+RR + A EI+DWM  R
Sbjct: 73  LYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWMEKR 132

Query: 132 GETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRR-TYGALLNAYVKARQREK 191
             TF +  SD AI LDLI K  G+ +AE+YF  L  + K+ + TYGAL+N Y    + EK
Sbjct: 133 KMTFSV--SDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEEEK 192

Query: 192 AESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRLDIYSYNIWIS 251
           A++    M    +  ++L FN MM++Y+   + EKV  LV  M ++GI     +Y+IW+ 
Sbjct: 193 AKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIWMQ 252

Query: 252 SRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEECLRRVESRIVG 311
           S G    ++ +E++ + M +D      W TFS +A +Y K G+ EKA+  L+ +E ++  
Sbjct: 253 SCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKMNP 312

Query: 312 RDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIRAGDIEGAEKI 371
            +R   H+LMSLY  +    E YRVW+  K   P + NL Y  ++ A+ + GD++G +KI
Sbjct: 313 NNRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDGIKKI 372

Query: 372 YEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSSWEILADGLSK 431
           + EW S   AYD R+AN+ +  Y+K  M  +AE   +  ++    P S + ++L   L +
Sbjct: 373 FTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMIHLLE 432

Query: 432 EGRVSDALASWKEAF--SAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVLVGLLRQSKC 491
             +   A+   + A   SAE    W      V  FF   EK  D+   E    +L   K 
Sbjct: 433 NDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFCKILSNWKP 492

Query: 492 LQDKAYVSLI 499
           L  +    LI
Sbjct: 493 LDSETMTFLI 500

BLAST of Cp4.1LG08g02150 vs. TAIR10
Match: AT4G21705.1 (AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 245.0 bits (624), Expect = 1.1e-64
Identity = 135/433 (31.18%), Postives = 230/433 (53.12%), Query Frame = 1

Query: 67  VKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYERALEIYD 126
           VK   +Y +IS + +P+      L  W   GKK++  E+ R++ +LR+ +R+  ALE+  
Sbjct: 22  VKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRKRFLHALEVSK 81

Query: 127 WMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALLNAYVKAR 186
           WM+  G     + ++ A+ LDLI +VYG  +AE+YF  L +  K+ +TYGALLN YV+ +
Sbjct: 82  WMNETGVCV-FSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGALLNCYVRQQ 141

Query: 187 QREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRLDIYSYN 246
             EK+     KM+  G+   +L++N +M LY N  ++EKV  ++ EM E+ +  D YSY 
Sbjct: 142 NVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEENVAPDNYSYR 201

Query: 247 IWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEECLRRVES 306
           I I++ G    +E++      M++ + I  +W T++  A  YI  G  ++A E L+  E+
Sbjct: 202 ICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRAVELLKMSEN 261

Query: 307 RIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIRAGDIEG 366
           R+  +D   +++L++LY  +G K E  R+W + K +     N  Y  ++ +L++   +  
Sbjct: 262 RLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQSLVKIDALVE 321

Query: 367 AEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSSWEILAD 426
           AE++  EW S    YD R+ N  +  Y+ + M  KAE+    +   G      SWE++A 
Sbjct: 322 AEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATTPESWELVAT 381

Query: 427 GLSKEGRVSDALASWKEAFSAE-GSRTWRPKHFKVLAFFNLCEKEGDIASKEVLVGLLRQ 486
             +++G + +A    K A   E GSR WRP    V +  +    EG +   E  V  LR 
Sbjct: 382 AYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKEVESFVASLRN 441

Query: 487 SKCLQDKAYVSLI 499
              +  + Y +L+
Sbjct: 442 CIGVNKQMYHALV 453

BLAST of Cp4.1LG08g02150 vs. NCBI nr
Match: gi|659075585|ref|XP_008438222.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02150 [Cucumis melo])

HSP 1 Score: 893.6 bits (2308), Expect = 1.6e-256
Identity = 442/537 (82.31%), Postives = 483/537 (89.94%), Query Frame = 1

Query: 1   MLLQTSVHHCNVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGTVD 60
           MLLQTSVHH  VSL SS SYS K  P IQSQ FF NVNYRRL LS+TCS+SQ HSYGTVD
Sbjct: 1   MLLQTSVHHRTVSLPSSVSYSRKFLPPIQSQPFFHNVNYRRLRLSVTCSISQVHSYGTVD 60

Query: 61  FERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYER 120
           FERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGK +TKWE+ RV+KELRKY+R+ER
Sbjct: 61  FERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKNLTKWELSRVVKELRKYKRFER 120

Query: 121 ALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALLN 180
           ALEIYDWMSNR E F+LT SDAAIQLDLISKV GI SAEDYF+RLP NLKDRR YGALLN
Sbjct: 121 ALEIYDWMSNREERFRLTTSDAAIQLDLISKVRGIKSAEDYFLRLPDNLKDRRIYGALLN 180

Query: 181 AYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRL 240
           AY KARQREKAE+LLAKMRTKGYA H L FNVMMTLY+N KEYEKV+SLVSEM EK I+L
Sbjct: 181 AYAKARQREKAENLLAKMRTKGYATHPLPFNVMMTLYMNVKEYEKVDSLVSEMTEKSIQL 240

Query: 241 DIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEEC 300
           DIYSYNIW+SS GLQGS +KMEEVYE+MKQD+TINANWTTFSTMATMYIK G+IEKAEEC
Sbjct: 241 DIYSYNIWLSSCGLQGSTDKMEEVYEQMKQDKTINANWTTFSTMATMYIKMGLIEKAEEC 300

Query: 301 LRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIR 360
           LR+VESRIVGRDRIP+HYL+SLYGSVGNKEE YRVW +YK +FPTIPNLGYHAIISALIR
Sbjct: 301 LRKVESRIVGRDRIPYHYLISLYGSVGNKEEMYRVWNIYKNVFPTIPNLGYHAIISALIR 360

Query: 361 AGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSS 420
            GD+EGAEKIYEEWL+VK  YDPRIANLF+GWYVKE   SKAE FF+HMVE GGKPNSS+
Sbjct: 361 VGDVEGAEKIYEEWLTVKATYDPRIANLFIGWYVKEANMSKAEGFFDHMVEVGGKPNSST 420

Query: 421 WEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVLV 480
           WEILADG +KEGRVSDALASWKEAFS+EGS++WRPK + VLA+F+LCEKEGDIASKEVLV
Sbjct: 421 WEILADGHTKEGRVSDALASWKEAFSSEGSKSWRPKPYNVLAYFDLCEKEGDIASKEVLV 480

Query: 481 GLLRQSKCLQDKAYVSLIGLSDETIDNDVVSGEGSNIDDESDKTVYESDDSEMLLEM 538
           G LRQ K LQDK+Y SLIGL DETIDND VS +GS+I+DE DKT YESDDSEMLL++
Sbjct: 481 GFLRQPKYLQDKSYASLIGLLDETIDNDEVSEKGSSINDEIDKTEYESDDSEMLLKL 537

BLAST of Cp4.1LG08g02150 vs. NCBI nr
Match: gi|449432307|ref|XP_004133941.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02150 [Cucumis sativus])

HSP 1 Score: 887.9 bits (2293), Expect = 9.0e-255
Identity = 439/537 (81.75%), Postives = 480/537 (89.39%), Query Frame = 1

Query: 1   MLLQTSVHHCNVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGTVD 60
           MLLQTSVHH  VSL S  SYS K  P+IQSQ FF NVNYRRLPLS+TCS+SQ HSYGTVD
Sbjct: 1   MLLQTSVHHRTVSLPSPISYSRKFLPLIQSQPFFHNVNYRRLPLSVTCSISQVHSYGTVD 60

Query: 61  FERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYER 120
           FERRPM KWNAIYRRISLMENPELGSASVLNQWENEGK ITKWE+ RV+KELRKY+R+ER
Sbjct: 61  FERRPMFKWNAIYRRISLMENPELGSASVLNQWENEGKNITKWELSRVVKELRKYKRFER 120

Query: 121 ALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALLN 180
           ALEIYDWMSNR E F+LT SDAAIQLDLISKV GI SAE+YF+RLP +LKDRR YGALLN
Sbjct: 121 ALEIYDWMSNREERFRLTTSDAAIQLDLISKVRGIKSAEEYFLRLPNHLKDRRIYGALLN 180

Query: 181 AYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIRL 240
           AY K RQREKAE+LL KMRTKG+  H L FNVMMTLY+N KEYEKVESLVSEM E  I+L
Sbjct: 181 AYAKGRQREKAENLLEKMRTKGFTTHPLPFNVMMTLYMNVKEYEKVESLVSEMTENSIQL 240

Query: 241 DIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEEC 300
           DIYSYNIW+SS GLQGS EKMEEVYE+MKQDRTINANWTTFSTMATMYIK G++EKAEEC
Sbjct: 241 DIYSYNIWLSSCGLQGSTEKMEEVYEQMKQDRTINANWTTFSTMATMYIKMGLMEKAEEC 300

Query: 301 LRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALIR 360
           LRRVESRIVGRDRIP+HYL+SLYGSVGNKEE YRVW +YK +FPTIPNLGYHAIISALIR
Sbjct: 301 LRRVESRIVGRDRIPYHYLISLYGSVGNKEEMYRVWNIYKNVFPTIPNLGYHAIISALIR 360

Query: 361 AGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSSS 420
            GD+EGAEKIYEEWL+VK+ YDPRIANLF+GWYVKEG +SKAESFF+HMVE GGKPNSS+
Sbjct: 361 VGDVEGAEKIYEEWLTVKSTYDPRIANLFIGWYVKEGNTSKAESFFDHMVEVGGKPNSST 420

Query: 421 WEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVLV 480
           WEIL D  +KEGRVSDALASWKEAFSAEGS++WRPK + VLA+F+LCEKEGDIASKEVLV
Sbjct: 421 WEILVDRHTKEGRVSDALASWKEAFSAEGSKSWRPKPYNVLAYFDLCEKEGDIASKEVLV 480

Query: 481 GLLRQSKCLQDKAYVSLIGLSDETIDNDVVSGEGSNIDDESDKTVYESDDSEMLLEM 538
           GLLRQ K LQDK Y SLIGL DETIDN+ VS +GSNI+DE DKT YESDDSEM L++
Sbjct: 481 GLLRQPKYLQDKTYASLIGLLDETIDNNEVSEKGSNINDEIDKTEYESDDSEMFLKL 537

BLAST of Cp4.1LG08g02150 vs. NCBI nr
Match: gi|590724054|ref|XP_007052357.1| (Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 740.7 bits (1911), Expect = 1.8e-210
Identity = 361/534 (67.60%), Postives = 447/534 (83.71%), Query Frame = 1

Query: 1   MLLQ-TSVHHCNVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGTV 60
           MLLQ +S+ +  VSLSS++SYS +LP  +      +  +Y++LP  +TCS+SQ HSYGTV
Sbjct: 1   MLLQPSSLLNHRVSLSSTSSYSRQLPCQVPQLILSQTQSYQKLP--VTCSISQIHSYGTV 60

Query: 61  DFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYE 120
           D+ERRPM+KWNAIY++ISLMENPELGSASVLN+WE  G+K+TKWE+ RV+KELRKY+RY+
Sbjct: 61  DYERRPMIKWNAIYKKISLMENPELGSASVLNEWEKGGRKLTKWELCRVVKELRKYKRYK 120

Query: 121 RALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALL 180
           +ALE+YDWM+NRGE F+L+ASDAAIQLDLI+KV G++SAED+FV+LP  +KD+R YGALL
Sbjct: 121 QALEVYDWMNNRGERFRLSASDAAIQLDLIAKVRGVSSAEDFFVQLPDTMKDKRIYGALL 180

Query: 181 NAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIR 240
           NAYV+A+ R+KAE+L+  MR KGYA+H L FNVMMTLY+N KEY+KVES+VSEM+EK IR
Sbjct: 181 NAYVRAKMRDKAETLIDNMRGKGYAMHPLPFNVMMTLYMNLKEYDKVESMVSEMMEKNIR 240

Query: 241 LDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEE 300
           LDIYSYNIW+SS G QGS+EKMEEVYE+MKQD++IN NWTTFSTMATMYIK G+ EKAEE
Sbjct: 241 LDIYSYNIWLSSCGSQGSVEKMEEVYEQMKQDQSINPNWTTFSTMATMYIKMGLTEKAEE 300

Query: 301 CLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALI 360
           CLR VESRI GRDRIP+HYL+SLYG VGN+EE YRVWKVYK+IFP+IPNLG+HA+IS+L+
Sbjct: 301 CLRNVESRITGRDRIPYHYLISLYGGVGNREEVYRVWKVYKSIFPSIPNLGFHAVISSLV 360

Query: 361 RAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSS 420
           RAGDI+GAE+IYEEWL+VKT+YDPRIANL MGWYVKEG   KAES F+H+ E GGKPNSS
Sbjct: 361 RAGDIQGAERIYEEWLTVKTSYDPRIANLLMGWYVKEGNLDKAESLFSHIAEVGGKPNSS 420

Query: 421 SWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVL 480
           SWEILA+G   E R+ DAL+  K+AF+ EGSR WRPK   V AFFNLCE++ D+AS+EV 
Sbjct: 421 SWEILAEGHILEKRIPDALSCLKDAFATEGSRGWRPKPTSVSAFFNLCEEKVDMASREVF 480

Query: 481 VGLLRQSKCLQDKAYVSLIGLSDETIDNDVVSGEGSNIDDESDKTVYESDDSEM 534
           VGLLRQS CL+++AY SLIGLS+E +    +  + +     S     + D SE+
Sbjct: 481 VGLLRQSGCLKNEAYASLIGLSEEALSESELPRDKNRKSSYSSSDENQDDGSEV 532

BLAST of Cp4.1LG08g02150 vs. NCBI nr
Match: gi|1009138398|ref|XP_015886567.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02150 [Ziziphus jujuba])

HSP 1 Score: 734.2 bits (1894), Expect = 1.7e-208
Identity = 358/539 (66.42%), Postives = 447/539 (82.93%), Query Frame = 1

Query: 1   MLLQTSVHHCNVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCS-VSQTHSYGTV 60
           MLLQ+S+HH  V+LSSS S S  L   + +    +++NY RL  SITCS +SQ H+YGTV
Sbjct: 1   MLLQSSMHHHKVALSSSLSSSPPLAWKVPTFTLHQSINYNRL--SITCSSISQVHNYGTV 60

Query: 61  DFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRYE 120
           D+ERRP++KWN IY+RISLMENPELGSA+VLNQWE EG+K+TKWE+ RV+KELRKY+RYE
Sbjct: 61  DYERRPLIKWNIIYKRISLMENPELGSATVLNQWEKEGRKLTKWELCRVVKELRKYKRYE 120

Query: 121 RALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGALL 180
           RALE+Y+WM+NRGE F+L+ SDAAIQLDLI+KV G++SAE YF++LP +LKDRR YGALL
Sbjct: 121 RALEVYEWMNNRGERFRLSTSDAAIQLDLIAKVRGVSSAEGYFMKLPDSLKDRRIYGALL 180

Query: 181 NAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGIR 240
           NAYV+A+ +EKAESLL +MR+KG+A+H+L +NVMMTLY+N KEYEKV+ LVSEM+EK I+
Sbjct: 181 NAYVRAKMKEKAESLLDRMRSKGHALHSLPYNVMMTLYMNLKEYEKVDLLVSEMMEKNIK 240

Query: 241 LDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAEE 300
           LDIYSYNIW+SSRG QGS EKMEEV+++MK DRTIN NWTTFST+ATMYIK   IEKAEE
Sbjct: 241 LDIYSYNIWLSSRGSQGSAEKMEEVFQQMKLDRTINPNWTTFSTLATMYIKMEQIEKAEE 300

Query: 301 CLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISALI 360
           CL++VESRI GRDRIP+HYL+SLYGSVGNKEE YR+W VYK +FP IPNLGYHAII +L+
Sbjct: 301 CLKKVESRITGRDRIPYHYLLSLYGSVGNKEEIYRIWNVYKAVFPNIPNLGYHAIICSLL 360

Query: 361 RAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNSS 420
           R GDIEGAEKIY+EWLSV+++YDPRIANLF+  YVKEG   KA+ FF+HM+E GGKPNSS
Sbjct: 361 RIGDIEGAEKIYDEWLSVRSSYDPRIANLFITCYVKEGNLEKAKGFFDHMIEVGGKPNSS 420

Query: 421 SWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEVL 480
           +WE LA+G + E    +AL+ WKEAF+ EGS++WRPK   V  F +LCE+E D+ASKEVL
Sbjct: 421 TWETLAEGHTAEKNTFEALSCWKEAFAEEGSKSWRPKPINVTLFLDLCEQEADLASKEVL 480

Query: 481 VGLLRQSKCLQDKAYVSLIGLSDETIDND---VVSGEGSNIDDESDKTVYESDDSEMLL 536
           VGLLRQ+  ++DK+Y SL+GLSDE  ++D    ++ E  N DD+ +    + D S ML+
Sbjct: 481 VGLLRQAGYIKDKSYASLVGLSDEANNDDNGNRMTSERGNPDDDEEDNENDEDGSGMLI 537

BLAST of Cp4.1LG08g02150 vs. NCBI nr
Match: gi|645253007|ref|XP_008232381.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02150 [Prunus mume])

HSP 1 Score: 731.1 bits (1886), Expect = 1.4e-207
Identity = 354/539 (65.68%), Postives = 451/539 (83.67%), Query Frame = 1

Query: 1   MLLQTSVHHC--NVSLSSSTSYSHKLPPVIQSQAFFRNVNYRRLPLSITCSVSQTHSYGT 60
           M LQTSVHH   N+ LSSS SYS  LP  I +     ++N++RLP SI+CS+SQ H+YGT
Sbjct: 1   MPLQTSVHHRHHNLPLSSSLSYSTILPCKIPTLPLPSSINFQRLP-SISCSISQVHNYGT 60

Query: 61  VDFERRPMVKWNAIYRRISLMENPELGSASVLNQWENEGKKITKWEIFRVIKELRKYRRY 120
           VD+ERRPMVKWNAIYR+ISL ++PE+ SA VLNQWE EG+K+TKWE+ RV+KELRKY+RY
Sbjct: 61  VDYERRPMVKWNAIYRKISLTDDPEVRSADVLNQWEKEGRKLTKWELCRVVKELRKYKRY 120

Query: 121 ERALEIYDWMSNRGETFKLTASDAAIQLDLISKVYGIASAEDYFVRLPKNLKDRRTYGAL 180
           +RALE+YDWMSNRGE F+++ SDAAIQLDL++KV G+ASAE+YF+ LP  LKDRR YGAL
Sbjct: 121 DRALEVYDWMSNRGERFRISTSDAAIQLDLVAKVRGVASAENYFLSLPDTLKDRRIYGAL 180

Query: 181 LNAYVKARQREKAESLLAKMRTKGYAIHTLSFNVMMTLYVNFKEYEKVESLVSEMIEKGI 240
           LNAYV+ R +EKAESLL KMR+KG+A+ +L FNVMMTLY+N KEY+KV+S++SEM+EK I
Sbjct: 181 LNAYVRTRMKEKAESLLDKMRSKGHALQSLPFNVMMTLYMNLKEYDKVDSIISEMMEKNI 240

Query: 241 RLDIYSYNIWISSRGLQGSIEKMEEVYERMKQDRTINANWTTFSTMATMYIKTGMIEKAE 300
           +LDIYSYNIW+SSRG QGS E+ME+V+E+MK DRT+N NWTTFSTMATMYIK G +EKAE
Sbjct: 241 QLDIYSYNIWLSSRGSQGSEERMEQVFEQMKLDRTVNPNWTTFSTMATMYIKMGQLEKAE 300

Query: 301 ECLRRVESRIVGRDRIPFHYLMSLYGSVGNKEETYRVWKVYKTIFPTIPNLGYHAIISAL 360
            CL++VESRI GRDRIP+HYL+SLYG+VGNKEE YRVW +YK+ FP+IPNLGYHAI+S+L
Sbjct: 301 ACLKKVESRITGRDRIPYHYLLSLYGNVGNKEELYRVWNIYKSSFPSIPNLGYHAIMSSL 360

Query: 361 IRAGDIEGAEKIYEEWLSVKTAYDPRIANLFMGWYVKEGMSSKAESFFNHMVEAGGKPNS 420
           +R GD+EGAEKIYEEWL+VK+ YDPRIAN+F+ +Y+K+G   KA+SF++HMV+ GGKPNS
Sbjct: 361 LRVGDVEGAEKIYEEWLTVKSTYDPRIANVFIAYYIKDGDFEKAQSFYDHMVDVGGKPNS 420

Query: 421 SSWEILADGLSKEGRVSDALASWKEAFSAEGSRTWRPKHFKVLAFFNLCEKEGDIASKEV 480
           S+WE LA+G ++E R+S+AL+ WKEAFSAEGS++WRPK   V AF  LCE+E +  SKEV
Sbjct: 421 STWETLAEGHTEEQRISEALSCWKEAFSAEGSKSWRPKPVNVSAFLELCEQEANSVSKEV 480

Query: 481 LVGLLRQSKCLQDKAYVSLIGLSDETIDNDVVS--GEGSNIDDESDKTVYESDDSEMLL 536
            +GLL+QS  L++K+Y SLIGL+DE + +D +S   + +NI ++ D      D SE+LL
Sbjct: 481 FMGLLKQSGQLKNKSYASLIGLADEDVSDDDLSLKKDRTNITNDDDDEKEAGDGSELLL 538

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR3_ARATH1.7e-18461.67Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PPR86_ARATH3.0e-6933.86Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
PP166_ARATH1.2e-6834.36Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
PPR4_ARATH8.6e-6434.19Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidop... [more]
PP334_ARATH1.9e-6331.18Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L6I4_CUCSA6.2e-25581.75Uncharacterized protein OS=Cucumis sativus GN=Csa_3G127020 PE=4 SV=1[more]
A0A061DUM9_THECC1.2e-21067.60Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 OS=Theobroma c... [more]
A0A0B0NQQ4_GOSAR1.8e-20667.16Uncharacterized protein OS=Gossypium arboreum GN=F383_17735 PE=4 SV=1[more]
M5XM86_PRUPE2.4e-20665.31Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003822mg PE=4 SV=1[more]
A0A0D2MUP8_GOSRA4.5e-20566.79Uncharacterized protein OS=Gossypium raimondii GN=B456_004G069400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G02150.19.4e-18661.67 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.11.7e-7033.86 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G20710.16.5e-7034.36 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02370.14.8e-6534.19 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21705.11.1e-6431.18 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659075585|ref|XP_008438222.1|1.6e-25682.31PREDICTED: pentatricopeptide repeat-containing protein At1g02150 [Cucumis melo][more]
gi|449432307|ref|XP_004133941.1|9.0e-25581.75PREDICTED: pentatricopeptide repeat-containing protein At1g02150 [Cucumis sativu... [more]
gi|590724054|ref|XP_007052357.1|1.8e-21067.60Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cac... [more]
gi|1009138398|ref|XP_015886567.1|1.7e-20866.42PREDICTED: pentatricopeptide repeat-containing protein At1g02150 [Ziziphus jujub... [more]
gi|645253007|ref|XP_008232381.1|1.4e-20765.68PREDICTED: pentatricopeptide repeat-containing protein At1g02150 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071840 cellular component organization or biogenesis
biological_process GO:0034660 ncRNA metabolic process
biological_process GO:0044699 single-organism process
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003729 mRNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g02150.1Cp4.1LG08g02150.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 351..375
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 387..430
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 174..219
score: 1.0E-4coord: 228..286
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 351..376
score: 0.0021coord: 209..241
score: 4.9E-5coord: 387..417
score: 4.4E-5coord: 174..203
score: 1.1E-5coord: 243..271
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 241..271
score: 8.068coord: 206..240
score: 9.175coord: 347..377
score: 6.796coord: 101..135
score: 5.492coord: 277..307
score: 7.618coord: 382..416
score: 9.142coord: 171..205
score: 10.567coord: 417..452
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 170..307
score: 3.2E-11coord: 343..447
score: 3.2
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 159..374
score: 3.4
NoneNo IPR availableunknownCoilCoilcoord: 251..271
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..531
score: 8.1E
NoneNo IPR availablePANTHERPTHR24015:SF27SUBFAMILY NOT NAMEDcoord: 1..531
score: 8.1E

The following gene(s) are paralogous to this gene:

None