Cp4.1LG01g04330 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g04330
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG01 : 1353506 .. 1358571 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTCTCGTGCTAAAGACGAAAAAGTTCTAATTTGTATGCGTTTTGGGGAAGAATGCTTGCAACGCTCATTTGAGATGTTATAGTACAACGTATAGATGCTTGTGTATGTACGTGAATTAGCTTGATTAACTATAGTTTGAGCGTCTAAATGTTAGTGAAGTGTCTCAATGTTGAAACCTTGAGAAATTTTGACTTGGACACCGTCTTCTCTTTCACTTGGTGATGATATTTGGTGATAAGGATGAAGCCCCTATTAATTATAGACCTTCATTCCGAGTTGTGGAACTAAGTTCTTTCGTTTTAAGTGAGCTAGACAAAAATACCAGATTAACCCAATATTATTGAAAATACCGTTAAAGCTACGAGCTAATAAACCAATTTGAACACTGCTCTACTGGAATTAGNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACTGGATTTAAGAAGCTAGAGCTTAGTGGAAGGATGAATTACTCATACTATTATTGATATTTTTATTTTGAAAGTTTGAGATATTTTTGAATGTTTTTATAATTAATTATATATATATATATAAATAGTTTAACCTAATTGATTCATGGAGGAAGGAGGTGAAAGATGAATTTGATCCCATATAAATTATTTGAAATTATAATTACCAAACCAAAATTAGGGGAACAAAAAAGAATGTTAATTCGAAGACGAATTGGGATTGAATTGTTTTTCAGTGGGTTACAAGAAACAGAACAGTTTTCTTTCGAATCGTAAGCACGATGGCGATTCTCCTCAGTTTCAACAATTATGGGGTCCATCTTCGCCGGCCGCCGCCATATCCCCTCCGTCTCTCAAGCTGCTGCAATCGTACAGCTTCGTCGGTACTTGCTTCTCCGGAGAACTTCATCAGCAGGAAGATAGGGATTTGACGGAGCTGGCCTTTGCAGCTCCTCGGACTTCCAGCCATGTCTTTGCATTCATTTTCGCTCTCTCTCTCCTTGTCGTCGCTATCTACAGCTCTCTCAAAGGCGGCGGCAACCTCACAGGAGGCTTTATTGAGGAGGTATTTCACTTTCAGTTTCTATTTTCAAGGCCATTTTCTGTGATTGGAAGTATCAGCGTTTGTGGTTTGAATTCGCTCGCCTAGCTTTGGCTCGTGTTTAGGTTATAGAGAAAGGGTAATATATATTGAATATTAGTTATCTTATTAGCTTCGTTGGATTGAGATTTTCAGCTAACTTTATACCATTTTAGGTTAGAAATCAGCTGTATATTTAGTATTTTTAGTAGATGACCATCTTGTGATTTCATATATGACTTCGAAACACTGTTTTGGTGAGATTCCACGTTGATTGGGGAGGAGAACGAAACATTTCTTTTATAAGGATGTGGAAACCTCTCCATGCGTTTTAAAAACTTTGAGGGTAAGCCCAAATAGGACAATATCTACTAGTGGTGAGCTTGAGCCGTTACAAATGGTATTAGAGCCAAATACCGGACGGTGTGTCAGCAAACACGTTGGGCCTCGAGGGGTGGATTGAGGGGTCTCACATTGATTGGAGAAGGAAACGAGTGCCAGCAAGGATGCTGGGCCTCGAAGGGGGTGGATTCTGAGATCCCACATCGATTGGGGAGGAAAACAAAACATTCTTTATCAAGGTGTTGAAACCTCTCCCTAACAGGCGTTTTAAAAACCTTGAGGAGAAGTCCGAAAGGAAAATCCTAAAGAGAACAATATTTGCTAGCGATGGGTTTGGGCCGTTACAGTTTTTCTAGAGCTATTGCCTTCAACAATAGTGGTATAGCTCGTTTGTATCAGAAATTTTTACCCTGTCATCGTGTTTCCAAACCCCACATGAATGATGACGATCTTCACTTCAACTTCAATTCCCTCGTCTTCTTTTTGCACTTCTAAGCTGTAACTCATCCTGTAGGAAGCATTTGGATCAATTATACGTCCAGTTAACTGTGTCTGGGCTATACAAGTGTGGTTTCTTGGTTATCAAATTTATCAATGCATGTTTGCATCTCAGAGATGTTAACTACGCACATAAGGTTTTTCGTGAAGTCTTAGAACCAGATATCTTGTTGTGGAATGGCATCATAAAGGGCTACACTCAGAACAATATTTTTGCTGGTGCTATCAGAATGTATAAGGATATGCAAGTGTCAGGGGTGAACCCAGATTGCTTCACATTTTTGTATGTGCTTAAAGCGTGCAGTGGAATGTCGGTCGAAGGAATAGGTAAACAGATGCATAGCCAGACGTTTAAATATGGCCTTGGATCAAATGTGTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAGATTTGGCCAAACCTCATCTGCTAGGCTCGTCTTTGATAAGTTACATAATAGAACTGTTGTTTCGTGGACGTCCATCATTTCTGGGTATGTTCAAAATGGCGATCCCGTGGACGCGTTGAGAGTTTTCAAAGATATGAGGCGAAGTACTGTGAAACTTGATTGGATTGTCCTTGTTAGTGTTGTGACAGCCTACACAGACATGGAGGATTTGGGGCAAGGAAAAGCCATTCATAGCTTAGTGACTAAATTAGGTCTAGAATTCGAACCCGACATAGTGGTCTCGCTCACTAACATGTATGCTAAATGTGGACGGGTGGAAGTTGCTAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTACTTTTGTGGAATGCTATGATTTCTGGTTATGCAAAAAATGGATATGGTGAAGAAGCAATCGAGCTATTCCGTAAGATGATTTCAAAGAATATCGGGGTCGATTCTGTTACTGTGAGGTCTGCTATTCTAGCCGTTGCCCAAGCGGGGTCTCTTGAACTAGCAAGATGGTTGGATGGTTATATCTCTAAGAGTGAGTACCGAGATGATGTTTTTGTGAACACAGCCCTTATAGATATGCATGCAAAATGTGGAAGCATATGTTTTGCTCGTAGTGTTTTCGATAGAATGGTCGATAAAGACGTTGTCTTATGGAGTGCTATGATTATGGGGTATGGATTACACGGTCATGGACAAGAAGCCATCGACCTTTACAACAGAATGAAGCAATCAGGAGTTCGTCCGAACGACGTTACTTTTGTTGGCCTTCTCACAGCATGTAAAAACTCGGGTCTTGTAAAAGAGGGATGGGAGCTTTTCCACCAGATGCGAGACTACGGGATTGAACCGCATCACCAGCATTACTCTTGCGTGGTCGATCTTCTGGGACGTGCAGGCTATTTGAATCGAGCTTATGATTTTATTATGAGCATGCCCATTAAACCTGGAGTTAGTGTTTGGGGGGCACTTTTAAGTGGATGTAAGATCCATCGTCAAGTGAGGTTGGGAGAGATAGCTGCAGAACAGCTTTTCTTATTAGATCCATATAATACAGGTCATTATGTACAACTCTCAAACTTATATGCTTCTGCCCATTTATGGAACCACGTGGGGAACGTTCGATTAATGATGACACAGAAAGGATTGAACAAGGACCTCGGACATAGTTCGATTGAGATCAATGGAAATCTCGAAACGTTCCATGTTGGAGATAGATCACATCCGAGATCGAAGGAAATCTTTGAAGAACTTGATAGATTGGAGAGGAGATTAAAGGCAGCTGGTTATGTTGCTCATATGGAATCTGTTCTACATGACTTGAATGATGAGGAGATTGAGGAAACTCTTTGTAACCATAGTGAGAGGTTAGCAGTTGCTTATGGCATCATCAGTACTGCTCCTGGAACTACACTTAGAATAACGAAAAATCTCCGTGCATGCGTTAATTGTCATTCGGCGATAAAGCTAATATCGAAGCTTGTCGATAGGGAAATAATTGTTCGAGATGCGAAACGCTTTCATCATTTCAAAGATGGAGTTTGTTCGTGCGGAGATTTTTGGTGAAGCTTGGTTAGTATTCTTTACTTACATCAACCTTACACGTAATGAATTTGCTGATTTTCTTTTGGCCTATACTAACCATCATTGATTCTGATGTGAAATTGAAGAAAGTCCAATTTTTCTGTGAAATGGTAACCAATCCATTGCCCCTTCATGTGCCTTTTTTTCAATGTTCAAGTTTGATCTATTATAATTTTCTGTTTGAGATGCTTCCGAACATTCTATTTTAGTTATATTGGAGAAAAACATGTCAAACTTCTGTTATGCATTTTAATAAGAATTCTACGTGGCATCACATGAATTTATGTGATAATTTTTCTTGAAGTTAATGATTATAAATCAATAAAGATGGTTGAAGTTAATTGATTCACTCGCGAAGTTTAGGATTTAGAACGATAGAACTCTGTGGTGATTTAGAACGATAGAACTCTTAAAGTTTAGGATTTAGATTGATAGAACTCTGTGGTGGTGCCTAACGGCGCCCTAACCTAGATCTCAAAAAGGGCCCTAACCTAGATCCCACAAAGGGCCCTAACCTAGATCCCACAAAGGGTTTTGGTCAATTAAGACAACAGACCAGTGGTTATGGAAGTTGAAATTTGCTAAAAAGTGTGTAACAACTCACCTACCGAATCAACTAGCTCAAAAAATGGATGGTCTCGCGATCTATACCCGACTCTTTGTACAAATTAGTTTATCAATCTATTGAGCCATCCAACCATCTATCTTTTGATTTTACAATAACATAAGAAGTAGTCTCTCTAAACTTTTCTTAAATTTTTGGTTTGCAAGAATTAATGTCATTATTTCTTTAAATTTAATCACAAGTTTTACAATTTACAACCTCTTAGTATTAAAAATCATCCAGCGAAATAAGTTGCAAACAACTTTTCAAGTTCTTAAAATCTTAAAAGTGGAAGGTGGTGGGATTGTCAACATTTAGCAATGGCTAATGGATACTCATTGACCAATTGAGATTGAATGGCTCACAACATGAAAGCTGGTGTTGGGTAATATGATTGAACTCTTCCAAAGATTCAAAGGGAAAATAATGGGTTTAGTTAATGATGAAGAAGATTGTGGTTTTGTGGTTAAAGGAGCTGCAACTCAAACACTGTTCACTGAAATTTCAAGTGTTCAAAACCACTATGTCCTAATCAAAGCGTAG

mRNA sequence

ATGGTGGGTTACAAGAAACAGAACAGTTTTCTTTCGAATCGTAAGCACGATGGCGATTCTCCTCAGTTTCAACAATTATGGGGTCCATCTTCGCCGGCCGCCGCCATATCCCCTCCGTCTCTCAAGCTGCTGCAATCGTACAGCTTCGTCGGTACTTGCTTCTCCGGAGAACTTCATCAGCAGGAAGATAGGGATTTGACGGAGCTGGCCTTTGCAGCTCCTCGGACTTCCAGCCATGTCTTTGCATTCATTTTCGCTCTCTCTCTCCTTGTCGTCGCTATCTACAGCTCTCTCAAAGGCGGCGGCAACCTCACAGGAGGCTTTATTGAGGAGTTAACTGTGTCTGGGCTATACAAGTGTGGTTTCTTGGTTATCAAATTTATCAATGCATGTTTGCATCTCAGAGATGTTAACTACGCACATAAGGTTTTTCGTGAAGTCTTAGAACCAGATATCTTGTTGTGGAATGGCATCATAAAGGGCTACACTCAGAACAATATTTTTGCTGGTGCTATCAGAATGTATAAGGATATGCAAGTGTCAGGGGTGAACCCAGATTGCTTCACATTTTTGTATGTGCTTAAAGCGTGCAGTGGAATGTCGGTCGAAGGAATAGGTAAACAGATGCATAGCCAGACGTTTAAATATGGCCTTGGATCAAATGTGTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAGATTTGGCCAAACCTCATCTGCTAGGCTCGTCTTTGATAAGTTACATAATAGAACTGTTGTTTCGTGGACGTCCATCATTTCTGGGTATGTTCAAAATGGCGATCCCGTGGACGCGTTGAGAGTTTTCAAAGATATGAGGCGAAGAGCTGCAACTCAAACACTGTTCACTGAAATTTCAAGTGTTCAAAACCACTATGTCCTAATCAAAGCGTAG

Coding sequence (CDS)

ATGGTGGGTTACAAGAAACAGAACAGTTTTCTTTCGAATCGTAAGCACGATGGCGATTCTCCTCAGTTTCAACAATTATGGGGTCCATCTTCGCCGGCCGCCGCCATATCCCCTCCGTCTCTCAAGCTGCTGCAATCGTACAGCTTCGTCGGTACTTGCTTCTCCGGAGAACTTCATCAGCAGGAAGATAGGGATTTGACGGAGCTGGCCTTTGCAGCTCCTCGGACTTCCAGCCATGTCTTTGCATTCATTTTCGCTCTCTCTCTCCTTGTCGTCGCTATCTACAGCTCTCTCAAAGGCGGCGGCAACCTCACAGGAGGCTTTATTGAGGAGTTAACTGTGTCTGGGCTATACAAGTGTGGTTTCTTGGTTATCAAATTTATCAATGCATGTTTGCATCTCAGAGATGTTAACTACGCACATAAGGTTTTTCGTGAAGTCTTAGAACCAGATATCTTGTTGTGGAATGGCATCATAAAGGGCTACACTCAGAACAATATTTTTGCTGGTGCTATCAGAATGTATAAGGATATGCAAGTGTCAGGGGTGAACCCAGATTGCTTCACATTTTTGTATGTGCTTAAAGCGTGCAGTGGAATGTCGGTCGAAGGAATAGGTAAACAGATGCATAGCCAGACGTTTAAATATGGCCTTGGATCAAATGTGTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAGATTTGGCCAAACCTCATCTGCTAGGCTCGTCTTTGATAAGTTACATAATAGAACTGTTGTTTCGTGGACGTCCATCATTTCTGGGTATGTTCAAAATGGCGATCCCGTGGACGCGTTGAGAGTTTTCAAAGATATGAGGCGAAGAGCTGCAACTCAAACACTGTTCACTGAAATTTCAAGTGTTCAAAACCACTATGTCCTAATCAAAGCGTAG

Protein sequence

MVGYKKQNSFLSNRKHDGDSPQFQQLWGPSSPAAAISPPSLKLLQSYSFVGTCFSGELHQQEDRDLTELAFAAPRTSSHVFAFIFALSLLVVAIYSSLKGGGNLTGGFIEELTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSMYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRRRAATQTLFTEISSVQNHYVLIKA
BLAST of Cp4.1LG01g04330 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 4.4e-41
Identity = 80/172 (46.51%), Postives = 119/172 (69.19%), Query Frame = 1

Query: 112 LTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGA 171
           L V GL   GFL+ K I+A     D+ +A +VF ++  P I  WN II+GY++NN F  A
Sbjct: 44  LLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDA 103

Query: 172 IRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSM 231
           + MY +MQ++ V+PD FTF ++LKACSG+S   +G+ +H+Q F+ G  ++VFVQN L+++
Sbjct: 104 LLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIAL 163

Query: 232 YARFGQTSSARLVFD--KLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRR 282
           YA+  +  SAR VF+   L  RT+VSWT+I+S Y QNG+P++AL +F  MR+
Sbjct: 164 YAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRK 215

BLAST of Cp4.1LG01g04330 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 128.3 bits (321), Expect = 1.5e-28
Identity = 59/159 (37.11%), Postives = 103/159 (64.78%), Query Frame = 1

Query: 137 VNYAHKVFREVLEP-DILLWNGIIKGYTQNNIFAGAIRMYKDMQVSG-VNPDCFTFLYVL 196
           ++YAHKVF ++ +P ++ +WN +I+GY +      A  +Y++M+VSG V PD  T+ +++
Sbjct: 69  MSYAHKVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLI 128

Query: 197 KACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSMYARFGQTSSARLVFDKLHNRTVV 256
           KA + M+   +G+ +HS   + G GS ++VQNSL+ +YA  G  +SA  VFDK+  + +V
Sbjct: 129 KAVTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLV 188

Query: 257 SWTSIISGYVQNGDPVDALRVFKDMRRRAATQTLFTEIS 294
           +W S+I+G+ +NG P +AL ++ +M  +      FT +S
Sbjct: 189 AWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVS 227

BLAST of Cp4.1LG01g04330 vs. Swiss-Prot
Match: PP165_ARATH (Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana GN=PCMP-E78 PE=2 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 1.9e-28
Identity = 58/166 (34.94%), Postives = 101/166 (60.84%), Query Frame = 1

Query: 112 LTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGA 171
           + + GL +  F+V K ++ C  + D++YA ++F +V  P++ L+N II+ YT N+++   
Sbjct: 33  IIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQVSNPNVFLYNSIIRAYTHNSLYCDV 92

Query: 172 IRMYKDM-QVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVS 231
           IR+YK + + S   PD FTF ++ K+C+ +    +GKQ+H    K+G   +V  +N+L+ 
Sbjct: 93  IRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYLGKQVHGHLCKFGPRFHVVTENALID 152

Query: 232 MYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVF 277
           MY +F     A  VFD+++ R V+SW S++SGY + G    A  +F
Sbjct: 153 MYMKFDDLVDAHKVFDEMYERDVISWNSLLSGYARLGQMKKAKGLF 198

BLAST of Cp4.1LG01g04330 vs. Swiss-Prot
Match: PP271_ARATH (Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis thaliana GN=PCMP-H77 PE=3 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 1.6e-27
Identity = 65/157 (41.40%), Postives = 92/157 (58.60%), Query Frame = 1

Query: 123 LVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSG 182
           L +K + A   L+DV  A KVF E+ E ++++ N +I+ Y  N  +   ++++  M    
Sbjct: 76  LGVKLMRAYASLKDVASARKVFDEIPERNVIIINVMIRSYVNNGFYGEGVKVFGTMCGCN 135

Query: 183 VNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSMYARFGQTSSAR 242
           V PD +TF  VLKACS      IG+++H    K GL S +FV N LVSMY + G  S AR
Sbjct: 136 VRPDHYTFPCVLKACSCSGTIVIGRKIHGSATKVGLSSTLFVGNGLVSMYGKCGFLSEAR 195

Query: 243 LVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDM 280
           LV D++  R VVSW S++ GY QN    DAL V ++M
Sbjct: 196 LVLDEMSRRDVVSWNSLVVGYAQNQRFDDALEVCREM 232

BLAST of Cp4.1LG01g04330 vs. Swiss-Prot
Match: PP210_ARATH (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 2.4e-26
Identity = 69/189 (36.51%), Postives = 108/189 (57.14%), Query Frame = 1

Query: 109 IEELTVS-GLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEP-DILLWNGIIKGYTQNN 168
           I  L +S GL    F   K I+   H R+   +  VFR V    ++ LWN II+ +++N 
Sbjct: 26  IHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVSPAKNVYLWNSIIRAFSKNG 85

Query: 169 IFAGAIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQN 228
           +F  A+  Y  ++ S V+PD +TF  V+KAC+G+    +G  ++ Q    G  S++FV N
Sbjct: 86  LFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQILDMGFESDLFVGN 145

Query: 229 SLVSMYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRRRAATQ 288
           +LV MY+R G  + AR VFD++  R +VSW S+ISGY  +G   +AL ++ +++      
Sbjct: 146 ALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEALEIYHELKNSWIVP 205

Query: 289 TLFTEISSV 296
             FT +SSV
Sbjct: 206 DSFT-VSSV 213

BLAST of Cp4.1LG01g04330 vs. TrEMBL
Match: A0A0A0KLB9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G175830 PE=4 SV=1)

HSP 1 Score: 286.6 bits (732), Expect = 3.6e-74
Identity = 144/207 (69.57%), Postives = 166/207 (80.19%), Query Frame = 1

Query: 85  FALSLLVVAIYSSLKGG------GNLTGGFIEE----LTVSGLYKCGFLVIKFINACLHL 144
           F+LSLL+ ++ S+L          +L    +++    L VSGL+KC FL+IKFINACLH 
Sbjct: 6   FSLSLLLSSLSSALSKSTITLHEASLRRKHLDQVYVQLIVSGLHKCRFLMIKFINACLHF 65

Query: 145 RDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVL 204
            DVNYAHK FREV EPDILLWN IIKGYTQ NI    IRMY DMQ+S V+P+CFTFLYVL
Sbjct: 66  GDVNYAHKAFREVSEPDILLWNAIIKGYTQKNIVDAPIRMYMDMQISQVHPNCFTFLYVL 125

Query: 205 KACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSMYARFGQTSSARLVFDKLHNRTVV 264
           KAC G SVEGIGKQ+H QTFKYG GSNVFVQNSLVSMYA+FGQ S AR+VFDKLH+RTVV
Sbjct: 126 KACGGTSVEGIGKQIHGQTFKYGFGSNVFVQNSLVSMYAKFGQISYARIVFDKLHDRTVV 185

Query: 265 SWTSIISGYVQNGDPVDALRVFKDMRR 282
           SWTSIISGYVQNGDP++AL VFK+MR+
Sbjct: 186 SWTSIISGYVQNGDPMEALNVFKEMRQ 212

BLAST of Cp4.1LG01g04330 vs. TrEMBL
Match: D7SQP8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0134g00210 PE=4 SV=1)

HSP 1 Score: 201.8 bits (512), Expect = 1.2e-48
Identity = 95/172 (55.23%), Postives = 125/172 (72.67%), Query Frame = 1

Query: 111 ELTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAG 170
           +L VSGL + GFLV KF+NA  ++ ++ YA KVF E  EP + LWN II+GY+ +N F  
Sbjct: 93  QLVVSGLVESGFLVTKFVNASWNIGEIGYARKVFDEFPEPSVFLWNAIIRGYSSHNFFGD 152

Query: 171 AIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVS 230
           AI MY  MQ SGVNPD FT   VLKACSG+ V  +GK++H Q F+ G  S+VFVQN LV+
Sbjct: 153 AIEMYSRMQASGVNPDGFTLPCVLKACSGVPVLEVGKRVHGQIFRLGFESDVFVQNGLVA 212

Query: 231 MYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRRR 283
           +YA+ G+   AR+VF+ L +R +VSWTS+ISGY QNG P++ALR+F  MR+R
Sbjct: 213 LYAKCGRVEQARIVFEGLDDRNIVSWTSMISGYGQNGLPMEALRIFGQMRQR 264

BLAST of Cp4.1LG01g04330 vs. TrEMBL
Match: A0A059BTM6_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02828 PE=4 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 6.4e-47
Identity = 84/171 (49.12%), Postives = 129/171 (75.44%), Query Frame = 1

Query: 111 ELTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAG 170
           +L V GL+K  FL+ K +N   +L ++ Y+ KVF E  +PD+ LWN II+GY+++N+F+ 
Sbjct: 95  KLLVLGLHKDSFLITKLVNWSSNLGEIRYSRKVFDEFSDPDVFLWNAIIRGYSRHNMFSD 154

Query: 171 AIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVS 230
           A+ +Y +M  +GV+PD FTF Y+L+AC+G+   GIG+ +H Q +++G  S+VFVQN +V+
Sbjct: 155 AVELYSEMLGTGVSPDGFTFPYILRACTGLPALGIGRCVHGQVYRHGFESDVFVQNGVVT 214

Query: 231 MYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRR 282
           +YA+ G+   AR+VFD+L +RTVVSWTS+ISGY QNG P+++LR+F  MR+
Sbjct: 215 LYAKCGKVKHARIVFDQLRDRTVVSWTSMISGYAQNGQPMESLRIFSQMRK 265

BLAST of Cp4.1LG01g04330 vs. TrEMBL
Match: W9S3H1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_007616 PE=4 SV=1)

HSP 1 Score: 191.4 bits (485), Expect = 1.6e-45
Identity = 86/171 (50.29%), Postives = 122/171 (71.35%), Query Frame = 1

Query: 111 ELTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAG 170
           +L +SGL + GFL+ K +N    +    YA K+F E  +PD+ LWN I++GY+++N+F  
Sbjct: 85  QLLISGLQQNGFLITKLVNVSSDIGCNFYARKLFDEFTDPDVFLWNAIVRGYSKHNMFGD 144

Query: 171 AIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVS 230
           A+ MY  MQ  GV+PD FTF +VLKACSG+     G+++H QTF+Y    + FVQNSLV+
Sbjct: 145 ALEMYSRMQAMGVSPDAFTFPHVLKACSGLQALEFGRRVHGQTFRYRSACDAFVQNSLVA 204

Query: 231 MYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRR 282
            YA+  Q   AR+VF++L +R++VSWTSIISGY QNG+P++ALR+F  MRR
Sbjct: 205 FYAKCCQIGRARMVFERLCDRSIVSWTSIISGYAQNGEPMEALRIFSQMRR 255

BLAST of Cp4.1LG01g04330 vs. TrEMBL
Match: A0A0D2TXW8_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G055100 PE=4 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 1.0e-44
Identity = 87/188 (46.28%), Postives = 129/188 (68.62%), Query Frame = 1

Query: 111 ELTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAG 170
           +L + G+ + GFLV K +NA ++L +++YA KVF +  +PD+ LWN II+GY++ N+FA 
Sbjct: 83  KLLLLGIQQNGFLVSKLVNAAVNLGEISYARKVFDKFPDPDVFLWNAIIRGYSKYNLFAS 142

Query: 171 AIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVS 230
           A+ MY  MQV  V+PD +T  +VLKAC G+    +G+Q+H Q F+ G   +VFVQN +V+
Sbjct: 143 AVEMYSRMQVLWVSPDGYTLPHVLKACGGIPSFRMGQQVHGQIFRLGFEKDVFVQNGVVA 202

Query: 231 MYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRRRAATQTLFT 290
            YA+ G+ +SA++VFD+L  R VVSWTS+ISGY QNG P++ALR F +MR          
Sbjct: 203 FYAKCGKIASAKVVFDRLEIRNVVSWTSMISGYAQNGQPIEALRFFDEMRSTGVMPDWIA 262

Query: 291 EISSVQNH 299
            +S ++ H
Sbjct: 263 LVSVIRAH 270

BLAST of Cp4.1LG01g04330 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 169.9 bits (429), Expect = 2.5e-42
Identity = 80/172 (46.51%), Postives = 119/172 (69.19%), Query Frame = 1

Query: 112 LTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGA 171
           L V GL   GFL+ K I+A     D+ +A +VF ++  P I  WN II+GY++NN F  A
Sbjct: 44  LLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDA 103

Query: 172 IRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSM 231
           + MY +MQ++ V+PD FTF ++LKACSG+S   +G+ +H+Q F+ G  ++VFVQN L+++
Sbjct: 104 LLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIAL 163

Query: 232 YARFGQTSSARLVFD--KLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRR 282
           YA+  +  SAR VF+   L  RT+VSWT+I+S Y QNG+P++AL +F  MR+
Sbjct: 164 YAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRK 215

BLAST of Cp4.1LG01g04330 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 128.3 bits (321), Expect = 8.3e-30
Identity = 59/159 (37.11%), Postives = 103/159 (64.78%), Query Frame = 1

Query: 137 VNYAHKVFREVLEP-DILLWNGIIKGYTQNNIFAGAIRMYKDMQVSG-VNPDCFTFLYVL 196
           ++YAHKVF ++ +P ++ +WN +I+GY +      A  +Y++M+VSG V PD  T+ +++
Sbjct: 69  MSYAHKVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLI 128

Query: 197 KACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSMYARFGQTSSARLVFDKLHNRTVV 256
           KA + M+   +G+ +HS   + G GS ++VQNSL+ +YA  G  +SA  VFDK+  + +V
Sbjct: 129 KAVTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLV 188

Query: 257 SWTSIISGYVQNGDPVDALRVFKDMRRRAATQTLFTEIS 294
           +W S+I+G+ +NG P +AL ++ +M  +      FT +S
Sbjct: 189 AWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVS 227

BLAST of Cp4.1LG01g04330 vs. TAIR10
Match: AT2G20540.1 (AT2G20540.1 mitochondrial editing factor 21)

HSP 1 Score: 127.9 bits (320), Expect = 1.1e-29
Identity = 58/166 (34.94%), Postives = 101/166 (60.84%), Query Frame = 1

Query: 112 LTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGA 171
           + + GL +  F+V K ++ C  + D++YA ++F +V  P++ L+N II+ YT N+++   
Sbjct: 33  IIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQVSNPNVFLYNSIIRAYTHNSLYCDV 92

Query: 172 IRMYKDM-QVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVS 231
           IR+YK + + S   PD FTF ++ K+C+ +    +GKQ+H    K+G   +V  +N+L+ 
Sbjct: 93  IRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYLGKQVHGHLCKFGPRFHVVTENALID 152

Query: 232 MYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVF 277
           MY +F     A  VFD+++ R V+SW S++SGY + G    A  +F
Sbjct: 153 MYMKFDDLVDAHKVFDEMYERDVISWNSLLSGYARLGQMKKAKGLF 198

BLAST of Cp4.1LG01g04330 vs. TAIR10
Match: AT3G49142.1 (AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 124.8 bits (312), Expect = 9.2e-29
Identity = 65/157 (41.40%), Postives = 92/157 (58.60%), Query Frame = 1

Query: 123 LVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSG 182
           L +K + A   L+DV  A KVF E+ E ++++ N +I+ Y  N  +   ++++  M    
Sbjct: 76  LGVKLMRAYASLKDVASARKVFDEIPERNVIIINVMIRSYVNNGFYGEGVKVFGTMCGCN 135

Query: 183 VNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSMYARFGQTSSAR 242
           V PD +TF  VLKACS      IG+++H    K GL S +FV N LVSMY + G  S AR
Sbjct: 136 VRPDHYTFPCVLKACSCSGTIVIGRKIHGSATKVGLSSTLFVGNGLVSMYGKCGFLSEAR 195

Query: 243 LVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDM 280
           LV D++  R VVSW S++ GY QN    DAL V ++M
Sbjct: 196 LVLDEMSRRDVVSWNSLVVGYAQNQRFDDALEVCREM 232

BLAST of Cp4.1LG01g04330 vs. TAIR10
Match: AT3G03580.1 (AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 120.9 bits (302), Expect = 1.3e-27
Identity = 69/189 (36.51%), Postives = 108/189 (57.14%), Query Frame = 1

Query: 109 IEELTVS-GLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEP-DILLWNGIIKGYTQNN 168
           I  L +S GL    F   K I+   H R+   +  VFR V    ++ LWN II+ +++N 
Sbjct: 26  IHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVSPAKNVYLWNSIIRAFSKNG 85

Query: 169 IFAGAIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQN 228
           +F  A+  Y  ++ S V+PD +TF  V+KAC+G+    +G  ++ Q    G  S++FV N
Sbjct: 86  LFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQILDMGFESDLFVGN 145

Query: 229 SLVSMYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRRRAATQ 288
           +LV MY+R G  + AR VFD++  R +VSW S+ISGY  +G   +AL ++ +++      
Sbjct: 146 ALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEALEIYHELKNSWIVP 205

Query: 289 TLFTEISSV 296
             FT +SSV
Sbjct: 206 DSFT-VSSV 213

BLAST of Cp4.1LG01g04330 vs. NCBI nr
Match: gi|778700750|ref|XP_011654911.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Cucumis sativus])

HSP 1 Score: 286.6 bits (732), Expect = 5.2e-74
Identity = 144/207 (69.57%), Postives = 166/207 (80.19%), Query Frame = 1

Query: 85  FALSLLVVAIYSSLKGG------GNLTGGFIEE----LTVSGLYKCGFLVIKFINACLHL 144
           F+LSLL+ ++ S+L          +L    +++    L VSGL+KC FL+IKFINACLH 
Sbjct: 6   FSLSLLLSSLSSALSKSTITLHEASLRRKHLDQVYVQLIVSGLHKCRFLMIKFINACLHF 65

Query: 145 RDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVL 204
            DVNYAHK FREV EPDILLWN IIKGYTQ NI    IRMY DMQ+S V+P+CFTFLYVL
Sbjct: 66  GDVNYAHKAFREVSEPDILLWNAIIKGYTQKNIVDAPIRMYMDMQISQVHPNCFTFLYVL 125

Query: 205 KACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSMYARFGQTSSARLVFDKLHNRTVV 264
           KAC G SVEGIGKQ+H QTFKYG GSNVFVQNSLVSMYA+FGQ S AR+VFDKLH+RTVV
Sbjct: 126 KACGGTSVEGIGKQIHGQTFKYGFGSNVFVQNSLVSMYAKFGQISYARIVFDKLHDRTVV 185

Query: 265 SWTSIISGYVQNGDPVDALRVFKDMRR 282
           SWTSIISGYVQNGDP++AL VFK+MR+
Sbjct: 186 SWTSIISGYVQNGDPMEALNVFKEMRQ 212

BLAST of Cp4.1LG01g04330 vs. NCBI nr
Match: gi|659090152|ref|XP_008445864.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Cucumis melo])

HSP 1 Score: 277.3 bits (708), Expect = 3.1e-71
Identity = 140/207 (67.63%), Postives = 165/207 (79.71%), Query Frame = 1

Query: 85  FALSLLVVAIYSSLKGG------GNLTGGFIEE----LTVSGLYKCGFLVIKFINACLHL 144
           F+LSLL+ ++ S+L          +L    +++    L VSGL+KC +LVIKF+NACLH 
Sbjct: 6   FSLSLLLSSLSSALSKSTITSHEASLRRKHLDQVYVQLIVSGLHKCRYLVIKFVNACLHF 65

Query: 145 RDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVL 204
            DVNYAHK F EV EPDI LWN IIKGY Q NI  G IRMY DMQ+S V+P+CFTFLYVL
Sbjct: 66  GDVNYAHKAFCEVSEPDIPLWNAIIKGYAQKNIVGGPIRMYMDMQISQVHPNCFTFLYVL 125

Query: 205 KACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVSMYARFGQTSSARLVFDKLHNRTVV 264
           KAC G SVE +GKQ+H  TFKYG GSNVFVQNSLVSMYA+FGQTSSAR+VFDKLH+RTVV
Sbjct: 126 KACGGTSVE-LGKQIHGHTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVV 185

Query: 265 SWTSIISGYVQNGDPVDALRVFKDMRR 282
           SWTSIISGYVQNGDP++AL+VFK+MR+
Sbjct: 186 SWTSIISGYVQNGDPMEALKVFKEMRQ 211

BLAST of Cp4.1LG01g04330 vs. NCBI nr
Match: gi|225447423|ref|XP_002276196.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Vitis vinifera])

HSP 1 Score: 201.8 bits (512), Expect = 1.7e-48
Identity = 95/172 (55.23%), Postives = 125/172 (72.67%), Query Frame = 1

Query: 111 ELTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAG 170
           +L VSGL + GFLV KF+NA  ++ ++ YA KVF E  EP + LWN II+GY+ +N F  
Sbjct: 93  QLVVSGLVESGFLVTKFVNASWNIGEIGYARKVFDEFPEPSVFLWNAIIRGYSSHNFFGD 152

Query: 171 AIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVS 230
           AI MY  MQ SGVNPD FT   VLKACSG+ V  +GK++H Q F+ G  S+VFVQN LV+
Sbjct: 153 AIEMYSRMQASGVNPDGFTLPCVLKACSGVPVLEVGKRVHGQIFRLGFESDVFVQNGLVA 212

Query: 231 MYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRRR 283
           +YA+ G+   AR+VF+ L +R +VSWTS+ISGY QNG P++ALR+F  MR+R
Sbjct: 213 LYAKCGRVEQARIVFEGLDDRNIVSWTSMISGYGQNGLPMEALRIFGQMRQR 264

BLAST of Cp4.1LG01g04330 vs. NCBI nr
Match: gi|720093617|ref|XP_010246108.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Nelumbo nucifera])

HSP 1 Score: 197.6 bits (501), Expect = 3.2e-47
Identity = 87/171 (50.88%), Postives = 128/171 (74.85%), Query Frame = 1

Query: 111 ELTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAG 170
           +L V+G     +L  KF++A  +  +++YA  +F E+ EP++ LWN I++GY+QNN+F+ 
Sbjct: 88  QLIVAGFQNSNYLATKFVHASSNAGEIHYARSLFEEIPEPNVFLWNAIVRGYSQNNLFSD 147

Query: 171 AIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVS 230
           A+ MY  MQV  +NPD FTF YVLKACS +S   +G ++H+Q F++G  S+VFVQN LV+
Sbjct: 148 ALEMYSRMQVERMNPDRFTFPYVLKACSSLSDLRMGFRIHAQIFRHGFESDVFVQNGLVA 207

Query: 231 MYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRR 282
           +YA+ G+ S AR VFD+L +RT+VSWTSIISGY QN  P++ALR+F++MR+
Sbjct: 208 LYAKCGEISRARAVFDRLSDRTIVSWTSIISGYAQNSQPLEALRIFREMRQ 258

BLAST of Cp4.1LG01g04330 vs. NCBI nr
Match: gi|629103875|gb|KCW69344.1| (hypothetical protein EUGRSUZ_F02828 [Eucalyptus grandis])

HSP 1 Score: 196.1 bits (497), Expect = 9.2e-47
Identity = 84/171 (49.12%), Postives = 129/171 (75.44%), Query Frame = 1

Query: 111 ELTVSGLYKCGFLVIKFINACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAG 170
           +L V GL+K  FL+ K +N   +L ++ Y+ KVF E  +PD+ LWN II+GY+++N+F+ 
Sbjct: 95  KLLVLGLHKDSFLITKLVNWSSNLGEIRYSRKVFDEFSDPDVFLWNAIIRGYSRHNMFSD 154

Query: 171 AIRMYKDMQVSGVNPDCFTFLYVLKACSGMSVEGIGKQMHSQTFKYGLGSNVFVQNSLVS 230
           A+ +Y +M  +GV+PD FTF Y+L+AC+G+   GIG+ +H Q +++G  S+VFVQN +V+
Sbjct: 155 AVELYSEMLGTGVSPDGFTFPYILRACTGLPALGIGRCVHGQVYRHGFESDVFVQNGVVT 214

Query: 231 MYARFGQTSSARLVFDKLHNRTVVSWTSIISGYVQNGDPVDALRVFKDMRR 282
           +YA+ G+   AR+VFD+L +RTVVSWTS+ISGY QNG P+++LR+F  MR+
Sbjct: 215 LYAKCGKVKHARIVFDQLRDRTVVSWTSMISGYAQNGQPMESLRIFSQMRK 265

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP224_ARATH4.4e-4146.51Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP330_ARATH1.5e-2837.11Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
PP165_ARATH1.9e-2834.94Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana GN... [more]
PP271_ARATH1.6e-2741.40Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis th... [more]
PP210_ARATH2.4e-2636.51Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KLB9_CUCSA3.6e-7469.57Uncharacterized protein OS=Cucumis sativus GN=Csa_5G175830 PE=4 SV=1[more]
D7SQP8_VITVI1.2e-4855.23Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0134g00210 PE=4 SV=... [more]
A0A059BTM6_EUCGR6.4e-4749.12Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F02828 PE=4 SV=1[more]
W9S3H1_9ROSA1.6e-4550.29Uncharacterized protein OS=Morus notabilis GN=L484_007616 PE=4 SV=1[more]
A0A0D2TXW8_GOSRA1.0e-4446.28Uncharacterized protein OS=Gossypium raimondii GN=B456_008G055100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G12770.12.5e-4246.51 mitochondrial editing factor 22[more]
AT4G21065.18.3e-3037.11 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G20540.11.1e-2934.94 mitochondrial editing factor 21[more]
AT3G49142.19.2e-2941.40 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G03580.11.3e-2736.51 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778700750|ref|XP_011654911.1|5.2e-7469.57PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Cucumis sativu... [more]
gi|659090152|ref|XP_008445864.1|3.1e-7167.63PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Cucumis melo][more]
gi|225447423|ref|XP_002276196.1|1.7e-4855.23PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Vitis vinifera... [more]
gi|720093617|ref|XP_010246108.1|3.2e-4750.88PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Nelumbo nucife... [more]
gi|629103875|gb|KCW69344.1|9.2e-4749.12hypothetical protein EUGRSUZ_F02828 [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g04330.1Cp4.1LG01g04330.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 226..250
score: 0.22coord: 254..282
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 150..197
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 155..187
score: 4.5E-6coord: 254..282
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 151..185
score: 10.282coord: 221..251
score: 7.454coord: 252..286
score: 10
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 121..282
score: 2.0
NoneNo IPR availablePANTHERPTHR24015:SF477SUBFAMILY NOT NAMEDcoord: 121..282
score: 2.0

The following gene(s) are paralogous to this gene:

None