Cla97C01G011970 (gene) Watermelon (97103) v2

NameCla97C01G011970
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr01 : 24772485 .. 24773972 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCCGTTTTGTTAGCAACTTCAGTCTCTTCCATTCTGGTGAAAGGAAATGGAGAACTTGGTTGCCAGACTACGATGGCTCATTTCAATACCAACTCCAGAAGACACCCACCCAAAAACCTCCTCTATCCACGACGGGCCAAGCTTCCTCCTGATCCTATCGTCGTCAACCAATTCTTGAACAACAAAATCTCTGCCCCTTCCCCATCCTTCACTGATTTAACTTCCTCTGAGATTTCCCAACTCCCCGAAGGTGAAGAGGATGAGCATGAAGAAATCTATGCTTATGACTATAAAGATACTGATGTTGTTTGGGATTCGGATGAAATTGAAGCTATTTCATCACTCTTTCAAGGGAGAATTCCTCAGAAACCTGGTACATTGAACAGGGACAGACCTCTTCCTCTTCCACTTCCTCACAAGCTACGACCACCAAGACTTCCTAACCTAAAAATCCGCCCAAGAACAGTGGTCTCTTCGCGTGCTTTGATGTCTAAGCAAGTCTACAAGTGTCCTGATTTTCTTATTGGCCTTGCCAGGGCGATTAGAGATTTGTCCCCGGAGGAAAATGTGTCCAAGGTTCTCAATCGGTGGGGTCCTTTTTTGCAAAAGGGCTCTCTATCATTGACGATCAAGGAACTAGGTCATATGGGTCTTCCTGATAGAGCTTTAAAGACGTTCTGTTGGGCACAAGAACAACCTCGCCTCTTCCCGGATGATCGTGTTTTGGCCACAACGGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTGAACTTGCTAGTCGTGGTGTGCTTGAGGCAATGGTGAGAGGGTTTATCAAAGGTGGGAGCTTAAATCTTGCTTGGAAGCTTCTTGTAGCTGCGAAGAAGCGTAAGAGAATGTTGGATCCCAGCGTTTATGTGAAGTTAATACTGGAGCTTGGGAAGAACCCTGATAAAAACATGTTGGTTCTTACCTTAATGGATGAGCTAGGACAAAGGGAAGCCTTGAAGTTAAACCAGCAAGATACTACAGCTATAATTAAGGTATGCACAAGGCTTGGTAAATCTGAAATTGCTGAGGAACTTCATAGCTGGTATGTTGAATCTGGACATGAACCTAGTGTGGTTATGTATACTGCCTTAGTTCATAATCGCTACTCAGACAAGAAATACAGGGAGGCATTATCTTTAGTGTGGGAAATGGAGGCTGCAAACTGTCCTTTTGATCTTCCTGCATATAGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTGCCCCTACATATGATGTCTATAGGAATCTGATCACCATTTATTTAGTTTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTATAAGGAAGCAGAGAATGCTGGATTTATCATGGATAAACAAATTACTTCAATGCTGTTGCAAGCAAAAAGATGA

mRNA sequence

ATGGATTCCGTTTTGTTAGCAACTTCAGTCTCTTCCATTCTGGTGAAAGGAAATGGAGAACTTGGTTGCCAGACTACGATGGCTCATTTCAATACCAACTCCAGAAGACACCCACCCAAAAACCTCCTCTATCCACGACGGGCCAAGCTTCCTCCTGATCCTATCGTCGTCAACCAATTCTTGAACAACAAAATCTCTGCCCCTTCCCCATCCTTCACTGATTTAACTTCCTCTGAGATTTCCCAACTCCCCGAAGGTGAAGAGGATGAGCATGAAGAAATCTATGCTTATGACTATAAAGATACTGATGTTGTTTGGGATTCGGATGAAATTGAAGCTATTTCATCACTCTTTCAAGGGAGAATTCCTCAGAAACCTGGTACATTGAACAGGGACAGACCTCTTCCTCTTCCACTTCCTCACAAGCTACGACCACCAAGACTTCCTAACCTAAAAATCCGCCCAAGAACAGTGGTCTCTTCGCGTGCTTTGATGTCTAAGCAAGTCTACAAGTGTCCTGATTTTCTTATTGGCCTTGCCAGGGCGATTAGAGATTTGTCCCCGGAGGAAAATGTGTCCAAGGTTCTCAATCGGTGGGGTCCTTTTTTGCAAAAGGGCTCTCTATCATTGACGATCAAGGAACTAGGTCATATGGGTCTTCCTGATAGAGCTTTAAAGACGTTCTGTTGGGCACAAGAACAACCTCGCCTCTTCCCGGATGATCGTGTTTTGGCCACAACGGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTGAACTTGCTAGTCGTGGTGTGCTTGAGGCAATGGTGAGAGGGTTTATCAAAGGTGGGAGCTTAAATCTTGCTTGGAAGCTTCTTGTAGCTGCGAAGAAGCGTAAGAGAATGTTGGATCCCAGCGTTTATGTGAAGTTAATACTGGAGCTTGGGAAGAACCCTGATAAAAACATGTTGGTTCTTACCTTAATGGATGAGCTAGGACAAAGGGAAGCCTTGAAGTTAAACCAGCAAGATACTACAGCTATAATTAAGGTATGCACAAGGCTTGGTAAATCTGAAATTGCTGAGGAACTTCATAGCTGGTATGTTGAATCTGGACATGAACCTAGTGTGGTTATGTATACTGCCTTAGTTCATAATCGCTACTCAGACAAGAAATACAGGGAGGCATTATCTTTAGTGTGGGAAATGGAGGCTGCAAACTGTCCTTTTGATCTTCCTGCATATAGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTGCCCCTACATATGATGTCTATAGGAATCTGATCACCATTTATTTAGTTTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTATAAGGAAGCAGAGAATGCTGGATTTATCATGGATAAACAAATTACTTCAATGCTGTTGCAAGCAAAAAGATGA

Coding sequence (CDS)

ATGGATTCCGTTTTGTTAGCAACTTCAGTCTCTTCCATTCTGGTGAAAGGAAATGGAGAACTTGGTTGCCAGACTACGATGGCTCATTTCAATACCAACTCCAGAAGACACCCACCCAAAAACCTCCTCTATCCACGACGGGCCAAGCTTCCTCCTGATCCTATCGTCGTCAACCAATTCTTGAACAACAAAATCTCTGCCCCTTCCCCATCCTTCACTGATTTAACTTCCTCTGAGATTTCCCAACTCCCCGAAGGTGAAGAGGATGAGCATGAAGAAATCTATGCTTATGACTATAAAGATACTGATGTTGTTTGGGATTCGGATGAAATTGAAGCTATTTCATCACTCTTTCAAGGGAGAATTCCTCAGAAACCTGGTACATTGAACAGGGACAGACCTCTTCCTCTTCCACTTCCTCACAAGCTACGACCACCAAGACTTCCTAACCTAAAAATCCGCCCAAGAACAGTGGTCTCTTCGCGTGCTTTGATGTCTAAGCAAGTCTACAAGTGTCCTGATTTTCTTATTGGCCTTGCCAGGGCGATTAGAGATTTGTCCCCGGAGGAAAATGTGTCCAAGGTTCTCAATCGGTGGGGTCCTTTTTTGCAAAAGGGCTCTCTATCATTGACGATCAAGGAACTAGGTCATATGGGTCTTCCTGATAGAGCTTTAAAGACGTTCTGTTGGGCACAAGAACAACCTCGCCTCTTCCCGGATGATCGTGTTTTGGCCACAACGGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTGAACTTGCTAGTCGTGGTGTGCTTGAGGCAATGGTGAGAGGGTTTATCAAAGGTGGGAGCTTAAATCTTGCTTGGAAGCTTCTTGTAGCTGCGAAGAAGCGTAAGAGAATGTTGGATCCCAGCGTTTATGTGAAGTTAATACTGGAGCTTGGGAAGAACCCTGATAAAAACATGTTGGTTCTTACCTTAATGGATGAGCTAGGACAAAGGGAAGCCTTGAAGTTAAACCAGCAAGATACTACAGCTATAATTAAGGTATGCACAAGGCTTGGTAAATCTGAAATTGCTGAGGAACTTCATAGCTGGTATGTTGAATCTGGACATGAACCTAGTGTGGTTATGTATACTGCCTTAGTTCATAATCGCTACTCAGACAAGAAATACAGGGAGGCATTATCTTTAGTGTGGGAAATGGAGGCTGCAAACTGTCCTTTTGATCTTCCTGCATATAGTGTAGTGATAAAGCTTTTTGTTGCTCTTGGTGATCTTTCAAGGGCTGTTAGATACTTTGCAAAGCTTAAGGAAGCTGGTTTTGCCCCTACATATGATGTCTATAGGAATCTGATCACCATTTATTTAGTTTCAGGGAGGTTAGCCAAGTGTAAGGAAATTTATAAGGAAGCAGAGAATGCTGGATTTATCATGGATAAACAAATTACTTCAATGCTGTTGCAAGCAAAAAGATGA

Protein sequence

MDSVLLATSVSSILVKGNGELGCQTTMAHFNTNSRRHPPKNLLYPRRAKLPPDPIVVNQFLNNKISAPSPSFTDLTSSEISQLPEGEEDEHEEIYAYDYKDTDVVWDSDEIEAISSLFQGRIPQKPGTLNRDRPLPLPLPHKLRPPRLPNLKIRPRTVVSSRALMSKQVYKCPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWAQEQPRLFPDDRVLATTVEVLARNHELKVPLNLEEFTELASRGVLEAMVRGFIKGGSLNLAWKLLVAAKKRKRMLDPSVYVKLILELGKNPDKNMLVLTLMDELGQREALKLNQQDTTAIIKVCTRLGKSEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEAANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFIMDKQITSMLLQAKR
BLAST of Cla97C01G011970 vs. NCBI nr
Match: XP_008462173.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462181.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462189.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902994.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902996.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo])

HSP 1 Score: 852.4 bits (2201), Expect = 7.5e-244
Identity = 436/496 (87.90%), Postives = 462/496 (93.15%), Query Frame = 0

Query: 1   MDSVLLATSVSSILVKGNGELGCQTTMAHFNTNSRRHPPKNLLYPRRAKLPPDPIVVNQF 60
           M S++ ATSVSSILVKGNG +GCQ TM HF  NSRR PPKNLL PRRAKLPPDP  VNQF
Sbjct: 1   MHSIVSATSVSSILVKGNGGIGCQITMVHFKANSRRRPPKNLLCPRRAKLPPDP-AVNQF 60

Query: 61  LNNKISAPSPSFTDLTSSEISQLPEGEEDEHEEIYAYDY-KDTDVVWDSDEIEAISSLFQ 120
           LNNK SAPSPSFTDL SS+I Q      DEHEEI+AYDY KDTDVVWDSDEIEAISSLFQ
Sbjct: 61  LNNKTSAPSPSFTDLISSKIFQ------DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQ 120

Query: 121 GRIPQKPGTLNRDRPLPLPLPHKLRPPRLPNLKIRPRTVVSSRALMSKQVYKCPDFLIGL 180
           GRIPQKPG LNR+RPLPLPLPHKLRPPRLPN KIRP T VSSRAL+SK+VYK PDFLIGL
Sbjct: 121 GRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTTVSSRALLSKKVYKRPDFLIGL 180

Query: 181 ARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWAQEQPRLFP 240
           ARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCW QEQ RLFP
Sbjct: 181 ARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWVQEQRRLFP 240

Query: 241 DDRVLATTVEVLARNHELKVPLNLEEFTELASRGVLEAMVRGFIKGGSLNLAWKLLVAAK 300
           DDRVLA+TVEVL+RNHELKVP+NLEEFT+LASRGVLEAM+RGFIKGGSLNLAWKLLVAAK
Sbjct: 241 DDRVLASTVEVLSRNHELKVPVNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAAK 300

Query: 301 KRKRMLDPSVYVKLILELGKNPDKNMLVLTLMDELGQREALKLNQQDTTAIIKVCTRLGK 360
           K KRMLDPSVYVKLILELGKNPDKN+LVLTL++ELGQREALKLNQQD+T IIKVCTRL K
Sbjct: 301 KGKRMLDPSVYVKLILELGKNPDKNVLVLTLLEELGQREALKLNQQDSTTIIKVCTRLRK 360

Query: 361 SEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEAANCPFDLPAYSV 420
            EIAE+L+ WYVESGHEPS+VMYTALVH+RYSD+KYREALSLVWEME+ANCPFDLPAY+V
Sbjct: 361 FEIAEKLYCWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYNV 420

Query: 421 VIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAG 480
           VIKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRN+ITIYLVSGRLAK KEIYKEAENAG
Sbjct: 421 VIKLFVALGDLSRAVRYFAKLKEAGFSPTYDVYRNMITIYLVSGRLAKSKEIYKEAENAG 480

Query: 481 FIMDKQITSMLLQAKR 496
           FIMDKQITSMLLQAKR
Sbjct: 481 FIMDKQITSMLLQAKR 489

BLAST of Cla97C01G011970 vs. NCBI nr
Match: XP_004139567.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_011654198.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_011654204.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >KGN64877.1 hypothetical protein Csa_1G144300 [Cucumis sativus])

HSP 1 Score: 851.3 bits (2198), Expect = 1.7e-243
Identity = 433/496 (87.30%), Postives = 461/496 (92.94%), Query Frame = 0

Query: 1   MDSVLLATSVSSILVKGNGELGCQTTMAHFNTNSRRHPPKNLLYPRRAKLPPDPIVVNQF 60
           MDS++ ATSVSSILVKGNG +GCQ TM HF  NSRR PPKNLL PRRAKLPP+P  VNQF
Sbjct: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNP-AVNQF 60

Query: 61  LNNKISAPSPSFTDLTSSEISQLPEGEEDEHEEIYAYDY-KDTDVVWDSDEIEAISSLFQ 120
            NNK SAPSP FTDL SS+I Q      DEHEEI+A+DY KDTDVVWDSDEIEAISSLFQ
Sbjct: 61  FNNKTSAPSPPFTDLISSKIFQ------DEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQ 120

Query: 121 GRIPQKPGTLNRDRPLPLPLPHKLRPPRLPNLKIRPRTVVSSRALMSKQVYKCPDFLIGL 180
           GRIPQKPG LNR+RPLPLPLPHKLRPPRLPN KIRP TVVSSRAL+SKQVYK PDFLIGL
Sbjct: 121 GRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGL 180

Query: 181 ARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWAQEQPRLFP 240
           AR IRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRAL TFCWAQEQ RLFP
Sbjct: 181 AREIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFP 240

Query: 241 DDRVLATTVEVLARNHELKVPLNLEEFTELASRGVLEAMVRGFIKGGSLNLAWKLLVAAK 300
           DDRVLA+TVEVL+RNHELKV +NLEEFT+LASRGVLEAM+RGFI+GGSLNLAWKLLVAAK
Sbjct: 241 DDRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAK 300

Query: 301 KRKRMLDPSVYVKLILELGKNPDKNMLVLTLMDELGQREALKLNQQDTTAIIKVCTRLGK 360
           K KRMLDPSVYVKLILELGKNPDKNMLVLTL++ELGQREALKLNQQD T I+KVCTRLGK
Sbjct: 301 KGKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGK 360

Query: 361 SEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEAANCPFDLPAYSV 420
            EIAE+L+SWYVESGHEPS+VMYTALVH+RYSD+KYREALSLVWEME+ NCPFDLPAYSV
Sbjct: 361 FEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSV 420

Query: 421 VIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAG 480
           VIKLFVALGDLSRAVRYFAKLKEAGF+PTY+VYRN+ITIYLVSGRLAKCKEIYKEAENAG
Sbjct: 421 VIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAG 480

Query: 481 FIMDKQITSMLLQAKR 496
           F+MDKQITSMLLQAKR
Sbjct: 481 FMMDKQITSMLLQAKR 489

BLAST of Cla97C01G011970 vs. NCBI nr
Match: XP_022951807.1 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita moschata])

HSP 1 Score: 801.2 bits (2068), Expect = 2.0e-228
Identity = 412/502 (82.07%), Postives = 443/502 (88.25%), Query Frame = 0

Query: 1   MDSVLLATSVSSILVKGNGELGCQTTMAHFNTNSRRHPPKNLLYPRRAKLPPDPIVVNQF 60
           MDS+   T++SSILVK NG + CQ  +AHF TNSRR PPKNLLYPRR KLPPDP  VNQF
Sbjct: 1   MDSLFSTTTISSILVKRNGGISCQIPVAHFQTNSRRRPPKNLLYPRRTKLPPDP-GVNQF 60

Query: 61  LNNKISAPSP--SFTDLTSSEISQLPEGEEDEHEE-----IYAYDYKDTDVVWDSDEIEA 120
           L  + S P P  SF DL SSE   LPE E DE EE      +A D  D+DVVWDS+EIEA
Sbjct: 61  LKKRTSGPQPDTSFPDLISSEKIGLPEEELDEIEETAADNYFANDDNDSDVVWDSEEIEA 120

Query: 121 ISSLFQGRIPQKPGTLNRDRPLPLPLPHKLRPPRLPNLKIRPRTVVSSRALMSKQVYKCP 180
           I+SLF+GRIPQKPG LNR+RPLPLPLPHKLRPP LPN KIRPRT VSSRALMSKQVYK P
Sbjct: 121 ITSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRP 180

Query: 181 DFLIGLARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWAQE 240
           DFLIGLARAIRDL PEENVSKVLNRW PFLQKGSLSLTIKELGHMGL DRALKTFCW QE
Sbjct: 181 DFLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQE 240

Query: 241 QPRLFPDDRVLATTVEVLARNHELKVPLNLEEFTELASRGVLEAMVRGFIKGGSLNLAWK 300
           QPRL+PDDRVLA+TVEVLARNHELK+P NL+EFT+LASRGVLEAM+RGFIKGG L+LAWK
Sbjct: 241 QPRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWK 300

Query: 301 LLVAAKKRKRMLDPSVYVKLILELGKNPDKNMLVLTLMDELGQREALKLNQQDTTAIIKV 360
           LLVAAK  KRMLDPSVYVKLILE+GKNPDKNMLVL L+DELGQREAL LNQQDT+AIIKV
Sbjct: 301 LLVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKV 360

Query: 361 CTRLGKSEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEAANCPFD 420
            TRLGK EIAE L+SWYVESGHEPSVVMYTALVHNRYS++KYREALS+VWEMEAAN PFD
Sbjct: 361 STRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANSPFD 420

Query: 421 LPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYK 480
           LPAYSVV+KLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYK
Sbjct: 421 LPAYSVVMKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYK 480

Query: 481 EAENAGFIMDKQITSMLLQAKR 496
           EAENAG++MDKQITSMLLQAKR
Sbjct: 481 EAENAGYVMDKQITSMLLQAKR 501

BLAST of Cla97C01G011970 vs. NCBI nr
Match: XP_023537574.1 (pentatricopeptide repeat-containing protein At2g01860 [Cucurbita pepo subsp. pepo] >XP_023537576.1 pentatricopeptide repeat-containing protein At2g01860 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 797.3 bits (2058), Expect = 2.9e-227
Identity = 410/502 (81.67%), Postives = 440/502 (87.65%), Query Frame = 0

Query: 1   MDSVLLATSVSSILVKGNGELGCQTTMAHFNTNSRRHPPKNLLYPRRAKLPPDPIVVNQF 60
           MDS+   T++SSILVK NG + CQ  MAHF TNSRR PPKNLLYPRR KLPPDP  VNQF
Sbjct: 1   MDSLFSTTTISSILVKRNGGVSCQIPMAHFQTNSRRRPPKNLLYPRRTKLPPDP-GVNQF 60

Query: 61  LNNKISAPSP--SFTDLTSSEISQLPEGEEDEHEE-----IYAYDYKDTDVVWDSDEIEA 120
           L  + S P P  S  DL  SE    PE E DE EE      +A D  D+D+VWDS+EIEA
Sbjct: 61  LKKRTSGPHPDTSLPDLIPSEKIGPPEEELDELEETAADNYFANDDNDSDIVWDSEEIEA 120

Query: 121 ISSLFQGRIPQKPGTLNRDRPLPLPLPHKLRPPRLPNLKIRPRTVVSSRALMSKQVYKCP 180
           I+SLF+GRIPQKPG LNR+RPLPLPLPHKLRPP LPN KIRPRT VSSRALMSKQVYK P
Sbjct: 121 ITSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRP 180

Query: 181 DFLIGLARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWAQE 240
           DFLIGLARAIRDL PEENVSKVLNRW PFLQKGSLSLTIKELGHMGL DRALKTFCW QE
Sbjct: 181 DFLIGLARAIRDLQPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQE 240

Query: 241 QPRLFPDDRVLATTVEVLARNHELKVPLNLEEFTELASRGVLEAMVRGFIKGGSLNLAWK 300
           QPRL+PDDRVLA+TVEVLARNHELK+P NL+EFT+LASRGVLEAM+RGFIKGG L+LAWK
Sbjct: 241 QPRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWK 300

Query: 301 LLVAAKKRKRMLDPSVYVKLILELGKNPDKNMLVLTLMDELGQREALKLNQQDTTAIIKV 360
           LLVAAK  KRMLDPSVYVKLILE+GKNPDKNMLVL L+DELGQREAL LNQQDT+AIIKV
Sbjct: 301 LLVAAKNGKRMLDPSVYVKLILEIGKNPDKNMLVLALLDELGQREALNLNQQDTSAIIKV 360

Query: 361 CTRLGKSEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEAANCPFD 420
            TRLGK EIAE L+SWYVESGHEPSVVMYTALVHNRYS++KYREALS+VWEMEAA CPFD
Sbjct: 361 STRLGKFEIAERLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAAKCPFD 420

Query: 421 LPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYK 480
           LPAYSVVIKLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYK
Sbjct: 421 LPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYK 480

Query: 481 EAENAGFIMDKQITSMLLQAKR 496
           EAENAG++MDKQITSMLLQAKR
Sbjct: 481 EAENAGYVMDKQITSMLLQAKR 501

BLAST of Cla97C01G011970 vs. NCBI nr
Match: XP_023001961.1 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita maxima])

HSP 1 Score: 795.8 bits (2054), Expect = 8.3e-227
Identity = 409/502 (81.47%), Postives = 443/502 (88.25%), Query Frame = 0

Query: 1   MDSVLLATSVSSILVKGNGELGCQTTMAHFNTNSRRHPPKNLLYPRRAKLPPDPIVVNQF 60
           MDS+   T+VSSILVK NG + CQ  MAHF TNS+R PPKNLLYPRR KLPPDP  VNQF
Sbjct: 1   MDSLFSTTAVSSILVKRNGGISCQIPMAHFLTNSKRRPPKNLLYPRRTKLPPDP-GVNQF 60

Query: 61  LNNKISAPSP--SFTDLTSSEISQLPEGEEDEHEE-----IYAYDYKDTDVVWDSDEIEA 120
           L  + S P P  S+ DL  SE   LPE E DE EE      +A D  D+D+VWD +EIEA
Sbjct: 61  LKKRTSDPHPDTSYPDLIPSEKIGLPEEELDELEETAADNYFANDDNDSDIVWDPEEIEA 120

Query: 121 ISSLFQGRIPQKPGTLNRDRPLPLPLPHKLRPPRLPNLKIRPRTVVSSRALMSKQVYKCP 180
           I+SLF+GRIPQKPG LNR+RPLPLPLPHKLRPP LPN KIRPRT VSSRALMSKQVYK P
Sbjct: 121 ITSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNPKIRPRTAVSSRALMSKQVYKRP 180

Query: 181 DFLIGLARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWAQE 240
           DFLIGLARAIRDL PEENVSKVLNRW PFLQKGSLSLTIKELGHMGL DRALKTFCW QE
Sbjct: 181 DFLIGLARAIRDLQPEENVSKVLNRWAPFLQKGSLSLTIKELGHMGLADRALKTFCWVQE 240

Query: 241 QPRLFPDDRVLATTVEVLARNHELKVPLNLEEFTELASRGVLEAMVRGFIKGGSLNLAWK 300
           QPRL+PDDRVLA+TVEVLARNHELK+P NL+EFT+LASRGVLEAM+RGFIKGG L+LAWK
Sbjct: 241 QPRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLASRGVLEAMMRGFIKGGRLSLAWK 300

Query: 301 LLVAAKKRKRMLDPSVYVKLILELGKNPDKNMLVLTLMDELGQREALKLNQQDTTAIIKV 360
           LLVAAK  KRMLDPSV+VKLILE+GKNPDKNMLVL L+DELGQREAL L+QQDT+AIIKV
Sbjct: 301 LLVAAKNGKRMLDPSVHVKLILEIGKNPDKNMLVLALLDELGQREALNLSQQDTSAIIKV 360

Query: 361 CTRLGKSEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEAANCPFD 420
            TRLGK EIAE+L+SWYVESGHEPSVVMYTALVHNRYS++KYREALS+VWEMEAANCPFD
Sbjct: 361 STRLGKFEIAEKLYSWYVESGHEPSVVMYTALVHNRYSERKYREALSVVWEMEAANCPFD 420

Query: 421 LPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYK 480
           LPAYSVVIKLFVALGDLSRAVRYFAKLKEAGF PTY +YRNLITIYL +GRLAKCKEIYK
Sbjct: 421 LPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFTPTYCIYRNLITIYLAAGRLAKCKEIYK 480

Query: 481 EAENAGFIMDKQITSMLLQAKR 496
           EAENAG++MDKQITSMLLQAKR
Sbjct: 481 EAENAGYVMDKQITSMLLQAKR 501

BLAST of Cla97C01G011970 vs. TrEMBL
Match: tr|A0A1S3CGD0|A0A1S3CGD0_CUCME (pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN=LOC103500594 PE=4 SV=1)

HSP 1 Score: 852.4 bits (2201), Expect = 4.9e-244
Identity = 436/496 (87.90%), Postives = 462/496 (93.15%), Query Frame = 0

Query: 1   MDSVLLATSVSSILVKGNGELGCQTTMAHFNTNSRRHPPKNLLYPRRAKLPPDPIVVNQF 60
           M S++ ATSVSSILVKGNG +GCQ TM HF  NSRR PPKNLL PRRAKLPPDP  VNQF
Sbjct: 1   MHSIVSATSVSSILVKGNGGIGCQITMVHFKANSRRRPPKNLLCPRRAKLPPDP-AVNQF 60

Query: 61  LNNKISAPSPSFTDLTSSEISQLPEGEEDEHEEIYAYDY-KDTDVVWDSDEIEAISSLFQ 120
           LNNK SAPSPSFTDL SS+I Q      DEHEEI+AYDY KDTDVVWDSDEIEAISSLFQ
Sbjct: 61  LNNKTSAPSPSFTDLISSKIFQ------DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQ 120

Query: 121 GRIPQKPGTLNRDRPLPLPLPHKLRPPRLPNLKIRPRTVVSSRALMSKQVYKCPDFLIGL 180
           GRIPQKPG LNR+RPLPLPLPHKLRPPRLPN KIRP T VSSRAL+SK+VYK PDFLIGL
Sbjct: 121 GRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTTVSSRALLSKKVYKRPDFLIGL 180

Query: 181 ARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWAQEQPRLFP 240
           ARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCW QEQ RLFP
Sbjct: 181 ARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWVQEQRRLFP 240

Query: 241 DDRVLATTVEVLARNHELKVPLNLEEFTELASRGVLEAMVRGFIKGGSLNLAWKLLVAAK 300
           DDRVLA+TVEVL+RNHELKVP+NLEEFT+LASRGVLEAM+RGFIKGGSLNLAWKLLVAAK
Sbjct: 241 DDRVLASTVEVLSRNHELKVPVNLEEFTKLASRGVLEAMMRGFIKGGSLNLAWKLLVAAK 300

Query: 301 KRKRMLDPSVYVKLILELGKNPDKNMLVLTLMDELGQREALKLNQQDTTAIIKVCTRLGK 360
           K KRMLDPSVYVKLILELGKNPDKN+LVLTL++ELGQREALKLNQQD+T IIKVCTRL K
Sbjct: 301 KGKRMLDPSVYVKLILELGKNPDKNVLVLTLLEELGQREALKLNQQDSTTIIKVCTRLRK 360

Query: 361 SEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEAANCPFDLPAYSV 420
            EIAE+L+ WYVESGHEPS+VMYTALVH+RYSD+KYREALSLVWEME+ANCPFDLPAY+V
Sbjct: 361 FEIAEKLYCWYVESGHEPSMVMYTALVHSRYSDRKYREALSLVWEMESANCPFDLPAYNV 420

Query: 421 VIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAG 480
           VIKLFVALGDLSRAVRYFAKLKEAGF+PTYDVYRN+ITIYLVSGRLAK KEIYKEAENAG
Sbjct: 421 VIKLFVALGDLSRAVRYFAKLKEAGFSPTYDVYRNMITIYLVSGRLAKSKEIYKEAENAG 480

Query: 481 FIMDKQITSMLLQAKR 496
           FIMDKQITSMLLQAKR
Sbjct: 481 FIMDKQITSMLLQAKR 489

BLAST of Cla97C01G011970 vs. TrEMBL
Match: tr|A0A0A0LVM0|A0A0A0LVM0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1)

HSP 1 Score: 851.3 bits (2198), Expect = 1.1e-243
Identity = 433/496 (87.30%), Postives = 461/496 (92.94%), Query Frame = 0

Query: 1   MDSVLLATSVSSILVKGNGELGCQTTMAHFNTNSRRHPPKNLLYPRRAKLPPDPIVVNQF 60
           MDS++ ATSVSSILVKGNG +GCQ TM HF  NSRR PPKNLL PRRAKLPP+P  VNQF
Sbjct: 1   MDSIVSATSVSSILVKGNGGIGCQNTMVHFKANSRRRPPKNLLCPRRAKLPPNP-AVNQF 60

Query: 61  LNNKISAPSPSFTDLTSSEISQLPEGEEDEHEEIYAYDY-KDTDVVWDSDEIEAISSLFQ 120
            NNK SAPSP FTDL SS+I Q      DEHEEI+A+DY KDTDVVWDSDEIEAISSLFQ
Sbjct: 61  FNNKTSAPSPPFTDLISSKIFQ------DEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQ 120

Query: 121 GRIPQKPGTLNRDRPLPLPLPHKLRPPRLPNLKIRPRTVVSSRALMSKQVYKCPDFLIGL 180
           GRIPQKPG LNR+RPLPLPLPHKLRPPRLPN KIRP TVVSSRAL+SKQVYK PDFLIGL
Sbjct: 121 GRIPQKPGKLNRERPLPLPLPHKLRPPRLPNPKIRPTTVVSSRALLSKQVYKRPDFLIGL 180

Query: 181 ARAIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWAQEQPRLFP 240
           AR IRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRAL TFCWAQEQ RLFP
Sbjct: 181 AREIRDLSPEENVSKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALNTFCWAQEQHRLFP 240

Query: 241 DDRVLATTVEVLARNHELKVPLNLEEFTELASRGVLEAMVRGFIKGGSLNLAWKLLVAAK 300
           DDRVLA+TVEVL+RNHELKV +NLEEFT+LASRGVLEAM+RGFI+GGSLNLAWKLLVAAK
Sbjct: 241 DDRVLASTVEVLSRNHELKVAVNLEEFTKLASRGVLEAMMRGFIRGGSLNLAWKLLVAAK 300

Query: 301 KRKRMLDPSVYVKLILELGKNPDKNMLVLTLMDELGQREALKLNQQDTTAIIKVCTRLGK 360
           K KRMLDPSVYVKLILELGKNPDKNMLVLTL++ELGQREALKLNQQD T I+KVCTRLGK
Sbjct: 301 KGKRMLDPSVYVKLILELGKNPDKNMLVLTLLEELGQREALKLNQQDCTTIVKVCTRLGK 360

Query: 361 SEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEAANCPFDLPAYSV 420
            EIAE+L+SWYVESGHEPS+VMYTALVH+RYSD+KYREALSLVWEME+ NCPFDLPAYSV
Sbjct: 361 FEIAEKLYSWYVESGHEPSIVMYTALVHSRYSDRKYREALSLVWEMESGNCPFDLPAYSV 420

Query: 421 VIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAG 480
           VIKLFVALGDLSRAVRYFAKLKEAGF+PTY+VYRN+ITIYLVSGRLAKCKEIYKEAENAG
Sbjct: 421 VIKLFVALGDLSRAVRYFAKLKEAGFSPTYNVYRNMITIYLVSGRLAKCKEIYKEAENAG 480

Query: 481 FIMDKQITSMLLQAKR 496
           F+MDKQITSMLLQAKR
Sbjct: 481 FMMDKQITSMLLQAKR 489

BLAST of Cla97C01G011970 vs. TrEMBL
Match: tr|A0A2N9HFH8|A0A2N9HFH8_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS38191 PE=4 SV=1)

HSP 1 Score: 604.0 bits (1556), Expect = 3.1e-169
Identity = 314/469 (66.95%), Postives = 377/469 (80.38%), Query Frame = 0

Query: 31  NTNSRRHPPKNLLYPRRAKLPPDPIVVNQFLNNKISAPSPSFTDLTSSEISQLPEGEEDE 90
           N++++R  PKNL YPR  KLPPD   VN FL  K +   PS TDL +S +++  EGEED 
Sbjct: 32  NSSTKRRLPKNLRYPRSTKLPPD-FGVNLFLKKKTT--DPSLTDLINSHLAE--EGEEDT 91

Query: 91  HEEIYAYDYKDTDVVWDSDEIEAISSLFQGRIPQKPGTLNRDRPLPLPLPHKLRPPRLPN 150
            EE       DT +VWDSDEIEAISSLF+GRIPQKPG LNR RPL     +KLRP  LP 
Sbjct: 92  QEE-------DTGIVWDSDEIEAISSLFRGRIPQKPGKLNRQRPLXXXXXYKLRPAGLPA 151

Query: 151 LKIRPRTV----VSSRALMSKQVYKCPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG 210
            K   ++V    +SSRA +SKQ+YK P  LIG+AR I+ LS EE+VS +LN+W  FL+KG
Sbjct: 152 PKKHVKSVSPSALSSRASLSKQLYKNPGVLIGIAREIKSLSSEEDVSVILNKWASFLRKG 211

Query: 211 SLSLTIKELGHMGLPDRALKTFCWAQEQPRLFPDDRVLATTVEVLARNHELKVPLNLEEF 270
           SLSLTI+ELGHMGLP+RALKTFCWAQ+QP+LFPDDR+LA+TVEVLARNHELKVP NLE+F
Sbjct: 212 SLSLTIRELGHMGLPERALKTFCWAQKQPQLFPDDRILASTVEVLARNHELKVPFNLEKF 271

Query: 271 TELASRGVLEAMVRGFIKGGSLNLAWKLLVAAKKRKRMLDPSVYVKLILELGKNPDKNML 330
           T LASRGV+EAMVRGFI+GGSL+LA K+L+ AK  KRMLD SVY KLILELGKNPDK +L
Sbjct: 272 TALASRGVIEAMVRGFIRGGSLHLARKVLLIAKHGKRMLDSSVYAKLILELGKNPDKQLL 331

Query: 331 VLTLMDELGQREALKLNQQDTTAIIKVCTRLGKSEIAEELHSWYVESGHEPSVVMYTALV 390
           V+ L+DELG+R+   L+QQD TAI+KVC RL K +I E L +W+ +SGH+PSVVMYT L+
Sbjct: 332 VVALLDELGERDDFNLSQQDCTAIMKVCIRLRKFDIVESLFNWFKQSGHDPSVVMYTTLI 391

Query: 391 HNRYSDKKYREALSLVWEMEAANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFA 450
           H+RYS+KKYREAL++VWEMEA+NC FDLPAY VVI+LFVAL DLSRAVRYF+KLKEAGF 
Sbjct: 392 HSRYSEKKYREALAVVWEMEASNCLFDLPAYRVVIRLFVALSDLSRAVRYFSKLKEAGFC 451

Query: 451 PTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFIMDKQITSMLLQAKR 496
           PTYD+YR+LI IY++SGRLAKCKE+ KEA  AGF +DK+ TS LLQ +R
Sbjct: 452 PTYDLYRDLIKIYMISGRLAKCKEVCKEAGQAGFKLDKETTSWLLQFER 488

BLAST of Cla97C01G011970 vs. TrEMBL
Match: tr|A0A2P6QJA8|A0A2P6QJA8_ROSCH (Putative pentatricopeptide OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr5g0066911 PE=4 SV=1)

HSP 1 Score: 563.9 bits (1452), Expect = 3.5e-157
Identity = 291/469 (62.05%), Postives = 364/469 (77.61%), Query Frame = 0

Query: 27  MAHFNTNSRRHPPKNLLYPRRAKLPPDPIVVNQFLNNKISAPSPSFTDLTSSEISQLPEG 86
           M   ++  RR  PKNL YPR+AKLPPD + VNQFLN             TS++ S   E 
Sbjct: 26  MVRPSSTRRRSLPKNLRYPRQAKLPPD-LGVNQFLNK------------TSNDAS---EP 85

Query: 87  EEDEHEEIYAYDYKDTDVVWDSDEIEAISSLFQGRIPQKPGTLNRDRPLPLPLPHKLRPP 146
           E  E EE Y    +D D+VW+SDEIEAI SLFQGRIPQKPG+LNR RP      +K+RP 
Sbjct: 86  EFPEEEEKYT---EDGDIVWESDEIEAIQSLFQGRIPQKPGSLNRQRPXXXXXXYKVRPS 145

Query: 147 RLPNLKIRPRTVVSSRALMSKQVYKCPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKG 206
           RLP+    P+        MSKQVYK P+ L+GLAR IR L+ +++V  VLN+   FL+KG
Sbjct: 146 RLPS----PKKNAVKIGPMSKQVYKNPNALVGLAREIRSLAADKDVGIVLNKRVHFLRKG 205

Query: 207 SLSLTIKELGHMGLPDRALKTFCWAQEQPRLFPDDRVLATTVEVLARNHELKVPLNLEEF 266
           SLS+TI+ELGHMGLP+RAL+TFCWAQ+QP+L+PDDR+L++TVEVLARNHELK+P NL++F
Sbjct: 206 SLSMTIRELGHMGLPERALQTFCWAQKQPQLYPDDRILSSTVEVLARNHELKLPFNLDKF 265

Query: 267 TELASRGVLEAMVRGFIKGGSLNLAWKLLVAAKKRKRMLDPSVYVKLILELGKNPDKNML 326
           T  ASRGV+EAMVRGFIKGGSL+LAWKL+  AK  +R LDPS+Y KLI+E GKNPDK+M+
Sbjct: 266 TSSASRGVIEAMVRGFIKGGSLHLAWKLVSVAKDNQRKLDPSLYAKLIVEFGKNPDKHMV 325

Query: 327 VLTLMDELGQREALKLNQQDTTAIIKVCTRLGKSEIAEELHSWYVESGHEPSVVMYTALV 386
           V+TL++ELG+RE L L+QQD TAI+KVC RLGK E+ E ++ W+ +SGH+P+VV+YT L+
Sbjct: 326 VMTLLEELGEREDLSLSQQDCTAIMKVCIRLGKFEVVESVYDWFRQSGHDPTVVIYTTLI 385

Query: 387 HNRYSDKKYREALSLVWEMEAANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFA 446
           H+RYS+K+YREAL++VWEMEA+NC FD PAY VVIKLFVAL DL+RA RYF+KLKEAGF 
Sbjct: 386 HSRYSEKRYREALAVVWEMEASNCLFDFPAYRVVIKLFVALNDLARAARYFSKLKEAGFT 445

Query: 447 PTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFIMDKQITSMLLQAKR 496
           PTYDVYR+LI IY+VSGRLAKC+EI KE E  G  +D++  S LLQ +R
Sbjct: 446 PTYDVYRDLIRIYMVSGRLAKCREICKEVEMGGLKLDQETMSHLLQLER 471

BLAST of Cla97C01G011970 vs. TrEMBL
Match: tr|A0A2I4GWH4|A0A2I4GWH4_9ROSI (pentatricopeptide repeat-containing protein At2g01860 isoform X2 OS=Juglans regia OX=51240 GN=LOC109011476 PE=4 SV=1)

HSP 1 Score: 563.5 bits (1451), Expect = 4.6e-157
Identity = 294/472 (62.29%), Postives = 360/472 (76.27%), Query Frame = 0

Query: 32  TNSRRHPPKNLLYPRRAKLPPDPIVVNQFLNNKISAPSPSFTDLTSSEISQLPEGEEDE- 91
           + +RR PPKNL YPR  K PP+   VN FL         + T+ T   ++ L +G++   
Sbjct: 31  SKTRRRPPKNLRYPRHPKSPPN-FGVNLFLKK-------TSTNSTDISLAYLIDGKKPRL 90

Query: 92  --------------HEEIYAYDYKDTDVVWDSDEIEAISSLFQGRIPQKPGTLNRDRPLP 151
                                  ++T + WDSDEIEAISSLFQGR+PQKPG LNR+RPL 
Sbjct: 91  AGKKGXXXXXXXXXXXXXXXXXRQETGICWDSDEIEAISSLFQGRVPQKPGKLNRERPLX 150

Query: 152 LPLPHKLRPPRLP----NLKIRPRTVVSSRALMSKQVYKCPDFLIGLARAIRDLSPEENV 211
               +KL P  LP    ++K     VVSSRA +SKQVYK P  LIG+AR I+ +S EE+V
Sbjct: 151 XXXXYKLXPLGLPTPKKHVKSASPLVVSSRASLSKQVYKNPGVLIGIAREIKMISSEEDV 210

Query: 212 SKVLNRWGPFLQKGSLSLTIKELGHMGLPDRALKTFCWAQEQPRLFPDDRVLATTVEVLA 271
           S VLN+W  FL+KGSLSLTI+ELGHMGLP+RAL+TFCWAQ+Q +LFPDDR+LA+TVEVLA
Sbjct: 211 SVVLNKWARFLRKGSLSLTIRELGHMGLPERALQTFCWAQKQTQLFPDDRILASTVEVLA 270

Query: 272 RNHELKVPLNLEEFTELASRGVLEAMVRGFIKGGSLNLAWKLLVAAKKRKRMLDPSVYVK 331
           RNHELKVP  L +FT LASRGV+EAMVRGFI+GGSL+LAWKLL  A+  KRMLDPS+Y K
Sbjct: 271 RNHELKVPFKLGKFTSLASRGVMEAMVRGFIRGGSLHLAWKLLSVARDGKRMLDPSIYAK 330

Query: 332 LILELGKNPDKNMLVLTLMDELGQREALKLNQQDTTAIIKVCTRLGKSEIAEELHSWYVE 391
           LILELGKNPDK+MLV++L+DELG+RE L L+QQD TAI+K+C RLGK ++ + L +W+ +
Sbjct: 331 LILELGKNPDKHMLVVSLLDELGEREDLNLSQQDCTAIMKICIRLGKFDVVDGLFNWFKQ 390

Query: 392 SGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEAANCPFDLPAYSVVIKLFVALGDLSR 451
           SG+EPSVVMYT L+H+ YS++KYREAL+LVWEMEA+NC  DLPAY VVIKLFVAL D+SR
Sbjct: 391 SGYEPSVVMYTTLIHSHYSERKYREALALVWEMEASNCLLDLPAYRVVIKLFVALNDISR 450

Query: 452 AVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFIMDK 485
           AVRYF+KLKEAGF+PTYD+YR LI IY+VSGRLAKCKE+ KEAE AGF +DK
Sbjct: 451 AVRYFSKLKEAGFSPTYDMYRELIKIYMVSGRLAKCKEVCKEAEIAGFKLDK 494

BLAST of Cla97C01G011970 vs. Swiss-Prot
Match: sp|Q5XET4|PP142_ARATH (Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX=3702 GN=EMB975 PE=2 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 3.2e-121
Identity = 244/471 (51.80%), Postives = 316/471 (67.09%), Query Frame = 0

Query: 31  NTNSRRHP---PKNLLYPRRAKLPPDPIVVNQFLNNKISAPSPSFTDLTSSEISQLPEGE 90
           N + R H     KNL  PRR KLPPD   VN FL        P    L            
Sbjct: 31  NASQRNHSKKLTKNLRNPRRTKLPPD-FGVNLFLR------KPKIEPLVI---------X 90

Query: 91  EDEHEEIYAYDYKDTDVVWDSDEIEAISSLFQGRIPQKPGTLNRDRPLPLPLPHKLRPPR 150
                             W+ +EIEAISSLFQ RIPQKP   +R RPLPL          
Sbjct: 91  XXXXXXXXXXXXXXXXXXWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPL---XXXXXXX 150

Query: 151 LPNLKIRPRTVVSSRAL--MSKQVYKCPDFLIGLARAIRDL-SPEENVSKVLNRWGPFLQ 210
                     ++ S AL  +SKQVYK P FLIGLAR I+ L S + +VS VLN+W  FL+
Sbjct: 151 XXXXXXXXXNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLNKWVSFLR 210

Query: 211 KGSLSLTIKELGHMGLPDRALKTFCWAQEQPRLFPDDRVLATTVEVLARNHELKVPLNLE 270
           KGSLS TI+ELGHMGLP+RAL+T+ WA++   L PD+R+LA+T++VLA++HELK+   L+
Sbjct: 211 KGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHELKL---LK 270

Query: 271 EFTELASRGVLEAMVRGFIKGGSLNLAWKLLVAAKKRKRMLDPSVYVKLILELGKNPDKN 330
               LAS+ V+EAM++G I+GG LNLA KL++ +K   R+LD SVYVK+ILE+ KNPDK 
Sbjct: 271 FDNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEIAKNPDKY 330

Query: 331 MLVLTLMDELGQREALKLNQQDTTAIIKVCTRLGKSEIAEELHSWYVESGHEPSVVMYTA 390
            LV+ L++EL +RE LKL+QQD T+I+K+C +LG+ E+ E L  W+  S  EPSVVMYT 
Sbjct: 331 HLVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREPSVVMYTT 390

Query: 391 LVHNRYSDKKYREALSLVWEMEAANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAG 450
           ++H+RYS++KYREA+S+VWEME +NC  DLPAY VVIKLFVAL DL RA+RY++KLKEAG
Sbjct: 391 MIHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYYSKLKEAG 450

Query: 451 FAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFIMDKQITSMLLQAKR 496
           F+PTYD+YR++I++Y  SGRL KCKEI KE E+AG  +DK  +  LLQ ++
Sbjct: 451 FSPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEK 479

BLAST of Cla97C01G011970 vs. Swiss-Prot
Match: sp|Q8GZ63|PP397_ARATH (Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX=3702 GN=At5g25630 PE=2 SV=2)

HSP 1 Score: 68.2 bits (165), Expect = 3.0e-10
Identity = 37/124 (29.84%), Postives = 61/124 (49.19%), Query Frame = 0

Query: 348 TAIIKVCTRLGKSEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEA 407
           T ++ V    G+   A+ +     E+GH PS++ YT L+      K+Y    S+V E+E 
Sbjct: 49  TKLMNVLIERGRPHEAQTVFKTLAETGHRPSLISYTTLLAAMTVQKQYGSISSIVSEVEQ 108

Query: 408 ANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAK 467
           +    D   ++ VI  F   G++  AV+   K+KE G  PT   Y  LI  Y ++G+  +
Sbjct: 109 SGTKLDSIFFNAVINAFSESGNMEDAVQALLKMKELGLNPTTSTYNTLIKGYGIAGKPER 168

Query: 468 CKEI 472
             E+
Sbjct: 169 SSEL 172

BLAST of Cla97C01G011970 vs. TAIR10
Match: AT2G01860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 436.8 bits (1122), Expect = 1.8e-122
Identity = 244/471 (51.80%), Postives = 316/471 (67.09%), Query Frame = 0

Query: 31  NTNSRRHP---PKNLLYPRRAKLPPDPIVVNQFLNNKISAPSPSFTDLTSSEISQLPEGE 90
           N + R H     KNL  PRR KLPPD   VN FL        P    L            
Sbjct: 31  NASQRNHSKKLTKNLRNPRRTKLPPD-FGVNLFLR------KPKIEPLVI---------X 90

Query: 91  EDEHEEIYAYDYKDTDVVWDSDEIEAISSLFQGRIPQKPGTLNRDRPLPLPLPHKLRPPR 150
                             W+ +EIEAISSLFQ RIPQKP   +R RPLPL          
Sbjct: 91  XXXXXXXXXXXXXXXXXXWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPL---XXXXXXX 150

Query: 151 LPNLKIRPRTVVSSRAL--MSKQVYKCPDFLIGLARAIRDL-SPEENVSKVLNRWGPFLQ 210
                     ++ S AL  +SKQVYK P FLIGLAR I+ L S + +VS VLN+W  FL+
Sbjct: 151 XXXXXXXXXNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLNKWVSFLR 210

Query: 211 KGSLSLTIKELGHMGLPDRALKTFCWAQEQPRLFPDDRVLATTVEVLARNHELKVPLNLE 270
           KGSLS TI+ELGHMGLP+RAL+T+ WA++   L PD+R+LA+T++VLA++HELK+   L+
Sbjct: 211 KGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHELKL---LK 270

Query: 271 EFTELASRGVLEAMVRGFIKGGSLNLAWKLLVAAKKRKRMLDPSVYVKLILELGKNPDKN 330
               LAS+ V+EAM++G I+GG LNLA KL++ +K   R+LD SVYVK+ILE+ KNPDK 
Sbjct: 271 FDNSLASKNVIEAMIKGCIEGGWLNLARKLILISKSNNRILDSSVYVKMILEIAKNPDKY 330

Query: 331 MLVLTLMDELGQREALKLNQQDTTAIIKVCTRLGKSEIAEELHSWYVESGHEPSVVMYTA 390
            LV+ L++EL +RE LKL+QQD T+I+K+C +LG+ E+ E L  W+  S  EPSVVMYT 
Sbjct: 331 HLVVALLEELKKREDLKLSQQDCTSIMKICVKLGEFELVESLFDWFKASNREPSVVMYTT 390

Query: 391 LVHNRYSDKKYREALSLVWEMEAANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAG 450
           ++H+RYS++KYREA+S+VWEME +NC  DLPAY VVIKLFVAL DL RA+RY++KLKEAG
Sbjct: 391 MIHSRYSEQKYREAMSVVWEMEESNCLLDLPAYRVVIKLFVALDDLGRAMRYYSKLKEAG 450

Query: 451 FAPTYDVYRNLITIYLVSGRLAKCKEIYKEAENAGFIMDKQITSMLLQAKR 496
           F+PTYD+YR++I++Y  SGRL KCKEI KE E+AG  +DK  +  LLQ ++
Sbjct: 451 FSPTYDIYRDMISVYTASGRLTKCKEICKEVEDAGLRLDKDTSFRLLQLEK 479

BLAST of Cla97C01G011970 vs. TAIR10
Match: AT5G25630.2 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 68.2 bits (165), Expect = 1.7e-11
Identity = 37/124 (29.84%), Postives = 61/124 (49.19%), Query Frame = 0

Query: 348 TAIIKVCTRLGKSEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYREALSLVWEMEA 407
           T ++ V    G+   A+ +     E+GH PS++ YT L+      K+Y    S+V E+E 
Sbjct: 49  TKLMNVLIERGRPHEAQTVFKTLAETGHRPSLISYTTLLAAMTVQKQYGSISSIVSEVEQ 108

Query: 408 ANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLITIYLVSGRLAK 467
           +    D   ++ VI  F   G++  AV+   K+KE G  PT   Y  LI  Y ++G+  +
Sbjct: 109 SGTKLDSIFFNAVINAFSESGNMEDAVQALLKMKELGLNPTTSTYNTLIKGYGIAGKPER 168

Query: 468 CKEI 472
             E+
Sbjct: 169 SSEL 172

BLAST of Cla97C01G011970 vs. TAIR10
Match: AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 42.7 bits (99), Expect = 7.5e-04
Identity = 35/158 (22.15%), Postives = 67/158 (42.41%), Query Frame = 0

Query: 341 KLNQQDTTAIIKVCTRLGKSEIAEELHSWYVESGHEPSVVM----YTALVHNRYSDKKYR 400
           +L + D  +++K     G  E A  L  W V S +  ++ +        V     + +Y 
Sbjct: 133 ELLRTDLVSLVKGLDDSGHWERAVFLFEWLVLSSNSGALKLDHQVIEIFVRILGRESQYS 192

Query: 401 EALSLVWEMEAANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLI 460
            A  L+ ++       D+ AY+ ++  +   G   +A+  F ++KE G +PT   Y  ++
Sbjct: 193 VAAKLLDKIPLQEYLLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMGPSPTLVTYNVIL 252

Query: 461 TIYLVSGR-LAKCKEIYKEAENAGFIMDKQITSMLLQA 494
            ++   GR   K   +  E  + G   D+   S +L A
Sbjct: 253 DVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSA 290

BLAST of Cla97C01G011970 vs. TAIR10
Match: AT3G02330.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 42.4 bits (98), Expect = 9.8e-04
Identity = 30/147 (20.41%), Postives = 67/147 (45.58%), Query Frame = 0

Query: 337 REALKLNQQDTTAIIKVCTRLGKSEIAEELHSWYVESGHEPSVVMYTALVHNRYSDKKYR 396
           RE ++ + +    I+KVC+ L  + +  ++H   V  G +  VV  +AL+      K++ 
Sbjct: 173 REGIEFDGRTFAIILKVCSFLEDTSLGMQIHGIVVRVGCDTDVVAASALLDMYAKGKRFV 232

Query: 397 EALSLVWEMEAANCPFDLPAYSVVIKLFVALGDLSRAVRYFAKLKEAGFAPTYDVYRNLI 456
           E+L +   +   N      ++S +I   V    LS A+++F ++++     +  +Y +++
Sbjct: 233 ESLRVFQGIPEKNS----VSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVL 292

Query: 457 TIYLVSGRLAKCKEIYKEAENAGFIMD 484
                   L    +++  A  + F  D
Sbjct: 293 RSCAALSELRLGGQLHAHALKSDFAAD 315

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008462173.17.5e-24487.90PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] ... [more]
XP_004139567.11.7e-24387.30PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativu... [more]
XP_022951807.12.0e-22882.07pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita mosc... [more]
XP_023537574.12.9e-22781.67pentatricopeptide repeat-containing protein At2g01860 [Cucurbita pepo subsp. pep... [more]
XP_023001961.18.3e-22781.47pentatricopeptide repeat-containing protein At2g01860 isoform X1 [Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CGD0|A0A1S3CGD0_CUCME4.9e-24487.90pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A0A0LVM0|A0A0A0LVM0_CUCSA1.1e-24387.30Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1[more]
tr|A0A2N9HFH8|A0A2N9HFH8_FAGSY3.1e-16966.95Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS38191 PE=4 SV=1[more]
tr|A0A2P6QJA8|A0A2P6QJA8_ROSCH3.5e-15762.05Putative pentatricopeptide OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr5g0066911 P... [more]
tr|A0A2I4GWH4|A0A2I4GWH4_9ROSI4.6e-15762.29pentatricopeptide repeat-containing protein At2g01860 isoform X2 OS=Juglans regi... [more]
Match NameE-valueIdentityDescription
sp|Q5XET4|PP142_ARATH3.2e-12151.80Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX... [more]
sp|Q8GZ63|PP397_ARATH3.0e-1029.84Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT2G01860.11.8e-12251.80Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G25630.21.7e-1129.84Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G18940.17.5e-0422.15Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G02330.19.8e-0420.41Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G011970.1Cla97C01G011970.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 416..448
e-value: 0.0027
score: 15.7
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 401..459
e-value: 5.1E-5
score: 23.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 343..377
score: 7.706
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 413..447
score: 9.887
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 5.722
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 378..412
score: 7.224
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 448..482
score: 7.048
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 316..495
e-value: 1.1E-25
score: 92.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 177..303
e-value: 6.0E-6
score: 27.8
NoneNo IPR availablePANTHERPTHR24015:SF642SUBFAMILY NOT NAMEDcoord: 22..494
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 22..494

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C01G011970Csa1G144300Cucumber (Chinese Long) v2cuwmbB009
Cla97C01G011970ClCG01G012950Watermelon (Charleston Gray)wcgwmbB089
Cla97C01G011970Lsi03G000760Bottle gourd (USVL1VR-Ls)lsiwmbB217
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C01G011970Wax gourdwgowmbB623
Cla97C01G011970Watermelon (97103) v2wmbwmbB028
Cla97C01G011970Cucumber (Gy14) v2cgybwmbB009
Cla97C01G011970Cucumber (Gy14) v1cgywmbB373
Cla97C01G011970Wild cucumber (PI 183967)cpiwmbB009
Cla97C01G011970Cucumber (Chinese Long) v3cucwmbB008
Cla97C01G011970Melon (DHL92) v3.6.1medwmbB132
Cla97C01G011970Melon (DHL92) v3.5.1mewmbB143