HG10004365 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004365
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat
LocationChr08: 16350095 .. 16351600 (-)
RNA-Seq ExpressionHG10004365
SyntenyHG10004365
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGCCATTACACATCGACGTATTATCATCAATAAGTTCTCATTTTCCTTCATCTTTCATCAACGTTTTACTCCCTTTACTTCCGATTCCACGACGGCCGCCACTCCCCACAACAACATTCCTCCGGCAATCAACCCGACCCACCTCCGCCGTGTCTGTACCGTTCTATATCAGCAACAGAACTCCCCCCATCTCAAGCTTCACTCCAAACTTCTCGCCTGTAATTTCAATCTCTCACACGAATTCTTCCTCCAGGTTTGCAACACCTTCCCTCTCTCATGGCGTCCCGTTTATCGCTTCTTCCAATTCACTGAAACCGACCCTAATTTCACTCACACGGCGGTATCTTTCAATAAGTTGATTGATGTTGTTGGGAAATCACGAAATATCGATCTCTTATGGGGTTTGGTTCAGGAAATGGGGCGGCGGCGGTTGGTTACTGATAAAACCTTTGTTGTTGCTCTGAGAACTTTGGCGGCGGCCAGAGAGTTGAAGAAGTGTGTGGAGTTTTTCCATTTGATGGATGGATATGGGTTTGGTTATAGTTTAATGACTTTGAATAAAGTAGTTGAGAAATTGTGTGTATGTAAATTAGTGGATGAGGCTAAGTTTTTGGTTATGAAATTGAATGAATGGATCAAAGCTGATGGGGTTACTTACAAATGGTTGATTAAGGGGTTTTGCAATGTTGGGGATTTGATTGAAGCTTCAAAGATGTGGAACTTAATGGTGGATGAAGGGTTTGAGCCTGAAATGGAAGCTGTGGAGGAGATGATGAATGTTCTTTTCAAGACCAATAAATTTGATGAAGCTTTGAAACTTTTCCAGGCAGTTAGATCAAACAGGATGAATGATTTGATTCCTTCAACATACAGTCTTATGATAAGATGGTTGTGTAACAAAGCTAAGGTGAGGCAAGCGTATGTCGTGTTCGACGAAATGCATAAGAGAGGACTTGAAGCTGATAATTCAGCACATTCTTCACTAATTTATGGGCTTTTAGCAAGAGGGAGAAGGAGAGAAGCTTATAATATAATGAGAAGAATTGAGAATCCTGATTTGGGTGTGTACCATGCTTTAATTAAGGGACTTTTGAGGTTAAAAAGGGCAAATGAAGCAACCCAAGTTTTCAGGGAAATGATAGAAACGGGGTGTGAGCCTATAATGCATACATATATAATGTTGTTGCAGGGACATTTAGGGAAAAGGGGGAGGAAGGGATTGGATCCACTTGTGAATTTTGATACTATTTTTGTTGGAGGATTGGTGAAGAATGAGAAGTCATTGGAGGCCACAAAGTATGTGGAAAGACTTATGAAAAGAGGGCTTGAAGTGCCAAGGTTTGATTACAACAAGTTTTTGCATTACTATTCAAATGAGGAGGGAGTGGTAATGTTTAGAGAGGTGGGGAATAGGTTGAGAGAAGTTGGATTGGTTGATTTGGCTGATATATTTCAGAGATATGGGGAGAAAATGACCACTAGAGATAGGAGGAGAAATTGA

mRNA sequence

ATGGCAGCCATTACACATCGACGTATTATCATCAATAAGTTCTCATTTTCCTTCATCTTTCATCAACGTTTTACTCCCTTTACTTCCGATTCCACGACGGCCGCCACTCCCCACAACAACATTCCTCCGGCAATCAACCCGACCCACCTCCGCCGTGTCTGTACCGTTCTATATCAGCAACAGAACTCCCCCCATCTCAAGCTTCACTCCAAACTTCTCGCCTGTAATTTCAATCTCTCACACGAATTCTTCCTCCAGGTTTGCAACACCTTCCCTCTCTCATGGCGTCCCGTTTATCGCTTCTTCCAATTCACTGAAACCGACCCTAATTTCACTCACACGGCGGTATCTTTCAATAAGTTGATTGATGTTGTTGGGAAATCACGAAATATCGATCTCTTATGGGGTTTGGTTCAGGAAATGGGGCGGCGGCGGTTGGTTACTGATAAAACCTTTGTTGTTGCTCTGAGAACTTTGGCGGCGGCCAGAGAGTTGAAGAAGTGTGTGGAGTTTTTCCATTTGATGGATGGATATGGGTTTGGTTATAGTTTAATGACTTTGAATAAAGTAGTTGAGAAATTGTGTGTATGTAAATTAGTGGATGAGGCTAAGTTTTTGGTTATGAAATTGAATGAATGGATCAAAGCTGATGGGGTTACTTACAAATGGTTGATTAAGGGGTTTTGCAATGTTGGGGATTTGATTGAAGCTTCAAAGATGTGGAACTTAATGGTGGATGAAGGGTTTGAGCCTGAAATGGAAGCTGTGGAGGAGATGATGAATGTTCTTTTCAAGACCAATAAATTTGATGAAGCTTTGAAACTTTTCCAGGCAGTTAGATCAAACAGGATGAATGATTTGATTCCTTCAACATACAGTCTTATGATAAGATGGTTGTGTAACAAAGCTAAGGTGAGGCAAGCGTATGTCGTGTTCGACGAAATGCATAAGAGAGGACTTGAAGCTGATAATTCAGCACATTCTTCACTAATTTATGGGCTTTTAGCAAGAGGGAGAAGGAGAGAAGCTTATAATATAATGAGAAGAATTGAGAATCCTGATTTGGGTGTGTACCATGCTTTAATTAAGGGACTTTTGAGGTTAAAAAGGGCAAATGAAGCAACCCAAGTTTTCAGGGAAATGATAGAAACGGGGTGTGAGCCTATAATGCATACATATATAATGTTGTTGCAGGGACATTTAGGGAAAAGGGGGAGGAAGGGATTGGATCCACTTGTGAATTTTGATACTATTTTTGTTGGAGGATTGGTGAAGAATGAGAAGTCATTGGAGGCCACAAAGTATGTGGAAAGACTTATGAAAAGAGGGCTTGAAGTGCCAAGGTTTGATTACAACAAGTTTTTGCATTACTATTCAAATGAGGAGGGAGTGGTAATGTTTAGAGAGGTGGGGAATAGGTTGAGAGAAGTTGGATTGGTTGATTTGGCTGATATATTTCAGAGATATGGGGAGAAAATGACCACTAGAGATAGGAGGAGAAATTGA

Coding sequence (CDS)

ATGGCAGCCATTACACATCGACGTATTATCATCAATAAGTTCTCATTTTCCTTCATCTTTCATCAACGTTTTACTCCCTTTACTTCCGATTCCACGACGGCCGCCACTCCCCACAACAACATTCCTCCGGCAATCAACCCGACCCACCTCCGCCGTGTCTGTACCGTTCTATATCAGCAACAGAACTCCCCCCATCTCAAGCTTCACTCCAAACTTCTCGCCTGTAATTTCAATCTCTCACACGAATTCTTCCTCCAGGTTTGCAACACCTTCCCTCTCTCATGGCGTCCCGTTTATCGCTTCTTCCAATTCACTGAAACCGACCCTAATTTCACTCACACGGCGGTATCTTTCAATAAGTTGATTGATGTTGTTGGGAAATCACGAAATATCGATCTCTTATGGGGTTTGGTTCAGGAAATGGGGCGGCGGCGGTTGGTTACTGATAAAACCTTTGTTGTTGCTCTGAGAACTTTGGCGGCGGCCAGAGAGTTGAAGAAGTGTGTGGAGTTTTTCCATTTGATGGATGGATATGGGTTTGGTTATAGTTTAATGACTTTGAATAAAGTAGTTGAGAAATTGTGTGTATGTAAATTAGTGGATGAGGCTAAGTTTTTGGTTATGAAATTGAATGAATGGATCAAAGCTGATGGGGTTACTTACAAATGGTTGATTAAGGGGTTTTGCAATGTTGGGGATTTGATTGAAGCTTCAAAGATGTGGAACTTAATGGTGGATGAAGGGTTTGAGCCTGAAATGGAAGCTGTGGAGGAGATGATGAATGTTCTTTTCAAGACCAATAAATTTGATGAAGCTTTGAAACTTTTCCAGGCAGTTAGATCAAACAGGATGAATGATTTGATTCCTTCAACATACAGTCTTATGATAAGATGGTTGTGTAACAAAGCTAAGGTGAGGCAAGCGTATGTCGTGTTCGACGAAATGCATAAGAGAGGACTTGAAGCTGATAATTCAGCACATTCTTCACTAATTTATGGGCTTTTAGCAAGAGGGAGAAGGAGAGAAGCTTATAATATAATGAGAAGAATTGAGAATCCTGATTTGGGTGTGTACCATGCTTTAATTAAGGGACTTTTGAGGTTAAAAAGGGCAAATGAAGCAACCCAAGTTTTCAGGGAAATGATAGAAACGGGGTGTGAGCCTATAATGCATACATATATAATGTTGTTGCAGGGACATTTAGGGAAAAGGGGGAGGAAGGGATTGGATCCACTTGTGAATTTTGATACTATTTTTGTTGGAGGATTGGTGAAGAATGAGAAGTCATTGGAGGCCACAAAGTATGTGGAAAGACTTATGAAAAGAGGGCTTGAAGTGCCAAGGTTTGATTACAACAAGTTTTTGCATTACTATTCAAATGAGGAGGGAGTGGTAATGTTTAGAGAGGTGGGGAATAGGTTGAGAGAAGTTGGATTGGTTGATTTGGCTGATATATTTCAGAGATATGGGGAGAAAATGACCACTAGAGATAGGAGGAGAAATTGA

Protein sequence

MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFVGGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRRN
Homology
BLAST of HG10004365 vs. NCBI nr
Match: XP_038884840.1 (putative pentatricopeptide repeat-containing protein At1g26500 [Benincasa hispida] >XP_038884842.1 putative pentatricopeptide repeat-containing protein At1g26500 [Benincasa hispida] >XP_038884843.1 putative pentatricopeptide repeat-containing protein At1g26500 [Benincasa hispida] >XP_038884844.1 putative pentatricopeptide repeat-containing protein At1g26500 [Benincasa hispida] >XP_038884845.1 putative pentatricopeptide repeat-containing protein At1g26500 [Benincasa hispida] >XP_038884846.1 putative pentatricopeptide repeat-containing protein At1g26500 [Benincasa hispida] >XP_038884847.1 putative pentatricopeptide repeat-containing protein At1g26500 [Benincasa hispida] >XP_038884848.1 putative pentatricopeptide repeat-containing protein At1g26500 [Benincasa hispida])

HSP 1 Score: 969.5 bits (2505), Expect = 1.1e-278
Identity = 477/501 (95.21%), Postives = 489/501 (97.60%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQ 60
           MAAI +RRIIINKFSF+FIFHQRFTPFTSDST AATPHNNIPP INPTHLRRVCTVLYQQ
Sbjct: 1   MAAIINRRIIINKFSFTFIFHQRFTPFTSDSTVAATPHNNIPPPINPTHLRRVCTVLYQQ 60

Query: 61  QNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSP LKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK
Sbjct: 61  QNSPDLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180

Query: 181 GYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKM 240
           GYSL+TLNKVVEKLC CKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASK+
Sbjct: 181 GYSLVTLNKVVEKLCGCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKI 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLC 300
           WNLMVDEGFEPEMEAVEEMMNVLFKTNKF EALKLFQAVRS+RMNDLIPSTYSL IRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFVEALKLFQAVRSDRMNDLIPSTYSLTIRWLC 300

Query: 301 NKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHA 360
           NKAKV+QAYVVF++MHKRGLEADNS HSSLIYGLLARGRRREAY+IM+RIE PDLGVYHA
Sbjct: 301 NKAKVKQAYVVFNKMHKRGLEADNSVHSSLIYGLLARGRRREAYDIMKRIEKPDLGVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
           LIKGLLRLKRANEATQVFREMIE GCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFD+IFV
Sbjct: 361 LIKGLLRLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDSIFV 420

Query: 421 GGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVK+ KSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLRE GLV
Sbjct: 421 GGLVKSGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREAGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRN 502
           DLADIFQRYGEKMTTRDRRRN
Sbjct: 481 DLADIFQRYGEKMTTRDRRRN 501

BLAST of HG10004365 vs. NCBI nr
Match: KAA0064759.1 (putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 929.5 bits (2401), Expect = 1.2e-266
Identity = 459/501 (91.62%), Postives = 476/501 (95.01%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQ 60
           MAAIT+RRIIIN FSFSFIFHQRF+PFTSDST A    +NIP AI+ THLRRVCTVLYQQ
Sbjct: 1   MAAITYRRIIINNFSFSFIFHQRFSPFTSDSTAA----DNIPSAIDSTHLRRVCTVLYQQ 60

Query: 61  QNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSP +KLH+KLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET PNFTHT VSFNK
Sbjct: 61  QNSPDIKLHTKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETHPNFTHTVVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGL+QEMGRRRLV DKTFVVALRTLA ARELKKCVEFFHLM+GYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLIQEMGRRRLVNDKTFVVALRTLATARELKKCVEFFHLMNGYGF 180

Query: 181 GYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKM 240
            YSL+TLNKVVEKLC CKLVDEAKFLVMKLNEWIKADGVTYK LIKGFCNVGDLIEASKM
Sbjct: 181 CYSLVTLNKVVEKLCGCKLVDEAKFLVMKLNEWIKADGVTYKLLIKGFCNVGDLIEASKM 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLC 300
           WNLMVDEGFEPEMEAVEEM+NVLFKTNK DEALKLFQAVRSNRMNDLIPSTY L+IRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMVNVLFKTNKLDEALKLFQAVRSNRMNDLIPSTYRLVIRWLC 300

Query: 301 NKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHA 360
           NK KVRQA++VFDEMHKRGLE DNS HSSLIYGLLARGRRREAYNIMRRIENPDL VYHA
Sbjct: 301 NKGKVRQAFIVFDEMHKRGLEVDNSVHSSLIYGLLARGRRREAYNIMRRIENPDLDVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
           LIKGLL+LKRANEATQVFREMIE GCEPIMHTYIMLLQGHLGKRGRKGLDP VNFD+IFV
Sbjct: 361 LIKGLLKLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGLDPFVNFDSIFV 420

Query: 421 GGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKN KSLEATKYVER+MKRGLEVPRFDYNKFLHYYSN+EGVVMFREVGNRLREVGLV
Sbjct: 421 GGLVKNGKSLEATKYVERIMKRGLEVPRFDYNKFLHYYSNDEGVVMFREVGNRLREVGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRN 502
           DLADIFQRYGEKMTTRDRRRN
Sbjct: 481 DLADIFQRYGEKMTTRDRRRN 497

BLAST of HG10004365 vs. NCBI nr
Match: XP_008445460.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis melo] >XP_008445461.1 PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis melo] >XP_008445462.1 PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis melo] >XP_008445463.1 PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis melo] >XP_016900083.1 PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis melo])

HSP 1 Score: 927.5 bits (2396), Expect = 4.7e-266
Identity = 458/501 (91.42%), Postives = 475/501 (94.81%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQ 60
           MAAIT+RRIIIN FSFSFIFHQRF+PFTSDST A    +NIP AI+ THLRRVCTVLYQQ
Sbjct: 1   MAAITYRRIIINNFSFSFIFHQRFSPFTSDSTAA----DNIPSAIDSTHLRRVCTVLYQQ 60

Query: 61  QNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSP +KLH+KLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET PNFTHT VSFNK
Sbjct: 61  QNSPDIKLHTKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETHPNFTHTVVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGL+QEMGRRRLV DKTFVVALRTLA ARELKKCVEFFHLM+GYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLIQEMGRRRLVNDKTFVVALRTLATARELKKCVEFFHLMNGYGF 180

Query: 181 GYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKM 240
            YSL+TLNKVVE LC CKLVDEAKFLVMKLNEWIKADGVTYK LIKGFCNVGDLIEASKM
Sbjct: 181 CYSLVTLNKVVENLCGCKLVDEAKFLVMKLNEWIKADGVTYKLLIKGFCNVGDLIEASKM 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLC 300
           WNLMVDEGFEPEMEAVEEM+NVLFKTNK DEALKLFQAVRSNRMNDLIPSTY L+IRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMVNVLFKTNKLDEALKLFQAVRSNRMNDLIPSTYRLVIRWLC 300

Query: 301 NKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHA 360
           NK KVRQA++VFDEMHKRGLE DNS HSSLIYGLLARGRRREAYNIMRRIENPDL VYHA
Sbjct: 301 NKGKVRQAFIVFDEMHKRGLEVDNSVHSSLIYGLLARGRRREAYNIMRRIENPDLDVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
           LIKGLL+LKRANEATQVFREMIE GCEPIMHTYIMLLQGHLGKRGRKGLDP VNFD+IFV
Sbjct: 361 LIKGLLKLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGLDPFVNFDSIFV 420

Query: 421 GGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKN KSLEATKYVER+MKRGLEVPRFDYNKFLHYYSN+EGVVMFREVGNRLREVGLV
Sbjct: 421 GGLVKNGKSLEATKYVERIMKRGLEVPRFDYNKFLHYYSNDEGVVMFREVGNRLREVGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRN 502
           DLADIFQRYGEKMTTRDRRRN
Sbjct: 481 DLADIFQRYGEKMTTRDRRRN 497

BLAST of HG10004365 vs. NCBI nr
Match: XP_004144262.1 (putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis sativus] >XP_031743882.1 putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis sativus] >XP_031743883.1 putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis sativus] >XP_031743916.1 LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis sativus] >KGN47645.1 hypothetical protein Csa_018968 [Cucumis sativus])

HSP 1 Score: 927.2 bits (2395), Expect = 6.1e-266
Identity = 459/501 (91.62%), Postives = 477/501 (95.21%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQ 60
           MAAIT+RRIIIN FSFSFIFHQRF+PFTSDS+TAA   +NIP  I+ THLRRVCTVLYQQ
Sbjct: 1   MAAITYRRIIINNFSFSFIFHQRFSPFTSDSSTAA---DNIPQPIDSTHLRRVCTVLYQQ 60

Query: 61  QNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSP LKLH+KLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET PNFTHTAVSFNK
Sbjct: 61  QNSPDLKLHTKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETHPNFTHTAVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGL+QEMGRRRLV DKTFVVALRTLA ARELKKCVEFFHLM+GYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLIQEMGRRRLVNDKTFVVALRTLATARELKKCVEFFHLMNGYGF 180

Query: 181 GYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKM 240
            YSL+TLN+VVEKLC CKLVDEAKFLVMKLNEWIKADGVTYK LIKGFCNVGDLIEASKM
Sbjct: 181 CYSLVTLNRVVEKLCGCKLVDEAKFLVMKLNEWIKADGVTYKLLIKGFCNVGDLIEASKM 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLC 300
           WNLMVDEGFEPEMEAVEEMMNVLFKTNK DEALKLFQA+RS+RMNDLIPSTYSL+IRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKLDEALKLFQALRSDRMNDLIPSTYSLVIRWLC 300

Query: 301 NKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHA 360
           NK KV QA++VFDEMHKRGLE DNS HSSLIYGLLARGRRREAYNIMRRIENPDL VYHA
Sbjct: 301 NKGKVGQAFIVFDEMHKRGLEVDNSVHSSLIYGLLARGRRREAYNIMRRIENPDLDVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
            IKGLL+LKRANEATQVFREMIE GCEPIMHTYIMLLQGHLGKRGRKG DPLVNFD+IFV
Sbjct: 361 FIKGLLKLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGSDPLVNFDSIFV 420

Query: 421 GGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKN KSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGN+LREVGLV
Sbjct: 421 GGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNKLREVGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRN 502
           DLADIFQRYGEKMTTRDRRRN
Sbjct: 481 DLADIFQRYGEKMTTRDRRRN 498

BLAST of HG10004365 vs. NCBI nr
Match: XP_022131400.1 (putative pentatricopeptide repeat-containing protein At1g26500 [Momordica charantia])

HSP 1 Score: 890.6 bits (2300), Expect = 6.4e-255
Identity = 437/501 (87.23%), Postives = 459/501 (91.62%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQ 60
           MA + +RRIII+KFS +FIFHQRF+ FTS+ST   T     P AINP  LRRVCTVLYQQ
Sbjct: 1   MAVVRYRRIIIDKFSLTFIFHQRFSSFTSNSTAHPTA---APAAINPARLRRVCTVLYQQ 60

Query: 61  QNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSP  +LHSKL ACNFNLSHEFFL+VCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK
Sbjct: 61  QNSPDPRLHSKLRACNFNLSHEFFLRVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           +IDV G SRNI LLW L+QE GRRRLVTDKTFV+ALRTLAAARELKKCVEFFHLMDGYGF
Sbjct: 121 MIDVAGNSRNIGLLWVLIQEAGRRRLVTDKTFVIALRTLAAARELKKCVEFFHLMDGYGF 180

Query: 181 GYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKM 240
           GYSL+TLNKVVEKLC CKLVDEAKFLV KL EWIK DGVTYKWLIKGFC VGDLIEASK+
Sbjct: 181 GYSLVTLNKVVEKLCSCKLVDEAKFLVFKLKEWIKPDGVTYKWLIKGFCEVGDLIEASKL 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLC 300
           WN MVDEGFEP+MEAVEEMMN LFKTNK DEALKLFQAVRSNRMNDLIPSTYSL+ +WLC
Sbjct: 241 WNFMVDEGFEPDMEAVEEMMNALFKTNKPDEALKLFQAVRSNRMNDLIPSTYSLVNKWLC 300

Query: 301 NKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHA 360
           NK KVRQAYVVFDEMHKRGLE DNSAHSSLIYGLLARGRRRE Y IM RIENPDLGVYHA
Sbjct: 301 NKGKVRQAYVVFDEMHKRGLETDNSAHSSLIYGLLARGRRREGYRIMERIENPDLGVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
           LIKGLLRL+RANEATQVFREMIE GCEPIMHTY+MLLQGHLGKRGR+G DPLVNFDTIFV
Sbjct: 361 LIKGLLRLRRANEATQVFREMIERGCEPIMHTYVMLLQGHLGKRGRRGFDPLVNFDTIFV 420

Query: 421 GGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKN +SLEATKYVER+MKRG+EVPRFDYNKFLHYYSNEEGVVMFREVGNRLRE GLV
Sbjct: 421 GGLVKNGQSLEATKYVERMMKRGVEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREAGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRN 502
           DLADIFQRYGEKM TRDRRRN
Sbjct: 481 DLADIFQRYGEKMATRDRRRN 498

BLAST of HG10004365 vs. ExPASy Swiss-Prot
Match: Q9FZD4 (Putative pentatricopeptide repeat-containing protein At1g26500 OS=Arabidopsis thaliana OX=3702 GN=At1g26500 PE=3 SV=1)

HSP 1 Score: 614.8 bits (1584), Expect = 8.8e-175
Identity = 303/505 (60.00%), Postives = 390/505 (77.23%), Query Frame = 0

Query: 1   MAAITHRRIII--NKFSFSFIFHQRFTPFTSDST-TAATPHNNIPPAINPTHLRRVCTVL 60
           +A +T RR+I   N     FI + RF  F+++ T T  TP       IN  HL RVCT+L
Sbjct: 3   VAVVTSRRMINIGNSIRRCFILNHRF--FSTELTPTTITP-------INQDHLLRVCTIL 62

Query: 61  YQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET-DPNFTHTAV 120
           YQQQNSP  +L SKL +  F L+HEFFLQVCN FPLSWRPV+RFF +++T  P+FTHT+ 
Sbjct: 63  YQQQNSPDSRLVSKLSSTKFQLTHEFFLQVCNNFPLSWRPVHRFFLYSQTHHPDFTHTST 122

Query: 121 SFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMD 180
           + NK++ ++G SRN+DL W L QE+G+R LV DKTF + L+TLA+ARELKKCV +FHLM+
Sbjct: 123 TSNKMLAIIGNSRNMDLFWELAQEIGKRGLVNDKTFRIVLKTLASARELKKCVNYFHLMN 182

Query: 181 GYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIE 240
           G+G+ Y++ T+N+ VE LC  KLV+EAKF+ +KL E+IK D +TY+ +I+GFC+VGDLIE
Sbjct: 183 GFGYLYNVETMNRGVETLCKEKLVEEAKFVFIKLKEFIKPDEITYRTMIQGFCDVGDLIE 242

Query: 241 ASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMI 300
           A+K+WNLM+DEGF+ ++EA +++M  L K N+FDEA K+F  + S R  DL    Y +MI
Sbjct: 243 AAKLWNLMMDEGFDVDIEAGKKIMETLLKKNQFDEASKVFYVMVSKRGGDLDGGFYRVMI 302

Query: 301 RWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLG 360
            WLC   ++  A  VFDEM +RG+  DN   +SLIYGLL + R  EAY ++  +ENPD+ 
Sbjct: 303 DWLCKNGRIDMARKVFDEMRERGVYVDNLTWASLIYGLLVKRRVVEAYGLVEGVENPDIS 362

Query: 361 VYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFD 420
           +YH LIKGL+++KRA+EAT+VFR+MI+ GCEPIMHTY+MLLQGHLG+RGRKG DPLVNFD
Sbjct: 363 IYHGLIKGLVKIKRASEATEVFRKMIQRGCEPIMHTYLMLLQGHLGRRGRKGPDPLVNFD 422

Query: 421 TIFVGGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLRE 480
           TIFVGG++K  K LE TKY+ER +KRGLEVPRFDY+KFLHYYSNEEGVVMF E+  +LRE
Sbjct: 423 TIFVGGMIKAGKRLETTKYIERTLKRGLEVPRFDYSKFLHYYSNEEGVVMFEEMAKKLRE 482

Query: 481 VGLVDLADIFQRYGEKMTTRDRRRN 502
           V L DLADIFQRYGEKMTTR+RRR+
Sbjct: 483 VSLFDLADIFQRYGEKMTTRERRRD 498

BLAST of HG10004365 vs. ExPASy Swiss-Prot
Match: Q9LZP3 (Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g62470 PE=2 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 9.8e-65
Identity = 142/476 (29.83%), Postives = 256/476 (53.78%), Query Frame = 0

Query: 41  IPPAINPTHLRRVCTVLYQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYR 100
           +  + NP  + RVC V+  +  +    + + L     +LSH+  ++V   F  + +P +R
Sbjct: 122 VESSTNPEEVERVCKVI-DELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFR 181

Query: 101 FFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLA 160
           FF +      F H + ++N ++ ++ K+R  + +  +++EMG + L+T +TF +A++  A
Sbjct: 182 FFCWAAERQGFAHDSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFA 241

Query: 161 AARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVT 220
           AA+E KK V  F LM  Y F   + T+N +++ L   KL  EA+ L  KL E    + +T
Sbjct: 242 AAKERKKAVGIFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMT 301

Query: 221 YKWLIKGFCNVGDLIEASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVR 280
           Y  L+ G+C V +LIEA+++WN M+D+G +P++ A   M+  L ++ K  +A+KLF  ++
Sbjct: 302 YTVLLNGWCRVRNLIEAARIWNDMIDQGLKPDIVAHNVMLEGLLRSRKKSDAIKLFHVMK 361

Query: 281 SNRMNDLIPSTYSLMIRWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRR 340
           S      + S Y++MIR  C ++ +  A   FD+M   GL+ D + ++ LI G   + + 
Sbjct: 362 SKGPCPNVRS-YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKL 421

Query: 341 REAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIML 400
              Y +++ ++     PD   Y+ALIK +   K    AT+++ +MI+   EP +HT+ M+
Sbjct: 422 DTVYELLKEMQEKGHPPDGKTYNALIKLMANQKMPEHATRIYNKMIQNEIEPSIHTFNMI 481

Query: 401 LQGHLGKRG------------RKGLDPLVNFDTIFVGGLVKNEKSLEATKYVERLMKRGL 460
           ++ +   R             +KG+ P  N  T+ + GL+   KS EA +Y+E ++ +G+
Sbjct: 482 MKSYFMARNYEMGRAVWEEMIKKGICPDDNSYTVLIRGLIGEGKSREACRYLEEMLDKGM 541

Query: 461 EVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRR 501
           + P  DYNKF   +       +F E+  R +  G    A+IF R+ +    R ++R
Sbjct: 542 KTPLIDYNKFAADFHRGGQPEIFEELAQRAKFSGKFAAAEIFARWAQMTRRRFKQR 595

BLAST of HG10004365 vs. ExPASy Swiss-Prot
Match: Q3EAF8 (Pentatricopeptide repeat-containing protein At3g62540, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g62540 PE=2 SV=1)

HSP 1 Score: 247.3 bits (630), Expect = 3.7e-64
Identity = 141/476 (29.62%), Postives = 254/476 (53.36%), Query Frame = 0

Query: 41  IPPAINPTHLRRVCTVLYQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYR 100
           +  + NP  + RVC V+  +  +    + + L     +LSH+  ++V   F  + +P +R
Sbjct: 122 VESSTNPEEVERVCKVI-DELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFR 181

Query: 101 FFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLA 160
           FF +      F H + ++N ++ ++ K+R  + +  +++EMG + L+T +TF +A++  A
Sbjct: 182 FFCWAAERQGFAHASRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFA 241

Query: 161 AARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVT 220
           AA+E KK V  F LM  Y F   + T+N +++ L   KL  EA+ L  KL E    + +T
Sbjct: 242 AAKERKKAVGIFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMT 301

Query: 221 YKWLIKGFCNVGDLIEASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVR 280
           Y  L+ G+C V +LIEA+++WN M+D G +P++ A   M+  L ++ K  +A+KLF  ++
Sbjct: 302 YTVLLNGWCRVRNLIEAARIWNDMIDHGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMK 361

Query: 281 SNRMNDLIPSTYSLMIRWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRR 340
           S      + S Y++MIR  C ++ +  A   FD+M   GL+ D + ++ LI G   + + 
Sbjct: 362 SKGPCPNVRS-YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKL 421

Query: 341 REAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIML 400
              Y +++ ++     PD   Y+ALIK +   K     T+++ +MI+   EP +HT+ M+
Sbjct: 422 DTVYELLKEMQEKGHPPDGKTYNALIKLMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMI 481

Query: 401 LQGHLGKRG------------RKGLDPLVNFDTIFVGGLVKNEKSLEATKYVERLMKRGL 460
           ++ +   R             +KG+ P  N  T+ + GL+   KS EA +Y+E ++ +G+
Sbjct: 482 MKSYFVARNYEMGRAVWDEMIKKGICPDDNSYTVLIRGLISEGKSREACRYLEEMLDKGM 541

Query: 461 EVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRR 501
           + P  DYNKF   +       +F E+  R +  G    A+IF R+ +    R ++R
Sbjct: 542 KTPLIDYNKFAADFHRGGQPEIFEELAQRAKFSGKFAAAEIFARWAQMTRRRCKQR 595

BLAST of HG10004365 vs. ExPASy Swiss-Prot
Match: Q9LEQ7 (Pentatricopeptide repeat-containing protein At5g14820, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g14820 PE=2 SV=1)

HSP 1 Score: 246.9 bits (629), Expect = 4.8e-64
Identity = 141/476 (29.62%), Postives = 254/476 (53.36%), Query Frame = 0

Query: 41  IPPAINPTHLRRVCTVLYQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYR 100
           +  + NP  + RVC V+  +  +    + + L     +LSH+  ++V   F  + +P +R
Sbjct: 121 VESSTNPEEVERVCKVI-DELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFR 180

Query: 101 FFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLA 160
           FF +      F H + ++N ++ ++ K+R  + +  +++EMG + L+T +TF +A++  A
Sbjct: 181 FFCWAAERQGFAHDSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFA 240

Query: 161 AARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVT 220
           AA+E KK V  F LM  Y F   + T+N +++ L   KL  EA+ L  KL E    + +T
Sbjct: 241 AAKERKKAVGIFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMT 300

Query: 221 YKWLIKGFCNVGDLIEASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVR 280
           Y  L+ G+C V +LIEA+++WN M+D G +P++ A   M+  L ++ K  +A+KLF  ++
Sbjct: 301 YTVLLNGWCRVRNLIEAARIWNDMIDHGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMK 360

Query: 281 SNRMNDLIPSTYSLMIRWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRR 340
           S      + S Y++MIR  C ++ +  A   FD+M   GL+ D + ++ LI G   + + 
Sbjct: 361 SKGPCPNVRS-YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKL 420

Query: 341 REAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIML 400
              Y +++ ++     PD   Y+ALIK +   K     T+++ +MI+   EP +HT+ M+
Sbjct: 421 DTVYELLKEMQEKGHPPDGKTYNALIKLMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMI 480

Query: 401 LQGHLGKRG------------RKGLDPLVNFDTIFVGGLVKNEKSLEATKYVERLMKRGL 460
           ++ +   R             +KG+ P  N  T+ + GL+   KS EA +Y+E ++ +G+
Sbjct: 481 MKSYFVARNYEMGRAVWDEMIKKGICPDDNSYTVLIRGLISEGKSREACRYLEEMLDKGM 540

Query: 461 EVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRR 501
           + P  DYNKF   +       +F E+  R +  G    A+IF R+ +    R ++R
Sbjct: 541 KTPLIDYNKFAADFHRGGQPEIFEELAQRAKFSGKFAAAEIFARWAQMTRRRCKQR 594

BLAST of HG10004365 vs. ExPASy Swiss-Prot
Match: Q9FVX2 (Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g77360 PE=2 SV=2)

HSP 1 Score: 160.6 bits (405), Expect = 4.6e-38
Identity = 103/419 (24.58%), Postives = 210/419 (50.12%), Query Frame = 0

Query: 62  NSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNKL 121
           +SP L L S L      +S E    V N F  +    YRFFQ++E   ++ H+  +++ +
Sbjct: 81  SSPQLVLDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEKQRHYEHSVRAYHMM 140

Query: 122 IDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGFG 181
           I+   K R   L+W L+  M +++++  +TF + +R  A A+++ + +  F++M+ Y   
Sbjct: 141 IESTAKIRQYKLMWDLINAMRKKKMLNVETFCIVMRKYARAQKVDEAIYAFNVMEKYDLP 200

Query: 182 YSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKMW 241
            +L+  N ++  LC  K V +A+ +   + +    D  TY  L++G+    +L +A +++
Sbjct: 201 PNLVAFNGLLSALCKSKNVRKAQEVFENMRDRFTPDSKTYSILLEGWGKEPNLPKAREVF 260

Query: 242 NLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPST--YSLMIRWL 301
             M+D G  P++     M+++L K  + DEAL +   VRS   +   P+T  YS+++   
Sbjct: 261 REMIDAGCHPDIVTYSIMVDILCKAGRVDEALGI---VRSMDPSICKPTTFIYSVLVHTY 320

Query: 302 CNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIEN----PDL 361
             + ++ +A   F EM + G++AD +  +SLI       R +  Y +++ +++    P+ 
Sbjct: 321 GTENRLEEAVDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYRVLKEMKSKGVTPNS 380

Query: 362 GVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKR----------- 421
              + +++ L+     +EA  VFR+MI+  CEP   TY M+++    K+           
Sbjct: 381 KSCNIILRHLIERGEKDEAFDVFRKMIKV-CEPDADTYTMVIKMFCEKKEMETADKVWKY 440

Query: 422 -GRKGLDPLVNFDTIFVGGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEE 463
             +KG+ P ++  ++ + GL +   + +A   +E +++ G+      + +       EE
Sbjct: 441 MRKKGVFPSMHTFSVLINGLCEERTTQKACVLLEEMIEMGIRPSGVTFGRLRQLLIKEE 495

BLAST of HG10004365 vs. ExPASy TrEMBL
Match: A0A5A7VH10 (Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G00640 PE=4 SV=1)

HSP 1 Score: 929.5 bits (2401), Expect = 6.0e-267
Identity = 459/501 (91.62%), Postives = 476/501 (95.01%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQ 60
           MAAIT+RRIIIN FSFSFIFHQRF+PFTSDST A    +NIP AI+ THLRRVCTVLYQQ
Sbjct: 1   MAAITYRRIIINNFSFSFIFHQRFSPFTSDSTAA----DNIPSAIDSTHLRRVCTVLYQQ 60

Query: 61  QNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSP +KLH+KLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET PNFTHT VSFNK
Sbjct: 61  QNSPDIKLHTKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETHPNFTHTVVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGL+QEMGRRRLV DKTFVVALRTLA ARELKKCVEFFHLM+GYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLIQEMGRRRLVNDKTFVVALRTLATARELKKCVEFFHLMNGYGF 180

Query: 181 GYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKM 240
            YSL+TLNKVVEKLC CKLVDEAKFLVMKLNEWIKADGVTYK LIKGFCNVGDLIEASKM
Sbjct: 181 CYSLVTLNKVVEKLCGCKLVDEAKFLVMKLNEWIKADGVTYKLLIKGFCNVGDLIEASKM 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLC 300
           WNLMVDEGFEPEMEAVEEM+NVLFKTNK DEALKLFQAVRSNRMNDLIPSTY L+IRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMVNVLFKTNKLDEALKLFQAVRSNRMNDLIPSTYRLVIRWLC 300

Query: 301 NKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHA 360
           NK KVRQA++VFDEMHKRGLE DNS HSSLIYGLLARGRRREAYNIMRRIENPDL VYHA
Sbjct: 301 NKGKVRQAFIVFDEMHKRGLEVDNSVHSSLIYGLLARGRRREAYNIMRRIENPDLDVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
           LIKGLL+LKRANEATQVFREMIE GCEPIMHTYIMLLQGHLGKRGRKGLDP VNFD+IFV
Sbjct: 361 LIKGLLKLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGLDPFVNFDSIFV 420

Query: 421 GGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKN KSLEATKYVER+MKRGLEVPRFDYNKFLHYYSN+EGVVMFREVGNRLREVGLV
Sbjct: 421 GGLVKNGKSLEATKYVERIMKRGLEVPRFDYNKFLHYYSNDEGVVMFREVGNRLREVGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRN 502
           DLADIFQRYGEKMTTRDRRRN
Sbjct: 481 DLADIFQRYGEKMTTRDRRRN 497

BLAST of HG10004365 vs. ExPASy TrEMBL
Match: A0A1S3BDH4 (putative pentatricopeptide repeat-containing protein At1g26500 OS=Cucumis melo OX=3656 GN=LOC103488475 PE=4 SV=1)

HSP 1 Score: 927.5 bits (2396), Expect = 2.3e-266
Identity = 458/501 (91.42%), Postives = 475/501 (94.81%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQ 60
           MAAIT+RRIIIN FSFSFIFHQRF+PFTSDST A    +NIP AI+ THLRRVCTVLYQQ
Sbjct: 1   MAAITYRRIIINNFSFSFIFHQRFSPFTSDSTAA----DNIPSAIDSTHLRRVCTVLYQQ 60

Query: 61  QNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSP +KLH+KLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET PNFTHT VSFNK
Sbjct: 61  QNSPDIKLHTKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETHPNFTHTVVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGL+QEMGRRRLV DKTFVVALRTLA ARELKKCVEFFHLM+GYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLIQEMGRRRLVNDKTFVVALRTLATARELKKCVEFFHLMNGYGF 180

Query: 181 GYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKM 240
            YSL+TLNKVVE LC CKLVDEAKFLVMKLNEWIKADGVTYK LIKGFCNVGDLIEASKM
Sbjct: 181 CYSLVTLNKVVENLCGCKLVDEAKFLVMKLNEWIKADGVTYKLLIKGFCNVGDLIEASKM 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLC 300
           WNLMVDEGFEPEMEAVEEM+NVLFKTNK DEALKLFQAVRSNRMNDLIPSTY L+IRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMVNVLFKTNKLDEALKLFQAVRSNRMNDLIPSTYRLVIRWLC 300

Query: 301 NKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHA 360
           NK KVRQA++VFDEMHKRGLE DNS HSSLIYGLLARGRRREAYNIMRRIENPDL VYHA
Sbjct: 301 NKGKVRQAFIVFDEMHKRGLEVDNSVHSSLIYGLLARGRRREAYNIMRRIENPDLDVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
           LIKGLL+LKRANEATQVFREMIE GCEPIMHTYIMLLQGHLGKRGRKGLDP VNFD+IFV
Sbjct: 361 LIKGLLKLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGLDPFVNFDSIFV 420

Query: 421 GGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKN KSLEATKYVER+MKRGLEVPRFDYNKFLHYYSN+EGVVMFREVGNRLREVGLV
Sbjct: 421 GGLVKNGKSLEATKYVERIMKRGLEVPRFDYNKFLHYYSNDEGVVMFREVGNRLREVGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRN 502
           DLADIFQRYGEKMTTRDRRRN
Sbjct: 481 DLADIFQRYGEKMTTRDRRRN 497

BLAST of HG10004365 vs. ExPASy TrEMBL
Match: A0A0A0KD74 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G366430 PE=4 SV=1)

HSP 1 Score: 927.2 bits (2395), Expect = 3.0e-266
Identity = 459/501 (91.62%), Postives = 477/501 (95.21%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQ 60
           MAAIT+RRIIIN FSFSFIFHQRF+PFTSDS+TAA   +NIP  I+ THLRRVCTVLYQQ
Sbjct: 1   MAAITYRRIIINNFSFSFIFHQRFSPFTSDSSTAA---DNIPQPIDSTHLRRVCTVLYQQ 60

Query: 61  QNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSP LKLH+KLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET PNFTHTAVSFNK
Sbjct: 61  QNSPDLKLHTKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETHPNFTHTAVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           LIDVVGKSRNIDLLWGL+QEMGRRRLV DKTFVVALRTLA ARELKKCVEFFHLM+GYGF
Sbjct: 121 LIDVVGKSRNIDLLWGLIQEMGRRRLVNDKTFVVALRTLATARELKKCVEFFHLMNGYGF 180

Query: 181 GYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKM 240
            YSL+TLN+VVEKLC CKLVDEAKFLVMKLNEWIKADGVTYK LIKGFCNVGDLIEASKM
Sbjct: 181 CYSLVTLNRVVEKLCGCKLVDEAKFLVMKLNEWIKADGVTYKLLIKGFCNVGDLIEASKM 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLC 300
           WNLMVDEGFEPEMEAVEEMMNVLFKTNK DEALKLFQA+RS+RMNDLIPSTYSL+IRWLC
Sbjct: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKLDEALKLFQALRSDRMNDLIPSTYSLVIRWLC 300

Query: 301 NKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHA 360
           NK KV QA++VFDEMHKRGLE DNS HSSLIYGLLARGRRREAYNIMRRIENPDL VYHA
Sbjct: 301 NKGKVGQAFIVFDEMHKRGLEVDNSVHSSLIYGLLARGRRREAYNIMRRIENPDLDVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
            IKGLL+LKRANEATQVFREMIE GCEPIMHTYIMLLQGHLGKRGRKG DPLVNFD+IFV
Sbjct: 361 FIKGLLKLKRANEATQVFREMIERGCEPIMHTYIMLLQGHLGKRGRKGSDPLVNFDSIFV 420

Query: 421 GGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKN KSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGN+LREVGLV
Sbjct: 421 GGLVKNGKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNKLREVGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRN 502
           DLADIFQRYGEKMTTRDRRRN
Sbjct: 481 DLADIFQRYGEKMTTRDRRRN 498

BLAST of HG10004365 vs. ExPASy TrEMBL
Match: A0A6J1BPL8 (putative pentatricopeptide repeat-containing protein At1g26500 OS=Momordica charantia OX=3673 GN=LOC111004625 PE=4 SV=1)

HSP 1 Score: 890.6 bits (2300), Expect = 3.1e-255
Identity = 437/501 (87.23%), Postives = 459/501 (91.62%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTTAATPHNNIPPAINPTHLRRVCTVLYQQ 60
           MA + +RRIII+KFS +FIFHQRF+ FTS+ST   T     P AINP  LRRVCTVLYQQ
Sbjct: 1   MAVVRYRRIIIDKFSLTFIFHQRFSSFTSNSTAHPTA---APAAINPARLRRVCTVLYQQ 60

Query: 61  QNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120
           QNSP  +LHSKL ACNFNLSHEFFL+VCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK
Sbjct: 61  QNSPDPRLHSKLRACNFNLSHEFFLRVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNK 120

Query: 121 LIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGF 180
           +IDV G SRNI LLW L+QE GRRRLVTDKTFV+ALRTLAAARELKKCVEFFHLMDGYGF
Sbjct: 121 MIDVAGNSRNIGLLWVLIQEAGRRRLVTDKTFVIALRTLAAARELKKCVEFFHLMDGYGF 180

Query: 181 GYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKM 240
           GYSL+TLNKVVEKLC CKLVDEAKFLV KL EWIK DGVTYKWLIKGFC VGDLIEASK+
Sbjct: 181 GYSLVTLNKVVEKLCSCKLVDEAKFLVFKLKEWIKPDGVTYKWLIKGFCEVGDLIEASKL 240

Query: 241 WNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMIRWLC 300
           WN MVDEGFEP+MEAVEEMMN LFKTNK DEALKLFQAVRSNRMNDLIPSTYSL+ +WLC
Sbjct: 241 WNFMVDEGFEPDMEAVEEMMNALFKTNKPDEALKLFQAVRSNRMNDLIPSTYSLVNKWLC 300

Query: 301 NKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLGVYHA 360
           NK KVRQAYVVFDEMHKRGLE DNSAHSSLIYGLLARGRRRE Y IM RIENPDLGVYHA
Sbjct: 301 NKGKVRQAYVVFDEMHKRGLETDNSAHSSLIYGLLARGRRREGYRIMERIENPDLGVYHA 360

Query: 361 LIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFDTIFV 420
           LIKGLLRL+RANEATQVFREMIE GCEPIMHTY+MLLQGHLGKRGR+G DPLVNFDTIFV
Sbjct: 361 LIKGLLRLRRANEATQVFREMIERGCEPIMHTYVMLLQGHLGKRGRRGFDPLVNFDTIFV 420

Query: 421 GGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLV 480
           GGLVKN +SLEATKYVER+MKRG+EVPRFDYNKFLHYYSNEEGVVMFREVGNRLRE GLV
Sbjct: 421 GGLVKNGQSLEATKYVERMMKRGVEVPRFDYNKFLHYYSNEEGVVMFREVGNRLREAGLV 480

Query: 481 DLADIFQRYGEKMTTRDRRRN 502
           DLADIFQRYGEKM TRDRRRN
Sbjct: 481 DLADIFQRYGEKMATRDRRRN 498

BLAST of HG10004365 vs. ExPASy TrEMBL
Match: A0A6J1GIN7 (putative pentatricopeptide repeat-containing protein At1g26500 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454535 PE=4 SV=1)

HSP 1 Score: 881.3 bits (2276), Expect = 1.9e-252
Identity = 434/507 (85.60%), Postives = 466/507 (91.91%), Query Frame = 0

Query: 1   MAAITHRRIIINKFSFSFIFHQRFTPFTSDSTT------AATPHNNIPPAINPTHLRRVC 60
           MA + +R III KF FSF+FHQRF+  TSDS         A PHN+I  AI+PTHLRRVC
Sbjct: 1   MAVVRYRHIIITKFPFSFVFHQRFSSLTSDSINPTAAPPTAAPHNDIRLAIDPTHLRRVC 60

Query: 61  TVLYQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHT 120
           TVLYQQQNS  L+LHSKL AC+FNLSHEFFLQVCNTFP SWRPVYRFFQFTETDPNFTHT
Sbjct: 61  TVLYQQQNSSDLRLHSKLRACSFNLSHEFFLQVCNTFPFSWRPVYRFFQFTETDPNFTHT 120

Query: 121 AVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHL 180
           AVSFNK+ID+VGKSRNIDLLW L+QEMGRRRLVTDKTFVVALRTLAAARELKKCVE FHL
Sbjct: 121 AVSFNKMIDIVGKSRNIDLLWDLIQEMGRRRLVTDKTFVVALRTLAAARELKKCVELFHL 180

Query: 181 MDGYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDL 240
           M+GYGFGYSL+TLNKVVEKLC CKLVDEAKFLV+KL EWIKADGVTYK LIKGFC+VGD+
Sbjct: 181 MNGYGFGYSLVTLNKVVEKLCGCKLVDEAKFLVLKLKEWIKADGVTYKLLIKGFCDVGDV 240

Query: 241 IEASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSL 300
           IEASK+WNLMV++GFE EMEAVEEMMNVLFK NKF EALKLFQA+RSNRMNDLIPSTYSL
Sbjct: 241 IEASKIWNLMVEDGFEAEMEAVEEMMNVLFKANKFGEALKLFQAMRSNRMNDLIPSTYSL 300

Query: 301 MIRWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPD 360
           +I+WLCNK K+RQAY VFDEM KRG EADNSAHSSLIYG+LARGRRREAYNIM RIENPD
Sbjct: 301 VIKWLCNKGKMRQAYNVFDEMLKRGFEADNSAHSSLIYGVLARGRRREAYNIMGRIENPD 360

Query: 361 LGVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVN 420
           L VYHA+IKGLLRL+RA+EATQVFR MIE GCEPIMHTY+MLLQGHLG+RGRKG+DPLVN
Sbjct: 361 LSVYHAMIKGLLRLRRASEATQVFRVMIERGCEPIMHTYVMLLQGHLGRRGRKGMDPLVN 420

Query: 421 FDTIFVGGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRL 480
           FDTIFVGGLVK  KSLEATKYVER+MKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRL
Sbjct: 421 FDTIFVGGLVKFGKSLEATKYVERVMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRL 480

Query: 481 REVGLVDLADIFQRYGEKMTTRDRRRN 502
           REVGLVDLADIFQRYGEKMTTRDRRR+
Sbjct: 481 REVGLVDLADIFQRYGEKMTTRDRRRD 507

BLAST of HG10004365 vs. TAIR 10
Match: AT1G26500.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 614.8 bits (1584), Expect = 6.3e-176
Identity = 303/505 (60.00%), Postives = 390/505 (77.23%), Query Frame = 0

Query: 1   MAAITHRRIII--NKFSFSFIFHQRFTPFTSDST-TAATPHNNIPPAINPTHLRRVCTVL 60
           +A +T RR+I   N     FI + RF  F+++ T T  TP       IN  HL RVCT+L
Sbjct: 3   VAVVTSRRMINIGNSIRRCFILNHRF--FSTELTPTTITP-------INQDHLLRVCTIL 62

Query: 61  YQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTET-DPNFTHTAV 120
           YQQQNSP  +L SKL +  F L+HEFFLQVCN FPLSWRPV+RFF +++T  P+FTHT+ 
Sbjct: 63  YQQQNSPDSRLVSKLSSTKFQLTHEFFLQVCNNFPLSWRPVHRFFLYSQTHHPDFTHTST 122

Query: 121 SFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMD 180
           + NK++ ++G SRN+DL W L QE+G+R LV DKTF + L+TLA+ARELKKCV +FHLM+
Sbjct: 123 TSNKMLAIIGNSRNMDLFWELAQEIGKRGLVNDKTFRIVLKTLASARELKKCVNYFHLMN 182

Query: 181 GYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIE 240
           G+G+ Y++ T+N+ VE LC  KLV+EAKF+ +KL E+IK D +TY+ +I+GFC+VGDLIE
Sbjct: 183 GFGYLYNVETMNRGVETLCKEKLVEEAKFVFIKLKEFIKPDEITYRTMIQGFCDVGDLIE 242

Query: 241 ASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPSTYSLMI 300
           A+K+WNLM+DEGF+ ++EA +++M  L K N+FDEA K+F  + S R  DL    Y +MI
Sbjct: 243 AAKLWNLMMDEGFDVDIEAGKKIMETLLKKNQFDEASKVFYVMVSKRGGDLDGGFYRVMI 302

Query: 301 RWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIENPDLG 360
            WLC   ++  A  VFDEM +RG+  DN   +SLIYGLL + R  EAY ++  +ENPD+ 
Sbjct: 303 DWLCKNGRIDMARKVFDEMRERGVYVDNLTWASLIYGLLVKRRVVEAYGLVEGVENPDIS 362

Query: 361 VYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKRGRKGLDPLVNFD 420
           +YH LIKGL+++KRA+EAT+VFR+MI+ GCEPIMHTY+MLLQGHLG+RGRKG DPLVNFD
Sbjct: 363 IYHGLIKGLVKIKRASEATEVFRKMIQRGCEPIMHTYLMLLQGHLGRRGRKGPDPLVNFD 422

Query: 421 TIFVGGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEEGVVMFREVGNRLRE 480
           TIFVGG++K  K LE TKY+ER +KRGLEVPRFDY+KFLHYYSNEEGVVMF E+  +LRE
Sbjct: 423 TIFVGGMIKAGKRLETTKYIERTLKRGLEVPRFDYSKFLHYYSNEEGVVMFEEMAKKLRE 482

Query: 481 VGLVDLADIFQRYGEKMTTRDRRRN 502
           V L DLADIFQRYGEKMTTR+RRR+
Sbjct: 483 VSLFDLADIFQRYGEKMTTRERRRD 498

BLAST of HG10004365 vs. TAIR 10
Match: AT3G62470.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 249.2 bits (635), Expect = 6.9e-66
Identity = 142/476 (29.83%), Postives = 256/476 (53.78%), Query Frame = 0

Query: 41  IPPAINPTHLRRVCTVLYQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYR 100
           +  + NP  + RVC V+  +  +    + + L     +LSH+  ++V   F  + +P +R
Sbjct: 122 VESSTNPEEVERVCKVI-DELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFR 181

Query: 101 FFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLA 160
           FF +      F H + ++N ++ ++ K+R  + +  +++EMG + L+T +TF +A++  A
Sbjct: 182 FFCWAAERQGFAHDSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFA 241

Query: 161 AARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVT 220
           AA+E KK V  F LM  Y F   + T+N +++ L   KL  EA+ L  KL E    + +T
Sbjct: 242 AAKERKKAVGIFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMT 301

Query: 221 YKWLIKGFCNVGDLIEASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVR 280
           Y  L+ G+C V +LIEA+++WN M+D+G +P++ A   M+  L ++ K  +A+KLF  ++
Sbjct: 302 YTVLLNGWCRVRNLIEAARIWNDMIDQGLKPDIVAHNVMLEGLLRSRKKSDAIKLFHVMK 361

Query: 281 SNRMNDLIPSTYSLMIRWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRR 340
           S      + S Y++MIR  C ++ +  A   FD+M   GL+ D + ++ LI G   + + 
Sbjct: 362 SKGPCPNVRS-YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKL 421

Query: 341 REAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIML 400
              Y +++ ++     PD   Y+ALIK +   K    AT+++ +MI+   EP +HT+ M+
Sbjct: 422 DTVYELLKEMQEKGHPPDGKTYNALIKLMANQKMPEHATRIYNKMIQNEIEPSIHTFNMI 481

Query: 401 LQGHLGKRG------------RKGLDPLVNFDTIFVGGLVKNEKSLEATKYVERLMKRGL 460
           ++ +   R             +KG+ P  N  T+ + GL+   KS EA +Y+E ++ +G+
Sbjct: 482 MKSYFMARNYEMGRAVWEEMIKKGICPDDNSYTVLIRGLIGEGKSREACRYLEEMLDKGM 541

Query: 461 EVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRR 501
           + P  DYNKF   +       +F E+  R +  G    A+IF R+ +    R ++R
Sbjct: 542 KTPLIDYNKFAADFHRGGQPEIFEELAQRAKFSGKFAAAEIFARWAQMTRRRFKQR 595

BLAST of HG10004365 vs. TAIR 10
Match: AT3G62540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 247.3 bits (630), Expect = 2.6e-65
Identity = 141/476 (29.62%), Postives = 254/476 (53.36%), Query Frame = 0

Query: 41  IPPAINPTHLRRVCTVLYQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYR 100
           +  + NP  + RVC V+  +  +    + + L     +LSH+  ++V   F  + +P +R
Sbjct: 122 VESSTNPEEVERVCKVI-DELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFR 181

Query: 101 FFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLA 160
           FF +      F H + ++N ++ ++ K+R  + +  +++EMG + L+T +TF +A++  A
Sbjct: 182 FFCWAAERQGFAHASRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFA 241

Query: 161 AARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVT 220
           AA+E KK V  F LM  Y F   + T+N +++ L   KL  EA+ L  KL E    + +T
Sbjct: 242 AAKERKKAVGIFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMT 301

Query: 221 YKWLIKGFCNVGDLIEASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVR 280
           Y  L+ G+C V +LIEA+++WN M+D G +P++ A   M+  L ++ K  +A+KLF  ++
Sbjct: 302 YTVLLNGWCRVRNLIEAARIWNDMIDHGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMK 361

Query: 281 SNRMNDLIPSTYSLMIRWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRR 340
           S      + S Y++MIR  C ++ +  A   FD+M   GL+ D + ++ LI G   + + 
Sbjct: 362 SKGPCPNVRS-YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKL 421

Query: 341 REAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIML 400
              Y +++ ++     PD   Y+ALIK +   K     T+++ +MI+   EP +HT+ M+
Sbjct: 422 DTVYELLKEMQEKGHPPDGKTYNALIKLMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMI 481

Query: 401 LQGHLGKRG------------RKGLDPLVNFDTIFVGGLVKNEKSLEATKYVERLMKRGL 460
           ++ +   R             +KG+ P  N  T+ + GL+   KS EA +Y+E ++ +G+
Sbjct: 482 MKSYFVARNYEMGRAVWDEMIKKGICPDDNSYTVLIRGLISEGKSREACRYLEEMLDKGM 541

Query: 461 EVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRR 501
           + P  DYNKF   +       +F E+  R +  G    A+IF R+ +    R ++R
Sbjct: 542 KTPLIDYNKFAADFHRGGQPEIFEELAQRAKFSGKFAAAEIFARWAQMTRRRCKQR 595

BLAST of HG10004365 vs. TAIR 10
Match: AT5G14820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 246.9 bits (629), Expect = 3.4e-65
Identity = 141/476 (29.62%), Postives = 254/476 (53.36%), Query Frame = 0

Query: 41  IPPAINPTHLRRVCTVLYQQQNSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYR 100
           +  + NP  + RVC V+  +  +    + + L     +LSH+  ++V   F  + +P +R
Sbjct: 121 VESSTNPEEVERVCKVI-DELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFR 180

Query: 101 FFQFTETDPNFTHTAVSFNKLIDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLA 160
           FF +      F H + ++N ++ ++ K+R  + +  +++EMG + L+T +TF +A++  A
Sbjct: 181 FFCWAAERQGFAHDSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFA 240

Query: 161 AARELKKCVEFFHLMDGYGFGYSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVT 220
           AA+E KK V  F LM  Y F   + T+N +++ L   KL  EA+ L  KL E    + +T
Sbjct: 241 AAKERKKAVGIFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMT 300

Query: 221 YKWLIKGFCNVGDLIEASKMWNLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVR 280
           Y  L+ G+C V +LIEA+++WN M+D G +P++ A   M+  L ++ K  +A+KLF  ++
Sbjct: 301 YTVLLNGWCRVRNLIEAARIWNDMIDHGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMK 360

Query: 281 SNRMNDLIPSTYSLMIRWLCNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRR 340
           S      + S Y++MIR  C ++ +  A   FD+M   GL+ D + ++ LI G   + + 
Sbjct: 361 SKGPCPNVRS-YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKL 420

Query: 341 REAYNIMRRIEN----PDLGVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIML 400
              Y +++ ++     PD   Y+ALIK +   K     T+++ +MI+   EP +HT+ M+
Sbjct: 421 DTVYELLKEMQEKGHPPDGKTYNALIKLMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMI 480

Query: 401 LQGHLGKRG------------RKGLDPLVNFDTIFVGGLVKNEKSLEATKYVERLMKRGL 460
           ++ +   R             +KG+ P  N  T+ + GL+   KS EA +Y+E ++ +G+
Sbjct: 481 MKSYFVARNYEMGRAVWDEMIKKGICPDDNSYTVLIRGLISEGKSREACRYLEEMLDKGM 540

Query: 461 EVPRFDYNKFLHYYSNEEGVVMFREVGNRLREVGLVDLADIFQRYGEKMTTRDRRR 501
           + P  DYNKF   +       +F E+  R +  G    A+IF R+ +    R ++R
Sbjct: 541 KTPLIDYNKFAADFHRGGQPEIFEELAQRAKFSGKFAAAEIFARWAQMTRRRCKQR 594

BLAST of HG10004365 vs. TAIR 10
Match: AT1G77360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 160.6 bits (405), Expect = 3.2e-39
Identity = 103/419 (24.58%), Postives = 210/419 (50.12%), Query Frame = 0

Query: 62  NSPHLKLHSKLLACNFNLSHEFFLQVCNTFPLSWRPVYRFFQFTETDPNFTHTAVSFNKL 121
           +SP L L S L      +S E    V N F  +    YRFFQ++E   ++ H+  +++ +
Sbjct: 81  SSPQLVLDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEKQRHYEHSVRAYHMM 140

Query: 122 IDVVGKSRNIDLLWGLVQEMGRRRLVTDKTFVVALRTLAAARELKKCVEFFHLMDGYGFG 181
           I+   K R   L+W L+  M +++++  +TF + +R  A A+++ + +  F++M+ Y   
Sbjct: 141 IESTAKIRQYKLMWDLINAMRKKKMLNVETFCIVMRKYARAQKVDEAIYAFNVMEKYDLP 200

Query: 182 YSLMTLNKVVEKLCVCKLVDEAKFLVMKLNEWIKADGVTYKWLIKGFCNVGDLIEASKMW 241
            +L+  N ++  LC  K V +A+ +   + +    D  TY  L++G+    +L +A +++
Sbjct: 201 PNLVAFNGLLSALCKSKNVRKAQEVFENMRDRFTPDSKTYSILLEGWGKEPNLPKAREVF 260

Query: 242 NLMVDEGFEPEMEAVEEMMNVLFKTNKFDEALKLFQAVRSNRMNDLIPST--YSLMIRWL 301
             M+D G  P++     M+++L K  + DEAL +   VRS   +   P+T  YS+++   
Sbjct: 261 REMIDAGCHPDIVTYSIMVDILCKAGRVDEALGI---VRSMDPSICKPTTFIYSVLVHTY 320

Query: 302 CNKAKVRQAYVVFDEMHKRGLEADNSAHSSLIYGLLARGRRREAYNIMRRIEN----PDL 361
             + ++ +A   F EM + G++AD +  +SLI       R +  Y +++ +++    P+ 
Sbjct: 321 GTENRLEEAVDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYRVLKEMKSKGVTPNS 380

Query: 362 GVYHALIKGLLRLKRANEATQVFREMIETGCEPIMHTYIMLLQGHLGKR----------- 421
              + +++ L+     +EA  VFR+MI+  CEP   TY M+++    K+           
Sbjct: 381 KSCNIILRHLIERGEKDEAFDVFRKMIKV-CEPDADTYTMVIKMFCEKKEMETADKVWKY 440

Query: 422 -GRKGLDPLVNFDTIFVGGLVKNEKSLEATKYVERLMKRGLEVPRFDYNKFLHYYSNEE 463
             +KG+ P ++  ++ + GL +   + +A   +E +++ G+      + +       EE
Sbjct: 441 MRKKGVFPSMHTFSVLINGLCEERTTQKACVLLEEMIEMGIRPSGVTFGRLRQLLIKEE 495

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884840.11.1e-27895.21putative pentatricopeptide repeat-containing protein At1g26500 [Benincasa hispid... [more]
KAA0064759.11.2e-26691.62putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008445460.14.7e-26691.42PREDICTED: putative pentatricopeptide repeat-containing protein At1g26500 [Cucum... [more]
XP_004144262.16.1e-26691.62putative pentatricopeptide repeat-containing protein At1g26500 [Cucumis sativus]... [more]
XP_022131400.16.4e-25587.23putative pentatricopeptide repeat-containing protein At1g26500 [Momordica charan... [more]
Match NameE-valueIdentityDescription
Q9FZD48.8e-17560.00Putative pentatricopeptide repeat-containing protein At1g26500 OS=Arabidopsis th... [more]
Q9LZP39.8e-6529.83Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidop... [more]
Q3EAF83.7e-6429.62Pentatricopeptide repeat-containing protein At3g62540, mitochondrial OS=Arabidop... [more]
Q9LEQ74.8e-6429.62Pentatricopeptide repeat-containing protein At5g14820, mitochondrial OS=Arabidop... [more]
Q9FVX24.6e-3824.58Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A5A7VH106.0e-26791.62Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa... [more]
A0A1S3BDH42.3e-26691.42putative pentatricopeptide repeat-containing protein At1g26500 OS=Cucumis melo O... [more]
A0A0A0KD743.0e-26691.62Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G366430 PE=4 SV=1[more]
A0A6J1BPL83.1e-25587.23putative pentatricopeptide repeat-containing protein At1g26500 OS=Momordica char... [more]
A0A6J1GIN71.9e-25285.60putative pentatricopeptide repeat-containing protein At1g26500 isoform X1 OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT1G26500.16.3e-17660.00Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G62470.16.9e-6629.83Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G62540.12.6e-6529.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G14820.13.4e-6529.62Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G77360.13.2e-3924.58Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 307..498
e-value: 2.7E-18
score: 68.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 100..306
e-value: 3.7E-26
score: 94.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 353..399
e-value: 2.0E-7
score: 31.1
coord: 291..334
e-value: 5.6E-10
score: 39.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 220..251
e-value: 1.1E-5
score: 23.3
coord: 357..388
e-value: 4.8E-7
score: 27.5
coord: 291..323
e-value: 1.5E-6
score: 26.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 259..281
e-value: 0.23
score: 11.8
coord: 219..249
e-value: 6.2E-4
score: 19.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 217..251
score: 11.651919
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 354..388
score: 12.035565
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 288..322
score: 9.941957
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 28..501
NoneNo IPR availablePANTHERPTHR47942:SF45OS07G0599201 PROTEINcoord: 28..501

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004365.1HG10004365.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding