HG10011190.1 (mRNA) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10011190.1
TypemRNA
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat
LocationChr01: 3339226 .. 3341007 (-)
Sequence length1782
RNA-Seq ExpressionHG10011190.1
SyntenyHG10011190.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAACAATTCCTCATTCAAGCTCTCCAAATTCTCCATTTCCCTCTCCAAACCTTCCTTCCGCTACTCCACATGGCACTCCCCACCACCGCCGGCGGCGGCAGCTGACCCTGTACTCGCCGCCGTTTCCACAGCCATCAATAACGCCCAGACAAAGCCTCTCGCCTCTTCTCTCCGCCGGCTCCTCCCTTCCTTCAAACCCCACCATTTCATCGACCTCATCAATCATAACCCTTTCTCTCTCTCCCCTCTCTCTCTCTTCTCCTTCTTCAATTGGCTCTCTTCTATCCCCACCTTTCGCCACACCCTCCAATCCTACTGCGCTATGGCTGATTTCCTCTCCGCCCATCAAATGTTCGAAGAATCTCAATCGGTTGTTCGATTTCTCGTCTCCCGCAAAGGTAAGGACTCGGCGGCTTCGGTCTTTGCAGCGATTCTTGAAATTGCAGGTACGCGTTGTTCGAATTTTGTATTTGACGCTTTGATGATTGCGTATTCGGATTCCGGGTTCATCTCCGATGCGATTCAGTGTTTTAGGCTGGTTAGGAAGAGCAATTTTCAAATCCCGTTTCATGGTTGTGGGTACTTGCTTGATAAAATGATGAATTCGAACTCCCCTGTTACGATCTGGACGTTTTATTCGGAAATTTTGGATTCTGGGTTCCCGCCTAATGTGAAGTATTTCAATATTTTGATCAATAAGTTCTGTAAAGAGGGAAGTATTAGAGATGCCAGGTTGATTTTCGATGAAATTGGGAAGAGGGGTTTGCGTCCTACAACTGTTAGTTTCAATACCTTGATTAATGGGTTCTGTAAATCTAGAAATTTAGATGAGGGGTTTAGGTTGAAGAAGATCATGGAAGAGAATAGAATATATCCTGATGTCTTCACTTACAGTGTTCTGATTCATGGGTTATGTAAGGAAGGTAGGTTAGATGATGCAGAACAACTGTTTGATGAAATGCAGCAAAGAGGATTGAGACCAAACGACGTTACGTTCACCGCATTGATTGACGGGCAATGTAGGAGCCAACGAATTGACTCAGCCATGAACACTTATCAGCAAATGTTAACCATGGGAGTGAAACCAGATTTAGTTATGTACAATACACTCTTAAATGGCCTTTGCAAGGTTGGGGATGTTAATAAAGCTAGGAAGCTAATTGATGAAATGAAAATGGTGGGGATGAAACCTGATAAAATCACTTACACTACACTCATAGATGGTTACTGCAAAGAGGGAGATCTAGAATCAGCCATGGAGATTAGGAAAGGAATGAATGAAGAAGGGATTGTGCTTGATAATGTAGCATTCACAGCCCTTATTTCAGGTTTGTGTAGAGATGGAAGGGTGGTGGATGCAGAGAGGACCTTGAGGGAGATGATGGGAGCTGGGATGAAACCTGACGATGCCACGTATACTATGGTGATCGACGGGTTTTGCAAGAAAGGCAATGTTAAGACGGGGTTTAAGCTGCTGAAGGAGATGCAGACCAATGGTCATAACCCTGGTGTCATAACTTACAATGTGCTTATGAATGGACTTTGCAAGCAAGGACAGATGAAGAATGCAAATATGCTATTGGAAGCAATGCTTAATTTAGGAGTAACTCCAGATGACATTACATACAACATTCTTTTGGAAGGGCACTGTAAGAATGGAAGAGCAGAAGATTTCCTTAAACTCAGAAATGAGAAAGGACTCGTAGTAGATTATGCGTATTATACTTCTTTAGTCGGTGAATACGATAAATCCGTGAAGGATCGTCGAAAGAGGTGA

mRNA sequence

ATGGCCAACAATTCCTCATTCAAGCTCTCCAAATTCTCCATTTCCCTCTCCAAACCTTCCTTCCGCTACTCCACATGGCACTCCCCACCACCGCCGGCGGCGGCAGCTGACCCTGTACTCGCCGCCGTTTCCACAGCCATCAATAACGCCCAGACAAAGCCTCTCGCCTCTTCTCTCCGCCGGCTCCTCCCTTCCTTCAAACCCCACCATTTCATCGACCTCATCAATCATAACCCTTTCTCTCTCTCCCCTCTCTCTCTCTTCTCCTTCTTCAATTGGCTCTCTTCTATCCCCACCTTTCGCCACACCCTCCAATCCTACTGCGCTATGGCTGATTTCCTCTCCGCCCATCAAATGTTCGAAGAATCTCAATCGGTTGTTCGATTTCTCGTCTCCCGCAAAGGTAAGGACTCGGCGGCTTCGGTCTTTGCAGCGATTCTTGAAATTGCAGGTACGCGTTGTTCGAATTTTGTATTTGACGCTTTGATGATTGCGTATTCGGATTCCGGGTTCATCTCCGATGCGATTCAGTGTTTTAGGCTGGTTAGGAAGAGCAATTTTCAAATCCCGTTTCATGGTTGTGGGTACTTGCTTGATAAAATGATGAATTCGAACTCCCCTGTTACGATCTGGACGTTTTATTCGGAAATTTTGGATTCTGGGTTCCCGCCTAATGTGAAGTATTTCAATATTTTGATCAATAAGTTCTGTAAAGAGGGAAGTATTAGAGATGCCAGGTTGATTTTCGATGAAATTGGGAAGAGGGGTTTGCGTCCTACAACTGTTAGTTTCAATACCTTGATTAATGGGTTCTGTAAATCTAGAAATTTAGATGAGGGGTTTAGGTTGAAGAAGATCATGGAAGAGAATAGAATATATCCTGATGTCTTCACTTACAGTGTTCTGATTCATGGGTTATGTAAGGAAGGTAGGTTAGATGATGCAGAACAACTGTTTGATGAAATGCAGCAAAGAGGATTGAGACCAAACGACGTTACGTTCACCGCATTGATTGACGGGCAATGTAGGAGCCAACGAATTGACTCAGCCATGAACACTTATCAGCAAATGTTAACCATGGGAGTGAAACCAGATTTAGTTATGTACAATACACTCTTAAATGGCCTTTGCAAGGTTGGGGATGTTAATAAAGCTAGGAAGCTAATTGATGAAATGAAAATGGTGGGGATGAAACCTGATAAAATCACTTACACTACACTCATAGATGGTTACTGCAAAGAGGGAGATCTAGAATCAGCCATGGAGATTAGGAAAGGAATGAATGAAGAAGGGATTGTGCTTGATAATGTAGCATTCACAGCCCTTATTTCAGGTTTGTGTAGAGATGGAAGGGTGGTGGATGCAGAGAGGACCTTGAGGGAGATGATGGGAGCTGGGATGAAACCTGACGATGCCACGTATACTATGGTGATCGACGGGTTTTGCAAGAAAGGCAATGTTAAGACGGGGTTTAAGCTGCTGAAGGAGATGCAGACCAATGGTCATAACCCTGGTGTCATAACTTACAATGTGCTTATGAATGGACTTTGCAAGCAAGGACAGATGAAGAATGCAAATATGCTATTGGAAGCAATGCTTAATTTAGGAGTAACTCCAGATGACATTACATACAACATTCTTTTGGAAGGGCACTGTAAGAATGGAAGAGCAGAAGATTTCCTTAAACTCAGAAATGAGAAAGGACTCGTAGTAGATTATGCGTATTATACTTCTTTAGTCGGTGAATACGATAAATCCGTGAAGGATCGTCGAAAGAGGTGA

Coding sequence (CDS)

ATGGCCAACAATTCCTCATTCAAGCTCTCCAAATTCTCCATTTCCCTCTCCAAACCTTCCTTCCGCTACTCCACATGGCACTCCCCACCACCGCCGGCGGCGGCAGCTGACCCTGTACTCGCCGCCGTTTCCACAGCCATCAATAACGCCCAGACAAAGCCTCTCGCCTCTTCTCTCCGCCGGCTCCTCCCTTCCTTCAAACCCCACCATTTCATCGACCTCATCAATCATAACCCTTTCTCTCTCTCCCCTCTCTCTCTCTTCTCCTTCTTCAATTGGCTCTCTTCTATCCCCACCTTTCGCCACACCCTCCAATCCTACTGCGCTATGGCTGATTTCCTCTCCGCCCATCAAATGTTCGAAGAATCTCAATCGGTTGTTCGATTTCTCGTCTCCCGCAAAGGTAAGGACTCGGCGGCTTCGGTCTTTGCAGCGATTCTTGAAATTGCAGGTACGCGTTGTTCGAATTTTGTATTTGACGCTTTGATGATTGCGTATTCGGATTCCGGGTTCATCTCCGATGCGATTCAGTGTTTTAGGCTGGTTAGGAAGAGCAATTTTCAAATCCCGTTTCATGGTTGTGGGTACTTGCTTGATAAAATGATGAATTCGAACTCCCCTGTTACGATCTGGACGTTTTATTCGGAAATTTTGGATTCTGGGTTCCCGCCTAATGTGAAGTATTTCAATATTTTGATCAATAAGTTCTGTAAAGAGGGAAGTATTAGAGATGCCAGGTTGATTTTCGATGAAATTGGGAAGAGGGGTTTGCGTCCTACAACTGTTAGTTTCAATACCTTGATTAATGGGTTCTGTAAATCTAGAAATTTAGATGAGGGGTTTAGGTTGAAGAAGATCATGGAAGAGAATAGAATATATCCTGATGTCTTCACTTACAGTGTTCTGATTCATGGGTTATGTAAGGAAGGTAGGTTAGATGATGCAGAACAACTGTTTGATGAAATGCAGCAAAGAGGATTGAGACCAAACGACGTTACGTTCACCGCATTGATTGACGGGCAATGTAGGAGCCAACGAATTGACTCAGCCATGAACACTTATCAGCAAATGTTAACCATGGGAGTGAAACCAGATTTAGTTATGTACAATACACTCTTAAATGGCCTTTGCAAGGTTGGGGATGTTAATAAAGCTAGGAAGCTAATTGATGAAATGAAAATGGTGGGGATGAAACCTGATAAAATCACTTACACTACACTCATAGATGGTTACTGCAAAGAGGGAGATCTAGAATCAGCCATGGAGATTAGGAAAGGAATGAATGAAGAAGGGATTGTGCTTGATAATGTAGCATTCACAGCCCTTATTTCAGGTTTGTGTAGAGATGGAAGGGTGGTGGATGCAGAGAGGACCTTGAGGGAGATGATGGGAGCTGGGATGAAACCTGACGATGCCACGTATACTATGGTGATCGACGGGTTTTGCAAGAAAGGCAATGTTAAGACGGGGTTTAAGCTGCTGAAGGAGATGCAGACCAATGGTCATAACCCTGGTGTCATAACTTACAATGTGCTTATGAATGGACTTTGCAAGCAAGGACAGATGAAGAATGCAAATATGCTATTGGAAGCAATGCTTAATTTAGGAGTAACTCCAGATGACATTACATACAACATTCTTTTGGAAGGGCACTGTAAGAATGGAAGAGCAGAAGATTTCCTTAAACTCAGAAATGAGAAAGGACTCGTAGTAGATTATGCGTATTATACTTCTTTAGTCGGTGAATACGATAAATCCGTGAAGGATCGTCGAAAGAGGTGA

Protein sequence

MANNSSFKLSKFSISLSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSLRRLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR
Homology
BLAST of HG10011190.1 vs. NCBI nr
Match: XP_038878138.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g09680 [Benincasa hispida])

HSP 1 Score: 1128.6 bits (2918), Expect = 0.0e+00
Identity = 557/595 (93.61%), Postives = 576/595 (96.81%), Query Frame = 0

Query: 1   MANNSSFKLSKFSISLSKPSFRYSTWHSPPPP--AAAADPVLAAVSTAINNAQTKPLASS 60
           MANNS F LS+FSI LSKPSFRYSTWHSPPPP  AAAADPVLAAVST+I NAQTKPLASS
Sbjct: 1   MANNSLFSLSRFSIPLSKPSFRYSTWHSPPPPPAAAAADPVLAAVSTSITNAQTKPLASS 60

Query: 61  LRRLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQ 120
           LRRLLPSFKPHH IDLINHNPFSLSPLSLFSFFNWLSS+PTFRHT Q+YCAMA+FLSAHQ
Sbjct: 61  LRRLLPSFKPHHLIDLINHNPFSLSPLSLFSFFNWLSSVPTFRHTPQAYCAMANFLSAHQ 120

Query: 121 MFEESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQC 180
           MFE+SQSVVRFLVSRKGKDSAASVFAA+LEI+GTRCSNFVFDALMIAYSDSGF+SDAIQC
Sbjct: 121 MFEDSQSVVRFLVSRKGKDSAASVFAAVLEISGTRCSNFVFDALMIAYSDSGFVSDAIQC 180

Query: 181 FRLVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCK 240
           FRLVRKSNFQIPFHGCGYLLDKM NSNSPV IWTFYSEILDSGFPP VKYFNILINKFCK
Sbjct: 181 FRLVRKSNFQIPFHGCGYLLDKMTNSNSPVMIWTFYSEILDSGFPPKVKYFNILINKFCK 240

Query: 241 EGSIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFT 300
           EGSIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFT
Sbjct: 241 EGSIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFT 300

Query: 301 YSVLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQML 360
           YSVLIHGLCKEGRLDDAE+LFDEMQQRGLRPNDVTFTALIDGQCRS R+DSAMNTYQQML
Sbjct: 301 YSVLIHGLCKEGRLDDAERLFDEMQQRGLRPNDVTFTALIDGQCRSGRVDSAMNTYQQML 360

Query: 361 TMGVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLE 420
           TMGVKPDLVMYNTLLNGLCKVG+VN+ARKLIDEMKMVG+KPDKITYTTLIDGYCKEGDLE
Sbjct: 361 TMGVKPDLVMYNTLLNGLCKVGEVNEARKLIDEMKMVGLKPDKITYTTLIDGYCKEGDLE 420

Query: 421 SAMEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVI 480
           SAMEIRKGMNEEG+VLDNVAFTALISGLCRDG+V DAERTLREM  AGMKPDDATYTMVI
Sbjct: 421 SAMEIRKGMNEEGVVLDNVAFTALISGLCRDGKVKDAERTLREMTEAGMKPDDATYTMVI 480

Query: 481 DGFCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVT 540
           DG+CKKG+VKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVT
Sbjct: 481 DGYCKKGDVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVT 540

Query: 541 PDDITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           PDDITYNILLEGHCKNGR ED LKLRNEKGLVVDYA YTSLVGEYDK +KDRRKR
Sbjct: 541 PDDITYNILLEGHCKNGRPEDLLKLRNEKGLVVDYACYTSLVGEYDKCIKDRRKR 595

BLAST of HG10011190.1 vs. NCBI nr
Match: XP_008457121.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucumis melo] >KAA0047378.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK14055.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1107.4 bits (2863), Expect = 0.0e+00
Identity = 545/593 (91.91%), Postives = 571/593 (96.29%), Query Frame = 0

Query: 1   MANNSSFKLSKFSISLSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSLR 60
           MANNSSFKL   SISLSKPSFRYSTWHSPPPPAA ADP+LAAVSTAINNAQTKPLASSLR
Sbjct: 1   MANNSSFKL---SISLSKPSFRYSTWHSPPPPAAVADPLLAAVSTAINNAQTKPLASSLR 60

Query: 61  RLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMF 120
           RLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSS+PTFRHT QSYCAMA+FLSAHQMF
Sbjct: 61  RLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSMPTFRHTSQSYCAMANFLSAHQMF 120

Query: 121 EESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCFR 180
           EE QS++RFLVSRKGKDSAASVFAAIL+IAGTRCSNFVFDALMIAY DSGF+SDAIQCFR
Sbjct: 121 EECQSIIRFLVSRKGKDSAASVFAAILDIAGTRCSNFVFDALMIAYWDSGFVSDAIQCFR 180

Query: 181 LVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEG 240
           LVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEG
Sbjct: 181 LVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEG 240

Query: 241 SIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYS 300
           S RDA+LIFDEI K GLRPTTVSFNTLING CKSRNLDEGFRLKKIMEENRIYPDVFTYS
Sbjct: 241 STRDAKLIFDEIRKWGLRPTTVSFNTLINGLCKSRNLDEGFRLKKIMEENRIYPDVFTYS 300

Query: 301 VLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTM 360
           VLIH LCKEGRLD+AEQLFDEMQ+RGLRPN VTFTALIDGQC+ ++IDSAMNTY QMLTM
Sbjct: 301 VLIHRLCKEGRLDNAEQLFDEMQKRGLRPNGVTFTALIDGQCKRRQIDSAMNTYHQMLTM 360

Query: 361 GVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESA 420
           GVKPDLVMYNTLL GLCKVGDVNKARKLIDEM+MVGMKPDKI+YTTLIDGYCKEGDLESA
Sbjct: 361 GVKPDLVMYNTLLYGLCKVGDVNKARKLIDEMRMVGMKPDKISYTTLIDGYCKEGDLESA 420

Query: 421 MEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDG 480
           +EIRKGMNEEG+VLDNVAFTALISG CRDGRV DAERTLREMM AGMKPDDATYTMVIDG
Sbjct: 421 LEIRKGMNEEGVVLDNVAFTALISGFCRDGRVRDAERTLREMMEAGMKPDDATYTMVIDG 480

Query: 481 FCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPD 540
           +CKKG+VKTGFK+LKEMQ NGH PGVITYNVLMNGLCKQGQMKNA+MLLEAMLNLGVTPD
Sbjct: 481 YCKKGDVKTGFKMLKEMQINGHKPGVITYNVLMNGLCKQGQMKNAHMLLEAMLNLGVTPD 540

Query: 541 DITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           DITYNILLEGHCKNG+AED LKLRNEKGL++DYAYYTSLVGEYDKS+KDR+KR
Sbjct: 541 DITYNILLEGHCKNGKAEDLLKLRNEKGLIIDYAYYTSLVGEYDKSLKDRQKR 590

BLAST of HG10011190.1 vs. NCBI nr
Match: KAG7017964.1 (putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1103.6 bits (2853), Expect = 0.0e+00
Identity = 545/594 (91.75%), Postives = 568/594 (95.62%), Query Frame = 0

Query: 1   MANNSSFKLSKFSISL-SKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSL 60
           MA NS+FKLS FS SL SKPSFRYSTWHSP PPAAAADPVLAAVSTAINN +TKPLASSL
Sbjct: 1   MAANSTFKLSNFSNSLPSKPSFRYSTWHSPLPPAAAADPVLAAVSTAINNVETKPLASSL 60

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQM 120
           RRLLPSFKPHHFIDLINHNPFSLSP+SLFSFFNWLSS+PTFRHTLQSYCAMA+FL AHQM
Sbjct: 61  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCAHQM 120

Query: 121 FEESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCF 180
           FEESQS+VRFLVSRKGKDSAAS+FAAILEI  TRCSNFVFDALMIAYSDSGFISDAIQCF
Sbjct: 121 FEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 180

Query: 181 RLVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKE 240
           RLVRK NFQIPF GC YLLDKMMNSNSPVTIWTFY EILDSGFPP VKYFNILINKFCK+
Sbjct: 181 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 240

Query: 241 GSIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTY 300
           GSIRDARLIFDEIGKRG RPT VSFNTLING CKSRNLDE FRLKK MEENRI+PDV+TY
Sbjct: 241 GSIRDARLIFDEIGKRGFRPTAVSFNTLINGLCKSRNLDECFRLKKAMEENRIHPDVYTY 300

Query: 301 SVLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLT 360
           SVLIHGLCKEG++DDAEQLFDEM+QRGLRPNDVTFTALIDGQCRS RIDSAMNTYQQML 
Sbjct: 301 SVLIHGLCKEGKVDDAEQLFDEMRQRGLRPNDVTFTALIDGQCRSGRIDSAMNTYQQMLA 360

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
           MGVKPDLVMYNTL+NGLCKVGDV+KARKL+DEMKMVGMKPDKITYTTLIDGYCKEGDLES
Sbjct: 361 MGVKPDLVMYNTLINGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420

Query: 421 AMEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVID 480
           AMEIRKGMNEEG+VLDNVAFTA+ISGLCRDGRV+DAERTLREM  AGMKPDDATYTMVID
Sbjct: 421 AMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKPDDATYTMVID 480

Query: 481 GFCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540
           G+CK G+VKTGFKLLKEMQ NGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP
Sbjct: 481 GYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540

Query: 541 DDITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           DDITYNILLEGHCK+GRAEDFL LRNEKGLVVDYAYYTSLVGEYDKS+KDRRKR
Sbjct: 541 DDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 594

BLAST of HG10011190.1 vs. NCBI nr
Match: XP_023526542.1 (putative pentatricopeptide repeat-containing protein At1g09680 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1102.4 bits (2850), Expect = 0.0e+00
Identity = 544/594 (91.58%), Postives = 568/594 (95.62%), Query Frame = 0

Query: 1   MANNSSFKLSKFSISL-SKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSL 60
           MA NS+FKLS FS SL SKPSFRYSTWHSP PPAAA DPVLAAVSTAINN +TKPLASSL
Sbjct: 28  MAANSTFKLSNFSNSLPSKPSFRYSTWHSPLPPAAAGDPVLAAVSTAINNVETKPLASSL 87

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQM 120
           RRLLPSFKPHHFIDLINHNPFSLSP+SLFSFFNWLSS+PTFRHTLQSYCAMA+FL AHQM
Sbjct: 88  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCAHQM 147

Query: 121 FEESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCF 180
           FEESQS+VRFLVSRKGKDSAAS+FAAILEI  TRCSNFVFDALMIAYSDSGFISDAIQCF
Sbjct: 148 FEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 207

Query: 181 RLVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKE 240
           RLVRK NFQIPF GC YLLDKMMNSNSPVTIWTFY EILDSGFPP VKYFNILINKFCK+
Sbjct: 208 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 267

Query: 241 GSIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTY 300
           GSIRDARLIFDEIGKRG RPT VSFNTLINGFCKSRNLDE FRLKK+MEE+RIYPDV+TY
Sbjct: 268 GSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEESRIYPDVYTY 327

Query: 301 SVLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLT 360
           SVLIHGLCKEG++DDAEQLFDEM+QRGLR NDVTFTALIDGQCRS RIDSAMNTYQQML 
Sbjct: 328 SVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLA 387

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
           MGVKPDLVMYNTLLNGLCKVGDV+KARKL+DEMKMVGMKPDKITYTTLIDGYCKEGDLES
Sbjct: 388 MGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLES 447

Query: 421 AMEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVID 480
           AMEIRKGMNEEG+VLDNVAFTA+ISGLCRDGRV+DAERTLREM  AGMKPDDATYTMVID
Sbjct: 448 AMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKPDDATYTMVID 507

Query: 481 GFCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540
           G+CK G+VKTGFKLLKEMQ NGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP
Sbjct: 508 GYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 567

Query: 541 DDITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           DDITYNILLEGHCK+GRAEDFL LRNEKG+VVDYAYYTSLVGEYDKS+KDRRKR
Sbjct: 568 DDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKDRRKR 621

BLAST of HG10011190.1 vs. NCBI nr
Match: XP_022983327.1 (putative pentatricopeptide repeat-containing protein At1g09680 [Cucurbita maxima])

HSP 1 Score: 1102.0 bits (2849), Expect = 0.0e+00
Identity = 544/594 (91.58%), Postives = 565/594 (95.12%), Query Frame = 0

Query: 1   MANNSSFKLSKFSISL-SKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSL 60
           MA NS+FKLS FS SL SKPSFRYSTWHSPPPPAAAADPVLAAVSTAINN +TKPLASSL
Sbjct: 1   MAANSTFKLSNFSSSLPSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNVETKPLASSL 60

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQM 120
           RRLLPSFKPHHFIDLINHNPFSLSP+SLFSFFNWLSS+PTFRHTLQSYCAMA+FL  HQM
Sbjct: 61  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQM 120

Query: 121 FEESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCF 180
           FEESQS++RFLVSRKGKDSAAS+FAAILEI  TRCSNFVFDALMIAYSDSGFISDAIQCF
Sbjct: 121 FEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 180

Query: 181 RLVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKE 240
           RLVRK NFQIPF GC YLLDKMMNSNSPVTIWTFY EILDSGFPP VKYFNILINKFCK+
Sbjct: 181 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 240

Query: 241 GSIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTY 300
           GSIRDARLIFDEIGKRG RPTTVSFNTLING CKSRNLDE FRLKK MEENRIYPDV+TY
Sbjct: 241 GSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTY 300

Query: 301 SVLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLT 360
           SVLIHGLCKEGR+DDAEQLFDEM+QRGLR NDVTFTALIDGQCRS RIDSAMNTYQQML 
Sbjct: 301 SVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLA 360

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
           MGVKPDLVMYNTLLNGLCKVGDV+KARKL+DEMKMVGMKPDKITYTTLIDGYCKEGDLES
Sbjct: 361 MGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420

Query: 421 AMEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVID 480
           AMEIRKGMN EG+VLDNVAFTA+ISGLCRDGRV+DAE TLREM  AGMKPDDATYTMVID
Sbjct: 421 AMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVID 480

Query: 481 GFCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540
           G+CK G+VK GFKLLKEMQ NGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP
Sbjct: 481 GYCKNGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540

Query: 541 DDITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           DDITYNILLEGHCK+GRAEDFL LRNEKGLVVDYAYYTSLVGEYDKS+KDRRKR
Sbjct: 541 DDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 594

BLAST of HG10011190.1 vs. ExPASy Swiss-Prot
Match: O04491 (Putative pentatricopeptide repeat-containing protein At1g09680 OS=Arabidopsis thaliana OX=3702 GN=At1g09680 PE=3 SV=1)

HSP 1 Score: 685.6 bits (1768), Expect = 4.8e-196
Identity = 340/588 (57.82%), Postives = 435/588 (73.98%), Query Frame = 0

Query: 17  SKPSFRYSTWHSPPPPAAA---ADPVLAAVSTAINNAQTKPL--------ASSLRRLLPS 76
           S+ SF  STW+S    +AA    DPVL  +S AI ++   P           S+R++LPS
Sbjct: 20  SRASFLLSTWYSQESVSAADNDDDPVLVKLSVAIRDSYKDPPLEFSSFTDCPSIRKVLPS 79

Query: 77  FKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQS 136
              HH +DLINHNP SL   S+F+FF ++SS P FR T+++Y  +A FL+ H+MF E+QS
Sbjct: 80  LSVHHVVDLINHNPLSLPQRSIFAFFKFISSQPGFRFTVETYFVLARFLAVHEMFTEAQS 139

Query: 137 VVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKS 196
           ++  +VSRKGK+SA+SVF +++E+  T    F+ DALMI Y+D GFI DAIQCFRL RK 
Sbjct: 140 LIELVVSRKGKNSASSVFISLVEMRVTPMCGFLVDALMITYTDLGFIPDAIQCFRLSRKH 199

Query: 197 NFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDA 256
            F +P  GCG LLD+MM  N   TIW FY EILD+GFP NV  FNIL+NKFCKEG+I DA
Sbjct: 200 RFDVPIRGCGNLLDRMMKLNPTGTIWGFYMEILDAGFPLNVYVFNILMNKFCKEGNISDA 259

Query: 257 RLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHG 316
           + +FDEI KR L+PT VSFNTLING+CK  NLDEGFRLK  ME++R  PDVFTYS LI+ 
Sbjct: 260 QKVFDEITKRSLQPTVVSFNTLINGYCKVGNLDEGFRLKHQMEKSRTRPDVFTYSALINA 319

Query: 317 LCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPD 376
           LCKE ++D A  LFDEM +RGL PNDV FT LI G  R+  ID    +YQ+ML+ G++PD
Sbjct: 320 LCKENKMDGAHGLFDEMCKRGLIPNDVIFTTLIHGHSRNGEIDLMKESYQKMLSKGLQPD 379

Query: 377 LVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRK 436
           +V+YNTL+NG CK GD+  AR ++D M   G++PDKITYTTLIDG+C+ GD+E+A+EIRK
Sbjct: 380 IVLYNTLVNGFCKNGDLVAARNIVDGMIRRGLRPDKITYTTLIDGFCRGGDVETALEIRK 439

Query: 437 GMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKG 496
            M++ GI LD V F+AL+ G+C++GRV+DAER LREM+ AG+KPDD TYTM++D FCKKG
Sbjct: 440 EMDQNGIELDRVGFSALVCGMCKEGRVIDAERALREMLRAGIKPDDVTYTMMMDAFCKKG 499

Query: 497 NVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYN 556
           + +TGFKLLKEMQ++GH P V+TYNVL+NGLCK GQMKNA+MLL+AMLN+GV PDDITYN
Sbjct: 500 DAQTGFKLLKEMQSDGHVPSVVTYNVLLNGLCKLGQMKNADMLLDAMLNIGVVPDDITYN 559

Query: 557 ILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
            LLEGH ++  +      + E G+V D A Y S+V E D++ KD R R
Sbjct: 560 TLLEGHHRHANSSKRYIQKPEIGIVADLASYKSIVNELDRASKDHRNR 607

BLAST of HG10011190.1 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 3.6e-82
Identity = 179/578 (30.97%), Postives = 306/578 (52.94%), Query Frame = 0

Query: 24  STWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSLRRLLPSFKPHHFIDLI--NHNPFS 83
           ST+ S P  +  AD  L  +         K     L  L  +F P    +L+  + N  +
Sbjct: 13  STFASSPSDSLLADKALTFL---------KRHPYQLHHLSANFTPEAASNLLLKSQNDQA 72

Query: 84  LSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQSVVRFLVSRKGKDSAAS 143
           L    +  F NW +    F  TL+  C     L+  ++++ +Q +   + ++   D  AS
Sbjct: 73  L----ILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTAQILAEDVAAKTLDDEYAS 132

Query: 144 VFAAILEIAGTRC--SNFVFDALMIAYSDSGFISDAIQCFRLVRKSNFQIPFHGCGYLLD 203
           +    L+     C  ++ VFD ++ +YS    I  A+    L +   F         +LD
Sbjct: 133 LVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLD 192

Query: 204 KMMNSNSPVTI-WTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDARLIFDEIGKRGLR 263
             + S   ++     + E+L+S   PNV  +NILI  FC  G+I  A  +FD++  +G  
Sbjct: 193 ATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCL 252

Query: 264 PTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHGLCKEGRLDDAEQL 323
           P  V++NTLI+G+CK R +D+GF+L + M    + P++ +Y+V+I+GLC+EGR+ +   +
Sbjct: 253 PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFV 312

Query: 324 FDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPDLVMYNTLLNGLCK 383
             EM +RG   ++VT+  LI G C+      A+  + +ML  G+ P ++ Y +L++ +CK
Sbjct: 313 LTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCK 372

Query: 384 VGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGIVLDNVA 443
            G++N+A + +D+M++ G+ P++ TYTTL+DG+ ++G +  A  + + MN+ G     V 
Sbjct: 373 AGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVT 432

Query: 444 FTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKGNVKTGFKLLKEMQ 503
           + ALI+G C  G++ DA   L +M   G+ PD  +Y+ V+ GFC+  +V    ++ +EM 
Sbjct: 433 YNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMV 492

Query: 504 TNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKNGRAE 563
             G  P  ITY+ L+ G C+Q + K A  L E ML +G+ PD+ TY  L+  +C  G  E
Sbjct: 493 EKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLE 552

Query: 564 DFLKLRN---EKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
             L+L N   EKG++ D   Y+ L+   +K  + R  +
Sbjct: 553 KALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAK 575

BLAST of HG10011190.1 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 281.6 bits (719), Expect = 2.1e-74
Identity = 158/524 (30.15%), Postives = 267/524 (50.95%), Query Frame = 0

Query: 75  INHNPFSLSPLSL------FSFFNWLSSIPTFR--HTLQSYCAMADFLSAHQMFEESQSV 134
           +NH  +  + L L        F  W+   P     H +Q  C     L   +M++ ++ +
Sbjct: 35  LNHMDYRQARLRLVHGKLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHI 94

Query: 135 VRFLVSRKGKDSAASVFAAILEIAGTRCSN-FVFDALMIAYSDSGFISDAIQCFRLVRKS 194
           ++ L    GK S   VF A++       SN  V+D L+  Y   G I D+++ FRL+   
Sbjct: 95  LKELSLMSGKSS--FVFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLY 154

Query: 195 NFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDA 254
            F    + C  +L  ++ S   V++W+F  E+L     P+V  FNILIN  C EGS   +
Sbjct: 155 GFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKS 214

Query: 255 RLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHG 314
             +  ++ K G  PT V++NT+++ +CK         L   M+   +  DV TY++LIH 
Sbjct: 215 SYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHD 274

Query: 315 LCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPD 374
           LC+  R+     L  +M++R + PN+VT+  LI+G     ++  A     +ML+ G+ P+
Sbjct: 275 LCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPN 334

Query: 375 LVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRK 434
            V +N L++G    G+  +A K+   M+  G+ P +++Y  L+DG CK  + + A     
Sbjct: 335 HVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYM 394

Query: 435 GMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKG 494
            M   G+ +  + +T +I GLC++G + +A   L EM   G+ PD  TY+ +I+GFCK G
Sbjct: 395 RMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVG 454

Query: 495 NVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYN 554
             KT  +++  +   G +P  I Y+ L+   C+ G +K A  + EAM+  G T D  T+N
Sbjct: 455 RFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFN 514

Query: 555 ILLEGHCKNGR---AEDFLKLRNEKGLVVDYAYYTSLVGEYDKS 587
           +L+   CK G+   AE+F++     G++ +   +  L+  Y  S
Sbjct: 515 VLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNS 556

BLAST of HG10011190.1 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 278.1 bits (710), Expect = 2.3e-73
Identity = 164/566 (28.98%), Postives = 285/566 (50.35%), Query Frame = 0

Query: 27  HSPPPPAAAADPVLAAVSTAINNAQTKPLASSLRRLLPSFKPHHFIDLINHNPFSLSPLS 86
           +SP   +      +  ++  I   + +PL  SL+     FK  H I ++           
Sbjct: 46  YSPKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKTDHLIWVL--MKIKCDYRL 105

Query: 87  LFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQSVVRFLVSRKGKDSAASVFAAI 146
           +  FF+W  S       L+S C +     A +  + +QS++     R  K +    F   
Sbjct: 106 VLDFFDWARS--RRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERP-KLNVTDSFVQF 165

Query: 147 LEIAGTRCSNF-----VFDALMIAYSDSGFISDAIQCFRLVRKSNFQIPFHGCG-YLLDK 206
            ++      ++     VFD       D G + +A + F  +      +    C  YL   
Sbjct: 166 FDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRL 225

Query: 207 MMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDARLIFDEIGKRGLRPT 266
             +     T    + E  + G   NV  +NI+I+  C+ G I++A  +   +  +G  P 
Sbjct: 226 SKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPD 285

Query: 267 TVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHGLCKEGRLDDAEQLFD 326
            +S++T++NG+C+   LD+ ++L ++M+   + P+ + Y  +I  LC+  +L +AE+ F 
Sbjct: 286 VISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFS 345

Query: 327 EMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPDLVMYNTLLNGLCKVG 386
           EM ++G+ P+ V +T LIDG C+   I +A   + +M +  + PD++ Y  +++G C++G
Sbjct: 346 EMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIG 405

Query: 387 DVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGIVLDNVAFT 446
           D+ +A KL  EM   G++PD +T+T LI+GYCK G ++ A  +   M + G   + V +T
Sbjct: 406 DMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYT 465

Query: 447 ALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKGNVKTGFKLLKEMQTN 506
            LI GLC++G +  A   L EM   G++P+  TY  +++G CK GN++   KL+ E +  
Sbjct: 466 TLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAA 525

Query: 507 GHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKNGRAEDF 566
           G N   +TY  LM+  CK G+M  A  +L+ ML  G+ P  +T+N+L+ G C +G  ED 
Sbjct: 526 GLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDG 585

Query: 567 LKLRN---EKGLVVDYAYYTSLVGEY 584
            KL N    KG+  +   + SLV +Y
Sbjct: 586 EKLLNWMLAKGIAPNATTFNSLVKQY 606

BLAST of HG10011190.1 vs. ExPASy Swiss-Prot
Match: Q9FKR3 (Pentatricopeptide repeat-containing protein At5g38730 OS=Arabidopsis thaliana OX=3702 GN=At5g38730 PE=2 SV=1)

HSP 1 Score: 270.8 bits (691), Expect = 3.7e-71
Identity = 153/504 (30.36%), Postives = 264/504 (52.38%), Query Frame = 0

Query: 84  PLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQSVVRFLVSRKGKDSAASVF 143
           P   +SFF W  S+P+ +H+LQS   M   L+ H+ F+ +  ++  L  R+   S   + 
Sbjct: 60  PSLSWSFFIWTDSLPSSKHSLQSSWKMILILTKHKHFKTAHQLLDKLAQRELLSSPLVLR 119

Query: 144 AAILEIA-GTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKSNFQIPFHGCGYLLDKMM 203
           + +  ++      + VF  LMI Y+ +G I+D+I  F  +R    +     C  LL+ ++
Sbjct: 120 SLVGGVSEDPEDVSHVFSWLMIYYAKAGMINDSIVVFEQIRSCGLKPHLQACTVLLNSLV 179

Query: 204 NSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDARLIFDEIGKRGLRPTTV 263
                 T+W  + +++  G   N+  +N+L++   K G    A  +  E+ ++G+ P   
Sbjct: 180 KQRLTDTVWKIFKKMVKLGVVANIHVYNVLVHACSKSGDPEKAEKLLSEMEEKGVFPDIF 239

Query: 264 SFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHGLCKEGRLDDAEQLFDEM 323
           ++NTLI+ +CK     E   ++  ME + + P++ TY+  IHG  +EGR+ +A +LF E+
Sbjct: 240 TYNTLISVYCKKSMHFEALSVQDRMERSGVAPNIVTYNSFIHGFSREGRMREATRLFREI 299

Query: 324 QQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPDLVMYNTLLNGLCKVGDV 383
           +   +  N VT+T LIDG CR   ID A+   + M + G  P +V YN++L  LC+ G +
Sbjct: 300 KD-DVTANHVTYTTLIDGYCRMNDIDEALRLREVMESRGFSPGVVTYNSILRKLCEDGRI 359

Query: 384 NKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGIVLDNVAFTAL 443
            +A +L+ EM    ++PD IT  TLI+ YCK  D+ SA++++K M E G+ LD  ++ AL
Sbjct: 360 REANRLLTEMSGKKIEPDNITCNTLINAYCKIEDMVSAVKVKKKMIESGLKLDMYSYKAL 419

Query: 444 ISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKGNVKTGFKLLKEMQTNGH 503
           I G C+   + +A+  L  M+  G  P  ATY+ ++DGF  +       KLL+E +  G 
Sbjct: 420 IHGFCKVLELENAKEELFSMIEKGFSPGYATYSWLVDGFYNQNKQDEITKLLEEFEKRGL 479

Query: 504 NPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKNGR---AED 563
              V  Y  L+  +CK  Q+  A +L E+M   G+  D + +  +   + + G+   A  
Sbjct: 480 CADVALYRGLIRRICKLEQVDYAKVLFESMEKKGLVGDSVIFTTMAYAYWRTGKVTEASA 539

Query: 564 FLKLRNEKGLVVDYAYYTSLVGEY 584
              +   + L+V+   Y S+   Y
Sbjct: 540 LFDVMYNRRLMVNLKLYKSISASY 562

BLAST of HG10011190.1 vs. ExPASy TrEMBL
Match: A0A5A7TWU4 (Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold363G00320 PE=4 SV=1)

HSP 1 Score: 1107.4 bits (2863), Expect = 0.0e+00
Identity = 545/593 (91.91%), Postives = 571/593 (96.29%), Query Frame = 0

Query: 1   MANNSSFKLSKFSISLSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSLR 60
           MANNSSFKL   SISLSKPSFRYSTWHSPPPPAA ADP+LAAVSTAINNAQTKPLASSLR
Sbjct: 1   MANNSSFKL---SISLSKPSFRYSTWHSPPPPAAVADPLLAAVSTAINNAQTKPLASSLR 60

Query: 61  RLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMF 120
           RLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSS+PTFRHT QSYCAMA+FLSAHQMF
Sbjct: 61  RLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSMPTFRHTSQSYCAMANFLSAHQMF 120

Query: 121 EESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCFR 180
           EE QS++RFLVSRKGKDSAASVFAAIL+IAGTRCSNFVFDALMIAY DSGF+SDAIQCFR
Sbjct: 121 EECQSIIRFLVSRKGKDSAASVFAAILDIAGTRCSNFVFDALMIAYWDSGFVSDAIQCFR 180

Query: 181 LVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEG 240
           LVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEG
Sbjct: 181 LVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEG 240

Query: 241 SIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYS 300
           S RDA+LIFDEI K GLRPTTVSFNTLING CKSRNLDEGFRLKKIMEENRIYPDVFTYS
Sbjct: 241 STRDAKLIFDEIRKWGLRPTTVSFNTLINGLCKSRNLDEGFRLKKIMEENRIYPDVFTYS 300

Query: 301 VLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTM 360
           VLIH LCKEGRLD+AEQLFDEMQ+RGLRPN VTFTALIDGQC+ ++IDSAMNTY QMLTM
Sbjct: 301 VLIHRLCKEGRLDNAEQLFDEMQKRGLRPNGVTFTALIDGQCKRRQIDSAMNTYHQMLTM 360

Query: 361 GVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESA 420
           GVKPDLVMYNTLL GLCKVGDVNKARKLIDEM+MVGMKPDKI+YTTLIDGYCKEGDLESA
Sbjct: 361 GVKPDLVMYNTLLYGLCKVGDVNKARKLIDEMRMVGMKPDKISYTTLIDGYCKEGDLESA 420

Query: 421 MEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDG 480
           +EIRKGMNEEG+VLDNVAFTALISG CRDGRV DAERTLREMM AGMKPDDATYTMVIDG
Sbjct: 421 LEIRKGMNEEGVVLDNVAFTALISGFCRDGRVRDAERTLREMMEAGMKPDDATYTMVIDG 480

Query: 481 FCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPD 540
           +CKKG+VKTGFK+LKEMQ NGH PGVITYNVLMNGLCKQGQMKNA+MLLEAMLNLGVTPD
Sbjct: 481 YCKKGDVKTGFKMLKEMQINGHKPGVITYNVLMNGLCKQGQMKNAHMLLEAMLNLGVTPD 540

Query: 541 DITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           DITYNILLEGHCKNG+AED LKLRNEKGL++DYAYYTSLVGEYDKS+KDR+KR
Sbjct: 541 DITYNILLEGHCKNGKAEDLLKLRNEKGLIIDYAYYTSLVGEYDKSLKDRQKR 590

BLAST of HG10011190.1 vs. ExPASy TrEMBL
Match: A0A1S3C629 (putative pentatricopeptide repeat-containing protein At1g09680 OS=Cucumis melo OX=3656 GN=LOC103496868 PE=4 SV=1)

HSP 1 Score: 1107.4 bits (2863), Expect = 0.0e+00
Identity = 545/593 (91.91%), Postives = 571/593 (96.29%), Query Frame = 0

Query: 1   MANNSSFKLSKFSISLSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSLR 60
           MANNSSFKL   SISLSKPSFRYSTWHSPPPPAA ADP+LAAVSTAINNAQTKPLASSLR
Sbjct: 1   MANNSSFKL---SISLSKPSFRYSTWHSPPPPAAVADPLLAAVSTAINNAQTKPLASSLR 60

Query: 61  RLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMF 120
           RLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSS+PTFRHT QSYCAMA+FLSAHQMF
Sbjct: 61  RLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSMPTFRHTSQSYCAMANFLSAHQMF 120

Query: 121 EESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCFR 180
           EE QS++RFLVSRKGKDSAASVFAAIL+IAGTRCSNFVFDALMIAY DSGF+SDAIQCFR
Sbjct: 121 EECQSIIRFLVSRKGKDSAASVFAAILDIAGTRCSNFVFDALMIAYWDSGFVSDAIQCFR 180

Query: 181 LVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEG 240
           LVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEG
Sbjct: 181 LVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEG 240

Query: 241 SIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYS 300
           S RDA+LIFDEI K GLRPTTVSFNTLING CKSRNLDEGFRLKKIMEENRIYPDVFTYS
Sbjct: 241 STRDAKLIFDEIRKWGLRPTTVSFNTLINGLCKSRNLDEGFRLKKIMEENRIYPDVFTYS 300

Query: 301 VLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTM 360
           VLIH LCKEGRLD+AEQLFDEMQ+RGLRPN VTFTALIDGQC+ ++IDSAMNTY QMLTM
Sbjct: 301 VLIHRLCKEGRLDNAEQLFDEMQKRGLRPNGVTFTALIDGQCKRRQIDSAMNTYHQMLTM 360

Query: 361 GVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESA 420
           GVKPDLVMYNTLL GLCKVGDVNKARKLIDEM+MVGMKPDKI+YTTLIDGYCKEGDLESA
Sbjct: 361 GVKPDLVMYNTLLYGLCKVGDVNKARKLIDEMRMVGMKPDKISYTTLIDGYCKEGDLESA 420

Query: 421 MEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDG 480
           +EIRKGMNEEG+VLDNVAFTALISG CRDGRV DAERTLREMM AGMKPDDATYTMVIDG
Sbjct: 421 LEIRKGMNEEGVVLDNVAFTALISGFCRDGRVRDAERTLREMMEAGMKPDDATYTMVIDG 480

Query: 481 FCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPD 540
           +CKKG+VKTGFK+LKEMQ NGH PGVITYNVLMNGLCKQGQMKNA+MLLEAMLNLGVTPD
Sbjct: 481 YCKKGDVKTGFKMLKEMQINGHKPGVITYNVLMNGLCKQGQMKNAHMLLEAMLNLGVTPD 540

Query: 541 DITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           DITYNILLEGHCKNG+AED LKLRNEKGL++DYAYYTSLVGEYDKS+KDR+KR
Sbjct: 541 DITYNILLEGHCKNGKAEDLLKLRNEKGLIIDYAYYTSLVGEYDKSLKDRQKR 590

BLAST of HG10011190.1 vs. ExPASy TrEMBL
Match: A0A6J1IYZ9 (putative pentatricopeptide repeat-containing protein At1g09680 OS=Cucurbita maxima OX=3661 GN=LOC111481940 PE=4 SV=1)

HSP 1 Score: 1102.0 bits (2849), Expect = 0.0e+00
Identity = 544/594 (91.58%), Postives = 565/594 (95.12%), Query Frame = 0

Query: 1   MANNSSFKLSKFSISL-SKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSL 60
           MA NS+FKLS FS SL SKPSFRYSTWHSPPPPAAAADPVLAAVSTAINN +TKPLASSL
Sbjct: 1   MAANSTFKLSNFSSSLPSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNVETKPLASSL 60

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQM 120
           RRLLPSFKPHHFIDLINHNPFSLSP+SLFSFFNWLSS+PTFRHTLQSYCAMA+FL  HQM
Sbjct: 61  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQM 120

Query: 121 FEESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCF 180
           FEESQS++RFLVSRKGKDSAAS+FAAILEI  TRCSNFVFDALMIAYSDSGFISDAIQCF
Sbjct: 121 FEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 180

Query: 181 RLVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKE 240
           RLVRK NFQIPF GC YLLDKMMNSNSPVTIWTFY EILDSGFPP VKYFNILINKFCK+
Sbjct: 181 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 240

Query: 241 GSIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTY 300
           GSIRDARLIFDEIGKRG RPTTVSFNTLING CKSRNLDE FRLKK MEENRIYPDV+TY
Sbjct: 241 GSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTY 300

Query: 301 SVLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLT 360
           SVLIHGLCKEGR+DDAEQLFDEM+QRGLR NDVTFTALIDGQCRS RIDSAMNTYQQML 
Sbjct: 301 SVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLA 360

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
           MGVKPDLVMYNTLLNGLCKVGDV+KARKL+DEMKMVGMKPDKITYTTLIDGYCKEGDLES
Sbjct: 361 MGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420

Query: 421 AMEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVID 480
           AMEIRKGMN EG+VLDNVAFTA+ISGLCRDGRV+DAE TLREM  AGMKPDDATYTMVID
Sbjct: 421 AMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVID 480

Query: 481 GFCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540
           G+CK G+VK GFKLLKEMQ NGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP
Sbjct: 481 GYCKNGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540

Query: 541 DDITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           DDITYNILLEGHCK+GRAEDFL LRNEKGLVVDYAYYTSLVGEYDKS+KDRRKR
Sbjct: 541 DDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 594

BLAST of HG10011190.1 vs. ExPASy TrEMBL
Match: A0A6J1F410 (putative pentatricopeptide repeat-containing protein At1g09680 OS=Cucurbita moschata OX=3662 GN=LOC111441918 PE=4 SV=1)

HSP 1 Score: 1094.0 bits (2828), Expect = 0.0e+00
Identity = 540/594 (90.91%), Postives = 565/594 (95.12%), Query Frame = 0

Query: 1   MANNSSFKLSKFSISL-SKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSL 60
           MA NS+FKLS FS SL SKPSFRYSTWHSP PPAAAADPVLAAVSTAINN +TKPLASSL
Sbjct: 1   MAANSTFKLSNFSNSLPSKPSFRYSTWHSPLPPAAAADPVLAAVSTAINNVETKPLASSL 60

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQM 120
           RRLLPSFKPHHFIDLINHNPFSLSP+SLFSFFNWLSS+PTFRHTLQSYCAMA+FL  HQM
Sbjct: 61  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCVHQM 120

Query: 121 FEESQSVVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCF 180
           FEESQS+VRFLVSRKGKDSAAS+FAAILEI  TRCSNFVFDALMIAYSDSGFISDAIQCF
Sbjct: 121 FEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 180

Query: 181 RLVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKE 240
           RLVRK NFQIPF GC YLLDKMMNSNSPVTIWTFY EILDSGFPP VKYFNILINKFCK+
Sbjct: 181 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 240

Query: 241 GSIRDARLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTY 300
           GSIRDARLIFDEIGKRG RPT VSFNTLING CKSRNLDE FRLKK MEENRI+PDV+TY
Sbjct: 241 GSIRDARLIFDEIGKRGFRPTAVSFNTLINGLCKSRNLDECFRLKKAMEENRIHPDVYTY 300

Query: 301 SVLIHGLCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLT 360
           SVLIHGLCK+G++DDAEQLFDEM+QRGLR NDVTFTALIDGQCRS RIDSAMNTYQQML 
Sbjct: 301 SVLIHGLCKQGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLA 360

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
           MGVKPDLVMYNTL+NGLCKVGDV+KARKL+DEMKMV MKPDKITYTTLIDGYCKEGD+ES
Sbjct: 361 MGVKPDLVMYNTLINGLCKVGDVSKARKLVDEMKMVRMKPDKITYTTLIDGYCKEGDIES 420

Query: 421 AMEIRKGMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVID 480
           AMEIRKGMNEEG+VLDNVAFTA+ISGLCRDGRV+DAERTLREM  AGMKPDDATYTMVID
Sbjct: 421 AMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKPDDATYTMVID 480

Query: 481 GFCKKGNVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540
           G+CK G+VKTGFKLLKEMQ NGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP
Sbjct: 481 GYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540

Query: 541 DDITYNILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           DDITYNILLEGHCK+GRAEDFL LRNEKGLVVDYAYYTSLVGEYDKS+KDRRKR
Sbjct: 541 DDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 594

BLAST of HG10011190.1 vs. ExPASy TrEMBL
Match: A0A6J1CQA5 (putative pentatricopeptide repeat-containing protein At1g09680 OS=Momordica charantia OX=3673 GN=LOC111013174 PE=4 SV=1)

HSP 1 Score: 1043.1 bits (2696), Expect = 4.4e-301
Identity = 518/588 (88.10%), Postives = 550/588 (93.54%), Query Frame = 0

Query: 11  KFSISLS----KPSFRYSTWHSPPPPAAA-ADPVLAAVSTAINNAQTKPLASSLRRLLPS 70
           K S SLS    K SFRYSTW+SPPPP+    D +LAAVSTAINNAQTKPLASSLRRLLPS
Sbjct: 7   KLSDSLSRLPPKSSFRYSTWYSPPPPSPPHTDDLLAAVSTAINNAQTKPLASSLRRLLPS 66

Query: 71  FKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQS 130
            + HH ++L+NHNPFSL PLSLFSFFNWLSS PTFRHT+QSYCAMA FL AH+MF ES S
Sbjct: 67  LRAHHVVNLLNHNPFSLHPLSLFSFFNWLSSQPTFRHTVQSYCAMAHFLCAHRMFAESHS 126

Query: 131 VVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKS 190
           +VRFLVSRKGKDSA+S+FAAILE+AGTR SN VFDALMIAY+DSGF+SDAIQCFRLVRK 
Sbjct: 127 IVRFLVSRKGKDSASSIFAAILEVAGTRSSNLVFDALMIAYADSGFVSDAIQCFRLVRKY 186

Query: 191 NFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDA 250
           NFQIP  GCGYLLDKMMNSNS V+IWTFYSEI+DSGFPPNVKYFNILINKFCKEG+IRDA
Sbjct: 187 NFQIPSRGCGYLLDKMMNSNSTVSIWTFYSEIMDSGFPPNVKYFNILINKFCKEGNIRDA 246

Query: 251 RLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHG 310
           RLIF EIGKRGLRPTTVSFNTLING CKS+NLDEGFRLKKIMEENRIYPDVFTYSVLIHG
Sbjct: 247 RLIFFEIGKRGLRPTTVSFNTLINGLCKSQNLDEGFRLKKIMEENRIYPDVFTYSVLIHG 306

Query: 311 LCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPD 370
           LCKEGRL DAEQLFDEMQQRGL+PNDVTFT LI+GQCRS +IDSAMNTYQQMLTMGVKPD
Sbjct: 307 LCKEGRLYDAEQLFDEMQQRGLKPNDVTFTTLINGQCRSGQIDSAMNTYQQMLTMGVKPD 366

Query: 371 LVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRK 430
           LVMYNTLLNGLCKVGDV+KARKLI EMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRK
Sbjct: 367 LVMYNTLLNGLCKVGDVSKARKLIGEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRK 426

Query: 431 GMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKG 490
           GMNEEG+VLD+VAFTALISGLCRDGRV+DAERTLREMM AGMKPDDATYTMVID +CKKG
Sbjct: 427 GMNEEGVVLDDVAFTALISGLCRDGRVIDAERTLREMMEAGMKPDDATYTMVIDEYCKKG 486

Query: 491 NVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYN 550
           +VK GFKLLKEMQ NGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYN
Sbjct: 487 DVKMGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYN 546

Query: 551 ILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
           ILLEGHCKNG+AED +KLRNEKGLVVDYAYYTSLVGEYDKS+KDR+KR
Sbjct: 547 ILLEGHCKNGKAEDSIKLRNEKGLVVDYAYYTSLVGEYDKSLKDRQKR 594

BLAST of HG10011190.1 vs. TAIR 10
Match: AT1G09680.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 685.6 bits (1768), Expect = 3.4e-197
Identity = 340/588 (57.82%), Postives = 435/588 (73.98%), Query Frame = 0

Query: 17  SKPSFRYSTWHSPPPPAAA---ADPVLAAVSTAINNAQTKPL--------ASSLRRLLPS 76
           S+ SF  STW+S    +AA    DPVL  +S AI ++   P           S+R++LPS
Sbjct: 20  SRASFLLSTWYSQESVSAADNDDDPVLVKLSVAIRDSYKDPPLEFSSFTDCPSIRKVLPS 79

Query: 77  FKPHHFIDLINHNPFSLSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQS 136
              HH +DLINHNP SL   S+F+FF ++SS P FR T+++Y  +A FL+ H+MF E+QS
Sbjct: 80  LSVHHVVDLINHNPLSLPQRSIFAFFKFISSQPGFRFTVETYFVLARFLAVHEMFTEAQS 139

Query: 137 VVRFLVSRKGKDSAASVFAAILEIAGTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKS 196
           ++  +VSRKGK+SA+SVF +++E+  T    F+ DALMI Y+D GFI DAIQCFRL RK 
Sbjct: 140 LIELVVSRKGKNSASSVFISLVEMRVTPMCGFLVDALMITYTDLGFIPDAIQCFRLSRKH 199

Query: 197 NFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDA 256
            F +P  GCG LLD+MM  N   TIW FY EILD+GFP NV  FNIL+NKFCKEG+I DA
Sbjct: 200 RFDVPIRGCGNLLDRMMKLNPTGTIWGFYMEILDAGFPLNVYVFNILMNKFCKEGNISDA 259

Query: 257 RLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHG 316
           + +FDEI KR L+PT VSFNTLING+CK  NLDEGFRLK  ME++R  PDVFTYS LI+ 
Sbjct: 260 QKVFDEITKRSLQPTVVSFNTLINGYCKVGNLDEGFRLKHQMEKSRTRPDVFTYSALINA 319

Query: 317 LCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPD 376
           LCKE ++D A  LFDEM +RGL PNDV FT LI G  R+  ID    +YQ+ML+ G++PD
Sbjct: 320 LCKENKMDGAHGLFDEMCKRGLIPNDVIFTTLIHGHSRNGEIDLMKESYQKMLSKGLQPD 379

Query: 377 LVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRK 436
           +V+YNTL+NG CK GD+  AR ++D M   G++PDKITYTTLIDG+C+ GD+E+A+EIRK
Sbjct: 380 IVLYNTLVNGFCKNGDLVAARNIVDGMIRRGLRPDKITYTTLIDGFCRGGDVETALEIRK 439

Query: 437 GMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKG 496
            M++ GI LD V F+AL+ G+C++GRV+DAER LREM+ AG+KPDD TYTM++D FCKKG
Sbjct: 440 EMDQNGIELDRVGFSALVCGMCKEGRVIDAERALREMLRAGIKPDDVTYTMMMDAFCKKG 499

Query: 497 NVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYN 556
           + +TGFKLLKEMQ++GH P V+TYNVL+NGLCK GQMKNA+MLL+AMLN+GV PDDITYN
Sbjct: 500 DAQTGFKLLKEMQSDGHVPSVVTYNVLLNGLCKLGQMKNADMLLDAMLNIGVVPDDITYN 559

Query: 557 ILLEGHCKNGRAEDFLKLRNEKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
            LLEGH ++  +      + E G+V D A Y S+V E D++ KD R R
Sbjct: 560 TLLEGHHRHANSSKRYIQKPEIGIVADLASYKSIVNELDRASKDHRNR 607

BLAST of HG10011190.1 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 307.4 bits (786), Expect = 2.5e-83
Identity = 179/578 (30.97%), Postives = 306/578 (52.94%), Query Frame = 0

Query: 24  STWHSPPPPAAAADPVLAAVSTAINNAQTKPLASSLRRLLPSFKPHHFIDLI--NHNPFS 83
           ST+ S P  +  AD  L  +         K     L  L  +F P    +L+  + N  +
Sbjct: 13  STFASSPSDSLLADKALTFL---------KRHPYQLHHLSANFTPEAASNLLLKSQNDQA 72

Query: 84  LSPLSLFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQSVVRFLVSRKGKDSAAS 143
           L    +  F NW +    F  TL+  C     L+  ++++ +Q +   + ++   D  AS
Sbjct: 73  L----ILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTAQILAEDVAAKTLDDEYAS 132

Query: 144 VFAAILEIAGTRC--SNFVFDALMIAYSDSGFISDAIQCFRLVRKSNFQIPFHGCGYLLD 203
           +    L+     C  ++ VFD ++ +YS    I  A+    L +   F         +LD
Sbjct: 133 LVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLD 192

Query: 204 KMMNSNSPVTI-WTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDARLIFDEIGKRGLR 263
             + S   ++     + E+L+S   PNV  +NILI  FC  G+I  A  +FD++  +G  
Sbjct: 193 ATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCL 252

Query: 264 PTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHGLCKEGRLDDAEQL 323
           P  V++NTLI+G+CK R +D+GF+L + M    + P++ +Y+V+I+GLC+EGR+ +   +
Sbjct: 253 PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFV 312

Query: 324 FDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPDLVMYNTLLNGLCK 383
             EM +RG   ++VT+  LI G C+      A+  + +ML  G+ P ++ Y +L++ +CK
Sbjct: 313 LTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCK 372

Query: 384 VGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGIVLDNVA 443
            G++N+A + +D+M++ G+ P++ TYTTL+DG+ ++G +  A  + + MN+ G     V 
Sbjct: 373 AGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVT 432

Query: 444 FTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKGNVKTGFKLLKEMQ 503
           + ALI+G C  G++ DA   L +M   G+ PD  +Y+ V+ GFC+  +V    ++ +EM 
Sbjct: 433 YNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMV 492

Query: 504 TNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKNGRAE 563
             G  P  ITY+ L+ G C+Q + K A  L E ML +G+ PD+ TY  L+  +C  G  E
Sbjct: 493 EKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLE 552

Query: 564 DFLKLRN---EKGLVVDYAYYTSLVGEYDKSVKDRRKR 594
             L+L N   EKG++ D   Y+ L+   +K  + R  +
Sbjct: 553 KALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAK 575

BLAST of HG10011190.1 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 281.6 bits (719), Expect = 1.5e-75
Identity = 158/524 (30.15%), Postives = 267/524 (50.95%), Query Frame = 0

Query: 75  INHNPFSLSPLSL------FSFFNWLSSIPTFR--HTLQSYCAMADFLSAHQMFEESQSV 134
           +NH  +  + L L        F  W+   P     H +Q  C     L   +M++ ++ +
Sbjct: 75  LNHMDYRQARLRLVHGKLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHI 134

Query: 135 VRFLVSRKGKDSAASVFAAILEIAGTRCSN-FVFDALMIAYSDSGFISDAIQCFRLVRKS 194
           ++ L    GK S   VF A++       SN  V+D L+  Y   G I D+++ FRL+   
Sbjct: 135 LKELSLMSGKSS--FVFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLY 194

Query: 195 NFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDA 254
            F    + C  +L  ++ S   V++W+F  E+L     P+V  FNILIN  C EGS   +
Sbjct: 195 GFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKS 254

Query: 255 RLIFDEIGKRGLRPTTVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHG 314
             +  ++ K G  PT V++NT+++ +CK         L   M+   +  DV TY++LIH 
Sbjct: 255 SYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHD 314

Query: 315 LCKEGRLDDAEQLFDEMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPD 374
           LC+  R+     L  +M++R + PN+VT+  LI+G     ++  A     +ML+ G+ P+
Sbjct: 315 LCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPN 374

Query: 375 LVMYNTLLNGLCKVGDVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRK 434
            V +N L++G    G+  +A K+   M+  G+ P +++Y  L+DG CK  + + A     
Sbjct: 375 HVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYM 434

Query: 435 GMNEEGIVLDNVAFTALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKG 494
            M   G+ +  + +T +I GLC++G + +A   L EM   G+ PD  TY+ +I+GFCK G
Sbjct: 435 RMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVG 494

Query: 495 NVKTGFKLLKEMQTNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYN 554
             KT  +++  +   G +P  I Y+ L+   C+ G +K A  + EAM+  G T D  T+N
Sbjct: 495 RFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFN 554

Query: 555 ILLEGHCKNGR---AEDFLKLRNEKGLVVDYAYYTSLVGEYDKS 587
           +L+   CK G+   AE+F++     G++ +   +  L+  Y  S
Sbjct: 555 VLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNS 596

BLAST of HG10011190.1 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 278.1 bits (710), Expect = 1.7e-74
Identity = 164/566 (28.98%), Postives = 285/566 (50.35%), Query Frame = 0

Query: 27  HSPPPPAAAADPVLAAVSTAINNAQTKPLASSLRRLLPSFKPHHFIDLINHNPFSLSPLS 86
           +SP   +      +  ++  I   + +PL  SL+     FK  H I ++           
Sbjct: 46  YSPKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKTDHLIWVL--MKIKCDYRL 105

Query: 87  LFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQSVVRFLVSRKGKDSAASVFAAI 146
           +  FF+W  S       L+S C +     A +  + +QS++     R  K +    F   
Sbjct: 106 VLDFFDWARS--RRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERP-KLNVTDSFVQF 165

Query: 147 LEIAGTRCSNF-----VFDALMIAYSDSGFISDAIQCFRLVRKSNFQIPFHGCG-YLLDK 206
            ++      ++     VFD       D G + +A + F  +      +    C  YL   
Sbjct: 166 FDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRL 225

Query: 207 MMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDARLIFDEIGKRGLRPT 266
             +     T    + E  + G   NV  +NI+I+  C+ G I++A  +   +  +G  P 
Sbjct: 226 SKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPD 285

Query: 267 TVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHGLCKEGRLDDAEQLFD 326
            +S++T++NG+C+   LD+ ++L ++M+   + P+ + Y  +I  LC+  +L +AE+ F 
Sbjct: 286 VISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFS 345

Query: 327 EMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPDLVMYNTLLNGLCKVG 386
           EM ++G+ P+ V +T LIDG C+   I +A   + +M +  + PD++ Y  +++G C++G
Sbjct: 346 EMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIG 405

Query: 387 DVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGIVLDNVAFT 446
           D+ +A KL  EM   G++PD +T+T LI+GYCK G ++ A  +   M + G   + V +T
Sbjct: 406 DMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYT 465

Query: 447 ALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKGNVKTGFKLLKEMQTN 506
            LI GLC++G +  A   L EM   G++P+  TY  +++G CK GN++   KL+ E +  
Sbjct: 466 TLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAA 525

Query: 507 GHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKNGRAEDF 566
           G N   +TY  LM+  CK G+M  A  +L+ ML  G+ P  +T+N+L+ G C +G  ED 
Sbjct: 526 GLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDG 585

Query: 567 LKLRN---EKGLVVDYAYYTSLVGEY 584
            KL N    KG+  +   + SLV +Y
Sbjct: 586 EKLLNWMLAKGIAPNATTFNSLVKQY 606

BLAST of HG10011190.1 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 278.1 bits (710), Expect = 1.7e-74
Identity = 164/566 (28.98%), Postives = 285/566 (50.35%), Query Frame = 0

Query: 27  HSPPPPAAAADPVLAAVSTAINNAQTKPLASSLRRLLPSFKPHHFIDLINHNPFSLSPLS 86
           +SP   +      +  ++  I   + +PL  SL+     FK  H I ++           
Sbjct: 46  YSPKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKTDHLIWVL--MKIKCDYRL 105

Query: 87  LFSFFNWLSSIPTFRHTLQSYCAMADFLSAHQMFEESQSVVRFLVSRKGKDSAASVFAAI 146
           +  FF+W  S       L+S C +     A +  + +QS++     R  K +    F   
Sbjct: 106 VLDFFDWARS--RRDSNLESLCIVIHLAVASKDLKVAQSLISSFWERP-KLNVTDSFVQF 165

Query: 147 LEIAGTRCSNF-----VFDALMIAYSDSGFISDAIQCFRLVRKSNFQIPFHGCG-YLLDK 206
            ++      ++     VFD       D G + +A + F  +      +    C  YL   
Sbjct: 166 FDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRL 225

Query: 207 MMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKEGSIRDARLIFDEIGKRGLRPT 266
             +     T    + E  + G   NV  +NI+I+  C+ G I++A  +   +  +G  P 
Sbjct: 226 SKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPD 285

Query: 267 TVSFNTLINGFCKSRNLDEGFRLKKIMEENRIYPDVFTYSVLIHGLCKEGRLDDAEQLFD 326
            +S++T++NG+C+   LD+ ++L ++M+   + P+ + Y  +I  LC+  +L +AE+ F 
Sbjct: 286 VISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFS 345

Query: 327 EMQQRGLRPNDVTFTALIDGQCRSQRIDSAMNTYQQMLTMGVKPDLVMYNTLLNGLCKVG 386
           EM ++G+ P+ V +T LIDG C+   I +A   + +M +  + PD++ Y  +++G C++G
Sbjct: 346 EMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIG 405

Query: 387 DVNKARKLIDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGIVLDNVAFT 446
           D+ +A KL  EM   G++PD +T+T LI+GYCK G ++ A  +   M + G   + V +T
Sbjct: 406 DMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYT 465

Query: 447 ALISGLCRDGRVVDAERTLREMMGAGMKPDDATYTMVIDGFCKKGNVKTGFKLLKEMQTN 506
            LI GLC++G +  A   L EM   G++P+  TY  +++G CK GN++   KL+ E +  
Sbjct: 466 TLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAA 525

Query: 507 GHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKNGRAEDF 566
           G N   +TY  LM+  CK G+M  A  +L+ ML  G+ P  +T+N+L+ G C +G  ED 
Sbjct: 526 GLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDG 585

Query: 567 LKLRN---EKGLVVDYAYYTSLVGEY 584
            KL N    KG+  +   + SLV +Y
Sbjct: 586 EKLLNWMLAKGIAPNATTFNSLVKQY 606

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038878138.10.0e+0093.61LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g09... [more]
XP_008457121.10.0e+0091.91PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucum... [more]
KAG7017964.10.0e+0091.75putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyros... [more]
XP_023526542.10.0e+0091.58putative pentatricopeptide repeat-containing protein At1g09680 [Cucurbita pepo s... [more]
XP_022983327.10.0e+0091.58putative pentatricopeptide repeat-containing protein At1g09680 [Cucurbita maxima... [more]
Match NameE-valueIdentityDescription
O044914.8e-19657.82Putative pentatricopeptide repeat-containing protein At1g09680 OS=Arabidopsis th... [more]
Q9FIX33.6e-8230.97Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9LVQ52.1e-7430.15Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q0WVK72.3e-7328.98Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9FKR33.7e-7130.36Pentatricopeptide repeat-containing protein At5g38730 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7TWU40.0e+0091.91Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa... [more]
A0A1S3C6290.0e+0091.91putative pentatricopeptide repeat-containing protein At1g09680 OS=Cucumis melo O... [more]
A0A6J1IYZ90.0e+0091.58putative pentatricopeptide repeat-containing protein At1g09680 OS=Cucurbita maxi... [more]
A0A6J1F4100.0e+0090.91putative pentatricopeptide repeat-containing protein At1g09680 OS=Cucurbita mosc... [more]
A0A6J1CQA54.4e-30188.10putative pentatricopeptide repeat-containing protein At1g09680 OS=Momordica char... [more]
Match NameE-valueIdentityDescription
AT1G09680.13.4e-19757.82Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.12.5e-8330.97Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G55840.11.5e-7530.15Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G05670.11.7e-7428.98Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.21.7e-7428.98Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 221..251
e-value: 1.8E-6
score: 27.5
coord: 325..357
e-value: 5.2E-7
score: 29.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 435..483
e-value: 1.0E-13
score: 51.2
coord: 504..553
e-value: 2.1E-16
score: 59.8
coord: 259..308
e-value: 7.3E-17
score: 61.3
coord: 364..413
e-value: 2.0E-18
score: 66.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 262..296
e-value: 5.0E-8
score: 30.6
coord: 437..470
e-value: 1.1E-8
score: 32.7
coord: 229..260
e-value: 8.0E-6
score: 23.7
coord: 473..505
e-value: 1.0E-8
score: 32.8
coord: 367..400
e-value: 2.9E-10
score: 37.7
coord: 297..330
e-value: 1.8E-12
score: 44.6
coord: 542..565
e-value: 5.7E-4
score: 17.9
coord: 332..365
e-value: 7.0E-7
score: 27.0
coord: 507..540
e-value: 1.8E-7
score: 28.9
coord: 402..435
e-value: 6.8E-9
score: 33.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 400..434
score: 12.528824
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 470..504
score: 12.199985
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 505..539
score: 11.904029
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 295..329
score: 15.795291
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 365..399
score: 13.372844
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 260..294
score: 11.301158
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 435..469
score: 12.287675
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 225..259
score: 11.32308
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 330..364
score: 11.608074
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 87..241
e-value: 8.1E-11
score: 43.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 242..358
e-value: 5.3E-39
score: 136.5
coord: 452..593
e-value: 5.2E-36
score: 126.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 359..451
e-value: 1.4E-31
score: 111.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 110..422
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 32..592
NoneNo IPR availablePANTHERPTHR47932:SF51PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN-RELATEDcoord: 32..592

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
HG10011190HG10011190gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
HG10011190.1-cdsHG10011190.1-cds-Chr01:3339226..3341007CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
HG10011190.1HG10011190.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding