HG10017615 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10017615
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr03: 16918639 .. 16921448 (-)
RNA-Seq ExpressionHG10017615
SyntenyHG10017615
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTTGGAACTCCATCATCAAGTCCCACTTTGACTCGGGTTTGTTCCTTTCTGCCCTTTTGTTGTATAAAAGCATGAGGGAGGTGGGAGTTGAGCATGATGGTTTCACGTTTCCGATCGTTAATCATGTCATTATGTCAATTTGGGTTGATGTAGTCTATGCGGGAATGGCCCATTGTGTTGGAATTCGAATGGGGTTTAGTGCTGATTTGTATTTCTGTAATACCATGATGCAGGTTTATGGGAAATGTGAGTGTTTGGTTTATGCTCGTAATGTGTTTGATGAAATGCCTAACAGAGACTTGGTTTCTTGGACCTCGATGATTTCGGCGTATGTTAATGGTGGTGATGTTGTTTCTGCTTTGGATCTTTTTGAGGGAATGAGGAGGGAGTTGGAGCCGAACTCGGTGACAGTAATGGTCATGTTGCAAGCTTGTTGTGCGACTCAAAATTTGGTTCTGGGAAGGCTGCTTCAATGTCATGTAGTTAAGAGTGGTTTATTGTTTGATACAGGTCTGCAGAATTCGTTCTTGCAAATGTATAGTCGACTGGGAGAGGAGGATGAAGTTGGAATTTTTTTCTCTGAAATTCATTGCAAGAATGTGGTTTCTTGGAATATTTTGATGTCTTTTTATTCCTCCATGGGGGATATTTTGAAAGTTGTAGATACCTTCAACAAAATCATGGGTGAAGTTCCACTCAGCATTGAGACATTAACCATACTTATATCAGCAACTGCGACATCTGATTCCGGGTGTCTGATCCTAGGTGAAAATCTACATTCCTTGGCAATTAAAAGTGGCCTTTATGATGGAATTCTGCGGACTTCTTTGTTGGATATGTATGCCAAGTTTGGGGAGTTGGAAAATTCAACTAGGTTGTTTAAGGGAATCCCAAATAGGAGCATCATTACTTGGGGGGCCATGATGTCTAGTTTTATTCAGAATGGTCATTTTGATGATGCAGTTGAGATCTTCAAGCAAATGCAAACTGCTGGCTTGAAACCCAGTGTTGGAATTATTAAACACTTAATTGATGCTTACGCCTATTTGGGTGCTCTGCAGTTGGGGAAAGCAATACATTGTTACCTCATCCGAATCTATGGATTGGAGATATGTAATACACACTTAGAAACATCTCTCCTAAACATGTATGGAAGATGTGGAAGCGTTGCTTCTGCTAGAAAATGTTTTGACTTGATCTTAATCAAAGATGTTGTGGCGTGGACTTCCATGATTGACGTATATGGTGCTCATGGATTAGGTATTGATGCCCTCAATCTGTTCCGTCAGATGATGAGTGAAGAAGTGGCCCCAAATAATGTCACGTTCTTAAGTTTGTTATCTGCCTGTAGCCACTCTGGCCTTGTAAGTGAGGGCTGTGAAATCTTTTATTCAATGAGATCAAGGTTCCATATTAAGCCTAATTTAGAGCACTACACTTGTTTTGTTGATCTTTTGAGTAGATCAACAAGAGTAAGAGAGGCCTTTGCAATTACATTGAGAATGACAAATCTCTGTGATGGCAGGATTTGGGGTGCTCTTATGGGCGCCTGCCGGGTGTATGGAGACAATAAAATCGCTAACTATGCTGCAGACAGGCTTCTTGAATTAGAACCTGACAATGTAGGCTATTATACTTTGTTGAGCAATTCACAGGCCAGTGTTGGGCAGTGGCATGAAGTTGAAAAATTACGTAGTGTTGTGTATGAGAAAGACCTTGTCAAGAAACCAGGTTGGAGCTTCATTGAGTTAAATGGAACAATTCATGGGTTTGTTTCAGGAGATAGATCACACAACAAGACCAATGAGATTTATGATTTATTGGTATATATTAATAGGATAAAATAGGACAGGGATTCTGTTTATCTAAGCTACAATAAGGGTGTCTGATTAAACCTGGAATTCTTTTTGGCATCTTCACTGTGCCATATGAAAGATGGGAATATTGGTTAACCTGCCACATAACAATATAGCATTTGATGAGATCGTTTGAAGTGATTGAAGGACTAGTTTGAAGTTTGAAGCATGAACATATTAAAGGGAGAAGTCTAATTAGTCATTATGCCTGATGGTATGGTAATACTTTTCATCCTCTCTAAATTTCAGCTTACGCTCATTGCTGTTTCACTATTCTCTAGTAATGTGAATCGTTTAGACTTACAAATGATATATCAGTTCTCATGCACAAAATCATGAATCTGTTGAATGCCCCTTTCCCCCTAGACTAATCTTACAAGCTTCGTGTTCCTGGTAGTATCTATGCTATGCAAATGATTAGAGAAATTTTTTTAATTGTAGAAGTGAAAAGAAAACCATGTTTAGTAGACAGAAAGGATCCCTCACAAATCAAAACATATGGTTATCTGGAGAATGGCCTGCCGCATTATGGTGCCATCTCTCTGCTTGGCAGCTAAACATTCCACTCTAAATAAACTACCATGGATTGACCTGATAATCATTGCAACAAGTGAAAAAATGTATAAGCGACTTTGGGAGAATGAATTCAAATTTAATTGTACCTACCTACCTAAGGTACTAATATCTTACAAGTTTCTAAGCAACTAGATGTTGTATGGTCGAATGGTTAACCTGTAAGATTGGTCAAGTACTTTTGAGCTTACTCAAATGCTGACAAATATATGAGTATATTACTGCTAATCTGTTACTGATTGTTGATTTTGTTTCTTCAGGCGATGGAGCTTATATCTAGCACAGCTCTGAAGTTCGGTTTTGGGCATAGAGCAAATAGTATAAGGTTCTTGGCCTTGAGGCATAGCTTAGGAAGAAACTGGGTCAGGCAGAGCTGA

mRNA sequence

ATGCTTTGGAACTCCATCATCAAGTCCCACTTTGACTCGGGTTTGTTCCTTTCTGCCCTTTTGTTGTATAAAAGCATGAGGGAGGTGGGAGTTGAGCATGATGGTTTCACGTTTCCGATCGTTAATCATGTCATTATGTCAATTTGGGTTGATGTAGTCTATGCGGGAATGGCCCATTGTGTTGGAATTCGAATGGGGTTTAGTGCTGATTTGTATTTCTGTAATACCATGATGCAGGTTTATGGGAAATGTGAGTGTTTGGTTTATGCTCGTAATGTGTTTGATGAAATGCCTAACAGAGACTTGGTTTCTTGGACCTCGATGATTTCGGCGTATGTTAATGGTGGTGATGTTGTTTCTGCTTTGGATCTTTTTGAGGGAATGAGGAGGGAGTTGGAGCCGAACTCGGTGACAGTAATGGTCATGTTGCAAGCTTGTTGTGCGACTCAAAATTTGGTTCTGGGAAGGCTGCTTCAATGTCATGTAGTTAAGAGTGGTTTATTGTTTGATACAGGTCTGCAGAATTCGTTCTTGCAAATGTATAGTCGACTGGGAGAGGAGGATGAAGTTGGAATTTTTTTCTCTGAAATTCATTGCAAGAATGTGGTTTCTTGGAATATTTTGATGTCTTTTTATTCCTCCATGGGGGATATTTTGAAAGTTGTAGATACCTTCAACAAAATCATGGGTGAAGTTCCACTCAGCATTGAGACATTAACCATACTTATATCAGCAACTGCGACATCTGATTCCGGGTGTCTGATCCTAGGTGAAAATCTACATTCCTTGGCAATTAAAAGTGGCCTTTATGATGGAATTCTGCGGACTTCTTTGTTGGATATGTATGCCAAGTTTGGGGAGTTGGAAAATTCAACTAGGTTGTTTAAGGGAATCCCAAATAGGAGCATCATTACTTGGGGGGCCATGATGTCTAGTTTTATTCAGAATGGTCATTTTGATGATGCAGTTGAGATCTTCAAGCAAATGCAAACTGCTGGCTTGAAACCCAGTGTTGGAATTATTAAACACTTAATTGATGCTTACGCCTATTTGGGTGCTCTGCAGTTGGGGAAAGCAATACATTGTTACCTCATCCGAATCTATGGATTGGAGATATGTAATACACACTTAGAAACATCTCTCCTAAACATGTATGGAAGATGTGGAAGCGTTGCTTCTGCTAGAAAATGTTTTGACTTGATCTTAATCAAAGATGTTGTGGCGTGGACTTCCATGATTGACGTATATGGTGCTCATGGATTAGGTATTGATGCCCTCAATCTGTTCCGTCAGATGATGAGTGAAGAAGTGGCCCCAAATAATGTCACGTTCTTAAGTTTGTTATCTGCCTGTAGCCACTCTGGCCTTGTAAGTGAGGGCTGTGAAATCTTTTATTCAATGAGATCAAGGTTCCATATTAAGCCTAATTTAGAGCACTACACTTGTTTTGTTGATCTTTTGAGTAGATCAACAAGAGTAAGAGAGGCCTTTGCAATTACATTGAGAATGACAAATCTCTGTGATGGCAGGATTTGGGGTGCTCTTATGGGCGCCTGCCGGGTGTATGGAGACAATAAAATCGCTAACTATGCTGCAGACAGGCTTCTTGAATTAGAACCTGACAATGTAGGCTATTATACTTTGTTGAGCAATTCACAGGCCAGTGTTGGGCAGTGGCATGAAGTTGAAAAATTACGTAGTGTTGTGTATGAGAAAGACCTTGTCAAGAAACCAGGTTGGAGCTTCATTGAGTTAAATGGAACAATTCATGGGTTTGTTTCAGGAGATAGATCACACAACAAGACCAATGAGATTTATGATTTATTGGCGATGGAGCTTATATCTAGCACAGCTCTGAAGTTCGGTTTTGGGCATAGAGCAAATAGTATAAGGTTCTTGGCCTTGAGGCATAGCTTAGGAAGAAACTGGGTCAGGCAGAGCTGA

Coding sequence (CDS)

ATGCTTTGGAACTCCATCATCAAGTCCCACTTTGACTCGGGTTTGTTCCTTTCTGCCCTTTTGTTGTATAAAAGCATGAGGGAGGTGGGAGTTGAGCATGATGGTTTCACGTTTCCGATCGTTAATCATGTCATTATGTCAATTTGGGTTGATGTAGTCTATGCGGGAATGGCCCATTGTGTTGGAATTCGAATGGGGTTTAGTGCTGATTTGTATTTCTGTAATACCATGATGCAGGTTTATGGGAAATGTGAGTGTTTGGTTTATGCTCGTAATGTGTTTGATGAAATGCCTAACAGAGACTTGGTTTCTTGGACCTCGATGATTTCGGCGTATGTTAATGGTGGTGATGTTGTTTCTGCTTTGGATCTTTTTGAGGGAATGAGGAGGGAGTTGGAGCCGAACTCGGTGACAGTAATGGTCATGTTGCAAGCTTGTTGTGCGACTCAAAATTTGGTTCTGGGAAGGCTGCTTCAATGTCATGTAGTTAAGAGTGGTTTATTGTTTGATACAGGTCTGCAGAATTCGTTCTTGCAAATGTATAGTCGACTGGGAGAGGAGGATGAAGTTGGAATTTTTTTCTCTGAAATTCATTGCAAGAATGTGGTTTCTTGGAATATTTTGATGTCTTTTTATTCCTCCATGGGGGATATTTTGAAAGTTGTAGATACCTTCAACAAAATCATGGGTGAAGTTCCACTCAGCATTGAGACATTAACCATACTTATATCAGCAACTGCGACATCTGATTCCGGGTGTCTGATCCTAGGTGAAAATCTACATTCCTTGGCAATTAAAAGTGGCCTTTATGATGGAATTCTGCGGACTTCTTTGTTGGATATGTATGCCAAGTTTGGGGAGTTGGAAAATTCAACTAGGTTGTTTAAGGGAATCCCAAATAGGAGCATCATTACTTGGGGGGCCATGATGTCTAGTTTTATTCAGAATGGTCATTTTGATGATGCAGTTGAGATCTTCAAGCAAATGCAAACTGCTGGCTTGAAACCCAGTGTTGGAATTATTAAACACTTAATTGATGCTTACGCCTATTTGGGTGCTCTGCAGTTGGGGAAAGCAATACATTGTTACCTCATCCGAATCTATGGATTGGAGATATGTAATACACACTTAGAAACATCTCTCCTAAACATGTATGGAAGATGTGGAAGCGTTGCTTCTGCTAGAAAATGTTTTGACTTGATCTTAATCAAAGATGTTGTGGCGTGGACTTCCATGATTGACGTATATGGTGCTCATGGATTAGGTATTGATGCCCTCAATCTGTTCCGTCAGATGATGAGTGAAGAAGTGGCCCCAAATAATGTCACGTTCTTAAGTTTGTTATCTGCCTGTAGCCACTCTGGCCTTGTAAGTGAGGGCTGTGAAATCTTTTATTCAATGAGATCAAGGTTCCATATTAAGCCTAATTTAGAGCACTACACTTGTTTTGTTGATCTTTTGAGTAGATCAACAAGAGTAAGAGAGGCCTTTGCAATTACATTGAGAATGACAAATCTCTGTGATGGCAGGATTTGGGGTGCTCTTATGGGCGCCTGCCGGGTGTATGGAGACAATAAAATCGCTAACTATGCTGCAGACAGGCTTCTTGAATTAGAACCTGACAATGTAGGCTATTATACTTTGTTGAGCAATTCACAGGCCAGTGTTGGGCAGTGGCATGAAGTTGAAAAATTACGTAGTGTTGTGTATGAGAAAGACCTTGTCAAGAAACCAGGTTGGAGCTTCATTGAGTTAAATGGAACAATTCATGGGTTTGTTTCAGGAGATAGATCACACAACAAGACCAATGAGATTTATGATTTATTGGCGATGGAGCTTATATCTAGCACAGCTCTGAAGTTCGGTTTTGGGCATAGAGCAAATAGTATAAGGTTCTTGGCCTTGAGGCATAGCTTAGGAAGAAACTGGGTCAGGCAGAGCTGA

Protein sequence

MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHCVGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQMYSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLTILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIPNRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAIHCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHGLGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEPDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHNKTNEIYDLLAMELISSTALKFGFGHRANSIRFLALRHSLGRNWVRQS
Homology
BLAST of HG10017615 vs. NCBI nr
Match: XP_038883286.1 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 1142.1 bits (2953), Expect = 0.0e+00
Identity = 561/609 (92.12%), Postives = 582/609 (95.57%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHC 60
           MLWNSIIKSHFDSGLFLSALLLYK+MREVGVEHDGFTFPI+NHVIMSIWVDV+YA M HC
Sbjct: 1   MLWNSIIKSHFDSGLFLSALLLYKNMREVGVEHDGFTFPILNHVIMSIWVDVLYAEMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVS 120
           VGIRMGF ADLYFCNTMM+VYGKC CLVYAR++FDEMPNRDLVSWTSMISAYVNGGDVV 
Sbjct: 61  VGIRMGFIADLYFCNTMMEVYGKCGCLVYARHMFDEMPNRDLVSWTSMISAYVNGGDVVC 120

Query: 121 ALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQM 180
           ALDLFE MRRELEPNSVT MVMLQACCATQN VLGR LQCHVVK+GLL D GL+NSFL+M
Sbjct: 121 ALDLFEAMRRELEPNSVTAMVMLQACCATQNFVLGRQLQCHVVKNGLLLDIGLRNSFLRM 180

Query: 181 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLT 240
           YSRLG EDEVG+FFSEI CKNVVSWNILMSFYSS+G+ILKVVD FNKIMGEV LSIETLT
Sbjct: 181 YSRLGGEDEVGVFFSEIDCKNVVSWNILMSFYSSVGNILKVVDIFNKIMGEVTLSIETLT 240

Query: 241 ILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIPN 300
           ILISATATSDSGCLILGENLHSLAIKSGLYD IL+TSLLDMYAKFGELENS +LFK IPN
Sbjct: 241 ILISATATSDSGCLILGENLHSLAIKSGLYDSILQTSLLDMYAKFGELENSAKLFKEIPN 300

Query: 301 RSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAI 360
           RSIITWGAMMSSFIQNGHFD+AVEIFKQMQ AGLKPSVG++KHLIDAYAYLGALQLGKAI
Sbjct: 301 RSIITWGAMMSSFIQNGHFDEAVEIFKQMQAAGLKPSVGVLKHLIDAYAYLGALQLGKAI 360

Query: 361 HCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHG 420
           HCYLIRIYGLEICNTHLETSLLNMYGRCGS+ASARKCFDLIL KDVV WTSMIDVYGAHG
Sbjct: 361 HCYLIRIYGLEICNTHLETSLLNMYGRCGSIASARKCFDLILTKDVVVWTSMIDVYGAHG 420

Query: 421 LGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHY 480
           LGIDALNLF QMMSEEVAPN+VTFLSLLSACSHSGLVSEGCEIFYSMRS F IKP+LEHY
Sbjct: 421 LGIDALNLFHQMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSSFDIKPDLEHY 480

Query: 481 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEP 540
           TCFVDLLSRSTRVREAFAI LRMTNL DGRIWGALMGACRVYGDNKIANYAA RLLELEP
Sbjct: 481 TCFVDLLSRSTRVREAFAIILRMTNLRDGRIWGALMGACRVYGDNKIANYAAHRLLELEP 540

Query: 541 DNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 600
           DNVGYYTLLSN+QASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN
Sbjct: 541 DNVGYYTLLSNAQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 600

Query: 601 KTNEIYDLL 610
           KTNEIYDLL
Sbjct: 601 KTNEIYDLL 609

BLAST of HG10017615 vs. NCBI nr
Match: XP_008457591.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_008457593.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_016902177.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_016902178.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_016902179.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo] >XP_016902180.1 PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic [Cucumis melo])

HSP 1 Score: 1107.4 bits (2863), Expect = 0.0e+00
Identity = 547/609 (89.82%), Postives = 571/609 (93.76%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHC 60
           MLWN++IKSHFDSGLF SALLLYK+MREV VEHDGFT PIVN VI+SIWVDVVY GM HC
Sbjct: 1   MLWNNVIKSHFDSGLFHSALLLYKNMREVRVEHDGFTLPIVNQVILSIWVDVVYGGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVS 120
           VGIRMGFS+DLYFCNTMM+VYGKC CLV AR+VFDEMPNRDLVSWTSMISAYV GGDV  
Sbjct: 61  VGIRMGFSSDLYFCNTMMEVYGKCGCLVSARDVFDEMPNRDLVSWTSMISAYVKGGDVFC 120

Query: 121 ALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQM 180
           ALD+FEGMRRELEPNSVTV+VMLQACCATQNLVLGRLLQC+VVK+GLLFDTGLQNSFL+M
Sbjct: 121 ALDIFEGMRRELEPNSVTVIVMLQACCATQNLVLGRLLQCYVVKNGLLFDTGLQNSFLRM 180

Query: 181 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLT 240
           YSRLG EDEV  FFSEI  KNVVSWNILMSFYSSMGDI+KVVD  NKIMGEVPLSIETLT
Sbjct: 181 YSRLGGEDEVVAFFSEIDFKNVVSWNILMSFYSSMGDIVKVVDILNKIMGEVPLSIETLT 240

Query: 241 ILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIPN 300
           ILIS  ATSDSGCLILGENLHSLAIKSGLYD IL TSLLDMYAKFGELENSTRLFK IPN
Sbjct: 241 ILISGIATSDSGCLILGENLHSLAIKSGLYDDILCTSLLDMYAKFGELENSTRLFKEIPN 300

Query: 301 RSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAI 360
           RSIITWGAMMSSFIQNGHFDDAV+IFKQMQ AGLKPSVGI+KHLIDAYAYLGALQLGKAI
Sbjct: 301 RSIITWGAMMSSFIQNGHFDDAVDIFKQMQVAGLKPSVGILKHLIDAYAYLGALQLGKAI 360

Query: 361 HCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHG 420
           HC+LIRIYGL +CNT LETS+LNMY RCGS+ASARKCFDLILIKDVVAWTSMI+ YGAHG
Sbjct: 361 HCHLIRIYGLVVCNTRLETSVLNMYVRCGSIASARKCFDLILIKDVVAWTSMIEGYGAHG 420

Query: 421 LGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHY 480
           LGIDALNLF QM SEEV PNNVTFLSLLSACSHSGLVSEGC IFYSMRSRF+IKP+LEHY
Sbjct: 421 LGIDALNLFHQMTSEEVTPNNVTFLSLLSACSHSGLVSEGCGIFYSMRSRFNIKPDLEHY 480

Query: 481 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEP 540
           TCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGDNKIANYAA RLLELEP
Sbjct: 481 TCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIANYAAHRLLELEP 540

Query: 541 DNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 600
           DNVGYYTLLSNSQASVGQWHE EKLRS+VYEK+L KKPGWSFIELNGTIHGFVSGDRSH 
Sbjct: 541 DNVGYYTLLSNSQASVGQWHEAEKLRSLVYEKNLAKKPGWSFIELNGTIHGFVSGDRSHY 600

Query: 601 KTNEIYDLL 610
           K NEIYDLL
Sbjct: 601 KANEIYDLL 609

BLAST of HG10017615 vs. NCBI nr
Match: XP_031739775.1 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739776.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739777.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739778.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739779.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739780.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739781.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739782.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739783.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739784.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739785.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739786.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739787.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >XP_031739788.1 pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis sativus] >KGN54049.2 hypothetical protein Csa_019111 [Cucumis sativus])

HSP 1 Score: 1098.2 bits (2839), Expect = 0.0e+00
Identity = 539/609 (88.51%), Postives = 569/609 (93.43%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHC 60
           MLWNSIIKS+FDSGLF SALLLYK+MREV VEHDGFT PIVN V MSIWVDV YAGM HC
Sbjct: 1   MLWNSIIKSYFDSGLFHSALLLYKNMREVRVEHDGFTLPIVNQVTMSIWVDVAYAGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVS 120
           VGIRMGFS+DLYFCNTMM+VYGKC C+V AR+VFDEMPNRDLVSWTSMISAYVNGGDV  
Sbjct: 61  VGIRMGFSSDLYFCNTMMEVYGKCGCMVSARDVFDEMPNRDLVSWTSMISAYVNGGDVFC 120

Query: 121 ALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQM 180
           ALD+FEGMRRELEPNSVTVMVMLQACCATQ LVLGRLLQC+VVK+GLLFDT LQNSFL+M
Sbjct: 121 ALDIFEGMRRELEPNSVTVMVMLQACCATQYLVLGRLLQCYVVKNGLLFDTHLQNSFLRM 180

Query: 181 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLT 240
           YSRLG +DE G+FFSEI  KN VSWNILMSFYSSMGDI+KVVD  NKIMGEVPLSIETLT
Sbjct: 181 YSRLGRQDEFGVFFSEIDFKNAVSWNILMSFYSSMGDIVKVVDILNKIMGEVPLSIETLT 240

Query: 241 ILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIPN 300
           ILIS  ATSDS CL+LGENLHSLAIKSGLYDGIL TSLL MYAKFGELENST LFK IPN
Sbjct: 241 ILISGIATSDSRCLMLGENLHSLAIKSGLYDGILCTSLLGMYAKFGELENSTSLFKEIPN 300

Query: 301 RSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAI 360
           RSI+TWGAMMSSFIQNGHFDDAV+IFKQMQ AGLKPSVGI+KHLIDAYA+LGALQLGKAI
Sbjct: 301 RSIVTWGAMMSSFIQNGHFDDAVDIFKQMQVAGLKPSVGILKHLIDAYAFLGALQLGKAI 360

Query: 361 HCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHG 420
           HC LIRIYG  +CNT LETS+LNMY RCGS+ASARKCFDLI+IKDVVAWTSMI+ YGAHG
Sbjct: 361 HCCLIRIYGFVVCNTRLETSVLNMYVRCGSIASARKCFDLIIIKDVVAWTSMIEGYGAHG 420

Query: 421 LGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHY 480
           LG+DALNLF QM SEEV PNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRF+IKP+LEHY
Sbjct: 421 LGVDALNLFHQMTSEEVTPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEHY 480

Query: 481 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEP 540
           TCFVDLLSRSTRVREAFAI LRMT+LCDGRIWGALMGACRVYGDNKIANYAA RLLELEP
Sbjct: 481 TCFVDLLSRSTRVREAFAIILRMTSLCDGRIWGALMGACRVYGDNKIANYAAHRLLELEP 540

Query: 541 DNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 600
           DNVGYYTLLSNSQASVGQWHE EKLRS+VYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN
Sbjct: 541 DNVGYYTLLSNSQASVGQWHEAEKLRSLVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 600

Query: 601 KTNEIYDLL 610
           +TNEIYD+L
Sbjct: 601 RTNEIYDVL 609

BLAST of HG10017615 vs. NCBI nr
Match: XP_023526509.1 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1079.3 bits (2790), Expect = 0.0e+00
Identity = 527/610 (86.39%), Postives = 568/610 (93.11%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHC 60
           MLWNSIIKS FDSGLFLSA++LYK+MREVGVEHDGFTFPI+NHV+MSIWVDVVYAGM HC
Sbjct: 65  MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 124

Query: 61  VGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVS 120
           VGIRMGF +DLYFCNTMM+VY KCECL +AR VFDEMPNRDLVSWTSMISAYVN GD+V 
Sbjct: 125 VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNSGDIVC 184

Query: 121 ALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQM 180
           AL+LFEGMRR LEPNSVT+M MLQACC T++LVLGRL+QC VVK+GLLFD GLQN FL+M
Sbjct: 185 ALNLFEGMRRVLEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 244

Query: 181 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIM-GEVPLSIETL 240
           YSRLG EDE   FFSEI CKNVVSWNIL+SFYSS+GDI+K VD F +IM GEVPL IETL
Sbjct: 245 YSRLGGEDEFVCFFSEIDCKNVVSWNILISFYSSVGDIVKAVDIFKQIMGGEVPLIIETL 304

Query: 241 TILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIP 300
           TILISAT TS+S CLILGENLHSLAIK+GLYD ILRTSLLDMYAKFGEL+NSTRLF  IP
Sbjct: 305 TILISATKTSESMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 364

Query: 301 NRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKA 360
           NRSIITWGAMMSSFIQNGHFD+AVEIF QMQ AGLKPS+GI+KHLIDAYA+LGALQLG+ 
Sbjct: 365 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 424

Query: 361 IHCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAH 420
           IHCYLIRIYGLEICNTHLETSL+NMY RCGS+ASARKCFDLI++KDVVAWTSMI+ YGAH
Sbjct: 425 IHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGAH 484

Query: 421 GLGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEH 480
           G GI+ALNL+  MMSEEVAPN+VTFLSLLSACSHSGLVSEGCEIFYSMRSRF+IKP+LEH
Sbjct: 485 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 544

Query: 481 YTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELE 540
           YTCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGD KIA YAA RLLELE
Sbjct: 545 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDTKIAIYAAHRLLELE 604

Query: 541 PDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSH 600
           PDNVGYYTLLSN+QASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSH
Sbjct: 605 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSH 664

Query: 601 NKTNEIYDLL 610
            KT++IYDLL
Sbjct: 665 GKTDQIYDLL 674

BLAST of HG10017615 vs. NCBI nr
Match: KAG6603417.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 527/610 (86.39%), Postives = 570/610 (93.44%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHC 60
           MLWNSIIKS FDSGLFLSA++LYK+MREVGVEHDGFTFPI+NHV+MSIWVDVVYAGM HC
Sbjct: 1   MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVS 120
           VGIRMGF +DLYFCNTMM+VY KCECL +ARNVFDEMPNRDLVSWTSMISAYV+ GD+V 
Sbjct: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARNVFDEMPNRDLVSWTSMISAYVDVGDIVC 120

Query: 121 ALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQM 180
           AL+LFEGMRR LEPNSVT+M MLQACC T++LVLGRL+QC VVK+GLLFD GLQN FL+M
Sbjct: 121 ALNLFEGMRRVLEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180

Query: 181 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIM-GEVPLSIETL 240
           YSRLG EDE   FFSEI CKNVVSW+IL+SFYSS+GDI+K VD F +IM GEVPL IETL
Sbjct: 181 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 240

Query: 241 TILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIP 300
           TILISAT TSDS CLILGENLHSLAIK+GLYD ILRTSLLDMYAKFGEL+NSTRLF  IP
Sbjct: 241 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 300

Query: 301 NRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKA 360
           NRSIITWGAMMSSFIQNGHFD+AVEIF QMQ AGLKPS+GI+KHLIDAYA+LGALQLG+ 
Sbjct: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360

Query: 361 IHCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAH 420
           IHCYLIRI GLEICNTHLETSL+NMY RCGS+ASARKCFDLI++KDVVAWTSMI+ YGAH
Sbjct: 361 IHCYLIRICGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGAH 420

Query: 421 GLGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEH 480
           G GI+ALNL+  MMSEEVAPN+VTFLSLLSACSHSGLVSEGCEIFYSMRSRF+IKP+LEH
Sbjct: 421 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 480

Query: 481 YTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELE 540
           YTCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGDNKIA YAA RLLELE
Sbjct: 481 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 540

Query: 541 PDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSH 600
           PDNVGYYTLLSN+QASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSH
Sbjct: 541 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSH 600

Query: 601 NKTNEIYDLL 610
           +KT++IYDLL
Sbjct: 601 SKTDQIYDLL 610

BLAST of HG10017615 vs. ExPASy Swiss-Prot
Match: O49619 (Pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H27 PE=3 SV=1)

HSP 1 Score: 397.9 bits (1021), Expect = 2.2e-109
Identity = 215/611 (35.19%), Postives = 350/611 (57.28%), Query Frame = 0

Query: 2   LWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHCV 61
           LWN +IK     GL++ A+  Y  M   GV+ D FT+P V   +  I   +      H +
Sbjct: 97  LWNVMIKGFTSCGLYIEAVQFYSRMVFAGVKADTFTYPFVIKSVAGI-SSLEEGKKIHAM 156

Query: 62  GIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSA 121
            I++GF +D+Y CN+++ +Y K  C   A  VF+EMP RD+VSW SMIS Y+  GD  S+
Sbjct: 157 VIKLGFVSDVYVCNSLISLYMKLGCAWDAEKVFEEMPERDIVSWNSMISGYLALGDGFSS 216

Query: 122 LDLFEGMRR-ELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGL-LFDTGLQNSFLQ 181
           L LF+ M +   +P+  + M  L AC    +  +G+ + CH V+S +   D  +  S L 
Sbjct: 217 LMLFKEMLKCGFKPDRFSTMSALGACSHVYSPKMGKEIHCHAVRSRIETGDVMVMTSILD 276

Query: 182 MYSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETL 241
           MYS+ GE       F+ +  +N+V+WN+++  Y+  G +      F K+  +  L  + +
Sbjct: 277 MYSKYGEVSYAERIFNGMIQRNIVAWNVMIGCYARNGRVTDAFLCFQKMSEQNGLQPDVI 336

Query: 242 TILISATATSDSGCLILGENLHSLAIKSG-LYDGILRTSLLDMYAKFGELENSTRLFKGI 301
           T +    A++    ++ G  +H  A++ G L   +L T+L+DMY + G+L+++  +F  +
Sbjct: 337 TSINLLPASA----ILEGRTIHGYAMRRGFLPHMVLETALIDMYGECGQLKSAEVIFDRM 396

Query: 302 PNRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGK 361
             +++I+W +++++++QNG    A+E+F+++  + L P    I  ++ AYA   +L  G+
Sbjct: 397 AEKNVISWNSIIAAYVQNGKNYSALELFQELWDSSLVPDSTTIASILPAYAESLSLSEGR 456

Query: 362 AIHCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGA 421
            IH Y+++       NT +  SL++MY  CG +  ARKCF+ IL+KDVV+W S+I  Y  
Sbjct: 457 EIHAYIVK--SRYWSNTIILNSLVHMYAMCGDLEDARKCFNHILLKDVVSWNSIIMAYAV 516

Query: 422 HGLGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLE 481
           HG G  ++ LF +M++  V PN  TF SLL+ACS SG+V EG E F SM+  + I P +E
Sbjct: 517 HGFGRISVWLFSEMIASRVNPNKSTFASLLAACSISGMVDEGWEYFESMKREYGIDPGIE 576

Query: 482 HYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLEL 541
           HY C +DL+ R+     A      M  +   RIWG+L+ A R + D  IA +AA+++ ++
Sbjct: 577 HYGCMLDLIGRTGNFSAAKRFLEEMPFVPTARIWGSLLNASRNHKDITIAEFAAEQIFKM 636

Query: 542 EPDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRS 601
           E DN G Y LL N  A  G+W +V +++ ++  K + +    S +E  G  H F +GDRS
Sbjct: 637 EHDNTGCYVLLLNMYAEAGRWEDVNRIKLLMESKGISRTSSRSTVEAKGKSHVFTNGDRS 696

Query: 602 HNKTNEIYDLL 610
           H  TN+IY++L
Sbjct: 697 HVATNKIYEVL 700

BLAST of HG10017615 vs. ExPASy Swiss-Prot
Match: Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 352.1 bits (902), Expect = 1.4e-95
Identity = 198/622 (31.83%), Postives = 344/622 (55.31%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHCVG 62
           WNS+I  +   G +  AL +Y  ++   +  D FT   V     ++ V     G+ H   
Sbjct: 175 WNSLISGYSSHGYYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGL-HGFA 234

Query: 63  IRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSAL 122
           ++ G ++ +   N ++ +Y K      AR VFDEM  RD VS+ +MI  Y+    V  ++
Sbjct: 235 LKSGVNSVVVVNNGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESV 294

Query: 123 DLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQMYS 182
            +F     + +P+ +TV  +L+AC   ++L L + +  +++K+G + ++ ++N  + +Y+
Sbjct: 295 RMFLENLDQFKPDLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYA 354

Query: 183 RLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLTIL 242
           + G+       F+ + CK+ VSWN ++S Y   GD+++ +  F K+M  +    + +T L
Sbjct: 355 KCGDMITARDVFNSMECKDTVSWNSIISGYIQSGDLMEAMKLF-KMMMIMEEQADHITYL 414

Query: 243 ISATATSDSGCLILGENLHSLAIKSGL-YDGILRTSLLDMYAKFGELENSTRLFKGIPNR 302
           +  + ++    L  G+ LHS  IKSG+  D  +  +L+DMYAK GE+ +S ++F  +   
Sbjct: 415 MLISVSTRLADLKFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTG 474

Query: 303 SIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAIH 362
             +TW  ++S+ ++ G F   +++  QM+ + + P +      +   A L A +LGK IH
Sbjct: 475 DTVTWNTVISACVRFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIH 534

Query: 363 CYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHGL 422
           C L+R +G E     +  +L+ MY +CG + ++ + F+ +  +DVV WT MI  YG +G 
Sbjct: 535 CCLLR-FGYE-SELQIGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGE 594

Query: 423 GIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHYT 482
           G  AL  F  M    + P++V F++++ ACSHSGLV EG   F  M++ + I P +EHY 
Sbjct: 595 GEKALETFADMEKSGIVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYA 654

Query: 483 CFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEPD 542
           C VDLLSRS ++ +A      M    D  IW +++ ACR  GD + A   + R++EL PD
Sbjct: 655 CVVDLLSRSQKISKAEEFIQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPD 714

Query: 543 NVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHNK 602
           + GY  L SN+ A++ +W +V  +R  + +K + K PG+S+IE+   +H F SGD S  +
Sbjct: 715 DPGYSILASNAYAALRKWDKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQ 774

Query: 603 TNEIYDLLAMELISSTALKFGF 624
           +  IY   ++E++ S   K G+
Sbjct: 775 SEAIYK--SLEILYSLMAKEGY 790

BLAST of HG10017615 vs. ExPASy Swiss-Prot
Match: Q9C507 (Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E66 PE=3 SV=1)

HSP 1 Score: 350.5 bits (898), Expect = 4.0e-95
Identity = 193/609 (31.69%), Postives = 345/609 (56.65%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHCVG 62
           W++++ S  ++G  + AL ++K M + GVE D  T   V      +   +  A   H   
Sbjct: 170 WSTLVSSCLENGEVVKALRMFKCMVDDGVEPDAVTMISVVEGCAELGC-LRIARSVHGQI 229

Query: 63  IRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSAL 122
            R  F  D   CN+++ +Y KC  L+ +  +F+++  ++ VSWT+MIS+Y  G     AL
Sbjct: 230 TRKMFDLDETLCNSLLTMYSKCGDLLSSERIFEKIAKKNAVSWTAMISSYNRGEFSEKAL 289

Query: 123 DLF-EGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFD-TGLQNSFLQM 182
             F E ++  +EPN VT+  +L +C     +  G+ +    V+  L  +   L  + +++
Sbjct: 290 RSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYESLSLALVEL 349

Query: 183 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLT 242
           Y+  G+  +       +  +N+V+WN L+S Y+  G +++ +  F +++ +  +  +  T
Sbjct: 350 YAECGKLSDCETVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQ-RIKPDAFT 409

Query: 243 ILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIPN 302
           +  S +A  ++G + LG+ +H   I++ + D  ++ SL+DMY+K G +++++ +F  I +
Sbjct: 410 LASSISACENAGLVPLGKQIHGHVIRTDVSDEFVQNSLIDMYSKSGSVDSASTVFNQIKH 469

Query: 303 RSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAI 362
           RS++TW +M+  F QNG+  +A+ +F  M  + L+ +      +I A + +G+L+ GK +
Sbjct: 470 RSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKWV 529

Query: 363 HCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHG 422
           H  LI I GL+  +   +T+L++MY +CG + +A   F  +  + +V+W+SMI+ YG HG
Sbjct: 530 HHKLI-ISGLK--DLFTDTALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYGMHG 589

Query: 423 LGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHY 482
               A++ F QM+     PN V F+++LSAC HSG V EG + ++++   F + PN EH+
Sbjct: 590 RIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEG-KYYFNLMKSFGVSPNSEHF 649

Query: 483 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEP 542
            CF+DLLSRS  ++EA+     M  L D  +WG+L+  CR++    I     + L ++  
Sbjct: 650 ACFIDLLSRSGDLKEAYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIVT 709

Query: 543 DNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 602
           D+ GYYTLLSN  A  G+W E  +LRS +   +L K PG+S IE++  +  F +G+ +  
Sbjct: 710 DDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLKKVPGYSAIEIDQKVFRFGAGEENRI 769

Query: 603 KTNEIYDLL 610
           +T+EIY  L
Sbjct: 770 QTDEIYRFL 772

BLAST of HG10017615 vs. ExPASy Swiss-Prot
Match: Q9M1V3 (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 350.1 bits (897), Expect = 5.2e-95
Identity = 202/616 (32.79%), Postives = 342/616 (55.52%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGM-AH 60
           +LWNSI+ S+  SG  L  L L++ M   G   + +T  IV+ +           G   H
Sbjct: 250 VLWNSILSSYSTSGKSLETLELFREMHMTGPAPNSYT--IVSALTACDGFSYAKLGKEIH 309

Query: 61  CVGIRMG-FSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDV 120
              ++    S++LY CN ++ +Y +C  +  A  +  +M N D+V+W S+I  YV     
Sbjct: 310 ASVLKSSTHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMY 369

Query: 121 VSALDLFEGM-RRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQ--N 180
             AL+ F  M     + + V++  ++ A     NL+ G  L  +V+K G  +D+ LQ  N
Sbjct: 370 KEALEFFSDMIAAGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHG--WDSNLQVGN 429

Query: 181 SFLQMYSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLS 240
           + + MYS+      +G  F  +H K+++SW  +++ Y+     ++ ++ F  +  +  + 
Sbjct: 430 TLIDMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDV-AKKRME 489

Query: 241 IETLTILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLF 300
           I+ + +     A+S    +++ + +H   ++ GL D +++  L+D+Y K   +  +TR+F
Sbjct: 490 IDEMILGSILRASSVLKSMLIVKEIHCHILRKGLLDTVIQNELVDVYGKCRNMGYATRVF 549

Query: 301 KGIPNRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQ 360
           + I  + +++W +M+SS   NG+  +AVE+F++M   GL      +  ++ A A L AL 
Sbjct: 550 ESIKGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALN 609

Query: 361 LGKAIHCYLIRI-YGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMID 420
            G+ IHCYL+R  + LE     +  ++++MY  CG + SA+  FD I  K ++ +TSMI+
Sbjct: 610 KGREIHCYLLRKGFCLE---GSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMIN 669

Query: 421 VYGAHGLGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIK 480
            YG HG G  A+ LF +M  E V+P++++FL+LL ACSH+GL+ EG      M   + ++
Sbjct: 670 AYGMHGCGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELE 729

Query: 481 PNLEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADR 540
           P  EHY C VD+L R+  V EAF     M       +W AL+ ACR + + +I   AA R
Sbjct: 730 PWPEHYVCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQR 789

Query: 541 LLELEPDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVS 600
           LLELEP N G   L+SN  A  G+W++VEK+R+ +    + K PG S+IE++G +H F +
Sbjct: 790 LLELEPKNPGNLVLVSNVFAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTA 849

Query: 601 GDRSHNKTNEIYDLLA 611
            D+SH ++ EIY+ L+
Sbjct: 850 RDKSHPESKEIYEKLS 857

BLAST of HG10017615 vs. ExPASy Swiss-Prot
Match: Q9SS97 (Putative pentatricopeptide repeat-containing protein At3g01580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E87 PE=3 SV=2)

HSP 1 Score: 349.7 bits (896), Expect = 6.9e-95
Identity = 213/615 (34.63%), Postives = 326/615 (53.01%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAH-CV 62
           WN+++KS      +   L  +  M     + D FT P+       +  +V Y  M H  V
Sbjct: 28  WNTLLKSLSREKQWEEVLYHFSHMFRDEEKPDNFTLPVALKACGEL-REVNYGEMIHGFV 87

Query: 63  GIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSA 122
              +   +DLY  ++++ +Y KC  ++ A  +FDE+   D+V+W+SM+S +   G    A
Sbjct: 88  KKDVTLGSDLYVGSSLIYMYIKCGRMIEALRMFDELEKPDIVTWSSMVSGFEKNGSPYQA 147

Query: 123 LDLFEG--MRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQ 182
           ++ F    M  ++ P+ VT++ ++ AC    N  LGR +   V++ G   D  L NS L 
Sbjct: 148 VEFFRRMVMASDVTPDRVTLITLVSACTKLSNSRLGRCVHGFVIRRGFSNDLSLVNSLLN 207

Query: 183 MYSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIM--GEVPLSIE 242
            Y++     E    F  I  K+V+SW+ +++ Y   G   + +  FN +M  G  P    
Sbjct: 208 CYAKSRAFKEAVNLFKMIAEKDVISWSTVIACYVQNGAAAEALLVFNDMMDDGTEPNVAT 267

Query: 243 TLTILISATATSDSGCLILGENLHSLAIKSGLYDGI-LRTSLLDMYAKFGELENSTRLFK 302
            L +L +  A  D   L  G   H LAI+ GL   + + T+L+DMY K    E +  +F 
Sbjct: 268 VLCVLQACAAAHD---LEQGRKTHELAIRKGLETEVKVSTALVDMYMKCFSPEEAYAVFS 327

Query: 303 GIPNRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTA-GLKPSVGIIKHLIDAYAYLGALQ 362
            IP + +++W A++S F  NG    ++E F  M      +P   ++  ++ + + LG L+
Sbjct: 328 RIPRKDVVSWVALISGFTLNGMAHRSIEEFSIMLLENNTRPDAILMVKVLGSCSELGFLE 387

Query: 363 LGKAIHCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDV 422
             K  H Y+I+ YG +  N  +  SL+ +Y RCGS+ +A K F+ I +KD V WTS+I  
Sbjct: 388 QAKCFHSYVIK-YGFD-SNPFIGASLVELYSRCGSLGNASKVFNGIALKDTVVWTSLITG 447

Query: 423 YGAHGLGIDALNLFRQMM-SEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIK 482
           YG HG G  AL  F  M+ S EV PN VTFLS+LSACSH+GL+ EG  IF  M + + + 
Sbjct: 448 YGIHGKGTKALETFNHMVKSSEVKPNEVTFLSILSACSHAGLIHEGLRIFKLMVNDYRLA 507

Query: 483 PNLEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADR 542
           PNLEHY   VDLL R   +  A  IT RM      +I G L+GACR++ + ++A   A +
Sbjct: 508 PNLEHYAVLVDLLGRVGDLDTAIEITKRMPFSPTPQILGTLLGACRIHQNGEMAETVAKK 567

Query: 543 LLELEPDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVS 602
           L ELE ++ GYY L+SN     G+W  VEKLR+ V ++ + K    S IE+   +H FV+
Sbjct: 568 LFELESNHAGYYMLMSNVYGVKGEWENVEKLRNSVKQRGIKKGLAESLIEIRRKVHRFVA 627

Query: 603 GDRSHNKTNEIYDLL 610
            D  H +   +Y LL
Sbjct: 628 DDELHPEKEPVYGLL 636

BLAST of HG10017615 vs. ExPASy TrEMBL
Match: A0A1S3C6I7 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103497256 PE=4 SV=1)

HSP 1 Score: 1107.4 bits (2863), Expect = 0.0e+00
Identity = 547/609 (89.82%), Postives = 571/609 (93.76%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHC 60
           MLWN++IKSHFDSGLF SALLLYK+MREV VEHDGFT PIVN VI+SIWVDVVY GM HC
Sbjct: 1   MLWNNVIKSHFDSGLFHSALLLYKNMREVRVEHDGFTLPIVNQVILSIWVDVVYGGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVS 120
           VGIRMGFS+DLYFCNTMM+VYGKC CLV AR+VFDEMPNRDLVSWTSMISAYV GGDV  
Sbjct: 61  VGIRMGFSSDLYFCNTMMEVYGKCGCLVSARDVFDEMPNRDLVSWTSMISAYVKGGDVFC 120

Query: 121 ALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQM 180
           ALD+FEGMRRELEPNSVTV+VMLQACCATQNLVLGRLLQC+VVK+GLLFDTGLQNSFL+M
Sbjct: 121 ALDIFEGMRRELEPNSVTVIVMLQACCATQNLVLGRLLQCYVVKNGLLFDTGLQNSFLRM 180

Query: 181 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLT 240
           YSRLG EDEV  FFSEI  KNVVSWNILMSFYSSMGDI+KVVD  NKIMGEVPLSIETLT
Sbjct: 181 YSRLGGEDEVVAFFSEIDFKNVVSWNILMSFYSSMGDIVKVVDILNKIMGEVPLSIETLT 240

Query: 241 ILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIPN 300
           ILIS  ATSDSGCLILGENLHSLAIKSGLYD IL TSLLDMYAKFGELENSTRLFK IPN
Sbjct: 241 ILISGIATSDSGCLILGENLHSLAIKSGLYDDILCTSLLDMYAKFGELENSTRLFKEIPN 300

Query: 301 RSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAI 360
           RSIITWGAMMSSFIQNGHFDDAV+IFKQMQ AGLKPSVGI+KHLIDAYAYLGALQLGKAI
Sbjct: 301 RSIITWGAMMSSFIQNGHFDDAVDIFKQMQVAGLKPSVGILKHLIDAYAYLGALQLGKAI 360

Query: 361 HCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHG 420
           HC+LIRIYGL +CNT LETS+LNMY RCGS+ASARKCFDLILIKDVVAWTSMI+ YGAHG
Sbjct: 361 HCHLIRIYGLVVCNTRLETSVLNMYVRCGSIASARKCFDLILIKDVVAWTSMIEGYGAHG 420

Query: 421 LGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHY 480
           LGIDALNLF QM SEEV PNNVTFLSLLSACSHSGLVSEGC IFYSMRSRF+IKP+LEHY
Sbjct: 421 LGIDALNLFHQMTSEEVTPNNVTFLSLLSACSHSGLVSEGCGIFYSMRSRFNIKPDLEHY 480

Query: 481 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEP 540
           TCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGDNKIANYAA RLLELEP
Sbjct: 481 TCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIANYAAHRLLELEP 540

Query: 541 DNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 600
           DNVGYYTLLSNSQASVGQWHE EKLRS+VYEK+L KKPGWSFIELNGTIHGFVSGDRSH 
Sbjct: 541 DNVGYYTLLSNSQASVGQWHEAEKLRSLVYEKNLAKKPGWSFIELNGTIHGFVSGDRSHY 600

Query: 601 KTNEIYDLL 610
           K NEIYDLL
Sbjct: 601 KANEIYDLL 609

BLAST of HG10017615 vs. ExPASy TrEMBL
Match: A0A6J1EYH5 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111439622 PE=4 SV=1)

HSP 1 Score: 1072.8 bits (2773), Expect = 4.8e-310
Identity = 523/610 (85.74%), Postives = 567/610 (92.95%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHC 60
           MLWNSIIKS FDSGLFLSA++LYK+MREVGVEHDGFTFPI+NHV+MSIWVDVVYAGM HC
Sbjct: 1   MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVS 120
           VGIRMGF +DLYFCNTMM+VY KCECL +AR VFDEMPNRDLVSWTSMISAYVN GD+V 
Sbjct: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVC 120

Query: 121 ALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQM 180
           AL+LFEGMRR  EPNSVT+M MLQACC T++LVLGRL+QC VVK+GLLFD GLQN FL+M
Sbjct: 121 ALNLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180

Query: 181 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIM-GEVPLSIETL 240
           YSRLG EDE   FFSEI CKNVVSW+IL+SFYSS+GDI+K VD F +IM GEVPL IETL
Sbjct: 181 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 240

Query: 241 TILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIP 300
           TILISAT TSDS CLILGENLHSLAIK+GLYD ILRTSLLDMYAKFGEL+NSTRLF  IP
Sbjct: 241 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 300

Query: 301 NRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKA 360
           NRSIITWGAMMSSFIQNGHFD+AVEIF QMQ AGLKPS+GI+KHLIDAYA+LGALQLG+ 
Sbjct: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360

Query: 361 IHCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAH 420
           IHCYLIRIYGLEICNTHLETSL+NMY RCGS+ASARKCFDLI++KDVVAWTSMI+ YG+H
Sbjct: 361 IHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGSH 420

Query: 421 GLGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEH 480
           G GI+ALNL+  MMSEEVAPN+VTFLSLLSACSHSGLVSEGCEIFYSMRSRF+IKP+LEH
Sbjct: 421 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 480

Query: 481 YTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELE 540
           YTCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGDNKIA YAA RLLELE
Sbjct: 481 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 540

Query: 541 PDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSH 600
           PDNVGYYTLLSN+QASVGQWHEVEKLRSVVYEKD VKKPGWSF+ELNGT+HGFVSGDRSH
Sbjct: 541 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDRSH 600

Query: 601 NKTNEIYDLL 610
            KT++IYDLL
Sbjct: 601 CKTDQIYDLL 610

BLAST of HG10017615 vs. ExPASy TrEMBL
Match: A0A6J1HTH3 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111467326 PE=4 SV=1)

HSP 1 Score: 1055.4 bits (2728), Expect = 9.3e-305
Identity = 518/610 (84.92%), Postives = 563/610 (92.30%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHC 60
           MLWNSIIKS FDSGLF SA++LYK+MREVGVEHDGFTFPI+NHV+MSI VDVVYAGM HC
Sbjct: 1   MLWNSIIKSQFDSGLFQSAIMLYKNMREVGVEHDGFTFPILNHVVMSICVDVVYAGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVS 120
           VGIRMGF +DLYFCNTMM+VY KCECL +AR VFDEMPNRDLVSWTSMISAYVN G +V 
Sbjct: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNSGVIVC 120

Query: 121 ALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQM 180
           AL+LFEGMRR LEPNSVT+M MLQACC T++LVLGRL+QC VVK+GLLFD GLQN FL+M
Sbjct: 121 ALNLFEGMRRVLEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180

Query: 181 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIM-GEVPLSIETL 240
           YSRLG EDE    FSEI CKNVVSWNIL+SFY S+GDI+K VD F +IM GEVPL I+TL
Sbjct: 181 YSRLGGEDEFVRVFSEIDCKNVVSWNILISFYFSVGDIVKAVDIFKQIMSGEVPLIIDTL 240

Query: 241 TILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIP 300
           TILISAT TS+S CLILGENLHSLAIK+GLYD ILRTSLLDMYAK GEL+NSTRLF  IP
Sbjct: 241 TILISATKTSESMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKIGELDNSTRLFNEIP 300

Query: 301 NRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKA 360
           NRSIITWGAMMSSFIQNGHFD+AV+IF QMQ AGLKPS+GI+KHLIDAYA+LGALQLG+ 
Sbjct: 301 NRSIITWGAMMSSFIQNGHFDEAVDIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360

Query: 361 IHCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAH 420
           IHCYLIRI+GLEICNTHLETSL+NMY RCGS+ASARKCFDLI++KDVVAWTSMI+ YGAH
Sbjct: 361 IHCYLIRIHGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGAH 420

Query: 421 GLGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEH 480
           G GI+ALNL+  MMSEEVAPN+VTFLSLLSACSHSGLVSEGCEIFYSMRSRF+IKP+LEH
Sbjct: 421 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 480

Query: 481 YTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELE 540
           YTCFVDLLSRSTRVREAFAI LRMTNLCDGRIWGALMGACRVYGDNKIA YAA RLLELE
Sbjct: 481 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 540

Query: 541 PDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSH 600
           PDNVGYYTLLSN+QASVGQWHEVEKLRSVVYEK+LVKKPGWSFIELNGTIHGFVSGDRSH
Sbjct: 541 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKNLVKKPGWSFIELNGTIHGFVSGDRSH 600

Query: 601 NKTNEIYDLL 610
            KT++IYDLL
Sbjct: 601 CKTDQIYDLL 610

BLAST of HG10017615 vs. ExPASy TrEMBL
Match: A0A6J1DKA1 (pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111021322 PE=4 SV=1)

HSP 1 Score: 1037.3 bits (2681), Expect = 2.6e-299
Identity = 512/609 (84.07%), Postives = 549/609 (90.15%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHC 60
           MLWNSIIKSH +SGLF+SALLLYK MRE+GVEHDGFTFP+VN +IMSI +DVVYAGM HC
Sbjct: 1   MLWNSIIKSHVESGLFVSALLLYKKMRELGVEHDGFTFPMVNRIIMSIQLDVVYAGMVHC 60

Query: 61  VGIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVS 120
           VGIRMGF ADLYFCNTMM+VYGKC CLV ARNVFDEMP+RDLVSWTSMIS YV  GDVVS
Sbjct: 61  VGIRMGFGADLYFCNTMMEVYGKCGCLVSARNVFDEMPHRDLVSWTSMISVYVCRGDVVS 120

Query: 121 ALDLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQM 180
            LDLFEGMRRELEPNSVT+MVM+QACCAT NL LGR LQ HV K+GLLFD GLQNS L+M
Sbjct: 121 GLDLFEGMRRELEPNSVTIMVMVQACCATGNLSLGRQLQSHVFKNGLLFDIGLQNSLLRM 180

Query: 181 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLT 240
           Y+RLG EDEVG+FFSE+  KNVVSWN+ +SFYSS GD +KVVD FNKIMGEV LS+ETLT
Sbjct: 181 YTRLGGEDEVGVFFSEVDRKNVVSWNVFISFYSSRGDFVKVVDIFNKIMGEVLLSVETLT 240

Query: 241 ILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIPN 300
           IL+SATA  DS  LILG+NLHSLAIKSGLYDGIL+TS LDMYAKFGELENSTRLFK IP 
Sbjct: 241 ILVSATAAPDSEHLILGKNLHSLAIKSGLYDGILQTSFLDMYAKFGELENSTRLFKEIPR 300

Query: 301 RSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAI 360
           +SIITWGAMMSSFIQNGHFD AVEIF QMQ AGLKPSVGI+KHLIDAY +LG LQLGKAI
Sbjct: 301 KSIITWGAMMSSFIQNGHFDGAVEIFNQMQAAGLKPSVGILKHLIDAYTHLGVLQLGKAI 360

Query: 361 HCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHG 420
           HCYLIR+ GLEI NT L TS+LNMY RCGS+ SA KCFDLILIKDVVAWTSMI+ YGAHG
Sbjct: 361 HCYLIRLNGLEIYNTQLGTSILNMYVRCGSLVSAIKCFDLILIKDVVAWTSMIEGYGAHG 420

Query: 421 LGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHY 480
           LG DALNLF QMM EEV PNNVTFLSLLSACSHSGLVSEGC+IFYSMRSRF+I P+LEHY
Sbjct: 421 LGFDALNLFLQMMREEVTPNNVTFLSLLSACSHSGLVSEGCQIFYSMRSRFNINPDLEHY 480

Query: 481 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEP 540
           TCFVDLLSRSTRVREAFAI LRMTN  DGRIWGALMGACRVY DNKIANYAA RLLELEP
Sbjct: 481 TCFVDLLSRSTRVREAFAIILRMTNFRDGRIWGALMGACRVYEDNKIANYAAHRLLELEP 540

Query: 541 DNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 600
           DNVGYYTLLSN+QA+VGQWH+VEKLRSVVYEKDLVKKPGWSFIEL G +HGFVSGDRSH+
Sbjct: 541 DNVGYYTLLSNAQATVGQWHDVEKLRSVVYEKDLVKKPGWSFIELKGIVHGFVSGDRSHD 600

Query: 601 KTNEIYDLL 610
           KT EIYDLL
Sbjct: 601 KTKEIYDLL 609

BLAST of HG10017615 vs. ExPASy TrEMBL
Match: A0A2N9FID5 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS14662 PE=4 SV=1)

HSP 1 Score: 718.4 bits (1853), Expect = 2.7e-203
Identity = 361/611 (59.08%), Postives = 450/611 (73.65%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHCVG 62
           WN IIKSH D GLF SALLLYK+MR +GV HD FTFPIVN  ++S+  DV+Y  M HCV 
Sbjct: 28  WNLIIKSHLDLGLFDSALLLYKTMRHLGVAHDSFTFPIVNQAVLSLQSDVIYGEMVHCVS 87

Query: 63  IRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSAL 122
            +MGF  ++YFCNTM++VY KC C+VYAR +FDEM  RDLVSWTSMIS YV  G V SA 
Sbjct: 88  TKMGFGFEVYFCNTMIEVYVKCGCVVYARKLFDEMSQRDLVSWTSMISGYVCEGSVGSAF 147

Query: 123 DLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQMYS 182
            LF  M  + EPNSVT+++MLQACCA ++L+ G  L  + +KSGL  D  LQNS L+MY+
Sbjct: 148 YLFREMMVKSEPNSVTLIIMLQACCAGESLIHGMQLHGYAIKSGLESDGSLQNSVLKMYT 207

Query: 183 RLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLTIL 242
           R G  +EV IFFS+I  K+ VSWNIL+SFYS  GDI K+V+ F+++ G   LSIETLT+L
Sbjct: 208 RTGSVEEVEIFFSKIDRKDDVSWNILISFYSMKGDIEKLVNRFSEMQGIAALSIETLTLL 267

Query: 243 ISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIPNRS 302
           ISA A S    L  GE +H LAIKSG  D +L TSLLD YAK G++E S +LF+ I  R+
Sbjct: 268 ISAFAKSRD--LFQGEQIHCLAIKSGFCDDVLLTSLLDFYAKCGKIEISDQLFRKISYRN 327

Query: 303 IITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAIHC 362
            +T+GAMMS F+QNG+  DA+ +F QMQ A ++P   I++ ++DAY  LGALQLGKAIH 
Sbjct: 328 NVTFGAMMSGFVQNGYVKDAINLFHQMQAANVEPGAEILRSILDAYTQLGALQLGKAIHG 387

Query: 363 YLIRIYGLEIC--NTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHG 422
           Y IR          T+LE S+LNMY RCG+++SAR  F  IL+KD+V WT+MI+ +G HG
Sbjct: 388 YFIRHIFCRTMEETTYLEASILNMYIRCGNISSARVSFHNILVKDLVIWTTMIEGFGTHG 447

Query: 423 LGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHY 482
           LG +AL LF  M+ E + PN+VTFLSLLSACSHSGLV EGCE++ SM+  F I+PNL+HY
Sbjct: 448 LGSEALELFGLMLKERIKPNSVTFLSLLSACSHSGLVREGCEVYNSMKWIFGIQPNLDHY 507

Query: 483 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEP 542
           TC VDLL R  +++EA AI ++M    DGRIWGAL+ ACRV+GD K+  Y A RLLELEP
Sbjct: 508 TCMVDLLGRYGKLKEALAIIVKMVIFSDGRIWGALLAACRVHGDIKLGEYTAQRLLELEP 567

Query: 543 DNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 602
           DNVGY+TLLSN QA VG+W EVE++R V+ EKDL KKPGWS  E  G IHGFVS DRSH+
Sbjct: 568 DNVGYHTLLSNVQAGVGRWDEVEEVRRVMNEKDLKKKPGWSCFEAKGMIHGFVSADRSHH 627

Query: 603 KTNEIYDLLAM 612
           +  EIYD+L +
Sbjct: 628 QVEEIYDILGL 636

BLAST of HG10017615 vs. TAIR 10
Match: AT4G35130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 397.9 bits (1021), Expect = 1.6e-110
Identity = 215/611 (35.19%), Postives = 350/611 (57.28%), Query Frame = 0

Query: 2   LWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHCV 61
           LWN +IK     GL++ A+  Y  M   GV+ D FT+P V   +  I   +      H +
Sbjct: 97  LWNVMIKGFTSCGLYIEAVQFYSRMVFAGVKADTFTYPFVIKSVAGI-SSLEEGKKIHAM 156

Query: 62  GIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSA 121
            I++GF +D+Y CN+++ +Y K  C   A  VF+EMP RD+VSW SMIS Y+  GD  S+
Sbjct: 157 VIKLGFVSDVYVCNSLISLYMKLGCAWDAEKVFEEMPERDIVSWNSMISGYLALGDGFSS 216

Query: 122 LDLFEGMRR-ELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGL-LFDTGLQNSFLQ 181
           L LF+ M +   +P+  + M  L AC    +  +G+ + CH V+S +   D  +  S L 
Sbjct: 217 LMLFKEMLKCGFKPDRFSTMSALGACSHVYSPKMGKEIHCHAVRSRIETGDVMVMTSILD 276

Query: 182 MYSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETL 241
           MYS+ GE       F+ +  +N+V+WN+++  Y+  G +      F K+  +  L  + +
Sbjct: 277 MYSKYGEVSYAERIFNGMIQRNIVAWNVMIGCYARNGRVTDAFLCFQKMSEQNGLQPDVI 336

Query: 242 TILISATATSDSGCLILGENLHSLAIKSG-LYDGILRTSLLDMYAKFGELENSTRLFKGI 301
           T +    A++    ++ G  +H  A++ G L   +L T+L+DMY + G+L+++  +F  +
Sbjct: 337 TSINLLPASA----ILEGRTIHGYAMRRGFLPHMVLETALIDMYGECGQLKSAEVIFDRM 396

Query: 302 PNRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGK 361
             +++I+W +++++++QNG    A+E+F+++  + L P    I  ++ AYA   +L  G+
Sbjct: 397 AEKNVISWNSIIAAYVQNGKNYSALELFQELWDSSLVPDSTTIASILPAYAESLSLSEGR 456

Query: 362 AIHCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGA 421
            IH Y+++       NT +  SL++MY  CG +  ARKCF+ IL+KDVV+W S+I  Y  
Sbjct: 457 EIHAYIVK--SRYWSNTIILNSLVHMYAMCGDLEDARKCFNHILLKDVVSWNSIIMAYAV 516

Query: 422 HGLGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLE 481
           HG G  ++ LF +M++  V PN  TF SLL+ACS SG+V EG E F SM+  + I P +E
Sbjct: 517 HGFGRISVWLFSEMIASRVNPNKSTFASLLAACSISGMVDEGWEYFESMKREYGIDPGIE 576

Query: 482 HYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLEL 541
           HY C +DL+ R+     A      M  +   RIWG+L+ A R + D  IA +AA+++ ++
Sbjct: 577 HYGCMLDLIGRTGNFSAAKRFLEEMPFVPTARIWGSLLNASRNHKDITIAEFAAEQIFKM 636

Query: 542 EPDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRS 601
           E DN G Y LL N  A  G+W +V +++ ++  K + +    S +E  G  H F +GDRS
Sbjct: 637 EHDNTGCYVLLLNMYAEAGRWEDVNRIKLLMESKGISRTSSRSTVEAKGKSHVFTNGDRS 696

Query: 602 HNKTNEIYDLL 610
           H  TN+IY++L
Sbjct: 697 HVATNKIYEVL 700

BLAST of HG10017615 vs. TAIR 10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 352.1 bits (902), Expect = 9.8e-97
Identity = 198/622 (31.83%), Postives = 344/622 (55.31%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHCVG 62
           WNS+I  +   G +  AL +Y  ++   +  D FT   V     ++ V     G+ H   
Sbjct: 175 WNSLISGYSSHGYYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGL-HGFA 234

Query: 63  IRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSAL 122
           ++ G ++ +   N ++ +Y K      AR VFDEM  RD VS+ +MI  Y+    V  ++
Sbjct: 235 LKSGVNSVVVVNNGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESV 294

Query: 123 DLFEGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQMYS 182
            +F     + +P+ +TV  +L+AC   ++L L + +  +++K+G + ++ ++N  + +Y+
Sbjct: 295 RMFLENLDQFKPDLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYA 354

Query: 183 RLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLTIL 242
           + G+       F+ + CK+ VSWN ++S Y   GD+++ +  F K+M  +    + +T L
Sbjct: 355 KCGDMITARDVFNSMECKDTVSWNSIISGYIQSGDLMEAMKLF-KMMMIMEEQADHITYL 414

Query: 243 ISATATSDSGCLILGENLHSLAIKSGL-YDGILRTSLLDMYAKFGELENSTRLFKGIPNR 302
           +  + ++    L  G+ LHS  IKSG+  D  +  +L+DMYAK GE+ +S ++F  +   
Sbjct: 415 MLISVSTRLADLKFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTG 474

Query: 303 SIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAIH 362
             +TW  ++S+ ++ G F   +++  QM+ + + P +      +   A L A +LGK IH
Sbjct: 475 DTVTWNTVISACVRFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIH 534

Query: 363 CYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHGL 422
           C L+R +G E     +  +L+ MY +CG + ++ + F+ +  +DVV WT MI  YG +G 
Sbjct: 535 CCLLR-FGYE-SELQIGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGE 594

Query: 423 GIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHYT 482
           G  AL  F  M    + P++V F++++ ACSHSGLV EG   F  M++ + I P +EHY 
Sbjct: 595 GEKALETFADMEKSGIVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYA 654

Query: 483 CFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEPD 542
           C VDLLSRS ++ +A      M    D  IW +++ ACR  GD + A   + R++EL PD
Sbjct: 655 CVVDLLSRSQKISKAEEFIQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPD 714

Query: 543 NVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHNK 602
           + GY  L SN+ A++ +W +V  +R  + +K + K PG+S+IE+   +H F SGD S  +
Sbjct: 715 DPGYSILASNAYAALRKWDKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQ 774

Query: 603 TNEIYDLLAMELISSTALKFGF 624
           +  IY   ++E++ S   K G+
Sbjct: 775 SEAIYK--SLEILYSLMAKEGY 790

BLAST of HG10017615 vs. TAIR 10
Match: AT1G69350.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 350.5 bits (898), Expect = 2.9e-96
Identity = 193/609 (31.69%), Postives = 345/609 (56.65%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAHCVG 62
           W++++ S  ++G  + AL ++K M + GVE D  T   V      +   +  A   H   
Sbjct: 170 WSTLVSSCLENGEVVKALRMFKCMVDDGVEPDAVTMISVVEGCAELGC-LRIARSVHGQI 229

Query: 63  IRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSAL 122
            R  F  D   CN+++ +Y KC  L+ +  +F+++  ++ VSWT+MIS+Y  G     AL
Sbjct: 230 TRKMFDLDETLCNSLLTMYSKCGDLLSSERIFEKIAKKNAVSWTAMISSYNRGEFSEKAL 289

Query: 123 DLF-EGMRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFD-TGLQNSFLQM 182
             F E ++  +EPN VT+  +L +C     +  G+ +    V+  L  +   L  + +++
Sbjct: 290 RSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYESLSLALVEL 349

Query: 183 YSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLSIETLT 242
           Y+  G+  +       +  +N+V+WN L+S Y+  G +++ +  F +++ +  +  +  T
Sbjct: 350 YAECGKLSDCETVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQ-RIKPDAFT 409

Query: 243 ILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLFKGIPN 302
           +  S +A  ++G + LG+ +H   I++ + D  ++ SL+DMY+K G +++++ +F  I +
Sbjct: 410 LASSISACENAGLVPLGKQIHGHVIRTDVSDEFVQNSLIDMYSKSGSVDSASTVFNQIKH 469

Query: 303 RSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQLGKAI 362
           RS++TW +M+  F QNG+  +A+ +F  M  + L+ +      +I A + +G+L+ GK +
Sbjct: 470 RSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKWV 529

Query: 363 HCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDVYGAHG 422
           H  LI I GL+  +   +T+L++MY +CG + +A   F  +  + +V+W+SMI+ YG HG
Sbjct: 530 HHKLI-ISGLK--DLFTDTALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYGMHG 589

Query: 423 LGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIKPNLEHY 482
               A++ F QM+     PN V F+++LSAC HSG V EG + ++++   F + PN EH+
Sbjct: 590 RIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEG-KYYFNLMKSFGVSPNSEHF 649

Query: 483 TCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADRLLELEP 542
            CF+DLLSRS  ++EA+     M  L D  +WG+L+  CR++    I     + L ++  
Sbjct: 650 ACFIDLLSRSGDLKEAYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIVT 709

Query: 543 DNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVSGDRSHN 602
           D+ GYYTLLSN  A  G+W E  +LRS +   +L K PG+S IE++  +  F +G+ +  
Sbjct: 710 DDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLKKVPGYSAIEIDQKVFRFGAGEENRI 769

Query: 603 KTNEIYDLL 610
           +T+EIY  L
Sbjct: 770 QTDEIYRFL 772

BLAST of HG10017615 vs. TAIR 10
Match: AT3G63370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 350.1 bits (897), Expect = 3.7e-96
Identity = 202/616 (32.79%), Postives = 342/616 (55.52%), Query Frame = 0

Query: 1   MLWNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGM-AH 60
           +LWNSI+ S+  SG  L  L L++ M   G   + +T  IV+ +           G   H
Sbjct: 250 VLWNSILSSYSTSGKSLETLELFREMHMTGPAPNSYT--IVSALTACDGFSYAKLGKEIH 309

Query: 61  CVGIRMG-FSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDV 120
              ++    S++LY CN ++ +Y +C  +  A  +  +M N D+V+W S+I  YV     
Sbjct: 310 ASVLKSSTHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMY 369

Query: 121 VSALDLFEGM-RRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQ--N 180
             AL+ F  M     + + V++  ++ A     NL+ G  L  +V+K G  +D+ LQ  N
Sbjct: 370 KEALEFFSDMIAAGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHG--WDSNLQVGN 429

Query: 181 SFLQMYSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIMGEVPLS 240
           + + MYS+      +G  F  +H K+++SW  +++ Y+     ++ ++ F  +  +  + 
Sbjct: 430 TLIDMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDV-AKKRME 489

Query: 241 IETLTILISATATSDSGCLILGENLHSLAIKSGLYDGILRTSLLDMYAKFGELENSTRLF 300
           I+ + +     A+S    +++ + +H   ++ GL D +++  L+D+Y K   +  +TR+F
Sbjct: 490 IDEMILGSILRASSVLKSMLIVKEIHCHILRKGLLDTVIQNELVDVYGKCRNMGYATRVF 549

Query: 301 KGIPNRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTAGLKPSVGIIKHLIDAYAYLGALQ 360
           + I  + +++W +M+SS   NG+  +AVE+F++M   GL      +  ++ A A L AL 
Sbjct: 550 ESIKGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALN 609

Query: 361 LGKAIHCYLIRI-YGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMID 420
            G+ IHCYL+R  + LE     +  ++++MY  CG + SA+  FD I  K ++ +TSMI+
Sbjct: 610 KGREIHCYLLRKGFCLE---GSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMIN 669

Query: 421 VYGAHGLGIDALNLFRQMMSEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIK 480
            YG HG G  A+ LF +M  E V+P++++FL+LL ACSH+GL+ EG      M   + ++
Sbjct: 670 AYGMHGCGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELE 729

Query: 481 PNLEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADR 540
           P  EHY C VD+L R+  V EAF     M       +W AL+ ACR + + +I   AA R
Sbjct: 730 PWPEHYVCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQR 789

Query: 541 LLELEPDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVS 600
           LLELEP N G   L+SN  A  G+W++VEK+R+ +    + K PG S+IE++G +H F +
Sbjct: 790 LLELEPKNPGNLVLVSNVFAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTA 849

Query: 601 GDRSHNKTNEIYDLLA 611
            D+SH ++ EIY+ L+
Sbjct: 850 RDKSHPESKEIYEKLS 857

BLAST of HG10017615 vs. TAIR 10
Match: AT3G01580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 349.7 bits (896), Expect = 4.9e-96
Identity = 213/615 (34.63%), Postives = 326/615 (53.01%), Query Frame = 0

Query: 3   WNSIIKSHFDSGLFLSALLLYKSMREVGVEHDGFTFPIVNHVIMSIWVDVVYAGMAH-CV 62
           WN+++KS      +   L  +  M     + D FT P+       +  +V Y  M H  V
Sbjct: 28  WNTLLKSLSREKQWEEVLYHFSHMFRDEEKPDNFTLPVALKACGEL-REVNYGEMIHGFV 87

Query: 63  GIRMGFSADLYFCNTMMQVYGKCECLVYARNVFDEMPNRDLVSWTSMISAYVNGGDVVSA 122
              +   +DLY  ++++ +Y KC  ++ A  +FDE+   D+V+W+SM+S +   G    A
Sbjct: 88  KKDVTLGSDLYVGSSLIYMYIKCGRMIEALRMFDELEKPDIVTWSSMVSGFEKNGSPYQA 147

Query: 123 LDLFEG--MRRELEPNSVTVMVMLQACCATQNLVLGRLLQCHVVKSGLLFDTGLQNSFLQ 182
           ++ F    M  ++ P+ VT++ ++ AC    N  LGR +   V++ G   D  L NS L 
Sbjct: 148 VEFFRRMVMASDVTPDRVTLITLVSACTKLSNSRLGRCVHGFVIRRGFSNDLSLVNSLLN 207

Query: 183 MYSRLGEEDEVGIFFSEIHCKNVVSWNILMSFYSSMGDILKVVDTFNKIM--GEVPLSIE 242
            Y++     E    F  I  K+V+SW+ +++ Y   G   + +  FN +M  G  P    
Sbjct: 208 CYAKSRAFKEAVNLFKMIAEKDVISWSTVIACYVQNGAAAEALLVFNDMMDDGTEPNVAT 267

Query: 243 TLTILISATATSDSGCLILGENLHSLAIKSGLYDGI-LRTSLLDMYAKFGELENSTRLFK 302
            L +L +  A  D   L  G   H LAI+ GL   + + T+L+DMY K    E +  +F 
Sbjct: 268 VLCVLQACAAAHD---LEQGRKTHELAIRKGLETEVKVSTALVDMYMKCFSPEEAYAVFS 327

Query: 303 GIPNRSIITWGAMMSSFIQNGHFDDAVEIFKQMQTA-GLKPSVGIIKHLIDAYAYLGALQ 362
            IP + +++W A++S F  NG    ++E F  M      +P   ++  ++ + + LG L+
Sbjct: 328 RIPRKDVVSWVALISGFTLNGMAHRSIEEFSIMLLENNTRPDAILMVKVLGSCSELGFLE 387

Query: 363 LGKAIHCYLIRIYGLEICNTHLETSLLNMYGRCGSVASARKCFDLILIKDVVAWTSMIDV 422
             K  H Y+I+ YG +  N  +  SL+ +Y RCGS+ +A K F+ I +KD V WTS+I  
Sbjct: 388 QAKCFHSYVIK-YGFD-SNPFIGASLVELYSRCGSLGNASKVFNGIALKDTVVWTSLITG 447

Query: 423 YGAHGLGIDALNLFRQMM-SEEVAPNNVTFLSLLSACSHSGLVSEGCEIFYSMRSRFHIK 482
           YG HG G  AL  F  M+ S EV PN VTFLS+LSACSH+GL+ EG  IF  M + + + 
Sbjct: 448 YGIHGKGTKALETFNHMVKSSEVKPNEVTFLSILSACSHAGLIHEGLRIFKLMVNDYRLA 507

Query: 483 PNLEHYTCFVDLLSRSTRVREAFAITLRMTNLCDGRIWGALMGACRVYGDNKIANYAADR 542
           PNLEHY   VDLL R   +  A  IT RM      +I G L+GACR++ + ++A   A +
Sbjct: 508 PNLEHYAVLVDLLGRVGDLDTAIEITKRMPFSPTPQILGTLLGACRIHQNGEMAETVAKK 567

Query: 543 LLELEPDNVGYYTLLSNSQASVGQWHEVEKLRSVVYEKDLVKKPGWSFIELNGTIHGFVS 602
           L ELE ++ GYY L+SN     G+W  VEKLR+ V ++ + K    S IE+   +H FV+
Sbjct: 568 LFELESNHAGYYMLMSNVYGVKGEWENVEKLRNSVKQRGIKKGLAESLIEIRRKVHRFVA 627

Query: 603 GDRSHNKTNEIYDLL 610
            D  H +   +Y LL
Sbjct: 628 DDELHPEKEPVYGLL 636

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038883286.10.0e+0092.12pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Benin... [more]
XP_008457591.10.0e+0089.82PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic ... [more]
XP_031739775.10.0e+0088.51pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucum... [more]
XP_023526509.10.0e+0086.39pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucur... [more]
KAG6603417.10.0e+0086.39Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
O496192.2e-10935.19Pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Arabidop... [more]
Q9SS601.4e-9531.83Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Q9C5074.0e-9531.69Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS... [more]
Q9M1V35.2e-9532.79Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
Q9SS976.9e-9534.63Putative pentatricopeptide repeat-containing protein At3g01580 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A1S3C6I70.0e+0089.82pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Cucumis ... [more]
A0A6J1EYH54.8e-31085.74pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like OS=Cuc... [more]
A0A6J1HTH39.3e-30584.92pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like OS=Cuc... [more]
A0A6J1DKA12.6e-29984.07pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like OS=Mom... [more]
A0A2N9FID52.7e-20359.08Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS14662 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G35130.11.6e-11035.19Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G03580.19.8e-9731.83Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G69350.12.9e-9631.69Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G63370.13.7e-9632.79Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G01580.14.9e-9634.63Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 304..333
e-value: 1.0E-6
score: 28.5
coord: 203..229
e-value: 0.027
score: 14.7
coord: 74..100
e-value: 0.0061
score: 16.7
coord: 3..31
e-value: 0.01
score: 16.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 404..452
e-value: 1.6E-9
score: 37.8
coord: 101..147
e-value: 4.7E-10
score: 39.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 304..338
e-value: 6.8E-9
score: 33.4
coord: 442..476
e-value: 3.2E-4
score: 18.7
coord: 2..34
e-value: 0.001
score: 17.1
coord: 103..130
e-value: 7.6E-6
score: 23.8
coord: 407..440
e-value: 2.0E-5
score: 22.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 405..439
score: 10.676364
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 302..336
score: 12.638436
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 101..131
score: 9.909073
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 2..158
e-value: 3.3E-23
score: 84.5
coord: 374..599
e-value: 6.2E-32
score: 113.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 159..252
e-value: 5.6E-8
score: 34.3
coord: 267..359
e-value: 2.0E-15
score: 58.6
NoneNo IPR availablePANTHERPTHR47928:SF75REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..604
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..604

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10017615.1HG10017615.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding