HG10022627 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022627
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 26421956 .. 26423794 (+)
RNA-Seq ExpressionHG10022627
SyntenyHG10022627
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTACTCAACAGCCCAGAATGTTGTTCATCTGCTTCGGCGCCTTGGAAGGACTGGCATGGTTGATGAGGCGCTTGCTGCGTTTTCTACACTCGATTCACATGCGAAAAACACAAATGTTCGTAATGTGATCATTGATTTGCTTCTGAAATCTGGGCGAGTTGACAATGCATTGAAAGTGCTCGACGAAATGCTTCTTCCAGGTTCGGAGTTTCGACCCAATTATGTAACTGCTGATATTGTAATCAATAAATTGTTGAAGATAAATGAGTCAGAGGGGAGAGTCAAGGAAGATGATATTGCCGGGTTGGTAGCCAAATTTGGTAAACATAACGTTGTTCCAAATCCCATTACATTGACGCAGTTGATATCGAAGCTTTGTAGGAGTAGAAATACAAATCTTGCATGGAATGTTTTAGATAATGTGATGATGTTGAATGGTATTAAGGATGCTGCACCTTGCAATGCGCTTTTGTCAGGATTGGGAAAGGAGAGGGAATTTGAGAAGATGAATCTGTTGATGCAAAAGATGAAAGACATGAATATTCAGCCTGATGTTATTACTTTTAGTATTCTTATCAACCATTTGTGTAAGTTCAGAAGGATTGAAGATGCATTAAAGGTGTTTGAGAAAATGAAAGAGGAAAAGGGGGAGGCGAAGGTTGTTGTACCTGATGCAATCATGTACAATACTTTGATTGATGGGCTCTGTAAAGTGGGGAGGCATGAAGAAGGATTGCGCTTGATGGAAATGATGAGATCAGATGGCTGTGCACCCAATACTGTTACGTACAATTGTTTGATTGATGGTTTCTGTAAGGCAGGTGAAATTGAGAGAGCTCATGAGCTCTTTAATGAGATGGTAAGTGAGCAAGTTGCGCCGAATGTAATTACCCTCAATACTTTAGTCGATGGAATGTGCAAGCATGACAGAGTAAGCAGCGCAGTTGAGTTCTTCAGGGAAATGCAGCAGAAGGGCTTGAAAGGAAATTCTGTTACTTACACGGTGTTCACTAATGCTTTTTGCAAAGCCAACAATATGGACAAGGCAATGGAATTTTTGGATGAAATGTCCAAAGATGGATGTTTGCCTGATGCCATTGTGTATTATACTTTGATATGTGGTTTAGCCCAAGCTGGAAGGTTGGATGATGCCAGCTCTGTTGTGTCAAAGTTGAAAGAGGCAGGGTTCTGTCTAGATCTTGTTTGCTACAATATTCTTATCAGTGAGTTCTGTAAGAGGAATAAATTAGATAGAGCTAATGAATTGTTGAATGAAATGGAGGTGGTTGGAGTTAAGCCTGACAGTGTCACATACAACACTTTGATTTCCTACTTCAGTAAAATTGGGAATTTCAAACTAGCTCACAAATTTATGAATAAGATGACCAAGGAGGAAGGTCTTTCACCCACTGTCTTCACTTATGGAGCTCTTATTCATGCATATTGCTTGAACAACAACACTGATGAAGCCATGAAGCTTTTCAAGGAAATGAATGCTACATCAAAGGTTCCTCCTAACACAGTAATATACAATATATTGATAGATTCTTTATGCAAAAATAATGAGGTCAATTACGCTCTTTCTCTATTGAATGATATGAAATTTAGAGGGGTGAAGCCAAACACCACAACGTACAATGCTATTTTCAAAGCCCTTAGGGAGAAGAATTGGTTGGACAAAGCATTCGAACTTATGGATAGAATGCGCGAGCAGGCGTGTGATGCCGATTATATAACAATGGAGATTTTAACCGAATGGCTTTCTTCAGTTGGTGAAACTACAAAATTGAAGAAGTTTACGCAGGGATACAGTGTTTCTGATTCTGCAGCCTAA

mRNA sequence

ATGTCTTACTCAACAGCCCAGAATGTTGTTCATCTGCTTCGGCGCCTTGGAAGGACTGGCATGGTTGATGAGGCGCTTGCTGCGTTTTCTACACTCGATTCACATGCGAAAAACACAAATGTTCGTAATGTGATCATTGATTTGCTTCTGAAATCTGGGCGAGTTGACAATGCATTGAAAGTGCTCGACGAAATGCTTCTTCCAGGTTCGGAGTTTCGACCCAATTATGTAACTGCTGATATTGTAATCAATAAATTGTTGAAGATAAATGAGTCAGAGGGGAGAGTCAAGGAAGATGATATTGCCGGGTTGGTAGCCAAATTTGGTAAACATAACGTTGTTCCAAATCCCATTACATTGACGCAGTTGATATCGAAGCTTTGTAGGAGTAGAAATACAAATCTTGCATGGAATGTTTTAGATAATGTGATGATGTTGAATGGTATTAAGGATGCTGCACCTTGCAATGCGCTTTTGTCAGGATTGGGAAAGGAGAGGGAATTTGAGAAGATGAATCTGTTGATGCAAAAGATGAAAGACATGAATATTCAGCCTGATGTTATTACTTTTAGTATTCTTATCAACCATTTGTGTAAGTTCAGAAGGATTGAAGATGCATTAAAGGTGTTTGAGAAAATGAAAGAGGAAAAGGGGGAGGCGAAGGTTGTTGTACCTGATGCAATCATGTACAATACTTTGATTGATGGGCTCTGTAAAGTGGGGAGGCATGAAGAAGGATTGCGCTTGATGGAAATGATGAGATCAGATGGCTGTGCACCCAATACTGTTACGTACAATTGTTTGATTGATGGTTTCTGTAAGGCAGGTGAAATTGAGAGAGCTCATGAGCTCTTTAATGAGATGGTAAGTGAGCAAGTTGCGCCGAATGTAATTACCCTCAATACTTTAGTCGATGGAATGTGCAAGCATGACAGAGTAAGCAGCGCAGTTGAGTTCTTCAGGGAAATGCAGCAGAAGGGCTTGAAAGGAAATTCTGTTACTTACACGGTGTTCACTAATGCTTTTTGCAAAGCCAACAATATGGACAAGGCAATGGAATTTTTGGATGAAATGTCCAAAGATGGATGTTTGCCTGATGCCATTGTGTATTATACTTTGATATGTGGTTTAGCCCAAGCTGGAAGGTTGGATGATGCCAGCTCTGTTGTGTCAAAGTTGAAAGAGGCAGGGTTCTGTCTAGATCTTGTTTGCTACAATATTCTTATCAGTGAGTTCTGTAAGAGGAATAAATTAGATAGAGCTAATGAATTGTTGAATGAAATGGAGGTGGTTGGAGTTAAGCCTGACAGTGTCACATACAACACTTTGATTTCCTACTTCAGTAAAATTGGGAATTTCAAACTAGCTCACAAATTTATGAATAAGATGACCAAGGAGGAAGGTCTTTCACCCACTGTCTTCACTTATGGAGCTCTTATTCATGCATATTGCTTGAACAACAACACTGATGAAGCCATGAAGCTTTTCAAGGAAATGAATGCTACATCAAAGGTTCCTCCTAACACAGTAATATACAATATATTGATAGATTCTTTATGCAAAAATAATGAGGTCAATTACGCTCTTTCTCTATTGAATGATATGAAATTTAGAGGGGTGAAGCCAAACACCACAACGTACAATGCTATTTTCAAAGCCCTTAGGGAGAAGAATTGGTTGGACAAAGCATTCGAACTTATGGATAGAATGCGCGAGCAGGCGTGTGATGCCGATTATATAACAATGGAGATTTTAACCGAATGGCTTTCTTCAGTTGGTGAAACTACAAAATTGAAGAAGTTTACGCAGGGATACAGTGTTTCTGATTCTGCAGCCTAA

Coding sequence (CDS)

ATGTCTTACTCAACAGCCCAGAATGTTGTTCATCTGCTTCGGCGCCTTGGAAGGACTGGCATGGTTGATGAGGCGCTTGCTGCGTTTTCTACACTCGATTCACATGCGAAAAACACAAATGTTCGTAATGTGATCATTGATTTGCTTCTGAAATCTGGGCGAGTTGACAATGCATTGAAAGTGCTCGACGAAATGCTTCTTCCAGGTTCGGAGTTTCGACCCAATTATGTAACTGCTGATATTGTAATCAATAAATTGTTGAAGATAAATGAGTCAGAGGGGAGAGTCAAGGAAGATGATATTGCCGGGTTGGTAGCCAAATTTGGTAAACATAACGTTGTTCCAAATCCCATTACATTGACGCAGTTGATATCGAAGCTTTGTAGGAGTAGAAATACAAATCTTGCATGGAATGTTTTAGATAATGTGATGATGTTGAATGGTATTAAGGATGCTGCACCTTGCAATGCGCTTTTGTCAGGATTGGGAAAGGAGAGGGAATTTGAGAAGATGAATCTGTTGATGCAAAAGATGAAAGACATGAATATTCAGCCTGATGTTATTACTTTTAGTATTCTTATCAACCATTTGTGTAAGTTCAGAAGGATTGAAGATGCATTAAAGGTGTTTGAGAAAATGAAAGAGGAAAAGGGGGAGGCGAAGGTTGTTGTACCTGATGCAATCATGTACAATACTTTGATTGATGGGCTCTGTAAAGTGGGGAGGCATGAAGAAGGATTGCGCTTGATGGAAATGATGAGATCAGATGGCTGTGCACCCAATACTGTTACGTACAATTGTTTGATTGATGGTTTCTGTAAGGCAGGTGAAATTGAGAGAGCTCATGAGCTCTTTAATGAGATGGTAAGTGAGCAAGTTGCGCCGAATGTAATTACCCTCAATACTTTAGTCGATGGAATGTGCAAGCATGACAGAGTAAGCAGCGCAGTTGAGTTCTTCAGGGAAATGCAGCAGAAGGGCTTGAAAGGAAATTCTGTTACTTACACGGTGTTCACTAATGCTTTTTGCAAAGCCAACAATATGGACAAGGCAATGGAATTTTTGGATGAAATGTCCAAAGATGGATGTTTGCCTGATGCCATTGTGTATTATACTTTGATATGTGGTTTAGCCCAAGCTGGAAGGTTGGATGATGCCAGCTCTGTTGTGTCAAAGTTGAAAGAGGCAGGGTTCTGTCTAGATCTTGTTTGCTACAATATTCTTATCAGTGAGTTCTGTAAGAGGAATAAATTAGATAGAGCTAATGAATTGTTGAATGAAATGGAGGTGGTTGGAGTTAAGCCTGACAGTGTCACATACAACACTTTGATTTCCTACTTCAGTAAAATTGGGAATTTCAAACTAGCTCACAAATTTATGAATAAGATGACCAAGGAGGAAGGTCTTTCACCCACTGTCTTCACTTATGGAGCTCTTATTCATGCATATTGCTTGAACAACAACACTGATGAAGCCATGAAGCTTTTCAAGGAAATGAATGCTACATCAAAGGTTCCTCCTAACACAGTAATATACAATATATTGATAGATTCTTTATGCAAAAATAATGAGGTCAATTACGCTCTTTCTCTATTGAATGATATGAAATTTAGAGGGGTGAAGCCAAACACCACAACGTACAATGCTATTTTCAAAGCCCTTAGGGAGAAGAATTGGTTGGACAAAGCATTCGAACTTATGGATAGAATGCGCGAGCAGGCGTGTGATGCCGATTATATAACAATGGAGATTTTAACCGAATGGCTTTCTTCAGTTGGTGAAACTACAAAATTGAAGAAGTTTACGCAGGGATACAGTGTTTCTGATTCTGCAGCCTAA

Protein sequence

MSYSTAQNVVHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALKVLDEMLLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITLTQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDMNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIHAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVKPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKKFTQGYSVSDSAA
Homology
BLAST of HG10022627 vs. NCBI nr
Match: XP_038898803.1 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benincasa hispida] >XP_038898805.1 pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benincasa hispida] >XP_038898806.1 pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benincasa hispida] >XP_038898807.1 pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benincasa hispida])

HSP 1 Score: 1070.1 bits (2766), Expect = 7.1e-309
Identity = 535/613 (87.28%), Postives = 568/613 (92.66%), Query Frame = 0

Query: 1   MSYSTAQNVVHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALK 60
           MS  TAQ+ VHLLRRLGRTGMVDEALAAFSTLDSHAKNT VRNVII LLLKSGRVDNAL 
Sbjct: 1   MSLLTAQSAVHLLRRLGRTGMVDEALAAFSTLDSHAKNTKVRNVIIALLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSEFRPNYVTADIVINKLL-KINESEGRVKEDDIAGLVAKFGKHNVVPNPIT 120
           VLDEMLLPGSEFRPN  TADIV NKLL K+N  EG  KED+IAGLVAKFGKHNVVPN IT
Sbjct: 61  VLDEMLLPGSEFRPNDRTADIVFNKLLEKMNGPEGGAKEDEIAGLVAKFGKHNVVPNTIT 120

Query: 121 LTQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMK 180
           LTQLI+KLC +RNTNLAW+VLDNVMMLNG+KDAAPCNALL+GLGK REFEKMNLLM+KMK
Sbjct: 121 LTQLITKLCWNRNTNLAWDVLDNVMMLNGLKDAAPCNALLTGLGKAREFEKMNLLMRKMK 180

Query: 181 DMNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMYNTLIDGLCK 240
           DMNIQP+VITF ILINHLCKFRRI+DAL+VFEK+K EK EAK V PD IMYNTLIDGLCK
Sbjct: 181 DMNIQPNVITFGILINHLCKFRRIDDALEVFEKLKAEKEEAKDVAPDVIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVIT 300
           VGR EEGLRLM MMR D CAPNTVTYNCLIDG+CK+GEIERAHELFN+MVSEQV PN+IT
Sbjct: 241 VGRQEEGLRLMGMMRLDDCAPNTVTYNCLIDGYCKSGEIERAHELFNQMVSEQVVPNIIT 300

Query: 301 LNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           +NTLV+GMC H R+SSAVEFFREMQQKGLKGNSVTYTVF NAFCK NNM+KAMEF DEMS
Sbjct: 301 VNTLVNGMCNHSRISSAVEFFREMQQKGLKGNSVTYTVFINAFCKVNNMNKAMEFWDEMS 360

Query: 361 KDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGCLPD IVYYTLICGLAQAGRLDDA+SVVSKLKEAGFCLDLVCYNILISEFCK+ KLD
Sbjct: 361 KDGCLPDVIVYYTLICGLAQAGRLDDANSVVSKLKEAGFCLDLVCYNILISEFCKKKKLD 420

Query: 421 RANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGAL 480
           RA ELLNEME+VGVKPD VTYNTLIS+FSKIGNFKLAH+FM KMTKE+GL PTVFTYGAL
Sbjct: 421 RAYELLNEMELVGVKPDCVTYNTLISHFSKIGNFKLAHEFMKKMTKEDGLLPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRG 540
           IHAYCLNNNTDEAMKLFKEM+ATSKVPPNTVIYNILIDSLCK NEVNYALSLL+DMKFR 
Sbjct: 481 IHAYCLNNNTDEAMKLFKEMSATSKVPPNTVIYNILIDSLCKRNEVNYALSLLDDMKFRR 540

Query: 541 VKPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLK 600
           VKPNTTTYN IF+ALREKNWLDKAF+LMDRM E+AC+ADYITMEILTEWLSSVGETTKLK
Sbjct: 541 VKPNTTTYNTIFEALREKNWLDKAFKLMDRMVEEACNADYITMEILTEWLSSVGETTKLK 600

Query: 601 KFTQGYSVSDSAA 613
            FTQGYSV DSAA
Sbjct: 601 LFTQGYSVCDSAA 613

BLAST of HG10022627 vs. NCBI nr
Match: XP_022147524.1 (pentatricopeptide repeat-containing protein At5g28460 [Momordica charantia])

HSP 1 Score: 1031.6 bits (2666), Expect = 2.8e-297
Identity = 512/603 (84.91%), Postives = 551/603 (91.38%), Query Frame = 0

Query: 10  VHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALKVLDEMLLPG 69
           V LLRRLGR GM +EALA FS LDSH KNT VRNVIIDLLLK+GRVD+A+KVLDEMLLPG
Sbjct: 162 VLLLRRLGRAGMABEALAVFSELDSHTKNTYVRNVIIDLLLKTGRVDSAMKVLDEMLLPG 221

Query: 70  SEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITLTQLISKLCR 129
           SEFRPN +TADIV  KLLK+N  EGRV+ED+IAGLVAKFGKH+V PN ITLTQLIS+LCR
Sbjct: 222 SEFRPNDITADIVFTKLLKVNGWEGRVREDEIAGLVAKFGKHHVFPNAITLTQLISRLCR 281

Query: 130 SRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDMNIQPDVIT 189
           SRNT+LAWNVLD+VM LNG+KD APCNALL+GLGKEREFEKMNLLM++MKDM+IQP+VIT
Sbjct: 282 SRNTDLAWNVLDDVMKLNGLKDVAPCNALLTGLGKEREFEKMNLLMRRMKDMDIQPNVIT 341

Query: 190 FSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMYNTLIDGLCKVGRHEEGLRL 249
           F ILINHLCKFRRI+DAL+VFE+MK EK EAKVV PDAI YNTLIDGLCKVGR EEG  L
Sbjct: 342 FGILINHLCKFRRIDDALEVFERMKGEK-EAKVVAPDAITYNTLIDGLCKVGRQEEGSSL 401

Query: 250 MEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLNTLVDGMCK 309
           M MMRSDGC PNTVTYNCLIDG+CK+GEIE+AHELFN+M +EQV PNVITLNTLVDGMCK
Sbjct: 402 MRMMRSDGCVPNTVTYNCLIDGYCKSGEIEKAHELFNQMANEQVVPNVITLNTLVDGMCK 461

Query: 310 HDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCLPDAIV 369
           H R+SSAVEF REMQQKGLKGNSVTYTVF NAFCK NN+DKAMEF DEM K GC PD+IV
Sbjct: 462 HGRISSAVEFLREMQQKGLKGNSVTYTVFINAFCKXNNIDKAMEFFDEMFKAGCFPDSIV 521

Query: 370 YYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELLNEME 429
           YYTLICGLAQAGRLDDAS V+SKLKEAGF LDLVC+NILISEFCKRNKLD+ANELLNEME
Sbjct: 522 YYTLICGLAQAGRLDDASFVMSKLKEAGFRLDLVCFNILISEFCKRNKLDKANELLNEME 581

Query: 430 VVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIHAYCLNNNT 489
           +VGVKPDSVTYNTLISYFS+ GNFKLAHKFM +MTK EGL PTVFTYGALIHAYCLNNNT
Sbjct: 582 LVGVKPDSVTYNTLISYFSRTGNFKLAHKFMKRMTKHEGLLPTVFTYGALIHAYCLNNNT 641

Query: 490 DEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVKPNTTTYNA 549
           DEAMKLFKEM+A SKVPPNTVIYNILIDSLCK NEVN ALSL +DMK RGVKPNTTTYNA
Sbjct: 642 DEAMKLFKEMSAASKVPPNTVIYNILIDSLCKKNEVNSALSLFDDMKVRGVKPNTTTYNA 701

Query: 550 IFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKKFTQGYSVSD 609
           +FKAL EKNWLDKAFELMDRM EQAC+ADYITMEILTEWLSSVGETTKLKKFTQGY VS 
Sbjct: 702 VFKALWEKNWLDKAFELMDRMVEQACNADYITMEILTEWLSSVGETTKLKKFTQGYKVST 761

Query: 610 SAA 613
           S A
Sbjct: 762 SPA 763

BLAST of HG10022627 vs. NCBI nr
Match: XP_008447282.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Cucumis melo] >XP_008447283.1 PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Cucumis melo] >KAA0042168.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK26743.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1025.8 bits (2651), Expect = 1.5e-295
Identity = 511/613 (83.36%), Postives = 558/613 (91.03%), Query Frame = 0

Query: 1   MSYSTAQNVVHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALK 60
           MS STAQ+ VHLLR LGR GMVDEALAAFSTLDSHAKNTNVRN II+LLLKSGRVDNAL 
Sbjct: 1   MSLSTAQSAVHLLRHLGRVGMVDEALAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITL 120
           VL EMLLP SEFRPN  TA IV NK+LKI+ SEGR KED+IAGLV+KFGK+N+ P+ I L
Sbjct: 61  VLYEMLLPESEFRPNDKTAGIVFNKMLKIDGSEGRAKEDEIAGLVSKFGKYNIFPDTIAL 120

Query: 121 TQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKD 180
           TQLISKLCRS NTNLAWN+LDN+MMLNG+KDAAPCNALL+GLGK REF KMNLLM+KMKD
Sbjct: 121 TQLISKLCRSGNTNLAWNILDNMMMLNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKD 180

Query: 181 MNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVV-PDAIMYNTLIDGLCK 240
           MNIQP VITF ILIN+LCKFRRI+DAL+VFEKMK EK EA+VVV PD IMYNTLIDGLCK
Sbjct: 181 MNIQPTVITFGILINYLCKFRRIDDALEVFEKMKGEKEEAEVVVAPDTIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVIT 300
           VGR EEGLRLM  MRS  CAP T TYNCLI+G+C+AGEIE A++LF+EMVSEQ+ PNVIT
Sbjct: 241 VGRQEEGLRLMGTMRSGQCAPTTATYNCLINGYCRAGEIEVANKLFSEMVSEQIEPNVIT 300

Query: 301 LNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           LNTLVDGMCKH+R+S+AV+FFR+MQQKGLKGN+VTYTVF NAFC  NNM+KAMEFLDEMS
Sbjct: 301 LNTLVDGMCKHNRISTAVKFFRDMQQKGLKGNNVTYTVFINAFCNVNNMNKAMEFLDEMS 360

Query: 361 KDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC PDA+VYYTLICGLA+AGRLDD+SSVVSKLKEAGF LD VCYN+LISEFCK+NKLD
Sbjct: 361 KDGCFPDAVVYYTLICGLARAGRLDDSSSVVSKLKEAGFFLDRVCYNVLISEFCKKNKLD 420

Query: 421 RANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGAL 480
           RA E LN+ME+ GVKPDSVTYNTLISYFSKIGNFKLAHKFM KMT+E+GLSPTVFTYGAL
Sbjct: 421 RAQEWLNQMELAGVKPDSVTYNTLISYFSKIGNFKLAHKFMKKMTEEDGLSPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRG 540
           IHAYCLNNN DEA+KLFKEMN  SKVPPNTVIYNILIDSLCK  EVN+ALSLL+DMKFRG
Sbjct: 481 IHAYCLNNNIDEAIKLFKEMNVASKVPPNTVIYNILIDSLCKQTEVNFALSLLDDMKFRG 540

Query: 541 VKPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLK 600
           V PNTTTYN+IFKALRE NWLDKAF+LMDRM EQAC+ADYITMEILTEWLSSVGETTKLK
Sbjct: 541 VMPNTTTYNSIFKALRENNWLDKAFKLMDRMVEQACNADYITMEILTEWLSSVGETTKLK 600

Query: 601 KFTQGYSVSDSAA 613
           KFTQG  VSDSAA
Sbjct: 601 KFTQGCMVSDSAA 613

BLAST of HG10022627 vs. NCBI nr
Match: XP_011659175.1 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial [Cucumis sativus] >KGN44578.1 hypothetical protein Csa_016553 [Cucumis sativus])

HSP 1 Score: 1020.4 bits (2637), Expect = 6.5e-294
Identity = 509/613 (83.03%), Postives = 552/613 (90.05%), Query Frame = 0

Query: 1   MSYSTAQNVVHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALK 60
           MS STAQ+ VHLLRRLGR GMVDEALAAFSTLDSHAKNTNVRN II+LLLKSGRVDNA+ 
Sbjct: 1   MSLSTAQSSVHLLRRLGRIGMVDEALAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNAMN 60

Query: 61  VLDEMLLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITL 120
           VLDEMLLP SEFRPN  TA IV N LLKI+  EGRVKED+IAGLV+KFGKHN+ P+ I L
Sbjct: 61  VLDEMLLPESEFRPNDKTAGIVFNNLLKIDGLEGRVKEDEIAGLVSKFGKHNIFPDTIAL 120

Query: 121 TQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKD 180
           TQLISKLCRS NTNLAWN+LDN+MMLNG+KDAAPCNALL+GLGK REF KMNLLM+KMKD
Sbjct: 121 TQLISKLCRSGNTNLAWNILDNLMMLNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKD 180

Query: 181 MNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKV-VVPDAIMYNTLIDGLCK 240
           MNIQP VITF ILINHLCKFRRI+DAL+VFEKMK EK E KV V PD IMYNTLIDGLCK
Sbjct: 181 MNIQPTVITFGILINHLCKFRRIDDALEVFEKMKGEKEETKVFVAPDTIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVIT 300
           VGR EE L LM  MRSD CAP T T+NCLI+G+C++GEIE AH+LFNEM + Q+ PNVIT
Sbjct: 241 VGRQEEALCLMGKMRSDQCAPTTATFNCLINGYCRSGEIEVAHKLFNEMENAQIEPNVIT 300

Query: 301 LNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           LNTLVDGMCKH+R+S+AVEFFR MQQKGLKGN+VTYTVF NAFC  NNM+KAMEFLDEMS
Sbjct: 301 LNTLVDGMCKHNRISTAVEFFRVMQQKGLKGNNVTYTVFINAFCNVNNMNKAMEFLDEMS 360

Query: 361 KDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC PDA+VYYTLICGLAQAGRLDDASSVVSKLKEAGFCLD VCYN+LISEFCK+NKLD
Sbjct: 361 KDGCFPDAVVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDRVCYNVLISEFCKKNKLD 420

Query: 421 RANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGAL 480
           RA E LNEME+ GVKPDSVTYNTLISYFSKIGNFKLAHKFM KMT+EEGLSPTVFTYGAL
Sbjct: 421 RAQEWLNEMELAGVKPDSVTYNTLISYFSKIGNFKLAHKFMKKMTEEEGLSPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMKLFKEM-NATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFR 540
           IHAYCLNNN DEA+K+FKEM N  SKVPPNTVIYNILIDSLCK  +VN+ALSLL+DMKFR
Sbjct: 481 IHAYCLNNNIDEAIKIFKEMNNVASKVPPNTVIYNILIDSLCKQTQVNFALSLLDDMKFR 540

Query: 541 GVKPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKL 600
           GV PNTTTYN+IFKALR+KNWLDKAF+LMDRM EQAC+ DYITMEILTEWLS+VGE TKL
Sbjct: 541 GVMPNTTTYNSIFKALRDKNWLDKAFKLMDRMVEQACNPDYITMEILTEWLSAVGEITKL 600

Query: 601 KKFTQGYSVSDSA 612
           KKFTQG  VSDSA
Sbjct: 601 KKFTQGCMVSDSA 613

BLAST of HG10022627 vs. NCBI nr
Match: XP_023514674.1 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1015.0 bits (2623), Expect = 2.7e-292
Identity = 502/612 (82.03%), Postives = 551/612 (90.03%), Query Frame = 0

Query: 1   MSYSTAQNVVHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALK 60
           M++STAQN V+LLRRLGR+GMVD+ALA FS LD H KNT VRNVIIDLLLKSGRVDNAL 
Sbjct: 1   MAHSTAQNAVNLLRRLGRSGMVDKALAVFSELDPHKKNTYVRNVIIDLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITL 120
           VLDEMLLP SEFRPN +TADIV  KLLKIN S+ RVKE++IAGLVAKFGKH+VVPNPITL
Sbjct: 61  VLDEMLLPDSEFRPNDITADIVFTKLLKINGSDWRVKENEIAGLVAKFGKHHVVPNPITL 120

Query: 121 TQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKD 180
           TQLISKLCRSRN +LAW+VLD++M  NG+KDAAPCNALL+GLGKEREFEKMNLLM+KMKD
Sbjct: 121 TQLISKLCRSRNIDLAWSVLDDLMTWNGLKDAAPCNALLTGLGKEREFEKMNLLMRKMKD 180

Query: 181 MNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMYNTLIDGLCKV 240
           MN+QP+VITF ILINH+CKFR I+DAL+VFEKMK EK EAK V PDAI YNTLIDGLCKV
Sbjct: 181 MNVQPNVITFGILINHMCKFRMIDDALEVFEKMKGEKEEAKAVAPDAITYNTLIDGLCKV 240

Query: 241 GRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITL 300
           GR EEGLRL+  MRSDGC PNTVTYNCLIDG+CKAG IE AH L+NEMV+EQV PNVITL
Sbjct: 241 GRQEEGLRLLGTMRSDGCKPNTVTYNCLIDGYCKAGAIEPAHALYNEMVNEQVVPNVITL 300

Query: 301 NTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSK 360
           NTLVDGMCKH R+SSAVEFFREMQQKGLKGNSVTYTVF NAFCK NN+DKA+EF DEMSK
Sbjct: 301 NTLVDGMCKHGRISSAVEFFREMQQKGLKGNSVTYTVFINAFCKVNNIDKAIEFFDEMSK 360

Query: 361 DGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDR 420
            GC PDAIVYYTLICGLAQAGRLDDA+SV S++K AGF LDLVCYN+LISEFCK+NKLD 
Sbjct: 361 AGCFPDAIVYYTLICGLAQAGRLDDATSVASRMKAAGFRLDLVCYNLLISEFCKKNKLDE 420

Query: 421 ANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALI 480
           ANELL+EMEVVG+KPDSVTYNTLISYFSK+GN +LA KFM KMTKE  L P+VFTYGA+I
Sbjct: 421 ANELLDEMEVVGIKPDSVTYNTLISYFSKMGNLELAFKFMEKMTKEACLLPSVFTYGAVI 480

Query: 481 HAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGV 540
           HAYCLNNNTDEAMKLFKEM+ATSKVPPNTVIYNILIDSLCK NEV+ AL+L +DM   GV
Sbjct: 481 HAYCLNNNTDEAMKLFKEMSATSKVPPNTVIYNILIDSLCKRNEVSSALALFDDMNVNGV 540

Query: 541 KPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKK 600
           KPNTTTYNAI K L+EK WL+KA E+MDRM EQAC+ADYITMEILTEWLSSVGET KLK+
Sbjct: 541 KPNTTTYNAILKGLKEKKWLEKALEVMDRMVEQACNADYITMEILTEWLSSVGETAKLKR 600

Query: 601 FTQGYSVSDSAA 613
           FTQGY VSD AA
Sbjct: 601 FTQGYKVSDFAA 612

BLAST of HG10022627 vs. ExPASy Swiss-Prot
Match: Q9M316 (Pentatricopeptide repeat-containing protein At3g61520, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g61520 PE=2 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 5.9e-165
Identity = 298/600 (49.67%), Postives = 414/600 (69.00%), Query Frame = 0

Query: 12  LLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALKVLDEMLLPGSE 71
           L+R  GR GMV++++  +  LDS+ KN+ VRNV++D+LL++G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADIVINKLLKINESEGR-VKEDDIAGLVAKFGKHNVVPNPITLTQLISKLCRS 131
           F PN +TADIV++++ K     GR + E+ I  L+++F  H V PN + LT+ IS LC++
Sbjct: 218 FPPNRITADIVLHEVWK-----GRLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKN 277

Query: 132 RNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDMNIQPDVITF 191
              N AW++L ++M      +A P NALLS LG+  +  +MN L+ KM ++ I+PDV+T 
Sbjct: 278 ARANAAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTL 337

Query: 192 SILINHLCKFRRIEDALKVFEKMKEEK-GEAKVVVPDAIMYNTLIDGLCKVGRHEEGLRL 251
            ILIN LCK RR+++AL+VFEKM+ ++  +  V+  D+I +NTLIDGLCKVGR +E   L
Sbjct: 338 GILINTLCKSRRVDEALEVFEKMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEEL 397

Query: 252 MEMMR-SDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLNTLVDGMC 311
           +  M+  + CAPN VTYNCLIDG+C+AG++E A E+ + M  +++ PNV+T+NT+V GMC
Sbjct: 398 LVRMKLEERCAPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMC 457

Query: 312 KHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCLPDAI 371
           +H  ++ AV FF +M+++G+KGN VTY    +A C  +N++KAM + ++M + GC PDA 
Sbjct: 458 RHHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAK 517

Query: 372 VYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELLNEM 431
           +YY LI GL Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+L +M
Sbjct: 518 IYYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNTEKVYEMLTDM 577

Query: 432 EVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIHAYCLNNN 491
           E  G KPDS+TYNTLIS+F K  +F+   + M +M +E+GL PTV TYGA+I AYC    
Sbjct: 578 EKEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGE 637

Query: 492 TDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVKPNTTTYN 551
            DEA+KLFK+M   SKV PNTVIYNILI++  K      ALSL  +MK + V+PN  TYN
Sbjct: 638 LDEALKLFKDMGLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETYN 697

Query: 552 AIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKKFTQGYSVS 609
           A+FK L EK   +   +LMD M EQ+C+ + ITMEIL E LS   E  KL+KF QGYSV+
Sbjct: 698 ALFKCLNEKTQGETLLKLMDEMVEQSCEPNQITMEILMERLSGSDELVKLRKFMQGYSVA 751

BLAST of HG10022627 vs. ExPASy Swiss-Prot
Match: Q9LKU8 (Pentatricopeptide repeat-containing protein At5g28460 OS=Arabidopsis thaliana OX=3702 GN=At5g28460 PE=2 SV=1)

HSP 1 Score: 580.5 bits (1495), Expect = 2.3e-164
Identity = 295/599 (49.25%), Postives = 412/599 (68.78%), Query Frame = 0

Query: 12  LLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALKVLDEMLLPGSE 71
           L+R  GR GMV++++  +  LDS+ KN+ VRNV++D+LL++G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITLTQLISKLCRSR 131
           F PN +TADIV++++ K    E  + E+ I  L+++F  H V PN + LT+ IS LC++ 
Sbjct: 218 FPPNRITADIVLHEVWK----ERLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA 277

Query: 132 NTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDMNIQPDVITFS 191
             N AW++L ++M      +A P NALLS LG+  +  +MN L+ KM ++ I+PDV+T  
Sbjct: 278 RANTAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTLG 337

Query: 192 ILINHLCKFRRIEDALKVFEKMKEEK-GEAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLM 251
           ILIN LCK RR+++AL+VFE+M+ ++  +  V+  D+I +NTLIDGLCKVGR +E   L+
Sbjct: 338 ILINTLCKSRRVDEALEVFEQMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEELL 397

Query: 252 EMMR-SDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLNTLVDGMCK 311
             M+  + C PN VTYNCLIDG+C+AG++E A E+ + M  +++ PNV+T+NT+V GMC+
Sbjct: 398 VRMKLEERCVPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMCR 457

Query: 312 HDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCLPDAIV 371
           H  ++ AV FF +M+++G+KGN VTY    +A C  +N++KAM + ++M + GC PDA +
Sbjct: 458 HHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAKI 517

Query: 372 YYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELLNEME 431
           YY LI GL Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+L +ME
Sbjct: 518 YYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNAEKVYEMLTDME 577

Query: 432 VVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIHAYCLNNNT 491
             G KPDS+TYNTLIS+F K  +F+   + M +M +E+GL PTV TYGA+I AYC     
Sbjct: 578 KEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGEL 637

Query: 492 DEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVKPNTTTYNA 551
           DEA+KLFK+M   SKV PNTVIYNILI++  K      ALSL  +MK + V+PN  TYNA
Sbjct: 638 DEALKLFKDMGLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETYNA 697

Query: 552 IFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKKFTQGYSVS 609
           +FK L EK   +   +LMD M EQ+C+ + ITMEIL E LS   E  KL+KF QGYSV+
Sbjct: 698 LFKCLNEKTQGETLLKLMDEMVEQSCEPNQITMEILMERLSGSDELVKLRKFMQGYSVA 751

BLAST of HG10022627 vs. ExPASy Swiss-Prot
Match: P0C7Q7 (Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12700 PE=3 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 5.2e-68
Identity = 174/596 (29.19%), Postives = 283/596 (47.48%), Query Frame = 0

Query: 11  HLLRRLG-RTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRV----DNALKVLDEM 70
           HLL+    RT ++    + FS+ +    + +  NV     L+SG V    D+A+ +  EM
Sbjct: 20  HLLKTGSLRTDLLCTISSFFSSCERDFSSISNGNVCFRERLRSGIVDIKKDDAIALFQEM 79

Query: 71  LLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGK----HNVVPNPITLT 130
           +      RP        +  L+  +     +       LV  F K    + +  N  TL 
Sbjct: 80  I----RSRP--------LPSLVDFSRFFSAIARTKQFNLVLDFCKQLELNGIAHNIYTLN 139

Query: 131 QLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDM 190
            +I+  CR   T  A++VL  VM L    D    N L+ GL  E +  +  +L+ +M + 
Sbjct: 140 IMINCFCRCCKTCFAYSVLGKVMKLGYEPDTTTFNTLIKGLFLEGKVSEAVVLVDRMVEN 199

Query: 191 NIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMYNTLIDGLCKVG 250
             QPDV+T++ ++N +C+      AL +  KM+E   +A     D   Y+T+ID LC+ G
Sbjct: 200 GCQPDVVTYNSIVNGICRSGDTSLALDLLRKMEERNVKA-----DVFTYSTIIDSLCRDG 259

Query: 251 RHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLN 310
             +  + L + M + G   + VTYN L+ G CKAG+      L  +MVS ++ PNVIT N
Sbjct: 260 CIDAAISLFKEMETKGIKSSVVTYNSLVRGLCKAGKWNDGALLLKDMVSREIVPNVITFN 319

Query: 311 TLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKD 370
            L+D   K  ++  A E ++EM  +G+  N +TY    + +C  N + +A   LD M ++
Sbjct: 320 VLLDVFVKEGKLQEANELYKEMITRGISPNIITYNTLMDGYCMQNRLSEANNMLDLMVRN 379

Query: 371 GCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRA 430
            C PD + + +LI G     R+DD   V   + + G   + V Y+IL+  FC+  K+  A
Sbjct: 380 KCSPDIVTFTSLIKGYCMVKRVDDGMKVFRNISKRGLVANAVTYSILVQGFCQSGKIKLA 439

Query: 431 NELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIH 490
            EL  EM   GV PD +TY  L+      G  + A +    + K + +   +  Y  +I 
Sbjct: 440 EELFQEMVSHGVLPDVMTYGILLDGLCDNGKLEKALEIFEDLQKSK-MDLGIVMYTTIIE 499

Query: 491 AYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVK 550
             C     ++A  LF  +     V PN + Y ++I  LCK   ++ A  LL  M+  G  
Sbjct: 500 GMCKGGKVEDAWNLFCSLPCKG-VKPNVMTYTVMISGLCKKGSLSEANILLRKMEEDGNA 559

Query: 551 PNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTK 598
           PN  TYN + +A      L  + +L++ M+     AD  +++++ + L S GE  K
Sbjct: 560 PNDCTYNTLIRAHLRDGDLTASAKLIEEMKSCGFSADASSIKMVIDMLLS-GELDK 595

BLAST of HG10022627 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 2.0e-67
Identity = 139/460 (30.22%), Postives = 240/460 (52.17%), Query Frame = 0

Query: 111 HNVVPNPITLTQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEK 170
           H  VP+ I  T LI   CR   T  A  +L+ +     + D    N ++SG  K  E   
Sbjct: 131 HGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGAVPDVITYNVMISGYCKAGE--- 190

Query: 171 MNLLMQKMKDMNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMY 230
           +N  +  +  M++ PDV+T++ ++  LC   +++ A++V ++M +     +   PD I Y
Sbjct: 191 INNALSVLDRMSVSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQ-----RDCYPDVITY 250

Query: 231 NTLIDGLCKVGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVS 290
             LI+  C+       ++L++ MR  GC P+ VTYN L++G CK G ++ A +  N+M S
Sbjct: 251 TILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRLDEAIKFLNDMPS 310

Query: 291 EQVAPNVITLNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDK 350
               PNVIT N ++  MC   R   A +   +M +KG   + VT+ +  N  C+   + +
Sbjct: 311 SGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNILINFLCRKGLLGR 370

Query: 351 AMEFLDEMSKDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILIS 410
           A++ L++M + GC P+++ Y  L+ G  +  ++D A   + ++   G   D+V YN +++
Sbjct: 371 AIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRGCYPDIVTYNTMLT 430

Query: 411 EFCKRNKLDRANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLS 470
             CK  K++ A E+LN++   G  P  +TYNT+I   +K G    A K +++M + + L 
Sbjct: 431 ALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIKLLDEM-RAKDLK 490

Query: 471 PTVFTYGALIHAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALS 530
           P   TY +L+         DEA+K F E      + PN V +N ++  LCK+ + + A+ 
Sbjct: 491 PDTITYSSLVGGLSREGKVDEAIKFFHEFERMG-IRPNAVTFNSIMLGLCKSRQTDRAID 550

Query: 531 LLNDMKFRGVKPNTTTYNAIFKALREKNWLDKAFELMDRM 571
            L  M  RG KPN T+Y  + + L  +    +A EL++ +
Sbjct: 551 FLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNEL 580

BLAST of HG10022627 vs. ExPASy Swiss-Prot
Match: Q0WKV3 (Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12300 PE=2 SV=1)

HSP 1 Score: 258.1 bits (658), Expect = 2.6e-67
Identity = 166/549 (30.24%), Postives = 265/549 (48.27%), Query Frame = 0

Query: 45  IIDLLLKSGRVDNALKVLDEMLLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGL 104
           +   + K+ + D  L +  +M L G     N  T  I+IN   +  +           G 
Sbjct: 94  LFSAIAKTKQYDLVLALCKQMELKG--IAHNLYTLSIMINCFCRCRK---LCLAFSAMGK 153

Query: 105 VAKFGKHNVVPNPITLTQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGL-- 164
           + K G     PN IT + LI+ LC     + A  ++D ++ +    D    N L++GL  
Sbjct: 154 IIKLGYE---PNTITFSTLINGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCL 213

Query: 165 -GKEREFEKMNLLMQKMKDMNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAK 224
            GKE E     LL+ KM +   QP+ +T+  ++N +CK  +   A+++  KM+E      
Sbjct: 214 SGKEAE---AMLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTALAMELLRKMEERN---- 273

Query: 225 VVVPDAIMYNTLIDGLCKVGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERA 284
            +  DA+ Y+ +IDGLCK G  +    L   M   G   N +TYN LI GFC AG  +  
Sbjct: 274 -IKLDAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITTNIITYNILIGGFCNAGRWDDG 333

Query: 285 HELFNEMVSEQVAPNVITLNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNA 344
            +L  +M+  ++ PNV+T + L+D   K  ++  A E  +EM  +G+  +++TYT   + 
Sbjct: 334 AKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELHKEMIHRGIAPDTITYTSLIDG 393

Query: 345 FCKANNMDKAMEFLDEMSKDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLD 404
           FCK N++DKA + +D M   GC P+   +  LI G  +A R+DD   +  K+   G   D
Sbjct: 394 FCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVAD 453

Query: 405 LVCYNILISEFCKRNKLDRANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMN 464
            V YN LI  FC+  KL+ A EL  EM    V P+ VTY  L+      G  + A +   
Sbjct: 454 TVTYNTLIQGFCELGKLNVAKELFQEMVSRKVPPNIVTYKILLDGLCDNGESEKALEIFE 513

Query: 465 KMTKEEGLSPTVFTYGALIHAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCK 524
           K+ K + +   +  Y  +IH  C  +  D+A  LF  +     V P    YNI+I  LCK
Sbjct: 514 KIEKSK-MELDIGIYNIIIHGMCNASKVDDAWDLFCSL-PLKGVKPGVKTYNIMIGGLCK 573

Query: 525 NNEVNYALSLLNDMKFRGVKPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYIT 584
              ++ A  L   M+  G  P+  TYN + +A        K+ +L++ ++      D  T
Sbjct: 574 KGPLSEAELLFRKMEEDGHAPDGWTYNILIRAHLGDGDATKSVKLIEELKRCGFSVDAST 624

Query: 585 MEILTEWLS 591
           ++++ + LS
Sbjct: 634 IKMVIDMLS 624

BLAST of HG10022627 vs. ExPASy TrEMBL
Match: A0A6J1D191 (pentatricopeptide repeat-containing protein At5g28460 OS=Momordica charantia OX=3673 GN=LOC111016427 PE=4 SV=1)

HSP 1 Score: 1032.3 bits (2668), Expect = 8.0e-298
Identity = 513/603 (85.07%), Postives = 551/603 (91.38%), Query Frame = 0

Query: 10  VHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALKVLDEMLLPG 69
           V LLRRLGR GM DEALA FS LDSH KNT VRNVIIDLLLK+GRVD+A+KVLDEMLLPG
Sbjct: 162 VLLLRRLGRAGMADEALAVFSELDSHTKNTYVRNVIIDLLLKTGRVDSAMKVLDEMLLPG 221

Query: 70  SEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITLTQLISKLCR 129
           SEFRPN +TADIV  KLLK+N  EGRV+ED+IAGLVAKFGKH+V PN ITLTQLIS+LCR
Sbjct: 222 SEFRPNDITADIVFTKLLKVNGWEGRVREDEIAGLVAKFGKHHVFPNAITLTQLISRLCR 281

Query: 130 SRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDMNIQPDVIT 189
           SRNT+LAWNVLD+VM LNG+KD APCNALL+GLGKEREFEKMNLLM++MKDM+IQP+VIT
Sbjct: 282 SRNTDLAWNVLDDVMKLNGLKDVAPCNALLTGLGKEREFEKMNLLMRRMKDMDIQPNVIT 341

Query: 190 FSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMYNTLIDGLCKVGRHEEGLRL 249
           F ILINHLCKFRRI+DAL+VFE+MK EK EAKVV PDAI YNTLIDGLCKVGR EEG  L
Sbjct: 342 FGILINHLCKFRRIDDALEVFERMKGEK-EAKVVAPDAITYNTLIDGLCKVGRQEEGSSL 401

Query: 250 MEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLNTLVDGMCK 309
           M MMRSDGC PNTVTYNCLIDG+CK+GEIE+AHELFN+M +EQV PNVITLNTLVDGMCK
Sbjct: 402 MRMMRSDGCVPNTVTYNCLIDGYCKSGEIEKAHELFNQMANEQVVPNVITLNTLVDGMCK 461

Query: 310 HDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCLPDAIV 369
           H R+SSAVEF REMQQKGLKGNSVTYTVF NAFCK NN+DKAMEF DEM K GC PD+IV
Sbjct: 462 HGRISSAVEFLREMQQKGLKGNSVTYTVFINAFCKXNNIDKAMEFFDEMFKAGCFPDSIV 521

Query: 370 YYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELLNEME 429
           YYTLICGLAQAGRLDDAS V+SKLKEAGF LDLVC+NILISEFCKRNKLD+ANELLNEME
Sbjct: 522 YYTLICGLAQAGRLDDASFVMSKLKEAGFRLDLVCFNILISEFCKRNKLDKANELLNEME 581

Query: 430 VVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIHAYCLNNNT 489
           +VGVKPDSVTYNTLISYFS+ GNFKLAHKFM +MTK EGL PTVFTYGALIHAYCLNNNT
Sbjct: 582 LVGVKPDSVTYNTLISYFSRTGNFKLAHKFMKRMTKHEGLLPTVFTYGALIHAYCLNNNT 641

Query: 490 DEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVKPNTTTYNA 549
           DEAMKLFKEM+A SKVPPNTVIYNILIDSLCK NEVN ALSL +DMK RGVKPNTTTYNA
Sbjct: 642 DEAMKLFKEMSAASKVPPNTVIYNILIDSLCKKNEVNSALSLFDDMKVRGVKPNTTTYNA 701

Query: 550 IFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKKFTQGYSVSD 609
           +FKAL EKNWLDKAFELMDRM EQAC+ADYITMEILTEWLSSVGETTKLKKFTQGY VS 
Sbjct: 702 VFKALWEKNWLDKAFELMDRMVEQACNADYITMEILTEWLSSVGETTKLKKFTQGYKVST 761

Query: 610 SAA 613
           S A
Sbjct: 762 SPA 763

BLAST of HG10022627 vs. ExPASy TrEMBL
Match: A0A1S3BGI7 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103489753 PE=4 SV=1)

HSP 1 Score: 1025.8 bits (2651), Expect = 7.5e-296
Identity = 511/613 (83.36%), Postives = 558/613 (91.03%), Query Frame = 0

Query: 1   MSYSTAQNVVHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALK 60
           MS STAQ+ VHLLR LGR GMVDEALAAFSTLDSHAKNTNVRN II+LLLKSGRVDNAL 
Sbjct: 1   MSLSTAQSAVHLLRHLGRVGMVDEALAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITL 120
           VL EMLLP SEFRPN  TA IV NK+LKI+ SEGR KED+IAGLV+KFGK+N+ P+ I L
Sbjct: 61  VLYEMLLPESEFRPNDKTAGIVFNKMLKIDGSEGRAKEDEIAGLVSKFGKYNIFPDTIAL 120

Query: 121 TQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKD 180
           TQLISKLCRS NTNLAWN+LDN+MMLNG+KDAAPCNALL+GLGK REF KMNLLM+KMKD
Sbjct: 121 TQLISKLCRSGNTNLAWNILDNMMMLNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKD 180

Query: 181 MNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVV-PDAIMYNTLIDGLCK 240
           MNIQP VITF ILIN+LCKFRRI+DAL+VFEKMK EK EA+VVV PD IMYNTLIDGLCK
Sbjct: 181 MNIQPTVITFGILINYLCKFRRIDDALEVFEKMKGEKEEAEVVVAPDTIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVIT 300
           VGR EEGLRLM  MRS  CAP T TYNCLI+G+C+AGEIE A++LF+EMVSEQ+ PNVIT
Sbjct: 241 VGRQEEGLRLMGTMRSGQCAPTTATYNCLINGYCRAGEIEVANKLFSEMVSEQIEPNVIT 300

Query: 301 LNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           LNTLVDGMCKH+R+S+AV+FFR+MQQKGLKGN+VTYTVF NAFC  NNM+KAMEFLDEMS
Sbjct: 301 LNTLVDGMCKHNRISTAVKFFRDMQQKGLKGNNVTYTVFINAFCNVNNMNKAMEFLDEMS 360

Query: 361 KDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC PDA+VYYTLICGLA+AGRLDD+SSVVSKLKEAGF LD VCYN+LISEFCK+NKLD
Sbjct: 361 KDGCFPDAVVYYTLICGLARAGRLDDSSSVVSKLKEAGFFLDRVCYNVLISEFCKKNKLD 420

Query: 421 RANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGAL 480
           RA E LN+ME+ GVKPDSVTYNTLISYFSKIGNFKLAHKFM KMT+E+GLSPTVFTYGAL
Sbjct: 421 RAQEWLNQMELAGVKPDSVTYNTLISYFSKIGNFKLAHKFMKKMTEEDGLSPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRG 540
           IHAYCLNNN DEA+KLFKEMN  SKVPPNTVIYNILIDSLCK  EVN+ALSLL+DMKFRG
Sbjct: 481 IHAYCLNNNIDEAIKLFKEMNVASKVPPNTVIYNILIDSLCKQTEVNFALSLLDDMKFRG 540

Query: 541 VKPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLK 600
           V PNTTTYN+IFKALRE NWLDKAF+LMDRM EQAC+ADYITMEILTEWLSSVGETTKLK
Sbjct: 541 VMPNTTTYNSIFKALRENNWLDKAFKLMDRMVEQACNADYITMEILTEWLSSVGETTKLK 600

Query: 601 KFTQGYSVSDSAA 613
           KFTQG  VSDSAA
Sbjct: 601 KFTQGCMVSDSAA 613

BLAST of HG10022627 vs. ExPASy TrEMBL
Match: A0A5D3DT18 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold124G00300 PE=4 SV=1)

HSP 1 Score: 1025.8 bits (2651), Expect = 7.5e-296
Identity = 511/613 (83.36%), Postives = 558/613 (91.03%), Query Frame = 0

Query: 1   MSYSTAQNVVHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALK 60
           MS STAQ+ VHLLR LGR GMVDEALAAFSTLDSHAKNTNVRN II+LLLKSGRVDNAL 
Sbjct: 1   MSLSTAQSAVHLLRHLGRVGMVDEALAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITL 120
           VL EMLLP SEFRPN  TA IV NK+LKI+ SEGR KED+IAGLV+KFGK+N+ P+ I L
Sbjct: 61  VLYEMLLPESEFRPNDKTAGIVFNKMLKIDGSEGRAKEDEIAGLVSKFGKYNIFPDTIAL 120

Query: 121 TQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKD 180
           TQLISKLCRS NTNLAWN+LDN+MMLNG+KDAAPCNALL+GLGK REF KMNLLM+KMKD
Sbjct: 121 TQLISKLCRSGNTNLAWNILDNMMMLNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKD 180

Query: 181 MNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVV-PDAIMYNTLIDGLCK 240
           MNIQP VITF ILIN+LCKFRRI+DAL+VFEKMK EK EA+VVV PD IMYNTLIDGLCK
Sbjct: 181 MNIQPTVITFGILINYLCKFRRIDDALEVFEKMKGEKEEAEVVVAPDTIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVIT 300
           VGR EEGLRLM  MRS  CAP T TYNCLI+G+C+AGEIE A++LF+EMVSEQ+ PNVIT
Sbjct: 241 VGRQEEGLRLMGTMRSGQCAPTTATYNCLINGYCRAGEIEVANKLFSEMVSEQIEPNVIT 300

Query: 301 LNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           LNTLVDGMCKH+R+S+AV+FFR+MQQKGLKGN+VTYTVF NAFC  NNM+KAMEFLDEMS
Sbjct: 301 LNTLVDGMCKHNRISTAVKFFRDMQQKGLKGNNVTYTVFINAFCNVNNMNKAMEFLDEMS 360

Query: 361 KDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC PDA+VYYTLICGLA+AGRLDD+SSVVSKLKEAGF LD VCYN+LISEFCK+NKLD
Sbjct: 361 KDGCFPDAVVYYTLICGLARAGRLDDSSSVVSKLKEAGFFLDRVCYNVLISEFCKKNKLD 420

Query: 421 RANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGAL 480
           RA E LN+ME+ GVKPDSVTYNTLISYFSKIGNFKLAHKFM KMT+E+GLSPTVFTYGAL
Sbjct: 421 RAQEWLNQMELAGVKPDSVTYNTLISYFSKIGNFKLAHKFMKKMTEEDGLSPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRG 540
           IHAYCLNNN DEA+KLFKEMN  SKVPPNTVIYNILIDSLCK  EVN+ALSLL+DMKFRG
Sbjct: 481 IHAYCLNNNIDEAIKLFKEMNVASKVPPNTVIYNILIDSLCKQTEVNFALSLLDDMKFRG 540

Query: 541 VKPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLK 600
           V PNTTTYN+IFKALRE NWLDKAF+LMDRM EQAC+ADYITMEILTEWLSSVGETTKLK
Sbjct: 541 VMPNTTTYNSIFKALRENNWLDKAFKLMDRMVEQACNADYITMEILTEWLSSVGETTKLK 600

Query: 601 KFTQGYSVSDSAA 613
           KFTQG  VSDSAA
Sbjct: 601 KFTQGCMVSDSAA 613

BLAST of HG10022627 vs. ExPASy TrEMBL
Match: A0A0A0K547 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G336570 PE=4 SV=1)

HSP 1 Score: 1020.4 bits (2637), Expect = 3.2e-294
Identity = 509/613 (83.03%), Postives = 552/613 (90.05%), Query Frame = 0

Query: 1   MSYSTAQNVVHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALK 60
           MS STAQ+ VHLLRRLGR GMVDEALAAFSTLDSHAKNTNVRN II+LLLKSGRVDNA+ 
Sbjct: 1   MSLSTAQSSVHLLRRLGRIGMVDEALAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNAMN 60

Query: 61  VLDEMLLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITL 120
           VLDEMLLP SEFRPN  TA IV N LLKI+  EGRVKED+IAGLV+KFGKHN+ P+ I L
Sbjct: 61  VLDEMLLPESEFRPNDKTAGIVFNNLLKIDGLEGRVKEDEIAGLVSKFGKHNIFPDTIAL 120

Query: 121 TQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKD 180
           TQLISKLCRS NTNLAWN+LDN+MMLNG+KDAAPCNALL+GLGK REF KMNLLM+KMKD
Sbjct: 121 TQLISKLCRSGNTNLAWNILDNLMMLNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKD 180

Query: 181 MNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKV-VVPDAIMYNTLIDGLCK 240
           MNIQP VITF ILINHLCKFRRI+DAL+VFEKMK EK E KV V PD IMYNTLIDGLCK
Sbjct: 181 MNIQPTVITFGILINHLCKFRRIDDALEVFEKMKGEKEETKVFVAPDTIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVIT 300
           VGR EE L LM  MRSD CAP T T+NCLI+G+C++GEIE AH+LFNEM + Q+ PNVIT
Sbjct: 241 VGRQEEALCLMGKMRSDQCAPTTATFNCLINGYCRSGEIEVAHKLFNEMENAQIEPNVIT 300

Query: 301 LNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           LNTLVDGMCKH+R+S+AVEFFR MQQKGLKGN+VTYTVF NAFC  NNM+KAMEFLDEMS
Sbjct: 301 LNTLVDGMCKHNRISTAVEFFRVMQQKGLKGNNVTYTVFINAFCNVNNMNKAMEFLDEMS 360

Query: 361 KDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC PDA+VYYTLICGLAQAGRLDDASSVVSKLKEAGFCLD VCYN+LISEFCK+NKLD
Sbjct: 361 KDGCFPDAVVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDRVCYNVLISEFCKKNKLD 420

Query: 421 RANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGAL 480
           RA E LNEME+ GVKPDSVTYNTLISYFSKIGNFKLAHKFM KMT+EEGLSPTVFTYGAL
Sbjct: 421 RAQEWLNEMELAGVKPDSVTYNTLISYFSKIGNFKLAHKFMKKMTEEEGLSPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMKLFKEM-NATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFR 540
           IHAYCLNNN DEA+K+FKEM N  SKVPPNTVIYNILIDSLCK  +VN+ALSLL+DMKFR
Sbjct: 481 IHAYCLNNNIDEAIKIFKEMNNVASKVPPNTVIYNILIDSLCKQTQVNFALSLLDDMKFR 540

Query: 541 GVKPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKL 600
           GV PNTTTYN+IFKALR+KNWLDKAF+LMDRM EQAC+ DYITMEILTEWLS+VGE TKL
Sbjct: 541 GVMPNTTTYNSIFKALRDKNWLDKAFKLMDRMVEQACNPDYITMEILTEWLSAVGEITKL 600

Query: 601 KKFTQGYSVSDSA 612
           KKFTQG  VSDSA
Sbjct: 601 KKFTQGCMVSDSA 613

BLAST of HG10022627 vs. ExPASy TrEMBL
Match: A0A6J1H7V3 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111460949 PE=4 SV=1)

HSP 1 Score: 1009.2 bits (2608), Expect = 7.3e-291
Identity = 499/612 (81.54%), Postives = 548/612 (89.54%), Query Frame = 0

Query: 1   MSYSTAQNVVHLLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALK 60
           M++ST QN V+LLRRLGR+GMVDEALA F+ LD H KNT VRNVIIDLLLKSGRVDNAL 
Sbjct: 1   MAHSTTQNAVNLLRRLGRSGMVDEALAVFTELDPHKKNTYVRNVIIDLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITL 120
           VLDEMLLPGSEFRPN +TADIV  KLLKIN S+ RVKE++IAGLVAKFGKH+VVPNPITL
Sbjct: 61  VLDEMLLPGSEFRPNDITADIVFTKLLKINGSDWRVKENEIAGLVAKFGKHHVVPNPITL 120

Query: 121 TQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKD 180
           TQLISKLCRSRN +LAW+VLD++M  NG+KDAAPCNALL+GLGKEREFEKMNLLM+KMKD
Sbjct: 121 TQLISKLCRSRNIDLAWSVLDDLMTWNGLKDAAPCNALLTGLGKEREFEKMNLLMRKMKD 180

Query: 181 MNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMYNTLIDGLCKV 240
           MN+QP+VITF ILINH+CKFR I+DAL+VFEKMK EK EAK V PDAI YNTLIDGLCKV
Sbjct: 181 MNVQPNVITFGILINHMCKFRMIDDALEVFEKMKGEKEEAKAVAPDAITYNTLIDGLCKV 240

Query: 241 GRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITL 300
           GR EEGLRL+  MRSDGC PNTVTYNCLIDG+CKAG IE AH L++EMV+EQV PNVITL
Sbjct: 241 GRQEEGLRLLGTMRSDGCKPNTVTYNCLIDGYCKAGAIETAHALYDEMVNEQVVPNVITL 300

Query: 301 NTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSK 360
           NTLVDGMCKH R+SSAVEFFREMQQKGLKGNSVTYTVF NAFCK NN+DKA+EF DEMSK
Sbjct: 301 NTLVDGMCKHGRISSAVEFFREMQQKGLKGNSVTYTVFINAFCKVNNIDKAIEFFDEMSK 360

Query: 361 DGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDR 420
            GC PDAIVYYTLICGLAQAGRLDDA+SV  ++K AGF LDLVCYN+LISEFCK+NKLD 
Sbjct: 361 AGCFPDAIVYYTLICGLAQAGRLDDATSVALRMKAAGFRLDLVCYNLLISEFCKKNKLDE 420

Query: 421 ANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALI 480
           ANELL+EMEVVG+KPDSVTYNTLISYFSK+GN +LA KFM KMTKE  L P+VFTYGA+I
Sbjct: 421 ANELLDEMEVVGIKPDSVTYNTLISYFSKMGNLELALKFMEKMTKEACLLPSVFTYGAVI 480

Query: 481 HAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGV 540
           HAYCL NNTDEAMKLFKEM ATSKVPPNTVIYNILIDSLCK NEV+ AL+L +DM   GV
Sbjct: 481 HAYCLTNNTDEAMKLFKEMTATSKVPPNTVIYNILIDSLCKRNEVSSALALFDDMNVNGV 540

Query: 541 KPNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKK 600
           KPNTTTYNAI K L+EK WL+KA E+MDRM EQAC+ADYITMEILTEWLSSVGET KLK+
Sbjct: 541 KPNTTTYNAILKGLKEKKWLEKALEVMDRMVEQACNADYITMEILTEWLSSVGETAKLKR 600

Query: 601 FTQGYSVSDSAA 613
           FTQGY VSD AA
Sbjct: 601 FTQGYKVSDFAA 612

BLAST of HG10022627 vs. TAIR 10
Match: AT3G61520.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 582.4 bits (1500), Expect = 4.2e-166
Identity = 298/600 (49.67%), Postives = 414/600 (69.00%), Query Frame = 0

Query: 12  LLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALKVLDEMLLPGSE 71
           L+R  GR GMV++++  +  LDS+ KN+ VRNV++D+LL++G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADIVINKLLKINESEGR-VKEDDIAGLVAKFGKHNVVPNPITLTQLISKLCRS 131
           F PN +TADIV++++ K     GR + E+ I  L+++F  H V PN + LT+ IS LC++
Sbjct: 218 FPPNRITADIVLHEVWK-----GRLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKN 277

Query: 132 RNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDMNIQPDVITF 191
              N AW++L ++M      +A P NALLS LG+  +  +MN L+ KM ++ I+PDV+T 
Sbjct: 278 ARANAAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTL 337

Query: 192 SILINHLCKFRRIEDALKVFEKMKEEK-GEAKVVVPDAIMYNTLIDGLCKVGRHEEGLRL 251
            ILIN LCK RR+++AL+VFEKM+ ++  +  V+  D+I +NTLIDGLCKVGR +E   L
Sbjct: 338 GILINTLCKSRRVDEALEVFEKMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEEL 397

Query: 252 MEMMR-SDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLNTLVDGMC 311
           +  M+  + CAPN VTYNCLIDG+C+AG++E A E+ + M  +++ PNV+T+NT+V GMC
Sbjct: 398 LVRMKLEERCAPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMC 457

Query: 312 KHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCLPDAI 371
           +H  ++ AV FF +M+++G+KGN VTY    +A C  +N++KAM + ++M + GC PDA 
Sbjct: 458 RHHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAK 517

Query: 372 VYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELLNEM 431
           +YY LI GL Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+L +M
Sbjct: 518 IYYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNTEKVYEMLTDM 577

Query: 432 EVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIHAYCLNNN 491
           E  G KPDS+TYNTLIS+F K  +F+   + M +M +E+GL PTV TYGA+I AYC    
Sbjct: 578 EKEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGE 637

Query: 492 TDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVKPNTTTYN 551
            DEA+KLFK+M   SKV PNTVIYNILI++  K      ALSL  +MK + V+PN  TYN
Sbjct: 638 LDEALKLFKDMGLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETYN 697

Query: 552 AIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKKFTQGYSVS 609
           A+FK L EK   +   +LMD M EQ+C+ + ITMEIL E LS   E  KL+KF QGYSV+
Sbjct: 698 ALFKCLNEKTQGETLLKLMDEMVEQSCEPNQITMEILMERLSGSDELVKLRKFMQGYSVA 751

BLAST of HG10022627 vs. TAIR 10
Match: AT5G28460.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 580.5 bits (1495), Expect = 1.6e-165
Identity = 295/599 (49.25%), Postives = 412/599 (68.78%), Query Frame = 0

Query: 12  LLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALKVLDEMLLPGSE 71
           L+R  GR GMV++++  +  LDS+ KN+ VRNV++D+LL++G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITLTQLISKLCRSR 131
           F PN +TADIV++++ K    E  + E+ I  L+++F  H V PN + LT+ IS LC++ 
Sbjct: 218 FPPNRITADIVLHEVWK----ERLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA 277

Query: 132 NTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDMNIQPDVITFS 191
             N AW++L ++M      +A P NALLS LG+  +  +MN L+ KM ++ I+PDV+T  
Sbjct: 278 RANTAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTLG 337

Query: 192 ILINHLCKFRRIEDALKVFEKMKEEK-GEAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLM 251
           ILIN LCK RR+++AL+VFE+M+ ++  +  V+  D+I +NTLIDGLCKVGR +E   L+
Sbjct: 338 ILINTLCKSRRVDEALEVFEQMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEELL 397

Query: 252 EMMR-SDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLNTLVDGMCK 311
             M+  + C PN VTYNCLIDG+C+AG++E A E+ + M  +++ PNV+T+NT+V GMC+
Sbjct: 398 VRMKLEERCVPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMCR 457

Query: 312 HDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCLPDAIV 371
           H  ++ AV FF +M+++G+KGN VTY    +A C  +N++KAM + ++M + GC PDA +
Sbjct: 458 HHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAKI 517

Query: 372 YYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELLNEME 431
           YY LI GL Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+L +ME
Sbjct: 518 YYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNAEKVYEMLTDME 577

Query: 432 VVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIHAYCLNNNT 491
             G KPDS+TYNTLIS+F K  +F+   + M +M +E+GL PTV TYGA+I AYC     
Sbjct: 578 KEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGEL 637

Query: 492 DEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVKPNTTTYNA 551
           DEA+KLFK+M   SKV PNTVIYNILI++  K      ALSL  +MK + V+PN  TYNA
Sbjct: 638 DEALKLFKDMGLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETYNA 697

Query: 552 IFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKKFTQGYSVS 609
           +FK L EK   +   +LMD M EQ+C+ + ITMEIL E LS   E  KL+KF QGYSV+
Sbjct: 698 LFKCLNEKTQGETLLKLMDEMVEQSCEPNQITMEILMERLSGSDELVKLRKFMQGYSVA 751

BLAST of HG10022627 vs. TAIR 10
Match: AT5G28370.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 543.1 bits (1398), Expect = 2.8e-154
Identity = 275/579 (47.50%), Postives = 393/579 (67.88%), Query Frame = 0

Query: 12  LLRRLGRTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRVDNALKVLDEMLLPGSE 71
           L+R  GR GMV++++  +  LDS+ KN+ VRNV++D+LL++G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGKHNVVPNPITLTQLISKLCRSR 131
           F PN +TADIV++++ K    E  + E+ I  L+++F  H V PN + LT+ IS LC++ 
Sbjct: 218 FPPNRITADIVLHEVWK----ERLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA 277

Query: 132 NTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDMNIQPDVITFS 191
             N AW++L ++M      +A P NALLS LG+  +  +MN L+ KM ++ I+PDV+T  
Sbjct: 278 RANTAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTLG 337

Query: 192 ILINHLCKFRRIEDALKVFEKMKEEK-GEAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLM 251
           ILIN LCK RR+++AL+VFE+M+ ++  +  V+  D+I +NTLIDGLCKVGR +E   L+
Sbjct: 338 ILINTLCKSRRVDEALEVFEQMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEELL 397

Query: 252 EMMR-SDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLNTLVDGMCK 311
             M+  + C PN VTYNCLIDG+C+AG++E A E+ + M  +++ PNV+T+NT+V GMC+
Sbjct: 398 VRMKLEERCVPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMCR 457

Query: 312 HDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCLPDAIV 371
           H  ++ AV FF +M+++G+KGN VTY    +A C  +N++KAM + ++M + GC PDA +
Sbjct: 458 HHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAKI 517

Query: 372 YYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELLNEME 431
           YY LI GL Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+L +ME
Sbjct: 518 YYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNAEKVYEMLTDME 577

Query: 432 VVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIHAYCLNNNT 491
             G KPDS+TYNTLIS+F K  +F+   + M +M +E+GL PTV TYGA+I AYC     
Sbjct: 578 KEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGEL 637

Query: 492 DEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVKPNTTTYNA 551
           DEA+KLFK+M   SKV PNTVIYNILI++  K      ALSL  +MK + V+PN  TYNA
Sbjct: 638 DEALKLFKDMGLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETYNA 697

Query: 552 IFKALREKNWLDKAFELMDRMREQACDADYITMEILTEW 589
           +FK L EK   +   +LMD M       +++  +I ++W
Sbjct: 698 LFKCLNEKTQGETLLKLMDEM------VEHLVNQIRSQW 725

BLAST of HG10022627 vs. TAIR 10
Match: AT1G12700.1 (ATP binding;nucleic acid binding;helicases )

HSP 1 Score: 261.5 bits (667), Expect = 1.6e-69
Identity = 175/607 (28.83%), Postives = 287/607 (47.28%), Query Frame = 0

Query: 11  HLLRRLG-RTGMVDEALAAFSTLDSHAKNTNVRNVIIDLLLKSGRV----DNALKVLDEM 70
           HLL+    RT ++    + FS+ +    + +  NV     L+SG V    D+A+ +  EM
Sbjct: 20  HLLKTGSLRTDLLCTISSFFSSCERDFSSISNGNVCFRERLRSGIVDIKKDDAIALFQEM 79

Query: 71  LLPGSEFRPNYVTADIVINKLLKINESEGRVKEDDIAGLVAKFGK----HNVVPNPITLT 130
           +      RP        +  L+  +     +       LV  F K    + +  N  TL 
Sbjct: 80  I----RSRP--------LPSLVDFSRFFSAIARTKQFNLVLDFCKQLELNGIAHNIYTLN 139

Query: 131 QLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEKMNLLMQKMKDM 190
            +I+  CR   T  A++VL  VM L    D    N L+ GL  E +  +  +L+ +M + 
Sbjct: 140 IMINCFCRCCKTCFAYSVLGKVMKLGYEPDTTTFNTLIKGLFLEGKVSEAVVLVDRMVEN 199

Query: 191 NIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMYNTLIDGLCKVG 250
             QPDV+T++ ++N +C+      AL +  KM+E   +A     D   Y+T+ID LC+ G
Sbjct: 200 GCQPDVVTYNSIVNGICRSGDTSLALDLLRKMEERNVKA-----DVFTYSTIIDSLCRDG 259

Query: 251 RHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVSEQVAPNVITLN 310
             +  + L + M + G   + VTYN L+ G CKAG+      L  +MVS ++ PNVIT N
Sbjct: 260 CIDAAISLFKEMETKGIKSSVVTYNSLVRGLCKAGKWNDGALLLKDMVSREIVPNVITFN 319

Query: 311 TLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKD 370
            L+D   K  ++  A E ++EM  +G+  N +TY    + +C  N + +A   LD M ++
Sbjct: 320 VLLDVFVKEGKLQEANELYKEMITRGISPNIITYNTLMDGYCMQNRLSEANNMLDLMVRN 379

Query: 371 GCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRA 430
            C PD + + +LI G     R+DD   V   + + G   + V Y+IL+  FC+  K+  A
Sbjct: 380 KCSPDIVTFTSLIKGYCMVKRVDDGMKVFRNISKRGLVANAVTYSILVQGFCQSGKIKLA 439

Query: 431 NELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLSPTVFTYGALIH 490
            EL  EM   GV PD +TY  L+      G  + A +    + K + +   +  Y  +I 
Sbjct: 440 EELFQEMVSHGVLPDVMTYGILLDGLCDNGKLEKALEIFEDLQKSK-MDLGIVMYTTIIE 499

Query: 491 AYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALSLLNDMKFRGVK 550
             C     ++A  LF  +     V PN + Y ++I  LCK   ++ A  LL  M+  G  
Sbjct: 500 GMCKGGKVEDAWNLFCSLPCKG-VKPNVMTYTVMISGLCKKGSLSEANILLRKMEEDGNA 559

Query: 551 PNTTTYNAIFKALREKNWLDKAFELMDRMREQACDADYITMEILTEWLSSVGETTKLKKF 609
           PN  TYN + +A      L  + +L++ M+     AD  +++++ + L S      +K+ 
Sbjct: 560 PNDCTYNTLIRAHLRDGDLTASAKLIEEMKSCGFSADASSIKMVIDMLLSA-----MKRL 602

BLAST of HG10022627 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 258.5 bits (659), Expect = 1.4e-68
Identity = 139/460 (30.22%), Postives = 240/460 (52.17%), Query Frame = 0

Query: 111 HNVVPNPITLTQLISKLCRSRNTNLAWNVLDNVMMLNGIKDAAPCNALLSGLGKEREFEK 170
           H  VP+ I  T LI   CR   T  A  +L+ +     + D    N ++SG  K  E   
Sbjct: 131 HGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGAVPDVITYNVMISGYCKAGE--- 190

Query: 171 MNLLMQKMKDMNIQPDVITFSILINHLCKFRRIEDALKVFEKMKEEKGEAKVVVPDAIMY 230
           +N  +  +  M++ PDV+T++ ++  LC   +++ A++V ++M +     +   PD I Y
Sbjct: 191 INNALSVLDRMSVSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQ-----RDCYPDVITY 250

Query: 231 NTLIDGLCKVGRHEEGLRLMEMMRSDGCAPNTVTYNCLIDGFCKAGEIERAHELFNEMVS 290
             LI+  C+       ++L++ MR  GC P+ VTYN L++G CK G ++ A +  N+M S
Sbjct: 251 TILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRLDEAIKFLNDMPS 310

Query: 291 EQVAPNVITLNTLVDGMCKHDRVSSAVEFFREMQQKGLKGNSVTYTVFTNAFCKANNMDK 350
               PNVIT N ++  MC   R   A +   +M +KG   + VT+ +  N  C+   + +
Sbjct: 311 SGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNILINFLCRKGLLGR 370

Query: 351 AMEFLDEMSKDGCLPDAIVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILIS 410
           A++ L++M + GC P+++ Y  L+ G  +  ++D A   + ++   G   D+V YN +++
Sbjct: 371 AIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRGCYPDIVTYNTMLT 430

Query: 411 EFCKRNKLDRANELLNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHKFMNKMTKEEGLS 470
             CK  K++ A E+LN++   G  P  +TYNT+I   +K G    A K +++M + + L 
Sbjct: 431 ALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIKLLDEM-RAKDLK 490

Query: 471 PTVFTYGALIHAYCLNNNTDEAMKLFKEMNATSKVPPNTVIYNILIDSLCKNNEVNYALS 530
           P   TY +L+         DEA+K F E      + PN V +N ++  LCK+ + + A+ 
Sbjct: 491 PDTITYSSLVGGLSREGKVDEAIKFFHEFERMG-IRPNAVTFNSIMLGLCKSRQTDRAID 550

Query: 531 LLNDMKFRGVKPNTTTYNAIFKALREKNWLDKAFELMDRM 571
            L  M  RG KPN T+Y  + + L  +    +A EL++ +
Sbjct: 551 FLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNEL 580

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898803.17.1e-30987.28pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benin... [more]
XP_022147524.12.8e-29784.91pentatricopeptide repeat-containing protein At5g28460 [Momordica charantia][more]
XP_008447282.11.5e-29583.36PREDICTED: pentatricopeptide repeat-containing protein At3g61520, mitochondrial-... [more]
XP_011659175.16.5e-29483.03pentatricopeptide repeat-containing protein At3g61520, mitochondrial [Cucumis sa... [more]
XP_023514674.12.7e-29282.03pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q9M3165.9e-16549.67Pentatricopeptide repeat-containing protein At3g61520, mitochondrial OS=Arabidop... [more]
Q9LKU82.3e-16449.25Pentatricopeptide repeat-containing protein At5g28460 OS=Arabidopsis thaliana OX... [more]
P0C7Q75.2e-6829.19Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS... [more]
Q3EDF82.0e-6730.22Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
Q0WKV32.6e-6730.24Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1D1918.0e-29885.07pentatricopeptide repeat-containing protein At5g28460 OS=Momordica charantia OX=... [more]
A0A1S3BGI77.5e-29683.36pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like OS=Cuc... [more]
A0A5D3DT187.5e-29683.36Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0K5473.2e-29483.03Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G336570 PE=4 SV=1[more]
A0A6J1H7V37.3e-29181.54pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT3G61520.14.2e-16649.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G28460.11.6e-16549.25Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G28370.12.8e-15447.50Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G12700.11.6e-6928.83ATP binding;nucleic acid binding;helicases [more]
AT1G09900.11.4e-6830.22Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 98..220
e-value: 8.8E-23
score: 83.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 396..484
e-value: 1.0E-26
score: 95.4
coord: 485..606
e-value: 3.4E-30
score: 106.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 334..395
e-value: 2.3E-16
score: 61.8
coord: 2..97
e-value: 9.9E-8
score: 33.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 221..326
e-value: 2.9E-36
score: 127.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 381..539
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 34..357
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 397..429
e-value: 7.7E-11
score: 41.5
coord: 326..359
e-value: 2.7E-9
score: 36.6
coord: 223..254
e-value: 4.1E-9
score: 36.0
coord: 361..392
e-value: 2.3E-7
score: 30.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 43..66
e-value: 0.0053
score: 16.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 188..217
e-value: 2.6E-7
score: 28.4
coord: 403..437
e-value: 2.0E-7
score: 28.8
coord: 298..331
e-value: 2.1E-7
score: 28.7
coord: 510..544
e-value: 1.1E-9
score: 35.9
coord: 369..401
e-value: 6.0E-6
score: 24.1
coord: 333..367
e-value: 1.3E-9
score: 35.6
coord: 155..187
e-value: 3.3E-6
score: 24.9
coord: 546..575
e-value: 3.1E-4
score: 18.7
coord: 474..508
e-value: 1.4E-7
score: 29.2
coord: 228..262
e-value: 2.8E-11
score: 40.9
coord: 263..297
e-value: 1.3E-11
score: 41.9
coord: 438..472
e-value: 3.3E-7
score: 28.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 435..484
e-value: 2.6E-14
score: 53.1
coord: 260..309
e-value: 6.8E-19
score: 67.8
coord: 507..554
e-value: 7.6E-17
score: 61.3
coord: 155..199
e-value: 7.4E-12
score: 45.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 296..330
score: 11.662881
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 472..506
score: 11.476539
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 38..72
score: 8.659485
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 543..577
score: 9.876189
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 508..542
score: 12.232868
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..400
score: 9.985802
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 186..220
score: 11.257313
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 226..260
score: 13.822257
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 331..365
score: 12.726127
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..471
score: 10.599635
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 401..435
score: 12.145178
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..295
score: 14.052444
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 151..185
score: 9.799459
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 222..603
coord: 109..465
coord: 12..222

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022627.1HG10022627.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding