HG10022628 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022628
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 26427935 .. 26429788 (+)
RNA-Seq ExpressionHG10022628
SyntenyHG10022628
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTTACTCAACATCCCAGAAGGTTGTTCATCTGCTTCGGCGCCTTGAAAGGACTGGCATGGTTGATGAGGCGCTTACTGCGTTTTCTACACTCGATTCACATGCGAAAAACACAAAAGTTCGTAATGTGATCATTGATTTGCTTCTGAAATTTGGGCGAGTTGACAATGCATTGAAAGTGCTCGACGAAATGCTTCTTCCAGGTTCGAAGTTTCGACCCAATTATGTTACTGCTGATGTCGTTTTCAATAAATTGTTGAAGATAAATGAGTCGGAGGGGAGAGTCAAGGAAGATGATATTGCCGGGTTGGTAGCCAAATTTTGTAAACATAACGTTGTTCCACATCCCTTTACATTGACGCAGTTGATCTGCAAGCTTTGTAGTAATAGAAATACAAATCTTGCATGGAATGTTTTAGATAATGTGATGATGTTGAATGGTCTTAGGCATGCTGCACCTTGCAATGCGCTTTTGTCAGGATTGGAAAAGGAGAGGAAGTTTGAGAAGATGAATCTGTTGTTGCGAAAGATGACAGACATGAATATTCAGCCTAATGCTTTTACTTTTCAAATTCTTATCAACCATTTGTGCAAGTTAAGAAGGATTGAAGATGCAGTAAAAGTGTTTATGAAAATGAAAGGGGAAAAGGAGCAGGCCAAGGTTGTTGTACCTGATGCAATCATGTACAATACTTTGATTGATGGGCTCTGTAAAGTGGGGAGGCATGAAGAAGGATTGCGCTTGATGAGAATGATGAGATCAGATGACTGTGCACCCAATACTGTTACTTACAATTGTTTGATTGATGGGTTCTGCAAGGTAGGTGAAGTTGAGAGAGCTCATGAGCTCTTCAATGAGATGGTAAGTGAACAAGTTGTACCGAACGTAATTACCCTCAATACTTTAGTCGATGGAATGTGTAAGCATGACAGAGTAAGCAGCGCAGTTGACTTCTTCAGGGAAATGCAGCAGAAGGGCTTGAGAGGAAATTCTGTTACTTACACGGTGTTCACTAATGCTTTTTGCAAAGCCAACAATATGGACAAGGCAATGGAATTTTTGGATGAAATGTCCAAAGATGGATGTGTGCCTGATGCCAATATTTATTATACTTTGATACGTGATTTAGCCCAAGCTGGAAGGTTGGATGATGCCAGCTCTGTTGTGTCAAAGTTGAAAGAGGCAGGGTTCTGTCTAGATCTTGTTTGCTACAATATTCTTATCAGTGAGTTCTGTAAGAGGAACAAATTAGATAGAGCTAATGAATTGTTTAATGAAATGGAGGTGGTTGGAGTTAAGCCTGACAGTGTCACATACAACACTTTGATTTCCTACTTCAGTAAAATTGGGAATTTCAAACTAGCTCACGAATTTATGAATAAGATGACCAAGGAGGAAGGTCTTTTACCCTCTGTCTTCACTTATGGAGCTCTTATTCATGCATATTGCTTGAACAACAACACTGATGAAGCCATGGAGCTTTTCAAGGAAATGAGTGCTACATCATCAAAGGTTCCTCCTGACACAGTAATATACAATATATTGATAGATTCTTTATGCGAAAATAATGAGGTCAATTACGCTCTTTCTCTATTGGATGATATGGAATTTAGAGGGGTGAGGCCAAACACCACAACGTACAATGCTATTTTCAAAGCCCTTAGGGAGAAGAATTGGTTGCACAAAGCATTCGAACTTATGGATAGAATGGGCGAGCAGGCGTGTCATGCCGATGATATAACAATGAAGATTTTAACCGAATGGCTTTCTTCAGATGGTGAAACTACACAATTGGAGGAGTGTACGCTGGGATACAGTGTTTCTGATTCTGTCTATAGTCTGCAAGCATGA

mRNA sequence

ATGCCTTACTCAACATCCCAGAAGGTTGTTCATCTGCTTCGGCGCCTTGAAAGGACTGGCATGGTTGATGAGGCGCTTACTGCGTTTTCTACACTCGATTCACATGCGAAAAACACAAAAGTTCGTAATGTGATCATTGATTTGCTTCTGAAATTTGGGCGAGTTGACAATGCATTGAAAGTGCTCGACGAAATGCTTCTTCCAGGTTCGAAGTTTCGACCCAATTATGTTACTGCTGATGTCGTTTTCAATAAATTGTTGAAGATAAATGAGTCGGAGGGGAGAGTCAAGGAAGATGATATTGCCGGGTTGGTAGCCAAATTTTGTAAACATAACGTTGTTCCACATCCCTTTACATTGACGCAGTTGATCTGCAAGCTTTGTAGTAATAGAAATACAAATCTTGCATGGAATGTTTTAGATAATGTGATGATGTTGAATGGTCTTAGGCATGCTGCACCTTGCAATGCGCTTTTGTCAGGATTGGAAAAGGAGAGGAAGTTTGAGAAGATGAATCTGTTGTTGCGAAAGATGACAGACATGAATATTCAGCCTAATGCTTTTACTTTTCAAATTCTTATCAACCATTTGTGCAAGTTAAGAAGGATTGAAGATGCAGTAAAAGTGTTTATGAAAATGAAAGGGGAAAAGGAGCAGGCCAAGGTTGTTGTACCTGATGCAATCATGTACAATACTTTGATTGATGGGCTCTGTAAAGTGGGGAGGCATGAAGAAGGATTGCGCTTGATGAGAATGATGAGATCAGATGACTGTGCACCCAATACTGTTACTTACAATTGTTTGATTGATGGGTTCTGCAAGGTAGGTGAAGTTGAGAGAGCTCATGAGCTCTTCAATGAGATGGTAAGTGAACAAGTTGTACCGAACGTAATTACCCTCAATACTTTAGTCGATGGAATGTGTAAGCATGACAGAGTAAGCAGCGCAGTTGACTTCTTCAGGGAAATGCAGCAGAAGGGCTTGAGAGGAAATTCTGTTACTTACACGGTGTTCACTAATGCTTTTTGCAAAGCCAACAATATGGACAAGGCAATGGAATTTTTGGATGAAATGTCCAAAGATGGATGTGTGCCTGATGCCAATATTTATTATACTTTGATACGTGATTTAGCCCAAGCTGGAAGGTTGGATGATGCCAGCTCTGTTGTGTCAAAGTTGAAAGAGGCAGGGTTCTGTCTAGATCTTGTTTGCTACAATATTCTTATCAGTGAGTTCTGTAAGAGGAACAAATTAGATAGAGCTAATGAATTGTTTAATGAAATGGAGGTGGTTGGAGTTAAGCCTGACAGTGTCACATACAACACTTTGATTTCCTACTTCAGTAAAATTGGGAATTTCAAACTAGCTCACGAATTTATGAATAAGATGACCAAGGAGGAAGGTCTTTTACCCTCTGTCTTCACTTATGGAGCTCTTATTCATGCATATTGCTTGAACAACAACACTGATGAAGCCATGGAGCTTTTCAAGGAAATGAGTGCTACATCATCAAAGGTTCCTCCTGACACAGTAATATACAATATATTGATAGATTCTTTATGCGAAAATAATGAGGTCAATTACGCTCTTTCTCTATTGGATGATATGGAATTTAGAGGGGTGAGGCCAAACACCACAACGTACAATGCTATTTTCAAAGCCCTTAGGGAGAAGAATTGGTTGCACAAAGCATTCGAACTTATGGATAGAATGGGCGAGCAGGCGTGTCATGCCGATGATATAACAATGAAGATTTTAACCGAATGGCTTTCTTCAGATGGTGAAACTACACAATTGGAGGAGTGTACGCTGGGATACAGTGTTTCTGATTCTGTCTATAGTCTGCAAGCATGA

Coding sequence (CDS)

ATGCCTTACTCAACATCCCAGAAGGTTGTTCATCTGCTTCGGCGCCTTGAAAGGACTGGCATGGTTGATGAGGCGCTTACTGCGTTTTCTACACTCGATTCACATGCGAAAAACACAAAAGTTCGTAATGTGATCATTGATTTGCTTCTGAAATTTGGGCGAGTTGACAATGCATTGAAAGTGCTCGACGAAATGCTTCTTCCAGGTTCGAAGTTTCGACCCAATTATGTTACTGCTGATGTCGTTTTCAATAAATTGTTGAAGATAAATGAGTCGGAGGGGAGAGTCAAGGAAGATGATATTGCCGGGTTGGTAGCCAAATTTTGTAAACATAACGTTGTTCCACATCCCTTTACATTGACGCAGTTGATCTGCAAGCTTTGTAGTAATAGAAATACAAATCTTGCATGGAATGTTTTAGATAATGTGATGATGTTGAATGGTCTTAGGCATGCTGCACCTTGCAATGCGCTTTTGTCAGGATTGGAAAAGGAGAGGAAGTTTGAGAAGATGAATCTGTTGTTGCGAAAGATGACAGACATGAATATTCAGCCTAATGCTTTTACTTTTCAAATTCTTATCAACCATTTGTGCAAGTTAAGAAGGATTGAAGATGCAGTAAAAGTGTTTATGAAAATGAAAGGGGAAAAGGAGCAGGCCAAGGTTGTTGTACCTGATGCAATCATGTACAATACTTTGATTGATGGGCTCTGTAAAGTGGGGAGGCATGAAGAAGGATTGCGCTTGATGAGAATGATGAGATCAGATGACTGTGCACCCAATACTGTTACTTACAATTGTTTGATTGATGGGTTCTGCAAGGTAGGTGAAGTTGAGAGAGCTCATGAGCTCTTCAATGAGATGGTAAGTGAACAAGTTGTACCGAACGTAATTACCCTCAATACTTTAGTCGATGGAATGTGTAAGCATGACAGAGTAAGCAGCGCAGTTGACTTCTTCAGGGAAATGCAGCAGAAGGGCTTGAGAGGAAATTCTGTTACTTACACGGTGTTCACTAATGCTTTTTGCAAAGCCAACAATATGGACAAGGCAATGGAATTTTTGGATGAAATGTCCAAAGATGGATGTGTGCCTGATGCCAATATTTATTATACTTTGATACGTGATTTAGCCCAAGCTGGAAGGTTGGATGATGCCAGCTCTGTTGTGTCAAAGTTGAAAGAGGCAGGGTTCTGTCTAGATCTTGTTTGCTACAATATTCTTATCAGTGAGTTCTGTAAGAGGAACAAATTAGATAGAGCTAATGAATTGTTTAATGAAATGGAGGTGGTTGGAGTTAAGCCTGACAGTGTCACATACAACACTTTGATTTCCTACTTCAGTAAAATTGGGAATTTCAAACTAGCTCACGAATTTATGAATAAGATGACCAAGGAGGAAGGTCTTTTACCCTCTGTCTTCACTTATGGAGCTCTTATTCATGCATATTGCTTGAACAACAACACTGATGAAGCCATGGAGCTTTTCAAGGAAATGAGTGCTACATCATCAAAGGTTCCTCCTGACACAGTAATATACAATATATTGATAGATTCTTTATGCGAAAATAATGAGGTCAATTACGCTCTTTCTCTATTGGATGATATGGAATTTAGAGGGGTGAGGCCAAACACCACAACGTACAATGCTATTTTCAAAGCCCTTAGGGAGAAGAATTGGTTGCACAAAGCATTCGAACTTATGGATAGAATGGGCGAGCAGGCGTGTCATGCCGATGATATAACAATGAAGATTTTAACCGAATGGCTTTCTTCAGATGGTGAAACTACACAATTGGAGGAGTGTACGCTGGGATACAGTGTTTCTGATTCTGTCTATAGTCTGCAAGCATGA

Protein sequence

MPYSTSQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALKVLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTLTQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLEECTLGYSVSDSVYSLQA
Homology
BLAST of HG10022628 vs. NCBI nr
Match: XP_038898803.1 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benincasa hispida] >XP_038898805.1 pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benincasa hispida] >XP_038898806.1 pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benincasa hispida] >XP_038898807.1 pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benincasa hispida])

HSP 1 Score: 1000.0 bits (2584), Expect = 9.2e-288
Identity = 501/612 (81.86%), Postives = 547/612 (89.38%), Query Frame = 0

Query: 1   MPYSTSQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALK 60
           M   T+Q  VHLLRRL RTGMVDEAL AFSTLDSHAKNTKVRNVII LLLK GRVDNAL 
Sbjct: 1   MSLLTAQSAVHLLRRLGRTGMVDEALAAFSTLDSHAKNTKVRNVIIALLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSKFRPNYVTADVVFNKLL-KINESEGRVKEDDIAGLVAKFCKHNVVPHPFT 120
           VLDEMLLPGS+FRPN  TAD+VFNKLL K+N  EG  KED+IAGLVAKF KHNVVP+  T
Sbjct: 61  VLDEMLLPGSEFRPNDRTADIVFNKLLEKMNGPEGGAKEDEIAGLVAKFGKHNVVPNTIT 120

Query: 121 LTQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMT 180
           LTQLI KLC NRNTNLAW+VLDNVMMLNGL+ AAPCNALL+GL K R+FEKMNLL+RKM 
Sbjct: 121 LTQLITKLCWNRNTNLAWDVLDNVMMLNGLKDAAPCNALLTGLGKAREFEKMNLLMRKMK 180

Query: 181 DMNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVVPDAIMYNTLIDGLCK 240
           DMNIQPN  TF ILINHLCK RRI+DA++VF K+K EKE+AK V PD IMYNTLIDGLCK
Sbjct: 181 DMNIQPNVITFGILINHLCKFRRIDDALEVFEKLKAEKEEAKDVAPDVIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVIT 300
           VGR EEGLRLM MMR DDCAPNTVTYNCLIDG+CK GE+ERAHELFN+MVSEQVVPN+IT
Sbjct: 241 VGRQEEGLRLMGMMRLDDCAPNTVTYNCLIDGYCKSGEIERAHELFNQMVSEQVVPNIIT 300

Query: 301 LNTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           +NTLV+GMC H R+SSAV+FFREMQQKGL+GNSVTYTVF NAFCK NNM+KAMEF DEMS
Sbjct: 301 VNTLVNGMCNHSRISSAVEFFREMQQKGLKGNSVTYTVFINAFCKVNNMNKAMEFWDEMS 360

Query: 361 KDGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC+PD  +YYTLI  LAQAGRLDDA+SVVSKLKEAGFCLDLVCYNILISEFCK+ KLD
Sbjct: 361 KDGCLPDVIVYYTLICGLAQAGRLDDANSVVSKLKEAGFCLDLVCYNILISEFCKKKKLD 420

Query: 421 RANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGAL 480
           RA EL NEME+VGVKPD VTYNTLIS+FSKIGNFKLAHEFM KMTKE+GLLP+VFTYGAL
Sbjct: 421 RAYELLNEMELVGVKPDCVTYNTLISHFSKIGNFKLAHEFMKKMTKEDGLLPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFR 540
           IHAYCLNNNTDEAM+LFKEMSAT SKVPP+TVIYNILIDSLC+ NEVNYALSLLDDM+FR
Sbjct: 481 IHAYCLNNNTDEAMKLFKEMSAT-SKVPPNTVIYNILIDSLCKRNEVNYALSLLDDMKFR 540

Query: 541 GVRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQL 600
            V+PNTTTYN IF+ALREKNWL KAF+LMDRM E+AC+AD ITM+ILTEWLSS GETT+L
Sbjct: 541 RVKPNTTTYNTIFEALREKNWLDKAFKLMDRMVEEACNADYITMEILTEWLSSVGETTKL 600

Query: 601 EECTLGYSVSDS 612
           +  T GYSV DS
Sbjct: 601 KLFTQGYSVCDS 611

BLAST of HG10022628 vs. NCBI nr
Match: XP_022147524.1 (pentatricopeptide repeat-containing protein At5g28460 [Momordica charantia])

HSP 1 Score: 954.9 bits (2467), Expect = 3.4e-274
Identity = 476/606 (78.55%), Postives = 532/606 (87.79%), Query Frame = 0

Query: 6   SQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALKVLDEM 65
           S   V LLRRL R GM +EAL  FS LDSH KNT VRNVIIDLLLK GRVD+A+KVLDEM
Sbjct: 158 SPAAVLLLRRLGRAGMABEALAVFSELDSHTKNTYVRNVIIDLLLKTGRVDSAMKVLDEM 217

Query: 66  LLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTLTQLIC 125
           LLPGS+FRPN +TAD+VF KLLK+N  EGRV+ED+IAGLVAKF KH+V P+  TLTQLI 
Sbjct: 218 LLPGSEFRPNDITADIVFTKLLKVNGWEGRVREDEIAGLVAKFGKHHVFPNAITLTQLIS 277

Query: 126 KLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQP 185
           +LC +RNT+LAWNVLD+VM LNGL+  APCNALL+GL KER+FEKMNLL+R+M DM+IQP
Sbjct: 278 RLCRSRNTDLAWNVLDDVMKLNGLKDVAPCNALLTGLGKEREFEKMNLLMRRMKDMDIQP 337

Query: 186 NAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVVPDAIMYNTLIDGLCKVGRHEE 245
           N  TF ILINHLCK RRI+DA++VF +MKGEKE AKVV PDAI YNTLIDGLCKVGR EE
Sbjct: 338 NVITFGILINHLCKFRRIDDALEVFERMKGEKE-AKVVAPDAITYNTLIDGLCKVGRQEE 397

Query: 246 GLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLVD 305
           G  LMRMMRSD C PNTVTYNCLIDG+CK GE+E+AHELFN+M +EQVVPNVITLNTLVD
Sbjct: 398 GSSLMRMMRSDGCVPNTVTYNCLIDGYCKSGEIEKAHELFNQMANEQVVPNVITLNTLVD 457

Query: 306 GMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCVP 365
           GMCKH R+SSAV+F REMQQKGL+GNSVTYTVF NAFCK NN+DKAMEF DEM K GC P
Sbjct: 458 GMCKHGRISSAVEFLREMQQKGLKGNSVTYTVFINAFCKXNNIDKAMEFFDEMFKAGCFP 517

Query: 366 DANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELF 425
           D+ +YYTLI  LAQAGRLDDAS V+SKLKEAGF LDLVC+NILISEFCKRNKLD+ANEL 
Sbjct: 518 DSIVYYTLICGLAQAGRLDDASFVMSKLKEAGFRLDLVCFNILISEFCKRNKLDKANELL 577

Query: 426 NEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYCL 485
           NEME+VGVKPDSVTYNTLISYFS+ GNFKLAH+FM +MTK EGLLP+VFTYGALIHAYCL
Sbjct: 578 NEMELVGVKPDSVTYNTLISYFSRTGNFKLAHKFMKRMTKHEGLLPTVFTYGALIHAYCL 637

Query: 486 NNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPNT 545
           NNNTDEAM+LFKEMSA +SKVPP+TVIYNILIDSLC+ NEVN ALSL DDM+ RGV+PNT
Sbjct: 638 NNNTDEAMKLFKEMSA-ASKVPPNTVIYNILIDSLCKKNEVNSALSLFDDMKVRGVKPNT 697

Query: 546 TTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLEECTLG 605
           TTYNA+FKAL EKNWL KAFELMDRM EQAC+AD ITM+ILTEWLSS GETT+L++ T G
Sbjct: 698 TTYNAVFKALWEKNWLDKAFELMDRMVEQACNADYITMEILTEWLSSVGETTKLKKFTQG 757

Query: 606 YSVSDS 612
           Y VS S
Sbjct: 758 YKVSTS 761

BLAST of HG10022628 vs. NCBI nr
Match: XP_011659175.1 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial [Cucumis sativus] >KGN44578.1 hypothetical protein Csa_016553 [Cucumis sativus])

HSP 1 Score: 938.7 bits (2425), Expect = 2.5e-269
Identity = 465/612 (75.98%), Postives = 526/612 (85.95%), Query Frame = 0

Query: 1   MPYSTSQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALK 60
           M  ST+Q  VHLLRRL R GMVDEAL AFSTLDSHAKNT VRN II+LLLK GRVDNA+ 
Sbjct: 1   MSLSTAQSSVHLLRRLGRIGMVDEALAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNAMN 60

Query: 61  VLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTL 120
           VLDEMLLP S+FRPN  TA +VFN LLKI+  EGRVKED+IAGLV+KF KHN+ P    L
Sbjct: 61  VLDEMLLPESEFRPNDKTAGIVFNNLLKIDGLEGRVKEDEIAGLVSKFGKHNIFPDTIAL 120

Query: 121 TQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTD 180
           TQLI KLC + NTNLAWN+LDN+MMLNGL+ AAPCNALL+GL K R+F KMNLL+RKM D
Sbjct: 121 TQLISKLCRSGNTNLAWNILDNLMMLNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKD 180

Query: 181 MNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKV-VVPDAIMYNTLIDGLCK 240
           MNIQP   TF ILINHLCK RRI+DA++VF KMKGEKE+ KV V PD IMYNTLIDGLCK
Sbjct: 181 MNIQPTVITFGILINHLCKFRRIDDALEVFEKMKGEKEETKVFVAPDTIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVIT 300
           VGR EE L LM  MRSD CAP T T+NCLI+G+C+ GE+E AH+LFNEM + Q+ PNVIT
Sbjct: 241 VGRQEEALCLMGKMRSDQCAPTTATFNCLINGYCRSGEIEVAHKLFNEMENAQIEPNVIT 300

Query: 301 LNTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           LNTLVDGMCKH+R+S+AV+FFR MQQKGL+GN+VTYTVF NAFC  NNM+KAMEFLDEMS
Sbjct: 301 LNTLVDGMCKHNRISTAVEFFRVMQQKGLKGNNVTYTVFINAFCNVNNMNKAMEFLDEMS 360

Query: 361 KDGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC PDA +YYTLI  LAQAGRLDDASSVVSKLKEAGFCLD VCYN+LISEFCK+NKLD
Sbjct: 361 KDGCFPDAVVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDRVCYNVLISEFCKKNKLD 420

Query: 421 RANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGAL 480
           RA E  NEME+ GVKPDSVTYNTLISYFSKIGNFKLAH+FM KMT+EEGL P+VFTYGAL
Sbjct: 421 RAQEWLNEMELAGVKPDSVTYNTLISYFSKIGNFKLAHKFMKKMTEEEGLSPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFR 540
           IHAYCLNNN DEA+++FKEM+  +SKVPP+TVIYNILIDSLC+  +VN+ALSLLDDM+FR
Sbjct: 481 IHAYCLNNNIDEAIKIFKEMNNVASKVPPNTVIYNILIDSLCKQTQVNFALSLLDDMKFR 540

Query: 541 GVRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQL 600
           GV PNTTTYN+IFKALR+KNWL KAF+LMDRM EQAC+ D ITM+ILTEWLS+ GE T+L
Sbjct: 541 GVMPNTTTYNSIFKALRDKNWLDKAFKLMDRMVEQACNPDYITMEILTEWLSAVGEITKL 600

Query: 601 EECTLGYSVSDS 612
           ++ T G  VSDS
Sbjct: 601 KKFTQGCMVSDS 612

BLAST of HG10022628 vs. NCBI nr
Match: KAG7025273.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 937.2 bits (2421), Expect = 7.3e-269
Identity = 464/610 (76.07%), Postives = 527/610 (86.39%), Query Frame = 0

Query: 1   MPYSTSQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALK 60
           M +ST+Q  V+LLRRL R+GMVDEAL  F+ LD H KNT VRNVIIDLLLK GRVDNAL 
Sbjct: 1   MAHSTAQNAVNLLRRLGRSGMVDEALAVFTELDPHKKNTYVRNVIIDLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTL 120
           VLDEMLLPGS+FRPN +TAD+VF KLLKIN S+ RVKE++IAGLVAKF KH+VVP+P TL
Sbjct: 61  VLDEMLLPGSEFRPNDITADIVFTKLLKINGSDWRVKENEIAGLVAKFGKHHVVPNPITL 120

Query: 121 TQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTD 180
           TQLI KLC +RN +LAW+VLD++M  NGL+ AAPCNALL+GL KER+FEKMNLL+RKM D
Sbjct: 121 TQLISKLCRSRNIDLAWSVLDDLMTRNGLKDAAPCNALLTGLGKEREFEKMNLLMRKMKD 180

Query: 181 MNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVVPDAIMYNTLIDGLCKV 240
           MN+QPN  TF ILINH+CK R I+DA++VF KMKGEKE+AK V PDAI YNTLIDGLCKV
Sbjct: 181 MNVQPNVITFGILINHMCKSRMIDDALEVFEKMKGEKEEAKAVAPDAITYNTLIDGLCKV 240

Query: 241 GRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITL 300
           GR EEGLRL+  MRSD C PNTVTYNCLIDG+CK G +E AH L++EMV+EQVVPNVITL
Sbjct: 241 GRQEEGLRLLGTMRSDGCKPNTVTYNCLIDGYCKAGAIETAHALYDEMVNEQVVPNVITL 300

Query: 301 NTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSK 360
           NTLVDGMCKH R+SSAV+FFREMQQKGL+GNSVTYTVF NAFCK NN+DKA+EF DEMSK
Sbjct: 301 NTLVDGMCKHGRISSAVEFFREMQQKGLKGNSVTYTVFINAFCKVNNIDKAIEFFDEMSK 360

Query: 361 DGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDR 420
            GC PDA +YYTLI  LAQAGRLDDA+SV S++K AGF LDLVCYN+LISEFCK+NKLD 
Sbjct: 361 AGCFPDAIVYYTLICGLAQAGRLDDATSVASRMKAAGFRLDLVCYNLLISEFCKKNKLDE 420

Query: 421 ANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALI 480
           ANEL +EMEVVG+KPDSVTYNTLISYFSK+GN +LA +FM KMTKE  LLPSVFTYGA+I
Sbjct: 421 ANELLDEMEVVGIKPDSVTYNTLISYFSKMGNLELALKFMEKMTKEACLLPSVFTYGAVI 480

Query: 481 HAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRG 540
           HAYCLNNNTDEAM+LFKEMSAT SKVPP+TVIYNILIDSLC+ NEV+ AL+L DDM   G
Sbjct: 481 HAYCLNNNTDEAMKLFKEMSAT-SKVPPNTVIYNILIDSLCKRNEVSSALALFDDMNVNG 540

Query: 541 VRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLE 600
           V+PNTTTYNAI K L+EK WL KA E+MDRM EQAC+AD ITM+ILTEWLSS GET +L+
Sbjct: 541 VKPNTTTYNAILKGLKEKKWLEKALEVMDRMVEQACNADYITMEILTEWLSSVGETAKLK 600

Query: 601 ECTLGYSVSD 611
             T GY VSD
Sbjct: 601 RFTQGYKVSD 609

BLAST of HG10022628 vs. NCBI nr
Match: XP_023514674.1 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 937.2 bits (2421), Expect = 7.3e-269
Identity = 464/610 (76.07%), Postives = 526/610 (86.23%), Query Frame = 0

Query: 1   MPYSTSQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALK 60
           M +ST+Q  V+LLRRL R+GMVD+AL  FS LD H KNT VRNVIIDLLLK GRVDNAL 
Sbjct: 1   MAHSTAQNAVNLLRRLGRSGMVDKALAVFSELDPHKKNTYVRNVIIDLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTL 120
           VLDEMLLP S+FRPN +TAD+VF KLLKIN S+ RVKE++IAGLVAKF KH+VVP+P TL
Sbjct: 61  VLDEMLLPDSEFRPNDITADIVFTKLLKINGSDWRVKENEIAGLVAKFGKHHVVPNPITL 120

Query: 121 TQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTD 180
           TQLI KLC +RN +LAW+VLD++M  NGL+ AAPCNALL+GL KER+FEKMNLL+RKM D
Sbjct: 121 TQLISKLCRSRNIDLAWSVLDDLMTWNGLKDAAPCNALLTGLGKEREFEKMNLLMRKMKD 180

Query: 181 MNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVVPDAIMYNTLIDGLCKV 240
           MN+QPN  TF ILINH+CK R I+DA++VF KMKGEKE+AK V PDAI YNTLIDGLCKV
Sbjct: 181 MNVQPNVITFGILINHMCKFRMIDDALEVFEKMKGEKEEAKAVAPDAITYNTLIDGLCKV 240

Query: 241 GRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITL 300
           GR EEGLRL+  MRSD C PNTVTYNCLIDG+CK G +E AH L+NEMV+EQVVPNVITL
Sbjct: 241 GRQEEGLRLLGTMRSDGCKPNTVTYNCLIDGYCKAGAIEPAHALYNEMVNEQVVPNVITL 300

Query: 301 NTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSK 360
           NTLVDGMCKH R+SSAV+FFREMQQKGL+GNSVTYTVF NAFCK NN+DKA+EF DEMSK
Sbjct: 301 NTLVDGMCKHGRISSAVEFFREMQQKGLKGNSVTYTVFINAFCKVNNIDKAIEFFDEMSK 360

Query: 361 DGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDR 420
            GC PDA +YYTLI  LAQAGRLDDA+SV S++K AGF LDLVCYN+LISEFCK+NKLD 
Sbjct: 361 AGCFPDAIVYYTLICGLAQAGRLDDATSVASRMKAAGFRLDLVCYNLLISEFCKKNKLDE 420

Query: 421 ANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALI 480
           ANEL +EMEVVG+KPDSVTYNTLISYFSK+GN +LA +FM KMTKE  LLPSVFTYGA+I
Sbjct: 421 ANELLDEMEVVGIKPDSVTYNTLISYFSKMGNLELAFKFMEKMTKEACLLPSVFTYGAVI 480

Query: 481 HAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRG 540
           HAYCLNNNTDEAM+LFKEMSAT SKVPP+TVIYNILIDSLC+ NEV+ AL+L DDM   G
Sbjct: 481 HAYCLNNNTDEAMKLFKEMSAT-SKVPPNTVIYNILIDSLCKRNEVSSALALFDDMNVNG 540

Query: 541 VRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLE 600
           V+PNTTTYNAI K L+EK WL KA E+MDRM EQAC+AD ITM+ILTEWLSS GET +L+
Sbjct: 541 VKPNTTTYNAILKGLKEKKWLEKALEVMDRMVEQACNADYITMEILTEWLSSVGETAKLK 600

Query: 601 ECTLGYSVSD 611
             T GY VSD
Sbjct: 601 RFTQGYKVSD 609

BLAST of HG10022628 vs. ExPASy Swiss-Prot
Match: Q9M316 (Pentatricopeptide repeat-containing protein At3g61520, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g61520 PE=2 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 3.2e-150
Identity = 280/601 (46.59%), Postives = 396/601 (65.89%), Query Frame = 0

Query: 12  LLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALKVLDEMLLPGSK 71
           L+R   R GMV++++  +  LDS+ KN++VRNV++D+LL+ G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADVVFNKLLKINESEGR-VKEDDIAGLVAKFCKHNVVPHPFTLTQLICKLCSN 131
           F PN +TAD+V +++ K     GR + E+ I  L+++F  H V P+   LT+ I  LC N
Sbjct: 218 FPPNRITADIVLHEVWK-----GRLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKN 277

Query: 132 RNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQPNAFTF 191
              N AW++L ++M       A P NALLS L +     +MN L+ KM ++ I+P+  T 
Sbjct: 278 ARANAAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTL 337

Query: 192 QILINHLCKLRRIEDAVKVFMKMKGEK-EQAKVVVPDAIMYNTLIDGLCKVGRHEEGLRL 251
            ILIN LCK RR+++A++VF KM+G++ +   V+  D+I +NTLIDGLCKVGR +E   L
Sbjct: 338 GILINTLCKSRRVDEALEVFEKMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEEL 397

Query: 252 M-RMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLVDGMC 311
           + RM   + CAPN VTYNCLIDG+C+ G++E A E+ + M  +++ PNV+T+NT+V GMC
Sbjct: 398 LVRMKLEERCAPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMC 457

Query: 312 KHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCVPDAN 371
           +H  ++ AV FF +M+++G++GN VTY    +A C  +N++KAM + ++M + GC PDA 
Sbjct: 458 RHHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAK 517

Query: 372 IYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELFNEM 431
           IYY LI  L Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+  +M
Sbjct: 518 IYYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNTEKVYEMLTDM 577

Query: 432 EVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYCLNNN 491
           E  G KPDS+TYNTLIS+F K  +F+     M +M +E+GL P+V TYGA+I AYC    
Sbjct: 578 EKEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGE 637

Query: 492 TDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPNTTTY 551
            DEA++LFK+M    SKV P+TVIYNILI++  +      ALSL ++M+ + VRPN  TY
Sbjct: 638 LDEALKLFKDM-GLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETY 697

Query: 552 NAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLEECTLGYSV 610
           NA+FK L EK       +LMD M EQ+C  + ITM+IL E LS   E  +L +   GYSV
Sbjct: 698 NALFKCLNEKTQGETLLKLMDEMVEQSCEPNQITMEILMERLSGSDELVKLRKFMQGYSV 751

BLAST of HG10022628 vs. ExPASy Swiss-Prot
Match: Q9LKU8 (Pentatricopeptide repeat-containing protein At5g28460 OS=Arabidopsis thaliana OX=3702 GN=At5g28460 PE=2 SV=1)

HSP 1 Score: 531.6 bits (1368), Expect = 1.2e-149
Identity = 277/600 (46.17%), Postives = 394/600 (65.67%), Query Frame = 0

Query: 12  LLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALKVLDEMLLPGSK 71
           L+R   R GMV++++  +  LDS+ KN++VRNV++D+LL+ G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTLTQLICKLCSNR 131
           F PN +TAD+V +++ K    E  + E+ I  L+++F  H V P+   LT+ I  LC N 
Sbjct: 218 FPPNRITADIVLHEVWK----ERLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA 277

Query: 132 NTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQPNAFTFQ 191
             N AW++L ++M       A P NALLS L +     +MN L+ KM ++ I+P+  T  
Sbjct: 278 RANTAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTLG 337

Query: 192 ILINHLCKLRRIEDAVKVFMKMKGEK-EQAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLM 251
           ILIN LCK RR+++A++VF +M+G++ +   V+  D+I +NTLIDGLCKVGR +E   L+
Sbjct: 338 ILINTLCKSRRVDEALEVFEQMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEELL 397

Query: 252 -RMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLVDGMCK 311
            RM   + C PN VTYNCLIDG+C+ G++E A E+ + M  +++ PNV+T+NT+V GMC+
Sbjct: 398 VRMKLEERCVPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMCR 457

Query: 312 HDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCVPDANI 371
           H  ++ AV FF +M+++G++GN VTY    +A C  +N++KAM + ++M + GC PDA I
Sbjct: 458 HHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAKI 517

Query: 372 YYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELFNEME 431
           YY LI  L Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+  +ME
Sbjct: 518 YYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNAEKVYEMLTDME 577

Query: 432 VVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYCLNNNT 491
             G KPDS+TYNTLIS+F K  +F+     M +M +E+GL P+V TYGA+I AYC     
Sbjct: 578 KEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGEL 637

Query: 492 DEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPNTTTYN 551
           DEA++LFK+M    SKV P+TVIYNILI++  +      ALSL ++M+ + VRPN  TYN
Sbjct: 638 DEALKLFKDM-GLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETYN 697

Query: 552 AIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLEECTLGYSVS 610
           A+FK L EK       +LMD M EQ+C  + ITM+IL E LS   E  +L +   GYSV+
Sbjct: 698 ALFKCLNEKTQGETLLKLMDEMVEQSCEPNQITMEILMERLSGSDELVKLRKFMQGYSVA 751

BLAST of HG10022628 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 6.8e-68
Identity = 169/591 (28.60%), Postives = 278/591 (47.04%), Query Frame = 0

Query: 12  LLRRLERTGMVDEALTAFSTLDSH--AKNTKVRNVIIDLLLKFGRVDNALKVLDEM---- 71
           +L RL R+G  D+       + S      T    ++I+   +F   D  L V+D M    
Sbjct: 89  ILLRLGRSGSFDDMKKILEDMKSSRCEMGTSTFLILIESYAQFELQDEILSVVDWMIDEF 148

Query: 72  -LLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTLTQLI 131
            L P + F          +N++L +      +K  +I+   AK     + P   T   LI
Sbjct: 149 GLKPDTHF----------YNRMLNLLVDGNSLKLVEISH--AKMSVWGIKPDVSTFNVLI 208

Query: 132 CKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQ 191
             LC       A  +L+++     +        ++ G  +E   +    +  +M +    
Sbjct: 209 KALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCS 268

Query: 192 PNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVVPDAIMYNTLIDGLCKVGRHE 251
            +  +  ++++  CK  R+EDA+    +M  +        PD   +NTL++GLCK G  +
Sbjct: 269 WSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDG----FFPDQYTFNTLVNGLCKAGHVK 328

Query: 252 EGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLV 311
             + +M +M  +   P+  TYN +I G CK+GEV+ A E+ ++M++    PN +T NTL+
Sbjct: 329 HAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLI 388

Query: 312 DGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCV 371
             +CK ++V  A +  R +  KG+  +  T+       C   N   AME  +EM   GC 
Sbjct: 389 STLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCE 448

Query: 372 PDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANEL 431
           PD   Y  LI  L   G+LD+A +++ +++ +G    ++ YN LI  FCK NK   A E+
Sbjct: 449 PDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEI 508

Query: 432 FNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYC 491
           F+EMEV GV  +SVTYNTLI    K    + A + M++M   EG  P  +TY +L+  +C
Sbjct: 509 FDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIM-EGQKPDKYTYNSLLTHFC 568

Query: 492 LNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPN 551
              +  +A ++ + M  TS+   PD V Y  LI  LC+   V  A  LL  ++ +G+   
Sbjct: 569 RGGDIKKAADIVQAM--TSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLT 628

Query: 552 TTTYNAIFKALREKNWLHKAFELMDRMGEQ-ACHADDITMKILTEWLSSDG 595
              YN + + L  K    +A  L   M EQ     D ++ +I+   L + G
Sbjct: 629 PHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGG 660

BLAST of HG10022628 vs. ExPASy Swiss-Prot
Match: Q0WKV3 (Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12300 PE=2 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 2.4e-65
Identity = 166/587 (28.28%), Postives = 290/587 (49.40%), Query Frame = 0

Query: 40  KVRNVIIDLLLKFGRVDNALKVLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKED 99
           ++R+ ++D+     + D+A+ +  +M+   S+  P  +    +F+ + K  + +      
Sbjct: 59  RLRSGLVDI-----KADDAIDLFRDMI--HSRPLPTVIDFSRLFSAIAKTKQYD------ 118

Query: 100 DIAGLVAKFCKH----NVVPHPFTLTQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPC 159
               LV   CK      +  + +TL+ +I   C  R   LA++ +  ++ L    +    
Sbjct: 119 ----LVLALCKQMELKGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIKLGYEPNTITF 178

Query: 160 NALLSGLEKERKFEKMNLLLRKMTDMNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKG 219
           + L++GL  E +  +   L+ +M +M  +P+  T   L+N LC   +  +A+ +  KM  
Sbjct: 179 STLINGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVE 238

Query: 220 EKEQAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKV 279
              Q     P+A+ Y  +++ +CK G+    + L+R M   +   + V Y+ +IDG CK 
Sbjct: 239 YGCQ-----PNAVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKH 298

Query: 280 GEVERAHELFNEMVSEQVVPNVITLNTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTY 339
           G ++ A  LFNEM  + +  N+IT N L+ G C   R        R+M ++ +  N VT+
Sbjct: 299 GSLDNAFNLFNEMEMKGITTNIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPNVVTF 358

Query: 340 TVFTNAFCKANNMDKAMEFLDEMSKDGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKE 399
           +V  ++F K   + +A E   EM   G  PD   Y +LI    +   LD A+ +V  +  
Sbjct: 359 SVLIDSFVKEGKLREAEELHKEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVDLMVS 418

Query: 400 AGFCLDLVCYNILISEFCKRNKLDRANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKL 459
            G   ++  +NILI+ +CK N++D   ELF +M + GV  D+VTYNTLI  F ++G   +
Sbjct: 419 KGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLNV 478

Query: 460 AHEFMNKMTKEEGLLPSVFTYGALIHAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNI 519
           A E   +M   + + P++ TY  L+   C N  +++A+E+F+++    SK+  D  IYNI
Sbjct: 479 AKELFQEMVSRK-VPPNIVTYKILLDGLCDNGESEKALEIFEKIE--KSKMELDIGIYNI 538

Query: 520 LIDSLCENNEVNYALSLLDDMEFRGVRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQA 579
           +I  +C  ++V+ A  L   +  +GV+P   TYN +   L +K  L +A  L  +M E  
Sbjct: 539 IIHGMCNASKVDDAWDLFCSLPLKGVKPGVKTYNIMIGGLCKKGPLSEAELLFRKMEEDG 598

Query: 580 CHADDITMKILTEWLSSDGETT-------QLEECTLGYSVSDSVYSL 616
              D  T  IL      DG+ T       +L+ C  G+SV  S   +
Sbjct: 599 HAPDGWTYNILIRAHLGDGDATKSVKLIEELKRC--GFSVDASTIKM 618

BLAST of HG10022628 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 251.1 bits (640), Expect = 3.2e-65
Identity = 137/434 (31.57%), Postives = 231/434 (53.23%), Query Frame = 0

Query: 156 NALLSG-LEKERKFEKMNLLLRKMTDMNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMK 215
           NA+L   +  +R       + ++M +  + PN FT+ ILI   C    I+ A+ +F KM 
Sbjct: 173 NAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKM- 232

Query: 216 GEKEQAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCK 275
               + K  +P+ + YNTLIDG CK+ + ++G +L+R M      PN ++YN +I+G C+
Sbjct: 233 ----ETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCR 292

Query: 276 VGEVERAHELFNEMVSEQVVPNVITLNTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVT 335
            G ++    +  EM       + +T NTL+ G CK      A+    EM + GL  + +T
Sbjct: 293 EGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVIT 352

Query: 336 YTVFTNAFCKANNMDKAMEFLDEMSKDGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLK 395
           YT   ++ CKA NM++AMEFLD+M   G  P+   Y TL+   +Q G +++A  V+ ++ 
Sbjct: 353 YTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMN 412

Query: 396 EAGFCLDLVCYNILISEFCKRNKLDRANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFK 455
           + GF   +V YN LI+  C   K++ A  +  +M+  G+ PD V+Y+T++S F +  +  
Sbjct: 413 DNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVD 472

Query: 456 LAHEFMNKMTKEEGLLPSVFTYGALIHAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYN 515
            A     +M  E+G+ P   TY +LI  +C    T EA +L++EM      +PPD   Y 
Sbjct: 473 EALRVKREMV-EKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVG--LPPDEFTYT 532

Query: 516 ILIDSLCENNEVNYALSLLDDMEFRGVRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQ 575
            LI++ C   ++  AL L ++M  +GV P+  TY+ +   L +++   +A  L+ ++  +
Sbjct: 533 ALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYE 592

Query: 576 ACHADDITMKILTE 589
                D+T   L E
Sbjct: 593 ESVPSDVTYHTLIE 598

BLAST of HG10022628 vs. ExPASy TrEMBL
Match: A0A6J1D191 (pentatricopeptide repeat-containing protein At5g28460 OS=Momordica charantia OX=3673 GN=LOC111016427 PE=4 SV=1)

HSP 1 Score: 955.7 bits (2469), Expect = 9.6e-275
Identity = 477/606 (78.71%), Postives = 532/606 (87.79%), Query Frame = 0

Query: 6   SQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALKVLDEM 65
           S   V LLRRL R GM DEAL  FS LDSH KNT VRNVIIDLLLK GRVD+A+KVLDEM
Sbjct: 158 SPAAVLLLRRLGRAGMADEALAVFSELDSHTKNTYVRNVIIDLLLKTGRVDSAMKVLDEM 217

Query: 66  LLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTLTQLIC 125
           LLPGS+FRPN +TAD+VF KLLK+N  EGRV+ED+IAGLVAKF KH+V P+  TLTQLI 
Sbjct: 218 LLPGSEFRPNDITADIVFTKLLKVNGWEGRVREDEIAGLVAKFGKHHVFPNAITLTQLIS 277

Query: 126 KLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQP 185
           +LC +RNT+LAWNVLD+VM LNGL+  APCNALL+GL KER+FEKMNLL+R+M DM+IQP
Sbjct: 278 RLCRSRNTDLAWNVLDDVMKLNGLKDVAPCNALLTGLGKEREFEKMNLLMRRMKDMDIQP 337

Query: 186 NAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVVPDAIMYNTLIDGLCKVGRHEE 245
           N  TF ILINHLCK RRI+DA++VF +MKGEKE AKVV PDAI YNTLIDGLCKVGR EE
Sbjct: 338 NVITFGILINHLCKFRRIDDALEVFERMKGEKE-AKVVAPDAITYNTLIDGLCKVGRQEE 397

Query: 246 GLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLVD 305
           G  LMRMMRSD C PNTVTYNCLIDG+CK GE+E+AHELFN+M +EQVVPNVITLNTLVD
Sbjct: 398 GSSLMRMMRSDGCVPNTVTYNCLIDGYCKSGEIEKAHELFNQMANEQVVPNVITLNTLVD 457

Query: 306 GMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCVP 365
           GMCKH R+SSAV+F REMQQKGL+GNSVTYTVF NAFCK NN+DKAMEF DEM K GC P
Sbjct: 458 GMCKHGRISSAVEFLREMQQKGLKGNSVTYTVFINAFCKXNNIDKAMEFFDEMFKAGCFP 517

Query: 366 DANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELF 425
           D+ +YYTLI  LAQAGRLDDAS V+SKLKEAGF LDLVC+NILISEFCKRNKLD+ANEL 
Sbjct: 518 DSIVYYTLICGLAQAGRLDDASFVMSKLKEAGFRLDLVCFNILISEFCKRNKLDKANELL 577

Query: 426 NEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYCL 485
           NEME+VGVKPDSVTYNTLISYFS+ GNFKLAH+FM +MTK EGLLP+VFTYGALIHAYCL
Sbjct: 578 NEMELVGVKPDSVTYNTLISYFSRTGNFKLAHKFMKRMTKHEGLLPTVFTYGALIHAYCL 637

Query: 486 NNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPNT 545
           NNNTDEAM+LFKEMSA +SKVPP+TVIYNILIDSLC+ NEVN ALSL DDM+ RGV+PNT
Sbjct: 638 NNNTDEAMKLFKEMSA-ASKVPPNTVIYNILIDSLCKKNEVNSALSLFDDMKVRGVKPNT 697

Query: 546 TTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLEECTLG 605
           TTYNA+FKAL EKNWL KAFELMDRM EQAC+AD ITM+ILTEWLSS GETT+L++ T G
Sbjct: 698 TTYNAVFKALWEKNWLDKAFELMDRMVEQACNADYITMEILTEWLSSVGETTKLKKFTQG 757

Query: 606 YSVSDS 612
           Y VS S
Sbjct: 758 YKVSTS 761

BLAST of HG10022628 vs. ExPASy TrEMBL
Match: A0A0A0K547 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G336570 PE=4 SV=1)

HSP 1 Score: 938.7 bits (2425), Expect = 1.2e-269
Identity = 465/612 (75.98%), Postives = 526/612 (85.95%), Query Frame = 0

Query: 1   MPYSTSQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALK 60
           M  ST+Q  VHLLRRL R GMVDEAL AFSTLDSHAKNT VRN II+LLLK GRVDNA+ 
Sbjct: 1   MSLSTAQSSVHLLRRLGRIGMVDEALAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNAMN 60

Query: 61  VLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTL 120
           VLDEMLLP S+FRPN  TA +VFN LLKI+  EGRVKED+IAGLV+KF KHN+ P    L
Sbjct: 61  VLDEMLLPESEFRPNDKTAGIVFNNLLKIDGLEGRVKEDEIAGLVSKFGKHNIFPDTIAL 120

Query: 121 TQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTD 180
           TQLI KLC + NTNLAWN+LDN+MMLNGL+ AAPCNALL+GL K R+F KMNLL+RKM D
Sbjct: 121 TQLISKLCRSGNTNLAWNILDNLMMLNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKD 180

Query: 181 MNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKV-VVPDAIMYNTLIDGLCK 240
           MNIQP   TF ILINHLCK RRI+DA++VF KMKGEKE+ KV V PD IMYNTLIDGLCK
Sbjct: 181 MNIQPTVITFGILINHLCKFRRIDDALEVFEKMKGEKEETKVFVAPDTIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVIT 300
           VGR EE L LM  MRSD CAP T T+NCLI+G+C+ GE+E AH+LFNEM + Q+ PNVIT
Sbjct: 241 VGRQEEALCLMGKMRSDQCAPTTATFNCLINGYCRSGEIEVAHKLFNEMENAQIEPNVIT 300

Query: 301 LNTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           LNTLVDGMCKH+R+S+AV+FFR MQQKGL+GN+VTYTVF NAFC  NNM+KAMEFLDEMS
Sbjct: 301 LNTLVDGMCKHNRISTAVEFFRVMQQKGLKGNNVTYTVFINAFCNVNNMNKAMEFLDEMS 360

Query: 361 KDGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC PDA +YYTLI  LAQAGRLDDASSVVSKLKEAGFCLD VCYN+LISEFCK+NKLD
Sbjct: 361 KDGCFPDAVVYYTLICGLAQAGRLDDASSVVSKLKEAGFCLDRVCYNVLISEFCKKNKLD 420

Query: 421 RANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGAL 480
           RA E  NEME+ GVKPDSVTYNTLISYFSKIGNFKLAH+FM KMT+EEGL P+VFTYGAL
Sbjct: 421 RAQEWLNEMELAGVKPDSVTYNTLISYFSKIGNFKLAHKFMKKMTEEEGLSPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFR 540
           IHAYCLNNN DEA+++FKEM+  +SKVPP+TVIYNILIDSLC+  +VN+ALSLLDDM+FR
Sbjct: 481 IHAYCLNNNIDEAIKIFKEMNNVASKVPPNTVIYNILIDSLCKQTQVNFALSLLDDMKFR 540

Query: 541 GVRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQL 600
           GV PNTTTYN+IFKALR+KNWL KAF+LMDRM EQAC+ D ITM+ILTEWLS+ GE T+L
Sbjct: 541 GVMPNTTTYNSIFKALRDKNWLDKAFKLMDRMVEQACNPDYITMEILTEWLSAVGEITKL 600

Query: 601 EECTLGYSVSDS 612
           ++ T G  VSDS
Sbjct: 601 KKFTQGCMVSDS 612

BLAST of HG10022628 vs. ExPASy TrEMBL
Match: A0A1S3BGI7 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103489753 PE=4 SV=1)

HSP 1 Score: 934.1 bits (2413), Expect = 3.0e-268
Identity = 466/612 (76.14%), Postives = 530/612 (86.60%), Query Frame = 0

Query: 1   MPYSTSQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALK 60
           M  ST+Q  VHLLR L R GMVDEAL AFSTLDSHAKNT VRN II+LLLK GRVDNAL 
Sbjct: 1   MSLSTAQSAVHLLRHLGRVGMVDEALAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTL 120
           VL EMLLP S+FRPN  TA +VFNK+LKI+ SEGR KED+IAGLV+KF K+N+ P    L
Sbjct: 61  VLYEMLLPESEFRPNDKTAGIVFNKMLKIDGSEGRAKEDEIAGLVSKFGKYNIFPDTIAL 120

Query: 121 TQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTD 180
           TQLI KLC + NTNLAWN+LDN+MMLNGL+ AAPCNALL+GL K R+F KMNLL+RKM D
Sbjct: 121 TQLISKLCRSGNTNLAWNILDNMMMLNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKD 180

Query: 181 MNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVV-PDAIMYNTLIDGLCK 240
           MNIQP   TF ILIN+LCK RRI+DA++VF KMKGEKE+A+VVV PD IMYNTLIDGLCK
Sbjct: 181 MNIQPTVITFGILINYLCKFRRIDDALEVFEKMKGEKEEAEVVVAPDTIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVIT 300
           VGR EEGLRLM  MRS  CAP T TYNCLI+G+C+ GE+E A++LF+EMVSEQ+ PNVIT
Sbjct: 241 VGRQEEGLRLMGTMRSGQCAPTTATYNCLINGYCRAGEIEVANKLFSEMVSEQIEPNVIT 300

Query: 301 LNTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           LNTLVDGMCKH+R+S+AV FFR+MQQKGL+GN+VTYTVF NAFC  NNM+KAMEFLDEMS
Sbjct: 301 LNTLVDGMCKHNRISTAVKFFRDMQQKGLKGNNVTYTVFINAFCNVNNMNKAMEFLDEMS 360

Query: 361 KDGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC PDA +YYTLI  LA+AGRLDD+SSVVSKLKEAGF LD VCYN+LISEFCK+NKLD
Sbjct: 361 KDGCFPDAVVYYTLICGLARAGRLDDSSSVVSKLKEAGFFLDRVCYNVLISEFCKKNKLD 420

Query: 421 RANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGAL 480
           RA E  N+ME+ GVKPDSVTYNTLISYFSKIGNFKLAH+FM KMT+E+GL P+VFTYGAL
Sbjct: 421 RAQEWLNQMELAGVKPDSVTYNTLISYFSKIGNFKLAHKFMKKMTEEDGLSPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFR 540
           IHAYCLNNN DEA++LFKEM+  +SKVPP+TVIYNILIDSLC+  EVN+ALSLLDDM+FR
Sbjct: 481 IHAYCLNNNIDEAIKLFKEMN-VASKVPPNTVIYNILIDSLCKQTEVNFALSLLDDMKFR 540

Query: 541 GVRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQL 600
           GV PNTTTYN+IFKALRE NWL KAF+LMDRM EQAC+AD ITM+ILTEWLSS GETT+L
Sbjct: 541 GVMPNTTTYNSIFKALRENNWLDKAFKLMDRMVEQACNADYITMEILTEWLSSVGETTKL 600

Query: 601 EECTLGYSVSDS 612
           ++ T G  VSDS
Sbjct: 601 KKFTQGCMVSDS 611

BLAST of HG10022628 vs. ExPASy TrEMBL
Match: A0A5D3DT18 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold124G00300 PE=4 SV=1)

HSP 1 Score: 934.1 bits (2413), Expect = 3.0e-268
Identity = 466/612 (76.14%), Postives = 530/612 (86.60%), Query Frame = 0

Query: 1   MPYSTSQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALK 60
           M  ST+Q  VHLLR L R GMVDEAL AFSTLDSHAKNT VRN II+LLLK GRVDNAL 
Sbjct: 1   MSLSTAQSAVHLLRHLGRVGMVDEALAAFSTLDSHAKNTNVRNEIINLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTL 120
           VL EMLLP S+FRPN  TA +VFNK+LKI+ SEGR KED+IAGLV+KF K+N+ P    L
Sbjct: 61  VLYEMLLPESEFRPNDKTAGIVFNKMLKIDGSEGRAKEDEIAGLVSKFGKYNIFPDTIAL 120

Query: 121 TQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTD 180
           TQLI KLC + NTNLAWN+LDN+MMLNGL+ AAPCNALL+GL K R+F KMNLL+RKM D
Sbjct: 121 TQLISKLCRSGNTNLAWNILDNMMMLNGLKDAAPCNALLTGLGKAREFGKMNLLMRKMKD 180

Query: 181 MNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVV-PDAIMYNTLIDGLCK 240
           MNIQP   TF ILIN+LCK RRI+DA++VF KMKGEKE+A+VVV PD IMYNTLIDGLCK
Sbjct: 181 MNIQPTVITFGILINYLCKFRRIDDALEVFEKMKGEKEEAEVVVAPDTIMYNTLIDGLCK 240

Query: 241 VGRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVIT 300
           VGR EEGLRLM  MRS  CAP T TYNCLI+G+C+ GE+E A++LF+EMVSEQ+ PNVIT
Sbjct: 241 VGRQEEGLRLMGTMRSGQCAPTTATYNCLINGYCRAGEIEVANKLFSEMVSEQIEPNVIT 300

Query: 301 LNTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMS 360
           LNTLVDGMCKH+R+S+AV FFR+MQQKGL+GN+VTYTVF NAFC  NNM+KAMEFLDEMS
Sbjct: 301 LNTLVDGMCKHNRISTAVKFFRDMQQKGLKGNNVTYTVFINAFCNVNNMNKAMEFLDEMS 360

Query: 361 KDGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLD 420
           KDGC PDA +YYTLI  LA+AGRLDD+SSVVSKLKEAGF LD VCYN+LISEFCK+NKLD
Sbjct: 361 KDGCFPDAVVYYTLICGLARAGRLDDSSSVVSKLKEAGFFLDRVCYNVLISEFCKKNKLD 420

Query: 421 RANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGAL 480
           RA E  N+ME+ GVKPDSVTYNTLISYFSKIGNFKLAH+FM KMT+E+GL P+VFTYGAL
Sbjct: 421 RAQEWLNQMELAGVKPDSVTYNTLISYFSKIGNFKLAHKFMKKMTEEDGLSPTVFTYGAL 480

Query: 481 IHAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFR 540
           IHAYCLNNN DEA++LFKEM+  +SKVPP+TVIYNILIDSLC+  EVN+ALSLLDDM+FR
Sbjct: 481 IHAYCLNNNIDEAIKLFKEMN-VASKVPPNTVIYNILIDSLCKQTEVNFALSLLDDMKFR 540

Query: 541 GVRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQL 600
           GV PNTTTYN+IFKALRE NWL KAF+LMDRM EQAC+AD ITM+ILTEWLSS GETT+L
Sbjct: 541 GVMPNTTTYNSIFKALRENNWLDKAFKLMDRMVEQACNADYITMEILTEWLSSVGETTKL 600

Query: 601 EECTLGYSVSDS 612
           ++ T G  VSDS
Sbjct: 601 KKFTQGCMVSDS 611

BLAST of HG10022628 vs. ExPASy TrEMBL
Match: A0A6J1H7V3 (pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111460949 PE=4 SV=1)

HSP 1 Score: 932.2 bits (2408), Expect = 1.1e-267
Identity = 461/610 (75.57%), Postives = 525/610 (86.07%), Query Frame = 0

Query: 1   MPYSTSQKVVHLLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALK 60
           M +ST+Q  V+LLRRL R+GMVDEAL  F+ LD H KNT VRNVIIDLLLK GRVDNAL 
Sbjct: 1   MAHSTTQNAVNLLRRLGRSGMVDEALAVFTELDPHKKNTYVRNVIIDLLLKSGRVDNALN 60

Query: 61  VLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTL 120
           VLDEMLLPGS+FRPN +TAD+VF KLLKIN S+ RVKE++IAGLVAKF KH+VVP+P TL
Sbjct: 61  VLDEMLLPGSEFRPNDITADIVFTKLLKINGSDWRVKENEIAGLVAKFGKHHVVPNPITL 120

Query: 121 TQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTD 180
           TQLI KLC +RN +LAW+VLD++M  NGL+ AAPCNALL+GL KER+FEKMNLL+RKM D
Sbjct: 121 TQLISKLCRSRNIDLAWSVLDDLMTWNGLKDAAPCNALLTGLGKEREFEKMNLLMRKMKD 180

Query: 181 MNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVVPDAIMYNTLIDGLCKV 240
           MN+QPN  TF ILINH+CK R I+DA++VF KMKGEKE+AK V PDAI YNTLIDGLCKV
Sbjct: 181 MNVQPNVITFGILINHMCKFRMIDDALEVFEKMKGEKEEAKAVAPDAITYNTLIDGLCKV 240

Query: 241 GRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITL 300
           GR EEGLRL+  MRSD C PNTVTYNCLIDG+CK G +E AH L++EMV+EQVVPNVITL
Sbjct: 241 GRQEEGLRLLGTMRSDGCKPNTVTYNCLIDGYCKAGAIETAHALYDEMVNEQVVPNVITL 300

Query: 301 NTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSK 360
           NTLVDGMCKH R+SSAV+FFREMQQKGL+GNSVTYTVF NAFCK NN+DKA+EF DEMSK
Sbjct: 301 NTLVDGMCKHGRISSAVEFFREMQQKGLKGNSVTYTVFINAFCKVNNIDKAIEFFDEMSK 360

Query: 361 DGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDR 420
            GC PDA +YYTLI  LAQAGRLDDA+SV  ++K AGF LDLVCYN+LISEFCK+NKLD 
Sbjct: 361 AGCFPDAIVYYTLICGLAQAGRLDDATSVALRMKAAGFRLDLVCYNLLISEFCKKNKLDE 420

Query: 421 ANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALI 480
           ANEL +EMEVVG+KPDSVTYNTLISYFSK+GN +LA +FM KMTKE  LLPSVFTYGA+I
Sbjct: 421 ANELLDEMEVVGIKPDSVTYNTLISYFSKMGNLELALKFMEKMTKEACLLPSVFTYGAVI 480

Query: 481 HAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRG 540
           HAYCL NNTDEAM+LFKEM+AT SKVPP+TVIYNILIDSLC+ NEV+ AL+L DDM   G
Sbjct: 481 HAYCLTNNTDEAMKLFKEMTAT-SKVPPNTVIYNILIDSLCKRNEVSSALALFDDMNVNG 540

Query: 541 VRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLE 600
           V+PNTTTYNAI K L+EK WL KA E+MDRM EQAC+AD ITM+ILTEWLSS GET +L+
Sbjct: 541 VKPNTTTYNAILKGLKEKKWLEKALEVMDRMVEQACNADYITMEILTEWLSSVGETAKLK 600

Query: 601 ECTLGYSVSD 611
             T GY VSD
Sbjct: 601 RFTQGYKVSD 609

BLAST of HG10022628 vs. TAIR 10
Match: AT3G61520.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 533.5 bits (1373), Expect = 2.3e-151
Identity = 280/601 (46.59%), Postives = 396/601 (65.89%), Query Frame = 0

Query: 12  LLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALKVLDEMLLPGSK 71
           L+R   R GMV++++  +  LDS+ KN++VRNV++D+LL+ G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADVVFNKLLKINESEGR-VKEDDIAGLVAKFCKHNVVPHPFTLTQLICKLCSN 131
           F PN +TAD+V +++ K     GR + E+ I  L+++F  H V P+   LT+ I  LC N
Sbjct: 218 FPPNRITADIVLHEVWK-----GRLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKN 277

Query: 132 RNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQPNAFTF 191
              N AW++L ++M       A P NALLS L +     +MN L+ KM ++ I+P+  T 
Sbjct: 278 ARANAAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTL 337

Query: 192 QILINHLCKLRRIEDAVKVFMKMKGEK-EQAKVVVPDAIMYNTLIDGLCKVGRHEEGLRL 251
            ILIN LCK RR+++A++VF KM+G++ +   V+  D+I +NTLIDGLCKVGR +E   L
Sbjct: 338 GILINTLCKSRRVDEALEVFEKMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEEL 397

Query: 252 M-RMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLVDGMC 311
           + RM   + CAPN VTYNCLIDG+C+ G++E A E+ + M  +++ PNV+T+NT+V GMC
Sbjct: 398 LVRMKLEERCAPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMC 457

Query: 312 KHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCVPDAN 371
           +H  ++ AV FF +M+++G++GN VTY    +A C  +N++KAM + ++M + GC PDA 
Sbjct: 458 RHHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAK 517

Query: 372 IYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELFNEM 431
           IYY LI  L Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+  +M
Sbjct: 518 IYYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNTEKVYEMLTDM 577

Query: 432 EVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYCLNNN 491
           E  G KPDS+TYNTLIS+F K  +F+     M +M +E+GL P+V TYGA+I AYC    
Sbjct: 578 EKEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGE 637

Query: 492 TDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPNTTTY 551
            DEA++LFK+M    SKV P+TVIYNILI++  +      ALSL ++M+ + VRPN  TY
Sbjct: 638 LDEALKLFKDM-GLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETY 697

Query: 552 NAIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLEECTLGYSV 610
           NA+FK L EK       +LMD M EQ+C  + ITM+IL E LS   E  +L +   GYSV
Sbjct: 698 NALFKCLNEKTQGETLLKLMDEMVEQSCEPNQITMEILMERLSGSDELVKLRKFMQGYSV 751

BLAST of HG10022628 vs. TAIR 10
Match: AT5G28460.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 531.6 bits (1368), Expect = 8.6e-151
Identity = 277/600 (46.17%), Postives = 394/600 (65.67%), Query Frame = 0

Query: 12  LLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALKVLDEMLLPGSK 71
           L+R   R GMV++++  +  LDS+ KN++VRNV++D+LL+ G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTLTQLICKLCSNR 131
           F PN +TAD+V +++ K    E  + E+ I  L+++F  H V P+   LT+ I  LC N 
Sbjct: 218 FPPNRITADIVLHEVWK----ERLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA 277

Query: 132 NTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQPNAFTFQ 191
             N AW++L ++M       A P NALLS L +     +MN L+ KM ++ I+P+  T  
Sbjct: 278 RANTAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTLG 337

Query: 192 ILINHLCKLRRIEDAVKVFMKMKGEK-EQAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLM 251
           ILIN LCK RR+++A++VF +M+G++ +   V+  D+I +NTLIDGLCKVGR +E   L+
Sbjct: 338 ILINTLCKSRRVDEALEVFEQMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEELL 397

Query: 252 -RMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLVDGMCK 311
            RM   + C PN VTYNCLIDG+C+ G++E A E+ + M  +++ PNV+T+NT+V GMC+
Sbjct: 398 VRMKLEERCVPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMCR 457

Query: 312 HDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCVPDANI 371
           H  ++ AV FF +M+++G++GN VTY    +A C  +N++KAM + ++M + GC PDA I
Sbjct: 458 HHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAKI 517

Query: 372 YYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELFNEME 431
           YY LI  L Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+  +ME
Sbjct: 518 YYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNAEKVYEMLTDME 577

Query: 432 VVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYCLNNNT 491
             G KPDS+TYNTLIS+F K  +F+     M +M +E+GL P+V TYGA+I AYC     
Sbjct: 578 KEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGEL 637

Query: 492 DEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPNTTTYN 551
           DEA++LFK+M    SKV P+TVIYNILI++  +      ALSL ++M+ + VRPN  TYN
Sbjct: 638 DEALKLFKDM-GLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETYN 697

Query: 552 AIFKALREKNWLHKAFELMDRMGEQACHADDITMKILTEWLSSDGETTQLEECTLGYSVS 610
           A+FK L EK       +LMD M EQ+C  + ITM+IL E LS   E  +L +   GYSV+
Sbjct: 698 ALFKCLNEKTQGETLLKLMDEMVEQSCEPNQITMEILMERLSGSDELVKLRKFMQGYSVA 751

BLAST of HG10022628 vs. TAIR 10
Match: AT5G28370.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 503.4 bits (1295), Expect = 2.5e-142
Identity = 261/564 (46.28%), Postives = 372/564 (65.96%), Query Frame = 0

Query: 12  LLRRLERTGMVDEALTAFSTLDSHAKNTKVRNVIIDLLLKFGRVDNALKVLDEMLLPGSK 71
           L+R   R GMV++++  +  LDS+ KN++VRNV++D+LL+ G VD+A KVLDEML   S 
Sbjct: 158 LIRWFGRMGMVNQSVLVYERLDSNMKNSQVRNVVVDVLLRNGLVDDAFKVLDEMLQKESV 217

Query: 72  FRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTLTQLICKLCSNR 131
           F PN +TAD+V +++ K    E  + E+ I  L+++F  H V P+   LT+ I  LC N 
Sbjct: 218 FPPNRITADIVLHEVWK----ERLLTEEKIIALISRFSSHGVSPNSVWLTRFISSLCKNA 277

Query: 132 NTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQPNAFTFQ 191
             N AW++L ++M       A P NALLS L +     +MN L+ KM ++ I+P+  T  
Sbjct: 278 RANTAWDILSDLMKNKTPLEAPPFNALLSCLGRNMDISRMNDLVLKMDEVKIRPDVVTLG 337

Query: 192 ILINHLCKLRRIEDAVKVFMKMKGEK-EQAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLM 251
           ILIN LCK RR+++A++VF +M+G++ +   V+  D+I +NTLIDGLCKVGR +E   L+
Sbjct: 338 ILINTLCKSRRVDEALEVFEQMRGKRTDDGNVIKADSIHFNTLIDGLCKVGRLKEAEELL 397

Query: 252 -RMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLVDGMCK 311
            RM   + C PN VTYNCLIDG+C+ G++E A E+ + M  +++ PNV+T+NT+V GMC+
Sbjct: 398 VRMKLEERCVPNAVTYNCLIDGYCRAGKLETAKEVVSRMKEDEIKPNVVTVNTIVGGMCR 457

Query: 312 HDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCVPDANI 371
           H  ++ AV FF +M+++G++GN VTY    +A C  +N++KAM + ++M + GC PDA I
Sbjct: 458 HHGLNMAVVFFMDMEKEGVKGNVVTYMTLIHACCSVSNVEKAMYWYEKMLEAGCSPDAKI 517

Query: 372 YYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANELFNEME 431
           YY LI  L Q  R  DA  VV KLKE GF LDL+ YN+LI  FC +N  ++  E+  +ME
Sbjct: 518 YYALISGLCQVRRDHDAIRVVEKLKEGGFSLDLLAYNMLIGLFCDKNNAEKVYEMLTDME 577

Query: 432 VVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYCLNNNT 491
             G KPDS+TYNTLIS+F K  +F+     M +M +E+GL P+V TYGA+I AYC     
Sbjct: 578 KEGKKPDSITYNTLISFFGKHKDFESVERMMEQM-REDGLDPTVTTYGAVIDAYCSVGEL 637

Query: 492 DEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPNTTTYN 551
           DEA++LFK+M    SKV P+TVIYNILI++  +      ALSL ++M+ + VRPN  TYN
Sbjct: 638 DEALKLFKDM-GLHSKVNPNTVIYNILINAFSKLGNFGQALSLKEEMKMKMVRPNVETYN 697

Query: 552 AIFKALREKNWLHKAFELMDRMGE 574
           A+FK L EK       +LMD M E
Sbjct: 698 ALFKCLNEKTQGETLLKLMDEMVE 715

BLAST of HG10022628 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 260.0 bits (663), Expect = 4.8e-69
Identity = 169/591 (28.60%), Postives = 278/591 (47.04%), Query Frame = 0

Query: 12  LLRRLERTGMVDEALTAFSTLDSH--AKNTKVRNVIIDLLLKFGRVDNALKVLDEM---- 71
           +L RL R+G  D+       + S      T    ++I+   +F   D  L V+D M    
Sbjct: 89  ILLRLGRSGSFDDMKKILEDMKSSRCEMGTSTFLILIESYAQFELQDEILSVVDWMIDEF 148

Query: 72  -LLPGSKFRPNYVTADVVFNKLLKINESEGRVKEDDIAGLVAKFCKHNVVPHPFTLTQLI 131
            L P + F          +N++L +      +K  +I+   AK     + P   T   LI
Sbjct: 149 GLKPDTHF----------YNRMLNLLVDGNSLKLVEISH--AKMSVWGIKPDVSTFNVLI 208

Query: 132 CKLCSNRNTNLAWNVLDNVMMLNGLRHAAPCNALLSGLEKERKFEKMNLLLRKMTDMNIQ 191
             LC       A  +L+++     +        ++ G  +E   +    +  +M +    
Sbjct: 209 KALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCS 268

Query: 192 PNAFTFQILINHLCKLRRIEDAVKVFMKMKGEKEQAKVVVPDAIMYNTLIDGLCKVGRHE 251
            +  +  ++++  CK  R+EDA+    +M  +        PD   +NTL++GLCK G  +
Sbjct: 269 WSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDG----FFPDQYTFNTLVNGLCKAGHVK 328

Query: 252 EGLRLMRMMRSDDCAPNTVTYNCLIDGFCKVGEVERAHELFNEMVSEQVVPNVITLNTLV 311
             + +M +M  +   P+  TYN +I G CK+GEV+ A E+ ++M++    PN +T NTL+
Sbjct: 329 HAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLI 388

Query: 312 DGMCKHDRVSSAVDFFREMQQKGLRGNSVTYTVFTNAFCKANNMDKAMEFLDEMSKDGCV 371
             +CK ++V  A +  R +  KG+  +  T+       C   N   AME  +EM   GC 
Sbjct: 389 STLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCE 448

Query: 372 PDANIYYTLIRDLAQAGRLDDASSVVSKLKEAGFCLDLVCYNILISEFCKRNKLDRANEL 431
           PD   Y  LI  L   G+LD+A +++ +++ +G    ++ YN LI  FCK NK   A E+
Sbjct: 449 PDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEI 508

Query: 432 FNEMEVVGVKPDSVTYNTLISYFSKIGNFKLAHEFMNKMTKEEGLLPSVFTYGALIHAYC 491
           F+EMEV GV  +SVTYNTLI    K    + A + M++M   EG  P  +TY +L+  +C
Sbjct: 509 FDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIM-EGQKPDKYTYNSLLTHFC 568

Query: 492 LNNNTDEAMELFKEMSATSSKVPPDTVIYNILIDSLCENNEVNYALSLLDDMEFRGVRPN 551
              +  +A ++ + M  TS+   PD V Y  LI  LC+   V  A  LL  ++ +G+   
Sbjct: 569 RGGDIKKAADIVQAM--TSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLT 628

Query: 552 TTTYNAIFKALREKNWLHKAFELMDRMGEQ-ACHADDITMKILTEWLSSDG 595
              YN + + L  K    +A  L   M EQ     D ++ +I+   L + G
Sbjct: 629 PHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGG 660

BLAST of HG10022628 vs. TAIR 10
Match: AT1G12300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 251.5 bits (641), Expect = 1.7e-66
Identity = 166/587 (28.28%), Postives = 290/587 (49.40%), Query Frame = 0

Query: 40  KVRNVIIDLLLKFGRVDNALKVLDEMLLPGSKFRPNYVTADVVFNKLLKINESEGRVKED 99
           ++R+ ++D+     + D+A+ +  +M+   S+  P  +    +F+ + K  + +      
Sbjct: 59  RLRSGLVDI-----KADDAIDLFRDMI--HSRPLPTVIDFSRLFSAIAKTKQYD------ 118

Query: 100 DIAGLVAKFCKH----NVVPHPFTLTQLICKLCSNRNTNLAWNVLDNVMMLNGLRHAAPC 159
               LV   CK      +  + +TL+ +I   C  R   LA++ +  ++ L    +    
Sbjct: 119 ----LVLALCKQMELKGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIKLGYEPNTITF 178

Query: 160 NALLSGLEKERKFEKMNLLLRKMTDMNIQPNAFTFQILINHLCKLRRIEDAVKVFMKMKG 219
           + L++GL  E +  +   L+ +M +M  +P+  T   L+N LC   +  +A+ +  KM  
Sbjct: 179 STLINGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVE 238

Query: 220 EKEQAKVVVPDAIMYNTLIDGLCKVGRHEEGLRLMRMMRSDDCAPNTVTYNCLIDGFCKV 279
              Q     P+A+ Y  +++ +CK G+    + L+R M   +   + V Y+ +IDG CK 
Sbjct: 239 YGCQ-----PNAVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKH 298

Query: 280 GEVERAHELFNEMVSEQVVPNVITLNTLVDGMCKHDRVSSAVDFFREMQQKGLRGNSVTY 339
           G ++ A  LFNEM  + +  N+IT N L+ G C   R        R+M ++ +  N VT+
Sbjct: 299 GSLDNAFNLFNEMEMKGITTNIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPNVVTF 358

Query: 340 TVFTNAFCKANNMDKAMEFLDEMSKDGCVPDANIYYTLIRDLAQAGRLDDASSVVSKLKE 399
           +V  ++F K   + +A E   EM   G  PD   Y +LI    +   LD A+ +V  +  
Sbjct: 359 SVLIDSFVKEGKLREAEELHKEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVDLMVS 418

Query: 400 AGFCLDLVCYNILISEFCKRNKLDRANELFNEMEVVGVKPDSVTYNTLISYFSKIGNFKL 459
            G   ++  +NILI+ +CK N++D   ELF +M + GV  D+VTYNTLI  F ++G   +
Sbjct: 419 KGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLNV 478

Query: 460 AHEFMNKMTKEEGLLPSVFTYGALIHAYCLNNNTDEAMELFKEMSATSSKVPPDTVIYNI 519
           A E   +M   + + P++ TY  L+   C N  +++A+E+F+++    SK+  D  IYNI
Sbjct: 479 AKELFQEMVSRK-VPPNIVTYKILLDGLCDNGESEKALEIFEKIE--KSKMELDIGIYNI 538

Query: 520 LIDSLCENNEVNYALSLLDDMEFRGVRPNTTTYNAIFKALREKNWLHKAFELMDRMGEQA 579
           +I  +C  ++V+ A  L   +  +GV+P   TYN +   L +K  L +A  L  +M E  
Sbjct: 539 IIHGMCNASKVDDAWDLFCSLPLKGVKPGVKTYNIMIGGLCKKGPLSEAELLFRKMEEDG 598

Query: 580 CHADDITMKILTEWLSSDGETT-------QLEECTLGYSVSDSVYSL 616
              D  T  IL      DG+ T       +L+ C  G+SV  S   +
Sbjct: 599 HAPDGWTYNILIRAHLGDGDATKSVKLIEELKRC--GFSVDASTIKM 618

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898803.19.2e-28881.86pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Benin... [more]
XP_022147524.13.4e-27478.55pentatricopeptide repeat-containing protein At5g28460 [Momordica charantia][more]
XP_011659175.12.5e-26975.98pentatricopeptide repeat-containing protein At3g61520, mitochondrial [Cucumis sa... [more]
KAG7025273.17.3e-26976.07Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_023514674.17.3e-26976.07pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q9M3163.2e-15046.59Pentatricopeptide repeat-containing protein At3g61520, mitochondrial OS=Arabidop... [more]
Q9LKU81.2e-14946.17Pentatricopeptide repeat-containing protein At5g28460 OS=Arabidopsis thaliana OX... [more]
Q9LFF16.8e-6828.60Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Q0WKV32.4e-6528.28Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidop... [more]
Q9FIX33.2e-6531.57Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1D1919.6e-27578.71pentatricopeptide repeat-containing protein At5g28460 OS=Momordica charantia OX=... [more]
A0A0A0K5471.2e-26975.98Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G336570 PE=4 SV=1[more]
A0A1S3BGI73.0e-26876.14pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like OS=Cuc... [more]
A0A5D3DT183.0e-26876.14Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1H7V31.1e-26775.57pentatricopeptide repeat-containing protein At3g61520, mitochondrial-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT3G61520.12.3e-15146.59Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G28460.18.6e-15146.17Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G28370.12.5e-14246.28Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G53700.14.8e-6928.60Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G12300.11.7e-6628.28Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 155..199
e-value: 4.8E-10
score: 39.5
coord: 260..309
e-value: 1.2E-18
score: 67.1
coord: 435..484
e-value: 1.2E-13
score: 51.0
coord: 508..555
e-value: 4.6E-15
score: 55.6
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 397..429
e-value: 3.4E-11
score: 42.7
coord: 223..254
e-value: 4.6E-9
score: 35.9
coord: 326..359
e-value: 9.1E-9
score: 34.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 511..545
e-value: 1.7E-8
score: 32.1
coord: 188..216
e-value: 1.1E-5
score: 23.3
coord: 474..501
e-value: 1.9E-7
score: 28.8
coord: 263..297
e-value: 9.5E-12
score: 42.3
coord: 228..262
e-value: 1.1E-10
score: 39.0
coord: 298..331
e-value: 3.5E-7
score: 28.0
coord: 438..472
e-value: 7.7E-7
score: 26.9
coord: 403..437
e-value: 3.9E-8
score: 31.0
coord: 333..367
e-value: 3.2E-10
score: 37.5
coord: 155..187
e-value: 4.4E-4
score: 18.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 43..66
e-value: 0.014
score: 15.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 331..365
score: 12.912469
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 226..260
score: 13.065928
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 296..330
score: 11.421732
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 186..220
score: 9.799459
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..295
score: 13.898986
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..471
score: 10.47906
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 401..435
score: 12.539784
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 509..543
score: 12.013642
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 472..506
score: 11.016164
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..400
score: 9.317163
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 151..185
score: 8.571795
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 544..578
score: 8.867749
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 291..359
e-value: 2.8E-20
score: 74.6
coord: 430..503
e-value: 3.9E-21
score: 77.4
coord: 152..216
e-value: 2.8E-13
score: 51.8
coord: 217..290
e-value: 3.7E-26
score: 93.8
coord: 360..429
e-value: 7.9E-17
score: 63.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 504..604
e-value: 5.7E-22
score: 79.9
coord: 3..151
e-value: 1.1E-11
score: 46.5
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 32..386
NoneNo IPR availablePANTHERPTHR47942:SF64F21B23.6 PROTEINcoord: 10..610
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 10..610

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022628.1HG10022628.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding