HG10002532 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10002532
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr11: 7902161 .. 7904629 (-)
RNA-Seq ExpressionHG10002532
SyntenyHG10002532
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATTTGACTAGATTTAAAATCAATAAGACAGTTCCTGTATTGTTTCCCTTCTCTCGCCGGCTGGCCTGTGCGTTATCCACCCAACCGCATAAAGAACACCACCAGGACCCGCCATGGCAGCTCCAGGATCAGTTGCTCTTTTGGGTATCTTCTATTCTCTCTAATTCGTCTCTCGACTCTTCTAAATGTAGAGCCCTCTTGCCCCATTTGTCTCCTTTTCAGTTTGATCAGCTCTTCTTCTCCGTTGGATTGAAAGCCAACCCCAACACTTGTCTTAATTTCTTTTACTTTGCGTCTGATTCTTTCAAGTTTCGATTTACCATTCGTTCTTATTGTATATTGATTCTTTTGCTTGTTCATTCCAAGTTTTTACACCCCGCGAGACTGCTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTCGGATACGAATAAGCTTCACATTGAGATAGCCAATGCATTGTTTGGTTTAACCTCAGTTGTTGGGCGGTTTGAATGGACACAGGCATTTGATTTGTTGATACATGTATACGGCACACAATTCAGAAATCTTGGCTTTAGTTGCGCTATTGATGTGTTTTATTTGTTTGCTCGTAAGGGAATCTTTCCATCGTTAAAGACTTGTAGTTTTTTATTGAGCTCTTTGGTAAAGGCTAATGAACTTGAGAAATGTTGTGAACTATTTGAAGTGATGTCCCAAGGTGTTCGTCCAGATGTTTTCTTATTTACGAATGTGATAAATGCTTTATGCAAGGGAGGGAAGATGGAAAAAGCCATTGAGTTATTCATGAAAATGGAGAAGTTGGGTATTTCTCCCAATGTTGTTACTTATAATATTATTATTCATGGTTTATGCCAGAATGGGAGATTAGACAATGCCTTTGAGCTCAAGGAGAAGATGAAAATGGAAGGGGTAAAGCCAAGTCATATAACTTATAACGTGCTTATTAATGGTTTGACAAAACTGGGACACTTTGACAAAGTGAATCATGTTTTAGATGAAATGGTTGATGCGGGTTTTGTTCCGAATGCAGTTGACTACAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCGATGAAGCGCTTTGGATTAAAGATTTGATGATATCCAAAAATATAACTCCTACTTCAGTTACTTTATATACTCTCATGCAAGGATTCTGCGAGAGCAATCAAATCGAGCAAGCAGAGAATGCTCTTGAGGAGATATTATCACGTGGGCTATCTATAAACCCTGATACTTGTTATTCGGTTGTCCACTGGTTATGTAAGAAGTGTAGGGTACATTCTGCATTCCGATTCACTAAGGTGATGTTATCGAAGAACTTCAGGCCTAGAGATCAACTCTTAACCATATTGGTACGTGAGCTATGTAAGGACGGTAAACATTTAGAAGCAACGGAACTTTGGTTTAGGTTATTGGAGAAAGGGTCTCCAGCAAGTATGGCAACCTCCAACGCTCTAATACATGGACTTTGTGGTGCTGGTAATTTGCCAGAGGCTGTTAGAATAGTCGAAGAGATGTTGGAGAGGGGTTTTCCAGTGGATGAGATCACATTCAACACACTCATCTTAGGTTGTTGCAGAGAGGGAAAAGTTGAGGAATGCTTTAGACTTAAAGAAGAGATGACCAAACAAGGAATTCAACCAGATATCTATACTTGCAATTTTCTATTGCGTGGACTATGCAGTGCAGGAAAATTGGACGATGCTATTAAGCTTTGGGATGAATACAAAGCTAGTGGGCTGGTTTCTAATGTTCACACTTACGGAGTAATGATGGATGGTTATTGTAAAGCTAACAGAATGGAAGATGTTGAAAAATTATTTAATGAATTAGTCGCTAAGAGAATGGAGCTGAATACCATTGTCTACAATATATTTATCCGAGCAAATTGCCACAATGGAAATGTCGCTGCAGCTTTGCAACTTCGTGATGATATGAAAAGCAAGGGAATTTTACCAACTTGTGCCACATATTCTTCTCTAATACATGGCATGTGCAACATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAGAGGAGGGATTGTCGCCAAATGTTGTTTGCTATACTGCATTAATTGGCGGTTATTGTAAGTTGGGGCAAATGGATATTGCTGAAGCTACTTGGCTTGAGATGATCTCTTTTAACATACGACCTAACAAATTTACCTACACCGTCATGATTGATGGGTACTGTAAATTGGGGAATATGGAAGAAGCAAATAACCTTCTGGCCAAAATGAAGGAAAGTCGAATCGTTCCAGATGTTGTTACATACAATGCCTTGACTAATGGATTTTGCAAGGGAAAGAACATGGATAAAGCTTTTAAAGTATGCGATCAAATGGCCACTGGAGGATTATCTTTAGATGAAATTACTTACACTACTCTCGTACATGGTTGGAATCGACCTACAATTACTAGCCAAGACTGA

mRNA sequence

ATGCATTTGACTAGATTTAAAATCAATAAGACAGTTCCTGTATTGTTTCCCTTCTCTCGCCGGCTGGCCTGTGCGTTATCCACCCAACCGCATAAAGAACACCACCAGGACCCGCCATGGCAGCTCCAGGATCAGTTGCTCTTTTGGGTATCTTCTATTCTCTCTAATTCGTCTCTCGACTCTTCTAAATGTAGAGCCCTCTTGCCCCATTTGTCTCCTTTTCAGTTTGATCAGCTCTTCTTCTCCGTTGGATTGAAAGCCAACCCCAACACTTGTCTTAATTTCTTTTACTTTGCGTCTGATTCTTTCAAGTTTCGATTTACCATTCGTTCTTATTGTATATTGATTCTTTTGCTTGTTCATTCCAAGTTTTTACACCCCGCGAGACTGCTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTCGGATACGAATAAGCTTCACATTGAGATAGCCAATGCATTGTTTGGTTTAACCTCAGTTGTTGGGCGGTTTGAATGGACACAGGCATTTGATTTGTTGATACATGTATACGGCACACAATTCAGAAATCTTGGCTTTAGTTGCGCTATTGATGTGTTTTATTTGTTTGCTCGTAAGGGAATCTTTCCATCGTTAAAGACTTGTAGTTTTTTATTGAGCTCTTTGGTAAAGGCTAATGAACTTGAGAAATGTTGTGAACTATTTGAAGTGATGTCCCAAGGTGTTCGTCCAGATGTTTTCTTATTTACGAATGTGATAAATGCTTTATGCAAGGGAGGGAAGATGGAAAAAGCCATTGAGTTATTCATGAAAATGGAGAAGTTGGGTATTTCTCCCAATGTTGTTACTTATAATATTATTATTCATGGTTTATGCCAGAATGGGAGATTAGACAATGCCTTTGAGCTCAAGGAGAAGATGAAAATGGAAGGGGTAAAGCCAAGTCATATAACTTATAACGTGCTTATTAATGGTTTGACAAAACTGGGACACTTTGACAAAGTGAATCATGTTTTAGATGAAATGGTTGATGCGGGTTTTGTTCCGAATGCAGTTGACTACAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCGATGAAGCGCTTTGGATTAAAGATTTGATGATATCCAAAAATATAACTCCTACTTCAGTTACTTTATATACTCTCATGCAAGGATTCTGCGAGAGCAATCAAATCGAGCAAGCAGAGAATGCTCTTGAGGAGATATTATCACGTGGGCTATCTATAAACCCTGATACTTGTTATTCGGTTGTCCACTGGTTATGTAAGAAGTGTAGGGTACATTCTGCATTCCGATTCACTAAGGTGATGTTATCGAAGAACTTCAGGCCTAGAGATCAACTCTTAACCATATTGGTACGTGAGCTATGTAAGGACGGTAAACATTTAGAAGCAACGGAACTTTGGTTTAGGTTATTGGAGAAAGGGTCTCCAGCAAGTATGGCAACCTCCAACGCTCTAATACATGGACTTTGTGGTGCTGGTAATTTGCCAGAGGCTGTTAGAATAGTCGAAGAGATGTTGGAGAGGGGTTTTCCAGTGGATGAGATCACATTCAACACACTCATCTTAGGTTGTTGCAGAGAGGGAAAAGTTGAGGAATGCTTTAGACTTAAAGAAGAGATGACCAAACAAGGAATTCAACCAGATATCTATACTTGCAATTTTCTATTGCGTGGACTATGCAGTGCAGGAAAATTGGACGATGCTATTAAGCTTTGGGATGAATACAAAGCTAGTGGGCTGGTTTCTAATGTTCACACTTACGGAGTAATGATGGATGGTTATTGTAAAGCTAACAGAATGGAAGATGTTGAAAAATTATTTAATGAATTAGTCGCTAAGAGAATGGAGCTGAATACCATTGTCTACAATATATTTATCCGAGCAAATTGCCACAATGGAAATGTCGCTGCAGCTTTGCAACTTCGTGATGATATGAAAAGCAAGGGAATTTTACCAACTTGTGCCACATATTCTTCTCTAATACATGGCATGTGCAACATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAGAGGAGGGATTGTCGCCAAATGTTGTTTGCTATACTGCATTAATTGGCGGTTATTGTAAGTTGGGGCAAATGGATATTGCTGAAGCTACTTGGCTTGAGATGATCTCTTTTAACATACGACCTAACAAATTTACCTACACCGTCATGATTGATGGGTACTGTAAATTGGGGAATATGGAAGAAGCAAATAACCTTCTGGCCAAAATGAAGGAAAGTCGAATCGTTCCAGATGTTGTTACATACAATGCCTTGACTAATGGATTTTGCAAGGGAAAGAACATGGATAAAGCTTTTAAAGTATGCGATCAAATGGCCACTGGAGGATTATCTTTAGATGAAATTACTTACACTACTCTCGTACATGGTTGGAATCGACCTACAATTACTAGCCAAGACTGA

Coding sequence (CDS)

ATGCATTTGACTAGATTTAAAATCAATAAGACAGTTCCTGTATTGTTTCCCTTCTCTCGCCGGCTGGCCTGTGCGTTATCCACCCAACCGCATAAAGAACACCACCAGGACCCGCCATGGCAGCTCCAGGATCAGTTGCTCTTTTGGGTATCTTCTATTCTCTCTAATTCGTCTCTCGACTCTTCTAAATGTAGAGCCCTCTTGCCCCATTTGTCTCCTTTTCAGTTTGATCAGCTCTTCTTCTCCGTTGGATTGAAAGCCAACCCCAACACTTGTCTTAATTTCTTTTACTTTGCGTCTGATTCTTTCAAGTTTCGATTTACCATTCGTTCTTATTGTATATTGATTCTTTTGCTTGTTCATTCCAAGTTTTTACACCCCGCGAGACTGCTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTCGGATACGAATAAGCTTCACATTGAGATAGCCAATGCATTGTTTGGTTTAACCTCAGTTGTTGGGCGGTTTGAATGGACACAGGCATTTGATTTGTTGATACATGTATACGGCACACAATTCAGAAATCTTGGCTTTAGTTGCGCTATTGATGTGTTTTATTTGTTTGCTCGTAAGGGAATCTTTCCATCGTTAAAGACTTGTAGTTTTTTATTGAGCTCTTTGGTAAAGGCTAATGAACTTGAGAAATGTTGTGAACTATTTGAAGTGATGTCCCAAGGTGTTCGTCCAGATGTTTTCTTATTTACGAATGTGATAAATGCTTTATGCAAGGGAGGGAAGATGGAAAAAGCCATTGAGTTATTCATGAAAATGGAGAAGTTGGGTATTTCTCCCAATGTTGTTACTTATAATATTATTATTCATGGTTTATGCCAGAATGGGAGATTAGACAATGCCTTTGAGCTCAAGGAGAAGATGAAAATGGAAGGGGTAAAGCCAAGTCATATAACTTATAACGTGCTTATTAATGGTTTGACAAAACTGGGACACTTTGACAAAGTGAATCATGTTTTAGATGAAATGGTTGATGCGGGTTTTGTTCCGAATGCAGTTGACTACAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCGATGAAGCGCTTTGGATTAAAGATTTGATGATATCCAAAAATATAACTCCTACTTCAGTTACTTTATATACTCTCATGCAAGGATTCTGCGAGAGCAATCAAATCGAGCAAGCAGAGAATGCTCTTGAGGAGATATTATCACGTGGGCTATCTATAAACCCTGATACTTGTTATTCGGTTGTCCACTGGTTATGTAAGAAGTGTAGGGTACATTCTGCATTCCGATTCACTAAGGTGATGTTATCGAAGAACTTCAGGCCTAGAGATCAACTCTTAACCATATTGGTACGTGAGCTATGTAAGGACGGTAAACATTTAGAAGCAACGGAACTTTGGTTTAGGTTATTGGAGAAAGGGTCTCCAGCAAGTATGGCAACCTCCAACGCTCTAATACATGGACTTTGTGGTGCTGGTAATTTGCCAGAGGCTGTTAGAATAGTCGAAGAGATGTTGGAGAGGGGTTTTCCAGTGGATGAGATCACATTCAACACACTCATCTTAGGTTGTTGCAGAGAGGGAAAAGTTGAGGAATGCTTTAGACTTAAAGAAGAGATGACCAAACAAGGAATTCAACCAGATATCTATACTTGCAATTTTCTATTGCGTGGACTATGCAGTGCAGGAAAATTGGACGATGCTATTAAGCTTTGGGATGAATACAAAGCTAGTGGGCTGGTTTCTAATGTTCACACTTACGGAGTAATGATGGATGGTTATTGTAAAGCTAACAGAATGGAAGATGTTGAAAAATTATTTAATGAATTAGTCGCTAAGAGAATGGAGCTGAATACCATTGTCTACAATATATTTATCCGAGCAAATTGCCACAATGGAAATGTCGCTGCAGCTTTGCAACTTCGTGATGATATGAAAAGCAAGGGAATTTTACCAACTTGTGCCACATATTCTTCTCTAATACATGGCATGTGCAACATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAGAGGAGGGATTGTCGCCAAATGTTGTTTGCTATACTGCATTAATTGGCGGTTATTGTAAGTTGGGGCAAATGGATATTGCTGAAGCTACTTGGCTTGAGATGATCTCTTTTAACATACGACCTAACAAATTTACCTACACCGTCATGATTGATGGGTACTGTAAATTGGGGAATATGGAAGAAGCAAATAACCTTCTGGCCAAAATGAAGGAAAGTCGAATCGTTCCAGATGTTGTTACATACAATGCCTTGACTAATGGATTTTGCAAGGGAAAGAACATGGATAAAGCTTTTAAAGTATGCGATCAAATGGCCACTGGAGGATTATCTTTAGATGAAATTACTTACACTACTCTCGTACATGGTTGGAATCGACCTACAATTACTAGCCAAGACTGA

Protein sequence

MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLDSSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHVYGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFELKEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCKMGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDTCYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLLEKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVEECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMMDGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGILPTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEATWLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCKGKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD
Homology
BLAST of HG10002532 vs. NCBI nr
Match: XP_038884789.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida] >XP_038884795.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida] >XP_038884803.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida] >XP_038884810.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida] >XP_038884818.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa hispida])

HSP 1 Score: 1502.3 bits (3888), Expect = 0.0e+00
Identity = 730/822 (88.81%), Postives = 771/822 (93.80%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           M LTRF INKTVPV FPFSRRLAC LSTQPHKEHHQDPP  +Q+QL +WVSS+LSNSSLD
Sbjct: 1   MRLTRFNINKTVPVFFPFSRRLACLLSTQPHKEHHQDPPRHIQEQLHYWVSSVLSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYCILILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPITCLNFFYFASDSFKFRFTIRSYCILILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLIDG LPVLNSD+NKLHIEIANALFGLTSVVGRFEWTQ FD LIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANALFGLTSVVGRFEWTQLFDFLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQF+N G +CA+DVFYLFARKGIFPS+KTC+FLLSSLVKANELEKCCE FEVMSQGVR
Sbjct: 181 YSTQFKNFGLNCAVDVFYLFARKGIFPSIKTCNFLLSSLVKANELEKCCEGFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVFLFTN INALCKGGKMEKAIEL MKMEKLGISPNVVTYN IIHGLCQNGRLDNAFEL
Sbjct: 241 PDVFLFTNAINALCKGGKMEKAIELLMKMEKLGISPNVVTYNCIIHGLCQNGRLDNAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM MEGV+P+  TY  LINGLTKL +FDKVNHVLDEMVDAG  PN + YNNLIDGYCK
Sbjct: 301 KEKMTMEGVQPNLKTYGALINGLTKLKYFDKVNHVLDEMVDAGIDPNVIVYNNLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MGNI+EAL IKD+M+SKNI+PTSVTLYTLMQGFC+S+QIEQAENALEEILS GLSINPDT
Sbjct: 361 MGNINEALRIKDVMMSKNISPTSVTLYTLMQGFCKSDQIEQAENALEEILSNGLSINPDT 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSVVHWLCKK R +SAF+FTKVML+KNFRPRDQLLTILVR LC+DGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKSRYYSAFQFTKVMLAKNFRPRDQLLTILVRGLCEDGKHLEATELWFRLL 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS  TSNALIHGLCGAGNLPEAVRIV+EMLERG P+D +T+N LILG C+EGKVE
Sbjct: 481 EKGSPASTLTSNALIHGLCGAGNLPEAVRIVKEMLERGIPMDRMTYNALILGFCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
           E F+LKE+MTKQGIQPDIYTCNFLL GLC+AGKLDDAIKLWDE+KASGLVSNVHTYGVMM
Sbjct: 541 EGFKLKEKMTKQGIQPDIYTCNFLLHGLCNAGKLDDAIKLWDEFKASGLVSNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           D YCKANR+EDVEKLFNELV+K+ME NTIVYN+FIRANCHNGNVAAALQL DDMKSKGIL
Sbjct: 601 DVYCKANRIEDVEKLFNELVSKKMEPNTIVYNLFIRANCHNGNVAAALQLCDDMKSKGIL 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           P CATYSSLIHGMCNIGLVE+AKHLIDEMR+EGL PNVVCYTALIGGYCKLGQMD AEAT
Sbjct: 661 PNCATYSSLIHGMCNIGLVENAKHLIDEMRKEGLLPNVVCYTALIGGYCKLGQMDTAEAT 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           WLEMISFNI PNKFTYTVMIDGYCKLGNMEEANNLL+KMKES IVPDVVTYNALTNGFCK
Sbjct: 721 WLEMISFNIAPNKFTYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
           GK+MDKAFKVCDQMATGGLSLDEITYTTLVHGWNR TITSQD
Sbjct: 781 GKDMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRSTITSQD 822

BLAST of HG10002532 vs. NCBI nr
Match: XP_023552294.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023552295.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023552296.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 722/822 (87.83%), Postives = 762/822 (92.70%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           MHLTRFKINKTVPV+FPFSR++AC LST+PHKEHHQDPPWQLQDQLL+ VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTVPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLID  LPVLNSD NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDRKLPVLNSDLNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQFRNLGFS A+DVFYLFAR GIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGV 
Sbjct: 181 YSTQFRNLGFSYAVDVFYLFARNGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVS 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLGISPNVVTYN IIHGLCQNGRL +AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSIIHGLCQNGRLGDAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM +EGVKPS ITY+VLINGLTKL  FDK N VL+EMVDAGFVPNAV YN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVDAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MG I+EAL I+D+M+SKNITPTSVTLYTL+QGFC++NQIEQAEN LEEILS+G  INP T
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTLYTLLQGFCKNNQIEQAENTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSV+HWLC K R H A RFT VMLSKNFRP DQLLTILV  LCKDGKHLEATELWFRLL
Sbjct: 421 CYSVIHWLCTKSRFHYALRFTMVMLSKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS ATSNALIHGLCGAG + EAVRI++EMLERGF +D IT+NTLILGCC+EGKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTLILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYTCN LL GLC+AGKLDDAIKLWDE+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNLLLYGLCNAGKLDDAIKLWDEFKASGLISNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           D YCKANRMEDVEKLFNELV K+MELN+IVYNIFIRA+C NGNVAAALQLRDDMKSKGI 
Sbjct: 601 DVYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           PTCATYSSLIHGMCNIG VEDAKHLIDEMREEGL PNVVCYTALIGGYCKLGQMDIAEAT
Sbjct: 661 PTCATYSSLIHGMCNIGRVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           WLEM SFNIRPNK TYTVMIDGYCK+GNMEEANNLL+KMKES IVPDVVTYNALTNGFCK
Sbjct: 721 WLEMTSFNIRPNKITYTVMIDGYCKIGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
           GK+MDKAFK CD+MATGGLSLDEITYTTLVHGWN+PTITSQD
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVHGWNQPTITSQD 822

BLAST of HG10002532 vs. NCBI nr
Match: KAG6577115.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 722/822 (87.83%), Postives = 762/822 (92.70%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           MHLTRFKINKTVPV+FPFSR++ C LST+PHKEHHQDPPWQLQDQLL+ VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTVPVVFPFSRQVVCVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQFRNLGFS A+DVFYLFAR GIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLGFSYAVDVFYLFARNGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLGISPNVVTYN IIHGLCQNGRL +AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSIIHGLCQNGRLGDAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM +EGVKPS ITY+VLINGLTKL  FDK N VL+EMVD GFVPNAV YN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVDVGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MG I+EAL I+D+M+SKNITPTSVTLYTL+QGFC+SNQIEQAEN LEEILS+G  INP T
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTLYTLLQGFCKSNQIEQAENTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSV+HWLC K R H A RFT VMLSKNFRP DQLLTILV  LCKDGKHLEATELWFRLL
Sbjct: 421 CYSVIHWLCTKSRFHYALRFTMVMLSKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS ATSNALIHGLCGAG + EAVRI++EMLERGF +D IT+NTLILGCC+EGKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTLILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYTCN LL GLC+AGKLDDAIKLWDE+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNLLLYGLCNAGKLDDAIKLWDEFKASGLISNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           D YCKANRMEDVEKLF+ELV K+MELN+IVYNIFIRA+C NGNVAAALQLRDDMKSKGI 
Sbjct: 601 DVYCKANRMEDVEKLFDELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           PTCATYSSLIHGMCNIGLVEDAK LIDEMREEGL PNVVCYTALIGGYCKLGQMDIAEAT
Sbjct: 661 PTCATYSSLIHGMCNIGLVEDAKRLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           WLEM S NIRPNK TYTVMIDGYCKLGNMEEANNLL+KMKES IVPDVVTYNALTNGFCK
Sbjct: 721 WLEMTSLNIRPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
           GK+MDKAFK CD+MATGGLSLDEITYTTLVHGWN+PTITSQD
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVHGWNQPTITSQD 822

BLAST of HG10002532 vs. NCBI nr
Match: XP_022984601.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita maxima] >XP_022984602.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita maxima] >XP_022984603.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1481.1 bits (3833), Expect = 0.0e+00
Identity = 720/822 (87.59%), Postives = 762/822 (92.70%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           MHLTRFKINKT+PV+FPFSR++AC LST+PHKEHHQDPPWQLQDQLL+ VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTLPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQFRNL FS A+DVFYLFARKGIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLCFSYAVDVFYLFARKGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLGISPNVVTYN +IHGLCQNGRL +AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSVIHGLCQNGRLGDAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM +EGVKPS ITY+VLINGLTKL  FDK N VL+EMVDAGFVPNAV YN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVDAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MG I+EAL I+D+M+SKNITPTSVT YTL+QGFC+SNQIEQA+N LEEILS+G  INP T
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTFYTLLQGFCKSNQIEQAQNTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSV+HWLC K R H A RFT VML KNFRP DQLLTILV  LCKDGKHLEATELWFRLL
Sbjct: 421 CYSVIHWLCTKFRFHYALRFTTVMLLKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS ATSNALIHGLCGAG + EAVRI++EMLERGF +D IT+NTLILGCC+EGKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTLILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYTCNFLL GLC+AGKLDDAIKLWDE+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNFLLYGLCNAGKLDDAIKLWDEFKASGLISNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           D YCKANRMEDVEKLFNELV K+MELN+IVYNIFIRA+C NGNVAAALQLRDDMKSKGI 
Sbjct: 601 DAYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           PTCATYSSLIHGMCNIG VEDAKHLIDEMREEGL PNVVCYTALIGGYCKLGQMDIAEAT
Sbjct: 661 PTCATYSSLIHGMCNIGRVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           WLEM S NI PNK TYTVMIDGYCKLGNMEEANNLL+KMKES IVPDVVTYNALTNGFCK
Sbjct: 721 WLEMTSLNISPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
           GK+MDKAFK CD+MATGGLSLDEITYTTLVHGWN+PTITSQD
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVHGWNQPTITSQD 822

BLAST of HG10002532 vs. NCBI nr
Match: XP_022931380.1 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita moschata] >XP_022931381.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita moschata] >XP_022931382.1 pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1473.4 bits (3813), Expect = 0.0e+00
Identity = 719/822 (87.47%), Postives = 759/822 (92.34%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           MHLTRFKINKTVPV+FPFSR++AC LST+PHKEHHQDPPWQLQDQLL+ VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTVPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFD LFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDHLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQFRNLGFS A+DVFYLFAR GIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLGFSYAVDVFYLFARNGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLGISPNVVTYN IIHGLCQNGRL +AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSIIHGLCQNGRLGDAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM +EGVKPS ITY+VLINGLTKL  FDK N VL+EMV AGFVPNAV YN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVGAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MG I+EAL I+D+M+SKNITPTSVTLYTL+QGFC+SNQIEQAEN LEEILS+G  INP T
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTLYTLLQGFCKSNQIEQAENTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSV+HWLC K R H A RFT VMLSKNFRP DQLLTILV  LCKDGKHLEATELWFRL 
Sbjct: 421 CYSVIHWLCTKSRFHYALRFTMVMLSKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLF 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS ATSNALIHGLCGAG + EAVRI++EMLERGF +D IT+NT ILGCC+EGKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTFILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYTCN LL GLC+AGKLDDAIKLW E+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNLLLYGLCNAGKLDDAIKLWGEFKASGLISNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           D YCKANRMEDVEKLFNELV K+MELN+IVYNIFIRA+C NGNVAAALQLRDDMKSKGI 
Sbjct: 601 DVYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           PTCATYSSLIHGMCNIGLVEDAK LIDEMREEGL PNVVCYTALIGGYCKLGQMDIAEAT
Sbjct: 661 PTCATYSSLIHGMCNIGLVEDAKRLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           +LEM S NIRPNK TYTVMIDGYCKLGNMEEANNLL+KMKES IVPDVVTYNALTNGFCK
Sbjct: 721 FLEMTSLNIRPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
           GK+MDKAFK CD+MATGGLSLDEITYTTLVHGWN+PTITSQD
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVHGWNQPTITSQD 822

BLAST of HG10002532 vs. ExPASy Swiss-Prot
Match: Q940A6 (Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g19440 PE=2 SV=2)

HSP 1 Score: 729.6 bits (1882), Expect = 4.0e-209
Identity = 376/787 (47.78%), Postives = 514/787 (65.31%), Query Frame = 0

Query: 29  QPHKEHHQDPPWQLQDQLLFWVSSILSNSSLDSSKCRALLPHLSPFQFDQLFFSVGLKAN 88
           +P K         L ++L    SS+LS  SLD  +C+ L+  LSP +FD+LF     K N
Sbjct: 63  RPDKSEETSSDRHLHERL----SSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVN 122

Query: 89  PNTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLHPARLLLIRLIDGNLPVLNSDTN 148
           P T L+FF  ASDSF F F++RSYC+LI LL+ +  L  AR++LIRLI+GN+PVL     
Sbjct: 123 PKTALDFFRLASDSFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLR 182

Query: 149 KLHIEIANALFGLTSVVGRFEWTQAFDLLIHVYGTQFRNLGFSCAIDVFYLFARKGIFPS 208
              + IA+A+  L+         +  DLLI VY TQF+  G   A+DVF + A KG+FPS
Sbjct: 183 DSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPS 242

Query: 209 LKTCSFLLSSLVKANELEKCCELFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMK 268
             TC+ LL+SLV+ANE +KCCE F+V+ +GV PDV+LFT  INA CKGGK+E+A++LF K
Sbjct: 243 KTTCNILLTSLVRANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSK 302

Query: 269 MEKLGISPNVVTYNIIIHGLCQNGRLDNAFELKEKMKMEGVKPSHITYNVLINGLTKLGH 328
           ME+ G++PNVVT+N +I GL   GR D AF  KEKM   G++P+ ITY++L+ GLT+   
Sbjct: 303 MEEAGVAPNVVTFNTVIDGLGMCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTRAKR 362

Query: 329 FDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCKMGNIDEALWIKDLMISKNITPTSVTLYT 388
                 VL EM   GF PN + YNNLID + + G++++A+ IKDLM+SK ++ TS T  T
Sbjct: 363 IGDAYFVLKEMTKKGFPPNVIVYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSSTYNT 422

Query: 389 LMQGFCESNQIEQAENALEEILSRGLSINPDTCYSVVHWLCKKCRVHSAFRFTKVMLSKN 448
           L++G+C++ Q + AE  L+E+LS G ++N  +  SV+  LC      SA RF   ML +N
Sbjct: 423 LIKGYCKNGQADNAERLLKEMLSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEMLLRN 482

Query: 449 FRPRDQLLTILVRELCKDGKHLEATELWFRLLEKGSPASMATSNALIHGLCGAGNLPEAV 508
             P   LLT L+  LCK GKH +A ELWF+ L KG      TSNAL+HGLC AG L EA 
Sbjct: 483 MSPGGGLLTTLISGLCKHGKHSKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLDEAF 542

Query: 509 RIVEEMLERGFPVDEITFNTLILGCCREGKVEECFRLKEEMTKQGIQPDIYTCNFLLRGL 568
           RI +E+L RG  +D +++NTLI GCC + K++E F   +EM K+G++PD YT + L+ GL
Sbjct: 543 RIQKEILGRGCVMDRVSYNTLISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILICGL 602

Query: 569 CSAGKLDDAIKLWDEYKASGLVSNVHTYGVMMDGYCKANRMEDVEKLFNELVAKRMELNT 628
            +  K+++AI+ WD+ K +G++ +V+TY VM+DG CKA R E+ ++ F+E+++K ++ NT
Sbjct: 603 FNMNKVEEAIQFWDDCKRNGMLPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQPNT 662

Query: 629 IVYNIFIRANCHNGNVAAALQLRDDMKSKGILPTCATYSSLIHGMCNIGLVEDAKHLIDE 688
           +VYN  IRA C +G ++ AL+LR+DMK KGI P  ATY+SLI GM  I  VE+AK L +E
Sbjct: 663 VVYNHLIRAYCRSGRLSMALELREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEE 722

Query: 689 MREEGLSPNVVCYTALIGGYCKLGQMDIAEATWLEMISFNIRPNKFTYTVMIDGYCKLGN 748
           MR EGL PNV  YTALI GY KLGQM   E    EM S N+ PNK TYTVMI GY + GN
Sbjct: 723 MRMEGLEPNVFHYTALIDGYGKLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGN 782

Query: 749 MEEANNLLAKMKESRIVPDVVTYNALTNGFCKGKNMDKAFKVCDQMATGGLSLDEITYTT 808
           + EA+ LL +M+E  IVPD +TY     G+ K   + +AFK            DE  Y  
Sbjct: 783 VTEASRLLNEMREKGIVPDSITYKEFIYGYLKQGGVLEAFK----------GSDEENYAA 835

Query: 809 LVHGWNR 816
           ++ GWN+
Sbjct: 843 IIEGWNK 835

BLAST of HG10002532 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 344.4 bits (882), Expect = 3.7e-93
Identity = 224/745 (30.07%), Postives = 347/745 (46.58%), Query Frame = 0

Query: 109 IRSYCILILLLVHSKFLHPARLLL--IRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVG 168
           ++  CI   +LV ++   PAR +L  + L+ G                ++ +FG      
Sbjct: 72  VQLVCITTHILVRARMYDPARHILKELSLMSGK---------------SSFVFGALMTTY 131

Query: 169 RF--EWTQAFDLLIHVYGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANE 228
           R        +D+LI VY    R      ++++F L    G  PS+ TC+ +L S+VK+ E
Sbjct: 132 RLCNSNPSVYDILIRVY---LREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGE 191

Query: 229 -LEKCCELFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNI 288
            +     L E++ + + PDV  F  +IN LC  G  EK+  L  KMEK G +P +VTYN 
Sbjct: 192 DVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNT 251

Query: 289 IIHGLCQNGRLDNAFELKEKMKMEGV---------------------------------- 348
           ++H  C+ GR   A EL + MK +GV                                  
Sbjct: 252 VLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRM 311

Query: 349 -KPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCKMGNIDEAL 408
             P+ +TYN LING +  G     + +L+EM+  G  PN V +N LIDG+   GN  EAL
Sbjct: 312 IHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEAL 371

Query: 409 WIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDTCYSVVHWL 468
            +  +M +K +TP+ V+   L+ G C++ + + A      +   G+ +   T   ++  L
Sbjct: 372 KMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGL 431

Query: 469 CKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLLEKGSPASM 528
           CK   +  A      M      P     + L+   CK G+   A E+  R+   G   + 
Sbjct: 432 CKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNG 491

Query: 529 ATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVEECFRLKEE 588
              + LI+  C  G L EA+RI E M+  G   D  TFN L+   C+ GKV E       
Sbjct: 492 IIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRC 551

Query: 589 MTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMMDGYCKANR 648
           MT  GI P+  + + L+ G  ++G+   A  ++DE    G      TYG ++ G CK   
Sbjct: 552 MTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGH 611

Query: 649 MEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGILPTCATYSS 708
           + + EK    L A    ++T++YN  + A C +GN+A A+ L  +M  + ILP   TY+S
Sbjct: 612 LREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTS 671

Query: 709 LIHGMCNIGLVEDAKHLIDEMREEG-LSPNVVCYTALIGGYCKLGQMDIAEATWLEMISF 768
           LI G+C  G    A     E    G + PN V YT  + G  K GQ         +M + 
Sbjct: 672 LISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNL 731

Query: 769 NIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCKGKNMDKA 813
              P+  T   MIDGY ++G +E+ N+LL +M      P++ TYN L +G+ K K++  +
Sbjct: 732 GHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTS 791

BLAST of HG10002532 vs. ExPASy Swiss-Prot
Match: Q9FJE6 (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana OX=3702 GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 338.6 bits (867), Expect = 2.0e-91
Identity = 229/802 (28.55%), Postives = 367/802 (45.76%), Query Frame = 0

Query: 83  VGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLHPARLLLIRLIDGNLPV 142
           +G   +P   L FF F      F  +  S+CILI  LV +    PA  LL  L+   L  
Sbjct: 78  IGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLL---LRA 137

Query: 143 LNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHVYGTQFRNLGFSCAIDVFYLFAR 202
           L         ++ N LF       +   + +FDLLI  Y    R L     + VF +   
Sbjct: 138 LKPS------DVFNVLFSCYEKC-KLSSSSSFDLLIQHYVRSRRVLD---GVLVFKMMIT 197

Query: 203 K-GIFPSLKTCSFLLSSLVKANELEKCCELF-EVMSQGVRPDVFLFTNVINALCKGGKME 262
           K  + P ++T S LL  LVK        ELF +++S G+RPDV+++T VI +LC+   + 
Sbjct: 198 KVSLLPEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLS 257

Query: 263 KAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFELKEKMKMEGVKPSHITYNVLI 322
           +A E+   ME  G   N+V YN++I GLC+  ++  A  +K+ +  + +KP  +TY  L+
Sbjct: 258 RAKEMIAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLV 317

Query: 323 NGLTKLGHFDKVNHVLDEM-----------------------------------VDAGFV 382
            GL K+  F+    ++DEM                                   VD G  
Sbjct: 318 YGLCKVQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVS 377

Query: 383 PNAVDYNNLIDGYCKMGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENA 442
           PN   YN LID  CK     EA  + D M    + P  VT   L+  FC   +++ A + 
Sbjct: 378 PNLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSF 437

Query: 443 LEEILSRGLSINPDTCYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCK 502
           L E++  GL ++     S+++  CK   + +A  F   M++K   P     T L+   C 
Sbjct: 438 LGEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCS 497

Query: 503 DGKHLEATELWFRLLEKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEIT 562
            GK  +A  L+  +  KG   S+ T   L+ GL  AG + +AV++  EM E     + +T
Sbjct: 498 KGKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVT 557

Query: 563 FNTLILGCCREGKVEECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYK 622
           +N +I G C EG + + F   +EMT++GI PD Y+   L+ GLC  G+  +A    D   
Sbjct: 558 YNVMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLH 617

Query: 623 ASGLVSNVHTYGVMMDGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVA 682
                 N   Y  ++ G+C+  ++E+   +  E+V + ++L+ + Y + I  +  + +  
Sbjct: 618 KGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRK 677

Query: 683 AALQLRDDMKSKGILPTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALI 742
               L  +M  +G+ P    Y+S+I      G  ++A  + D M  EG  PN V YTA+I
Sbjct: 678 LFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVI 737

Query: 743 GGYCKLGQMDIAEATWLEMISFNIRPNKF------------------------------- 802
            G CK G ++ AE    +M   +  PN+                                
Sbjct: 738 NGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILKGLL 797

Query: 803 ----TYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCKGKNMDKAFKV 813
               TY ++I G+C+ G +EEA+ L+ +M    + PD +TY  + N  C+  ++ KA ++
Sbjct: 798 ANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIEL 857

BLAST of HG10002532 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 1.5e-83
Identity = 229/830 (27.59%), Postives = 399/830 (48.07%), Query Frame = 0

Query: 5   RFKINKTVPVLFPFSRRLACALS-TQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLDSS- 64
           +F  + TVP   P +RR  C++S    +    +     +  +LL    SILS  +   S 
Sbjct: 26  KFSTDVTVP--SPVTRRQFCSVSPLLRNLPEEESDSMSVPHRLL----SILSKPNWHKSP 85

Query: 65  KCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLVHS 124
             ++++  +SP     LF    L  +P T LNF ++ S + +++ ++ SY  L+ LL+++
Sbjct: 86  SLKSMVSAISPSHVSSLF---SLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINN 145

Query: 125 KF---LHPARLLLIRLIDG---NLPVL------NSDTN-----KLHIEIANAL------F 184
            +   +   RLL+I+  D     L VL      N D       KL I   N L      F
Sbjct: 146 GYVGVVFKIRLLMIKSCDSVGDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLARF 205

Query: 185 GLTSVVGRFEWTQAFDLLIHVYGTQFRNLGFSC-------AIDVFYLFARKGIFPSLKTC 244
           GL   + +       D +     T  + +   C       A          G+ P   T 
Sbjct: 206 GLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTY 265

Query: 245 SFLLSSLVKANELEKCCELF-EVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEK 304
           + L+    +  +L+   ++F E+  +G R +   +T++I+ LC   ++++A++LF+KM+ 
Sbjct: 266 TSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKD 325

Query: 305 LGISPNVVTYNIIIHGLCQNGRLDNAFELKEKMKMEGVKPSHITYNVLINGLTKLGHFDK 364
               P V TY ++I  LC + R   A  L ++M+  G+KP+  TY VLI+ L     F+K
Sbjct: 326 DECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEK 385

Query: 365 VNHVLDEMVDAGFVPNAVDYNNLIDGYCKMGNIDEALWIKDLMISKNITPTSVTLYTLMQ 424
              +L +M++ G +PN + YN LI+GYCK G I++A+ + +LM S+ ++P + T   L++
Sbjct: 386 ARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIK 445

Query: 425 GFCESNQIEQAENALEEILSRGLSINPDTCYSVVHWLCKKCRVHSAFRFTKVMLSKNFRP 484
           G+C+SN + +A   L ++L R +  +  T  S++   C+     SA+R   +M  +   P
Sbjct: 446 GYCKSN-VHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVP 505

Query: 485 RDQLLTILVRELCKDGKHLEATELWFRLLEKGSPASMATSNALIHGLCGAGNLPEAVRIV 544
                T ++  LCK  +  EA +L+  L +KG   ++    ALI G C AG + EA  ++
Sbjct: 506 DQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLML 565

Query: 545 EEMLERGFPVDEITFNTLILGCCREGKVEECFRLKEEMTKQGIQPDIYTCNFLLRGLCSA 604
           E+ML +    + +TFN LI G C +GK++E   L+E+M K G+QP + T   L+  L   
Sbjct: 566 EKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKD 625

Query: 605 GKLDDAIKLWDEYKASGLVSNVHTYGVMMDGYCKANRMEDVEKLFNELVAKRMELNTIVY 664
           G  D A   + +  +SG   + HTY   +  YC+  R+ D E +  ++    +  +   Y
Sbjct: 626 GDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFTY 685

Query: 665 NIFIRANCHNGNVAAALQLRDDMKSKGILPTCATYSSLIHGMCNIGLVEDAKHLIDEM-- 724
           +  I+     G    A  +   M+  G  P+  T+ SLI            KHL++    
Sbjct: 686 SSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLI------------KHLLEMKYG 745

Query: 725 REEGLSPNVVCYTALIGGYCKLGQMDIAEATWLEMISFNIRPNKFTYTVMIDGYCKLGNM 784
           +++G  P +   + ++       + D       +M+  ++ PN  +Y  +I G C++GN+
Sbjct: 746 KQKGSEPELCAMSNMM-------EFDTVVELLEKMVEHSVTPNAKSYEKLILGICEVGNL 805

Query: 785 EEANNLLAKMKESR-IVPDVVTYNALTNGFCKGKNMDKAFKVCDQMATGG 799
             A  +   M+ +  I P  + +NAL +  CK K  ++A KV D M   G
Sbjct: 806 RVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDMICVG 826

BLAST of HG10002532 vs. ExPASy Swiss-Prot
Match: Q9M907 (Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX=3702 GN=At3g06920 PE=2 SV=1)

HSP 1 Score: 303.5 bits (776), Expect = 7.2e-81
Identity = 184/675 (27.26%), Postives = 324/675 (48.00%), Query Frame = 0

Query: 157 ALFGLTSVVGRFEWTQAFDLLIHVYGTQFRNLGFSCAIDVFYL----FARKG-------I 216
           A    T+++G F      D+++ ++  Q + LG+   + +F      FA++G       +
Sbjct: 167 AFSAYTTLIGAFSAVNHSDMMLTLF-QQMQELGYEPTVHLFTTLIRGFAKEGRVDSALSL 226

Query: 217 FPSLKTCSF---------LLSSLVKANELEKCCELF-EVMSQGVRPDVFLFTNVINALCK 276
              +K+ S           + S  K  +++   + F E+ + G++PD   +T++I  LCK
Sbjct: 227 LDEMKSSSLDADIVLYNVCIDSFGKVGKVDMAWKFFHEIEANGLKPDEVTYTSMIGVLCK 286

Query: 277 GGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFELKEKMKMEGVKPSHIT 336
             ++++A+E+F  +EK    P    YN +I G    G+ D A+ L E+ + +G  PS I 
Sbjct: 287 ANRLDEAVEMFEHLEKNRRVPCTYAYNTMIMGYGSAGKFDEAYSLLERQRAKGSIPSVIA 346

Query: 337 YNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCKMGNIDEALWIKDLMI 396
           YN ++  L K+G  D+   V +EM      PN   YN LID  C+ G +D A  ++D M 
Sbjct: 347 YNCILTCLRKMGKVDEALKVFEEM-KKDAAPNLSTYNILIDMLCRAGKLDTAFELRDSMQ 406

Query: 397 SKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDTCYSVVHWLCKKCRVH 456
              + P   T+  ++   C+S ++++A    EE+  +  + +  T  S++  L K  RV 
Sbjct: 407 KAGLFPNVRTVNIMVDRLCKSQKLDEACAMFEEMDYKVCTPDEITFCSLIDGLGKVGRVD 466

Query: 457 SAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLLEKGSPASMATSNALI 516
            A++  + ML  + R    + T L++     G+  +  +++  ++ +     +   N  +
Sbjct: 467 DAYKVYEKMLDSDCRTNSIVYTSLIKNFFNHGRKEDGHKIYKDMINQNCSPDLQLLNTYM 526

Query: 517 HGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVEECFRLKEEMTKQGIQ 576
             +  AG   +   + EE+  R F  D  +++ LI G  + G   E + L   M +QG  
Sbjct: 527 DCMFKAGEPEKGRAMFEEIKARRFVPDARSYSILIHGLIKAGFANETYELFYSMKEQGCV 586

Query: 577 PDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMMDGYCKANRMEDVEKL 636
            D    N ++ G C  GK++ A +L +E K  G    V TYG ++DG  K +R+++   L
Sbjct: 587 LDTRAYNIVIDGFCKCGKVNKAYQLLEEMKTKGFEPTVVTYGSVIDGLAKIDRLDEAYML 646

Query: 637 FNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGILPTCATYSSLIHGMCN 696
           F E  +KR+ELN ++Y+  I      G +  A  + +++  KG+ P   T++SL+  +  
Sbjct: 647 FEEAKSKRIELNVVIYSSLIDGFGKVGRIDEAYLILEELMQKGLTPNLYTWNSLLDALVK 706

Query: 697 IGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEATWLEMISFNIRPNKFT 756
              + +A      M+E   +PN V Y  LI G CK+ + + A   W EM    ++P+  +
Sbjct: 707 AEEINEALVCFQSMKELKCTPNQVTYGILINGLCKVRKFNKAFVFWQEMQKQGMKPSTIS 766

Query: 757 YTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCKGKNMDKAFKVCDQMA 811
           YT MI G  K GN+ EA  L  + K +  VPD   YNA+  G   G     AF + ++  
Sbjct: 767 YTTMISGLAKAGNIAEAGALFDRFKANGGVPDSACYNAMIEGLSNGNRAMDAFSLFEETR 826

BLAST of HG10002532 vs. ExPASy TrEMBL
Match: A0A6J1J2L6 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111482844 PE=4 SV=1)

HSP 1 Score: 1481.1 bits (3833), Expect = 0.0e+00
Identity = 720/822 (87.59%), Postives = 762/822 (92.70%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           MHLTRFKINKT+PV+FPFSR++AC LST+PHKEHHQDPPWQLQDQLL+ VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTLPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFDQLFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQFRNL FS A+DVFYLFARKGIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLCFSYAVDVFYLFARKGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLGISPNVVTYN +IHGLCQNGRL +AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSVIHGLCQNGRLGDAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM +EGVKPS ITY+VLINGLTKL  FDK N VL+EMVDAGFVPNAV YN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVDAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MG I+EAL I+D+M+SKNITPTSVT YTL+QGFC+SNQIEQA+N LEEILS+G  INP T
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTFYTLLQGFCKSNQIEQAQNTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSV+HWLC K R H A RFT VML KNFRP DQLLTILV  LCKDGKHLEATELWFRLL
Sbjct: 421 CYSVIHWLCTKFRFHYALRFTTVMLLKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS ATSNALIHGLCGAG + EAVRI++EMLERGF +D IT+NTLILGCC+EGKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTLILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYTCNFLL GLC+AGKLDDAIKLWDE+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNFLLYGLCNAGKLDDAIKLWDEFKASGLISNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           D YCKANRMEDVEKLFNELV K+MELN+IVYNIFIRA+C NGNVAAALQLRDDMKSKGI 
Sbjct: 601 DAYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           PTCATYSSLIHGMCNIG VEDAKHLIDEMREEGL PNVVCYTALIGGYCKLGQMDIAEAT
Sbjct: 661 PTCATYSSLIHGMCNIGRVEDAKHLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           WLEM S NI PNK TYTVMIDGYCKLGNMEEANNLL+KMKES IVPDVVTYNALTNGFCK
Sbjct: 721 WLEMTSLNISPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
           GK+MDKAFK CD+MATGGLSLDEITYTTLVHGWN+PTITSQD
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVHGWNQPTITSQD 822

BLAST of HG10002532 vs. ExPASy TrEMBL
Match: A0A6J1ETG9 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111437580 PE=4 SV=1)

HSP 1 Score: 1473.4 bits (3813), Expect = 0.0e+00
Identity = 719/822 (87.47%), Postives = 759/822 (92.34%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           MHLTRFKINKTVPV+FPFSR++AC LST+PHKEHHQDPPWQLQDQLL+ VSSILSNSSLD
Sbjct: 1   MHLTRFKINKTVPVVFPFSRQVACVLSTEPHKEHHQDPPWQLQDQLLYSVSSILSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKCRALLPHLSPFQFD LFFSVGLKANP TCLNFFYFASDSFKFRFTIRSYC+LILLLV
Sbjct: 61  SSKCRALLPHLSPFQFDHLFFSVGLKANPKTCLNFFYFASDSFKFRFTIRSYCLLILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLIDG LPVLNSD+NKLHIEIAN LFGLTSVVGRFE T AFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGKLPVLNSDSNKLHIEIANELFGLTSVVGRFECTHAFDLLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQFRNLGFS A+DVFYLFAR GIFPSLKTC+FLLSSLVKANELEKCCE+FEVMSQGVR
Sbjct: 181 YSTQFRNLGFSYAVDVFYLFARNGIFPSLKTCNFLLSSLVKANELEKCCEVFEVMSQGVR 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVFLFTNVINALCKGGKME A+EL M MEKLGISPNVVTYN IIHGLCQNGRL +AFEL
Sbjct: 241 PDVFLFTNVINALCKGGKMENAMELLMNMEKLGISPNVVTYNSIIHGLCQNGRLGDAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM +EGVKPS ITY+VLINGLTKL  FDK N VL+EMV AGFVPNAV YN LIDGYCK
Sbjct: 301 KEKMTIEGVKPSLITYSVLINGLTKLEKFDKANDVLNEMVGAGFVPNAVVYNTLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MG I+EAL I+D+M+SKNITPTSVTLYTL+QGFC+SNQIEQAEN LEEILS+G  INP T
Sbjct: 361 MGKINEALKIRDVMVSKNITPTSVTLYTLLQGFCKSNQIEQAENTLEEILSQGFPINPVT 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSV+HWLC K R H A RFT VMLSKNFRP DQLLTILV  LCKDGKHLEATELWFRL 
Sbjct: 421 CYSVIHWLCTKSRFHYALRFTMVMLSKNFRPSDQLLTILVCGLCKDGKHLEATELWFRLF 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS ATSNALIHGLCGAG + EAVRI++EMLERGF +D IT+NT ILGCC+EGKVE
Sbjct: 481 EKGSPASTATSNALIHGLCGAGKMAEAVRILKEMLERGFSLDRITYNTFILGCCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
           ECFRLKEEMT QGIQPDIYTCN LL GLC+AGKLDDAIKLW E+KASGL+SNVHTYGVMM
Sbjct: 541 ECFRLKEEMTNQGIQPDIYTCNLLLYGLCNAGKLDDAIKLWGEFKASGLISNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           D YCKANRMEDVEKLFNELV K+MELN+IVYNIFIRA+C NGNVAAALQLRDDMKSKGI 
Sbjct: 601 DVYCKANRMEDVEKLFNELVTKKMELNSIVYNIFIRAHCRNGNVAAALQLRDDMKSKGIF 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           PTCATYSSLIHGMCNIGLVEDAK LIDEMREEGL PNVVCYTALIGGYCKLGQMDIAEAT
Sbjct: 661 PTCATYSSLIHGMCNIGLVEDAKRLIDEMREEGLLPNVVCYTALIGGYCKLGQMDIAEAT 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           +LEM S NIRPNK TYTVMIDGYCKLGNMEEANNLL+KMKES IVPDVVTYNALTNGFCK
Sbjct: 721 FLEMTSLNIRPNKITYTVMIDGYCKLGNMEEANNLLSKMKESGIVPDVVTYNALTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
           GK+MDKAFK CD+MATGGLSLDEITYTTLVHGWN+PTITSQD
Sbjct: 781 GKDMDKAFKTCDEMATGGLSLDEITYTTLVHGWNQPTITSQD 822

BLAST of HG10002532 vs. ExPASy TrEMBL
Match: A0A1S4DYY2 (pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493057 PE=4 SV=1)

HSP 1 Score: 1456.4 bits (3769), Expect = 0.0e+00
Identity = 706/822 (85.89%), Postives = 751/822 (91.36%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           MHLTRFKINKT+PVLFPFSRRLAC  STQPHKEHHQDPPWQ QDQL  WVSS+LSNSSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKC ALLPHLSPFQFDQLFFS+GLKANP TCLNFFYFASDSFKFRFTI SYCILILLLV
Sbjct: 61  SSKCSALLPHLSPFQFDQLFFSIGLKANPMTCLNFFYFASDSFKFRFTIHSYCILILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLIDGNLPVLNSD  K HIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGNLPVLNSDFKKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQFRNLGF CAIDVFYL ARKG FPSLKTC+FLLSSLVKANE EKCCE+F+VMS+GV 
Sbjct: 181 YSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFQVMSEGVC 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVF FTNVINALCKGGKMEKA ELFMKMEKLGISPNVVTYN II+GLCQNGRLD+AFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMEKATELFMKMEKLGISPNVVTYNCIINGLCQNGRLDHAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM +EGV+P+  TY  L+NGL KL  FDKVNH+LDEM+ AGF PN V +NNLIDGYCK
Sbjct: 301 KEKMTIEGVQPNLKTYGALVNGLIKLKCFDKVNHILDEMIGAGFYPNVVVFNNLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MGNI EAL IKD+MISKNITPTSVTLYTL+QGFC+S+QIEQAENALEEILS GLSI+PD 
Sbjct: 361 MGNIKEALRIKDVMISKNITPTSVTLYTLLQGFCKSDQIEQAENALEEILSNGLSIHPDK 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSVVHWLCKK R HSAFRFTK+MLS+NFRP D LLTILV  LCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDPLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS  TSNALIHGLC AGNLPEA RIV+EMLERG P+D IT+N LILG C+EGKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCEAGNLPEASRIVKEMLERGLPLDRITYNALILGFCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
            CFRLKEEMTK+GIQPDIYT NFLLRGLC+AGKLDDAIKLWDE+KASG +SNVHTYGVMM
Sbjct: 541 GCFRLKEEMTKRGIQPDIYTYNFLLRGLCNAGKLDDAIKLWDEFKASGPISNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           DGYCKANR+EDVE LFNEL++K+MELN+IVYNI IRA+C NGNVAAALQLR++MKSKGIL
Sbjct: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIRAHCQNGNVAAALQLRENMKSKGIL 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           P CATYSSLIHGMC+IGLVEDAKHLIDEMR+EG  PNVVCYTALIGGYCKLGQMD AE+T
Sbjct: 661 PNCATYSSLIHGMCDIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           WLEMISFNI PNKFTYTVMIDGYCKLGNME+A NLL KMKES IVPDVVTYN LTNGFCK
Sbjct: 721 WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKAYNLLTKMKESGIVPDVVTYNVLTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
             +MD AFKVCDQMAT GLS+DEITYTTLVHGWNRPTIT QD
Sbjct: 781 ANDMDNAFKVCDQMATEGLSVDEITYTTLVHGWNRPTITGQD 822

BLAST of HG10002532 vs. ExPASy TrEMBL
Match: A0A5D3CYQ1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G002960 PE=4 SV=1)

HSP 1 Score: 1456.4 bits (3769), Expect = 0.0e+00
Identity = 706/822 (85.89%), Postives = 751/822 (91.36%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           MHLTRFKINKT+PVLFPFSRRLAC  STQPHKEHHQDPPWQ QDQL  WVSS+LSNSSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKC ALLPHLSPFQFDQLFFS+GLKANP TCLNFFYFASDSFKFRFTI SYCILILLLV
Sbjct: 61  SSKCSALLPHLSPFQFDQLFFSIGLKANPMTCLNFFYFASDSFKFRFTIHSYCILILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLIDGNLPVLNSD  K HIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGNLPVLNSDFKKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQFRNLGF CAIDVFYL ARKG FPSLKTC+FLLSSLVKANE EKCCE+F+VMS+GV 
Sbjct: 181 YSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFQVMSEGVC 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVF FTNVINALCKGGKMEKA ELFMKMEKLGISPNVVTYN II+GLCQNGRLD+AFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMEKATELFMKMEKLGISPNVVTYNCIINGLCQNGRLDHAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM +EGV+P+  TY  L+NGL KL  FDKVNH+LDEM+ AGF PN V +NNLIDGYCK
Sbjct: 301 KEKMTIEGVQPNLKTYGALVNGLIKLKCFDKVNHILDEMIGAGFYPNVVVFNNLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MGNI EAL IKD+MISKNITPTSVTLYTL+QGFC+S+QIEQAENALEEILS GLSI+PD 
Sbjct: 361 MGNIKEALRIKDVMISKNITPTSVTLYTLLQGFCKSDQIEQAENALEEILSNGLSIHPDK 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSVVHWLCKK R HSAFRFTK+MLS+NFRP D LLTILV  LCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDPLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS  TSNALIHGLC AGNLPEA RIV+EMLERG P+D IT+N LILG C+EGKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCEAGNLPEASRIVKEMLERGLPLDRITYNALILGFCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
            CFRLKEEMTK+GIQPDIYT NFLLRGLC+AGKLDDAIKLWDE+KASG +SNVHTYGVMM
Sbjct: 541 GCFRLKEEMTKRGIQPDIYTYNFLLRGLCNAGKLDDAIKLWDEFKASGPISNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           DGYCKANR+EDVE LFNEL++K+MELN+IVYNI IRA+C NGNVAAALQLR++MKSKGIL
Sbjct: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIRAHCQNGNVAAALQLRENMKSKGIL 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           P CATYSSLIHGMC+IGLVEDAKHLIDEMR+EG  PNVVCYTALIGGYCKLGQMD AE+T
Sbjct: 661 PNCATYSSLIHGMCDIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           WLEMISFNI PNKFTYTVMIDGYCKLGNME+A NLL KMKES IVPDVVTYN LTNGFCK
Sbjct: 721 WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKAYNLLTKMKESGIVPDVVTYNVLTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
             +MD AFKVCDQMAT GLS+DEITYTTLVHGWNRPTIT QD
Sbjct: 781 ANDMDNAFKVCDQMATEGLSVDEITYTTLVHGWNRPTITGQD 822

BLAST of HG10002532 vs. ExPASy TrEMBL
Match: A0A5A7TTX4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G002890 PE=4 SV=1)

HSP 1 Score: 1451.8 bits (3757), Expect = 0.0e+00
Identity = 705/822 (85.77%), Postives = 750/822 (91.24%), Query Frame = 0

Query: 1   MHLTRFKINKTVPVLFPFSRRLACALSTQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLD 60
           MHLTRFKINKT+PVLFPFSRRLAC  STQPHKEHHQDPPWQ QDQL  WVSS+LSNSSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSNSSLD 60

Query: 61  SSKCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLV 120
           SSKC ALLPHLSPFQFDQLFFS+GLKANP TCLNFFYFASDSFKFRFTI SYCILILLLV
Sbjct: 61  SSKCSALLPHLSPFQFDQLFFSIGLKANPMTCLNFFYFASDSFKFRFTIHSYCILILLLV 120

Query: 121 HSKFLHPARLLLIRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           HSKFL PARLLLIRLIDGNLPVLNSD  K HIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGNLPVLNSDFKKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANELEKCCELFEVMSQGVR 240
           Y TQFRNLGF CAIDVFYL ARKG FPSLKTC+FLLSSLVKANE EKCCE+F+VMS+GV 
Sbjct: 181 YSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFQVMSEGVC 240

Query: 241 PDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFEL 300
           PDVF FTNVINALCKGGKMEKA ELFMKMEKLGISPNVVTYN II+GLCQNGRLD+AFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMEKATELFMKMEKLGISPNVVTYNCIINGLCQNGRLDHAFEL 300

Query: 301 KEKMKMEGVKPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCK 360
           KEKM +EGV+P+  TY  L+NGL KL  FDKVNH+LDEM+ AGF PN V +NNLIDGYCK
Sbjct: 301 KEKMTIEGVQPNLKTYGALVNGLIKLKCFDKVNHILDEMIGAGFYPNVVVFNNLIDGYCK 360

Query: 361 MGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDT 420
           MGNI EAL IKD+MISKNITPTSVTLYTL+QGFC+S+QIEQAENALEEILS GLSI+PD 
Sbjct: 361 MGNIKEALRIKDVMISKNITPTSVTLYTLLQGFCKSDQIEQAENALEEILSNGLSIHPDK 420

Query: 421 CYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLL 480
           CYSVVHWLCKK R HSAFRFTK+MLS+NFRP D LLTILV  LCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDPLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVE 540
           EKGSPAS  TSNALIHGLC AGNLPEA RIV+EMLERG P+D IT+N LILG C+EGKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCEAGNLPEASRIVKEMLERGLPLDRITYNALILGFCKEGKVE 540

Query: 541 ECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMM 600
            CFRLKEEMTK+GIQPDIYT NFLLRGLC+AGKLDDAIKLWDE+KASG +SNVHTYGVMM
Sbjct: 541 GCFRLKEEMTKRGIQPDIYTYNFLLRGLCNAGKLDDAIKLWDEFKASGPISNVHTYGVMM 600

Query: 601 DGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGIL 660
           DGYCKANR+EDVE LFNEL++K+MELN+IVYNI IRA+C NGNVAAALQLR++MKSKGIL
Sbjct: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIRAHCQNGNVAAALQLRENMKSKGIL 660

Query: 661 PTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALIGGYCKLGQMDIAEAT 720
           P CATYSSLIHGMC+IGLVEDAKHLIDEMR+EG  PNVVCYTALIGGYCKLGQMD AE+T
Sbjct: 661 PNCATYSSLIHGMCDIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720

Query: 721 WLEMISFNIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCK 780
           WLEMISFNI PNKFTYTVMIDGY KLGNME+A NLL KMKES IVPDVVTYN LTNGFCK
Sbjct: 721 WLEMISFNIHPNKFTYTVMIDGYGKLGNMEKAYNLLTKMKESGIVPDVVTYNVLTNGFCK 780

Query: 781 GKNMDKAFKVCDQMATGGLSLDEITYTTLVHGWNRPTITSQD 823
             +MD AFKVCDQMAT GLS+DEITYTTLVHGWNRPTIT QD
Sbjct: 781 ANDMDNAFKVCDQMATEGLSVDEITYTTLVHGWNRPTITGQD 822

BLAST of HG10002532 vs. TAIR 10
Match: AT4G19440.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 729.6 bits (1882), Expect = 2.9e-210
Identity = 376/787 (47.78%), Postives = 514/787 (65.31%), Query Frame = 0

Query: 29  QPHKEHHQDPPWQLQDQLLFWVSSILSNSSLDSSKCRALLPHLSPFQFDQLFFSVGLKAN 88
           +P K         L ++L    SS+LS  SLD  +C+ L+  LSP +FD+LF     K N
Sbjct: 50  RPDKSEETSSDRHLHERL----SSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVN 109

Query: 89  PNTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLHPARLLLIRLIDGNLPVLNSDTN 148
           P T L+FF  ASDSF F F++RSYC+LI LL+ +  L  AR++LIRLI+GN+PVL     
Sbjct: 110 PKTALDFFRLASDSFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLR 169

Query: 149 KLHIEIANALFGLTSVVGRFEWTQAFDLLIHVYGTQFRNLGFSCAIDVFYLFARKGIFPS 208
              + IA+A+  L+         +  DLLI VY TQF+  G   A+DVF + A KG+FPS
Sbjct: 170 DSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPS 229

Query: 209 LKTCSFLLSSLVKANELEKCCELFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMK 268
             TC+ LL+SLV+ANE +KCCE F+V+ +GV PDV+LFT  INA CKGGK+E+A++LF K
Sbjct: 230 KTTCNILLTSLVRANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSK 289

Query: 269 MEKLGISPNVVTYNIIIHGLCQNGRLDNAFELKEKMKMEGVKPSHITYNVLINGLTKLGH 328
           ME+ G++PNVVT+N +I GL   GR D AF  KEKM   G++P+ ITY++L+ GLT+   
Sbjct: 290 MEEAGVAPNVVTFNTVIDGLGMCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTRAKR 349

Query: 329 FDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCKMGNIDEALWIKDLMISKNITPTSVTLYT 388
                 VL EM   GF PN + YNNLID + + G++++A+ IKDLM+SK ++ TS T  T
Sbjct: 350 IGDAYFVLKEMTKKGFPPNVIVYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSSTYNT 409

Query: 389 LMQGFCESNQIEQAENALEEILSRGLSINPDTCYSVVHWLCKKCRVHSAFRFTKVMLSKN 448
           L++G+C++ Q + AE  L+E+LS G ++N  +  SV+  LC      SA RF   ML +N
Sbjct: 410 LIKGYCKNGQADNAERLLKEMLSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEMLLRN 469

Query: 449 FRPRDQLLTILVRELCKDGKHLEATELWFRLLEKGSPASMATSNALIHGLCGAGNLPEAV 508
             P   LLT L+  LCK GKH +A ELWF+ L KG      TSNAL+HGLC AG L EA 
Sbjct: 470 MSPGGGLLTTLISGLCKHGKHSKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLDEAF 529

Query: 509 RIVEEMLERGFPVDEITFNTLILGCCREGKVEECFRLKEEMTKQGIQPDIYTCNFLLRGL 568
           RI +E+L RG  +D +++NTLI GCC + K++E F   +EM K+G++PD YT + L+ GL
Sbjct: 530 RIQKEILGRGCVMDRVSYNTLISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILICGL 589

Query: 569 CSAGKLDDAIKLWDEYKASGLVSNVHTYGVMMDGYCKANRMEDVEKLFNELVAKRMELNT 628
            +  K+++AI+ WD+ K +G++ +V+TY VM+DG CKA R E+ ++ F+E+++K ++ NT
Sbjct: 590 FNMNKVEEAIQFWDDCKRNGMLPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQPNT 649

Query: 629 IVYNIFIRANCHNGNVAAALQLRDDMKSKGILPTCATYSSLIHGMCNIGLVEDAKHLIDE 688
           +VYN  IRA C +G ++ AL+LR+DMK KGI P  ATY+SLI GM  I  VE+AK L +E
Sbjct: 650 VVYNHLIRAYCRSGRLSMALELREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEE 709

Query: 689 MREEGLSPNVVCYTALIGGYCKLGQMDIAEATWLEMISFNIRPNKFTYTVMIDGYCKLGN 748
           MR EGL PNV  YTALI GY KLGQM   E    EM S N+ PNK TYTVMI GY + GN
Sbjct: 710 MRMEGLEPNVFHYTALIDGYGKLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGN 769

Query: 749 MEEANNLLAKMKESRIVPDVVTYNALTNGFCKGKNMDKAFKVCDQMATGGLSLDEITYTT 808
           + EA+ LL +M+E  IVPD +TY     G+ K   + +AFK            DE  Y  
Sbjct: 770 VTEASRLLNEMREKGIVPDSITYKEFIYGYLKQGGVLEAFK----------GSDEENYAA 822

Query: 809 LVHGWNR 816
           ++ GWN+
Sbjct: 830 IIEGWNK 822

BLAST of HG10002532 vs. TAIR 10
Match: AT4G19440.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 729.6 bits (1882), Expect = 2.9e-210
Identity = 376/787 (47.78%), Postives = 514/787 (65.31%), Query Frame = 0

Query: 29  QPHKEHHQDPPWQLQDQLLFWVSSILSNSSLDSSKCRALLPHLSPFQFDQLFFSVGLKAN 88
           +P K         L ++L    SS+LS  SLD  +C+ L+  LSP +FD+LF     K N
Sbjct: 50  RPDKSEETSSDRHLHERL----SSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVN 109

Query: 89  PNTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLHPARLLLIRLIDGNLPVLNSDTN 148
           P T L+FF  ASDSF F F++RSYC+LI LL+ +  L  AR++LIRLI+GN+PVL     
Sbjct: 110 PKTALDFFRLASDSFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLR 169

Query: 149 KLHIEIANALFGLTSVVGRFEWTQAFDLLIHVYGTQFRNLGFSCAIDVFYLFARKGIFPS 208
              + IA+A+  L+         +  DLLI VY TQF+  G   A+DVF + A KG+FPS
Sbjct: 170 DSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPS 229

Query: 209 LKTCSFLLSSLVKANELEKCCELFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMK 268
             TC+ LL+SLV+ANE +KCCE F+V+ +GV PDV+LFT  INA CKGGK+E+A++LF K
Sbjct: 230 KTTCNILLTSLVRANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSK 289

Query: 269 MEKLGISPNVVTYNIIIHGLCQNGRLDNAFELKEKMKMEGVKPSHITYNVLINGLTKLGH 328
           ME+ G++PNVVT+N +I GL   GR D AF  KEKM   G++P+ ITY++L+ GLT+   
Sbjct: 290 MEEAGVAPNVVTFNTVIDGLGMCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTRAKR 349

Query: 329 FDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCKMGNIDEALWIKDLMISKNITPTSVTLYT 388
                 VL EM   GF PN + YNNLID + + G++++A+ IKDLM+SK ++ TS T  T
Sbjct: 350 IGDAYFVLKEMTKKGFPPNVIVYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSSTYNT 409

Query: 389 LMQGFCESNQIEQAENALEEILSRGLSINPDTCYSVVHWLCKKCRVHSAFRFTKVMLSKN 448
           L++G+C++ Q + AE  L+E+LS G ++N  +  SV+  LC      SA RF   ML +N
Sbjct: 410 LIKGYCKNGQADNAERLLKEMLSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEMLLRN 469

Query: 449 FRPRDQLLTILVRELCKDGKHLEATELWFRLLEKGSPASMATSNALIHGLCGAGNLPEAV 508
             P   LLT L+  LCK GKH +A ELWF+ L KG      TSNAL+HGLC AG L EA 
Sbjct: 470 MSPGGGLLTTLISGLCKHGKHSKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLDEAF 529

Query: 509 RIVEEMLERGFPVDEITFNTLILGCCREGKVEECFRLKEEMTKQGIQPDIYTCNFLLRGL 568
           RI +E+L RG  +D +++NTLI GCC + K++E F   +EM K+G++PD YT + L+ GL
Sbjct: 530 RIQKEILGRGCVMDRVSYNTLISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILICGL 589

Query: 569 CSAGKLDDAIKLWDEYKASGLVSNVHTYGVMMDGYCKANRMEDVEKLFNELVAKRMELNT 628
            +  K+++AI+ WD+ K +G++ +V+TY VM+DG CKA R E+ ++ F+E+++K ++ NT
Sbjct: 590 FNMNKVEEAIQFWDDCKRNGMLPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQPNT 649

Query: 629 IVYNIFIRANCHNGNVAAALQLRDDMKSKGILPTCATYSSLIHGMCNIGLVEDAKHLIDE 688
           +VYN  IRA C +G ++ AL+LR+DMK KGI P  ATY+SLI GM  I  VE+AK L +E
Sbjct: 650 VVYNHLIRAYCRSGRLSMALELREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEE 709

Query: 689 MREEGLSPNVVCYTALIGGYCKLGQMDIAEATWLEMISFNIRPNKFTYTVMIDGYCKLGN 748
           MR EGL PNV  YTALI GY KLGQM   E    EM S N+ PNK TYTVMI GY + GN
Sbjct: 710 MRMEGLEPNVFHYTALIDGYGKLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGN 769

Query: 749 MEEANNLLAKMKESRIVPDVVTYNALTNGFCKGKNMDKAFKVCDQMATGGLSLDEITYTT 808
           + EA+ LL +M+E  IVPD +TY     G+ K   + +AFK            DE  Y  
Sbjct: 770 VTEASRLLNEMREKGIVPDSITYKEFIYGYLKQGGVLEAFK----------GSDEENYAA 822

Query: 809 LVHGWNR 816
           ++ GWN+
Sbjct: 830 IIEGWNK 822

BLAST of HG10002532 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 344.4 bits (882), Expect = 2.6e-94
Identity = 224/745 (30.07%), Postives = 347/745 (46.58%), Query Frame = 0

Query: 109 IRSYCILILLLVHSKFLHPARLLL--IRLIDGNLPVLNSDTNKLHIEIANALFGLTSVVG 168
           ++  CI   +LV ++   PAR +L  + L+ G                ++ +FG      
Sbjct: 112 VQLVCITTHILVRARMYDPARHILKELSLMSGK---------------SSFVFGALMTTY 171

Query: 169 RF--EWTQAFDLLIHVYGTQFRNLGFSCAIDVFYLFARKGIFPSLKTCSFLLSSLVKANE 228
           R        +D+LI VY    R      ++++F L    G  PS+ TC+ +L S+VK+ E
Sbjct: 172 RLCNSNPSVYDILIRVY---LREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGE 231

Query: 229 -LEKCCELFEVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEKLGISPNVVTYNI 288
            +     L E++ + + PDV  F  +IN LC  G  EK+  L  KMEK G +P +VTYN 
Sbjct: 232 DVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNT 291

Query: 289 IIHGLCQNGRLDNAFELKEKMKMEGV---------------------------------- 348
           ++H  C+ GR   A EL + MK +GV                                  
Sbjct: 292 VLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRM 351

Query: 349 -KPSHITYNVLINGLTKLGHFDKVNHVLDEMVDAGFVPNAVDYNNLIDGYCKMGNIDEAL 408
             P+ +TYN LING +  G     + +L+EM+  G  PN V +N LIDG+   GN  EAL
Sbjct: 352 IHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEAL 411

Query: 409 WIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENALEEILSRGLSINPDTCYSVVHWL 468
            +  +M +K +TP+ V+   L+ G C++ + + A      +   G+ +   T   ++  L
Sbjct: 412 KMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGL 471

Query: 469 CKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCKDGKHLEATELWFRLLEKGSPASM 528
           CK   +  A      M      P     + L+   CK G+   A E+  R+   G   + 
Sbjct: 472 CKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNG 531

Query: 529 ATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEITFNTLILGCCREGKVEECFRLKEE 588
              + LI+  C  G L EA+RI E M+  G   D  TFN L+   C+ GKV E       
Sbjct: 532 IIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRC 591

Query: 589 MTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYKASGLVSNVHTYGVMMDGYCKANR 648
           MT  GI P+  + + L+ G  ++G+   A  ++DE    G      TYG ++ G CK   
Sbjct: 592 MTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGH 651

Query: 649 MEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVAAALQLRDDMKSKGILPTCATYSS 708
           + + EK    L A    ++T++YN  + A C +GN+A A+ L  +M  + ILP   TY+S
Sbjct: 652 LREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTS 711

Query: 709 LIHGMCNIGLVEDAKHLIDEMREEG-LSPNVVCYTALIGGYCKLGQMDIAEATWLEMISF 768
           LI G+C  G    A     E    G + PN V YT  + G  K GQ         +M + 
Sbjct: 712 LISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNL 771

Query: 769 NIRPNKFTYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCKGKNMDKA 813
              P+  T   MIDGY ++G +E+ N+LL +M      P++ TYN L +G+ K K++  +
Sbjct: 772 GHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTS 831

BLAST of HG10002532 vs. TAIR 10
Match: AT5G59900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 338.6 bits (867), Expect = 1.4e-92
Identity = 229/802 (28.55%), Postives = 367/802 (45.76%), Query Frame = 0

Query: 83  VGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLVHSKFLHPARLLLIRLIDGNLPV 142
           +G   +P   L FF F      F  +  S+CILI  LV +    PA  LL  L+   L  
Sbjct: 78  IGTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLL---LRA 137

Query: 143 LNSDTNKLHIEIANALFGLTSVVGRFEWTQAFDLLIHVYGTQFRNLGFSCAIDVFYLFAR 202
           L         ++ N LF       +   + +FDLLI  Y    R L     + VF +   
Sbjct: 138 LKPS------DVFNVLFSCYEKC-KLSSSSSFDLLIQHYVRSRRVLD---GVLVFKMMIT 197

Query: 203 K-GIFPSLKTCSFLLSSLVKANELEKCCELF-EVMSQGVRPDVFLFTNVINALCKGGKME 262
           K  + P ++T S LL  LVK        ELF +++S G+RPDV+++T VI +LC+   + 
Sbjct: 198 KVSLLPEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLS 257

Query: 263 KAIELFMKMEKLGISPNVVTYNIIIHGLCQNGRLDNAFELKEKMKMEGVKPSHITYNVLI 322
           +A E+   ME  G   N+V YN++I GLC+  ++  A  +K+ +  + +KP  +TY  L+
Sbjct: 258 RAKEMIAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLV 317

Query: 323 NGLTKLGHFDKVNHVLDEM-----------------------------------VDAGFV 382
            GL K+  F+    ++DEM                                   VD G  
Sbjct: 318 YGLCKVQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVS 377

Query: 383 PNAVDYNNLIDGYCKMGNIDEALWIKDLMISKNITPTSVTLYTLMQGFCESNQIEQAENA 442
           PN   YN LID  CK     EA  + D M    + P  VT   L+  FC   +++ A + 
Sbjct: 378 PNLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSF 437

Query: 443 LEEILSRGLSINPDTCYSVVHWLCKKCRVHSAFRFTKVMLSKNFRPRDQLLTILVRELCK 502
           L E++  GL ++     S+++  CK   + +A  F   M++K   P     T L+   C 
Sbjct: 438 LGEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCS 497

Query: 503 DGKHLEATELWFRLLEKGSPASMATSNALIHGLCGAGNLPEAVRIVEEMLERGFPVDEIT 562
            GK  +A  L+  +  KG   S+ T   L+ GL  AG + +AV++  EM E     + +T
Sbjct: 498 KGKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVT 557

Query: 563 FNTLILGCCREGKVEECFRLKEEMTKQGIQPDIYTCNFLLRGLCSAGKLDDAIKLWDEYK 622
           +N +I G C EG + + F   +EMT++GI PD Y+   L+ GLC  G+  +A    D   
Sbjct: 558 YNVMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLH 617

Query: 623 ASGLVSNVHTYGVMMDGYCKANRMEDVEKLFNELVAKRMELNTIVYNIFIRANCHNGNVA 682
                 N   Y  ++ G+C+  ++E+   +  E+V + ++L+ + Y + I  +  + +  
Sbjct: 618 KGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRK 677

Query: 683 AALQLRDDMKSKGILPTCATYSSLIHGMCNIGLVEDAKHLIDEMREEGLSPNVVCYTALI 742
               L  +M  +G+ P    Y+S+I      G  ++A  + D M  EG  PN V YTA+I
Sbjct: 678 LFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVI 737

Query: 743 GGYCKLGQMDIAEATWLEMISFNIRPNKF------------------------------- 802
            G CK G ++ AE    +M   +  PN+                                
Sbjct: 738 NGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILKGLL 797

Query: 803 ----TYTVMIDGYCKLGNMEEANNLLAKMKESRIVPDVVTYNALTNGFCKGKNMDKAFKV 813
               TY ++I G+C+ G +EEA+ L+ +M    + PD +TY  + N  C+  ++ KA ++
Sbjct: 798 ANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIEL 857

BLAST of HG10002532 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 312.4 bits (799), Expect = 1.1e-84
Identity = 229/830 (27.59%), Postives = 399/830 (48.07%), Query Frame = 0

Query: 5   RFKINKTVPVLFPFSRRLACALS-TQPHKEHHQDPPWQLQDQLLFWVSSILSNSSLDSS- 64
           +F  + TVP   P +RR  C++S    +    +     +  +LL    SILS  +   S 
Sbjct: 26  KFSTDVTVP--SPVTRRQFCSVSPLLRNLPEEESDSMSVPHRLL----SILSKPNWHKSP 85

Query: 65  KCRALLPHLSPFQFDQLFFSVGLKANPNTCLNFFYFASDSFKFRFTIRSYCILILLLVHS 124
             ++++  +SP     LF    L  +P T LNF ++ S + +++ ++ SY  L+ LL+++
Sbjct: 86  SLKSMVSAISPSHVSSLF---SLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINN 145

Query: 125 KF---LHPARLLLIRLIDG---NLPVL------NSDTN-----KLHIEIANAL------F 184
            +   +   RLL+I+  D     L VL      N D       KL I   N L      F
Sbjct: 146 GYVGVVFKIRLLMIKSCDSVGDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLARF 205

Query: 185 GLTSVVGRFEWTQAFDLLIHVYGTQFRNLGFSC-------AIDVFYLFARKGIFPSLKTC 244
           GL   + +       D +     T  + +   C       A          G+ P   T 
Sbjct: 206 GLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTY 265

Query: 245 SFLLSSLVKANELEKCCELF-EVMSQGVRPDVFLFTNVINALCKGGKMEKAIELFMKMEK 304
           + L+    +  +L+   ++F E+  +G R +   +T++I+ LC   ++++A++LF+KM+ 
Sbjct: 266 TSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKD 325

Query: 305 LGISPNVVTYNIIIHGLCQNGRLDNAFELKEKMKMEGVKPSHITYNVLINGLTKLGHFDK 364
               P V TY ++I  LC + R   A  L ++M+  G+KP+  TY VLI+ L     F+K
Sbjct: 326 DECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEK 385

Query: 365 VNHVLDEMVDAGFVPNAVDYNNLIDGYCKMGNIDEALWIKDLMISKNITPTSVTLYTLMQ 424
              +L +M++ G +PN + YN LI+GYCK G I++A+ + +LM S+ ++P + T   L++
Sbjct: 386 ARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIK 445

Query: 425 GFCESNQIEQAENALEEILSRGLSINPDTCYSVVHWLCKKCRVHSAFRFTKVMLSKNFRP 484
           G+C+SN + +A   L ++L R +  +  T  S++   C+     SA+R   +M  +   P
Sbjct: 446 GYCKSN-VHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVP 505

Query: 485 RDQLLTILVRELCKDGKHLEATELWFRLLEKGSPASMATSNALIHGLCGAGNLPEAVRIV 544
                T ++  LCK  +  EA +L+  L +KG   ++    ALI G C AG + EA  ++
Sbjct: 506 DQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLML 565

Query: 545 EEMLERGFPVDEITFNTLILGCCREGKVEECFRLKEEMTKQGIQPDIYTCNFLLRGLCSA 604
           E+ML +    + +TFN LI G C +GK++E   L+E+M K G+QP + T   L+  L   
Sbjct: 566 EKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKD 625

Query: 605 GKLDDAIKLWDEYKASGLVSNVHTYGVMMDGYCKANRMEDVEKLFNELVAKRMELNTIVY 664
           G  D A   + +  +SG   + HTY   +  YC+  R+ D E +  ++    +  +   Y
Sbjct: 626 GDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFTY 685

Query: 665 NIFIRANCHNGNVAAALQLRDDMKSKGILPTCATYSSLIHGMCNIGLVEDAKHLIDEM-- 724
           +  I+     G    A  +   M+  G  P+  T+ SLI            KHL++    
Sbjct: 686 SSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLI------------KHLLEMKYG 745

Query: 725 REEGLSPNVVCYTALIGGYCKLGQMDIAEATWLEMISFNIRPNKFTYTVMIDGYCKLGNM 784
           +++G  P +   + ++       + D       +M+  ++ PN  +Y  +I G C++GN+
Sbjct: 746 KQKGSEPELCAMSNMM-------EFDTVVELLEKMVEHSVTPNAKSYEKLILGICEVGNL 805

Query: 785 EEANNLLAKMKESR-IVPDVVTYNALTNGFCKGKNMDKAFKVCDQMATGG 799
             A  +   M+ +  I P  + +NAL +  CK K  ++A KV D M   G
Sbjct: 806 RVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDMICVG 826

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884789.10.0e+0088.81pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Benincasa ... [more]
XP_023552294.10.0e+0087.83pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucur... [more]
KAG6577115.10.0e+0087.83Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_022984601.10.0e+0087.59pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucur... [more]
XP_022931380.10.0e+0087.47pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q940A64.0e-20947.78Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidop... [more]
Q9LVQ53.7e-9330.07Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q9FJE62.0e-9128.55Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
Q9LSL91.5e-8327.59Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q9M9077.2e-8127.26Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1J2L60.0e+0087.59pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like OS=Cuc... [more]
A0A6J1ETG90.0e+0087.47pentatricopeptide repeat-containing protein At4g19440, chloroplastic-like OS=Cuc... [more]
A0A1S4DYY20.0e+0085.89pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 ... [more]
A0A5D3CYQ10.0e+0085.89Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5A7TTX40.0e+0085.77Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT4G19440.12.9e-21047.78Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G19440.22.9e-21047.78Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G55840.12.6e-9430.07Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G59900.11.4e-9228.55Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G65560.11.1e-8427.59Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 457..483
e-value: 0.19
score: 12.1
coord: 490..519
e-value: 1.5E-5
score: 24.9
coord: 211..235
e-value: 0.24
score: 11.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 384..417
e-value: 0.0018
score: 16.3
coord: 629..663
e-value: 4.6E-6
score: 24.5
coord: 492..523
e-value: 2.2E-6
score: 25.5
coord: 734..768
e-value: 3.0E-10
score: 37.6
coord: 211..242
e-value: 0.0029
score: 15.6
coord: 314..348
e-value: 1.6E-9
score: 35.3
coord: 559..592
e-value: 6.7E-7
score: 27.1
coord: 665..698
e-value: 5.1E-9
score: 33.8
coord: 524..558
e-value: 2.1E-9
score: 35.0
coord: 769..802
e-value: 5.0E-7
score: 27.5
coord: 245..278
e-value: 1.3E-8
score: 32.5
coord: 594..627
e-value: 1.1E-7
score: 29.6
coord: 279..312
e-value: 2.5E-9
score: 34.8
coord: 699..732
e-value: 3.6E-7
score: 28.0
coord: 351..382
e-value: 2.5E-6
score: 25.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 276..324
e-value: 1.5E-18
score: 66.7
coord: 522..570
e-value: 2.8E-16
score: 59.5
coord: 627..675
e-value: 1.1E-13
score: 51.2
coord: 696..745
e-value: 4.9E-16
score: 58.7
coord: 346..394
e-value: 2.7E-10
score: 40.3
coord: 766..812
e-value: 9.5E-13
score: 48.1
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 238..270
e-value: 1.5E-8
score: 34.2
coord: 588..618
e-value: 2.0E-6
score: 27.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..311
score: 13.942831
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 662..696
score: 12.397287
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 627..661
score: 11.39981
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 592..626
score: 11.454616
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 382..416
score: 9.086975
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 697..731
score: 11.673842
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 312..346
score: 12.101333
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 767..801
score: 11.388848
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 557..591
score: 11.597113
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 732..766
score: 13.350921
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 522..556
score: 13.482456
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 487..521
score: 11.224429
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 242..276
score: 12.901507
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 347..381
score: 12.408249
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 480..549
e-value: 1.9E-17
score: 65.4
coord: 761..816
e-value: 4.9E-15
score: 57.5
coord: 550..621
e-value: 2.2E-21
score: 78.2
coord: 342..408
e-value: 6.5E-15
score: 57.1
coord: 690..760
e-value: 1.8E-22
score: 81.8
coord: 622..689
e-value: 3.1E-16
score: 61.5
coord: 409..479
e-value: 7.3E-6
score: 27.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 185..341
e-value: 9.5E-45
score: 155.3
NoneNo IPR availablePANTHERPTHR47938:SF7REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 27..807
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 27..807
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 463..797

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10002532.1HG10002532.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding