HG10021841 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021841
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr05: 17465093 .. 17467486 (-)
RNA-Seq ExpressionHG10021841
SyntenyHG10021841
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTACGAGAGTTTCGTTTTGGTAATTTCACCAAAATTTAACGTTAAGTTGGTCATTCATCCGGCAAATCTAAAGTTTAGTTGGACTAACAAGGGACATCAGACAAGAGAGTACTTCAAAAAGCGGGTCATTGGCTTCGTGGGATATGCAGATGGCAGAGAAACGAAACTAGGCGGCCATTTGTCATGGGCTATGGAAGCTCTAAGCGTTCCATTGATTTCTCTCCAGAATTTCCCAACCCCAAACAACAATCTTCCTTTCAGAAACCATCAAATTCTCTCTACAATCGATCAATGTTCAAGTTCAAAGCAATTGAAGCAAGTTCACGCTCACATGCTCCGCACCGGCCTCTTCTTCGACCCCTTCTCCGCCAGCAAGCTCTTCACAGCTTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCGACGTGTTCGACCAAATTCCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCTTATGCTTCCAGCTCCGACCCTTTTCAGAGTTTTGTAATATTTTTGGATTTGCTTGATAAATGTGAGGATTTGCCCAATAATTTCACTTTCCCATTTGTCATTAAGGCGGCTTCGGAGCTAAAAGCATCACGGGTCGGGAGAGCTGTTCATGGAATGGCGATTAAGTTGTCGTTTGGTATGGATTTGTATATTCTTAATTCGCTTGTGCGATTCTATGGGGCATGTGGGGATTTGAATATGGCCGAGCGATTGTTTGAGGGTATTTCCTGCAAAGATGTGGTGTCTTGGAATTCGATGATCTCGGCTTTTGCTCAGGGGAACTTTCCAGAAGATGCATTGGACTTGTTTTTGAAAATGGAGGGGGAGAATGTGATGCCCAACTCTGTAACAATGGTGGGTGTTTTATCTGCTTGCGCAAAGAAGTTGGATTTGGAGTTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAAGAAATCAAAGTGGATTTAACTCTGTGTAACGCCATGCTTGATATGTATTCGAAGTGTGGAAGCATTGATGTTGCACAGAAGCTGTTTGATGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCACCATGCTTGATGGGTATGCAAAAATGGGAGACTTCGATGCTGCTCGGCAAGTGTTCGATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAATGTTCTCATATCTGCTTGTGAACAAAATGGTAAGCCTAAGGAAGCTTTGGCCACTTTTAATGAGTTGCAGGTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACCCTGTCAGCTTGTGCTCAACTGGGGGCAATTGATTTGGGTGGGTGGATTCATGTTTACATAAAAAGGGAGGGGATAGATCTAAATTGCCATTTAATTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTGCTTTAGAGAAAGCTCTTGAGGTGTTCTATTCAGTGGAGGAGAAAGATGTGTATGTTTGGAGTGCCATGATTGCTGGCTTGGGAATGCATGGCCGTGGGAAAGCGGCAATTGATCTATTCTTCGAAATGCAGGAAGCTAAAGTGAAGCCAAATAATGTGACTTTTACAAATGTATTATGTGCCTGCAGCCATGCCGGATTAGTTGATGAGGGACGGGCGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTCCCTGGGACTAAGCACTATGCGTGTATGGTCGATATTCTCGGTCGTGCAGGGTTTCTAGAAGAAGCTATGGAGTTGATCAATGAAATGCCTATAACTCCAAGCGCCTCCGTTTGGGGTGCTTTGCTTGGTGCCTGCAGCCTTCATATGAATGTTGAGCTTGCAGAATTGGCTAGTGATCAATTGCTCAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCAAACATATATGCCAAAACAGGAAGATGGGAAAAGGTTTCTGAGTTGAGGAAACTAATGAGAGATTCTGAACTGAAAAAGGAACCAGGTTGTAGTTCCATTGAAGTCAATGGCAACGTCCACGAGTTTCTAGTAGGCGATAATTCCCACCTGTTATCCAGCGACATCTATTCGAAGTTAGACGAGATTGCAACGAAACTAAAATCAGTTGGATATGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGACGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATCGCATTCGGGCTTATTAGTTTGGCTCCATCTCAACCAATTCGTGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGAAGTTGCCAAGCTTGTATCTAGAGTATATGACAGAGAGATATTACTCCGTGATCGATACCGATTCCATCATTTTCGAGATGGGCATTGCTCATGTATGGATTACTGGTAA

mRNA sequence

ATGTACGAGAGTTTCGTTTTGGTAATTTCACCAAAATTTAACGTTAAGTTGGTCATTCATCCGGCAAATCTAAAGTTTAGTTGGACTAACAAGGGACATCAGACAAGAGAGTACTTCAAAAAGCGGGTCATTGGCTTCGTGGGATATGCAGATGGCAGAGAAACGAAACTAGGCGGCCATTTGTCATGGGCTATGGAAGCTCTAAGCGTTCCATTGATTTCTCTCCAGAATTTCCCAACCCCAAACAACAATCTTCCTTTCAGAAACCATCAAATTCTCTCTACAATCGATCAATGTTCAAGTTCAAAGCAATTGAAGCAAGTTCACGCTCACATGCTCCGCACCGGCCTCTTCTTCGACCCCTTCTCCGCCAGCAAGCTCTTCACAGCTTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCGACGTGTTCGACCAAATTCCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCTTATGCTTCCAGCTCCGACCCTTTTCAGAGTTTTGTAATATTTTTGGATTTGCTTGATAAATGTGAGGATTTGCCCAATAATTTCACTTTCCCATTTGTCATTAAGGCGGCTTCGGAGCTAAAAGCATCACGGGTCGGGAGAGCTGTTCATGGAATGGCGATTAAGTTGTCGTTTGGTATGGATTTGTATATTCTTAATTCGCTTGTGCGATTCTATGGGGCATGTGGGGATTTGAATATGGCCGAGCGATTGTTTGAGGGTATTTCCTGCAAAGATGTGGTGTCTTGGAATTCGATGATCTCGGCTTTTGCTCAGGGGAACTTTCCAGAAGATGCATTGGACTTGTTTTTGAAAATGGAGGGGGAGAATGTGATGCCCAACTCTGTAACAATGGTGGGTGTTTTATCTGCTTGCGCAAAGAAGTTGGATTTGGAGTTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAAGAAATCAAAGTGGATTTAACTCTGTGTAACGCCATGCTTGATATGTATTCGAAGTGTGGAAGCATTGATGTTGCACAGAAGCTGTTTGATGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCACCATGCTTGATGGGTATGCAAAAATGGGAGACTTCGATGCTGCTCGGCAAGTGTTCGATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAATGTTCTCATATCTGCTTGTGAACAAAATGGTAAGCCTAAGGAAGCTTTGGCCACTTTTAATGAGTTGCAGGTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACCCTGTCAGCTTGTGCTCAACTGGGGGCAATTGATTTGGGTGGGTGGATTCATGTTTACATAAAAAGGGAGGGGATAGATCTAAATTGCCATTTAATTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTGCTTTAGAGAAAGCTCTTGAGGTGTTCTATTCAGTGGAGGAGAAAGATGTGTATGTTTGGAGTGCCATGATTGCTGGCTTGGGAATGCATGGCCGTGGGAAAGCGGCAATTGATCTATTCTTCGAAATGCAGGAAGCTAAAGTGAAGCCAAATAATGTGACTTTTACAAATGTATTATGTGCCTGCAGCCATGCCGGATTAGTTGATGAGGGACGGGCGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTCCCTGGGACTAAGCACTATGCGTGTATGGTCGATATTCTCGGTCGTGCAGGGTTTCTAGAAGAAGCTATGGAGTTGATCAATGAAATGCCTATAACTCCAAGCGCCTCCGTTTGGGGTGCTTTGCTTGGTGCCTGCAGCCTTCATATGAATGTTGAGCTTGCAGAATTGGCTAGTGATCAATTGCTCAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCAAACATATATGCCAAAACAGGAAGATGGGAAAAGGTTTCTGAGTTGAGGAAACTAATGAGAGATTCTGAACTGAAAAAGGAACCAGGTTGTAGTTCCATTGAAGTCAATGGCAACGTCCACGAGTTTCTAGTAGGCGATAATTCCCACCTGTTATCCAGCGACATCTATTCGAAGTTAGACGAGATTGCAACGAAACTAAAATCAGTTGGATATGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGACGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATCGCATTCGGGCTTATTAGTTTGGCTCCATCTCAACCAATTCGTGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGAAGTTGCCAAGCTTGTATCTAGAGTATATGACAGAGAGATATTACTCCGTGATCGATACCGATTCCATCATTTTCGAGATGGGCATTGCTCATGTATGGATTACTGGTAA

Coding sequence (CDS)

ATGTACGAGAGTTTCGTTTTGGTAATTTCACCAAAATTTAACGTTAAGTTGGTCATTCATCCGGCAAATCTAAAGTTTAGTTGGACTAACAAGGGACATCAGACAAGAGAGTACTTCAAAAAGCGGGTCATTGGCTTCGTGGGATATGCAGATGGCAGAGAAACGAAACTAGGCGGCCATTTGTCATGGGCTATGGAAGCTCTAAGCGTTCCATTGATTTCTCTCCAGAATTTCCCAACCCCAAACAACAATCTTCCTTTCAGAAACCATCAAATTCTCTCTACAATCGATCAATGTTCAAGTTCAAAGCAATTGAAGCAAGTTCACGCTCACATGCTCCGCACCGGCCTCTTCTTCGACCCCTTCTCCGCCAGCAAGCTCTTCACAGCTTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCGACGTGTTCGACCAAATTCCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCTTATGCTTCCAGCTCCGACCCTTTTCAGAGTTTTGTAATATTTTTGGATTTGCTTGATAAATGTGAGGATTTGCCCAATAATTTCACTTTCCCATTTGTCATTAAGGCGGCTTCGGAGCTAAAAGCATCACGGGTCGGGAGAGCTGTTCATGGAATGGCGATTAAGTTGTCGTTTGGTATGGATTTGTATATTCTTAATTCGCTTGTGCGATTCTATGGGGCATGTGGGGATTTGAATATGGCCGAGCGATTGTTTGAGGGTATTTCCTGCAAAGATGTGGTGTCTTGGAATTCGATGATCTCGGCTTTTGCTCAGGGGAACTTTCCAGAAGATGCATTGGACTTGTTTTTGAAAATGGAGGGGGAGAATGTGATGCCCAACTCTGTAACAATGGTGGGTGTTTTATCTGCTTGCGCAAAGAAGTTGGATTTGGAGTTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAAGAAATCAAAGTGGATTTAACTCTGTGTAACGCCATGCTTGATATGTATTCGAAGTGTGGAAGCATTGATGTTGCACAGAAGCTGTTTGATGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCACCATGCTTGATGGGTATGCAAAAATGGGAGACTTCGATGCTGCTCGGCAAGTGTTCGATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAATGTTCTCATATCTGCTTGTGAACAAAATGGTAAGCCTAAGGAAGCTTTGGCCACTTTTAATGAGTTGCAGGTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACCCTGTCAGCTTGTGCTCAACTGGGGGCAATTGATTTGGGTGGGTGGATTCATGTTTACATAAAAAGGGAGGGGATAGATCTAAATTGCCATTTAATTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTGCTTTAGAGAAAGCTCTTGAGGTGTTCTATTCAGTGGAGGAGAAAGATGTGTATGTTTGGAGTGCCATGATTGCTGGCTTGGGAATGCATGGCCGTGGGAAAGCGGCAATTGATCTATTCTTCGAAATGCAGGAAGCTAAAGTGAAGCCAAATAATGTGACTTTTACAAATGTATTATGTGCCTGCAGCCATGCCGGATTAGTTGATGAGGGACGGGCGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTCCCTGGGACTAAGCACTATGCGTGTATGGTCGATATTCTCGGTCGTGCAGGGTTTCTAGAAGAAGCTATGGAGTTGATCAATGAAATGCCTATAACTCCAAGCGCCTCCGTTTGGGGTGCTTTGCTTGGTGCCTGCAGCCTTCATATGAATGTTGAGCTTGCAGAATTGGCTAGTGATCAATTGCTCAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCAAACATATATGCCAAAACAGGAAGATGGGAAAAGGTTTCTGAGTTGAGGAAACTAATGAGAGATTCTGAACTGAAAAAGGAACCAGGTTGTAGTTCCATTGAAGTCAATGGCAACGTCCACGAGTTTCTAGTAGGCGATAATTCCCACCTGTTATCCAGCGACATCTATTCGAAGTTAGACGAGATTGCAACGAAACTAAAATCAGTTGGATATGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGACGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATCGCATTCGGGCTTATTAGTTTGGCTCCATCTCAACCAATTCGTGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGAAGTTGCCAAGCTTGTATCTAGAGTATATGACAGAGAGATATTACTCCGTGATCGATACCGATTCCATCATTTTCGAGATGGGCATTGCTCATGTATGGATTACTGGTAA

Protein sequence

MYESFVLVISPKFNVKLVIHPANLKFSWTNKGHQTREYFKKRVIGFVGYADGRETKLGGHLSWAMEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW
Homology
BLAST of HG10021841 vs. NCBI nr
Match: XP_038893523.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa hispida])

HSP 1 Score: 1418.7 bits (3671), Expect = 0.0e+00
Identity = 699/733 (95.36%), Postives = 715/733 (97.54%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           MEALSVPLISLQNFPTPN+NLPFRNHQILSTIDQCSS KQLKQVHAHMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFPTPNDNLPFRNHQILSTIDQCSSPKQLKQVHAHMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKLFTASALSSFSTLDYA +VFDQI  PNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYALNVFDQISHPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           +DLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYG CGDLNMA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGTCGDLNMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLFEGISCKDVVSWNSMISAFAQGN PEDALDLFLKME ENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALDLFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMY+KCGSID AQKLFDEMPERDVFSWTTML
Sbjct: 241 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGDFDAAR+VFDAMPVKEIAAWNVLISA EQNG PKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDFDAARRVFDAMPVKEIAAWNVLISAYEQNGNPKEALATFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVSTLSAC+QLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACSQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VE +DVYVWSAMIAGLGMHGRGKAAI+LFFEMQEAKVKPN+VTF NVLCACSHAGLVDEG
Sbjct: 421 VEVRDVYVWSAMIAGLGMHGRGKAAINLFFEMQEAKVKPNSVTFMNVLCACSHAGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           RAF HEMEP+YGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSAS+WGALLGACS
Sbjct: 481 RAFLHEMEPIYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASIWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDS+LKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSKLKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SSIEV+GNVHEFLVGDNSH LSS IY KLDEIATKLKSVGYEPNKSHLLQ IE+DDLKEQ
Sbjct: 601 SSIEVDGNVHEFLVGDNSHPLSSKIYLKLDEIATKLKSVGYEPNKSHLLQFIEEDDLKEQ 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSC DYW
Sbjct: 721 HFRDGHCSCRDYW 733

BLAST of HG10021841 vs. NCBI nr
Match: KAA0031814.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1393.6 bits (3606), Expect = 0.0e+00
Identity = 689/733 (94.00%), Postives = 709/733 (96.73%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of HG10021841 vs. NCBI nr
Match: XP_008457379.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis melo] >TYJ97320.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1391.3 bits (3600), Expect = 0.0e+00
Identity = 688/733 (93.86%), Postives = 708/733 (96.59%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of HG10021841 vs. NCBI nr
Match: XP_004145320.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis sativus] >KGN65801.1 hypothetical protein Csa_023315 [Cucumis sativus])

HSP 1 Score: 1385.5 bits (3585), Expect = 0.0e+00
Identity = 684/733 (93.32%), Postives = 707/733 (96.45%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           MEALSVP ISLQNF T NNNL FRNHQILSTID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKLFTASALSSFSTLDYAR++FDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           EDLPN FTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERK IKVDLTLCNAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           R FFHEMEPVYGVVP  KHYACMVDILGRAGFLEEAMELINEM  TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SSIE NGNVHEFLVGDN+H LSS+IYSKL+EIATKLKSVGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH  AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of HG10021841 vs. NCBI nr
Match: XP_022964665.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1359.4 bits (3517), Expect = 0.0e+00
Identity = 666/733 (90.86%), Postives = 699/733 (95.36%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           ME LS PL+SL N    +NNL FRNHQILSTIDQCSS KQLKQVHA MLRTGLFFDPFSA
Sbjct: 1   METLSAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKL  ASAL S STL+YARDVFDQIP PNLYTWNTLIRAYASS+DPFQSFVIFL LLD+C
Sbjct: 61  SKLIAASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDEC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           +DLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLS GMD YILNSLVRFYGACGDLNMA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLFEGISCKDVVSWNSMISAFAQGN PEDAL+LFLKMEG NVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERKEI VDLTLCNAMLDMY+KCGSI  A+KLFDEMPERDVFSWTTML
Sbjct: 241 LDLEFGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGDF+AAR+VFD MPVKEIAAWN LISA E+NGKPKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVS+LSACAQLGAIDLGGWIHVYIKREGI+LN HLI+SL+DMYAKCGALEKALEVFY+
Sbjct: 361 VTLVSSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYA 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VEEKDVYVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN+VTFTN+LCACSHAGLVDEG
Sbjct: 421 VEEKDVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           RA FHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMP TPSASVWGALLGACS
Sbjct: 481 RALFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVELAELASDQLLKLEPRNHGAI+LLSN+YAKTGRW+KVSELRKLMRDSELKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SS+EVNG VHEFLVGDNSH LS DIYSKLDEIA KLKSVGYEPNKSHLLQLIE+DD+KE 
Sbjct: 601 SSVEVNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEH 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKL+SRVY+R+IL++DRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 939.9 bits (2428), Expect = 1.9e-272
Identity = 447/727 (61.49%), Postives = 569/727 (78.27%), Query Frame = 0

Query: 71  PLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTA 130
           P  S  N PT NN       + +S I++C S +QLKQ H HM+RTG F DP+SASKLF  
Sbjct: 16  PNFSNPNQPTTNN----ERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75

Query: 131 SALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 190
           +ALSSF++L+YAR VFD+IP+PN + WNTLIRAYAS  DP  S   FLD++ + +  PN 
Sbjct: 76  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135

Query: 191 FTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEG 250
           +TFPF+IKAA+E+ +  +G+++HGMA+K + G D+++ NSL+  Y +CGDL+ A ++F  
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195

Query: 251 ISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFG 310
           I  KDVVSWNSMI+ F Q   P+ AL+LF KME E+V  + VTMVGVLSACAK  +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255

Query: 311 RWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKM 370
           R VCSYIE   + V+LTL NAMLDMY+KCGSI+ A++LFD M E+D  +WTTMLDGYA  
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315

Query: 371 GDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVST 430
            D++AAR+V ++MP K+I AWN LISA EQNGKP EAL  F+ELQ+ K  K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375

Query: 431 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDV 490
           LSACAQ+GA++LG WIH YIK+ GI +N H+ S+L+ MY+KCG LEK+ EVF SVE++DV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435

Query: 491 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHE 550
           +VWSAMI GL MHG G  A+D+F++MQEA VKPN VTFTNV CACSH GLVDE  + FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495

Query: 551 MEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVE 610
           ME  YG+VP  KHYAC+VD+LGR+G+LE+A++ I  MPI PS SVWGALLGAC +H N+ 
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555

Query: 611 LAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVN 670
           LAE+A  +LL+LEPRN GA VLLSNIYAK G+WE VSELRK MR + LKKEPGCSSIE++
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615

Query: 671 GNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHS 730
           G +HEFL GDN+H +S  +Y KL E+  KLKS GYEP  S +LQ+IE++++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675

Query: 731 EKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGH 790
           EKLAI +GLIS    + IRV+KNLR+CGDCH VAKL+S++YDREI++RDRYRFHHFR+G 
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735

Query: 791 CSCMDYW 798
           CSC D+W
Sbjct: 736 CSCNDFW 738

BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 605.1 bits (1559), Expect = 1.1e-171
Identity = 321/768 (41.80%), Postives = 459/768 (59.77%), Query Frame = 0

Query: 68  LSVPLISLQNFPTPNNNLP----FRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFS 127
           L+VP  S      P+++ P     RNH  LS +  C + + L+ +HA M++ GL    ++
Sbjct: 8   LTVPSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYA 67

Query: 128 ASKLFTASALS-SFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLD 187
            SKL     LS  F  L YA  VF  I +PNL  WNT+ R +A SSDP  +  +++ ++ 
Sbjct: 68  LSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI- 127

Query: 188 KCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNS------------ 247
               LPN++TFPFV+K+ ++ KA + G+ +HG  +KL   +DLY+  S            
Sbjct: 128 SLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLE 187

Query: 248 -------------------LVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGNF 307
                              L++ Y + G +  A++LF+ I  KDVVSWN+MIS +A+   
Sbjct: 188 DAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGN 247

Query: 308 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 367
            ++AL+LF  M   NV P+  TMV V+SACA+   +E GR V  +I+      +L + NA
Sbjct: 248 YKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNA 307

Query: 368 MLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAW 427
           ++D+YSKCG ++ A  LF+ +P +DV SW T++ GY  M  +                  
Sbjct: 308 LIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY------------------ 367

Query: 428 NVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 487
                        KEAL  F E+  S    P++VT++S L ACA LGAID+G WIHVYI 
Sbjct: 368 -------------KEALLLFQEMLRSG-ETPNDVTMLSILPACAHLGAIDIGRWIHVYID 427

Query: 488 R--EGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAA 547
           +  +G+     L +SL+DMYAKCG +E A +VF S+  K +  W+AMI G  MHGR  A+
Sbjct: 428 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 487

Query: 548 IDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVD 607
            DLF  M++  ++P+++TF  +L ACSH+G++D GR  F  M   Y + P  +HY CM+D
Sbjct: 488 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 547

Query: 608 ILGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGA 667
           +LG +G  +EA E+IN M + P   +W +LL AC +H NVEL E  ++ L+K+EP N G+
Sbjct: 548 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 607

Query: 668 IVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDI 727
            VLLSNIYA  GRW +V++ R L+ D  +KK PGCSSIE++  VHEF++GD  H  + +I
Sbjct: 608 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 667

Query: 728 YSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIR 787
           Y  L+E+   L+  G+ P+ S +LQ +E ++ KE AL  HSEKLAIAFGLIS  P   + 
Sbjct: 668 YGMLEEMEVLLEKAGFVPDTSEVLQEME-EEWKEGALRHHSEKLAIAFGLISTKPGTKLT 727

Query: 788 VVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
           +VKNLR+C +CHE  KL+S++Y REI+ RDR RFHHFRDG CSC DYW
Sbjct: 728 IVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match: O23337 (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 566.2 bits (1458), Expect = 5.7e-160
Identity = 281/713 (39.41%), Postives = 438/713 (61.43%), Query Frame = 0

Query: 92  ILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQ 151
           IL  +  C S   +KQ+HAH+LRT    +    S LF  S  SS   L YA +VF  IP 
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 152 -PNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGR 211
            P    +N  +R  + SS+P ++ ++F   +       + F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 212 AVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGN 271
            +HG+A K++   D ++    +  Y +CG +N A  +F+ +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 272 FPEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCN 331
             ++A  LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++   ++++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 332 AMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAA 391
           A++ MY+  G +D+A++ F +M  R++F  T M+ GY+K G  D A+ +FD    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 392 WNVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 451
           W  +ISA  ++  P+EAL  F E+  S I KPD V++ S +SACA LG +D   W+H  I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 452 KREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAI 511
              G++    + ++L++MYAKCG L+   +VF  +  ++V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 512 DLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDI 571
            LF  M++  V+PN VTF  VL  CSH+GLV+EG+  F  M   Y + P  +HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 572 LGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAI 631
            GRA  L EA+E+I  MP+  +  +WG+L+ AC +H  +EL + A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 632 VLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIY 691
           VL+SNIYA+  RWE V  +R++M +  + KE G S I+ NG  HEFL+GD  H  S++IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 692 SKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQP--- 751
           +KLDE+ +KLK  GY P+   +L  +E+++ K+  L  HSEKLA+ FGL++    +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 752 ---IRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
              IR+VKNLR+C DCH   KLVS+VY+REI++RDR RFH +++G CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 543.5 bits (1399), Expect = 4.0e-153
Identity = 279/706 (39.52%), Postives = 425/706 (60.20%), Query Frame = 0

Query: 94  STIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQPN 153
           S ID  +   QLKQ+HA +L  GL F  F  +KL  AS  SSF  + +AR VFD +P+P 
Sbjct: 26  SLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS--SSFGDITFARQVFDDLPRPQ 85

Query: 154 LYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVH 213
           ++ WN +IR Y S ++ FQ  ++    +      P++FTFP ++KA S L   ++GR VH
Sbjct: 86  IFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVH 145

Query: 214 GMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEGISC--KDVVSWNSMISAFAQGNF 273
               +L F  D+++ N L+  Y  C  L  A  +FEG+    + +VSW +++SA+AQ   
Sbjct: 146 AQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGE 205

Query: 274 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 333
           P +AL++F +M   +V P+ V +V VL+A     DL+ GR + + + +  ++++  L  +
Sbjct: 206 PMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLIS 265

Query: 334 MLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAW 393
           +  MY+KCG +  A+ LFD+M   ++  W  M+ GYAK                      
Sbjct: 266 LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK---------------------- 325

Query: 394 NVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 453
                    NG  +EA+  F+E+ ++K  +PD +++ S +SACAQ+G+++    ++ Y+ 
Sbjct: 326 ---------NGYAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVG 385

Query: 454 REGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAID 513
           R     +  + S+L+DM+AKCG++E A  VF    ++DV VWSAMI G G+HGR + AI 
Sbjct: 386 RSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAIS 445

Query: 514 LFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDIL 573
           L+  M+   V PN+VTF  +L AC+H+G+V EG  FF+ M   + + P  +HYAC++D+L
Sbjct: 446 LYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 505

Query: 574 GRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAIV 633
           GRAG L++A E+I  MP+ P  +VWGALL AC  H +VEL E A+ QL  ++P N G  V
Sbjct: 506 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 565

Query: 634 LLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIYS 693
            LSN+YA    W++V+E+R  M++  L K+ GCS +EV G +  F VGD SH    +I  
Sbjct: 566 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 625

Query: 694 KLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIRVV 753
           +++ I ++LK  G+  NK   L  + D++  E+ L  HSE++AIA+GLIS     P+R+ 
Sbjct: 626 QVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHSERIAIAYGLISTPQGTPLRIT 685

Query: 754 KNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
           KNLR C +CH   KL+S++ DREI++RD  RFHHF+DG CSC DYW
Sbjct: 686 KNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of HG10021841 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 525.8 bits (1353), Expect = 8.6e-148
Identity = 264/695 (37.99%), Postives = 414/695 (59.57%), Query Frame = 0

Query: 107 QVHAHMLRTGLFFDPFSASKLFTASALSSF----STLDYARDVFDQIPQPNLYTWNTLIR 166
           Q+H  +++ G       A  LF  ++L  F      LD AR VFD++ + N+ +W ++I 
Sbjct: 155 QIHGLIVKMGY------AKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMIC 214

Query: 167 AYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFG 226
            YA       +  +F  ++   E  PN+ T   VI A ++L+    G  V+         
Sbjct: 215 GYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIE 274

Query: 227 MDLYILNSLVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKM 286
           ++  ++++LV  Y  C  +++A+RLF+     ++   N+M S + +     +AL +F  M
Sbjct: 275 VNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLM 334

Query: 287 EGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSI 346
               V P+ ++M+  +S+C++  ++ +G+    Y+ R   +    +CNA++DMY KC   
Sbjct: 335 MDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQ 394

Query: 347 DVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNG 406
           D A ++FD M  + V +W +++ GY + G+ DAA + F+ MP K I +WN +IS   Q  
Sbjct: 395 DTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGS 454

Query: 407 KPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLI 466
             +EA+  F  +Q  +    D VT++S  SAC  LGA+DL  WI+ YI++ GI L+  L 
Sbjct: 455 LFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLG 514

Query: 467 SSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVK 526
           ++LVDM+++CG  E A+ +F S+  +DV  W+A I  + M G  + AI+LF +M E  +K
Sbjct: 515 TTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLK 574

Query: 527 PNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAME 586
           P+ V F   L ACSH GLV +G+  F+ M  ++GV P   HY CMVD+LGRAG LEEA++
Sbjct: 575 PDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQ 634

Query: 587 LINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGR 646
           LI +MP+ P+  +W +LL AC +  NVE+A  A++++  L P   G+ VLLSN+YA  GR
Sbjct: 635 LIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGR 694

Query: 647 WEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKS 706
           W  ++++R  M++  L+K PG SSI++ G  HEF  GD SH    +I + LDE++ +   
Sbjct: 695 WNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASH 754

Query: 707 VGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHE 766
           +G+ P+ S++L  +++ + K   LS HSEKLA+A+GLIS      IR+VKNLR+C DCH 
Sbjct: 755 LGHVPDLSNVLMDVDEKE-KIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHS 814

Query: 767 VAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
            AK  S+VY+REI+LRD  RFH+ R G CSC D+W
Sbjct: 815 FAKFASKVYNREIILRDNNRFHYIRQGKCSCGDFW 842

BLAST of HG10021841 vs. ExPASy TrEMBL
Match: A0A5A7SKX2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00570 PE=3 SV=1)

HSP 1 Score: 1393.6 bits (3606), Expect = 0.0e+00
Identity = 689/733 (94.00%), Postives = 709/733 (96.73%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of HG10021841 vs. ExPASy TrEMBL
Match: A0A5D3BBW6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001260 PE=3 SV=1)

HSP 1 Score: 1391.3 bits (3600), Expect = 0.0e+00
Identity = 688/733 (93.86%), Postives = 708/733 (96.59%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of HG10021841 vs. ExPASy TrEMBL
Match: A0A1S3C623 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103497087 PE=3 SV=1)

HSP 1 Score: 1391.3 bits (3600), Expect = 0.0e+00
Identity = 688/733 (93.86%), Postives = 708/733 (96.59%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           MEALSVPLISLQNF T NNNLPFRNHQILS ID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKLFTASALSSFSTLDYAR+VFDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERK IK+DLTL NAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SSIEVNGNVHEFLVGDN H LSS+IYSKLD+IATKLK VGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGL+SLAPSQPIRVVKNLRICGDCHE AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of HG10021841 vs. ExPASy TrEMBL
Match: A0A0A0M0R9 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G530130 PE=3 SV=1)

HSP 1 Score: 1385.5 bits (3585), Expect = 0.0e+00
Identity = 684/733 (93.32%), Postives = 707/733 (96.45%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           MEALSVP ISLQNF T NNNL FRNHQILSTID+CSSSKQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKLFTASALSSFSTLDYAR++FDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           EDLPN FTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACGDL+MA
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLF+GISCKDVVSWNSMISAFAQGN PEDAL+LFLKME ENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERK IKVDLTLCNAMLDMY+KCGS+D AQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISA EQNGKPKEALA FNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VEE+DVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPN+VTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           R FFHEMEPVYGVVP  KHYACMVDILGRAGFLEEAMELINEM  TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SSIE NGNVHEFLVGDN+H LSS+IYSKL+EIATKLKSVGYEPNKSHLLQLIE+DDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH  AKLVSRVYDR+ILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of HG10021841 vs. ExPASy TrEMBL
Match: A0A6J1HLG4 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111464676 PE=3 SV=1)

HSP 1 Score: 1359.4 bits (3517), Expect = 0.0e+00
Identity = 666/733 (90.86%), Postives = 699/733 (95.36%), Query Frame = 0

Query: 65  MEALSVPLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSA 124
           ME LS PL+SL N    +NNL FRNHQILSTIDQCSS KQLKQVHA MLRTGLFFDPFSA
Sbjct: 1   METLSAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSA 60

Query: 125 SKLFTASALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 184
           SKL  ASAL S STL+YARDVFDQIP PNLYTWNTLIRAYASS+DPFQSFVIFL LLD+C
Sbjct: 61  SKLIAASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDEC 120

Query: 185 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMA 244
           +DLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLS GMD YILNSLVRFYGACGDLNMA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMA 180

Query: 245 ERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKK 304
           ERLFEGISCKDVVSWNSMISAFAQGN PEDAL+LFLKMEG NVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKK 240

Query: 305 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTML 364
           LDLEFGRWVCSYIERKEI VDLTLCNAMLDMY+KCGSI  A+KLFDEMPERDVFSWTTML
Sbjct: 241 LDLEFGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTML 300

Query: 365 DGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDE 424
           DGYAKMGDF+AAR+VFD MPVKEIAAWN LISA E+NGKPKEALATFNELQ+SKIAKPDE
Sbjct: 301 DGYAKMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDE 360

Query: 425 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 484
           VTLVS+LSACAQLGAIDLGGWIHVYIKREGI+LN HLI+SL+DMYAKCGALEKALEVFY+
Sbjct: 361 VTLVSSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYA 420

Query: 485 VEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEG 544
           VEEKDVYVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN+VTFTN+LCACSHAGLVDEG
Sbjct: 421 VEEKDVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEG 480

Query: 545 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACS 604
           RA FHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMP TPSASVWGALLGACS
Sbjct: 481 RALFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACS 540

Query: 605 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGC 664
           LHMNVELAELASDQLLKLEPRNHGAI+LLSN+YAKTGRW+KVSELRKLMRDSELKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGC 600

Query: 665 SSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQ 724
           SS+EVNG VHEFLVGDNSH LS DIYSKLDEIA KLKSVGYEPNKSHLLQLIE+DD+KE 
Sbjct: 601 SSVEVNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEH 660

Query: 725 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFH 784
           ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKL+SRVY+R+IL++DRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFH 720

Query: 785 HFRDGHCSCMDYW 798
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of HG10021841 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 939.9 bits (2428), Expect = 1.4e-273
Identity = 447/727 (61.49%), Postives = 569/727 (78.27%), Query Frame = 0

Query: 71  PLISLQNFPTPNNNLPFRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTA 130
           P  S  N PT NN       + +S I++C S +QLKQ H HM+RTG F DP+SASKLF  
Sbjct: 16  PNFSNPNQPTTNN----ERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75

Query: 131 SALSSFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 190
           +ALSSF++L+YAR VFD+IP+PN + WNTLIRAYAS  DP  S   FLD++ + +  PN 
Sbjct: 76  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135

Query: 191 FTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEG 250
           +TFPF+IKAA+E+ +  +G+++HGMA+K + G D+++ NSL+  Y +CGDL+ A ++F  
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195

Query: 251 ISCKDVVSWNSMISAFAQGNFPEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFG 310
           I  KDVVSWNSMI+ F Q   P+ AL+LF KME E+V  + VTMVGVLSACAK  +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255

Query: 311 RWVCSYIERKEIKVDLTLCNAMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKM 370
           R VCSYIE   + V+LTL NAMLDMY+KCGSI+ A++LFD M E+D  +WTTMLDGYA  
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315

Query: 371 GDFDAARQVFDAMPVKEIAAWNVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVST 430
            D++AAR+V ++MP K+I AWN LISA EQNGKP EAL  F+ELQ+ K  K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375

Query: 431 LSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDV 490
           LSACAQ+GA++LG WIH YIK+ GI +N H+ S+L+ MY+KCG LEK+ EVF SVE++DV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435

Query: 491 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHE 550
           +VWSAMI GL MHG G  A+D+F++MQEA VKPN VTFTNV CACSH GLVDE  + FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495

Query: 551 MEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVE 610
           ME  YG+VP  KHYAC+VD+LGR+G+LE+A++ I  MPI PS SVWGALLGAC +H N+ 
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555

Query: 611 LAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVN 670
           LAE+A  +LL+LEPRN GA VLLSNIYAK G+WE VSELRK MR + LKKEPGCSSIE++
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615

Query: 671 GNVHEFLVGDNSHLLSSDIYSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHS 730
           G +HEFL GDN+H +S  +Y KL E+  KLKS GYEP  S +LQ+IE++++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675

Query: 731 EKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGH 790
           EKLAI +GLIS    + IRV+KNLR+CGDCH VAKL+S++YDREI++RDRYRFHHFR+G 
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735

Query: 791 CSCMDYW 798
           CSC D+W
Sbjct: 736 CSCNDFW 738

BLAST of HG10021841 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 605.1 bits (1559), Expect = 7.9e-173
Identity = 321/768 (41.80%), Postives = 459/768 (59.77%), Query Frame = 0

Query: 68  LSVPLISLQNFPTPNNNLP----FRNHQILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFS 127
           L+VP  S      P+++ P     RNH  LS +  C + + L+ +HA M++ GL    ++
Sbjct: 8   LTVPSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYA 67

Query: 128 ASKLFTASALS-SFSTLDYARDVFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLD 187
            SKL     LS  F  L YA  VF  I +PNL  WNT+ R +A SSDP  +  +++ ++ 
Sbjct: 68  LSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI- 127

Query: 188 KCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNS------------ 247
               LPN++TFPFV+K+ ++ KA + G+ +HG  +KL   +DLY+  S            
Sbjct: 128 SLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLE 187

Query: 248 -------------------LVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGNF 307
                              L++ Y + G +  A++LF+ I  KDVVSWN+MIS +A+   
Sbjct: 188 DAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGN 247

Query: 308 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 367
            ++AL+LF  M   NV P+  TMV V+SACA+   +E GR V  +I+      +L + NA
Sbjct: 248 YKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNA 307

Query: 368 MLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAW 427
           ++D+YSKCG ++ A  LF+ +P +DV SW T++ GY  M  +                  
Sbjct: 308 LIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY------------------ 367

Query: 428 NVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 487
                        KEAL  F E+  S    P++VT++S L ACA LGAID+G WIHVYI 
Sbjct: 368 -------------KEALLLFQEMLRSG-ETPNDVTMLSILPACAHLGAIDIGRWIHVYID 427

Query: 488 R--EGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAA 547
           +  +G+     L +SL+DMYAKCG +E A +VF S+  K +  W+AMI G  MHGR  A+
Sbjct: 428 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 487

Query: 548 IDLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVD 607
            DLF  M++  ++P+++TF  +L ACSH+G++D GR  F  M   Y + P  +HY CM+D
Sbjct: 488 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 547

Query: 608 ILGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGA 667
           +LG +G  +EA E+IN M + P   +W +LL AC +H NVEL E  ++ L+K+EP N G+
Sbjct: 548 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 607

Query: 668 IVLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDI 727
            VLLSNIYA  GRW +V++ R L+ D  +KK PGCSSIE++  VHEF++GD  H  + +I
Sbjct: 608 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 667

Query: 728 YSKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIR 787
           Y  L+E+   L+  G+ P+ S +LQ +E ++ KE AL  HSEKLAIAFGLIS  P   + 
Sbjct: 668 YGMLEEMEVLLEKAGFVPDTSEVLQEME-EEWKEGALRHHSEKLAIAFGLISTKPGTKLT 727

Query: 788 VVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
           +VKNLR+C +CHE  KL+S++Y REI+ RDR RFHHFRDG CSC DYW
Sbjct: 728 IVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of HG10021841 vs. TAIR 10
Match: AT4G14820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 566.2 bits (1458), Expect = 4.1e-161
Identity = 281/713 (39.41%), Postives = 438/713 (61.43%), Query Frame = 0

Query: 92  ILSTIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQ 151
           IL  +  C S   +KQ+HAH+LRT    +    S LF  S  SS   L YA +VF  IP 
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 152 -PNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGR 211
            P    +N  +R  + SS+P ++ ++F   +       + F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 212 AVHGMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGN 271
            +HG+A K++   D ++    +  Y +CG +N A  +F+ +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 272 FPEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCN 331
             ++A  LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++   ++++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 332 AMLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAA 391
           A++ MY+  G +D+A++ F +M  R++F  T M+ GY+K G  D A+ +FD    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 392 WNVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 451
           W  +ISA  ++  P+EAL  F E+  S I KPD V++ S +SACA LG +D   W+H  I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 452 KREGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAI 511
              G++    + ++L++MYAKCG L+   +VF  +  ++V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 512 DLFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDI 571
            LF  M++  V+PN VTF  VL  CSH+GLV+EG+  F  M   Y + P  +HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 572 LGRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAI 631
            GRA  L EA+E+I  MP+  +  +WG+L+ AC +H  +EL + A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 632 VLLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIY 691
           VL+SNIYA+  RWE V  +R++M +  + KE G S I+ NG  HEFL+GD  H  S++IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 692 SKLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQP--- 751
           +KLDE+ +KLK  GY P+   +L  +E+++ K+  L  HSEKLA+ FGL++    +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 752 ---IRVVKNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
              IR+VKNLR+C DCH   KLVS+VY+REI++RDR RFH +++G CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of HG10021841 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 543.5 bits (1399), Expect = 2.8e-154
Identity = 279/706 (39.52%), Postives = 425/706 (60.20%), Query Frame = 0

Query: 94  STIDQCSSSKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQPN 153
           S ID  +   QLKQ+HA +L  GL F  F  +KL  AS  SSF  + +AR VFD +P+P 
Sbjct: 26  SLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS--SSFGDITFARQVFDDLPRPQ 85

Query: 154 LYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVH 213
           ++ WN +IR Y S ++ FQ  ++    +      P++FTFP ++KA S L   ++GR VH
Sbjct: 86  IFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVH 145

Query: 214 GMAIKLSFGMDLYILNSLVRFYGACGDLNMAERLFEGISC--KDVVSWNSMISAFAQGNF 273
               +L F  D+++ N L+  Y  C  L  A  +FEG+    + +VSW +++SA+AQ   
Sbjct: 146 AQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGE 205

Query: 274 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNA 333
           P +AL++F +M   +V P+ V +V VL+A     DL+ GR + + + +  ++++  L  +
Sbjct: 206 PMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLIS 265

Query: 334 MLDMYSKCGSIDVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAW 393
           +  MY+KCG +  A+ LFD+M   ++  W  M+ GYAK                      
Sbjct: 266 LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK---------------------- 325

Query: 394 NVLISACEQNGKPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 453
                    NG  +EA+  F+E+ ++K  +PD +++ S +SACAQ+G+++    ++ Y+ 
Sbjct: 326 ---------NGYAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVG 385

Query: 454 REGIDLNCHLISSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAID 513
           R     +  + S+L+DM+AKCG++E A  VF    ++DV VWSAMI G G+HGR + AI 
Sbjct: 386 RSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAIS 445

Query: 514 LFFEMQEAKVKPNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDIL 573
           L+  M+   V PN+VTF  +L AC+H+G+V EG  FF+ M   + + P  +HYAC++D+L
Sbjct: 446 LYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 505

Query: 574 GRAGFLEEAMELINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAIV 633
           GRAG L++A E+I  MP+ P  +VWGALL AC  H +VEL E A+ QL  ++P N G  V
Sbjct: 506 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 565

Query: 634 LLSNIYAKTGRWEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIYS 693
            LSN+YA    W++V+E+R  M++  L K+ GCS +EV G +  F VGD SH    +I  
Sbjct: 566 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 625

Query: 694 KLDEIATKLKSVGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIRVV 753
           +++ I ++LK  G+  NK   L  + D++  E+ L  HSE++AIA+GLIS     P+R+ 
Sbjct: 626 QVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHSERIAIAYGLISTPQGTPLRIT 685

Query: 754 KNLRICGDCHEVAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
           KNLR C +CH   KL+S++ DREI++RD  RFHHF+DG CSC DYW
Sbjct: 686 KNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of HG10021841 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 525.8 bits (1353), Expect = 6.1e-149
Identity = 264/695 (37.99%), Postives = 414/695 (59.57%), Query Frame = 0

Query: 107 QVHAHMLRTGLFFDPFSASKLFTASALSSF----STLDYARDVFDQIPQPNLYTWNTLIR 166
           Q+H  +++ G       A  LF  ++L  F      LD AR VFD++ + N+ +W ++I 
Sbjct: 155 QIHGLIVKMGY------AKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMIC 214

Query: 167 AYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFG 226
            YA       +  +F  ++   E  PN+ T   VI A ++L+    G  V+         
Sbjct: 215 GYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIE 274

Query: 227 MDLYILNSLVRFYGACGDLNMAERLFEGISCKDVVSWNSMISAFAQGNFPEDALDLFLKM 286
           ++  ++++LV  Y  C  +++A+RLF+     ++   N+M S + +     +AL +F  M
Sbjct: 275 VNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLM 334

Query: 287 EGENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYSKCGSI 346
               V P+ ++M+  +S+C++  ++ +G+    Y+ R   +    +CNA++DMY KC   
Sbjct: 335 MDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQ 394

Query: 347 DVAQKLFDEMPERDVFSWTTMLDGYAKMGDFDAARQVFDAMPVKEIAAWNVLISACEQNG 406
           D A ++FD M  + V +W +++ GY + G+ DAA + F+ MP K I +WN +IS   Q  
Sbjct: 395 DTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGS 454

Query: 407 KPKEALATFNELQVSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLI 466
             +EA+  F  +Q  +    D VT++S  SAC  LGA+DL  WI+ YI++ GI L+  L 
Sbjct: 455 LFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLG 514

Query: 467 SSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVK 526
           ++LVDM+++CG  E A+ +F S+  +DV  W+A I  + M G  + AI+LF +M E  +K
Sbjct: 515 TTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLK 574

Query: 527 PNNVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAME 586
           P+ V F   L ACSH GLV +G+  F+ M  ++GV P   HY CMVD+LGRAG LEEA++
Sbjct: 575 PDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQ 634

Query: 587 LINEMPITPSASVWGALLGACSLHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGR 646
           LI +MP+ P+  +W +LL AC +  NVE+A  A++++  L P   G+ VLLSN+YA  GR
Sbjct: 635 LIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGR 694

Query: 647 WEKVSELRKLMRDSELKKEPGCSSIEVNGNVHEFLVGDNSHLLSSDIYSKLDEIATKLKS 706
           W  ++++R  M++  L+K PG SSI++ G  HEF  GD SH    +I + LDE++ +   
Sbjct: 695 WNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASH 754

Query: 707 VGYEPNKSHLLQLIEDDDLKEQALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHE 766
           +G+ P+ S++L  +++ + K   LS HSEKLA+A+GLIS      IR+VKNLR+C DCH 
Sbjct: 755 LGHVPDLSNVLMDVDEKE-KIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHS 814

Query: 767 VAKLVSRVYDREILLRDRYRFHHFRDGHCSCMDYW 798
            AK  S+VY+REI+LRD  RFH+ R G CSC D+W
Sbjct: 815 FAKFASKVYNREIILRDNNRFHYIRQGKCSCGDFW 842

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893523.10.0e+0095.36pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa ... [more]
KAA0031814.10.0e+0094.00pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008457379.10.0e+0093.86PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic ... [more]
XP_004145320.10.0e+0093.32pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis sa... [more]
XP_022964665.10.0e+0090.86pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
O823801.9e-27261.49Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LN011.1e-17141.80Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O233375.7e-16039.41Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX... [more]
Q9LTV84.0e-15339.52Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9LUJ28.6e-14837.99Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7SKX20.0e+0094.00Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3BBW60.0e+0093.86Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C6230.0e+0093.86pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis ... [more]
A0A0A0M0R90.0e+0093.32DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G5301... [more]
A0A6J1HLG40.0e+0090.86pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT2G29760.11.4e-27361.49Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.17.9e-17341.80Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G14820.14.1e-16139.41Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.12.8e-15439.52mitochondrial editing factor 22 [more]
AT3G22690.26.1e-14937.99INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 155..181
e-value: 0.072
score: 13.4
coord: 228..250
e-value: 0.88
score: 10.0
coord: 390..417
e-value: 3.2E-5
score: 23.9
coord: 564..587
e-value: 0.0022
score: 18.1
coord: 329..356
e-value: 3.8E-5
score: 23.7
coord: 358..386
e-value: 3.5E-6
score: 26.9
coord: 635..656
e-value: 0.9
score: 9.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 491..524
e-value: 5.8E-7
score: 27.3
coord: 329..357
e-value: 5.1E-4
score: 18.0
coord: 358..385
e-value: 9.0E-7
score: 26.7
coord: 526..559
e-value: 4.2E-4
score: 18.3
coord: 564..587
e-value: 0.003
score: 15.6
coord: 390..423
e-value: 3.2E-4
score: 18.7
coord: 257..290
e-value: 4.7E-6
score: 24.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 488..536
e-value: 1.7E-10
score: 40.9
coord: 254..303
e-value: 1.1E-9
score: 38.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 489..523
score: 10.884628
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 325..355
score: 9.941957
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 356..390
score: 11.443655
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 255..289
score: 11.73961
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 390..551
e-value: 4.7E-36
score: 126.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 553..676
e-value: 3.2E-12
score: 48.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 209..308
e-value: 2.6E-20
score: 74.5
coord: 81..208
e-value: 8.8E-9
score: 36.9
coord: 309..389
e-value: 4.4E-20
score: 73.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 367..646
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 662..787
e-value: 4.7E-35
score: 120.2
NoneNo IPR availablePANTHERPTHR47926:SF99BNAANNG32650D PROTEINcoord: 368..783
coord: 76..250
coord: 239..369
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 368..783
coord: 76..250
coord: 239..369

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021841.1HG10021841.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding