HG10001188 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001188
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr09: 14780866 .. 14782944 (-)
RNA-Seq ExpressionHG10001188
SyntenyHG10001188
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACATTATTATCTGCCTTGAAAACTTGCGCTAGCTCCAAATTACTGAAACAAGGCAAGCTTCTTCATCAGAAAATATTTTCTTTAGGCTTTCAATCCAACATCCCCCTCTGCAAAACCCTCATCGGCTTTTACTTTTCCTGCCATGATTACAGATCAGCTGAGCTTGTTTTCCAGACCAACGATTCCCCATTGGATGTTTCTCTGTGGAATGCTCTTCTCTCTGCTAACACCAAGAACTTCAGGTTCGTTGAAGCTCTGCAACTCTTTCACCAATTGAACTGTCATTCTTATGTAAGACCTGATTGTTACACTTACCCAGTTGTTCTTAAGGCGTGTGGTGGATTGGGTAGAGTTGTTTATGGGAGAAGAATCCATAACCATTTGATAAAAACGGGTTTGATATGGGATGTTTTTGTGGGGAGTTCTATGATGAATATGTATGCGAAATGTGATCAGTTTCATGATGCCATTAACCTGTTCGATGAATTTCCTCAGAGAGATGTGGGGTGTTGGAACACAGTTATCTCTTGTTATTTTAAAGATGATAAGGCTGAGACTGCCCTGAAGATGTTCGATAAAATGAAAGAGTTGGGTTTTGAGCCTAATTCAGTAACTTTTACTGTTGCTGTCTCATCATGTACAAGGCTTTTGAATTTGGAAAGAGGTAAGGAGATTCAGAGGGAGTTGATAGATAGAGGGACTTTGTTGGATGCTTTTGTTCTTTCTGCGCTTGTGGATATGTATGGAAAATGTGGTTGTTTAGAAATGGCCAAAGAAGTTTTTGAGCAAATCCCAAGGAAGAATACGATCACTTGGAATGCTATGATCACAGGGTACAGTTTGAAAGGTGATAGCAGATCCTGCATTGAACTTCTCATGAGGATGAATGATGAAGGATCCAAACCGACTTTGACGACTTTGACCAGCATTATATATGCTAGCTCAAGGTCTGTTCATCTTCGGCACGGAAAATTTATACATGGATATATTTTGAGAAATAGAATAGATGTTGATATCTTCATCGACGTTTCTCTCATTGATTTCTACTTTAAATGTGGATATGTTTCTTCAGCTGAAACCATCTTCAGAAGTATATCCAAGAATGAAGTAGTTTCTTGGAATGTCATGATTTCCGGATATGTCATGTTGGGTAATCACATTCTGGCTCTCCGTACCTATGATAATATGAAAGAACATCATGTAAAACCAGATGCCGTAACATTTTCTAGCACCTTACCAGCTTGTTCACAGCTAGCAGCCTTGGATAAGGGTAGGGAGCTTCACCACTGCATCATCAGTCATAAGTTGGAATCCAATAAAATCGTTATGGGGGCTCTTCTTGATATGTATGCTAAATGTGGTGATGTCGATGAAGCACGGAAACTCTTTCATCAACTACCAAAGAGGGATCTTGTGTCGTGGACATCAATGATTGCTGCTTATGGATCTCATGGCCAGGCTTCAGAAGCTTTGAGGCTTTTTGATGAAATGCAGAAGTCGAACGTACAGGCTGATTCAGTTACATTCCTAGCAGTTCTATCTGCTTGTAGCCATGCTGGATTGGTTGATGAAGGCTACAAGTATTTCAACGAAATGATCATTCAGTATGACATTAAGCCCGGGATTGAACAGTATTCATGCTTGATAGATCTACTTGGACGTGCTGGAAGACTACATGAAGCTTATGAGATTCTCCAAAGATCAAAAGAGACTAGGAACGATATCGGATTGTTAAGCACACTGTTTTCTGCATGTCGCTTGCATAATGATTTTGTTTTAGGCATACAAATCGGGAAAATGCTCATAGAGATAGATCCTGATGATCCATCTACTTACATTTTGCTGTCGAATATGTATGCTTCTGTCAATAAATGGGAGGAGGTATGTAAAGTACGACGAAAAATGAGAGAAATAGGATTGAAGAAGAACCCTGGTTGCAGCTGGATTGAGATAAACCAGAGGATCCATCCATTCTTTGTTGAAGATAAGTCAAACCCTCTGGCTGATGGGGTTTATGAATGTCTAAACATTCTAGCTTGTCATATGGAGAAGAATGAACTAGAGTTAGTAGATTAG

mRNA sequence

ATGACATTATTATCTGCCTTGAAAACTTGCGCTAGCTCCAAATTACTGAAACAAGGCAAGCTTCTTCATCAGAAAATATTTTCTTTAGGCTTTCAATCCAACATCCCCCTCTGCAAAACCCTCATCGGCTTTTACTTTTCCTGCCATGATTACAGATCAGCTGAGCTTGTTTTCCAGACCAACGATTCCCCATTGGATGTTTCTCTGTGGAATGCTCTTCTCTCTGCTAACACCAAGAACTTCAGGTTCGTTGAAGCTCTGCAACTCTTTCACCAATTGAACTGTCATTCTTATGTAAGACCTGATTGTTACACTTACCCAGTTGTTCTTAAGGCGTGTGGTGGATTGGGTAGAGTTGTTTATGGGAGAAGAATCCATAACCATTTGATAAAAACGGGTTTGATATGGGATGTTTTTGTGGGGAGTTCTATGATGAATATGTATGCGAAATGTGATCAGTTTCATGATGCCATTAACCTGTTCGATGAATTTCCTCAGAGAGATGTGGGGTGTTGGAACACAGTTATCTCTTGTTATTTTAAAGATGATAAGGCTGAGACTGCCCTGAAGATGTTCGATAAAATGAAAGAGTTGGGTTTTGAGCCTAATTCAGTAACTTTTACTGTTGCTGTCTCATCATGTACAAGGCTTTTGAATTTGGAAAGAGGTAAGGAGATTCAGAGGGAGTTGATAGATAGAGGGACTTTGTTGGATGCTTTTGTTCTTTCTGCGCTTGTGGATATGTATGGAAAATGTGGTTGTTTAGAAATGGCCAAAGAAGTTTTTGAGCAAATCCCAAGGAAGAATACGATCACTTGGAATGCTATGATCACAGGGTACAGTTTGAAAGGTGATAGCAGATCCTGCATTGAACTTCTCATGAGGATGAATGATGAAGGATCCAAACCGACTTTGACGACTTTGACCAGCATTATATATGCTAGCTCAAGCACCTTACCAGCTTGTTCACAGCTAGCAGCCTTGGATAAGGGTAGGGAGCTTCACCACTGCATCATCAGTCATAAGTTGGAATCCAATAAAATCGTTATGGGGGCTCTTCTTGATATGTATGCTAAATGTGGTGATGTCGATGAAGCACGGAAACTCTTTCATCAACTACCAAAGAGGGATCTTGTGTCGTGGACATCAATGATTGCTGCTTATGGATCTCATGGCCAGGCTTCAGAAGCTTTGAGGCTTTTTGATGAAATGCAGAAGTCGAACGTACAGGCTGATTCAGTTACATTCCTAGCAGTTCTATCTGCTTGTAGCCATGCTGGATTGGTTGATGAAGGCTACAAGTATTTCAACGAAATGATCATTCAGTATGACATTAAGCCCGGGATTGAACAGTATTCATGCTTGATAGATCTACTTGGACGTGCTGGAAGACTACATGAAGCTTATGAGATTCTCCAAAGATCAAAAGAGACTAGGAACGATATCGGATTGTTAAGCACACTGTTTTCTGCATGTCGCTTGCATAATGATTTTGTTTTAGGCATACAAATCGGGAAAATGCTCATAGAGATAGATCCTGATGATCCATCTACTTACATTTTGCTGTCGAATATGTATGCTTCTGTCAATAAATGGGAGGAGGTATGTAAAGTACGACGAAAAATGAGAGAAATAGGATTGAAGAAGAACCCTGGTTGCAGCTGGATTGAGATAAACCAGAGGATCCATCCATTCTTTGTTGAAGATAAGTCAAACCCTCTGGCTGATGGGGTTTATGAATGTCTAAACATTCTAGCTTGTCATATGGAGAAGAATGAACTAGAGTTAGTAGATTAG

Coding sequence (CDS)

ATGACATTATTATCTGCCTTGAAAACTTGCGCTAGCTCCAAATTACTGAAACAAGGCAAGCTTCTTCATCAGAAAATATTTTCTTTAGGCTTTCAATCCAACATCCCCCTCTGCAAAACCCTCATCGGCTTTTACTTTTCCTGCCATGATTACAGATCAGCTGAGCTTGTTTTCCAGACCAACGATTCCCCATTGGATGTTTCTCTGTGGAATGCTCTTCTCTCTGCTAACACCAAGAACTTCAGGTTCGTTGAAGCTCTGCAACTCTTTCACCAATTGAACTGTCATTCTTATGTAAGACCTGATTGTTACACTTACCCAGTTGTTCTTAAGGCGTGTGGTGGATTGGGTAGAGTTGTTTATGGGAGAAGAATCCATAACCATTTGATAAAAACGGGTTTGATATGGGATGTTTTTGTGGGGAGTTCTATGATGAATATGTATGCGAAATGTGATCAGTTTCATGATGCCATTAACCTGTTCGATGAATTTCCTCAGAGAGATGTGGGGTGTTGGAACACAGTTATCTCTTGTTATTTTAAAGATGATAAGGCTGAGACTGCCCTGAAGATGTTCGATAAAATGAAAGAGTTGGGTTTTGAGCCTAATTCAGTAACTTTTACTGTTGCTGTCTCATCATGTACAAGGCTTTTGAATTTGGAAAGAGGTAAGGAGATTCAGAGGGAGTTGATAGATAGAGGGACTTTGTTGGATGCTTTTGTTCTTTCTGCGCTTGTGGATATGTATGGAAAATGTGGTTGTTTAGAAATGGCCAAAGAAGTTTTTGAGCAAATCCCAAGGAAGAATACGATCACTTGGAATGCTATGATCACAGGGTACAGTTTGAAAGGTGATAGCAGATCCTGCATTGAACTTCTCATGAGGATGAATGATGAAGGATCCAAACCGACTTTGACGACTTTGACCAGCATTATATATGCTAGCTCAAGCACCTTACCAGCTTGTTCACAGCTAGCAGCCTTGGATAAGGGTAGGGAGCTTCACCACTGCATCATCAGTCATAAGTTGGAATCCAATAAAATCGTTATGGGGGCTCTTCTTGATATGTATGCTAAATGTGGTGATGTCGATGAAGCACGGAAACTCTTTCATCAACTACCAAAGAGGGATCTTGTGTCGTGGACATCAATGATTGCTGCTTATGGATCTCATGGCCAGGCTTCAGAAGCTTTGAGGCTTTTTGATGAAATGCAGAAGTCGAACGTACAGGCTGATTCAGTTACATTCCTAGCAGTTCTATCTGCTTGTAGCCATGCTGGATTGGTTGATGAAGGCTACAAGTATTTCAACGAAATGATCATTCAGTATGACATTAAGCCCGGGATTGAACAGTATTCATGCTTGATAGATCTACTTGGACGTGCTGGAAGACTACATGAAGCTTATGAGATTCTCCAAAGATCAAAAGAGACTAGGAACGATATCGGATTGTTAAGCACACTGTTTTCTGCATGTCGCTTGCATAATGATTTTGTTTTAGGCATACAAATCGGGAAAATGCTCATAGAGATAGATCCTGATGATCCATCTACTTACATTTTGCTGTCGAATATGTATGCTTCTGTCAATAAATGGGAGGAGGTATGTAAAGTACGACGAAAAATGAGAGAAATAGGATTGAAGAAGAACCCTGGTTGCAGCTGGATTGAGATAAACCAGAGGATCCATCCATTCTTTGTTGAAGATAAGTCAAACCCTCTGGCTGATGGGGTTTATGAATGTCTAAACATTCTAGCTTGTCATATGGAGAAGAATGAACTAGAGTTAGTAGATTAG

Protein sequence

MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQTNDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVVYGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYFKDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAFVLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGSKPTLTTLTSIIYASSSTLPACSQLAALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIAAYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIKPGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGKMLIEIDPDDPSTYILLSNMYASVNKWEEVCKVRRKMREIGLKKNPGCSWIEINQRIHPFFVEDKSNPLADGVYECLNILACHMEKNELELVD
Homology
BLAST of HG10001188 vs. NCBI nr
Match: XP_038902098.1 (pentatricopeptide repeat-containing protein At5g27110 [Benincasa hispida])

HSP 1 Score: 1092.0 bits (2823), Expect = 0.0e+00
Identity = 553/690 (80.14%), Postives = 573/690 (83.04%), Query Frame = 0

Query: 1   MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQT 60
           M LLSAL+TCASSKLLKQGKL+HQKIFSLGFQSNI LCKTLIGFYFSCHDYRSAELVFQT
Sbjct: 4   MILLSALRTCASSKLLKQGKLIHQKIFSLGFQSNIVLCKTLIGFYFSCHDYRSAELVFQT 63

Query: 61  NDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVV 120
           N SPLDVSLWNALLSA TKNF FVEALQLF QLNCHS+VRPDCYTYPVVLKACGGLGRVV
Sbjct: 64  NGSPLDVSLWNALLSAYTKNFGFVEALQLFDQLNCHSHVRPDCYTYPVVLKACGGLGRVV 123

Query: 121 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 180
           YGRRIHNHLIKTGLIWDVFVGSS+MNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF
Sbjct: 124 YGRRIHNHLIKTGLIWDVFVGSSLMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 183

Query: 181 KDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAF 240
           KDDKAETALKMFDKMKELGFEPNSVTFTV VSSCTRLLNLERGKEI RELIDRG LLDAF
Sbjct: 184 KDDKAETALKMFDKMKELGFEPNSVTFTVVVSSCTRLLNLERGKEIHRELIDRGILLDAF 243

Query: 241 VLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEG 300
           VLSALVDMYGKCG  EMAKE+FEQIPRKN ITWNAMITGYSLKGDSRSCIELLMRMNDEG
Sbjct: 244 VLSALVDMYGKCGYFEMAKEIFEQIPRKNAITWNAMITGYSLKGDSRSCIELLMRMNDEG 303

Query: 301 SKPTLTTLTSIIYAS--------------------------------------------- 360
           +KPTLTTLTSII+AS                                             
Sbjct: 304 TKPTLTTLTSIIHASSRSVQLRHGKFIHGYILRNRIDVDIFIDVSLIDFYFKCGYVSSAE 363

Query: 361 -------------------------------------------------SSTLPACSQLA 420
                                                            SSTL ACSQLA
Sbjct: 364 TIFRNISKNEVVSWNVMISGYVMVGNHIQALRIYDNMKEHHVKPDAITFSSTLSACSQLA 423

Query: 421 ALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIA 480
           AL+KGRELHHCIISHKLE+N+IVMGALLDMYAKCGDVD+A KLFHQLPKRDLVSWTSMI 
Sbjct: 424 ALEKGRELHHCIISHKLETNEIVMGALLDMYAKCGDVDDAWKLFHQLPKRDLVSWTSMIT 483

Query: 481 AYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIK 540
           AYGSHGQASEALRLFDEMQKSNV+ADSVTFLAVLSACSHAGLVDEGYKYFNEMI Q +IK
Sbjct: 484 AYGSHGQASEALRLFDEMQKSNVRADSVTFLAVLSACSHAGLVDEGYKYFNEMIAQ-NIK 543

Query: 541 PGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGK 597
             IE YSCLIDLLGRAGRLHEAYEILQRSKETR DIGLLSTLFSACRLHNDFVLG+QIGK
Sbjct: 544 ASIEHYSCLIDLLGRAGRLHEAYEILQRSKETRKDIGLLSTLFSACRLHNDFVLGVQIGK 603

BLAST of HG10001188 vs. NCBI nr
Match: TYJ96117.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 538/690 (77.97%), Postives = 571/690 (82.75%), Query Frame = 0

Query: 1   MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQT 60
           +TLLSAL+TC SSKLLKQGKL+HQ+IFS GFQSNI LCK+LIGFYFSCHDY SAELVFQT
Sbjct: 4   VTLLSALRTCTSSKLLKQGKLIHQRIFSCGFQSNILLCKSLIGFYFSCHDYASAELVFQT 63

Query: 61  NDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVV 120
           N+ PLDVSLWN+LLSA T NFRFVEALQLF QLNC+S+VRPD YTYPVVLKACGGLGRV+
Sbjct: 64  NECPLDVSLWNSLLSAYTNNFRFVEALQLFDQLNCNSHVRPDFYTYPVVLKACGGLGRVI 123

Query: 121 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 180
           YGRRIHNHL+KTGLIWDVFVGSS+MNMYAKCDQF DAI LFDEFPQRDVGCWN VISCYF
Sbjct: 124 YGRRIHNHLLKTGLIWDVFVGSSLMNMYAKCDQFVDAIKLFDEFPQRDVGCWNAVISCYF 183

Query: 181 KDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAF 240
           KD KAETALK FDKMKELGFEPNSVTFTV VSSCTRLLNL+RGKEI RELIDR  LLDAF
Sbjct: 184 KDGKAETALKTFDKMKELGFEPNSVTFTVVVSSCTRLLNLKRGKEIHRELIDRRILLDAF 243

Query: 241 VLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEG 300
           VLSALVDMYGKCGCLEMAKEVFEQIPRKN ITWNAMITGYSLKGDSRSCIELLMRMNDEG
Sbjct: 244 VLSALVDMYGKCGCLEMAKEVFEQIPRKNAITWNAMITGYSLKGDSRSCIELLMRMNDEG 303

Query: 301 SKPTLTTLTSIIYAS--------------------------------------------- 360
           +KPTLTTLTSI+YAS                                             
Sbjct: 304 TKPTLTTLTSIVYASSRSVQLRHGKFIHGYILRNRIDVDIFIDVSLIDFYFKCGYVSSAE 363

Query: 361 -------------------------------------------------SSTLPACSQLA 420
                                                            SSTL ACSQLA
Sbjct: 364 TIFRSISKNEVVSWNVMVSGYVMVGNHIQALRIYDNMKEHHVKPDALTFSSTLSACSQLA 423

Query: 421 ALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIA 480
           ALDKGRELH+CII+HKLE+N+IVMGALLDMYAKCGDVDEA+KLFHQLPKRDLVSWTSMI+
Sbjct: 424 ALDKGRELHYCIINHKLEANEIVMGALLDMYAKCGDVDEAQKLFHQLPKRDLVSWTSMIS 483

Query: 481 AYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIK 540
           AYGSHGQASEALRLFDEMQKSNV+ADSV FLAVLSACSHAGLVDEGY YFNEMI+QYDIK
Sbjct: 484 AYGSHGQASEALRLFDEMQKSNVRADSVAFLAVLSACSHAGLVDEGYTYFNEMIVQYDIK 543

Query: 541 PGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGK 597
           PGIE YSCLIDLLGRAGRLHEAYEILQRSKET++DIGLLSTLFSACRLHNDFVLGIQIGK
Sbjct: 544 PGIEHYSCLIDLLGRAGRLHEAYEILQRSKETKSDIGLLSTLFSACRLHNDFVLGIQIGK 603

BLAST of HG10001188 vs. NCBI nr
Match: XP_004140282.1 (pentatricopeptide repeat-containing protein At5g27110 [Cucumis sativus] >KGN48106.1 hypothetical protein Csa_003806 [Cucumis sativus])

HSP 1 Score: 1070.1 bits (2766), Expect = 6.9e-309
Identity = 538/690 (77.97%), Postives = 565/690 (81.88%), Query Frame = 0

Query: 1   MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQT 60
           +TLLSAL+TC SSKLLKQGKL+HQ+IFS GFQSNI L K+LIGFYFSCHDY SAELVFQT
Sbjct: 4   VTLLSALRTCTSSKLLKQGKLIHQRIFSCGFQSNIVLSKSLIGFYFSCHDYASAELVFQT 63

Query: 61  NDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVV 120
           ND PLDVSLWNALLSA T NFRFVEALQLF QLNC+SYVRPD YTYPVVLKACGGLGRV+
Sbjct: 64  NDCPLDVSLWNALLSAYTNNFRFVEALQLFDQLNCNSYVRPDFYTYPVVLKACGGLGRVI 123

Query: 121 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 180
           YGRRIHNHL+KTGLIWDVFVGSS+MNMYAKCDQF DAI LFDEFPQRDVGCWN VISCYF
Sbjct: 124 YGRRIHNHLLKTGLIWDVFVGSSLMNMYAKCDQFVDAIKLFDEFPQRDVGCWNAVISCYF 183

Query: 181 KDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAF 240
           KD KAE ALK FDKMKELGFEPNSVTFTV VSSCTRLLNLERGKE+ RELI+R  LLDAF
Sbjct: 184 KDGKAEMALKTFDKMKELGFEPNSVTFTVVVSSCTRLLNLERGKEVHRELIERRILLDAF 243

Query: 241 VLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEG 300
           VLSALVDMYGKCGCLEMAKEVFE+IPRKN ITWNAMITGYSLKGDSRSCIELLMRMNDEG
Sbjct: 244 VLSALVDMYGKCGCLEMAKEVFEKIPRKNAITWNAMITGYSLKGDSRSCIELLMRMNDEG 303

Query: 301 SKPTLTTLTSIIYAS--------------------------------------------- 360
           +KPTL TLTSIIYAS                                             
Sbjct: 304 TKPTLMTLTSIIYASSRSVQLRHGKFIHGYILRNRIDVDIFIDVSLIDFYFKCGYVSSAE 363

Query: 361 -------------------------------------------------SSTLPACSQLA 420
                                                            SSTL ACSQLA
Sbjct: 364 TIFRTISKNEVVSWNVMISGHVMVGNHIQALHIYDNMKEHHVKPDALTFSSTLSACSQLA 423

Query: 421 ALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIA 480
           ALDKGRELH+CII+HKLE+N+IVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMI 
Sbjct: 424 ALDKGRELHYCIINHKLEANEIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIF 483

Query: 481 AYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIK 540
           AYGSHGQASEALRLFDEMQK NV+ADSVTFLAVLSACSHAGLVDEGY YFNEM++QYDIK
Sbjct: 484 AYGSHGQASEALRLFDEMQKLNVRADSVTFLAVLSACSHAGLVDEGYMYFNEMVVQYDIK 543

Query: 541 PGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGK 597
           PGIE YSCLIDLLGRAGRLHEAYEILQRSKETR+DIGLLSTLFSAC LHN+FVLGIQIGK
Sbjct: 544 PGIEHYSCLIDLLGRAGRLHEAYEILQRSKETRSDIGLLSTLFSACLLHNNFVLGIQIGK 603

BLAST of HG10001188 vs. NCBI nr
Match: XP_023513314.1 (pentatricopeptide repeat-containing protein At5g27110 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1059.3 bits (2738), Expect = 1.2e-305
Identity = 531/690 (76.96%), Postives = 559/690 (81.01%), Query Frame = 0

Query: 1   MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQT 60
           MTLLSAL+TCASSKLLKQGKL+HQ+IF  GFQSN+ LCK LIGFYFSC+DYRSAELVFQT
Sbjct: 4   MTLLSALRTCASSKLLKQGKLIHQRIFCSGFQSNVTLCKALIGFYFSCYDYRSAELVFQT 63

Query: 61  NDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVV 120
           ND PLDVSLWNALLSA TK+ RFVEALQLF QL  HS+VRPDCYTYPVVLKACGGLGRVV
Sbjct: 64  NDCPLDVSLWNALLSAYTKSSRFVEALQLFDQLKSHSHVRPDCYTYPVVLKACGGLGRVV 123

Query: 121 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 180
           YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDA+NLFDE PQRDVGCWN VISCYF
Sbjct: 124 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAVNLFDEIPQRDVGCWNAVISCYF 183

Query: 181 KDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAF 240
           +DDK E ALKMFDKMKELGFEPNSVTFTV VSSC RLLNL RGKEI RELIDR  LLDAF
Sbjct: 184 QDDKPEAALKMFDKMKELGFEPNSVTFTVVVSSCARLLNLGRGKEIHRELIDRSVLLDAF 243

Query: 241 VLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEG 300
           VLSALVDMYGKCGCLEMAKEVFEQIPRKN +TWN+MITGYSLKGDSRSCIELLMRMNDEG
Sbjct: 244 VLSALVDMYGKCGCLEMAKEVFEQIPRKNAMTWNSMITGYSLKGDSRSCIELLMRMNDEG 303

Query: 301 SKPTLTTLTSIIYAS--------------------------------------------- 360
           +KPTLTTLTSIIYAS                                             
Sbjct: 304 TKPTLTTLTSIIYASSRSVQLRHGKFIHGYILRNRIDVDIFIDVSLIDLYFKCGSVSSAE 363

Query: 361 -------------------------------------------------SSTLPACSQLA 420
                                                            SSTL ACSQLA
Sbjct: 364 TVFRNVSKHEVVSWNVMISGYVMVGNHIQALRVYDNMKEHHVKPDAVTFSSTLSACSQLA 423

Query: 421 ALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIA 480
           ALDKGRELHHCIISHKLE N+IVMGALLDMYAKCGDV+EARKLFHQLP+RDLVSWTSMI 
Sbjct: 424 ALDKGRELHHCIISHKLECNEIVMGALLDMYAKCGDVNEARKLFHQLPERDLVSWTSMIT 483

Query: 481 AYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIK 540
           AYGSHGQASEALRLFDEMQKSNV+ADSVTFLAVLSACSHAGLVDEGY YFNEMI QYDI+
Sbjct: 484 AYGSHGQASEALRLFDEMQKSNVRADSVTFLAVLSACSHAGLVDEGYIYFNEMITQYDIR 543

Query: 541 PGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGK 597
           PGIE YSCLIDLLGRAGRLHEAYEIL+RS+ETRND GLLSTLFSACRLHNDF LGI+IGK
Sbjct: 544 PGIEHYSCLIDLLGRAGRLHEAYEILERSEETRNDTGLLSTLFSACRLHNDFGLGIKIGK 603

BLAST of HG10001188 vs. NCBI nr
Match: KAG7010290.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1058.9 bits (2737), Expect = 1.6e-305
Identity = 531/690 (76.96%), Postives = 559/690 (81.01%), Query Frame = 0

Query: 1   MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQT 60
           MTLLSAL+TCASSKLLKQGKL+HQ+IF  GFQSN+ LCK LIGFYFSC+DYRSAELVFQT
Sbjct: 4   MTLLSALRTCASSKLLKQGKLIHQRIFCSGFQSNVTLCKALIGFYFSCYDYRSAELVFQT 63

Query: 61  NDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVV 120
           ND PLDVSLWNALLSA TK+ RFVEALQLF QL  HS+VRPDCYTYPVVLKACGGLGRVV
Sbjct: 64  NDCPLDVSLWNALLSAYTKSSRFVEALQLFDQLKSHSHVRPDCYTYPVVLKACGGLGRVV 123

Query: 121 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 180
           YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDA+NLFDE PQRDVGCWN VISCYF
Sbjct: 124 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAVNLFDEIPQRDVGCWNAVISCYF 183

Query: 181 KDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAF 240
           +DDK E ALKMFDKMKELGFEPNSVTFTV VSSC RLLNLERGKEI RELIDR  LLDAF
Sbjct: 184 QDDKPEAALKMFDKMKELGFEPNSVTFTVVVSSCARLLNLERGKEIHRELIDRSVLLDAF 243

Query: 241 VLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEG 300
           VLSALVDMYGKCGCLEMAKEVFEQIPRKN +TWN+MITGYSLKGDSRSCIELLMRMN EG
Sbjct: 244 VLSALVDMYGKCGCLEMAKEVFEQIPRKNAMTWNSMITGYSLKGDSRSCIELLMRMNCEG 303

Query: 301 SKPTLTTLTSIIYAS--------------------------------------------- 360
           ++PTLTTLTSIIYAS                                             
Sbjct: 304 TEPTLTTLTSIIYASSRSIQLRHGKFIHGYILRNRIDVDIFIDVSLIDLYFKCGSVSSAE 363

Query: 361 -------------------------------------------------SSTLPACSQLA 420
                                                            SSTL ACSQLA
Sbjct: 364 TVFRNVSKNEVVSWNVMISGYVMVGNHIQALRVYDNMKEHHVKPDAVTFSSTLSACSQLA 423

Query: 421 ALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIA 480
           ALDKGRELHHCIISHKLE N+IVMGALLDMYAKCGDV+EARKLFHQLP+RDLVSWTSMI 
Sbjct: 424 ALDKGRELHHCIISHKLECNEIVMGALLDMYAKCGDVNEARKLFHQLPERDLVSWTSMIT 483

Query: 481 AYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIK 540
           AYGSHGQASEALRLFDEMQKSNV+ADSVTFLAVLSACSHAGLVDEGY YFNEMI QYDI+
Sbjct: 484 AYGSHGQASEALRLFDEMQKSNVRADSVTFLAVLSACSHAGLVDEGYIYFNEMITQYDIR 543

Query: 541 PGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGK 597
           PGIE YSCLIDLLGRAGRLHEAYEIL+RS+ETRND GLLSTLFSACRLHNDF LGIQIGK
Sbjct: 544 PGIEHYSCLIDLLGRAGRLHEAYEILERSEETRNDTGLLSTLFSACRLHNDFGLGIQIGK 603

BLAST of HG10001188 vs. ExPASy Swiss-Prot
Match: O04659 (Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E14 PE=2 SV=2)

HSP 1 Score: 627.5 bits (1617), Expect = 1.6e-178
Identity = 320/686 (46.65%), Postives = 446/686 (65.01%), Query Frame = 0

Query: 3   LLSALKTCA-SSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQTN 62
           LLS L+ C  S+K L++ KL+HQ+I +LG + ++ LCK+LI  YF+C D+ SA  VF+  
Sbjct: 6   LLSLLRECTNSTKSLRRIKLVHQRILTLGLRRDVVLCKSLINVYFTCKDHCSARHVFENF 65

Query: 63  DSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVVY 122
           D   DV +WN+L+S  +KN  F + L++F +L   S   PD +T+P V+KA G LGR   
Sbjct: 66  DIRSDVYIWNSLMSGYSKNSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREFL 125

Query: 123 GRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYFK 182
           GR IH  ++K+G + DV V SS++ MYAK + F +++ +FDE P+RDV  WNTVISC+++
Sbjct: 126 GRMIHTLVVKSGYVCDVVVASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFYQ 185

Query: 183 DDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAFV 242
             +AE AL++F +M+  GFEPNSV+ TVA+S+C+RLL LERGKEI R+ + +G  LD +V
Sbjct: 186 SGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEYV 245

Query: 243 LSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGS 302
            SALVDMYGKC CLE+A+EVF+++PRK+ + WN+MI GY  KGDS+SC+E+L RM  EG+
Sbjct: 246 NSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMIIEGT 305

Query: 303 KPTLTTLTSIIYASS--------------------------------------------- 362
           +P+ TTLTSI+ A S                                             
Sbjct: 306 RPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAET 365

Query: 363 -------------------------------------------------STLPACSQLAA 422
                                                            S LPACSQLAA
Sbjct: 366 VFSKTQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSVGVKPDVVTFTSVLPACSQLAA 425

Query: 423 LDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIAA 482
           L+KG+++H  I   +LE++++++ ALLDMY+KCG+  EA ++F+ +PK+D+VSWT MI+A
Sbjct: 426 LEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEAFRIFNSIPKKDVVSWTVMISA 485

Query: 483 YGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIKP 542
           YGSHGQ  EAL  FDEMQK  ++ D VT LAVLSAC HAGL+DEG K+F++M  +Y I+P
Sbjct: 486 YGSHGQPREALYQFDEMQKFGLKPDGVTLLAVLSACGHAGLIDEGLKFFSQMRSKYGIEP 545

Query: 543 GIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGKM 594
            IE YSC+ID+LGRAGRL EAYEI+Q++ ET ++  LLSTLFSAC LH +  LG +I ++
Sbjct: 546 IIEHYSCMIDILGRAGRLLEAYEIIQQTPETSDNAELLSTLFSACCLHLEHSLGDRIARL 605

BLAST of HG10001188 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 380.6 bits (976), Expect = 3.4e-104
Identity = 212/579 (36.61%), Postives = 321/579 (55.44%), Query Frame = 0

Query: 2   TLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQTN 61
           TL S +  C++   L +G+ LH     LGF SN  +   L+  Y  C D  +A L +   
Sbjct: 391 TLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETA-LDYFLE 450

Query: 62  DSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVVY 121
               +V LWN +L A         + ++F Q+     V P+ YTYP +LK C  LG +  
Sbjct: 451 TEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIV-PNQYTYPSILKTCIRLGDLEL 510

Query: 122 GRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYFK 181
           G +IH+ +IKT    + +V S +++MYAK  +   A ++   F  +DV  W T+I+ Y +
Sbjct: 511 GEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQ 570

Query: 182 DDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAFV 241
            +  + AL  F +M + G   + V  T AVS+C  L  L+ G++I  +    G   D   
Sbjct: 571 YNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPF 630

Query: 242 LSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGS 301
            +ALV +Y +CG +E +   FEQ    + I WNA+++G+   G++   + + +RMN EG 
Sbjct: 631 QNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGI 690

Query: 302 KPTLTTLTSIIYASSSTLPACSQLAALDKGRELHHCIISHKLESNKIVMGALLDMYAKCG 361
                T  S + A+S T       A + +G+++H  I     +S   V  AL+ MYAKCG
Sbjct: 691 DNNNFTFGSAVKAASET-------ANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCG 750

Query: 362 DVDEARKLFHQLPKRDLVSWTSMIAAYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLS 421
            + +A K F ++  ++ VSW ++I AY  HG  SEAL  FD+M  SNV+ + VT + VLS
Sbjct: 751 SISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLS 810

Query: 422 ACSHAGLVDEGYKYFNEMIIQYDIKPGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRND 481
           ACSH GLVD+G  YF  M  +Y + P  E Y C++D+L RAG L  A E +Q     + D
Sbjct: 811 ACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQ-EMPIKPD 870

Query: 482 IGLLSTLFSACRLHNDFVLGIQIGKMLIEIDPDDPSTYILLSNMYASVNKWEEVCKVRRK 541
             +  TL SAC +H +  +G      L+E++P+D +TY+LLSN+YA   KW+     R+K
Sbjct: 871 ALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQK 930

Query: 542 MREIGLKKNPGCSWIEINQRIHPFFVEDKSNPLADGVYE 581
           M+E G+KK PG SWIE+   IH F+V D+++PLAD ++E
Sbjct: 931 MKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHE 959

BLAST of HG10001188 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 374.4 bits (960), Expect = 2.4e-102
Identity = 211/656 (32.16%), Postives = 338/656 (51.52%), Query Frame = 0

Query: 7   LKTCASSKLLK-QGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVF----QTN 66
           L +C  SKL     + +H  +   GF + I +   LI  Y  C        VF    Q N
Sbjct: 26  LDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRN 85

Query: 67  --------------------DS------PLDVSLWNALLSANTKNFRFVEALQLFHQLNC 126
                               DS        D   WN+++S   ++ R  EAL  F  ++ 
Sbjct: 86  IYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHK 145

Query: 127 HSYVRPDCYTYPVVLKACGGLGRVVYGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFH 186
             +V  + Y++  VL AC GL  +  G ++H+ + K+  + DV++GS++++MY+KC   +
Sbjct: 146 EGFVLNE-YSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVN 205

Query: 187 DAINLFDEFPQRDVGCWNTVISCYFKDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCT 246
           DA  +FDE   R+V  WN++I+C+ ++  A  AL +F  M E   EP+ VT    +S+C 
Sbjct: 206 DAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACA 265

Query: 247 RLLNLERGKEIQRELIDRGTLLDAFVLS-ALVDMYGKCGCLEMAKEVFEQIP-------- 306
            L  ++ G+E+   ++    L +  +LS A VDMY KC  ++ A+ +F+ +P        
Sbjct: 266 SLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAET 325

Query: 307 -----------------------RKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGSKP 366
                                   +N ++WNA+I GY+  G++   + L   +  E   P
Sbjct: 326 SMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCP 385

Query: 367 TLTTLTSIIYASSSTLPACSQLAALDKGRELHHCIISHKL------ESNKIVMGALLDMY 426
           T        Y+ ++ L AC+ LA L  G + H  ++ H        E +  V  +L+DMY
Sbjct: 386 T-------HYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMY 445

Query: 427 AKCGDVDEARKLFHQLPKRDLVSWTSMIAAYGSHGQASEALRLFDEMQKSNVQADSVTFL 486
            KCG V+E   +F ++ +RD VSW +MI  +  +G  +EAL LF EM +S  + D +T +
Sbjct: 446 VKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMI 505

Query: 487 AVLSACSHAGLVDEGYKYFNEMIIQYDIKPGIEQYSCLIDLLGRAGRLHEAYEILQRSKE 546
            VLSAC HAG V+EG  YF+ M   + + P  + Y+C++DLLGRAG L EA  +++    
Sbjct: 506 GVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIE-EMP 565

Query: 547 TRNDIGLLSTLFSACRLHNDFVLGIQIGKMLIEIDPDDPSTYILLSNMYASVNKWEEVCK 594
            + D  +  +L +AC++H +  LG  + + L+E++P +   Y+LLSNMYA + KWE+V  
Sbjct: 566 MQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMN 625

BLAST of HG10001188 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 5.3e-102
Identity = 193/558 (34.59%), Postives = 320/558 (57.35%), Query Frame = 0

Query: 18  QGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQTNDSPLDVSLWNALLSAN 77
           Q K +H ++  LG Q +  L   LI    S  D   A  VF     P  +  WNA++   
Sbjct: 36  QLKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRP-QIFPWNAIIRGY 95

Query: 78  TKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVVYGRRIHNHLIKTGLIWD 137
           ++N  F +AL ++  +   + V PD +T+P +LKAC GL  +  GR +H  + + G   D
Sbjct: 96  SRNNHFQDALLMYSNMQL-ARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDAD 155

Query: 138 VFVGSSMMNMYAKCDQFHDAINLFD--EFPQRDVGCWNTVISCYFKDDKAETALKMFDKM 197
           VFV + ++ +YAKC +   A  +F+    P+R +  W  ++S Y ++ +   AL++F +M
Sbjct: 156 VFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQM 215

Query: 198 KELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAFVLSALVDMYGKCGCL 257
           +++  +P+ V     +++ T L +L++G+ I   ++  G  ++  +L +L  MY KCG +
Sbjct: 216 RKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQV 275

Query: 258 EMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGSKPTLTTLTSIIYAS 317
             AK +F+++   N I WNAMI+GY+  G +R  I++   M ++  +P   ++TS I   
Sbjct: 276 ATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAI--- 335

Query: 318 SSTLPACSQLAALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPK 377
                AC+Q+ +L++ R ++  +       +  +  AL+DM+AKCG V+ AR +F +   
Sbjct: 336 ----SACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLD 395

Query: 378 RDLVSWTSMIAAYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKY 437
           RD+V W++MI  YG HG+A EA+ L+  M++  V  + VTFL +L AC+H+G+V EG+ +
Sbjct: 396 RDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWF 455

Query: 438 FNEMIIQYDIKPGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLH 497
           FN M   + I P  + Y+C+IDLLGRAG L +AYE++ +    +  + +   L SAC+ H
Sbjct: 456 FNRM-ADHKINPQQQHYACVIDLLGRAGHLDQAYEVI-KCMPVQPGVTVWGALLSACKKH 515

Query: 498 NDFVLGIQIGKMLIEIDPDDPSTYILLSNMYASVNKWEEVCKVRRKMREIGLKKNPGCSW 557
               LG    + L  IDP +   Y+ LSN+YA+   W+ V +VR +M+E GL K+ GCSW
Sbjct: 516 RHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSW 575

Query: 558 IEINQRIHPFFVEDKSNP 574
           +E+  R+  F V DKS+P
Sbjct: 576 VEVRGRLEAFRVGDKSHP 582

BLAST of HG10001188 vs. ExPASy Swiss-Prot
Match: Q7Y211 (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 372.9 bits (956), Expect = 7.0e-102
Identity = 203/599 (33.89%), Postives = 337/599 (56.26%), Query Frame = 0

Query: 2   TLLSALKTCASSKLLKQGKLLHQKIFSLGFQS---NIPLCKTLIGFYFSCHDYRSAELVF 61
           TL+S +  C S+  + +G ++ +++ + G +    N  +  TL+  Y       S++++ 
Sbjct: 201 TLVSVVTAC-SNLPMPEGLMMGKQVHAYGLRKGELNSFIINTLVAMYGKLGKLASSKVLL 260

Query: 62  QTNDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGR 121
            +     D+  WN +LS+  +N + +EAL+   ++     V PD +T   VL AC  L  
Sbjct: 261 GSFGG-RDLVTWNTVLSSLCQNEQLLEALEYLREMVLEG-VEPDEFTISSVLPACSHLEM 320

Query: 122 VVYGRRIHNHLIKTG-LIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVIS 181
           +  G+ +H + +K G L  + FVGS++++MY  C Q      +FD    R +G WN +I+
Sbjct: 321 LRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIA 380

Query: 182 CYFKDDKAETALKMFDKMKE-LGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTL 241
            Y +++  + AL +F  M+E  G   NS T    V +C R     R + I   ++ RG  
Sbjct: 381 GYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLD 440

Query: 242 LDAFVLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRM 301
            D FV + L+DMY + G +++A  +F ++  ++ +TWN MITGY         + LL +M
Sbjct: 441 RDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKM 500

Query: 302 NDEGSKPTLTTLTSIIYASSST----LPACSQLAALDKGRELHHCIISHKLESNKIVMGA 361
            +   K +       +  +S T    LP+C+ L+AL KG+E+H   I + L ++  V  A
Sbjct: 501 QNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSA 560

Query: 362 LLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIAAYGSHGQASEALRLFDEMQKSNVQAD 421
           L+DMYAKCG +  +RK+F Q+P++++++W  +I AYG HG   EA+ L   M    V+ +
Sbjct: 561 LVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPN 620

Query: 422 SVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIKPGIEQYSCLIDLLGRAGRLHEAYEIL 481
            VTF++V +ACSH+G+VDEG + F  M   Y ++P  + Y+C++DLLGRAGR+ EAY+++
Sbjct: 621 EVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLM 680

Query: 482 QRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGKMLIEIDPDDPSTYILLSNMYASVNKW 541
                  N  G  S+L  A R+HN+  +G    + LI+++P+  S Y+LL+N+Y+S   W
Sbjct: 681 NMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLW 740

Query: 542 EEVCKVRRKMREIGLKKNPGCSWIEINQRIHPFFVEDKSNPLADGVYECLNILACHMEK 592
           ++  +VRR M+E G++K PGCSWIE    +H F   D S+P ++ +   L  L   M K
Sbjct: 741 DKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRK 796

BLAST of HG10001188 vs. ExPASy TrEMBL
Match: A0A5D3BAG7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold182G00800 PE=4 SV=1)

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 538/690 (77.97%), Postives = 571/690 (82.75%), Query Frame = 0

Query: 1   MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQT 60
           +TLLSAL+TC SSKLLKQGKL+HQ+IFS GFQSNI LCK+LIGFYFSCHDY SAELVFQT
Sbjct: 4   VTLLSALRTCTSSKLLKQGKLIHQRIFSCGFQSNILLCKSLIGFYFSCHDYASAELVFQT 63

Query: 61  NDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVV 120
           N+ PLDVSLWN+LLSA T NFRFVEALQLF QLNC+S+VRPD YTYPVVLKACGGLGRV+
Sbjct: 64  NECPLDVSLWNSLLSAYTNNFRFVEALQLFDQLNCNSHVRPDFYTYPVVLKACGGLGRVI 123

Query: 121 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 180
           YGRRIHNHL+KTGLIWDVFVGSS+MNMYAKCDQF DAI LFDEFPQRDVGCWN VISCYF
Sbjct: 124 YGRRIHNHLLKTGLIWDVFVGSSLMNMYAKCDQFVDAIKLFDEFPQRDVGCWNAVISCYF 183

Query: 181 KDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAF 240
           KD KAETALK FDKMKELGFEPNSVTFTV VSSCTRLLNL+RGKEI RELIDR  LLDAF
Sbjct: 184 KDGKAETALKTFDKMKELGFEPNSVTFTVVVSSCTRLLNLKRGKEIHRELIDRRILLDAF 243

Query: 241 VLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEG 300
           VLSALVDMYGKCGCLEMAKEVFEQIPRKN ITWNAMITGYSLKGDSRSCIELLMRMNDEG
Sbjct: 244 VLSALVDMYGKCGCLEMAKEVFEQIPRKNAITWNAMITGYSLKGDSRSCIELLMRMNDEG 303

Query: 301 SKPTLTTLTSIIYAS--------------------------------------------- 360
           +KPTLTTLTSI+YAS                                             
Sbjct: 304 TKPTLTTLTSIVYASSRSVQLRHGKFIHGYILRNRIDVDIFIDVSLIDFYFKCGYVSSAE 363

Query: 361 -------------------------------------------------SSTLPACSQLA 420
                                                            SSTL ACSQLA
Sbjct: 364 TIFRSISKNEVVSWNVMVSGYVMVGNHIQALRIYDNMKEHHVKPDALTFSSTLSACSQLA 423

Query: 421 ALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIA 480
           ALDKGRELH+CII+HKLE+N+IVMGALLDMYAKCGDVDEA+KLFHQLPKRDLVSWTSMI+
Sbjct: 424 ALDKGRELHYCIINHKLEANEIVMGALLDMYAKCGDVDEAQKLFHQLPKRDLVSWTSMIS 483

Query: 481 AYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIK 540
           AYGSHGQASEALRLFDEMQKSNV+ADSV FLAVLSACSHAGLVDEGY YFNEMI+QYDIK
Sbjct: 484 AYGSHGQASEALRLFDEMQKSNVRADSVAFLAVLSACSHAGLVDEGYTYFNEMIVQYDIK 543

Query: 541 PGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGK 597
           PGIE YSCLIDLLGRAGRLHEAYEILQRSKET++DIGLLSTLFSACRLHNDFVLGIQIGK
Sbjct: 544 PGIEHYSCLIDLLGRAGRLHEAYEILQRSKETKSDIGLLSTLFSACRLHNDFVLGIQIGK 603

BLAST of HG10001188 vs. ExPASy TrEMBL
Match: A0A0A0KK21 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G432790 PE=4 SV=1)

HSP 1 Score: 1070.1 bits (2766), Expect = 3.3e-309
Identity = 538/690 (77.97%), Postives = 565/690 (81.88%), Query Frame = 0

Query: 1   MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQT 60
           +TLLSAL+TC SSKLLKQGKL+HQ+IFS GFQSNI L K+LIGFYFSCHDY SAELVFQT
Sbjct: 4   VTLLSALRTCTSSKLLKQGKLIHQRIFSCGFQSNIVLSKSLIGFYFSCHDYASAELVFQT 63

Query: 61  NDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVV 120
           ND PLDVSLWNALLSA T NFRFVEALQLF QLNC+SYVRPD YTYPVVLKACGGLGRV+
Sbjct: 64  NDCPLDVSLWNALLSAYTNNFRFVEALQLFDQLNCNSYVRPDFYTYPVVLKACGGLGRVI 123

Query: 121 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 180
           YGRRIHNHL+KTGLIWDVFVGSS+MNMYAKCDQF DAI LFDEFPQRDVGCWN VISCYF
Sbjct: 124 YGRRIHNHLLKTGLIWDVFVGSSLMNMYAKCDQFVDAIKLFDEFPQRDVGCWNAVISCYF 183

Query: 181 KDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAF 240
           KD KAE ALK FDKMKELGFEPNSVTFTV VSSCTRLLNLERGKE+ RELI+R  LLDAF
Sbjct: 184 KDGKAEMALKTFDKMKELGFEPNSVTFTVVVSSCTRLLNLERGKEVHRELIERRILLDAF 243

Query: 241 VLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEG 300
           VLSALVDMYGKCGCLEMAKEVFE+IPRKN ITWNAMITGYSLKGDSRSCIELLMRMNDEG
Sbjct: 244 VLSALVDMYGKCGCLEMAKEVFEKIPRKNAITWNAMITGYSLKGDSRSCIELLMRMNDEG 303

Query: 301 SKPTLTTLTSIIYAS--------------------------------------------- 360
           +KPTL TLTSIIYAS                                             
Sbjct: 304 TKPTLMTLTSIIYASSRSVQLRHGKFIHGYILRNRIDVDIFIDVSLIDFYFKCGYVSSAE 363

Query: 361 -------------------------------------------------SSTLPACSQLA 420
                                                            SSTL ACSQLA
Sbjct: 364 TIFRTISKNEVVSWNVMISGHVMVGNHIQALHIYDNMKEHHVKPDALTFSSTLSACSQLA 423

Query: 421 ALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIA 480
           ALDKGRELH+CII+HKLE+N+IVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMI 
Sbjct: 424 ALDKGRELHYCIINHKLEANEIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIF 483

Query: 481 AYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIK 540
           AYGSHGQASEALRLFDEMQK NV+ADSVTFLAVLSACSHAGLVDEGY YFNEM++QYDIK
Sbjct: 484 AYGSHGQASEALRLFDEMQKLNVRADSVTFLAVLSACSHAGLVDEGYMYFNEMVVQYDIK 543

Query: 541 PGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGK 597
           PGIE YSCLIDLLGRAGRLHEAYEILQRSKETR+DIGLLSTLFSAC LHN+FVLGIQIGK
Sbjct: 544 PGIEHYSCLIDLLGRAGRLHEAYEILQRSKETRSDIGLLSTLFSACLLHNNFVLGIQIGK 603

BLAST of HG10001188 vs. ExPASy TrEMBL
Match: A0A6J1FXU4 (pentatricopeptide repeat-containing protein At5g27110 OS=Cucurbita moschata OX=3662 GN=LOC111448220 PE=4 SV=1)

HSP 1 Score: 1057.4 bits (2733), Expect = 2.3e-305
Identity = 530/690 (76.81%), Postives = 560/690 (81.16%), Query Frame = 0

Query: 1   MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQT 60
           MTLLSAL+TCASSKLLKQGKL+HQ+IF  GFQSN+ LCK LIGFYFSC+DYRSAELVFQT
Sbjct: 4   MTLLSALRTCASSKLLKQGKLIHQRIFCSGFQSNVTLCKALIGFYFSCYDYRSAELVFQT 63

Query: 61  NDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVV 120
           ND PLDVSLWNALLSA TK+ RFVEALQLF QL  HS+VRPDCYTYPVVLKACGGLGRVV
Sbjct: 64  NDCPLDVSLWNALLSAYTKSSRFVEALQLFDQLKSHSHVRPDCYTYPVVLKACGGLGRVV 123

Query: 121 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 180
           YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDA++LFDE PQRDVGCWN VISCYF
Sbjct: 124 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAVDLFDEIPQRDVGCWNAVISCYF 183

Query: 181 KDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAF 240
           +DDK E ALKMFDKMKELGFEPNSVTFTV VSSC RLLNLERGKEI RELIDR  LLDAF
Sbjct: 184 QDDKPEAALKMFDKMKELGFEPNSVTFTVVVSSCARLLNLERGKEIHRELIDRSVLLDAF 243

Query: 241 VLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEG 300
           VLSALVDMYGKCGCLEMAKEVFEQIPRKN +TWN+MITGYSLKGDSRSCIELLMRMN EG
Sbjct: 244 VLSALVDMYGKCGCLEMAKEVFEQIPRKNAMTWNSMITGYSLKGDSRSCIELLMRMNYEG 303

Query: 301 SKPTLTTLTSIIYAS--------------------------------------------- 360
           ++PTLTTLTSIIYAS                                             
Sbjct: 304 TEPTLTTLTSIIYASSRSVQLRHGKFIHGYILRNRIDVDIFIDVSLIDLYFKCGSVSSAE 363

Query: 361 -------------------------------------------------SSTLPACSQLA 420
                                                            SSTL ACSQLA
Sbjct: 364 TVFRNVSKNDVVSWNVMISGYVMVGNHIQALRVYDNMKEHHVKPDAVTFSSTLSACSQLA 423

Query: 421 ALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIA 480
           ALDKGRELHHCIISHKLESN+IVMGALLDMYAKCGDV+EARKLFHQLP+RDLVSWTSMI 
Sbjct: 424 ALDKGRELHHCIISHKLESNEIVMGALLDMYAKCGDVNEARKLFHQLPERDLVSWTSMIT 483

Query: 481 AYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIK 540
           AYGSHGQASEALRLFDEM+KSNV+ADSVTFLAVLSACSHAGLVDEGY YFNEMI QYDI+
Sbjct: 484 AYGSHGQASEALRLFDEMKKSNVRADSVTFLAVLSACSHAGLVDEGYIYFNEMITQYDIR 543

Query: 541 PGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGK 597
           PGIE YSCLIDLLGRAGRLHEAYEIL+RS+ETRND GLLSTLFSACRLHNDF LGIQIGK
Sbjct: 544 PGIEHYSCLIDLLGRAGRLHEAYEILERSEETRNDTGLLSTLFSACRLHNDFGLGIQIGK 603

BLAST of HG10001188 vs. ExPASy TrEMBL
Match: A0A6J1JHK4 (pentatricopeptide repeat-containing protein At5g27110 OS=Cucurbita maxima OX=3661 GN=LOC111484569 PE=4 SV=1)

HSP 1 Score: 1056.6 bits (2731), Expect = 3.9e-305
Identity = 530/691 (76.70%), Postives = 560/691 (81.04%), Query Frame = 0

Query: 1   MTLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQT 60
           MTLLSAL+TCASSKLLKQGKL+HQ+IF  GFQSN+ LCK LIGFYFSC+DYRSAELVFQT
Sbjct: 4   MTLLSALRTCASSKLLKQGKLIHQRIFCSGFQSNLTLCKALIGFYFSCYDYRSAELVFQT 63

Query: 61  NDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVV 120
           ND PLDVSLWNALLSA TK+FRFVEALQLF QL  HS+VRPDCYTYPVVLKACGGLGRVV
Sbjct: 64  NDCPLDVSLWNALLSAYTKSFRFVEALQLFDQLKSHSHVRPDCYTYPVVLKACGGLGRVV 123

Query: 121 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYF 180
           YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDA+NLFDEFPQRDVGCWN VISCYF
Sbjct: 124 YGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAVNLFDEFPQRDVGCWNAVISCYF 183

Query: 181 KDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAF 240
           +DDK E ALKMFDKMKELGFEPNSVTFTV VSSC RLLNLERGKEI RELIDR  LLD F
Sbjct: 184 QDDKPEAALKMFDKMKELGFEPNSVTFTVVVSSCARLLNLERGKEIHRELIDRSVLLDDF 243

Query: 241 VLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEG 300
           VLSALVDMYGKCGCLEMAKEVFEQIPRKN +TWN+MITGYSLKGDSRSCIELLMRM+ EG
Sbjct: 244 VLSALVDMYGKCGCLEMAKEVFEQIPRKNAMTWNSMITGYSLKGDSRSCIELLMRMSYEG 303

Query: 301 SKPTLTTLTSIIYAS--------------------------------------------- 360
           ++PTLTTLTSIIYAS                                             
Sbjct: 304 TEPTLTTLTSIIYASSRSVQLRHGKFIHGYILRNRIDVDIFIDVSLIDFYFKCGSVSSAE 363

Query: 361 -------------------------------------------------SSTLPACSQLA 420
                                                            SSTL ACSQLA
Sbjct: 364 TVFRNVSKNEVVSWNVMISGYVMVGNHIQALCVYDNMKEHHVKPDAVTFSSTLSACSQLA 423

Query: 421 ALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIA 480
           ALDKGRELHHCIISHKLE N+IVMGALLDMYAKCGDV+EARKLFHQLP+RDLVSWTSMI 
Sbjct: 424 ALDKGRELHHCIISHKLECNEIVMGALLDMYAKCGDVNEARKLFHQLPERDLVSWTSMIT 483

Query: 481 AYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIK 540
           AYGSHGQASEALRLFD MQKSNV+ADSVTFLAVLSACSHAGLVDEGY YFNEMI QYDI+
Sbjct: 484 AYGSHGQASEALRLFDAMQKSNVRADSVTFLAVLSACSHAGLVDEGYIYFNEMITQYDIR 543

Query: 541 PGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGK 598
           PGIE YSCLIDLLGRAGRLHEAYEIL+RS+ETRND GLLSTLFSACRLHNDF LG QIGK
Sbjct: 544 PGIEHYSCLIDLLGRAGRLHEAYEILERSEETRNDTGLLSTLFSACRLHNDFGLGKQIGK 603

BLAST of HG10001188 vs. ExPASy TrEMBL
Match: A0A6J1DJC4 (pentatricopeptide repeat-containing protein At5g27110 OS=Momordica charantia OX=3673 GN=LOC111020588 PE=4 SV=1)

HSP 1 Score: 1013.8 bits (2620), Expect = 2.9e-292
Identity = 512/688 (74.42%), Postives = 551/688 (80.09%), Query Frame = 0

Query: 3   LLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQTND 62
           LLSAL+TCASSKLLKQGKL+HQ+IFS GFQS+I LCKTLI FY SCHDYRSAELVF+T  
Sbjct: 6   LLSALRTCASSKLLKQGKLIHQRIFSSGFQSHIALCKTLIDFYSSCHDYRSAELVFETTC 65

Query: 63  SPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVVYG 122
             LDVSLWNALLSA TKNF FVEALQL+ QL CHS VRPDCYTYPVVLKACGGLGRVV G
Sbjct: 66  CSLDVSLWNALLSAYTKNFMFVEALQLYDQLKCHSEVRPDCYTYPVVLKACGGLGRVVCG 125

Query: 123 RRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYFKD 182
           RR+H+HLIKTGLIWDVFV SS+MNMY KC +FH AINLFDE P RDVGCWNTVISCYF+D
Sbjct: 126 RRVHSHLIKTGLIWDVFVSSSLMNMYVKCGRFHYAINLFDEMPHRDVGCWNTVISCYFQD 185

Query: 183 DKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAFVL 242
           DKAETALKMFD+MK+ GFEPNSVTFT+ +SSCTRLLNLE+GKEI RELIDR  LLDAFVL
Sbjct: 186 DKAETALKMFDEMKDSGFEPNSVTFTIVISSCTRLLNLEKGKEIHRELIDRRVLLDAFVL 245

Query: 243 SALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGSK 302
           SALVD+YGKCGCLEMAKEVFEQIPRKN ITWN+MITGYSL GDSRSCIELL RM D G+K
Sbjct: 246 SALVDLYGKCGCLEMAKEVFEQIPRKNAITWNSMITGYSLTGDSRSCIELLKRMTDAGTK 305

Query: 303 PTLTTLTSIIYAS----------------------------------------------- 362
           PTLTTLTSII AS                                               
Sbjct: 306 PTLTTLTSIISASSRSAQPRHGKFIHGFILRNGMNSDIFIDVSLIDLYFKCGYVYSAETI 365

Query: 363 -----------------------------------------------SSTLPACSQLAAL 422
                                                          SSTL ACSQLAAL
Sbjct: 366 FRNISKNEVVSWNVMISGYVLVGNHIQALRVYDNMKEHDVKPDAVTFSSTLSACSQLAAL 425

Query: 423 DKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIAAY 482
           +KGRELH+CIISHKLE+N+IVMGALLDM+AKCGDVDEARKLF QLP+RDLVSWT+MI AY
Sbjct: 426 EKGRELHNCIISHKLETNEIVMGALLDMFAKCGDVDEARKLFLQLPERDLVSWTTMITAY 485

Query: 483 GSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIKPG 542
           GSHGQASEALRLFDEMQKSNV ADSVTFLAVLSACSHAGLVDEGYKYFN+MIIQYDIKPG
Sbjct: 486 GSHGQASEALRLFDEMQKSNVGADSVTFLAVLSACSHAGLVDEGYKYFNKMIIQYDIKPG 545

Query: 543 IEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGKML 597
           IE YSCLIDLLGRAGRLHEAYEILQRS++TRN IGLLSTLFSACRL NDFVLGI+IGKM+
Sbjct: 546 IEHYSCLIDLLGRAGRLHEAYEILQRSEDTRNHIGLLSTLFSACRLRNDFVLGIEIGKMI 605

BLAST of HG10001188 vs. TAIR 10
Match: AT5G27110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 627.5 bits (1617), Expect = 1.1e-179
Identity = 320/686 (46.65%), Postives = 446/686 (65.01%), Query Frame = 0

Query: 3   LLSALKTCA-SSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQTN 62
           LLS L+ C  S+K L++ KL+HQ+I +LG + ++ LCK+LI  YF+C D+ SA  VF+  
Sbjct: 6   LLSLLRECTNSTKSLRRIKLVHQRILTLGLRRDVVLCKSLINVYFTCKDHCSARHVFENF 65

Query: 63  DSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVVY 122
           D   DV +WN+L+S  +KN  F + L++F +L   S   PD +T+P V+KA G LGR   
Sbjct: 66  DIRSDVYIWNSLMSGYSKNSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREFL 125

Query: 123 GRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYFK 182
           GR IH  ++K+G + DV V SS++ MYAK + F +++ +FDE P+RDV  WNTVISC+++
Sbjct: 126 GRMIHTLVVKSGYVCDVVVASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFYQ 185

Query: 183 DDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAFV 242
             +AE AL++F +M+  GFEPNSV+ TVA+S+C+RLL LERGKEI R+ + +G  LD +V
Sbjct: 186 SGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEYV 245

Query: 243 LSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGS 302
            SALVDMYGKC CLE+A+EVF+++PRK+ + WN+MI GY  KGDS+SC+E+L RM  EG+
Sbjct: 246 NSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMIIEGT 305

Query: 303 KPTLTTLTSIIYASS--------------------------------------------- 362
           +P+ TTLTSI+ A S                                             
Sbjct: 306 RPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAET 365

Query: 363 -------------------------------------------------STLPACSQLAA 422
                                                            S LPACSQLAA
Sbjct: 366 VFSKTQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSVGVKPDVVTFTSVLPACSQLAA 425

Query: 423 LDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIAA 482
           L+KG+++H  I   +LE++++++ ALLDMY+KCG+  EA ++F+ +PK+D+VSWT MI+A
Sbjct: 426 LEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEAFRIFNSIPKKDVVSWTVMISA 485

Query: 483 YGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIKP 542
           YGSHGQ  EAL  FDEMQK  ++ D VT LAVLSAC HAGL+DEG K+F++M  +Y I+P
Sbjct: 486 YGSHGQPREALYQFDEMQKFGLKPDGVTLLAVLSACGHAGLIDEGLKFFSQMRSKYGIEP 545

Query: 543 GIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGKM 594
            IE YSC+ID+LGRAGRL EAYEI+Q++ ET ++  LLSTLFSAC LH +  LG +I ++
Sbjct: 546 IIEHYSCMIDILGRAGRLLEAYEIIQQTPETSDNAELLSTLFSACCLHLEHSLGDRIARL 605

BLAST of HG10001188 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 380.6 bits (976), Expect = 2.4e-105
Identity = 212/579 (36.61%), Postives = 321/579 (55.44%), Query Frame = 0

Query: 2   TLLSALKTCASSKLLKQGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQTN 61
           TL S +  C++   L +G+ LH     LGF SN  +   L+  Y  C D  +A L +   
Sbjct: 391 TLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETA-LDYFLE 450

Query: 62  DSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVVY 121
               +V LWN +L A         + ++F Q+     V P+ YTYP +LK C  LG +  
Sbjct: 451 TEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIV-PNQYTYPSILKTCIRLGDLEL 510

Query: 122 GRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVISCYFK 181
           G +IH+ +IKT    + +V S +++MYAK  +   A ++   F  +DV  W T+I+ Y +
Sbjct: 511 GEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQ 570

Query: 182 DDKAETALKMFDKMKELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAFV 241
            +  + AL  F +M + G   + V  T AVS+C  L  L+ G++I  +    G   D   
Sbjct: 571 YNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPF 630

Query: 242 LSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGS 301
            +ALV +Y +CG +E +   FEQ    + I WNA+++G+   G++   + + +RMN EG 
Sbjct: 631 QNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGI 690

Query: 302 KPTLTTLTSIIYASSSTLPACSQLAALDKGRELHHCIISHKLESNKIVMGALLDMYAKCG 361
                T  S + A+S T       A + +G+++H  I     +S   V  AL+ MYAKCG
Sbjct: 691 DNNNFTFGSAVKAASET-------ANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCG 750

Query: 362 DVDEARKLFHQLPKRDLVSWTSMIAAYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLS 421
            + +A K F ++  ++ VSW ++I AY  HG  SEAL  FD+M  SNV+ + VT + VLS
Sbjct: 751 SISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLS 810

Query: 422 ACSHAGLVDEGYKYFNEMIIQYDIKPGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRND 481
           ACSH GLVD+G  YF  M  +Y + P  E Y C++D+L RAG L  A E +Q     + D
Sbjct: 811 ACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQ-EMPIKPD 870

Query: 482 IGLLSTLFSACRLHNDFVLGIQIGKMLIEIDPDDPSTYILLSNMYASVNKWEEVCKVRRK 541
             +  TL SAC +H +  +G      L+E++P+D +TY+LLSN+YA   KW+     R+K
Sbjct: 871 ALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQK 930

Query: 542 MREIGLKKNPGCSWIEINQRIHPFFVEDKSNPLADGVYE 581
           M+E G+KK PG SWIE+   IH F+V D+++PLAD ++E
Sbjct: 931 MKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHE 959

BLAST of HG10001188 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 374.4 bits (960), Expect = 1.7e-103
Identity = 211/656 (32.16%), Postives = 338/656 (51.52%), Query Frame = 0

Query: 7   LKTCASSKLLK-QGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVF----QTN 66
           L +C  SKL     + +H  +   GF + I +   LI  Y  C        VF    Q N
Sbjct: 26  LDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRN 85

Query: 67  --------------------DS------PLDVSLWNALLSANTKNFRFVEALQLFHQLNC 126
                               DS        D   WN+++S   ++ R  EAL  F  ++ 
Sbjct: 86  IYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHK 145

Query: 127 HSYVRPDCYTYPVVLKACGGLGRVVYGRRIHNHLIKTGLIWDVFVGSSMMNMYAKCDQFH 186
             +V  + Y++  VL AC GL  +  G ++H+ + K+  + DV++GS++++MY+KC   +
Sbjct: 146 EGFVLNE-YSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGNVN 205

Query: 187 DAINLFDEFPQRDVGCWNTVISCYFKDDKAETALKMFDKMKELGFEPNSVTFTVAVSSCT 246
           DA  +FDE   R+V  WN++I+C+ ++  A  AL +F  M E   EP+ VT    +S+C 
Sbjct: 206 DAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACA 265

Query: 247 RLLNLERGKEIQRELIDRGTLLDAFVLS-ALVDMYGKCGCLEMAKEVFEQIP-------- 306
            L  ++ G+E+   ++    L +  +LS A VDMY KC  ++ A+ +F+ +P        
Sbjct: 266 SLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAET 325

Query: 307 -----------------------RKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGSKP 366
                                   +N ++WNA+I GY+  G++   + L   +  E   P
Sbjct: 326 SMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCP 385

Query: 367 TLTTLTSIIYASSSTLPACSQLAALDKGRELHHCIISHKL------ESNKIVMGALLDMY 426
           T        Y+ ++ L AC+ LA L  G + H  ++ H        E +  V  +L+DMY
Sbjct: 386 T-------HYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMY 445

Query: 427 AKCGDVDEARKLFHQLPKRDLVSWTSMIAAYGSHGQASEALRLFDEMQKSNVQADSVTFL 486
            KCG V+E   +F ++ +RD VSW +MI  +  +G  +EAL LF EM +S  + D +T +
Sbjct: 446 VKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMI 505

Query: 487 AVLSACSHAGLVDEGYKYFNEMIIQYDIKPGIEQYSCLIDLLGRAGRLHEAYEILQRSKE 546
            VLSAC HAG V+EG  YF+ M   + + P  + Y+C++DLLGRAG L EA  +++    
Sbjct: 506 GVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIE-EMP 565

Query: 547 TRNDIGLLSTLFSACRLHNDFVLGIQIGKMLIEIDPDDPSTYILLSNMYASVNKWEEVCK 594
            + D  +  +L +AC++H +  LG  + + L+E++P +   Y+LLSNMYA + KWE+V  
Sbjct: 566 MQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMN 625

BLAST of HG10001188 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 373.2 bits (957), Expect = 3.8e-103
Identity = 193/558 (34.59%), Postives = 320/558 (57.35%), Query Frame = 0

Query: 18  QGKLLHQKIFSLGFQSNIPLCKTLIGFYFSCHDYRSAELVFQTNDSPLDVSLWNALLSAN 77
           Q K +H ++  LG Q +  L   LI    S  D   A  VF     P  +  WNA++   
Sbjct: 36  QLKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRP-QIFPWNAIIRGY 95

Query: 78  TKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGRVVYGRRIHNHLIKTGLIWD 137
           ++N  F +AL ++  +   + V PD +T+P +LKAC GL  +  GR +H  + + G   D
Sbjct: 96  SRNNHFQDALLMYSNMQL-ARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDAD 155

Query: 138 VFVGSSMMNMYAKCDQFHDAINLFD--EFPQRDVGCWNTVISCYFKDDKAETALKMFDKM 197
           VFV + ++ +YAKC +   A  +F+    P+R +  W  ++S Y ++ +   AL++F +M
Sbjct: 156 VFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQM 215

Query: 198 KELGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTLLDAFVLSALVDMYGKCGCL 257
           +++  +P+ V     +++ T L +L++G+ I   ++  G  ++  +L +L  MY KCG +
Sbjct: 216 RKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQV 275

Query: 258 EMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRMNDEGSKPTLTTLTSIIYAS 317
             AK +F+++   N I WNAMI+GY+  G +R  I++   M ++  +P   ++TS I   
Sbjct: 276 ATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAI--- 335

Query: 318 SSTLPACSQLAALDKGRELHHCIISHKLESNKIVMGALLDMYAKCGDVDEARKLFHQLPK 377
                AC+Q+ +L++ R ++  +       +  +  AL+DM+AKCG V+ AR +F +   
Sbjct: 336 ----SACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLD 395

Query: 378 RDLVSWTSMIAAYGSHGQASEALRLFDEMQKSNVQADSVTFLAVLSACSHAGLVDEGYKY 437
           RD+V W++MI  YG HG+A EA+ L+  M++  V  + VTFL +L AC+H+G+V EG+ +
Sbjct: 396 RDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWF 455

Query: 438 FNEMIIQYDIKPGIEQYSCLIDLLGRAGRLHEAYEILQRSKETRNDIGLLSTLFSACRLH 497
           FN M   + I P  + Y+C+IDLLGRAG L +AYE++ +    +  + +   L SAC+ H
Sbjct: 456 FNRM-ADHKINPQQQHYACVIDLLGRAGHLDQAYEVI-KCMPVQPGVTVWGALLSACKKH 515

Query: 498 NDFVLGIQIGKMLIEIDPDDPSTYILLSNMYASVNKWEEVCKVRRKMREIGLKKNPGCSW 557
               LG    + L  IDP +   Y+ LSN+YA+   W+ V +VR +M+E GL K+ GCSW
Sbjct: 516 RHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSW 575

Query: 558 IEINQRIHPFFVEDKSNP 574
           +E+  R+  F V DKS+P
Sbjct: 576 VEVRGRLEAFRVGDKSHP 582

BLAST of HG10001188 vs. TAIR 10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 372.9 bits (956), Expect = 5.0e-103
Identity = 203/599 (33.89%), Postives = 337/599 (56.26%), Query Frame = 0

Query: 2   TLLSALKTCASSKLLKQGKLLHQKIFSLGFQS---NIPLCKTLIGFYFSCHDYRSAELVF 61
           TL+S +  C S+  + +G ++ +++ + G +    N  +  TL+  Y       S++++ 
Sbjct: 201 TLVSVVTAC-SNLPMPEGLMMGKQVHAYGLRKGELNSFIINTLVAMYGKLGKLASSKVLL 260

Query: 62  QTNDSPLDVSLWNALLSANTKNFRFVEALQLFHQLNCHSYVRPDCYTYPVVLKACGGLGR 121
            +     D+  WN +LS+  +N + +EAL+   ++     V PD +T   VL AC  L  
Sbjct: 261 GSFGG-RDLVTWNTVLSSLCQNEQLLEALEYLREMVLEG-VEPDEFTISSVLPACSHLEM 320

Query: 122 VVYGRRIHNHLIKTG-LIWDVFVGSSMMNMYAKCDQFHDAINLFDEFPQRDVGCWNTVIS 181
           +  G+ +H + +K G L  + FVGS++++MY  C Q      +FD    R +G WN +I+
Sbjct: 321 LRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIA 380

Query: 182 CYFKDDKAETALKMFDKMKE-LGFEPNSVTFTVAVSSCTRLLNLERGKEIQRELIDRGTL 241
            Y +++  + AL +F  M+E  G   NS T    V +C R     R + I   ++ RG  
Sbjct: 381 GYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLD 440

Query: 242 LDAFVLSALVDMYGKCGCLEMAKEVFEQIPRKNTITWNAMITGYSLKGDSRSCIELLMRM 301
            D FV + L+DMY + G +++A  +F ++  ++ +TWN MITGY         + LL +M
Sbjct: 441 RDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKM 500

Query: 302 NDEGSKPTLTTLTSIIYASSST----LPACSQLAALDKGRELHHCIISHKLESNKIVMGA 361
            +   K +       +  +S T    LP+C+ L+AL KG+E+H   I + L ++  V  A
Sbjct: 501 QNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSA 560

Query: 362 LLDMYAKCGDVDEARKLFHQLPKRDLVSWTSMIAAYGSHGQASEALRLFDEMQKSNVQAD 421
           L+DMYAKCG +  +RK+F Q+P++++++W  +I AYG HG   EA+ L   M    V+ +
Sbjct: 561 LVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPN 620

Query: 422 SVTFLAVLSACSHAGLVDEGYKYFNEMIIQYDIKPGIEQYSCLIDLLGRAGRLHEAYEIL 481
            VTF++V +ACSH+G+VDEG + F  M   Y ++P  + Y+C++DLLGRAGR+ EAY+++
Sbjct: 621 EVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLM 680

Query: 482 QRSKETRNDIGLLSTLFSACRLHNDFVLGIQIGKMLIEIDPDDPSTYILLSNMYASVNKW 541
                  N  G  S+L  A R+HN+  +G    + LI+++P+  S Y+LL+N+Y+S   W
Sbjct: 681 NMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLW 740

Query: 542 EEVCKVRRKMREIGLKKNPGCSWIEINQRIHPFFVEDKSNPLADGVYECLNILACHMEK 592
           ++  +VRR M+E G++K PGCSWIE    +H F   D S+P ++ +   L  L   M K
Sbjct: 741 DKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRK 796

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038902098.10.0e+0080.14pentatricopeptide repeat-containing protein At5g27110 [Benincasa hispida][more]
TYJ96117.10.0e+0077.97pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_004140282.16.9e-30977.97pentatricopeptide repeat-containing protein At5g27110 [Cucumis sativus] >KGN4810... [more]
XP_023513314.11.2e-30576.96pentatricopeptide repeat-containing protein At5g27110 [Cucurbita pepo subsp. pep... [more]
KAG7010290.11.6e-30576.96Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
O046591.6e-17846.65Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX... [more]
Q9SVP73.4e-10436.61Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Q9SIT72.4e-10232.16Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9LTV85.3e-10234.59Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q7Y2117.0e-10233.89Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A5D3BAG70.0e+0077.97Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0KK213.3e-30977.97Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G432790 PE=4 SV=1[more]
A0A6J1FXU42.3e-30576.81pentatricopeptide repeat-containing protein At5g27110 OS=Cucurbita moschata OX=3... [more]
A0A6J1JHK43.9e-30576.70pentatricopeptide repeat-containing protein At5g27110 OS=Cucurbita maxima OX=366... [more]
A0A6J1DJC42.9e-29274.42pentatricopeptide repeat-containing protein At5g27110 OS=Momordica charantia OX=... [more]
Match NameE-valueIdentityDescription
AT5G27110.11.1e-17946.65Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G13650.12.4e-10536.61Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G13600.11.7e-10332.16Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.13.8e-10334.59mitochondrial editing factor 22 [more]
AT3G57430.15.0e-10333.89Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 115..218
e-value: 3.6E-24
score: 87.1
coord: 219..316
e-value: 4.7E-19
score: 70.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..114
e-value: 3.1E-12
score: 48.5
coord: 317..439
e-value: 6.1E-34
score: 119.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 440..576
e-value: 1.7E-9
score: 39.6
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 148..539
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 69..92
e-value: 0.09
score: 13.1
coord: 518..547
e-value: 0.28
score: 11.5
coord: 142..167
e-value: 0.012
score: 15.8
coord: 452..475
e-value: 0.015
score: 15.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 171..204
e-value: 1.7E-9
score: 35.2
coord: 271..304
e-value: 3.3E-4
score: 18.6
coord: 379..412
e-value: 4.9E-8
score: 30.7
coord: 414..447
e-value: 1.5E-4
score: 19.7
coord: 352..378
e-value: 3.7E-4
score: 18.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 377..424
e-value: 2.0E-12
score: 47.1
coord: 168..214
e-value: 7.9E-14
score: 51.6
coord: 268..313
e-value: 6.9E-8
score: 32.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 238..268
score: 8.506026
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 168..202
score: 12.178061
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 412..447
score: 9.273317
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 377..411
score: 12.298636
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 515..549
score: 9.185627
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 269..303
score: 10.873667
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 346..376
score: 9.097937
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 316..593
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 3..305
NoneNo IPR availablePANTHERPTHR47925:SF20SUBFAMILY NOT NAMEDcoord: 316..593
NoneNo IPR availablePANTHERPTHR47925:SF20SUBFAMILY NOT NAMEDcoord: 3..305

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001188.1HG10001188.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding