HG10006016 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10006016
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr07: 11448341 .. 11452145 (-)
RNA-Seq ExpressionHG10006016
SyntenyHG10006016
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATGCAGTAATGCCCAGTGAGTTTAGTTAAGAAATGTTTGCAATATGAATACTTTGTAAACGAGTAAATGAAATCTTGTGTAATTTGTTTTGAAATTTTGAATAGAACTGAAAATTTTACAAGTTTATGAACAAAAAAAGACATGCACGTAACACTGTGAAATTTAGATAAATTTAAACCAATTGCCTCAAGTTATTGGAGTTTATTTTCATTAATTTTTGAGTTTGAAGTAAAAAAATTATTTAATTTGGAGAAATAATTGGTTTATTTGATTTTGCTTAGATAAAAATATTAAAAAAGATTTGGAATCCAAGAAGAAAAACAGAAAGTTTGACCTAGGTCAAGGTAAAAATTAGGTAAAGATTGATATATGTGAGTTAAAATAAAATAAAATAAAATAAAATATTAACTTTCTTTTTCCAAGTCCAGGCGCCAGAAACCTCCCATCTCCTCTCTCACTGTCGCACAAACCTCCTCACGCTCTGCCCTTCCTTCCGCCGGTCGCCAACCTCAGCGGCCCGCGCCGCCCGACTGCAACGCCCGCCGCCAGTCTCCGATCGCTGTCGCCGCCGTCCGTGAGTAACCTCTGGATATTTCCGTCTCTCTCTCTAAACTCGGTCCCTTTTTGTATCTTTAATTCCGGCCACGCCGCTCGGGTTTCTGGCCTGCAGGTTGTGCCGCCGCTCATTCCCGTCGTCAACAGTGCGCCGGTGAGGTAATTCCCCCTTTTCACAGCTAAAAGTTATATTTTACTTATTAATATTTTGTTCAATTTGCATTGTTTATGAATGTGCCTTACTAATTGAGTGTGGTTTGATTTGCTTCGTTGTAGGGTTATATTTGGGATGCTTGTGGCTTTGCCTATTACACTGGTGACTCTAAGACTGTTTCACTCCAAATACAATTCTTTTTTCTTGAGTTCTATTTTCTTCTCTTTCTTCCGCATTTTTCAAACAAAACAAAGAGAAAACACCCATAGGTTTTAGAAGAACTTTGGGTTGAAATGAGAGGTTGATAGAAGGGGATTTTTTTTTGTAAGCCCAAGCATTAGTGTGGTCCACTCTAACAGTACCTTTTTCCTTCTTTCTTCTCACTGCCATCATTCACCAACACCGATCCATCCGGTTTCAAAGTGCTCTTAATTGTTATTTTCCTATCTTATATAGGATTTTTGCATTGCCATTATTGAAAGGCCGCATTTCTGTTTGCTTTTTCTTCTTCAATCTTCAGTGTTATGATACATCACCATTGTGGCTCATTTCTTAGTCGAATCCTCACCACTTTAGTTCAATTCTATTCTACCTCTACAACTTCTCCTCCCACCATTCCCTTCACCTCCTTGTTGAGACAATGCAGGACGTTGATCAATGCCAAGCTTGCTCACCAGCAAATTTTTGTCAATGGCTTCACCGAAATGGCCACCTACGCCGTTGGTGCCTACATCGAGTGTGGTGCTTTTGCAGAAGCTGTATCGCTCCTCCAACGTCTTATTCCGTCGCATTCCACTGTTTTCTGGTGGAATGCACTGATTCGACGTTCTGTGAGACTTGGTTTCCTTGATGATGCATTGGGTTTTTATTGTCAGATGCAGAGGCTTGGGTGGTTGCCTGACCATTACACATTTCCTTTTGTTCTCAAAGCCTGTGGTGAAATCCCATCGTTTCGACGTGGTGCTTCAGTTCACGCCATAGTTTGTGCAAATGGGTTTGAGTCAAATGTATTTATTTGTAATTCAATTGTGGCTATGTATGGGCGATGTGGGGCATTGGATGATGCACGCCAAGTGTTTAATGAGGTGCTTGAAAGAAAGATAGAGGACATTGTGTCTTGGAATTCAATTCTTGCTGCCTATGTACAAGGTGGGGAGTCAAGAACTGCCCTTAGAGTTGCTTTACGGATGGCTAACCACTACAGTTCTAAACTTCGGCCAGATGCAATTACACTTGTGAATATTCTTCCTGCTTGTGCGTCAGCATTTGCACCTCAACATGGTAAGCAGGTACATGGATTTTCAGTACGAAGTGGATTGGTGGATGATGTATTTGTAGGCAATGCTCTCGTGGATATGTACGCCAAATGCTCGAAGATGAATGAGGCTAACAAGGTGTTTGAGCGGATGAAGGAGAAGGACGTGGTTTCTTGGAATGCTATGGTCACTGGGTATTCTCAGATTGGTAGCTTTGATAGTGCTCTCTCCTTATTTAAGAGGATGCAAGAGGAAGATATCGAGTTAAATGTTGTAACATGGAGCGCTGTAATTGCTGGGTACGCTCAAAGGGGGCATGGTTTTGAAGCCCTTGATGTATTTAGACAAATGCAACTTTGTGGGTGGGAGCCCAATGTTGTTACTCTTGTGTCTCTTCTTTCAGGTTGTGCTTCTGTGGGAGCATTGCTTCATGGAAAGCAAACACATGCGTATGCCATTAAAAATATTCTCAACTTGGATTGGAGTGATCCAGGGGGTGACTTGATGGTTCTCAACGGTCTAATTGATATGTATGCTAAATGCAAAAGCTATAAAGTTGCTCGCAACATTTTTGACTTGATAGCAGGAAAAGACAAGAATGTGGTGACTTGGACCGTGTTGATTGGTGGATATGCTCAGCATGGAGAAGCCAATGATGCATTAGAACTTTTTGCTCAGATATTTAAACAAGAGACCTCTTTAAAGCCTAATGCCTTTACTCTATCATGTGCCTTGATGGCTTGTGCACGTTTGGGTGCATTAAGGCTTGGAAGACAACTCCATGCCTATGCTTTGCGACATGAAAATGGGTCTGAGGTTTTATTTGTAGCCAATTGTCTTATTGATATGTATTCCAAATCAGGGGACATTGATGCTGCTCAGGCCGTGTTCGACAACATGAAAGTACGGAATGCTGTTTCTTGGACTTCTTTGATGACGGGTTATGGTATGCACGGTCGTGGTGAAGAAGCTTTGCATGTTTTTCATCAAATGCGGCAAGCGGGCTTTGTCATTGATGGGGTTACCTTTCTTGTCGTTTTATATGCTTGTAGCCATTCTGGAATGGTTGATCAAGGCATGGACTACTTCCATGGTATGATCAAGTGCTTTGGGGTTACCCCTGGAGCTGAACATTATGCATGTATGGTTGATCTCTTGGGTCGTGCAGGTCGTTTTAATGAAGCAATGGAACTCATCAAAAGCATGCCAATGGAGCCGACCGCAGTTGTATGGGTGGCACTACTAAGTGCCAGCAGAATCCATGCAAATATTGAGCTTGGGGAATATGCAGCAAGCAGATTGTTAGAGTCGGGGGCAGAGAACGATGGTTCATACACATTGCTTTCGAACTTGTATGCAAATGCACGACGTTGGAAAGATGTAGCTAGAATCAGGTCATTGATGAAGAATACTGGGATCAAGAAGAGACCGGGATGTAGTTGGATACAAGGGAAAAAAACCACTACAACCTTCTTTGTGGGTGATAGAAGTCATCCAGAATCAGACCAAATATACAACCTTCTTTCCGAGTTGATTAAACAGATCAAAGACATGGGGTACGTTCCTCAAACGAGCTTTGCTCTTCATGATGTTGATGATGAAGAGAAAGGTGATCTCTTGTTTGAGCATAGTGAGAAGTTGGCTGTTGCATATGGGATTTTAACATCAGCTCCAGGACAGCCCATTCGAATAAACAAGAACTTGCGCATCTGCGGCGATTGCCACAGTGCCTTAACCTACATTTCTATGATTATTGACCATGAGATCATATTGAGAGACTCGAGTAGGTTCCATCATTTCAAGAAAGGCTCATGTTCTTGTAGAAGCTATTGGTGA

mRNA sequence

ATGCATGCAGTAATGCCCAGCGCCAGAAACCTCCCATCTCCTCTCTCACTGTCGCACAAACCTCCTCACGCTCTGCCCTTCCTTCCGCCGGTCGCCAACCTCAGCGGCCCGCGCCGCCCGACTGCAACGCCCGCCGCCAGTCTCCGATCGCTGTCGCCGCCGTCCGTGAGTAACCTCTGGATATTTCCGTCTCTCTCTCTAAACTCGGTCCCTTTTTGTATCTTTAATTCCGGCCACGCCGCTCGGGTTTCTGGCCTGCAGGTTGTGCCGCCGCTCATTCCCGTCGTCAACAGTGCGCCGGTGAGTGTTATGATACATCACCATTGTGGCTCATTTCTTAGTCGAATCCTCACCACTTTAGTTCAATTCTATTCTACCTCTACAACTTCTCCTCCCACCATTCCCTTCACCTCCTTGTTGAGACAATGCAGGACGTTGATCAATGCCAAGCTTGCTCACCAGCAAATTTTTGTCAATGGCTTCACCGAAATGGCCACCTACGCCGTTGGTGCCTACATCGAGTGTGGTGCTTTTGCAGAAGCTGTATCGCTCCTCCAACGTCTTATTCCGTCGCATTCCACTGTTTTCTGGTGGAATGCACTGATTCGACGTTCTGTGAGACTTGGTTTCCTTGATGATGCATTGGGTTTTTATTGTCAGATGCAGAGGCTTGGGTGGTTGCCTGACCATTACACATTTCCTTTTGTTCTCAAAGCCTGTGGTGAAATCCCATCGTTTCGACGTGGTGCTTCAGTTCACGCCATAGTTTGTGCAAATGGGTTTGAGTCAAATGTATTTATTTGTAATTCAATTGTGGCTATGTATGGGCGATGTGGGGCATTGGATGATGCACGCCAAGTGTTTAATGAGGTGCTTGAAAGAAAGATAGAGGACATTGTGTCTTGGAATTCAATTCTTGCTGCCTATGTACAAGGTGGGGAGTCAAGAACTGCCCTTAGAGTTGCTTTACGGATGGCTAACCACTACAGTTCTAAACTTCGGCCAGATGCAATTACACTTGTGAATATTCTTCCTGCTTGTGCGTCAGCATTTGCACCTCAACATGGTAAGCAGGTACATGGATTTTCAGTACGAAGTGGATTGGTGGATGATGTATTTGTAGGCAATGCTCTCGTGGATATGTACGCCAAATGCTCGAAGATGAATGAGGCTAACAAGGTGTTTGAGCGGATGAAGGAGAAGGACGTGGTTTCTTGGAATGCTATGGTCACTGGGTATTCTCAGATTGGTAGCTTTGATAGTGCTCTCTCCTTATTTAAGAGGATGCAAGAGGAAGATATCGAGTTAAATGTTGTAACATGGAGCGCTGTAATTGCTGGGTACGCTCAAAGGGGGCATGGTTTTGAAGCCCTTGATGTATTTAGACAAATGCAACTTTGTGGGTGGGAGCCCAATGTTGTTACTCTTGTGTCTCTTCTTTCAGGTTGTGCTTCTGTGGGAGCATTGCTTCATGGAAAGCAAACACATGCGTATGCCATTAAAAATATTCTCAACTTGGATTGGAGTGATCCAGGGGGTGACTTGATGGTTCTCAACGGTCTAATTGATATGTATGCTAAATGCAAAAGCTATAAAGTTGCTCGCAACATTTTTGACTTGATAGCAGGAAAAGACAAGAATGTGGTGACTTGGACCGTGTTGATTGGTGGATATGCTCAGCATGGAGAAGCCAATGATGCATTAGAACTTTTTGCTCAGATATTTAAACAAGAGACCTCTTTAAAGCCTAATGCCTTTACTCTATCATGTGCCTTGATGGCTTGTGCACGTTTGGGTGCATTAAGGCTTGGAAGACAACTCCATGCCTATGCTTTGCGACATGAAAATGGGTCTGAGGTTTTATTTGTAGCCAATTGTCTTATTGATATGTATTCCAAATCAGGGGACATTGATGCTGCTCAGGCCGTGTTCGACAACATGAAAGTACGGAATGCTGTTTCTTGGACTTCTTTGATGACGGGTTATGGTATGCACGGTCGTGGTGAAGAAGCTTTGCATGTTTTTCATCAAATGCGGCAAGCGGGCTTTGTCATTGATGGGGTTACCTTTCTTGTCGTTTTATATGCTTGTAGCCATTCTGGAATGGTTGATCAAGGCATGGACTACTTCCATGGTATGATCAAGTGCTTTGGGGTTACCCCTGGAGCTGAACATTATGCATGTATGGTTGATCTCTTGGGTCGTGCAGGTCGTTTTAATGAAGCAATGGAACTCATCAAAAGCATGCCAATGGAGCCGACCGCAGTTGTATGGGTGGCACTACTAAGTGCCAGCAGAATCCATGCAAATATTGAGCTTGGGGAATATGCAGCAAGCAGATTGTTAGAGTCGGGGGCAGAGAACGATGGTTCATACACATTGCTTTCGAACTTGTATGCAAATGCACGACGTTGGAAAGATGTAGCTAGAATCAGGTCATTGATGAAGAATACTGGGATCAAGAAGAGACCGGGATGTAGTTGGATACAAGGGAAAAAAACCACTACAACCTTCTTTGTGGGTGATAGAAGTCATCCAGAATCAGACCAAATATACAACCTTCTTTCCGAGTTGATTAAACAGATCAAAGACATGGGGTACGTTCCTCAAACGAGCTTTGCTCTTCATGATGTTGATGATGAAGAGAAAGGTGATCTCTTGTTTGAGCATAGTGAGAAGTTGGCTGTTGCATATGGGATTTTAACATCAGCTCCAGGACAGCCCATTCGAATAAACAAGAACTTGCGCATCTGCGGCGATTGCCACAGTGCCTTAACCTACATTTCTATGATTATTGACCATGAGATCATATTGAGAGACTCGAGTAGGTTCCATCATTTCAAGAAAGGCTCATGTTCTTGTAGAAGCTATTGGTGA

Coding sequence (CDS)

ATGCATGCAGTAATGCCCAGCGCCAGAAACCTCCCATCTCCTCTCTCACTGTCGCACAAACCTCCTCACGCTCTGCCCTTCCTTCCGCCGGTCGCCAACCTCAGCGGCCCGCGCCGCCCGACTGCAACGCCCGCCGCCAGTCTCCGATCGCTGTCGCCGCCGTCCGTGAGTAACCTCTGGATATTTCCGTCTCTCTCTCTAAACTCGGTCCCTTTTTGTATCTTTAATTCCGGCCACGCCGCTCGGGTTTCTGGCCTGCAGGTTGTGCCGCCGCTCATTCCCGTCGTCAACAGTGCGCCGGTGAGTGTTATGATACATCACCATTGTGGCTCATTTCTTAGTCGAATCCTCACCACTTTAGTTCAATTCTATTCTACCTCTACAACTTCTCCTCCCACCATTCCCTTCACCTCCTTGTTGAGACAATGCAGGACGTTGATCAATGCCAAGCTTGCTCACCAGCAAATTTTTGTCAATGGCTTCACCGAAATGGCCACCTACGCCGTTGGTGCCTACATCGAGTGTGGTGCTTTTGCAGAAGCTGTATCGCTCCTCCAACGTCTTATTCCGTCGCATTCCACTGTTTTCTGGTGGAATGCACTGATTCGACGTTCTGTGAGACTTGGTTTCCTTGATGATGCATTGGGTTTTTATTGTCAGATGCAGAGGCTTGGGTGGTTGCCTGACCATTACACATTTCCTTTTGTTCTCAAAGCCTGTGGTGAAATCCCATCGTTTCGACGTGGTGCTTCAGTTCACGCCATAGTTTGTGCAAATGGGTTTGAGTCAAATGTATTTATTTGTAATTCAATTGTGGCTATGTATGGGCGATGTGGGGCATTGGATGATGCACGCCAAGTGTTTAATGAGGTGCTTGAAAGAAAGATAGAGGACATTGTGTCTTGGAATTCAATTCTTGCTGCCTATGTACAAGGTGGGGAGTCAAGAACTGCCCTTAGAGTTGCTTTACGGATGGCTAACCACTACAGTTCTAAACTTCGGCCAGATGCAATTACACTTGTGAATATTCTTCCTGCTTGTGCGTCAGCATTTGCACCTCAACATGGTAAGCAGGTACATGGATTTTCAGTACGAAGTGGATTGGTGGATGATGTATTTGTAGGCAATGCTCTCGTGGATATGTACGCCAAATGCTCGAAGATGAATGAGGCTAACAAGGTGTTTGAGCGGATGAAGGAGAAGGACGTGGTTTCTTGGAATGCTATGGTCACTGGGTATTCTCAGATTGGTAGCTTTGATAGTGCTCTCTCCTTATTTAAGAGGATGCAAGAGGAAGATATCGAGTTAAATGTTGTAACATGGAGCGCTGTAATTGCTGGGTACGCTCAAAGGGGGCATGGTTTTGAAGCCCTTGATGTATTTAGACAAATGCAACTTTGTGGGTGGGAGCCCAATGTTGTTACTCTTGTGTCTCTTCTTTCAGGTTGTGCTTCTGTGGGAGCATTGCTTCATGGAAAGCAAACACATGCGTATGCCATTAAAAATATTCTCAACTTGGATTGGAGTGATCCAGGGGGTGACTTGATGGTTCTCAACGGTCTAATTGATATGTATGCTAAATGCAAAAGCTATAAAGTTGCTCGCAACATTTTTGACTTGATAGCAGGAAAAGACAAGAATGTGGTGACTTGGACCGTGTTGATTGGTGGATATGCTCAGCATGGAGAAGCCAATGATGCATTAGAACTTTTTGCTCAGATATTTAAACAAGAGACCTCTTTAAAGCCTAATGCCTTTACTCTATCATGTGCCTTGATGGCTTGTGCACGTTTGGGTGCATTAAGGCTTGGAAGACAACTCCATGCCTATGCTTTGCGACATGAAAATGGGTCTGAGGTTTTATTTGTAGCCAATTGTCTTATTGATATGTATTCCAAATCAGGGGACATTGATGCTGCTCAGGCCGTGTTCGACAACATGAAAGTACGGAATGCTGTTTCTTGGACTTCTTTGATGACGGGTTATGGTATGCACGGTCGTGGTGAAGAAGCTTTGCATGTTTTTCATCAAATGCGGCAAGCGGGCTTTGTCATTGATGGGGTTACCTTTCTTGTCGTTTTATATGCTTGTAGCCATTCTGGAATGGTTGATCAAGGCATGGACTACTTCCATGGTATGATCAAGTGCTTTGGGGTTACCCCTGGAGCTGAACATTATGCATGTATGGTTGATCTCTTGGGTCGTGCAGGTCGTTTTAATGAAGCAATGGAACTCATCAAAAGCATGCCAATGGAGCCGACCGCAGTTGTATGGGTGGCACTACTAAGTGCCAGCAGAATCCATGCAAATATTGAGCTTGGGGAATATGCAGCAAGCAGATTGTTAGAGTCGGGGGCAGAGAACGATGGTTCATACACATTGCTTTCGAACTTGTATGCAAATGCACGACGTTGGAAAGATGTAGCTAGAATCAGGTCATTGATGAAGAATACTGGGATCAAGAAGAGACCGGGATGTAGTTGGATACAAGGGAAAAAAACCACTACAACCTTCTTTGTGGGTGATAGAAGTCATCCAGAATCAGACCAAATATACAACCTTCTTTCCGAGTTGATTAAACAGATCAAAGACATGGGGTACGTTCCTCAAACGAGCTTTGCTCTTCATGATGTTGATGATGAAGAGAAAGGTGATCTCTTGTTTGAGCATAGTGAGAAGTTGGCTGTTGCATATGGGATTTTAACATCAGCTCCAGGACAGCCCATTCGAATAAACAAGAACTTGCGCATCTGCGGCGATTGCCACAGTGCCTTAACCTACATTTCTATGATTATTGACCATGAGATCATATTGAGAGACTCGAGTAGGTTCCATCATTTCAAGAAAGGCTCATGTTCTTGTAGAAGCTATTGGTGA

Protein sequence

MHAVMPSARNLPSPLSLSHKPPHALPFLPPVANLSGPRRPTATPAASLRSLSPPSVSNLWIFPSLSLNSVPFCIFNSGHAARVSGLQVVPPLIPVVNSAPVSVMIHHHCGSFLSRILTTLVQFYSTSTTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGFTEMATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQMQRLGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVFRQMQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKPNAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAVFDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLSASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSCSCRSYW
Homology
BLAST of HG10006016 vs. NCBI nr
Match: XP_038889862.1 (pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889863.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889864.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889865.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889866.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889867.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889868.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889869.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889870.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889871.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889872.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889873.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889874.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_038889875.1 pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida])

HSP 1 Score: 1635.2 bits (4233), Expect = 0.0e+00
Identity = 792/855 (92.63%), Postives = 833/855 (97.43%), Query Frame = 0

Query: 104 MIHHHCGSFLSRILTTLVQFYSTSTTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGFTE 163
           MIHH+CGS+L+R+L+T VQFYSTST SPPTIPF S+L+QC+TLINAKLAHQQIFVNGFTE
Sbjct: 1   MIHHYCGSYLNRVLSTSVQFYSTSTISPPTIPFISILKQCKTLINAKLAHQQIFVNGFTE 60

Query: 164 MATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQMQR 223
           + +YAVGAYIECGAF EAV+LLQRLIPSHSTVFWWNALIRRSVRLGFLDD LGFYCQMQR
Sbjct: 61  IISYAVGAYIECGAFVEAVTLLQRLIPSHSTVFWWNALIRRSVRLGFLDDTLGFYCQMQR 120

Query: 224 LGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALDD 283
           LGWLPDHYTFPFVLKACGEIPSFR GASVHAIVCANGFESNVFICNS+VAMYGRCGAL D
Sbjct: 121 LGWLPDHYTFPFVLKACGEIPSFRCGASVHAIVCANGFESNVFICNSLVAMYGRCGALGD 180

Query: 284 ARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLVNI 343
           ARQVF+EVLERKIEDIVSWNSILAAYVQG ES+TALR+A RMANHYS KLRPDAITLVNI
Sbjct: 181 ARQVFDEVLERKIEDIVSWNSILAAYVQGRESKTALRIAFRMANHYSFKLRPDAITLVNI 240

Query: 344 LPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDV 403
           LPACASAFAPQHGKQVHGFS+RSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDV
Sbjct: 241 LPACASAFAPQHGKQVHGFSIRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDV 300

Query: 404 VSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVFRQ 463
           VSWNAMVTGYSQIGSFDSALSLFKRMQEEDI L+VVTWSAVIAGY+QRGHGFEAL+VFRQ
Sbjct: 301 VSWNAMVTGYSQIGSFDSALSLFKRMQEEDIALDVVTWSAVIAGYSQRGHGFEALNVFRQ 360

Query: 464 MQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGLID 523
           MQLCG EPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDW DPG DLMVLNGLID
Sbjct: 361 MQLCGLEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWRDPGDDLMVLNGLID 420

Query: 524 MYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKP 583
           MYAKC+SY+VARNIFDLIAGKDK+VVTWTV+IGGYAQHGEANDALELFAQIFKQETSLKP
Sbjct: 421 MYAKCQSYRVARNIFDLIAGKDKDVVTWTVMIGGYAQHGEANDALELFAQIFKQETSLKP 480

Query: 584 NAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAV 643
           NAFTLSCALMACARLGALRLGRQLHAYALRHEN SEVL+VANCLIDMYSKSGDIDAA+AV
Sbjct: 481 NAFTLSCALMACARLGALRLGRQLHAYALRHENESEVLYVANCLIDMYSKSGDIDAARAV 540

Query: 644 FDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMV 703
           FDNMKVRN++SWTSLMTGYG+HG GEEALHVF QMRQAGFV+DG+TFLVVLYACSHSGMV
Sbjct: 541 FDNMKVRNSISWTSLMTGYGIHGCGEEALHVFDQMRQAGFVVDGITFLVVLYACSHSGMV 600

Query: 704 DQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLS 763
           DQG++YF+GMIKCFGVTPGAEHYACMVDLLGRAGR  +AM LIKSMPMEPTAVVWVALLS
Sbjct: 601 DQGVNYFNGMIKCFGVTPGAEHYACMVDLLGRAGRLKDAMGLIKSMPMEPTAVVWVALLS 660

Query: 764 ASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKR 823
           ASRIH+NIELGEYAAS+L+E GAENDGSYTLLSNLYANARRWKDVARIRSLMK+TGIKKR
Sbjct: 661 ASRIHSNIELGEYAASKLIELGAENDGSYTLLSNLYANARRWKDVARIRSLMKHTGIKKR 720

Query: 824 PGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEK 883
           PGCSWIQGKK+TTTFFVGDRSHPESDQIYNLLSELIK+IKD+GYVPQTSFALHDVDDEEK
Sbjct: 721 PGCSWIQGKKSTTTFFVGDRSHPESDQIYNLLSELIKRIKDIGYVPQTSFALHDVDDEEK 780

Query: 884 GDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSR 943
           GDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSR
Sbjct: 781 GDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSR 840

Query: 944 FHHFKKGSCSCRSYW 959
           FHHFKKGSCSCRSYW
Sbjct: 841 FHHFKKGSCSCRSYW 855

BLAST of HG10006016 vs. NCBI nr
Match: XP_008455181.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g16860 [Cucumis melo] >XP_008455182.1 PREDICTED: pentatricopeptide repeat-containing protein At5g16860 [Cucumis melo] >XP_008455183.1 PREDICTED: pentatricopeptide repeat-containing protein At5g16860 [Cucumis melo] >XP_008455184.1 PREDICTED: pentatricopeptide repeat-containing protein At5g16860 [Cucumis melo] >XP_008455185.1 PREDICTED: pentatricopeptide repeat-containing protein At5g16860 [Cucumis melo] >XP_008455186.1 PREDICTED: pentatricopeptide repeat-containing protein At5g16860 [Cucumis melo] >XP_008455189.1 PREDICTED: pentatricopeptide repeat-containing protein At5g16860 [Cucumis melo] >XP_016901762.1 PREDICTED: pentatricopeptide repeat-containing protein At5g16860 [Cucumis melo])

HSP 1 Score: 1587.8 bits (4110), Expect = 0.0e+00
Identity = 770/857 (89.85%), Postives = 820/857 (95.68%), Query Frame = 0

Query: 102 SVMIHHHCGSFLSRILTTLVQFYSTSTTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGF 161
           +VMIH  CGS+LSRIL T V FYST TTSPPTIP  SLLRQC+TLINAKLAHQQIFV+GF
Sbjct: 12  NVMIHPRCGSYLSRILITSVHFYSTFTTSPPTIPLISLLRQCKTLINAKLAHQQIFVHGF 71

Query: 162 TEMATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQM 221
           TEM +YAVGAYIECGA AEAVSLLQR+IPSHSTVFWWNALIRRSVRLG LDD LGFYCQM
Sbjct: 72  TEMFSYAVGAYIECGASAEAVSLLQRIIPSHSTVFWWNALIRRSVRLGLLDDTLGFYCQM 131

Query: 222 QRLGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGAL 281
           Q LGWLPDHYTFPFVLKACGEIPSFR GASVHA+VCA GFESNVFICNSIVAMYGRCGAL
Sbjct: 132 QSLGWLPDHYTFPFVLKACGEIPSFRHGASVHAVVCAKGFESNVFICNSIVAMYGRCGAL 191

Query: 282 DDARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLV 341
           DDARQ+F+EVLER+IEDIVSWNSILAAYVQGG+SRTALR+A +M NHYS KLRPDAITLV
Sbjct: 192 DDARQMFDEVLERRIEDIVSWNSILAAYVQGGKSRTALRIAFQMGNHYSLKLRPDAITLV 251

Query: 342 NILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEK 401
           NILPACAS FA QHGKQVHGFSVRSGLVDDVFVGNALV MYAKCSKMNEANKVFER+K+K
Sbjct: 252 NILPACASIFAIQHGKQVHGFSVRSGLVDDVFVGNALVSMYAKCSKMNEANKVFERIKKK 311

Query: 402 DVVSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVF 461
           DVVSWNAMVTGYSQIGSFDSALSLFK MQEEDI+L+V+TWSAVIAGYAQ+GHGFEALDVF
Sbjct: 312 DVVSWNAMVTGYSQIGSFDSALSLFKMMQEEDIKLDVITWSAVIAGYAQKGHGFEALDVF 371

Query: 462 RQMQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGL 521
           RQMQL G EPNVVTLVSLLSGCASVGALL+GKQTHAY IKNILNL+WSD G D++VLNGL
Sbjct: 372 RQMQLYGLEPNVVTLVSLLSGCASVGALLYGKQTHAYVIKNILNLNWSDKGDDMLVLNGL 431

Query: 522 IDMYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSL 581
           IDMYAKCKSY+VARNIFD IAGKDK+VVTWTV+IGGYAQHGEANDAL+LFAQIF+Q+TSL
Sbjct: 432 IDMYAKCKSYRVARNIFDSIAGKDKDVVTWTVMIGGYAQHGEANDALQLFAQIFEQKTSL 491

Query: 582 KPNAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQ 641
           KPNAFTLSCALMACARLG LRLGRQLHAYALR+EN SEVL+VANCLIDMYSKSGDIDAA+
Sbjct: 492 KPNAFTLSCALMACARLGELRLGRQLHAYALRNENESEVLYVANCLIDMYSKSGDIDAAR 551

Query: 642 AVFDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSG 701
           AVF+NMK+RN VSWTSLMTGYGMHGRGEEALHVF QMRQ GFV+DG+TFLVVLYACSHSG
Sbjct: 552 AVFNNMKLRNVVSWTSLMTGYGMHGRGEEALHVFDQMRQLGFVVDGITFLVVLYACSHSG 611

Query: 702 MVDQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVAL 761
           +VDQGM+YFH M+KCFG+TPGAEHYACMVDLLGRAGR NEAMELIKSM MEPTAVVWVAL
Sbjct: 612 LVDQGMNYFHDMVKCFGITPGAEHYACMVDLLGRAGRLNEAMELIKSMSMEPTAVVWVAL 671

Query: 762 LSASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIK 821
           LSASRIHANIELGEYAAS+L+E GAENDGSYTLLSNLYANARRWKDVARIRSLMK+TGI+
Sbjct: 672 LSASRIHANIELGEYAASKLIELGAENDGSYTLLSNLYANARRWKDVARIRSLMKHTGIR 731

Query: 822 KRPGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDE 881
           KRPGCSWIQGKK+TTTFFVGDRSHPES+QIYNLLS+LIK+IKDMGYVPQTSFALHDVDDE
Sbjct: 732 KRPGCSWIQGKKSTTTFFVGDRSHPESEQIYNLLSDLIKRIKDMGYVPQTSFALHDVDDE 791

Query: 882 EKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDS 941
           EKGDLLFEHSEKLAVAYGILT+APGQPIRI+KNLRICGDCHSALTYISMIIDHEIILRDS
Sbjct: 792 EKGDLLFEHSEKLAVAYGILTTAPGQPIRIHKNLRICGDCHSALTYISMIIDHEIILRDS 851

Query: 942 SRFHHFKKGSCSCRSYW 959
           SRFHHFKKGSCSCRSYW
Sbjct: 852 SRFHHFKKGSCSCRSYW 868

BLAST of HG10006016 vs. NCBI nr
Match: XP_004137054.2 (pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658790.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658791.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658792.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658793.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658794.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658795.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658796.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658797.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658798.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658799.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_011658800.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_031744346.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_031744347.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_031744348.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_031744349.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_031744350.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_031744351.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_031744352.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_031744353.1 pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus])

HSP 1 Score: 1569.7 bits (4063), Expect = 0.0e+00
Identity = 763/857 (89.03%), Postives = 810/857 (94.52%), Query Frame = 0

Query: 102 SVMIHHHCGSFLSRILTTLVQFYSTSTTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGF 161
           SVMIHHHCGS+LSRIL T V FYST TTSPPTIP  SLLRQC+TLINAKLAHQQIFV+GF
Sbjct: 12  SVMIHHHCGSYLSRILITSVHFYSTFTTSPPTIPLISLLRQCKTLINAKLAHQQIFVHGF 71

Query: 162 TEMATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQM 221
           TEM +YAVGAYIECGA AEAVSLLQRLIPSHSTVFWWNALIRRSV+LG LDD LGFYCQM
Sbjct: 72  TEMFSYAVGAYIECGASAEAVSLLQRLIPSHSTVFWWNALIRRSVKLGLLDDTLGFYCQM 131

Query: 222 QRLGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGAL 281
           QRLGWLPDHYTFPFVLKACGEIPS R GASVHAIVCANG  SNVFICNSIVAMYGRCGAL
Sbjct: 132 QRLGWLPDHYTFPFVLKACGEIPSLRHGASVHAIVCANGLGSNVFICNSIVAMYGRCGAL 191

Query: 282 DDARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLV 341
           DDA Q+F+EVLERKIEDIVSWNSILAAYVQGG+SRTALR+A RM NHYS KLRPDAITLV
Sbjct: 192 DDAHQMFDEVLERKIEDIVSWNSILAAYVQGGQSRTALRIAFRMGNHYSLKLRPDAITLV 251

Query: 342 NILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEK 401
           NILPACAS FA QHGKQVHGFSVR+GLVDDVFVGNALV MYAKCSKMNEANKVFE +K+K
Sbjct: 252 NILPACASVFALQHGKQVHGFSVRNGLVDDVFVGNALVSMYAKCSKMNEANKVFEGIKKK 311

Query: 402 DVVSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVF 461
           DVVSWNAMVTGYSQIGSFDSALSLFK MQEEDI+L+V+TWSAVIAGYAQ+GHGFEALDVF
Sbjct: 312 DVVSWNAMVTGYSQIGSFDSALSLFKMMQEEDIKLDVITWSAVIAGYAQKGHGFEALDVF 371

Query: 462 RQMQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGL 521
           RQMQL G EPNVVTL SLLSGCASVGALL+GKQTHAY IKNILNL+W+D   DL+VLNGL
Sbjct: 372 RQMQLYGLEPNVVTLASLLSGCASVGALLYGKQTHAYVIKNILNLNWNDKEDDLLVLNGL 431

Query: 522 IDMYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSL 581
           IDMYAKCKSY+VAR+IFD I GKDKNVVTWTV+IGGYAQHGEANDAL+LFAQIFKQ+TSL
Sbjct: 432 IDMYAKCKSYRVARSIFDSIEGKDKNVVTWTVMIGGYAQHGEANDALKLFAQIFKQKTSL 491

Query: 582 KPNAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQ 641
           KPNAFTLSCALMACARLG LRLGRQLHAYALR+EN SEVL+V NCLIDMYSKSGDIDAA+
Sbjct: 492 KPNAFTLSCALMACARLGELRLGRQLHAYALRNENESEVLYVGNCLIDMYSKSGDIDAAR 551

Query: 642 AVFDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSG 701
           AVFDNMK+RN VSWTSLMTGYGMHGRGEEALH+F QM++ GF +DG+TFLVVLYACSHSG
Sbjct: 552 AVFDNMKLRNVVSWTSLMTGYGMHGRGEEALHLFDQMQKLGFAVDGITFLVVLYACSHSG 611

Query: 702 MVDQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVAL 761
           MVDQGM YFH M+K FG+TPGAEHYACMVDLLGRAGR NEAMELIK+M MEPTAVVWVAL
Sbjct: 612 MVDQGMIYFHDMVKGFGITPGAEHYACMVDLLGRAGRLNEAMELIKNMSMEPTAVVWVAL 671

Query: 762 LSASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIK 821
           LSASRIHANIELGEYAAS+L E GAENDGSYTLLSNLYANARRWKDVARIRSLMK+TGI+
Sbjct: 672 LSASRIHANIELGEYAASKLTELGAENDGSYTLLSNLYANARRWKDVARIRSLMKHTGIR 731

Query: 822 KRPGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDE 881
           KRPGCSWIQGKK+TTTFFVGDRSHPES+QIYNLL +LIK+IKDMGYVPQTSFALHDVDDE
Sbjct: 732 KRPGCSWIQGKKSTTTFFVGDRSHPESEQIYNLLLDLIKRIKDMGYVPQTSFALHDVDDE 791

Query: 882 EKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDS 941
           EKGDLLFEHSEKLAVAYGILT+APGQPIRI+KNLRICGDCHSALTYISMIIDHEI+LRDS
Sbjct: 792 EKGDLLFEHSEKLAVAYGILTTAPGQPIRIHKNLRICGDCHSALTYISMIIDHEIVLRDS 851

Query: 942 SRFHHFKKGSCSCRSYW 959
           SRFHHFKKGSCSCRSYW
Sbjct: 852 SRFHHFKKGSCSCRSYW 868

BLAST of HG10006016 vs. NCBI nr
Match: XP_022143067.1 (pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143068.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143069.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143070.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143071.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143072.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143074.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143075.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143076.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143077.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143078.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143079.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143080.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143081.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia] >XP_022143082.1 pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica charantia])

HSP 1 Score: 1562.4 bits (4044), Expect = 0.0e+00
Identity = 759/855 (88.77%), Postives = 805/855 (94.15%), Query Frame = 0

Query: 104 MIHHHCGSFLSRILTTLVQFYSTSTTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGFTE 163
           MIHH C S++SRIL + V  YSTS TS   IP  SLL+QCRTLINAKLAHQQI VNGFT+
Sbjct: 1   MIHHSCASYVSRILPSSVPCYSTSATS---IPLISLLQQCRTLINAKLAHQQILVNGFTQ 60

Query: 164 MATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQMQR 223
           M TYA+GAYIECGA A+AVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDD LGFYCQMQR
Sbjct: 61  MVTYAIGAYIECGASAQAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDVLGFYCQMQR 120

Query: 224 LGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALDD 283
           LGW PDHYTFPFVLKACGEIPSFRRGASVHA+VCANGFESNVFICNSIVAMYGRCGALDD
Sbjct: 121 LGWSPDHYTFPFVLKACGEIPSFRRGASVHAVVCANGFESNVFICNSIVAMYGRCGALDD 180

Query: 284 ARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLVNI 343
           ARQVF+EVLERKIEDIVSWNSILAAYVQGGES+TALR+A+RMANHY+ KL PDAITLVNI
Sbjct: 181 ARQVFDEVLERKIEDIVSWNSILAAYVQGGESKTALRIAVRMANHYNCKLLPDAITLVNI 240

Query: 344 LPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDV 403
           LPACAS  APQHGKQVHG++VRSGLVDDVFVGNALVDMYAKC KM+EA++VFE MKEKDV
Sbjct: 241 LPACASTLAPQHGKQVHGYAVRSGLVDDVFVGNALVDMYAKCWKMDEASRVFELMKEKDV 300

Query: 404 VSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVFRQ 463
           VSWNAMVTGYSQI  FD ALSLFKRMQEEDIELNVVTWSA+IAGY+QRG GFEALDVFRQ
Sbjct: 301 VSWNAMVTGYSQISRFDDALSLFKRMQEEDIELNVVTWSALIAGYSQRGLGFEALDVFRQ 360

Query: 464 MQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGLID 523
           MQLCG EPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILN DW+DPG DLMVLNGLID
Sbjct: 361 MQLCGLEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNFDWNDPGDDLMVLNGLID 420

Query: 524 MYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKP 583
           MYAKCKS KVARNIFDLI  K+KNVVTWTV+IGGYAQHGEANDALELF+Q+FK ETSLKP
Sbjct: 421 MYAKCKSSKVARNIFDLITRKNKNVVTWTVMIGGYAQHGEANDALELFSQMFKHETSLKP 480

Query: 584 NAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAV 643
           NAFTLSCALMACARLGALRLGRQ+HAYALRHEN +EVL+VANCLIDMYSKSGDIDAAQ V
Sbjct: 481 NAFTLSCALMACARLGALRLGRQIHAYALRHENENEVLYVANCLIDMYSKSGDIDAAQTV 540

Query: 644 FDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMV 703
           FDNMKVRNAVSWTSLMTGYGMHGRGEEALH+F QM+QA   +DGVTFLVVLYACSHSGMV
Sbjct: 541 FDNMKVRNAVSWTSLMTGYGMHGRGEEALHIFDQMQQADLAVDGVTFLVVLYACSHSGMV 600

Query: 704 DQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLS 763
           DQGM+YFHGMIK FGV PGAEHYACMVDLLGRAGR NEAMELIKSM  EPTAVVWVALLS
Sbjct: 601 DQGMNYFHGMIKYFGVAPGAEHYACMVDLLGRAGRLNEAMELIKSMSTEPTAVVWVALLS 660

Query: 764 ASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKR 823
           ASRIHAN+ELGEYAA++L+ESG ENDGSYTLLSNLYANARRWKDVARIRSLMK+TGIKKR
Sbjct: 661 ASRIHANVELGEYAANKLIESGLENDGSYTLLSNLYANARRWKDVARIRSLMKHTGIKKR 720

Query: 824 PGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEK 883
           PGCSW+QGKK TTTFFVGDRSHP+SDQIY +L++LI++IKDMGYVPQTSFALHDVDDEEK
Sbjct: 721 PGCSWVQGKKGTTTFFVGDRSHPQSDQIYGILADLIQRIKDMGYVPQTSFALHDVDDEEK 780

Query: 884 GDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSR 943
           GDLLFEHSEKLAVAYGILTS+PGQPIRINKNLRICGDCHSALTYISMII+HEIILRDSSR
Sbjct: 781 GDLLFEHSEKLAVAYGILTSSPGQPIRINKNLRICGDCHSALTYISMIIEHEIILRDSSR 840

Query: 944 FHHFKKGSCSCRSYW 959
           FHHFK GSCSCR YW
Sbjct: 841 FHHFKNGSCSCRGYW 852

BLAST of HG10006016 vs. NCBI nr
Match: KAA0031472.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK06925.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1547.3 bits (4005), Expect = 0.0e+00
Identity = 753/839 (89.75%), Postives = 802/839 (95.59%), Query Frame = 0

Query: 104 MIHHHCGSFLSRILTTLVQFYSTSTTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGFTE 163
           MIH  CGS+LSRIL T V FYST TTSPPTIP  SLLRQC+TLINAKLAHQQIFV+GFTE
Sbjct: 1   MIHPRCGSYLSRILITSVHFYSTFTTSPPTIPLISLLRQCKTLINAKLAHQQIFVHGFTE 60

Query: 164 MATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQMQR 223
           M +YAVGAYIECGA AEAVSLLQR+IPSHSTVFWWNALIRRSVRLG LDD LGFYCQMQ 
Sbjct: 61  MFSYAVGAYIECGASAEAVSLLQRIIPSHSTVFWWNALIRRSVRLGLLDDTLGFYCQMQS 120

Query: 224 LGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALDD 283
           LGWLPDHYTFPFVLKACGEIPSFR GASVHA+VCA GFESNVFICNSIVAMYGRCGALDD
Sbjct: 121 LGWLPDHYTFPFVLKACGEIPSFRHGASVHAVVCAKGFESNVFICNSIVAMYGRCGALDD 180

Query: 284 ARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLVNI 343
           ARQ+F+EVLER+IEDIVSWNSILAAYVQGG+SRTALR+A +M NHYS KLRPDAITLVNI
Sbjct: 181 ARQMFDEVLERRIEDIVSWNSILAAYVQGGKSRTALRIAFQMGNHYSLKLRPDAITLVNI 240

Query: 344 LPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDV 403
           LPACAS FA QHGKQVHGFSVRSGLVDDVFVGNALV MYAKCSKMNEANKVFER+K+KDV
Sbjct: 241 LPACASIFAIQHGKQVHGFSVRSGLVDDVFVGNALVSMYAKCSKMNEANKVFERIKKKDV 300

Query: 404 VSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVFRQ 463
           VSWNAMVTGYSQIGSFDSALSLFK MQEEDI+L+V+TWSAVIAGYAQ+GHGFEALDVFRQ
Sbjct: 301 VSWNAMVTGYSQIGSFDSALSLFKMMQEEDIKLDVITWSAVIAGYAQKGHGFEALDVFRQ 360

Query: 464 MQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGLID 523
           MQL G EPNVVTLVSLLSGCASVGALL+GKQTHAY IKNILNL+WSD G D++VLNGLID
Sbjct: 361 MQLYGLEPNVVTLVSLLSGCASVGALLYGKQTHAYVIKNILNLNWSDKGDDMLVLNGLID 420

Query: 524 MYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKP 583
           MYAKCKSY+VARNIFD IAGKDK+VVTWTV+IGGYAQHGEANDAL+LFAQIF+Q+TSLKP
Sbjct: 421 MYAKCKSYRVARNIFDSIAGKDKDVVTWTVMIGGYAQHGEANDALQLFAQIFEQKTSLKP 480

Query: 584 NAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAV 643
           NAFTLSCALMACARLG LRLGRQLHAYALR+EN SEVL+VANCLIDMYSKSGDIDAA+AV
Sbjct: 481 NAFTLSCALMACARLGELRLGRQLHAYALRNENESEVLYVANCLIDMYSKSGDIDAARAV 540

Query: 644 FDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMV 703
           F+NMK+RN VSWTSLMTGYGMHGRGEEALHVF QMRQ GFV+DG+TFLVVLYACSHSG+V
Sbjct: 541 FNNMKLRNVVSWTSLMTGYGMHGRGEEALHVFDQMRQLGFVVDGITFLVVLYACSHSGLV 600

Query: 704 DQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLS 763
           DQGM+YFH M+KCFG+TPGAEHYACMVDLLGRAGR NEAMELIKSM MEPTAVVWVALLS
Sbjct: 601 DQGMNYFHDMVKCFGITPGAEHYACMVDLLGRAGRLNEAMELIKSMSMEPTAVVWVALLS 660

Query: 764 ASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKR 823
           ASRIHANIELGEYAAS+L+E GAENDGSYTLLSNLYANARRWKDVARIRSLMK+TGI+KR
Sbjct: 661 ASRIHANIELGEYAASKLIELGAENDGSYTLLSNLYANARRWKDVARIRSLMKHTGIRKR 720

Query: 824 PGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEK 883
           PGCSWIQGKK+TTTFFVGDRSHPES+QIYNLLS+LIK+IKDMGYVPQTSFALHDVDDEEK
Sbjct: 721 PGCSWIQGKKSTTTFFVGDRSHPESEQIYNLLSDLIKRIKDMGYVPQTSFALHDVDDEEK 780

Query: 884 GDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSS 943
           GDLLFEHSEKLAVAYGILT+APGQPIRI+KNLRICGDCHSALTYISMIIDHEIILRDSS
Sbjct: 781 GDLLFEHSEKLAVAYGILTTAPGQPIRIHKNLRICGDCHSALTYISMIIDHEIILRDSS 839

BLAST of HG10006016 vs. ExPASy Swiss-Prot
Match: Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 1034.2 bits (2673), Expect = 8.9e-301
Identity = 498/834 (59.71%), Postives = 630/834 (75.54%), Query Frame = 0

Query: 127 STTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGF--TEMATYAVGAYIECGAFAEAVSL 186
           ST++P   P    + +C+T+   KL HQ++   G     + ++ +  YI  G  + AVSL
Sbjct: 24  STSAPEITP--PFIHKCKTISQVKLIHQKLLSFGILTLNLTSHLISTYISVGCLSHAVSL 83

Query: 187 LQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQMQRLGWLPDHYTFPFVLKACGEIP 246
           L+R  PS + V+ WN+LIR     G  +  L  +  M  L W PD+YTFPFV KACGEI 
Sbjct: 84  LRRFPPSDAGVYHWNSLIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGEIS 143

Query: 247 SFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNS 306
           S R G S HA+    GF SNVF+ N++VAMY RC +L DAR+VF+E+    + D+VSWNS
Sbjct: 144 SVRCGESAHALSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEM---SVWDVVSWNS 203

Query: 307 ILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSV 366
           I+ +Y + G+ + AL +  RM N +    RPD ITLVN+LP CAS      GKQ+H F+V
Sbjct: 204 IIESYAKLGKPKVALEMFSRMTNEFG--CRPDNITLVNVLPPCASLGTHSLGKQLHCFAV 263

Query: 367 RSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALS 426
            S ++ ++FVGN LVDMYAKC  M+EAN VF  M  KDVVSWNAMV GYSQIG F+ A+ 
Sbjct: 264 TSEMIQNMFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVR 323

Query: 427 LFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVFRQMQLCGWEPNVVTLVSLLSGCA 486
           LF++MQEE I+++VVTWSA I+GYAQRG G+EAL V RQM   G +PN VTL+S+LSGCA
Sbjct: 324 LFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCA 383

Query: 487 SVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFDLIAGK 546
           SVGAL+HGK+ H YAIK  ++L  +  G + MV+N LIDMYAKCK    AR +FD ++ K
Sbjct: 384 SVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPK 443

Query: 547 DKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKPNAFTLSCALMACARLGALRLG 606
           +++VVTWTV+IGGY+QHG+AN ALEL +++F+++   +PNAFT+SCAL+ACA L ALR+G
Sbjct: 444 ERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIG 503

Query: 607 RQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAVFDNMKVRNAVSWTSLMTGYGM 666
           +Q+HAYALR++  +  LFV+NCLIDMY+K G I  A+ VFDNM  +N V+WTSLMTGYGM
Sbjct: 504 KQIHAYALRNQQNAVPLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGM 563

Query: 667 HGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHGMIKCFGVTPGAE 726
           HG GEEAL +F +MR+ GF +DGVT LVVLYACSHSGM+DQGM+YF+ M   FGV+PG E
Sbjct: 564 HGYGEEALGIFDEMRRIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPE 623

Query: 727 HYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLSASRIHANIELGEYAASRLLES 786
           HYAC+VDLLGRAGR N A+ LI+ MPMEP  VVWVA LS  RIH  +ELGEYAA ++ E 
Sbjct: 624 HYACLVDLLGRAGRLNAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITEL 683

Query: 787 GAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRS 846
            + +DGSYTLLSNLYANA RWKDV RIRSLM++ G+KKRPGCSW++G K TTTFFVGD++
Sbjct: 684 ASNHDGSYTLLSNLYANAGRWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKT 743

Query: 847 HPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSA 906
           HP + +IY +L + +++IKD+GYVP+T FALHDVDDEEK DLLFEHSEKLA+AYGILT+ 
Sbjct: 744 HPHAKEIYQVLLDHMQRIKDIGYVPETGFALHDVDDEEKDDLLFEHSEKLALAYGILTTP 803

Query: 907 PGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSCSCRSYW 959
            G  IRI KNLR+CGDCH+A TY+S IIDH+IILRDSSRFHHFK GSCSC+ YW
Sbjct: 804 QGAAIRITKNLRVCGDCHTAFTYMSRIIDHDIILRDSSRFHHFKNGSCSCKGYW 850

BLAST of HG10006016 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 3.9e-155
Identity = 302/827 (36.52%), Postives = 460/827 (55.62%), Query Frame = 0

Query: 138 SLLRQC---RTLINAKLAHQQIFVNGF---TEMATYAVGAYIECGAFAEAVSLLQRLIPS 197
           S+L+ C   ++L + K     I  NGF   + + +     Y  CG   EA  +   +   
Sbjct: 99  SVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEV--K 158

Query: 198 HSTVFWWNALIRRSVRLGFLDDALGFYCQMQRLGWLPDHYTFPFVLKACGEIPSFRRGAS 257
                +WN L+    + G    ++G + +M   G   D YTF  V K+   + S   G  
Sbjct: 159 IEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQ 218

Query: 258 VHAIVCANGFESNVFICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNSILAAYVQ 317
           +H  +  +GF     + NS+VA Y +   +D AR+VF+E+ ER   D++SWNSI+  YV 
Sbjct: 219 LHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTER---DVISWNSIINGYVS 278

Query: 318 GGESRTALRVALRMANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSVRSGLVDD 377
            G +   L V ++M     S +  D  T+V++   CA +     G+ VH   V++    +
Sbjct: 279 NGLAEKGLSVFVQM---LVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSRE 338

Query: 378 VFVGNALVDMYAKCSKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALSLFKRMQE 437
               N L+DMY+KC  ++ A  VF  M ++ VVS+ +M+ GY++ G    A+ LF+ M+E
Sbjct: 339 DRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEE 398

Query: 438 EDIELNVVTWSAVIAGYAQRGHGFEALDVFRQMQLCGWEPNVVTLVSLLSGCASVGALLH 497
           E                                   G  P+V T+ ++L+ CA    L  
Sbjct: 399 E-----------------------------------GISPDVYTVTAVLNCCARYRLLDE 458

Query: 498 GKQTHAYAIKNILNLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFDLIAGKDKNVVTW 557
           GK+ H +  +N       D G D+ V N L+DMYAKC S + A  +F  +  KD  +++W
Sbjct: 459 GKRVHEWIKEN-------DLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKD--IISW 518

Query: 558 TVLIGGYAQHGEANDALELFAQIFKQETSLKPNAFTLSCALMACARLGALRLGRQLHAYA 617
             +IGGY+++  AN+AL LF  +  +E    P+  T++C L ACA L A   GR++H Y 
Sbjct: 519 NTIIGGYSKNCYANEALSLF-NLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYI 578

Query: 618 LRHENGSEVLFVANCLIDMYSKSGDIDAAQAVFDNMKVRNAVSWTSLMTGYGMHGRGEEA 677
           +R+   S+   VAN L+DMY+K G +  A  +FD++  ++ VSWT ++ GYGMHG G+EA
Sbjct: 579 MRNGYFSD-RHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEA 638

Query: 678 LHVFHQMRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHGMIKCFGVTPGAEHYACMVD 737
           + +F+QMRQAG   D ++F+ +LYACSHSG+VD+G  +F+ M     + P  EHYAC+VD
Sbjct: 639 IALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVD 698

Query: 738 LLGRAGRFNEAMELIKSMPMEPTAVVWVALLSASRIHANIELGEYAASRLLESGAENDGS 797
           +L R G   +A   I++MP+ P A +W ALL   RIH +++L E  A ++ E   EN G 
Sbjct: 699 MLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGY 758

Query: 798 YTLLSNLYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRSHPESDQI 857
           Y L++N+YA A +W+ V R+R  +   G++K PGCSWI+ K     F  GD S+PE++ I
Sbjct: 759 YVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENI 818

Query: 858 YNLLSELIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRI 917
              L ++  ++ + GY P T +AL D ++ EK + L  HSEKLA+A GI++S  G+ IR+
Sbjct: 819 EAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRV 871

Query: 918 NKNLRICGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSCSCRSYW 959
            KNLR+CGDCH    ++S +   EI+LRDS+RFH FK G CSCR +W
Sbjct: 879 TKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 871

BLAST of HG10006016 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 1.8e-152
Identity = 293/827 (35.43%), Postives = 461/827 (55.74%), Query Frame = 0

Query: 135 PFTSLLRQCRTLINAKLAHQQIFVNGFTE---MATYAVGAYIECGAFAEAVSLLQRLIPS 194
           P   LL +C +L   +     +F NG  +     T  V  +   G+  EA  + + +   
Sbjct: 39  PAALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSK 98

Query: 195 HSTVFWWNALIRRSVRLGFLDDALGFYCQMQRLGWLPDHYTFPFVLKACGEIPSFRRGAS 254
            + ++  + +++   ++  LD AL F+ +M+     P  Y F ++LK CG+    R G  
Sbjct: 99  LNVLY--HTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKE 158

Query: 255 VHAIVCANGFESNVFICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNSILAAYVQ 314
           +H ++  +GF  ++F    +  MY +C  +++AR+VF+ + ER   D+VSWN+I+A Y Q
Sbjct: 159 IHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPER---DLVSWNTIVAGYSQ 218

Query: 315 GGESRTALRVALRMANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSVRSGLVDD 374
            G +R AL +   M       L+P  IT+V++LPA ++      GK++HG+++RSG    
Sbjct: 219 NGMARMALEMVKSMC---EENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSL 278

Query: 375 VFVGNALVDMYAKCSKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALSLFKRMQE 434
           V +  ALVDMYAKC  +  A ++F+ M E++VVSWN+M+  Y Q  +   A+ +F++M +
Sbjct: 279 VNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLD 338

Query: 435 EDIELNVVTWSAVIAGYAQRGHGFEALDVFRQMQLCGWEPNVVTLVSLLSGCASVGALLH 494
           E                                   G +P  V+++  L  CA +G L  
Sbjct: 339 E-----------------------------------GVKPTDVSVMGALHACADLGDLER 398

Query: 495 GKQTHAYAIKNILNLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFDLIAGKDKNVVTW 554
           G+  H  +++  L LD      ++ V+N LI MY KCK    A ++F  +  + + +V+W
Sbjct: 399 GRFIHKLSVE--LGLD-----RNVSVVNSLISMYCKCKEVDTAASMFGKL--QSRTLVSW 458

Query: 555 TVLIGGYAQHGEANDALELFAQIFKQETSLKPNAFTLSCALMACARLGALRLGRQLHAYA 614
             +I G+AQ+G   DAL  F+Q+  +  ++KP+ FT    + A A L      + +H   
Sbjct: 459 NAMILGFAQNGRPIDALNYFSQM--RSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVV 518

Query: 615 LRHENGSEVLFVANCLIDMYSKSGDIDAAQAVFDNMKVRNAVSWTSLMTGYGMHGRGEEA 674
           +R      V FV   L+DMY+K G I  A+ +FD M  R+  +W +++ GYG HG G+ A
Sbjct: 519 MRSCLDKNV-FVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 578

Query: 675 LHVFHQMRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHGMIKCFGVTPGAEHYACMVD 734
           L +F +M++     +GVTFL V+ ACSHSG+V+ G+  F+ M + + +    +HY  MVD
Sbjct: 579 LELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVD 638

Query: 735 LLGRAGRFNEAMELIKSMPMEPTAVVWVALLSASRIHANIELGEYAASRLLESGAENDGS 794
           LLGRAGR NEA + I  MP++P   V+ A+L A +IH N+   E AA RL E   ++ G 
Sbjct: 639 LLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGY 698

Query: 795 YTLLSNLYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRSHPESDQI 854
           + LL+N+Y  A  W+ V ++R  M   G++K PGCS ++ K    +FF G  +HP+S +I
Sbjct: 699 HVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKI 758

Query: 855 YNLLSELIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRI 914
           Y  L +LI  IK+ GYVP T+  L  V+++ K  LL  HSEKLA+++G+L +  G  I +
Sbjct: 759 YAFLEKLICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHV 809

Query: 915 NKNLRICGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSCSCRSYW 959
            KNLR+C DCH+A  YIS++   EI++RD  RFHHFK G+CSC  YW
Sbjct: 819 RKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of HG10006016 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 536.2 bits (1380), Expect = 7.6e-151
Identity = 283/726 (38.98%), Postives = 426/726 (58.68%), Query Frame = 0

Query: 266 FICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRM 325
           F  N++++ Y + G +D   + F+++ +R   D VSW +++  Y   G+   A+RV   M
Sbjct: 81  FSWNTVLSAYSKRGDMDSTCEFFDQLPQR---DSVSWTTMIVGYKNIGQYHKAIRV---M 140

Query: 326 ANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKC 385
            +     + P   TL N+L + A+    + GK+VH F V+ GL  +V V N+L++MYAKC
Sbjct: 141 GDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKC 200

Query: 386 SKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVI 445
                A  VF+RM  +D+ SWNAM+  + Q+G  D A++ F++M E DI    VTW+++I
Sbjct: 201 GDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDI----VTWNSMI 260

Query: 446 AGYAQRGHGFEALDVFRQMQLCG-WEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNIL 505
           +G+ QRG+   ALD+F +M       P+  TL S+LS CA++  L  GKQ H++ +    
Sbjct: 261 SGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGF 320

Query: 506 NLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFD------------------------- 565
           ++         +VLN LI MY++C   + AR + +                         
Sbjct: 321 DISG-------IVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDM 380

Query: 566 ------LIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKPNAFTLSCALM 625
                  ++ KD++VV WT +I GY QHG   +A+ LF  +       +PN++TL+  L 
Sbjct: 381 NQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMV--GGGQRPNSYTLAAMLS 440

Query: 626 ACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAVFDNMKV-RNA 685
             + L +L  G+Q+H  A++      V  V+N LI MY+K+G+I +A   FD ++  R+ 
Sbjct: 441 VASSLASLSHGKQIHGSAVKSGEIYSV-SVSNALITMYAKAGNITSASRAFDLIRCERDT 500

Query: 686 VSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHG 745
           VSWTS++     HG  EEAL +F  M   G   D +T++ V  AC+H+G+V+QG  YF  
Sbjct: 501 VSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDM 560

Query: 746 MIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLSASRIHANIE 805
           M     + P   HYACMVDL GRAG   EA E I+ MP+EP  V W +LLSA R+H NI+
Sbjct: 561 MKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNID 620

Query: 806 LGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGK 865
           LG+ AA RLL    EN G+Y+ L+NLY+   +W++ A+IR  MK+  +KK  G SWI+ K
Sbjct: 621 LGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVK 680

Query: 866 KTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSE 925
                F V D +HPE ++IY  + ++  +IK MGYVP T+  LHD+++E K  +L  HSE
Sbjct: 681 HKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSE 740

Query: 926 KLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSC 959
           KLA+A+G++++     +RI KNLR+C DCH+A+ +IS ++  EII+RD++RFHHFK G C
Sbjct: 741 KLAIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFC 786

BLAST of HG10006016 vs. ExPASy Swiss-Prot
Match: Q0WN60 (Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H8 PE=2 SV=2)

HSP 1 Score: 534.3 bits (1375), Expect = 2.9e-150
Identity = 302/866 (34.87%), Postives = 465/866 (53.70%), Query Frame = 0

Query: 164 MATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQM-Q 223
           + T  +  Y  CG+  ++  +   L      +F WNA+I    R    D+ L  + +M  
Sbjct: 122 LCTRIITMYAMCGSPDDSRFVFDAL--RSKNLFQWNAVISSYSRNELYDEVLETFIEMIS 181

Query: 224 RLGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALD 283
               LPDH+T+P V+KAC  +     G +VH +V   G   +VF+ N++V+ YG  G + 
Sbjct: 182 TTDLLPDHFTYPCVIKACAGMSDVGIGLAVHGLVVKTGLVEDVFVGNALVSFYGTHGFVT 241

Query: 284 DARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTA-LRVALRMANHYSSKLRPDAITLV 343
           DA Q+F+ + ER   ++VSWNS++  +   G S  + L +   M  +      PD  TLV
Sbjct: 242 DALQLFDIMPER---NLVSWNSMIRVFSDNGFSEESFLLLGEMMEENGDGAFMPDVATLV 301

Query: 344 NILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEK 403
            +LP CA       GK VHG++V+  L  ++ + NAL+DMY+KC  +  A  +F+    K
Sbjct: 302 TVLPVCAREREIGLGKGVHGWAVKLRLDKELVLNNALMDMYSKCGCITNAQMIFKMNNNK 361

Query: 404 DVVSWNAMVTGYSQIGSFDSALSLFKRMQE--EDIELNVVT------------------- 463
           +VVSWN MV G+S  G       + ++M    ED++ + VT                   
Sbjct: 362 NVVSWNTMVGGFSAEGDTHGTFDVLRQMLAGGEDVKADEVTILNAVPVCFHESFLPSLKE 421

Query: 464 -----------------------------------------------WSAVIAGYAQRGH 523
                                                          W+A+I G+AQ   
Sbjct: 422 LHCYSLKQEFVYNELVANAFVASYAKCGSLSYAQRVFHGIRSKTVNSWNALIGGHAQSND 481

Query: 524 GFEALDVFRQMQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGG 583
              +LD   QM++ G  P+  T+ SLLS C+ + +L  GK+ H + I+N L         
Sbjct: 482 PRLSLDAHLQMKISGLLPDSFTVCSLLSACSKLKSLRLGKEVHGFIIRNWLE-------R 541

Query: 584 DLMVLNGLIDMYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQ 643
           DL V   ++ +Y  C      + +FD  A +DK++V+W  +I GY Q+G  + AL +F Q
Sbjct: 542 DLFVYLSVLSLYIHCGELCTVQALFD--AMEDKSLVSWNTVITGYLQNGFPDRALGVFRQ 601

Query: 644 IFKQETSLKPNAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSK 703
           +      L     ++     AC+ L +LRLGR+ HAYAL+H    +  F+A  LIDMY+K
Sbjct: 602 MVLYGIQL--CGISMMPVFGACSLLPSLRLGREAHAYALKHLLEDDA-FIACSLIDMYAK 661

Query: 704 SGDIDAAQAVFDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVV 763
           +G I  +  VF+ +K ++  SW +++ GYG+HG  +EA+ +F +M++ G   D +TFL V
Sbjct: 662 NGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKLFEEMQRTGHNPDDLTFLGV 721

Query: 764 LYACSHSGMVDQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELI-KSMPME 823
           L AC+HSG++ +G+ Y   M   FG+ P  +HYAC++D+LGRAG+ ++A+ ++ + M  E
Sbjct: 722 LTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLGRAGQLDKALRVVAEEMSEE 781

Query: 824 PTAVVWVALLSASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIR 883
               +W +LLS+ RIH N+E+GE  A++L E   E   +Y LLSNLYA   +W+DV ++R
Sbjct: 782 ADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYVLLSNLYAGLGKWEDVRKVR 841

Query: 884 SLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTS 943
             M    ++K  GCSWI+  +   +F VG+R     ++I +L S L  +I  MGY P T 
Sbjct: 842 QRMNEMSLRKDAGCSWIELNRKVFSFVVGERFLDGFEEIKSLWSILEMKISKMGYRPDTM 901

Query: 944 FALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMII 959
              HD+ +EEK + L  HSEKLA+ YG++ ++ G  IR+ KNLRIC DCH+A   IS ++
Sbjct: 902 SVQHDLSEEEKIEQLRGHSEKLALTYGLIKTSEGTTIRVYKNLRICVDCHNAAKLISKVM 961

BLAST of HG10006016 vs. ExPASy TrEMBL
Match: A0A1S3C0G3 (pentatricopeptide repeat-containing protein At5g16860 OS=Cucumis melo OX=3656 GN=LOC103495413 PE=3 SV=1)

HSP 1 Score: 1587.8 bits (4110), Expect = 0.0e+00
Identity = 770/857 (89.85%), Postives = 820/857 (95.68%), Query Frame = 0

Query: 102 SVMIHHHCGSFLSRILTTLVQFYSTSTTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGF 161
           +VMIH  CGS+LSRIL T V FYST TTSPPTIP  SLLRQC+TLINAKLAHQQIFV+GF
Sbjct: 12  NVMIHPRCGSYLSRILITSVHFYSTFTTSPPTIPLISLLRQCKTLINAKLAHQQIFVHGF 71

Query: 162 TEMATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQM 221
           TEM +YAVGAYIECGA AEAVSLLQR+IPSHSTVFWWNALIRRSVRLG LDD LGFYCQM
Sbjct: 72  TEMFSYAVGAYIECGASAEAVSLLQRIIPSHSTVFWWNALIRRSVRLGLLDDTLGFYCQM 131

Query: 222 QRLGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGAL 281
           Q LGWLPDHYTFPFVLKACGEIPSFR GASVHA+VCA GFESNVFICNSIVAMYGRCGAL
Sbjct: 132 QSLGWLPDHYTFPFVLKACGEIPSFRHGASVHAVVCAKGFESNVFICNSIVAMYGRCGAL 191

Query: 282 DDARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLV 341
           DDARQ+F+EVLER+IEDIVSWNSILAAYVQGG+SRTALR+A +M NHYS KLRPDAITLV
Sbjct: 192 DDARQMFDEVLERRIEDIVSWNSILAAYVQGGKSRTALRIAFQMGNHYSLKLRPDAITLV 251

Query: 342 NILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEK 401
           NILPACAS FA QHGKQVHGFSVRSGLVDDVFVGNALV MYAKCSKMNEANKVFER+K+K
Sbjct: 252 NILPACASIFAIQHGKQVHGFSVRSGLVDDVFVGNALVSMYAKCSKMNEANKVFERIKKK 311

Query: 402 DVVSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVF 461
           DVVSWNAMVTGYSQIGSFDSALSLFK MQEEDI+L+V+TWSAVIAGYAQ+GHGFEALDVF
Sbjct: 312 DVVSWNAMVTGYSQIGSFDSALSLFKMMQEEDIKLDVITWSAVIAGYAQKGHGFEALDVF 371

Query: 462 RQMQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGL 521
           RQMQL G EPNVVTLVSLLSGCASVGALL+GKQTHAY IKNILNL+WSD G D++VLNGL
Sbjct: 372 RQMQLYGLEPNVVTLVSLLSGCASVGALLYGKQTHAYVIKNILNLNWSDKGDDMLVLNGL 431

Query: 522 IDMYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSL 581
           IDMYAKCKSY+VARNIFD IAGKDK+VVTWTV+IGGYAQHGEANDAL+LFAQIF+Q+TSL
Sbjct: 432 IDMYAKCKSYRVARNIFDSIAGKDKDVVTWTVMIGGYAQHGEANDALQLFAQIFEQKTSL 491

Query: 582 KPNAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQ 641
           KPNAFTLSCALMACARLG LRLGRQLHAYALR+EN SEVL+VANCLIDMYSKSGDIDAA+
Sbjct: 492 KPNAFTLSCALMACARLGELRLGRQLHAYALRNENESEVLYVANCLIDMYSKSGDIDAAR 551

Query: 642 AVFDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSG 701
           AVF+NMK+RN VSWTSLMTGYGMHGRGEEALHVF QMRQ GFV+DG+TFLVVLYACSHSG
Sbjct: 552 AVFNNMKLRNVVSWTSLMTGYGMHGRGEEALHVFDQMRQLGFVVDGITFLVVLYACSHSG 611

Query: 702 MVDQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVAL 761
           +VDQGM+YFH M+KCFG+TPGAEHYACMVDLLGRAGR NEAMELIKSM MEPTAVVWVAL
Sbjct: 612 LVDQGMNYFHDMVKCFGITPGAEHYACMVDLLGRAGRLNEAMELIKSMSMEPTAVVWVAL 671

Query: 762 LSASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIK 821
           LSASRIHANIELGEYAAS+L+E GAENDGSYTLLSNLYANARRWKDVARIRSLMK+TGI+
Sbjct: 672 LSASRIHANIELGEYAASKLIELGAENDGSYTLLSNLYANARRWKDVARIRSLMKHTGIR 731

Query: 822 KRPGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDE 881
           KRPGCSWIQGKK+TTTFFVGDRSHPES+QIYNLLS+LIK+IKDMGYVPQTSFALHDVDDE
Sbjct: 732 KRPGCSWIQGKKSTTTFFVGDRSHPESEQIYNLLSDLIKRIKDMGYVPQTSFALHDVDDE 791

Query: 882 EKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDS 941
           EKGDLLFEHSEKLAVAYGILT+APGQPIRI+KNLRICGDCHSALTYISMIIDHEIILRDS
Sbjct: 792 EKGDLLFEHSEKLAVAYGILTTAPGQPIRIHKNLRICGDCHSALTYISMIIDHEIILRDS 851

Query: 942 SRFHHFKKGSCSCRSYW 959
           SRFHHFKKGSCSCRSYW
Sbjct: 852 SRFHHFKKGSCSCRSYW 868

BLAST of HG10006016 vs. ExPASy TrEMBL
Match: A0A6J1CPQ5 (pentatricopeptide repeat-containing protein At5g16860 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111013040 PE=3 SV=1)

HSP 1 Score: 1562.4 bits (4044), Expect = 0.0e+00
Identity = 759/855 (88.77%), Postives = 805/855 (94.15%), Query Frame = 0

Query: 104 MIHHHCGSFLSRILTTLVQFYSTSTTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGFTE 163
           MIHH C S++SRIL + V  YSTS TS   IP  SLL+QCRTLINAKLAHQQI VNGFT+
Sbjct: 1   MIHHSCASYVSRILPSSVPCYSTSATS---IPLISLLQQCRTLINAKLAHQQILVNGFTQ 60

Query: 164 MATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQMQR 223
           M TYA+GAYIECGA A+AVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDD LGFYCQMQR
Sbjct: 61  MVTYAIGAYIECGASAQAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDVLGFYCQMQR 120

Query: 224 LGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALDD 283
           LGW PDHYTFPFVLKACGEIPSFRRGASVHA+VCANGFESNVFICNSIVAMYGRCGALDD
Sbjct: 121 LGWSPDHYTFPFVLKACGEIPSFRRGASVHAVVCANGFESNVFICNSIVAMYGRCGALDD 180

Query: 284 ARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLVNI 343
           ARQVF+EVLERKIEDIVSWNSILAAYVQGGES+TALR+A+RMANHY+ KL PDAITLVNI
Sbjct: 181 ARQVFDEVLERKIEDIVSWNSILAAYVQGGESKTALRIAVRMANHYNCKLLPDAITLVNI 240

Query: 344 LPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDV 403
           LPACAS  APQHGKQVHG++VRSGLVDDVFVGNALVDMYAKC KM+EA++VFE MKEKDV
Sbjct: 241 LPACASTLAPQHGKQVHGYAVRSGLVDDVFVGNALVDMYAKCWKMDEASRVFELMKEKDV 300

Query: 404 VSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVFRQ 463
           VSWNAMVTGYSQI  FD ALSLFKRMQEEDIELNVVTWSA+IAGY+QRG GFEALDVFRQ
Sbjct: 301 VSWNAMVTGYSQISRFDDALSLFKRMQEEDIELNVVTWSALIAGYSQRGLGFEALDVFRQ 360

Query: 464 MQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGLID 523
           MQLCG EPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILN DW+DPG DLMVLNGLID
Sbjct: 361 MQLCGLEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNFDWNDPGDDLMVLNGLID 420

Query: 524 MYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKP 583
           MYAKCKS KVARNIFDLI  K+KNVVTWTV+IGGYAQHGEANDALELF+Q+FK ETSLKP
Sbjct: 421 MYAKCKSSKVARNIFDLITRKNKNVVTWTVMIGGYAQHGEANDALELFSQMFKHETSLKP 480

Query: 584 NAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAV 643
           NAFTLSCALMACARLGALRLGRQ+HAYALRHEN +EVL+VANCLIDMYSKSGDIDAAQ V
Sbjct: 481 NAFTLSCALMACARLGALRLGRQIHAYALRHENENEVLYVANCLIDMYSKSGDIDAAQTV 540

Query: 644 FDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMV 703
           FDNMKVRNAVSWTSLMTGYGMHGRGEEALH+F QM+QA   +DGVTFLVVLYACSHSGMV
Sbjct: 541 FDNMKVRNAVSWTSLMTGYGMHGRGEEALHIFDQMQQADLAVDGVTFLVVLYACSHSGMV 600

Query: 704 DQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLS 763
           DQGM+YFHGMIK FGV PGAEHYACMVDLLGRAGR NEAMELIKSM  EPTAVVWVALLS
Sbjct: 601 DQGMNYFHGMIKYFGVAPGAEHYACMVDLLGRAGRLNEAMELIKSMSTEPTAVVWVALLS 660

Query: 764 ASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKR 823
           ASRIHAN+ELGEYAA++L+ESG ENDGSYTLLSNLYANARRWKDVARIRSLMK+TGIKKR
Sbjct: 661 ASRIHANVELGEYAANKLIESGLENDGSYTLLSNLYANARRWKDVARIRSLMKHTGIKKR 720

Query: 824 PGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEK 883
           PGCSW+QGKK TTTFFVGDRSHP+SDQIY +L++LI++IKDMGYVPQTSFALHDVDDEEK
Sbjct: 721 PGCSWVQGKKGTTTFFVGDRSHPQSDQIYGILADLIQRIKDMGYVPQTSFALHDVDDEEK 780

Query: 884 GDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSR 943
           GDLLFEHSEKLAVAYGILTS+PGQPIRINKNLRICGDCHSALTYISMII+HEIILRDSSR
Sbjct: 781 GDLLFEHSEKLAVAYGILTSSPGQPIRINKNLRICGDCHSALTYISMIIEHEIILRDSSR 840

Query: 944 FHHFKKGSCSCRSYW 959
           FHHFK GSCSCR YW
Sbjct: 841 FHHFKNGSCSCRGYW 852

BLAST of HG10006016 vs. ExPASy TrEMBL
Match: A0A5A7SK77 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold13G002350 PE=3 SV=1)

HSP 1 Score: 1547.3 bits (4005), Expect = 0.0e+00
Identity = 753/839 (89.75%), Postives = 802/839 (95.59%), Query Frame = 0

Query: 104 MIHHHCGSFLSRILTTLVQFYSTSTTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGFTE 163
           MIH  CGS+LSRIL T V FYST TTSPPTIP  SLLRQC+TLINAKLAHQQIFV+GFTE
Sbjct: 1   MIHPRCGSYLSRILITSVHFYSTFTTSPPTIPLISLLRQCKTLINAKLAHQQIFVHGFTE 60

Query: 164 MATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQMQR 223
           M +YAVGAYIECGA AEAVSLLQR+IPSHSTVFWWNALIRRSVRLG LDD LGFYCQMQ 
Sbjct: 61  MFSYAVGAYIECGASAEAVSLLQRIIPSHSTVFWWNALIRRSVRLGLLDDTLGFYCQMQS 120

Query: 224 LGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALDD 283
           LGWLPDHYTFPFVLKACGEIPSFR GASVHA+VCA GFESNVFICNSIVAMYGRCGALDD
Sbjct: 121 LGWLPDHYTFPFVLKACGEIPSFRHGASVHAVVCAKGFESNVFICNSIVAMYGRCGALDD 180

Query: 284 ARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLVNI 343
           ARQ+F+EVLER+IEDIVSWNSILAAYVQGG+SRTALR+A +M NHYS KLRPDAITLVNI
Sbjct: 181 ARQMFDEVLERRIEDIVSWNSILAAYVQGGKSRTALRIAFQMGNHYSLKLRPDAITLVNI 240

Query: 344 LPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDV 403
           LPACAS FA QHGKQVHGFSVRSGLVDDVFVGNALV MYAKCSKMNEANKVFER+K+KDV
Sbjct: 241 LPACASIFAIQHGKQVHGFSVRSGLVDDVFVGNALVSMYAKCSKMNEANKVFERIKKKDV 300

Query: 404 VSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVFRQ 463
           VSWNAMVTGYSQIGSFDSALSLFK MQEEDI+L+V+TWSAVIAGYAQ+GHGFEALDVFRQ
Sbjct: 301 VSWNAMVTGYSQIGSFDSALSLFKMMQEEDIKLDVITWSAVIAGYAQKGHGFEALDVFRQ 360

Query: 464 MQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGLID 523
           MQL G EPNVVTLVSLLSGCASVGALL+GKQTHAY IKNILNL+WSD G D++VLNGLID
Sbjct: 361 MQLYGLEPNVVTLVSLLSGCASVGALLYGKQTHAYVIKNILNLNWSDKGDDMLVLNGLID 420

Query: 524 MYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKP 583
           MYAKCKSY+VARNIFD IAGKDK+VVTWTV+IGGYAQHGEANDAL+LFAQIF+Q+TSLKP
Sbjct: 421 MYAKCKSYRVARNIFDSIAGKDKDVVTWTVMIGGYAQHGEANDALQLFAQIFEQKTSLKP 480

Query: 584 NAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAV 643
           NAFTLSCALMACARLG LRLGRQLHAYALR+EN SEVL+VANCLIDMYSKSGDIDAA+AV
Sbjct: 481 NAFTLSCALMACARLGELRLGRQLHAYALRNENESEVLYVANCLIDMYSKSGDIDAARAV 540

Query: 644 FDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMV 703
           F+NMK+RN VSWTSLMTGYGMHGRGEEALHVF QMRQ GFV+DG+TFLVVLYACSHSG+V
Sbjct: 541 FNNMKLRNVVSWTSLMTGYGMHGRGEEALHVFDQMRQLGFVVDGITFLVVLYACSHSGLV 600

Query: 704 DQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLS 763
           DQGM+YFH M+KCFG+TPGAEHYACMVDLLGRAGR NEAMELIKSM MEPTAVVWVALLS
Sbjct: 601 DQGMNYFHDMVKCFGITPGAEHYACMVDLLGRAGRLNEAMELIKSMSMEPTAVVWVALLS 660

Query: 764 ASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKR 823
           ASRIHANIELGEYAAS+L+E GAENDGSYTLLSNLYANARRWKDVARIRSLMK+TGI+KR
Sbjct: 661 ASRIHANIELGEYAASKLIELGAENDGSYTLLSNLYANARRWKDVARIRSLMKHTGIRKR 720

Query: 824 PGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEK 883
           PGCSWIQGKK+TTTFFVGDRSHPES+QIYNLLS+LIK+IKDMGYVPQTSFALHDVDDEEK
Sbjct: 721 PGCSWIQGKKSTTTFFVGDRSHPESEQIYNLLSDLIKRIKDMGYVPQTSFALHDVDDEEK 780

Query: 884 GDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSS 943
           GDLLFEHSEKLAVAYGILT+APGQPIRI+KNLRICGDCHSALTYISMIIDHEIILRDSS
Sbjct: 781 GDLLFEHSEKLAVAYGILTTAPGQPIRIHKNLRICGDCHSALTYISMIIDHEIILRDSS 839

BLAST of HG10006016 vs. ExPASy TrEMBL
Match: A0A6J1H912 (pentatricopeptide repeat-containing protein At5g16860 OS=Cucurbita moschata OX=3662 GN=LOC111461191 PE=3 SV=1)

HSP 1 Score: 1513.0 bits (3916), Expect = 0.0e+00
Identity = 728/821 (88.67%), Postives = 777/821 (94.64%), Query Frame = 0

Query: 138 SLLRQCRTLINAKLAHQQIFVNGFTEMATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFW 197
           S L+QCRTLI+AKL HQQI VNGFT++ T+A+G YIEC AF +AVSLL+RL+PSHSTVFW
Sbjct: 2   SFLKQCRTLIDAKLVHQQILVNGFTDLVTHAIGGYIECNAFGQAVSLLERLVPSHSTVFW 61

Query: 198 WNALIRRSVRLGFLDDALGFYCQMQRLGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVC 257
           WNALIRRSVRLGFLDDAL FY QMQRLGW PDHYTFPFVLKACGE  SFR G SVHA+VC
Sbjct: 62  WNALIRRSVRLGFLDDALCFYRQMQRLGWWPDHYTFPFVLKACGEKLSFRCGTSVHAMVC 121

Query: 258 ANGFESNVFICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNSILAAYVQGGESRT 317
           A GFESNVFICNS+VAMYGRCGALDDARQVF+EVLERKI+DIVSWNSILAAYVQGGES+ 
Sbjct: 122 AYGFESNVFICNSVVAMYGRCGALDDARQVFDEVLERKIDDIVSWNSILAAYVQGGESKA 181

Query: 318 ALRVALRMANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNA 377
           ALR+A +MA HY+ KLRPDAITLVN+LPACAS FA QHG+QVHGF+VRSGLVDDVFVGNA
Sbjct: 182 ALRIAFQMAKHYNFKLRPDAITLVNVLPACASTFATQHGRQVHGFAVRSGLVDDVFVGNA 241

Query: 378 LVDMYAKCSKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELN 437
           LVDMYAKCSKMNEANKVFE+MKEKDVVSWNA+VTGYSQIGSFD ALSLFKRMQEEDIELN
Sbjct: 242 LVDMYAKCSKMNEANKVFEQMKEKDVVSWNALVTGYSQIGSFDDALSLFKRMQEEDIELN 301

Query: 438 VVTWSAVIAGYAQRGHGFEALDVFRQMQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHA 497
           VVTWSAVIAGY+QRGHG EALDVFRQMQ CG EPNVVTLVSLLSGCASVGALLHGKQTHA
Sbjct: 302 VVTWSAVIAGYSQRGHGCEALDVFRQMQHCGLEPNVVTLVSLLSGCASVGALLHGKQTHA 361

Query: 498 YAIKNILNLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGG 557
           YAIKNILNLDWSDPG D+MV NGLIDMYAKCKS +VARNIFD I GKDKNVVTWTV+IGG
Sbjct: 362 YAIKNILNLDWSDPGDDMMVFNGLIDMYAKCKSSRVARNIFDSIIGKDKNVVTWTVMIGG 421

Query: 558 YAQHGEANDALELFAQIFKQETSLKPNAFTLSCALMACARLGALRLGRQLHAYALRHENG 617
           YAQHGEANDA+ELF+Q+FKQETSLKPNAFTLSCALMACARLGALRLG+Q+HAYALRHEN 
Sbjct: 422 YAQHGEANDAVELFSQMFKQETSLKPNAFTLSCALMACARLGALRLGKQIHAYALRHENE 481

Query: 618 SEVLFVANCLIDMYSKSGDIDAAQAVFDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQ 677
           SEVL VANCLIDMYSKSGDIDAAQ VFDNMKVRNAVSWTSLMTGYG+HGRGEEAL VF+Q
Sbjct: 482 SEVLHVANCLIDMYSKSGDIDAAQIVFDNMKVRNAVSWTSLMTGYGIHGRGEEALRVFNQ 541

Query: 678 MRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAG 737
           MRQ G  +DGVTFLVVLYACSHSGMVDQGM+YFHGM+K FGV PGAEHYACMVDLLGRAG
Sbjct: 542 MRQVGLSVDGVTFLVVLYACSHSGMVDQGMNYFHGMVKYFGVAPGAEHYACMVDLLGRAG 601

Query: 738 RFNEAMELIKSMPMEPTAVVWVALLSASRIHANIELGEYAASRLLESGAENDGSYTLLSN 797
           R NEAMELIKSMPMEPT VVWVALLSASR HAN+ELGEYAAS+L+ESGAENDGSYTLLSN
Sbjct: 602 RLNEAMELIKSMPMEPTPVVWVALLSASRTHANVELGEYAASKLMESGAENDGSYTLLSN 661

Query: 798 LYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSE 857
           LYANARRWKDVARIR LMK+TGIKKRPGCSW+QGKK+TTTFFVGD+SHP+SDQIYN+LS+
Sbjct: 662 LYANARRWKDVARIRRLMKHTGIKKRPGCSWVQGKKSTTTFFVGDKSHPQSDQIYNILSD 721

Query: 858 LIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRI 917
           LI++IKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRI
Sbjct: 722 LIQRIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRI 781

Query: 918 CGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSCSCRSYW 959
           CGDCHSALTYISMII+HEIILRDSSRFHHFKKGSCSCR YW
Sbjct: 782 CGDCHSALTYISMIIEHEIILRDSSRFHHFKKGSCSCRGYW 822

BLAST of HG10006016 vs. ExPASy TrEMBL
Match: A0A6J1KUL7 (pentatricopeptide repeat-containing protein At5g16860 OS=Cucurbita maxima OX=3661 GN=LOC111497759 PE=3 SV=1)

HSP 1 Score: 1501.5 bits (3886), Expect = 0.0e+00
Identity = 720/821 (87.70%), Postives = 778/821 (94.76%), Query Frame = 0

Query: 138 SLLRQCRTLINAKLAHQQIFVNGFTEMATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFW 197
           S L+QCRTLI+AKL HQQI VNGFT++ T+A+G YIEC AFA+AVSLL+RL+PSHS VFW
Sbjct: 2   SFLKQCRTLIDAKLVHQQILVNGFTDLVTHAIGGYIECNAFAQAVSLLERLVPSHSAVFW 61

Query: 198 WNALIRRSVRLGFLDDALGFYCQMQRLGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVC 257
           WNALIRRSVRLGFLDDAL FY QM+RLGW PD+YTFPFVLKACGE  SFR GASVHA+VC
Sbjct: 62  WNALIRRSVRLGFLDDALCFYRQMERLGWSPDYYTFPFVLKACGEKLSFRCGASVHAMVC 121

Query: 258 ANGFESNVFICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNSILAAYVQGGESRT 317
           A GFESNVFICNS+VAMYGRCGALDDARQVF+EVLERKI+DIVSWNSILAAYVQGGES+ 
Sbjct: 122 AYGFESNVFICNSVVAMYGRCGALDDARQVFDEVLERKIDDIVSWNSILAAYVQGGESKA 181

Query: 318 ALRVALRMANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNA 377
           ALR+A +MA HY+ KL PDAITLVN+LPACAS FA +HG+QVHGF+VRSGLVDDVFVGNA
Sbjct: 182 ALRIAFQMAKHYNFKLFPDAITLVNVLPACASTFATEHGRQVHGFAVRSGLVDDVFVGNA 241

Query: 378 LVDMYAKCSKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELN 437
           LVDMYAKCSKMNEANK+FE+MKEKDVVSWNA+VTGYSQIGSFD ALSLFKRMQEEDIELN
Sbjct: 242 LVDMYAKCSKMNEANKMFEQMKEKDVVSWNALVTGYSQIGSFDDALSLFKRMQEEDIELN 301

Query: 438 VVTWSAVIAGYAQRGHGFEALDVFRQMQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHA 497
           VVTWSAVIAGY+QRGHG EALDVFRQMQ CG EPNVVTLVSLLSGCASVGALLHGKQTHA
Sbjct: 302 VVTWSAVIAGYSQRGHGCEALDVFRQMQNCGLEPNVVTLVSLLSGCASVGALLHGKQTHA 361

Query: 498 YAIKNILNLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGG 557
           YAIKNILNLDWSDPG D+MV NGLIDMYAKCKS +VAR+IFD I GKDKNVVTWTV+IGG
Sbjct: 362 YAIKNILNLDWSDPGDDMMVFNGLIDMYAKCKSSRVARSIFDSIIGKDKNVVTWTVMIGG 421

Query: 558 YAQHGEANDALELFAQIFKQETSLKPNAFTLSCALMACARLGALRLGRQLHAYALRHENG 617
           YAQHGEANDA+ELF+Q+FKQETSLKPNAFTLSCALMACARLGALRLG+Q+HAYALRHEN 
Sbjct: 422 YAQHGEANDAIELFSQMFKQETSLKPNAFTLSCALMACARLGALRLGKQIHAYALRHENE 481

Query: 618 SEVLFVANCLIDMYSKSGDIDAAQAVFDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQ 677
           SEVL+VANCLIDMYSKSGDIDAAQ VFDNMKV+NAVSWTSLMTGYG+HGRGEEAL VF+Q
Sbjct: 482 SEVLYVANCLIDMYSKSGDIDAAQIVFDNMKVQNAVSWTSLMTGYGIHGRGEEALRVFNQ 541

Query: 678 MRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAG 737
           MR+ G  +DGVTFLVVLYACSHSGMVDQGM+YFHGM+K FGV PGAEHYACMVDLLGRAG
Sbjct: 542 MREVGLSVDGVTFLVVLYACSHSGMVDQGMNYFHGMVKYFGVAPGAEHYACMVDLLGRAG 601

Query: 738 RFNEAMELIKSMPMEPTAVVWVALLSASRIHANIELGEYAASRLLESGAENDGSYTLLSN 797
           R NEAMELIKSMPMEPT VVWVALLSASR HAN+ELGEYAAS+L+ESGAENDGSYTLLSN
Sbjct: 602 RLNEAMELIKSMPMEPTPVVWVALLSASRTHANVELGEYAASKLIESGAENDGSYTLLSN 661

Query: 798 LYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSE 857
           LYANARRWKDVARIR LMK+TGIKKRPGCSW+QGKK+TTTFFVGD+SHP+SDQIYN+L++
Sbjct: 662 LYANARRWKDVARIRRLMKHTGIKKRPGCSWVQGKKSTTTFFVGDKSHPQSDQIYNILAD 721

Query: 858 LIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRI 917
           LI++IKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRI
Sbjct: 722 LIQRIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRI 781

Query: 918 CGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSCSCRSYW 959
           CGDCHSALTYISMII+HEIILRDSSRFHHFKKGSCSCR YW
Sbjct: 782 CGDCHSALTYISMIIEHEIILRDSSRFHHFKKGSCSCRGYW 822

BLAST of HG10006016 vs. TAIR 10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 1034.2 bits (2673), Expect = 6.4e-302
Identity = 498/834 (59.71%), Postives = 630/834 (75.54%), Query Frame = 0

Query: 127 STTSPPTIPFTSLLRQCRTLINAKLAHQQIFVNGF--TEMATYAVGAYIECGAFAEAVSL 186
           ST++P   P    + +C+T+   KL HQ++   G     + ++ +  YI  G  + AVSL
Sbjct: 24  STSAPEITP--PFIHKCKTISQVKLIHQKLLSFGILTLNLTSHLISTYISVGCLSHAVSL 83

Query: 187 LQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQMQRLGWLPDHYTFPFVLKACGEIP 246
           L+R  PS + V+ WN+LIR     G  +  L  +  M  L W PD+YTFPFV KACGEI 
Sbjct: 84  LRRFPPSDAGVYHWNSLIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGEIS 143

Query: 247 SFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNS 306
           S R G S HA+    GF SNVF+ N++VAMY RC +L DAR+VF+E+    + D+VSWNS
Sbjct: 144 SVRCGESAHALSLVTGFISNVFVGNALVAMYSRCRSLSDARKVFDEM---SVWDVVSWNS 203

Query: 307 ILAAYVQGGESRTALRVALRMANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSV 366
           I+ +Y + G+ + AL +  RM N +    RPD ITLVN+LP CAS      GKQ+H F+V
Sbjct: 204 IIESYAKLGKPKVALEMFSRMTNEFG--CRPDNITLVNVLPPCASLGTHSLGKQLHCFAV 263

Query: 367 RSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALS 426
            S ++ ++FVGN LVDMYAKC  M+EAN VF  M  KDVVSWNAMV GYSQIG F+ A+ 
Sbjct: 264 TSEMIQNMFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVR 323

Query: 427 LFKRMQEEDIELNVVTWSAVIAGYAQRGHGFEALDVFRQMQLCGWEPNVVTLVSLLSGCA 486
           LF++MQEE I+++VVTWSA I+GYAQRG G+EAL V RQM   G +PN VTL+S+LSGCA
Sbjct: 324 LFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCA 383

Query: 487 SVGALLHGKQTHAYAIKNILNLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFDLIAGK 546
           SVGAL+HGK+ H YAIK  ++L  +  G + MV+N LIDMYAKCK    AR +FD ++ K
Sbjct: 384 SVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPK 443

Query: 547 DKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKPNAFTLSCALMACARLGALRLG 606
           +++VVTWTV+IGGY+QHG+AN ALEL +++F+++   +PNAFT+SCAL+ACA L ALR+G
Sbjct: 444 ERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIG 503

Query: 607 RQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAVFDNMKVRNAVSWTSLMTGYGM 666
           +Q+HAYALR++  +  LFV+NCLIDMY+K G I  A+ VFDNM  +N V+WTSLMTGYGM
Sbjct: 504 KQIHAYALRNQQNAVPLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGM 563

Query: 667 HGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHGMIKCFGVTPGAE 726
           HG GEEAL +F +MR+ GF +DGVT LVVLYACSHSGM+DQGM+YF+ M   FGV+PG E
Sbjct: 564 HGYGEEALGIFDEMRRIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPE 623

Query: 727 HYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLSASRIHANIELGEYAASRLLES 786
           HYAC+VDLLGRAGR N A+ LI+ MPMEP  VVWVA LS  RIH  +ELGEYAA ++ E 
Sbjct: 624 HYACLVDLLGRAGRLNAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITEL 683

Query: 787 GAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRS 846
            + +DGSYTLLSNLYANA RWKDV RIRSLM++ G+KKRPGCSW++G K TTTFFVGD++
Sbjct: 684 ASNHDGSYTLLSNLYANAGRWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFVGDKT 743

Query: 847 HPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSA 906
           HP + +IY +L + +++IKD+GYVP+T FALHDVDDEEK DLLFEHSEKLA+AYGILT+ 
Sbjct: 744 HPHAKEIYQVLLDHMQRIKDIGYVPETGFALHDVDDEEKDDLLFEHSEKLALAYGILTTP 803

Query: 907 PGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSCSCRSYW 959
            G  IRI KNLR+CGDCH+A TY+S IIDH+IILRDSSRFHHFK GSCSC+ YW
Sbjct: 804 QGAAIRITKNLRVCGDCHTAFTYMSRIIDHDIILRDSSRFHHFKNGSCSCKGYW 850

BLAST of HG10006016 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 550.4 bits (1417), Expect = 2.8e-156
Identity = 302/827 (36.52%), Postives = 460/827 (55.62%), Query Frame = 0

Query: 138 SLLRQC---RTLINAKLAHQQIFVNGF---TEMATYAVGAYIECGAFAEAVSLLQRLIPS 197
           S+L+ C   ++L + K     I  NGF   + + +     Y  CG   EA  +   +   
Sbjct: 99  SVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEV--K 158

Query: 198 HSTVFWWNALIRRSVRLGFLDDALGFYCQMQRLGWLPDHYTFPFVLKACGEIPSFRRGAS 257
                +WN L+    + G    ++G + +M   G   D YTF  V K+   + S   G  
Sbjct: 159 IEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQ 218

Query: 258 VHAIVCANGFESNVFICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNSILAAYVQ 317
           +H  +  +GF     + NS+VA Y +   +D AR+VF+E+ ER   D++SWNSI+  YV 
Sbjct: 219 LHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTER---DVISWNSIINGYVS 278

Query: 318 GGESRTALRVALRMANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSVRSGLVDD 377
            G +   L V ++M     S +  D  T+V++   CA +     G+ VH   V++    +
Sbjct: 279 NGLAEKGLSVFVQM---LVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSRE 338

Query: 378 VFVGNALVDMYAKCSKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALSLFKRMQE 437
               N L+DMY+KC  ++ A  VF  M ++ VVS+ +M+ GY++ G    A+ LF+ M+E
Sbjct: 339 DRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEE 398

Query: 438 EDIELNVVTWSAVIAGYAQRGHGFEALDVFRQMQLCGWEPNVVTLVSLLSGCASVGALLH 497
           E                                   G  P+V T+ ++L+ CA    L  
Sbjct: 399 E-----------------------------------GISPDVYTVTAVLNCCARYRLLDE 458

Query: 498 GKQTHAYAIKNILNLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFDLIAGKDKNVVTW 557
           GK+ H +  +N       D G D+ V N L+DMYAKC S + A  +F  +  KD  +++W
Sbjct: 459 GKRVHEWIKEN-------DLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKD--IISW 518

Query: 558 TVLIGGYAQHGEANDALELFAQIFKQETSLKPNAFTLSCALMACARLGALRLGRQLHAYA 617
             +IGGY+++  AN+AL LF  +  +E    P+  T++C L ACA L A   GR++H Y 
Sbjct: 519 NTIIGGYSKNCYANEALSLF-NLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYI 578

Query: 618 LRHENGSEVLFVANCLIDMYSKSGDIDAAQAVFDNMKVRNAVSWTSLMTGYGMHGRGEEA 677
           +R+   S+   VAN L+DMY+K G +  A  +FD++  ++ VSWT ++ GYGMHG G+EA
Sbjct: 579 MRNGYFSD-RHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEA 638

Query: 678 LHVFHQMRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHGMIKCFGVTPGAEHYACMVD 737
           + +F+QMRQAG   D ++F+ +LYACSHSG+VD+G  +F+ M     + P  EHYAC+VD
Sbjct: 639 IALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVD 698

Query: 738 LLGRAGRFNEAMELIKSMPMEPTAVVWVALLSASRIHANIELGEYAASRLLESGAENDGS 797
           +L R G   +A   I++MP+ P A +W ALL   RIH +++L E  A ++ E   EN G 
Sbjct: 699 MLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGY 758

Query: 798 YTLLSNLYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRSHPESDQI 857
           Y L++N+YA A +W+ V R+R  +   G++K PGCSWI+ K     F  GD S+PE++ I
Sbjct: 759 YVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENI 818

Query: 858 YNLLSELIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRI 917
              L ++  ++ + GY P T +AL D ++ EK + L  HSEKLA+A GI++S  G+ IR+
Sbjct: 819 EAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRV 871

Query: 918 NKNLRICGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSCSCRSYW 959
            KNLR+CGDCH    ++S +   EI+LRDS+RFH FK G CSCR +W
Sbjct: 879 TKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 871

BLAST of HG10006016 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 541.6 bits (1394), Expect = 1.3e-153
Identity = 293/827 (35.43%), Postives = 461/827 (55.74%), Query Frame = 0

Query: 135 PFTSLLRQCRTLINAKLAHQQIFVNGFTE---MATYAVGAYIECGAFAEAVSLLQRLIPS 194
           P   LL +C +L   +     +F NG  +     T  V  +   G+  EA  + + +   
Sbjct: 39  PAALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSK 98

Query: 195 HSTVFWWNALIRRSVRLGFLDDALGFYCQMQRLGWLPDHYTFPFVLKACGEIPSFRRGAS 254
            + ++  + +++   ++  LD AL F+ +M+     P  Y F ++LK CG+    R G  
Sbjct: 99  LNVLY--HTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKE 158

Query: 255 VHAIVCANGFESNVFICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNSILAAYVQ 314
           +H ++  +GF  ++F    +  MY +C  +++AR+VF+ + ER   D+VSWN+I+A Y Q
Sbjct: 159 IHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPER---DLVSWNTIVAGYSQ 218

Query: 315 GGESRTALRVALRMANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSVRSGLVDD 374
            G +R AL +   M       L+P  IT+V++LPA ++      GK++HG+++RSG    
Sbjct: 219 NGMARMALEMVKSMC---EENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSL 278

Query: 375 VFVGNALVDMYAKCSKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALSLFKRMQE 434
           V +  ALVDMYAKC  +  A ++F+ M E++VVSWN+M+  Y Q  +   A+ +F++M +
Sbjct: 279 VNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLD 338

Query: 435 EDIELNVVTWSAVIAGYAQRGHGFEALDVFRQMQLCGWEPNVVTLVSLLSGCASVGALLH 494
           E                                   G +P  V+++  L  CA +G L  
Sbjct: 339 E-----------------------------------GVKPTDVSVMGALHACADLGDLER 398

Query: 495 GKQTHAYAIKNILNLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFDLIAGKDKNVVTW 554
           G+  H  +++  L LD      ++ V+N LI MY KCK    A ++F  +  + + +V+W
Sbjct: 399 GRFIHKLSVE--LGLD-----RNVSVVNSLISMYCKCKEVDTAASMFGKL--QSRTLVSW 458

Query: 555 TVLIGGYAQHGEANDALELFAQIFKQETSLKPNAFTLSCALMACARLGALRLGRQLHAYA 614
             +I G+AQ+G   DAL  F+Q+  +  ++KP+ FT    + A A L      + +H   
Sbjct: 459 NAMILGFAQNGRPIDALNYFSQM--RSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVV 518

Query: 615 LRHENGSEVLFVANCLIDMYSKSGDIDAAQAVFDNMKVRNAVSWTSLMTGYGMHGRGEEA 674
           +R      V FV   L+DMY+K G I  A+ +FD M  R+  +W +++ GYG HG G+ A
Sbjct: 519 MRSCLDKNV-FVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 578

Query: 675 LHVFHQMRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHGMIKCFGVTPGAEHYACMVD 734
           L +F +M++     +GVTFL V+ ACSHSG+V+ G+  F+ M + + +    +HY  MVD
Sbjct: 579 LELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVD 638

Query: 735 LLGRAGRFNEAMELIKSMPMEPTAVVWVALLSASRIHANIELGEYAASRLLESGAENDGS 794
           LLGRAGR NEA + I  MP++P   V+ A+L A +IH N+   E AA RL E   ++ G 
Sbjct: 639 LLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGY 698

Query: 795 YTLLSNLYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRSHPESDQI 854
           + LL+N+Y  A  W+ V ++R  M   G++K PGCS ++ K    +FF G  +HP+S +I
Sbjct: 699 HVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKI 758

Query: 855 YNLLSELIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRI 914
           Y  L +LI  IK+ GYVP T+  L  V+++ K  LL  HSEKLA+++G+L +  G  I +
Sbjct: 759 YAFLEKLICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHV 809

Query: 915 NKNLRICGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSCSCRSYW 959
            KNLR+C DCH+A  YIS++   EI++RD  RFHHFK G+CSC  YW
Sbjct: 819 RKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of HG10006016 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 536.2 bits (1380), Expect = 5.4e-152
Identity = 283/726 (38.98%), Postives = 426/726 (58.68%), Query Frame = 0

Query: 266 FICNSIVAMYGRCGALDDARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTALRVALRM 325
           F  N++++ Y + G +D   + F+++ +R   D VSW +++  Y   G+   A+RV   M
Sbjct: 81  FSWNTVLSAYSKRGDMDSTCEFFDQLPQR---DSVSWTTMIVGYKNIGQYHKAIRV---M 140

Query: 326 ANHYSSKLRPDAITLVNILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKC 385
            +     + P   TL N+L + A+    + GK+VH F V+ GL  +V V N+L++MYAKC
Sbjct: 141 GDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKC 200

Query: 386 SKMNEANKVFERMKEKDVVSWNAMVTGYSQIGSFDSALSLFKRMQEEDIELNVVTWSAVI 445
                A  VF+RM  +D+ SWNAM+  + Q+G  D A++ F++M E DI    VTW+++I
Sbjct: 201 GDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDI----VTWNSMI 260

Query: 446 AGYAQRGHGFEALDVFRQMQLCG-WEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNIL 505
           +G+ QRG+   ALD+F +M       P+  TL S+LS CA++  L  GKQ H++ +    
Sbjct: 261 SGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGF 320

Query: 506 NLDWSDPGGDLMVLNGLIDMYAKCKSYKVARNIFD------------------------- 565
           ++         +VLN LI MY++C   + AR + +                         
Sbjct: 321 DISG-------IVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDM 380

Query: 566 ------LIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQIFKQETSLKPNAFTLSCALM 625
                  ++ KD++VV WT +I GY QHG   +A+ LF  +       +PN++TL+  L 
Sbjct: 381 NQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMV--GGGQRPNSYTLAAMLS 440

Query: 626 ACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSKSGDIDAAQAVFDNMKV-RNA 685
             + L +L  G+Q+H  A++      V  V+N LI MY+K+G+I +A   FD ++  R+ 
Sbjct: 441 VASSLASLSHGKQIHGSAVKSGEIYSV-SVSNALITMYAKAGNITSASRAFDLIRCERDT 500

Query: 686 VSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVVLYACSHSGMVDQGMDYFHG 745
           VSWTS++     HG  EEAL +F  M   G   D +T++ V  AC+H+G+V+QG  YF  
Sbjct: 501 VSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDM 560

Query: 746 MIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELIKSMPMEPTAVVWVALLSASRIHANIE 805
           M     + P   HYACMVDL GRAG   EA E I+ MP+EP  V W +LLSA R+H NI+
Sbjct: 561 MKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNID 620

Query: 806 LGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIRSLMKNTGIKKRPGCSWIQGK 865
           LG+ AA RLL    EN G+Y+ L+NLY+   +W++ A+IR  MK+  +KK  G SWI+ K
Sbjct: 621 LGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVK 680

Query: 866 KTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTSFALHDVDDEEKGDLLFEHSE 925
                F V D +HPE ++IY  + ++  +IK MGYVP T+  LHD+++E K  +L  HSE
Sbjct: 681 HKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSE 740

Query: 926 KLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMIIDHEIILRDSSRFHHFKKGSC 959
           KLA+A+G++++     +RI KNLR+C DCH+A+ +IS ++  EII+RD++RFHHFK G C
Sbjct: 741 KLAIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFC 786

BLAST of HG10006016 vs. TAIR 10
Match: AT1G18485.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 534.3 bits (1375), Expect = 2.1e-151
Identity = 302/866 (34.87%), Postives = 465/866 (53.70%), Query Frame = 0

Query: 164 MATYAVGAYIECGAFAEAVSLLQRLIPSHSTVFWWNALIRRSVRLGFLDDALGFYCQM-Q 223
           + T  +  Y  CG+  ++  +   L      +F WNA+I    R    D+ L  + +M  
Sbjct: 122 LCTRIITMYAMCGSPDDSRFVFDAL--RSKNLFQWNAVISSYSRNELYDEVLETFIEMIS 181

Query: 224 RLGWLPDHYTFPFVLKACGEIPSFRRGASVHAIVCANGFESNVFICNSIVAMYGRCGALD 283
               LPDH+T+P V+KAC  +     G +VH +V   G   +VF+ N++V+ YG  G + 
Sbjct: 182 TTDLLPDHFTYPCVIKACAGMSDVGIGLAVHGLVVKTGLVEDVFVGNALVSFYGTHGFVT 241

Query: 284 DARQVFNEVLERKIEDIVSWNSILAAYVQGGESRTA-LRVALRMANHYSSKLRPDAITLV 343
           DA Q+F+ + ER   ++VSWNS++  +   G S  + L +   M  +      PD  TLV
Sbjct: 242 DALQLFDIMPER---NLVSWNSMIRVFSDNGFSEESFLLLGEMMEENGDGAFMPDVATLV 301

Query: 344 NILPACASAFAPQHGKQVHGFSVRSGLVDDVFVGNALVDMYAKCSKMNEANKVFERMKEK 403
            +LP CA       GK VHG++V+  L  ++ + NAL+DMY+KC  +  A  +F+    K
Sbjct: 302 TVLPVCAREREIGLGKGVHGWAVKLRLDKELVLNNALMDMYSKCGCITNAQMIFKMNNNK 361

Query: 404 DVVSWNAMVTGYSQIGSFDSALSLFKRMQE--EDIELNVVT------------------- 463
           +VVSWN MV G+S  G       + ++M    ED++ + VT                   
Sbjct: 362 NVVSWNTMVGGFSAEGDTHGTFDVLRQMLAGGEDVKADEVTILNAVPVCFHESFLPSLKE 421

Query: 464 -----------------------------------------------WSAVIAGYAQRGH 523
                                                          W+A+I G+AQ   
Sbjct: 422 LHCYSLKQEFVYNELVANAFVASYAKCGSLSYAQRVFHGIRSKTVNSWNALIGGHAQSND 481

Query: 524 GFEALDVFRQMQLCGWEPNVVTLVSLLSGCASVGALLHGKQTHAYAIKNILNLDWSDPGG 583
              +LD   QM++ G  P+  T+ SLLS C+ + +L  GK+ H + I+N L         
Sbjct: 482 PRLSLDAHLQMKISGLLPDSFTVCSLLSACSKLKSLRLGKEVHGFIIRNWLE-------R 541

Query: 584 DLMVLNGLIDMYAKCKSYKVARNIFDLIAGKDKNVVTWTVLIGGYAQHGEANDALELFAQ 643
           DL V   ++ +Y  C      + +FD  A +DK++V+W  +I GY Q+G  + AL +F Q
Sbjct: 542 DLFVYLSVLSLYIHCGELCTVQALFD--AMEDKSLVSWNTVITGYLQNGFPDRALGVFRQ 601

Query: 644 IFKQETSLKPNAFTLSCALMACARLGALRLGRQLHAYALRHENGSEVLFVANCLIDMYSK 703
           +      L     ++     AC+ L +LRLGR+ HAYAL+H    +  F+A  LIDMY+K
Sbjct: 602 MVLYGIQL--CGISMMPVFGACSLLPSLRLGREAHAYALKHLLEDDA-FIACSLIDMYAK 661

Query: 704 SGDIDAAQAVFDNMKVRNAVSWTSLMTGYGMHGRGEEALHVFHQMRQAGFVIDGVTFLVV 763
           +G I  +  VF+ +K ++  SW +++ GYG+HG  +EA+ +F +M++ G   D +TFL V
Sbjct: 662 NGSITQSSKVFNGLKEKSTASWNAMIMGYGIHGLAKEAIKLFEEMQRTGHNPDDLTFLGV 721

Query: 764 LYACSHSGMVDQGMDYFHGMIKCFGVTPGAEHYACMVDLLGRAGRFNEAMELI-KSMPME 823
           L AC+HSG++ +G+ Y   M   FG+ P  +HYAC++D+LGRAG+ ++A+ ++ + M  E
Sbjct: 722 LTACNHSGLIHEGLRYLDQMKSSFGLKPNLKHYACVIDMLGRAGQLDKALRVVAEEMSEE 781

Query: 824 PTAVVWVALLSASRIHANIELGEYAASRLLESGAENDGSYTLLSNLYANARRWKDVARIR 883
               +W +LLS+ RIH N+E+GE  A++L E   E   +Y LLSNLYA   +W+DV ++R
Sbjct: 782 ADVGIWKSLLSSCRIHQNLEMGEKVAAKLFELEPEKPENYVLLSNLYAGLGKWEDVRKVR 841

Query: 884 SLMKNTGIKKRPGCSWIQGKKTTTTFFVGDRSHPESDQIYNLLSELIKQIKDMGYVPQTS 943
             M    ++K  GCSWI+  +   +F VG+R     ++I +L S L  +I  MGY P T 
Sbjct: 842 QRMNEMSLRKDAGCSWIELNRKVFSFVVGERFLDGFEEIKSLWSILEMKISKMGYRPDTM 901

Query: 944 FALHDVDDEEKGDLLFEHSEKLAVAYGILTSAPGQPIRINKNLRICGDCHSALTYISMII 959
              HD+ +EEK + L  HSEKLA+ YG++ ++ G  IR+ KNLRIC DCH+A   IS ++
Sbjct: 902 SVQHDLSEEEKIEQLRGHSEKLALTYGLIKTSEGTTIRVYKNLRICVDCHNAAKLISKVM 961

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038889862.10.0e+0092.63pentatricopeptide repeat-containing protein At5g16860 [Benincasa hispida] >XP_03... [more]
XP_008455181.10.0e+0089.85PREDICTED: pentatricopeptide repeat-containing protein At5g16860 [Cucumis melo] ... [more]
XP_004137054.20.0e+0089.03pentatricopeptide repeat-containing protein At5g16860 [Cucumis sativus] >XP_0116... [more]
XP_022143067.10.0e+0088.77pentatricopeptide repeat-containing protein At5g16860 isoform X1 [Momordica char... [more]
KAA0031472.10.0e+0089.75pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK06925... [more]
Match NameE-valueIdentityDescription
Q9LFL58.9e-30159.71Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
Q9SN393.9e-15536.52Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q3E6Q11.8e-15235.43Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9SHZ87.6e-15138.98Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q0WN602.9e-15034.87Pentatricopeptide repeat-containing protein At1g18485 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A1S3C0G30.0e+0089.85pentatricopeptide repeat-containing protein At5g16860 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1CPQ50.0e+0088.77pentatricopeptide repeat-containing protein At5g16860 isoform X1 OS=Momordica ch... [more]
A0A5A7SK770.0e+0089.75Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1H9120.0e+0088.67pentatricopeptide repeat-containing protein At5g16860 OS=Cucurbita moschata OX=3... [more]
A0A6J1KUL70.0e+0087.70pentatricopeptide repeat-containing protein At5g16860 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT5G16860.16.4e-30259.71Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.12.8e-15636.52Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.11.3e-15335.43Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22070.15.4e-15238.98pentatricopeptide (PPR) repeat-containing protein [more]
AT1G18485.12.1e-15134.87Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 437..484
e-value: 1.5E-10
score: 41.1
coord: 651..698
e-value: 2.1E-8
score: 34.2
coord: 546..596
e-value: 1.3E-7
score: 31.7
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 258..310
e-value: 3.0E-4
score: 20.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 625..650
e-value: 6.2E-4
score: 19.8
coord: 404..434
e-value: 1.5E-7
score: 31.1
coord: 198..225
e-value: 0.0067
score: 16.6
coord: 726..750
e-value: 0.014
score: 15.6
coord: 376..403
e-value: 1.9E-4
score: 21.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 439..473
e-value: 9.7E-8
score: 29.7
coord: 726..749
e-value: 9.0E-4
score: 17.3
coord: 549..574
e-value: 5.1E-4
score: 18.0
coord: 404..438
e-value: 4.8E-8
score: 30.7
coord: 688..721
e-value: 0.0033
score: 15.5
coord: 198..229
e-value: 1.9E-4
score: 19.4
coord: 653..686
e-value: 2.2E-8
score: 31.8
coord: 376..404
e-value: 8.2E-4
score: 17.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 651..685
score: 11.717688
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 264..298
score: 9.920034
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 402..436
score: 12.265752
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 194..228
score: 9.656963
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 371..401
score: 9.415814
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 437..471
score: 11.586152
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 547..581
score: 9.930995
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 246..361
e-value: 1.7E-17
score: 65.8
coord: 509..620
e-value: 3.1E-17
score: 65.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 622..845
e-value: 4.3E-42
score: 146.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 376..502
e-value: 1.8E-35
score: 124.0
coord: 121..245
e-value: 4.5E-10
score: 41.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 171..428
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 824..948
e-value: 3.2E-49
score: 166.0
NoneNo IPR availablePANTHERPTHR47929FAMILY NOT NAMEDcoord: 298..440
coord: 331..872
coord: 122..325
NoneNo IPR availablePANTHERPTHR47929:SF20BNACNNG07920D PROTEINcoord: 298..440
coord: 331..872
coord: 122..325

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10006016.1HG10006016.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding