HG10004076 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004076
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr08: 13381746 .. 13383083 (+)
RNA-Seq ExpressionHG10004076
SyntenyHG10004076
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTCTACAAATATTCTACAAAGTTGTTGAATAAATGTGTGGTCTCATTTTGTAAATCTCAGCAAATGCAGAAAGCTGAAAATGTCATAAGAAACGGCATTCGGTTTGGAGTTCTACCTGATGTAGTAACTTACAATACTTTGCTTGATGGATATTGTCGATTTAGCGGCATGGATGCTGCGTATTCTGTTTTTAATAGAATGAGGGAAGCTGGTATCAGTCCAGATGTGATTACTTATAATTCTTTGATAGCTGGTGCAACCAGGAATTGTTCATTAGAACAATCCCTTAATCTGTTTGAAGAAATGCTTCAATCAGGTATCACACCTAATATACGGAGTTACAACACTTTAATGCACTGTTTCTTCAAACTAGGAAAGCCAGATGAAGCCATTAGAGTTTTTCAAGATATTATACTTAAAGACCTATATCCTGATCCGGCTACCTTTACTGTGATTATTGATGGCCTCTGCAAATTTGGATATACAAGTAATGCTATTATGTTATTCAGAAATTTACAGAGCCATGGATTTGTTCCTCAATTAGTTACATGTAACATTCTTATTAATGGACTTTGCAAGGTGGGTAGGTTGAAGGCTGCTTGGATGATGCTCAATGAGGCCATTGATTTGGGTTTTGAGCCTGATGCAACATACACAACTTTGATGAAAAGTTGCTTTAATTATAGGAAATATGAACATGGATTAGAGATTTTCTTTGCAATGAAAAACAAAGGATTTGCTTACTGCACTGCTATTAGTGCTTTTCTTAAGTTAGGTAGGTTTGAAGAGGCAAATTTTTGGATGGCACAGATGATAAAGAATGGAGAGGGAATTGATTTAGTTCTTTATAACACATTTCTTAATTTCTATTGTAAACAAGGTAAATTGGAGGCTGCATATAAGTTGTTGAATGATATGGAGTCACAAGGACTATGCAACCACTACACACATGCTATAATAATTGATGGGTTGTGCAGGGCTGGTAATATCGAGGGGGCTCGACGATATTGGAATTACATGTATATAGGAGGTTTTACGTCGAACTTGGTAGCGACGAATTGTCTAATTGATAGGTTATGTAAGGCTAGACAAATTGATCAAGCAATGAAATTGTTTGAATCAATGGAAACAAGGGATTCTGTTACCTACACTTCTTTGGTGCATAATCTTTGCAAGGCAAGGAGATTTAGTTGTGCGTCAAAGTTATTGCTTGCTTGCTTAAGAGGTGGTATGAAGATTCATAAGTCTATACAACGTGCAGTTATCAATGGTCTTTGTTCTTCCAGATTTACAAGTGAAGCAAGGAAGCTCCAAACAGAAATATGCTTGGCTTAG

mRNA sequence

ATGGTCTACAAATATTCTACAAAGTTGTTGAATAAATGTGTGGTCTCATTTTGTAAATCTCAGCAAATGCAGAAAGCTGAAAATGTCATAAGAAACGGCATTCGGTTTGGAGTTCTACCTGATGTAGTAACTTACAATACTTTGCTTGATGGATATTGTCGATTTAGCGGCATGGATGCTGCGTATTCTGTTTTTAATAGAATGAGGGAAGCTGGTATCAGTCCAGATGTGATTACTTATAATTCTTTGATAGCTGGTGCAACCAGGAATTGTTCATTAGAACAATCCCTTAATCTGTTTGAAGAAATGCTTCAATCAGGTATCACACCTAATATACGGAGTTACAACACTTTAATGCACTGTTTCTTCAAACTAGGAAAGCCAGATGAAGCCATTAGAGTTTTTCAAGATATTATACTTAAAGACCTATATCCTGATCCGGCTACCTTTACTGTGATTATTGATGGCCTCTGCAAATTTGGATATACAAGTAATGCTATTATGTTATTCAGAAATTTACAGAGCCATGGATTTGTTCCTCAATTAGTTACATGTAACATTCTTATTAATGGACTTTGCAAGGTGGGTAGGTTGAAGGCTGCTTGGATGATGCTCAATGAGGCCATTGATTTGGGTTTTGAGCCTGATGCAACATACACAACTTTGATGAAAAGTTGCTTTAATTATAGGAAATATGAACATGGATTAGAGATTTTCTTTGCAATGAAAAACAAAGGATTTGCTTACTGCACTGCTATTAGTGCTTTTCTTAAGTTAGGTAGGTTTGAAGAGGCAAATTTTTGGATGGCACAGATGATAAAGAATGGAGAGGGAATTGATTTAGTTCTTTATAACACATTTCTTAATTTCTATTGTAAACAAGGTAAATTGGAGGCTGCATATAAGTTGTTGAATGATATGGAGTCACAAGGACTATGCAACCACTACACACATGCTATAATAATTGATGGGTTGTGCAGGGCTGGTAATATCGAGGGGGCTCGACGATATTGGAATTACATGTATATAGGAGGTTTTACGTCGAACTTGGTAGCGACGAATTGTCTAATTGATAGGTTATGTAAGGCTAGACAAATTGATCAAGCAATGAAATTGTTTGAATCAATGGAAACAAGGGATTCTGTTACCTACACTTCTTTGGTGCATAATCTTTGCAAGGCAAGGAGATTTAGTTGTGCGTCAAAGTTATTGCTTGCTTGCTTAAGAGGTGGTATGAAGATTCATAAGTCTATACAACGTGCAGTTATCAATGGTCTTTGTTCTTCCAGATTTACAAGTGAAGCAAGGAAGCTCCAAACAGAAATATGCTTGGCTTAG

Coding sequence (CDS)

ATGGTCTACAAATATTCTACAAAGTTGTTGAATAAATGTGTGGTCTCATTTTGTAAATCTCAGCAAATGCAGAAAGCTGAAAATGTCATAAGAAACGGCATTCGGTTTGGAGTTCTACCTGATGTAGTAACTTACAATACTTTGCTTGATGGATATTGTCGATTTAGCGGCATGGATGCTGCGTATTCTGTTTTTAATAGAATGAGGGAAGCTGGTATCAGTCCAGATGTGATTACTTATAATTCTTTGATAGCTGGTGCAACCAGGAATTGTTCATTAGAACAATCCCTTAATCTGTTTGAAGAAATGCTTCAATCAGGTATCACACCTAATATACGGAGTTACAACACTTTAATGCACTGTTTCTTCAAACTAGGAAAGCCAGATGAAGCCATTAGAGTTTTTCAAGATATTATACTTAAAGACCTATATCCTGATCCGGCTACCTTTACTGTGATTATTGATGGCCTCTGCAAATTTGGATATACAAGTAATGCTATTATGTTATTCAGAAATTTACAGAGCCATGGATTTGTTCCTCAATTAGTTACATGTAACATTCTTATTAATGGACTTTGCAAGGTGGGTAGGTTGAAGGCTGCTTGGATGATGCTCAATGAGGCCATTGATTTGGGTTTTGAGCCTGATGCAACATACACAACTTTGATGAAAAGTTGCTTTAATTATAGGAAATATGAACATGGATTAGAGATTTTCTTTGCAATGAAAAACAAAGGATTTGCTTACTGCACTGCTATTAGTGCTTTTCTTAAGTTAGGTAGGTTTGAAGAGGCAAATTTTTGGATGGCACAGATGATAAAGAATGGAGAGGGAATTGATTTAGTTCTTTATAACACATTTCTTAATTTCTATTGTAAACAAGGTAAATTGGAGGCTGCATATAAGTTGTTGAATGATATGGAGTCACAAGGACTATGCAACCACTACACACATGCTATAATAATTGATGGGTTGTGCAGGGCTGGTAATATCGAGGGGGCTCGACGATATTGGAATTACATGTATATAGGAGGTTTTACGTCGAACTTGGTAGCGACGAATTGTCTAATTGATAGGTTATGTAAGGCTAGACAAATTGATCAAGCAATGAAATTGTTTGAATCAATGGAAACAAGGGATTCTGTTACCTACACTTCTTTGGTGCATAATCTTTGCAAGGCAAGGAGATTTAGTTGTGCGTCAAAGTTATTGCTTGCTTGCTTAAGAGGTGGTATGAAGATTCATAAGTCTATACAACGTGCAGTTATCAATGGTCTTTGTTCTTCCAGATTTACAAGTGAAGCAAGGAAGCTCCAAACAGAAATATGCTTGGCTTAG

Protein sequence

MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDATYTTLMKSCFNYRKYEHGLEIFFAMKNKGFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQGKLEAAYKLLNDMESQGLCNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMKIHKSIQRAVINGLCSSRFTSEARKLQTEICLA
Homology
BLAST of HG10004076 vs. NCBI nr
Match: XP_038887759.1 (putative pentatricopeptide repeat-containing protein At4g17915 [Benincasa hispida] >XP_038887760.1 putative pentatricopeptide repeat-containing protein At4g17915 [Benincasa hispida] >XP_038887761.1 putative pentatricopeptide repeat-containing protein At4g17915 [Benincasa hispida])

HSP 1 Score: 705.7 bits (1820), Expect = 2.6e-199
Identity = 356/452 (78.76%), Postives = 391/452 (86.50%), Query Frame = 0

Query: 1   MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDA 60
           MV KYSTKLLN CV SFCKSQQMQKAE VI + IR GVLPD+VTYNTL+DGYCRFSG+DA
Sbjct: 1   MVCKYSTKLLNICVASFCKSQQMQKAEIVIIDAIRIGVLPDLVTYNTLVDGYCRFSGIDA 60

Query: 61  AYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMH 120
           AYSV +RMREAGI PDVITYNSLIAGATRNCSLE SL LF+EM+QSGI+P+I SYNTLMH
Sbjct: 61  AYSVLHRMREAGIRPDVITYNSLIAGATRNCSLEHSLKLFKEMIQSGISPDIWSYNTLMH 120

Query: 121 CFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVP 180
           CFF  GKPDEA RVFQ+IILKDL+P P TF  +I+GLCK GYTSNAIMLFRNLQ HGFVP
Sbjct: 121 CFFISGKPDEANRVFQEIILKDLHPHPVTFNTMINGLCKHGYTSNAIMLFRNLQHHGFVP 180

Query: 181 QLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIF 240
           QLVT NILINGLCKVGRLKAAW MLNEA+D G EPDA TY TLMKSCF  RKYEHGLEIF
Sbjct: 181 QLVTYNILINGLCKVGRLKAAWRMLNEAMDSGLEPDAITYLTLMKSCFRRRKYEHGLEIF 240

Query: 241 FAMKNKG-----FAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQ 300
           F M NKG     FAYCT I AFLKLG FEEANFWM +MIKNG GIDLV YNTF+N YCK+
Sbjct: 241 FEMINKGFGFDAFAYCTVIGAFLKLGWFEEANFWMQRMIKNGMGIDLVSYNTFVNLYCKE 300

Query: 301 GKLEAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVAT 360
           GKL+AAYKLL+++ES+GL C+ YTHAI++DGLCR GNIEGAR++ NYMY  GFTSNLVA 
Sbjct: 301 GKLDAAYKLLDEIESKGLECDDYTHAIMVDGLCRDGNIEGARQHLNYMYTTGFTSNLVAL 360

Query: 361 NCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMK 420
           NCLIDR CKA QIDQAMKLF SMETRDS TYTSLVHNLCKARRF  ASKLLL+ LRGGM+
Sbjct: 361 NCLIDRYCKAGQIDQAMKLFASMETRDSFTYTSLVHNLCKARRFRSASKLLLSGLRGGME 420

Query: 421 IHKSIQRAVINGLCSSRFTSEARKLQTEICLA 446
           + ++ QRAVI+GLCSS FTSEARKL+T+I LA
Sbjct: 421 VLETTQRAVIDGLCSSGFTSEARKLRTKIRLA 452

BLAST of HG10004076 vs. NCBI nr
Match: XP_022147934.1 (putative pentatricopeptide repeat-containing protein At4g17915 [Momordica charantia] >XP_022147935.1 putative pentatricopeptide repeat-containing protein At4g17915 [Momordica charantia] >XP_022147936.1 putative pentatricopeptide repeat-containing protein At4g17915 [Momordica charantia] >XP_022147937.1 putative pentatricopeptide repeat-containing protein At4g17915 [Momordica charantia] >XP_022147938.1 putative pentatricopeptide repeat-containing protein At4g17915 [Momordica charantia])

HSP 1 Score: 701.8 bits (1810), Expect = 3.7e-198
Identity = 355/452 (78.54%), Postives = 389/452 (86.06%), Query Frame = 0

Query: 1   MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDA 60
           MV K+STK LN CV S+CKS+QMQKAE VI +GIR GVLPDVVTYNTLLDGYCRF GMDA
Sbjct: 1   MVCKFSTKFLNICVASYCKSRQMQKAEAVIIDGIRLGVLPDVVTYNTLLDGYCRFIGMDA 60

Query: 61  AYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMH 120
           AYSV  RMREAGISPDVITYNSLIAGATRNCSLEQSL+LFEEMLQSGITP+I SYNTLMH
Sbjct: 61  AYSVIYRMREAGISPDVITYNSLIAGATRNCSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVP 180
           CFF+LGKPDEA RVF+DIILKDL P P TF  +I+GLCK+GYTSNAIMLFRNLQ HGFVP
Sbjct: 121 CFFRLGKPDEANRVFKDIILKDLSPHPVTFNTMINGLCKYGYTSNAIMLFRNLQRHGFVP 180

Query: 181 QLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIF 240
           QLVT NILINGLCKVGRLKAA  MLNEA D G EPDA TYTTLMKSC   R+Y+HG EIF
Sbjct: 181 QLVTYNILINGLCKVGRLKAARRMLNEARDSGLEPDAITYTTLMKSCLRSRQYKHGFEIF 240

Query: 241 FAMKNK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQ 300
           F MKNK     GFAYCT I AFLKLGRFEEAN  M QMI+N  GIDLV YNTF++ YCK+
Sbjct: 241 FEMKNKGYAFDGFAYCTVIGAFLKLGRFEEANVCMEQMIRNRMGIDLVFYNTFIHLYCKE 300

Query: 301 GKLEAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVAT 360
           GKL+AAYKLL+++ES+GL  ++YTH II DGLCRAGNI+GARR+ NYMY  G  SNLV  
Sbjct: 301 GKLDAAYKLLDEIESRGLEFDNYTHTIITDGLCRAGNIDGARRHLNYMYTTGLASNLVPL 360

Query: 361 NCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMK 420
           NCLIDRLCKA QID AMKLFESMETRDS TYTSLVHNLCKARRF CASKLLL+C+RGGMK
Sbjct: 361 NCLIDRLCKAGQIDHAMKLFESMETRDSFTYTSLVHNLCKARRFRCASKLLLSCIRGGMK 420

Query: 421 IHKSIQRAVINGLCSSRFTSEARKLQTEICLA 446
           + KS QRAVI+GLCSS FTSEARKL++++ LA
Sbjct: 421 VLKSTQRAVIDGLCSSGFTSEARKLKSKLHLA 452

BLAST of HG10004076 vs. NCBI nr
Match: XP_022973869.1 (putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 [Cucurbita maxima])

HSP 1 Score: 691.8 bits (1784), Expect = 3.9e-195
Identity = 349/452 (77.21%), Postives = 387/452 (85.62%), Query Frame = 0

Query: 1   MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDA 60
           MV KYSTK LN CV SFCKSQQMQKAE+VI +GIR GVLPDVVTYNTL+DGYCRFSGMDA
Sbjct: 1   MVSKYSTKFLNICVASFCKSQQMQKAEDVIIDGIRLGVLPDVVTYNTLIDGYCRFSGMDA 60

Query: 61  AYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMH 120
           AYSV  RMREAGISPDVITYNSLIAGA+RN SLEQSL+LFEEMLQSGITP+I SYNTLMH
Sbjct: 61  AYSVLYRMREAGISPDVITYNSLIAGASRNRSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVP 180
           C F+LGKPDEA R+F+DIILK L P P TF  +I+GLCK+GYTSNAIMLFRNLQ HGFVP
Sbjct: 121 CLFRLGKPDEANRIFKDIILKGLSPHPVTFNTMINGLCKYGYTSNAIMLFRNLQRHGFVP 180

Query: 181 QLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIF 240
           QLVT NILINGLCKVGR+KAAW MLNEA+D G EP A TYTTLMKSCF  R+Y+HG EIF
Sbjct: 181 QLVTYNILINGLCKVGRMKAAWRMLNEAMDSGLEPGAVTYTTLMKSCFRCRQYKHGFEIF 240

Query: 241 FAMKNK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQ 300
           F MKNK     GFAYCT I AFLKLGRFEEAN  M QMI+NG GIDLV YNT +N YCK+
Sbjct: 241 FEMKNKGYAFDGFAYCTVIGAFLKLGRFEEANVCMEQMIRNGLGIDLVFYNTLINLYCKE 300

Query: 301 GKLEAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVAT 360
           GKLEAAYKLL+++ES GL  + YTH+II DGLCR GNIEGA R+ NYMY  GFTSNLVA 
Sbjct: 301 GKLEAAYKLLDELESLGLEYDDYTHSIITDGLCRNGNIEGAWRHLNYMYTTGFTSNLVAL 360

Query: 361 NCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMK 420
           NCLIDRL KA QID AMKLFESME RDS+TYTSLVHNLCKARRF CASKLL++C++GGMK
Sbjct: 361 NCLIDRLGKAGQIDHAMKLFESMEIRDSITYTSLVHNLCKARRFRCASKLLISCIKGGMK 420

Query: 421 IHKSIQRAVINGLCSSRFTSEARKLQTEICLA 446
           + KS QR VI+GL SS FTSEARK+++++ +A
Sbjct: 421 VLKSTQRTVIDGLRSSGFTSEARKVRSKLRMA 452

BLAST of HG10004076 vs. NCBI nr
Match: XP_023554660.1 (putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 687.6 bits (1773), Expect = 7.3e-194
Identity = 349/452 (77.21%), Postives = 386/452 (85.40%), Query Frame = 0

Query: 1   MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDA 60
           MV KYSTK LN CV SFCKSQQMQKAE+VI +GIR GVLPDVVTYNTL+DGYCRFSGMD 
Sbjct: 1   MVSKYSTKFLNICVASFCKSQQMQKAEDVIIDGIRLGVLPDVVTYNTLIDGYCRFSGMDT 60

Query: 61  AYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMH 120
           AYSV  RMREAGISPDVITYNSLIAGA+RN SLEQSL+LFEEMLQSGITP+I SYNTLMH
Sbjct: 61  AYSVLYRMREAGISPDVITYNSLIAGASRNRSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVP 180
           C F+LGKPDEA RVF+DIILKDL P P TF  +I+GLCK+GYTSNAIMLFRNLQ HGFVP
Sbjct: 121 CLFRLGKPDEANRVFKDIILKDLSPQPVTFNTMINGLCKYGYTSNAIMLFRNLQRHGFVP 180

Query: 181 QLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIF 240
           QLVT NILINGLCKVGR+KAA  MLNEA+D G EPDA TYTTLMKSCF  R+Y+HG EIF
Sbjct: 181 QLVTYNILINGLCKVGRMKAARRMLNEAMDSGLEPDAVTYTTLMKSCFRCRQYKHGFEIF 240

Query: 241 FAMKNK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQ 300
           F MKNK     GFAYCT I AFLKLGRFEEAN  M QMI+NG GIDLV YNTF+N YCK+
Sbjct: 241 FEMKNKGYAFDGFAYCTVIGAFLKLGRFEEANVCMEQMIRNGLGIDLVFYNTFINLYCKE 300

Query: 301 GKLEAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVAT 360
           GKLEAAYKLL+++ESQGL  + YTH+II +GLC  GNIEGA R+ NYMY  GF SNLVA 
Sbjct: 301 GKLEAAYKLLDELESQGLEYDDYTHSIITNGLCWNGNIEGAWRHLNYMYTTGFASNLVAL 360

Query: 361 NCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMK 420
           NCLIDRL KA QID AMKLFESME RDS TYTSLVHNLCKARRF CASKLL++C++GGMK
Sbjct: 361 NCLIDRLGKAGQIDHAMKLFESMEIRDSFTYTSLVHNLCKARRFRCASKLLISCIKGGMK 420

Query: 421 IHKSIQRAVINGLCSSRFTSEARKLQTEICLA 446
           + KS QR VI+GL SS FTSEARK+++++ +A
Sbjct: 421 VLKSTQRTVIDGLRSSGFTSEARKVRSKLRMA 452

BLAST of HG10004076 vs. NCBI nr
Match: XP_031740560.1 (putative pentatricopeptide repeat-containing protein At4g17915 [Cucumis sativus] >KAE8649736.1 hypothetical protein Csa_012878 [Cucumis sativus])

HSP 1 Score: 675.6 bits (1742), Expect = 2.9e-190
Identity = 343/452 (75.88%), Postives = 380/452 (84.07%), Query Frame = 0

Query: 1   MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDA 60
           MV KYSTK LN CV SFCKSQQMQKAE VI +GIR GVLPDVVTYNTL+DGYCRFSGMDA
Sbjct: 1   MVCKYSTKFLNICVASFCKSQQMQKAEAVIIDGIRIGVLPDVVTYNTLIDGYCRFSGMDA 60

Query: 61  AYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMH 120
           AYSV  RMREAGISPDVITYNSLIAGATRN SLEQSL+LFEEMLQSGITP+I SYNTLMH
Sbjct: 61  AYSVLYRMREAGISPDVITYNSLIAGATRNFSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVP 180
           CFF LGKPDEA RVF+DIILKDL P P TF  +I+GLCK GYTSNAIMLFRNLQ HGF+P
Sbjct: 121 CFFILGKPDEAYRVFKDIILKDLSPHPVTFNTMINGLCKHGYTSNAIMLFRNLQRHGFIP 180

Query: 181 QLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIF 240
           QLVT NILINGLCKV RL+AA  MLNEA+D G EP+A TYTTLMKSCF  R+YE G EIF
Sbjct: 181 QLVTYNILINGLCKVDRLRAAIRMLNEAMDSGLEPNAVTYTTLMKSCFRSRQYERGFEIF 240

Query: 241 FAMKNK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQ 300
             MKNK     GFAYCT   AFLKLGRFEEA F M QMIKN  GID+  YNTF+N YCK+
Sbjct: 241 SKMKNKGYAFDGFAYCTVSGAFLKLGRFEEAKFCMEQMIKNDVGIDITFYNTFINLYCKE 300

Query: 301 GKLEAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVAT 360
           GKLEAAYKL +++E +GL C+ YTH+II +GLCR GNIEGA ++ N +Y  GF SNLVA 
Sbjct: 301 GKLEAAYKLFDEIEPRGLECDVYTHSIITNGLCRVGNIEGAMQHLNCVYTTGFASNLVAL 360

Query: 361 NCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMK 420
           NCLIDRLCKA QID+A++LFESMETRDS TYTSLVHNLCKARRF CASKLL++C R GMK
Sbjct: 361 NCLIDRLCKAGQIDRAIRLFESMETRDSFTYTSLVHNLCKARRFRCASKLLISCSRDGMK 420

Query: 421 IHKSIQRAVINGLCSSRFTSEARKLQTEICLA 446
           + K+ +RAVI+GLCSS FTSEARKL+ ++ LA
Sbjct: 421 VLKATRRAVIDGLCSSGFTSEARKLKFKLRLA 452

BLAST of HG10004076 vs. ExPASy Swiss-Prot
Match: P0C043 (Putative pentatricopeptide repeat-containing protein At4g17915 OS=Arabidopsis thaliana OX=3702 GN=At4g17915 PE=3 SV=1)

HSP 1 Score: 434.9 bits (1117), Expect = 1.1e-120
Identity = 230/447 (51.45%), Postives = 303/447 (67.79%), Query Frame = 0

Query: 6   STKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYSVF 65
           ST+LLN CV S CK ++++KAE++I +GIR GV PDVVTYNTL+ GYCRF G++ AY+V 
Sbjct: 12  STRLLNICVDSLCKFRKLEKAESLIIDGIRLGVDPDVVTYNTLISGYCRFVGIEEAYAVT 71

Query: 66  NRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCFFKL 125
            RMR+AGI PDV TYNSLIAGA R   L+  L LF+EML+ GI P++ SYNTLM C+FKL
Sbjct: 72  RRMRDAGIRPDVATYNSLIAGAARRLMLDHVLYLFDEMLEWGIYPDLWSYNTLMCCYFKL 131

Query: 126 GKPDEAIRV-FQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQLVT 185
           GK +EA RV ++D+ L  L P P T+ V++D LCK GY  NA+ LF+ +QS  F P+L+T
Sbjct: 132 GKHEEAFRVLYKDLQLAGLNPGPDTYNVLLDALCKCGYIDNALELFKEMQSR-FKPELMT 191

Query: 186 CNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIFFAMK 245
            NILINGLCK  R+  A  ML E    G+ P+A TYTT++K  F  R+   GL++F  MK
Sbjct: 192 YNILINGLCKSRRVGTAKWMLTELKKSGYTPNAVTYTTILKLYFKTRRIRRGLQLFLEMK 251

Query: 246 NK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQGKLE 305
            +     G+AY   +SA +K GR +EA  +M ++++ G   D+V YNT LN Y K G L+
Sbjct: 252 REGYTYDGYAYFAVVSALIKTGRTKEAYEYMQELVRKGRRHDIVSYNTLLNLYFKDGNLD 311

Query: 306 AAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNCLI 365
           A   LL ++E +G+  + YTH II++GL R G    A  ++  M   G   NLV  NCL+
Sbjct: 312 AVDDLLGEIERRGMKADEYTHTIIVNGLLRTGQTRRAEEHFVSMGEMGIGLNLVTCNCLV 371

Query: 366 DRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMKIHKS 425
           D LCKA  +D+AM+ FESME +D  TYTS+VHNLCK  RF CASKLLL+C   G+KI  S
Sbjct: 372 DGLCKAGHVDRAMRYFESMEVKDEYTYTSVVHNLCKDMRFVCASKLLLSCYNKGIKIPTS 431

Query: 426 IQRAVINGLCSSRFTSEARKLQTEICL 445
            +RAV++GL  S    EARK + E+ L
Sbjct: 432 ARRAVLSGLRMSGCYGEARKAKAEMKL 457

BLAST of HG10004076 vs. ExPASy Swiss-Prot
Match: Q56XR6 (Pentatricopeptide repeat-containing protein At5g46680 OS=Arabidopsis thaliana OX=3702 GN=At5g46680 PE=2 SV=2)

HSP 1 Score: 413.7 bits (1062), Expect = 2.7e-114
Identity = 221/446 (49.55%), Postives = 298/446 (66.82%), Query Frame = 0

Query: 6   STKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYSVF 65
           STKLLN  V S CK + +++AE ++ +GIR GVLPDV+TYNTL+ GY RF G+D AY+V 
Sbjct: 12  STKLLNISVNSLCKFRNLERAETLLIDGIRLGVLPDVITYNTLIKGYTRFIGIDEAYAVT 71

Query: 66  NRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCFFKL 125
            RMREAGI PDV TYNSLI+GA +N  L + L LF+EML SG++P++ SYNTLM C+FKL
Sbjct: 72  RRMREAGIEPDVTTYNSLISGAAKNLMLNRVLQLFDEMLHSGLSPDMWSYNTLMSCYFKL 131

Query: 126 GKPDEAIRVF-QDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQLVT 185
           G+  EA ++  +DI L  L P   T+ +++D LCK G+T NAI LF++L+S    P+L+T
Sbjct: 132 GRHGEAFKILHEDIHLAGLVPGIDTYNILLDALCKSGHTDNAIELFKHLKSR-VKPELMT 191

Query: 186 CNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIFFAMK 245
            NILINGLCK  R+ +   M+ E    G+ P+A TYTT++K  F  ++ E GL++F  MK
Sbjct: 192 YNILINGLCKSRRVGSVDWMMRELKKSGYTPNAVTYTTMLKMYFKTKRIEKGLQLFLKMK 251

Query: 246 NK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNG-EGIDLVLYNTFLNFYCKQGKL 305
            +     GFA C  +SA +K GR EEA   M +++++G    D+V YNT LN Y K G L
Sbjct: 252 KEGYTFDGFANCAVVSALIKTGRAEEAYECMHELVRSGTRSQDIVSYNTLLNLYFKDGNL 311

Query: 306 EAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNCL 365
           +A   LL ++E +GL  + YTH II++GL   GN  GA ++   +   G   ++V  NCL
Sbjct: 312 DAVDDLLEEIEMKGLKPDDYTHTIIVNGLLNIGNTGGAEKHLACIGEMGMQPSVVTCNCL 371

Query: 366 IDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMKIHK 425
           ID LCKA  +D+AM+LF SME RD  TYTS+VHNLCK  R  CASKLLL+C   GMKI  
Sbjct: 372 IDGLCKAGHVDRAMRLFASMEVRDEFTYTSVVHNLCKDGRLVCASKLLLSCYNKGMKIPS 431

Query: 426 SIQRAVINGLCSSRFTSEARKLQTEI 443
           S +RAV++G+  +     ARK   +I
Sbjct: 432 SARRAVLSGIRETVSYQAARKTHIKI 456

BLAST of HG10004076 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 211.1 bits (536), Expect = 2.6e-53
Identity = 129/432 (29.86%), Postives = 205/432 (47.45%), Query Frame = 0

Query: 18  CKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYSVFNRMREAGISPDV 77
           C+  ++++A +++      G  PDV++Y+T+++GYCRF  +D  + +   M+  G+ P+ 
Sbjct: 257 CQLGRIKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNS 316

Query: 78  ITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCFFKLGKPDEAIRVFQD 137
             Y S+I    R C L ++   F EM++ GI P+   Y TL+  F K G    A + F +
Sbjct: 317 YIYGSIIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYE 376

Query: 138 IILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQLVTCNILINGLCKVGR 197
           +  +D+ PD  T+T II G C+ G    A  LF  +   G  P  VT   LING CK G 
Sbjct: 377 MHSRDITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGH 436

Query: 198 LKAAWMMLNEAIDLGFEPD-ATYTTLMKSCFNYRKYEHGLEIFFAMKNKG-----FAYCT 257
           +K A+ + N  I  G  P+  TYTTL+         +   E+   M   G     F Y +
Sbjct: 437 MKDAFRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNS 496

Query: 258 AISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQGKLEAAYKLLNDMESQG 317
            ++   K G  EEA   + +    G   D V Y T ++ YCK G+++ A ++L +M  +G
Sbjct: 497 IVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKG 556

Query: 318 L-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNCLIDRLCKARQIDQAM 377
           L     T  ++++G C  G +E   +  N+M   G   N    N L+ + C    +  A 
Sbjct: 557 LQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAAT 616

Query: 378 KLFESMETR----DSVTYTSLVHNLCKARRFSCASKLLLACLRGGMKIHKSIQRAVINGL 437
            +++ M +R    D  TY +LV   CKAR    A  L       G  +  S    +I G 
Sbjct: 617 AIYKDMCSRGVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGF 676

Query: 438 CSSRFTSEARKL 439
              +   EAR++
Sbjct: 677 LKRKKFLEAREV 688

BLAST of HG10004076 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 4.9e-52
Identity = 131/452 (28.98%), Postives = 225/452 (49.78%), Query Frame = 0

Query: 4   KYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYS 63
           K      N  + + C++ Q++ A  ++ +   +G++PD  T+ T++ GY     +D A  
Sbjct: 186 KPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALR 245

Query: 64  VFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEML-QSGITPNIRSYNTLMHCF 123
           +  +M E G S   ++ N ++ G  +   +E +LN  +EM  Q G  P+  ++NTL++  
Sbjct: 246 IREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGL 305

Query: 124 FKLGKPDEAIRVFQDIILKDLY-PDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQ 183
            K G    AI +  D++L++ Y PD  T+  +I GLCK G    A+ +   + +    P 
Sbjct: 306 CKAGHVKHAIEI-MDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPN 365

Query: 184 LVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPD-ATYTTLMKSCFNYRKYEHGLEIFF 243
            VT N LI+ LCK  +++ A  +       G  PD  T+ +L++     R +   +E+F 
Sbjct: 366 TVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFE 425

Query: 244 AMKNKG-----FAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQG 303
            M++KG     F Y   I +    G+ +EA   + QM  +G    ++ YNT ++ +CK  
Sbjct: 426 EMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKAN 485

Query: 304 KLEAAYKLLNDMESQGLC-NHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATN 363
           K   A ++ ++ME  G+  N  T+  +IDGLC++  +E A +  + M + G   +    N
Sbjct: 486 KTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYN 545

Query: 364 CLIDRLCKARQIDQAMKLFESMETR----DSVTYTSLVHNLCKARRFSCASKLLLACLRG 423
            L+   C+   I +A  + ++M +     D VTY +L+  LCKA R   ASKLL +    
Sbjct: 546 SLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMK 605

Query: 424 GMKIHKSIQRAVINGLCSSRFTSEARKLQTEI 443
           G+ +       VI GL   R T+EA  L  E+
Sbjct: 606 GINLTPHAYNPVIQGLFRKRKTTEAINLFREM 636

BLAST of HG10004076 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 6.4e-52
Identity = 122/444 (27.48%), Postives = 210/444 (47.30%), Query Frame = 0

Query: 11  NKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYSVFNRMRE 70
           N  + + CK+Q M KA  V+   ++ GV+PD +TYN++L GYC       A     +MR 
Sbjct: 235 NSIIAALCKAQAMDKAMEVLNTMVKNGVMPDCMTYNSILHGYCSSGQPKEAIGFLKKMRS 294

Query: 71  AGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCFFKLGKPDE 130
            G+ PDV+TY+ L+    +N    ++  +F+ M + G+ P I +Y TL+  +   G   E
Sbjct: 295 DGVEPDVVTYSLLMDYLCKNGRCMEARKIFDSMTKRGLKPEITTYGTLLQGYATKGALVE 354

Query: 131 AIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQLVTCNILIN 190
              +   ++   ++PD   F+++I    K G    A+++F  ++  G  P  VT   +I 
Sbjct: 355 MHGLLDLMVRNGIHPDHYVFSILICAYAKQGKVDQAMLVFSKMRQQGLNPNAVTYGAVIG 414

Query: 191 GLCKVGRLKAAWMMLNEAIDLGFEP-DATYTTLMKSCFNYRKYEHGLEIFFAMKNKGFA- 250
            LCK GR++ A +   + ID G  P +  Y +L+       K+E   E+   M ++G   
Sbjct: 415 ILCKSGRVEDAMLYFEQMIDEGLSPGNIVYNSLIHGLCTCNKWERAEELILEMLDRGICL 474

Query: 251 ----YCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQGKLEAAYKLL 310
               + + I +  K GR  E+      M++ G   +++ YNT +N YC  GK++ A KLL
Sbjct: 475 NTIFFNSIIDSHCKEGRVIESEKLFELMVRIGVKPNVITYNTLINGYCLAGKMDEAMKLL 534

Query: 311 NDMESQGLCNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNCLIDRLCKAR 370
           + M S GL                                    N V  + LI+  CK  
Sbjct: 535 SGMVSVGL----------------------------------KPNTVTYSTLINGYCKIS 594

Query: 371 QIDQAMKLFESMETR----DSVTYTSLVHNLCKARRFSCASKLLLACLRGGMKIHKSIQR 430
           +++ A+ LF+ ME+     D +TY  ++  L + RR + A +L +     G +I  S   
Sbjct: 595 RMEDALVLFKEMESSGVSPDIITYNIILQGLFQTRRTAAAKELYVRITESGTQIELSTYN 644

Query: 431 AVINGLCSSRFTSEARKLQTEICL 445
            +++GLC ++ T +A ++   +CL
Sbjct: 655 IILHGLCKNKLTDDALQMFQNLCL 644

BLAST of HG10004076 vs. ExPASy TrEMBL
Match: A0A6J1D2P0 (putative pentatricopeptide repeat-containing protein At4g17915 OS=Momordica charantia OX=3673 GN=LOC111016743 PE=4 SV=1)

HSP 1 Score: 701.8 bits (1810), Expect = 1.8e-198
Identity = 355/452 (78.54%), Postives = 389/452 (86.06%), Query Frame = 0

Query: 1   MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDA 60
           MV K+STK LN CV S+CKS+QMQKAE VI +GIR GVLPDVVTYNTLLDGYCRF GMDA
Sbjct: 1   MVCKFSTKFLNICVASYCKSRQMQKAEAVIIDGIRLGVLPDVVTYNTLLDGYCRFIGMDA 60

Query: 61  AYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMH 120
           AYSV  RMREAGISPDVITYNSLIAGATRNCSLEQSL+LFEEMLQSGITP+I SYNTLMH
Sbjct: 61  AYSVIYRMREAGISPDVITYNSLIAGATRNCSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVP 180
           CFF+LGKPDEA RVF+DIILKDL P P TF  +I+GLCK+GYTSNAIMLFRNLQ HGFVP
Sbjct: 121 CFFRLGKPDEANRVFKDIILKDLSPHPVTFNTMINGLCKYGYTSNAIMLFRNLQRHGFVP 180

Query: 181 QLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIF 240
           QLVT NILINGLCKVGRLKAA  MLNEA D G EPDA TYTTLMKSC   R+Y+HG EIF
Sbjct: 181 QLVTYNILINGLCKVGRLKAARRMLNEARDSGLEPDAITYTTLMKSCLRSRQYKHGFEIF 240

Query: 241 FAMKNK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQ 300
           F MKNK     GFAYCT I AFLKLGRFEEAN  M QMI+N  GIDLV YNTF++ YCK+
Sbjct: 241 FEMKNKGYAFDGFAYCTVIGAFLKLGRFEEANVCMEQMIRNRMGIDLVFYNTFIHLYCKE 300

Query: 301 GKLEAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVAT 360
           GKL+AAYKLL+++ES+GL  ++YTH II DGLCRAGNI+GARR+ NYMY  G  SNLV  
Sbjct: 301 GKLDAAYKLLDEIESRGLEFDNYTHTIITDGLCRAGNIDGARRHLNYMYTTGLASNLVPL 360

Query: 361 NCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMK 420
           NCLIDRLCKA QID AMKLFESMETRDS TYTSLVHNLCKARRF CASKLLL+C+RGGMK
Sbjct: 361 NCLIDRLCKAGQIDHAMKLFESMETRDSFTYTSLVHNLCKARRFRCASKLLLSCIRGGMK 420

Query: 421 IHKSIQRAVINGLCSSRFTSEARKLQTEICLA 446
           + KS QRAVI+GLCSS FTSEARKL++++ LA
Sbjct: 421 VLKSTQRAVIDGLCSSGFTSEARKLKSKLHLA 452

BLAST of HG10004076 vs. ExPASy TrEMBL
Match: A0A6J1IFW8 (putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472462 PE=4 SV=1)

HSP 1 Score: 691.8 bits (1784), Expect = 1.9e-195
Identity = 349/452 (77.21%), Postives = 387/452 (85.62%), Query Frame = 0

Query: 1   MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDA 60
           MV KYSTK LN CV SFCKSQQMQKAE+VI +GIR GVLPDVVTYNTL+DGYCRFSGMDA
Sbjct: 1   MVSKYSTKFLNICVASFCKSQQMQKAEDVIIDGIRLGVLPDVVTYNTLIDGYCRFSGMDA 60

Query: 61  AYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMH 120
           AYSV  RMREAGISPDVITYNSLIAGA+RN SLEQSL+LFEEMLQSGITP+I SYNTLMH
Sbjct: 61  AYSVLYRMREAGISPDVITYNSLIAGASRNRSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVP 180
           C F+LGKPDEA R+F+DIILK L P P TF  +I+GLCK+GYTSNAIMLFRNLQ HGFVP
Sbjct: 121 CLFRLGKPDEANRIFKDIILKGLSPHPVTFNTMINGLCKYGYTSNAIMLFRNLQRHGFVP 180

Query: 181 QLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIF 240
           QLVT NILINGLCKVGR+KAAW MLNEA+D G EP A TYTTLMKSCF  R+Y+HG EIF
Sbjct: 181 QLVTYNILINGLCKVGRMKAAWRMLNEAMDSGLEPGAVTYTTLMKSCFRCRQYKHGFEIF 240

Query: 241 FAMKNK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQ 300
           F MKNK     GFAYCT I AFLKLGRFEEAN  M QMI+NG GIDLV YNT +N YCK+
Sbjct: 241 FEMKNKGYAFDGFAYCTVIGAFLKLGRFEEANVCMEQMIRNGLGIDLVFYNTLINLYCKE 300

Query: 301 GKLEAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVAT 360
           GKLEAAYKLL+++ES GL  + YTH+II DGLCR GNIEGA R+ NYMY  GFTSNLVA 
Sbjct: 301 GKLEAAYKLLDELESLGLEYDDYTHSIITDGLCRNGNIEGAWRHLNYMYTTGFTSNLVAL 360

Query: 361 NCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMK 420
           NCLIDRL KA QID AMKLFESME RDS+TYTSLVHNLCKARRF CASKLL++C++GGMK
Sbjct: 361 NCLIDRLGKAGQIDHAMKLFESMEIRDSITYTSLVHNLCKARRFRCASKLLISCIKGGMK 420

Query: 421 IHKSIQRAVINGLCSSRFTSEARKLQTEICLA 446
           + KS QR VI+GL SS FTSEARK+++++ +A
Sbjct: 421 VLKSTQRTVIDGLRSSGFTSEARKVRSKLRMA 452

BLAST of HG10004076 vs. ExPASy TrEMBL
Match: A0A1S3B4M5 (putative pentatricopeptide repeat-containing protein At4g17915 OS=Cucumis melo OX=3656 GN=LOC103486102 PE=4 SV=1)

HSP 1 Score: 672.2 bits (1733), Expect = 1.5e-189
Identity = 341/452 (75.44%), Postives = 381/452 (84.29%), Query Frame = 0

Query: 1   MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDA 60
           MV KYSTK LN CV SFCKSQQMQKAE VI +GIR GVLP+VVTYNTL+DGYCRFSGMDA
Sbjct: 1   MVCKYSTKFLNICVASFCKSQQMQKAEAVIIDGIRIGVLPNVVTYNTLIDGYCRFSGMDA 60

Query: 61  AYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMH 120
           AYSV  RMREAGISPDVITYNSLIAGATRN SLEQSL+LFEEMLQSGITP+I SYNTLMH
Sbjct: 61  AYSVLYRMREAGISPDVITYNSLIAGATRNFSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVP 180
           CFF LGKPDEA RVF+DIILKDL P P TF  +I+GLCK GYTSNA+MLFRNLQ HGF+P
Sbjct: 121 CFFILGKPDEAYRVFKDIILKDLSPHPVTFNTMINGLCKHGYTSNAVMLFRNLQRHGFIP 180

Query: 181 QLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIF 240
           QLVT NILINGLCKV RL+AA  MLNEA+D G EP+A TYTTLMKSCF  R+YEHG EIF
Sbjct: 181 QLVTYNILINGLCKVSRLRAAIRMLNEAVDSGLEPNAVTYTTLMKSCFRSRQYEHGFEIF 240

Query: 241 FAMKNK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQ 300
             MK+K     GFAYCT I AFLKLGRFEEAN    QMIKN  GID+  YNT +N YCK+
Sbjct: 241 SKMKSKGYAFDGFAYCTVIGAFLKLGRFEEANSCTEQMIKNDVGIDMTFYNTLINLYCKE 300

Query: 301 GKLEAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVAT 360
           GKLEAAYKLL+ +ES+GL C+ YTH+II +GLCR GNIEGA ++ N +Y  GF SNLVA 
Sbjct: 301 GKLEAAYKLLDQIESRGLECDDYTHSIITNGLCRVGNIEGAMQHLNCVYTTGFASNLVAL 360

Query: 361 NCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMK 420
           NCLIDRLCKA QID+A++LFESMETRDS TYTSLVHNLCKARRF CASKLL++C RGG+K
Sbjct: 361 NCLIDRLCKAGQIDRAIRLFESMETRDSFTYTSLVHNLCKARRFRCASKLLISCSRGGIK 420

Query: 421 IHKSIQRAVINGLCSSRFTSEARKLQTEICLA 446
           I ++ +RAVI+GL SS FTSEARKL+ ++ LA
Sbjct: 421 ILRATRRAVIDGLYSSGFTSEARKLKFKLHLA 452

BLAST of HG10004076 vs. ExPASy TrEMBL
Match: A0A6J1EXR2 (putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439553 PE=4 SV=1)

HSP 1 Score: 670.6 bits (1729), Expect = 4.5e-189
Identity = 337/452 (74.56%), Postives = 382/452 (84.51%), Query Frame = 0

Query: 1   MVYKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDA 60
           MV KYSTK LN CV SFCKSQQMQKAE+VI +GIR GVLPDVVTYNTL+DGYCRFSG+D 
Sbjct: 1   MVSKYSTKFLNICVASFCKSQQMQKAEDVIIDGIRLGVLPDVVTYNTLIDGYCRFSGVDT 60

Query: 61  AYSVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMH 120
           AYSV  RMREAGISPDVITYNSLIAGA+RN SLEQSL+LFEEMLQSGITP+I SYNTLMH
Sbjct: 61  AYSVLYRMREAGISPDVITYNSLIAGASRNRSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFFKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVP 180
           C F+LGKPDEA R+F+DIILK L P P TF  +I+GLCK+GYTSNAIMLFRNLQ HGFVP
Sbjct: 121 CLFRLGKPDEANRIFKDIILKGLSPHPVTFNTMINGLCKYGYTSNAIMLFRNLQRHGFVP 180

Query: 181 QLVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIF 240
           QLVT NILINGLCKVGR++AA  MLNEA+D G EP+A TYTTLMKSCF  R+Y+HG EIF
Sbjct: 181 QLVTYNILINGLCKVGRMRAARRMLNEAMDSGLEPNAVTYTTLMKSCFRCRQYKHGFEIF 240

Query: 241 FAMKNK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQ 300
           F MKN+     GFAYCT I AFLKLGRFEEAN  M QMIKNG G DLV YNTF+N YCK+
Sbjct: 241 FEMKNRGYAFDGFAYCTVIGAFLKLGRFEEANVCMEQMIKNGLGFDLVFYNTFINLYCKE 300

Query: 301 GKLEAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVAT 360
           GKLEAAYK+L+++ESQGL  + YTH+II DGLCR GNIEGA R+ NYMY  GF SN VA 
Sbjct: 301 GKLEAAYKMLDELESQGLEYDDYTHSIITDGLCRNGNIEGAWRHLNYMYTTGFVSNSVAL 360

Query: 361 NCLIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMK 420
           NCLI+RL KA QID AMKLFESME RDS  YTSLVHNLCKARRF CAS+LL++C++GGMK
Sbjct: 361 NCLIERLGKAGQIDHAMKLFESMEIRDSFAYTSLVHNLCKARRFRCASRLLISCIKGGMK 420

Query: 421 IHKSIQRAVINGLCSSRFTSEARKLQTEICLA 446
           + KS +R VI+GL SS +TSEA K+++++ +A
Sbjct: 421 VLKSTRRTVIDGLISSGYTSEAGKVRSKLRMA 452

BLAST of HG10004076 vs. ExPASy TrEMBL
Match: A0A0A0KYL3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G481190 PE=4 SV=1)

HSP 1 Score: 645.6 bits (1664), Expect = 1.5e-181
Identity = 327/432 (75.69%), Postives = 364/432 (84.26%), Query Frame = 0

Query: 21  QQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYSVFNRMREAGISPDVITY 80
           QQMQKAE VI +GIR GVLPDVVTYNTL+DGYCRFSGMDAAYSV  RMREAGISPDVITY
Sbjct: 11  QQMQKAEAVIIDGIRIGVLPDVVTYNTLIDGYCRFSGMDAAYSVLYRMREAGISPDVITY 70

Query: 81  NSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCFFKLGKPDEAIRVFQDIIL 140
           NSLIAGATRN SLEQSL+LFEEMLQSGITP+I SYNTLMHCFF LGKPDEA RVF+DIIL
Sbjct: 71  NSLIAGATRNFSLEQSLDLFEEMLQSGITPDIWSYNTLMHCFFILGKPDEAYRVFKDIIL 130

Query: 141 KDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQLVTCNILINGLCKVGRLKA 200
           KDL P P TF  +I+GLCK GYTSNAIMLFRNLQ HGF+PQLVT NILINGLCKV RL+A
Sbjct: 131 KDLSPHPVTFNTMINGLCKHGYTSNAIMLFRNLQRHGFIPQLVTYNILINGLCKVDRLRA 190

Query: 201 AWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIFFAMKNK-----GFAYCTAIS 260
           A  MLNEA+D G EP+A TYTTLMKSCF  R+YE G EIF  MKNK     GFAYCT   
Sbjct: 191 AIRMLNEAMDSGLEPNAVTYTTLMKSCFRSRQYERGFEIFSKMKNKGYAFDGFAYCTVSG 250

Query: 261 AFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQGKLEAAYKLLNDMESQGL-C 320
           AFLKLGRFEEA F M QMIKN  GID+  YNTF+N YCK+GKLEAAYKL +++E +GL C
Sbjct: 251 AFLKLGRFEEAKFCMEQMIKNDVGIDITFYNTFINLYCKEGKLEAAYKLFDEIEPRGLEC 310

Query: 321 NHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNCLIDRLCKARQIDQAMKLF 380
           + YTH+II +GLCR GNIEGA ++ N +Y  GF SNLVA NCLIDRLCKA QID+A++LF
Sbjct: 311 DVYTHSIITNGLCRVGNIEGAMQHLNCVYTTGFASNLVALNCLIDRLCKAGQIDRAIRLF 370

Query: 381 ESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMKIHKSIQRAVINGLCSSRFTS 440
           ESMETRDS TYTSLVHNLCKARRF CASKLL++C R GMK+ K+ +RAVI+GLCSS FTS
Sbjct: 371 ESMETRDSFTYTSLVHNLCKARRFRCASKLLISCSRDGMKVLKATRRAVIDGLCSSGFTS 430

Query: 441 EARKLQTEICLA 446
           EARKL+ ++ LA
Sbjct: 431 EARKLKFKLRLA 442

BLAST of HG10004076 vs. TAIR 10
Match: AT5G46680.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 413.7 bits (1062), Expect = 1.9e-115
Identity = 221/446 (49.55%), Postives = 298/446 (66.82%), Query Frame = 0

Query: 6   STKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYSVF 65
           STKLLN  V S CK + +++AE ++ +GIR GVLPDV+TYNTL+ GY RF G+D AY+V 
Sbjct: 12  STKLLNISVNSLCKFRNLERAETLLIDGIRLGVLPDVITYNTLIKGYTRFIGIDEAYAVT 71

Query: 66  NRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCFFKL 125
            RMREAGI PDV TYNSLI+GA +N  L + L LF+EML SG++P++ SYNTLM C+FKL
Sbjct: 72  RRMREAGIEPDVTTYNSLISGAAKNLMLNRVLQLFDEMLHSGLSPDMWSYNTLMSCYFKL 131

Query: 126 GKPDEAIRVF-QDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQLVT 185
           G+  EA ++  +DI L  L P   T+ +++D LCK G+T NAI LF++L+S    P+L+T
Sbjct: 132 GRHGEAFKILHEDIHLAGLVPGIDTYNILLDALCKSGHTDNAIELFKHLKSR-VKPELMT 191

Query: 186 CNILINGLCKVGRLKAAWMMLNEAIDLGFEPDA-TYTTLMKSCFNYRKYEHGLEIFFAMK 245
            NILINGLCK  R+ +   M+ E    G+ P+A TYTT++K  F  ++ E GL++F  MK
Sbjct: 192 YNILINGLCKSRRVGSVDWMMRELKKSGYTPNAVTYTTMLKMYFKTKRIEKGLQLFLKMK 251

Query: 246 NK-----GFAYCTAISAFLKLGRFEEANFWMAQMIKNG-EGIDLVLYNTFLNFYCKQGKL 305
            +     GFA C  +SA +K GR EEA   M +++++G    D+V YNT LN Y K G L
Sbjct: 252 KEGYTFDGFANCAVVSALIKTGRAEEAYECMHELVRSGTRSQDIVSYNTLLNLYFKDGNL 311

Query: 306 EAAYKLLNDMESQGL-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNCL 365
           +A   LL ++E +GL  + YTH II++GL   GN  GA ++   +   G   ++V  NCL
Sbjct: 312 DAVDDLLEEIEMKGLKPDDYTHTIIVNGLLNIGNTGGAEKHLACIGEMGMQPSVVTCNCL 371

Query: 366 IDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKARRFSCASKLLLACLRGGMKIHK 425
           ID LCKA  +D+AM+LF SME RD  TYTS+VHNLCK  R  CASKLLL+C   GMKI  
Sbjct: 372 IDGLCKAGHVDRAMRLFASMEVRDEFTYTSVVHNLCKDGRLVCASKLLLSCYNKGMKIPS 431

Query: 426 SIQRAVINGLCSSRFTSEARKLQTEI 443
           S +RAV++G+  +     ARK   +I
Sbjct: 432 SARRAVLSGIRETVSYQAARKTHIKI 456

BLAST of HG10004076 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 211.1 bits (536), Expect = 1.9e-54
Identity = 129/432 (29.86%), Postives = 205/432 (47.45%), Query Frame = 0

Query: 18  CKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYSVFNRMREAGISPDV 77
           C+  ++++A +++      G  PDV++Y+T+++GYCRF  +D  + +   M+  G+ P+ 
Sbjct: 257 CQLGRIKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNS 316

Query: 78  ITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCFFKLGKPDEAIRVFQD 137
             Y S+I    R C L ++   F EM++ GI P+   Y TL+  F K G    A + F +
Sbjct: 317 YIYGSIIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYE 376

Query: 138 IILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQLVTCNILINGLCKVGR 197
           +  +D+ PD  T+T II G C+ G    A  LF  +   G  P  VT   LING CK G 
Sbjct: 377 MHSRDITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGH 436

Query: 198 LKAAWMMLNEAIDLGFEPD-ATYTTLMKSCFNYRKYEHGLEIFFAMKNKG-----FAYCT 257
           +K A+ + N  I  G  P+  TYTTL+         +   E+   M   G     F Y +
Sbjct: 437 MKDAFRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNS 496

Query: 258 AISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQGKLEAAYKLLNDMESQG 317
            ++   K G  EEA   + +    G   D V Y T ++ YCK G+++ A ++L +M  +G
Sbjct: 497 IVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKG 556

Query: 318 L-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNCLIDRLCKARQIDQAM 377
           L     T  ++++G C  G +E   +  N+M   G   N    N L+ + C    +  A 
Sbjct: 557 LQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAAT 616

Query: 378 KLFESMETR----DSVTYTSLVHNLCKARRFSCASKLLLACLRGGMKIHKSIQRAVINGL 437
            +++ M +R    D  TY +LV   CKAR    A  L       G  +  S    +I G 
Sbjct: 617 AIYKDMCSRGVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGF 676

Query: 438 CSSRFTSEARKL 439
              +   EAR++
Sbjct: 677 LKRKKFLEAREV 688

BLAST of HG10004076 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 211.1 bits (536), Expect = 1.9e-54
Identity = 129/432 (29.86%), Postives = 205/432 (47.45%), Query Frame = 0

Query: 18  CKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYSVFNRMREAGISPDV 77
           C+  ++++A +++      G  PDV++Y+T+++GYCRF  +D  + +   M+  G+ P+ 
Sbjct: 257 CQLGRIKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNS 316

Query: 78  ITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCFFKLGKPDEAIRVFQD 137
             Y S+I    R C L ++   F EM++ GI P+   Y TL+  F K G    A + F +
Sbjct: 317 YIYGSIIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYE 376

Query: 138 IILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQLVTCNILINGLCKVGR 197
           +  +D+ PD  T+T II G C+ G    A  LF  +   G  P  VT   LING CK G 
Sbjct: 377 MHSRDITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGH 436

Query: 198 LKAAWMMLNEAIDLGFEPD-ATYTTLMKSCFNYRKYEHGLEIFFAMKNKG-----FAYCT 257
           +K A+ + N  I  G  P+  TYTTL+         +   E+   M   G     F Y +
Sbjct: 437 MKDAFRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNS 496

Query: 258 AISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQGKLEAAYKLLNDMESQG 317
            ++   K G  EEA   + +    G   D V Y T ++ YCK G+++ A ++L +M  +G
Sbjct: 497 IVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKG 556

Query: 318 L-CNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNCLIDRLCKARQIDQAM 377
           L     T  ++++G C  G +E   +  N+M   G   N    N L+ + C    +  A 
Sbjct: 557 LQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAAT 616

Query: 378 KLFESMETR----DSVTYTSLVHNLCKARRFSCASKLLLACLRGGMKIHKSIQRAVINGL 437
            +++ M +R    D  TY +LV   CKAR    A  L       G  +  S    +I G 
Sbjct: 617 AIYKDMCSRGVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGF 676

Query: 438 CSSRFTSEARKL 439
              +   EAR++
Sbjct: 677 LKRKKFLEAREV 688

BLAST of HG10004076 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 206.8 bits (525), Expect = 3.5e-53
Identity = 131/452 (28.98%), Postives = 225/452 (49.78%), Query Frame = 0

Query: 4   KYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAYS 63
           K      N  + + C++ Q++ A  ++ +   +G++PD  T+ T++ GY     +D A  
Sbjct: 186 KPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALR 245

Query: 64  VFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEML-QSGITPNIRSYNTLMHCF 123
           +  +M E G S   ++ N ++ G  +   +E +LN  +EM  Q G  P+  ++NTL++  
Sbjct: 246 IREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGL 305

Query: 124 FKLGKPDEAIRVFQDIILKDLY-PDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQ 183
            K G    AI +  D++L++ Y PD  T+  +I GLCK G    A+ +   + +    P 
Sbjct: 306 CKAGHVKHAIEI-MDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPN 365

Query: 184 LVTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPD-ATYTTLMKSCFNYRKYEHGLEIFF 243
            VT N LI+ LCK  +++ A  +       G  PD  T+ +L++     R +   +E+F 
Sbjct: 366 TVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFE 425

Query: 244 AMKNKG-----FAYCTAISAFLKLGRFEEANFWMAQMIKNGEGIDLVLYNTFLNFYCKQG 303
            M++KG     F Y   I +    G+ +EA   + QM  +G    ++ YNT ++ +CK  
Sbjct: 426 EMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKAN 485

Query: 304 KLEAAYKLLNDMESQGLC-NHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATN 363
           K   A ++ ++ME  G+  N  T+  +IDGLC++  +E A +  + M + G   +    N
Sbjct: 486 KTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYN 545

Query: 364 CLIDRLCKARQIDQAMKLFESMETR----DSVTYTSLVHNLCKARRFSCASKLLLACLRG 423
            L+   C+   I +A  + ++M +     D VTY +L+  LCKA R   ASKLL +    
Sbjct: 546 SLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMK 605

Query: 424 GMKIHKSIQRAVINGLCSSRFTSEARKLQTEI 443
           G+ +       VI GL   R T+EA  L  E+
Sbjct: 606 GINLTPHAYNPVIQGLFRKRKTTEAINLFREM 636

BLAST of HG10004076 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 206.1 bits (523), Expect = 6.0e-53
Identity = 120/400 (30.00%), Postives = 204/400 (51.00%), Query Frame = 0

Query: 3   YKYSTKLLNKCVVSFCKSQQMQKAENVIRNGIRFGVLPDVVTYNTLLDGYCRFSGMDAAY 62
           Y+ +T   N  +       +  +A  +I   +  G  PD+VTY  +++G C+    D A+
Sbjct: 182 YQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAF 241

Query: 63  SVFNRMREAGISPDVITYNSLIAGATRNCSLEQSLNLFEEMLQSGITPNIRSYNTLMHCF 122
           ++ N+M +  + P V+ YN++I G  +   ++ +LNLF+EM   GI PN+ +Y++L+ C 
Sbjct: 242 NLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCL 301

Query: 123 FKLGKPDEAIRVFQDIILKDLYPDPATFTVIIDGLCKFGYTSNAIMLFRNLQSHGFVPQL 182
              G+  +A R+  D+I + + PD  TF+ +ID   K G    A  L+  +      P +
Sbjct: 302 CNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSI 361

Query: 183 VTCNILINGLCKVGRLKAAWMMLNEAIDLGFEPD-ATYTTLMKSCFNYRKYEHGLEIFFA 242
           VT + LING C   RL  A  M    +     PD  TY TL+K    Y++ E G+E+F  
Sbjct: 362 VTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFRE 421

Query: 243 MKNKGFAYCTAISAFLKLGRFEEANFWMAQ-----MIKNGEGIDLVLYNTFLNFYCKQGK 302
           M  +G    T     L  G F+  +  MAQ     M+ +G   +++ YNT L+  CK GK
Sbjct: 422 MSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGK 481

Query: 303 LEAAYKLLNDME-SQGLCNHYTHAIIIDGLCRAGNIEGARRYWNYMYIGGFTSNLVATNC 362
           LE A  +   ++ S+     YT+ I+I+G+C+AG +E     +  + + G   ++VA N 
Sbjct: 482 LEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNT 541

Query: 363 LIDRLCKARQIDQAMKLFESMETRDSVTYTSLVHNLCKAR 396
           +I   C+    ++A  LF+ M+   ++  +   + L +AR
Sbjct: 542 MISGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIRAR 581

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887759.12.6e-19978.76putative pentatricopeptide repeat-containing protein At4g17915 [Benincasa hispid... [more]
XP_022147934.13.7e-19878.54putative pentatricopeptide repeat-containing protein At4g17915 [Momordica charan... [more]
XP_022973869.13.9e-19577.21putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 [Cucur... [more]
XP_023554660.17.3e-19477.21putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 [Cucur... [more]
XP_031740560.12.9e-19075.88putative pentatricopeptide repeat-containing protein At4g17915 [Cucumis sativus]... [more]
Match NameE-valueIdentityDescription
P0C0431.1e-12051.45Putative pentatricopeptide repeat-containing protein At4g17915 OS=Arabidopsis th... [more]
Q56XR62.7e-11449.55Pentatricopeptide repeat-containing protein At5g46680 OS=Arabidopsis thaliana OX... [more]
Q0WVK72.6e-5329.86Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9LFF14.9e-5228.98Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Q76C996.4e-5227.48Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Match NameE-valueIdentityDescription
A0A6J1D2P01.8e-19878.54putative pentatricopeptide repeat-containing protein At4g17915 OS=Momordica char... [more]
A0A6J1IFW81.9e-19577.21putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 OS=Cuc... [more]
A0A1S3B4M51.5e-18975.44putative pentatricopeptide repeat-containing protein At4g17915 OS=Cucumis melo O... [more]
A0A6J1EXR24.5e-18974.56putative pentatricopeptide repeat-containing protein At4g17915 isoform X1 OS=Cuc... [more]
A0A0A0KYL31.5e-18175.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G481190 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G46680.11.9e-11549.55Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.11.9e-5429.86Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.21.9e-5429.86Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G53700.13.5e-5328.98Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62670.16.0e-5330.00rna processing factor 2 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 248..276
e-value: 0.0089
score: 16.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 78..112
e-value: 5.7E-7
score: 27.3
coord: 282..312
e-value: 7.4E-8
score: 30.1
coord: 218..247
e-value: 4.0E-5
score: 21.5
coord: 149..180
e-value: 6.7E-5
score: 20.8
coord: 248..278
e-value: 0.0012
score: 16.9
coord: 43..77
e-value: 2.1E-9
score: 35.0
coord: 316..345
e-value: 8.5E-6
score: 23.6
coord: 114..146
e-value: 1.3E-8
score: 32.5
coord: 183..217
e-value: 2.0E-9
score: 35.0
coord: 354..380
e-value: 8.2E-6
score: 23.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 40..86
e-value: 2.5E-16
score: 59.6
coord: 349..393
e-value: 1.1E-9
score: 38.3
coord: 280..327
e-value: 2.3E-14
score: 53.3
coord: 110..159
e-value: 4.6E-14
score: 52.3
coord: 180..227
e-value: 1.1E-10
score: 41.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 41..75
score: 13.723605
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 280..314
score: 11.421732
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 146..180
score: 11.114816
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 76..110
score: 12.057487
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 111..145
score: 11.465577
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 181..215
score: 11.202506
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 349..383
score: 9.656963
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..90
e-value: 1.5E-21
score: 78.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 311..379
e-value: 2.2E-13
score: 52.1
coord: 247..310
e-value: 2.6E-11
score: 45.4
coord: 380..443
e-value: 1.3E-6
score: 30.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 91..246
e-value: 4.5E-39
score: 136.6
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 244..384
NoneNo IPR availablePANTHERPTHR47932:SF33OS02G0793200 PROTEINcoord: 2..444
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 2..444
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 52..244

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004076.1HG10004076.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding