HG10001495 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001495
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr09: 17585726 .. 17587759 (-)
RNA-Seq ExpressionHG10001495
SyntenyHG10001495
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCTCTCTTGCCGGCGGTGCTGTATTTCTCAGGGCTCCGAACAGATTTCCGGCCATCCTTCGTGGGTTTTGTTCCTCGTCCAATGAATATATTGAGCTCATCGAGACGTGTGGACGCGGTCGAGAATTGAACTTCGGGAGGTTAATTCACGCTCGATTGATCATCAATGGACTAGCCCGTTTAACACATTTTGCTGCGAAGATCATAGCATTCTACGCCGCATGTGGCAAAATCAAGGATGCACGGACATTGTTCGACAAAATTCCCCAAACGAATCCCCGCCGGTGGATTGTTTTGATTGGGGCATATTCTCGTTGTGGGTATTACACAGAAGCTCTGAGTGTATTTTGTGAGCTGCAGAGAGAAGGATTGAGGCCTAGCGAGTACATCATTCCCAGTGTCTTGAAAGCATGTGGCCATCTCTCTGATAACACTACAGGAAGAAAATTACATACCTTAATCCTCAAACACTCGCTTGAATCCGATGCTTATGTATGCAGCGCATTGATAGATATGTATGCAAAAAATGGACAGGTTGAGAAAGCTCGGCGAGTGTTTGAATCAATGGCTGGGAAGGATTTGGTGGCATTGAATGCAATGGTTTCAGGGTATGCCCACCATGGATTGGCTGAGGAAGCTTTGAATCTGGTGGAGGAGATGCAAGTATTGGGTGTAAAACCCAACTTGGTGACTTGGAACACTTTGGTTACAGGGTTTTCTCAGATGGGTGAAGAAGAGATTGTTCATGAGCTTTTCAAAGAGATGGAAGCCAATGGGATACAACCAGATGTAGTATCTTGGACATCTGTGATATCTGGGTTTGTACAGAACTTTAGAAATGAGGAGGCTTTTAGTACGTTTAGAAGGATGTTGAATGCTGGGTTCTGTCCAACTTCTGCTACAATCAGTAGTCTTTTGCCTGCTTGCGCATCCGTGGGGAACGGGCGGTGTGGGAAGGAGATCCATGGACATTCTCTGGTTCTTGGAGTTGAAAAGGATGTGTATGTGAGAACTGCATTGGTTGATATGTATGCAAAATGTGGATATTTTTATGAAGCAAGGATATTATTTTGGAGGATGTCTGAAAGGAATTCAGCAACTTGGAACTCCATGATTTTTGGCTATGCAAATCATGGATATTGCAATGAAGCAATAGAGCTTTTCCATCAGATGAAGGATGATGATGAGAATAAACTTGATCATTTGACATTTACAGCAGTTCTCACTGCTTGTGGCCATGCTGGGATGGTTGATTTGGGAAGGAGTTTGTTCCAATTGATGCAAAGTAAATATGGTATTATTCCTAGAGTTGAGCATTATGCCTGTATGGTCGACGTGTTTGGTCGAGCGGGGAAGCTGGCTGAGGCTTATGATTTGATCAAGACGATGCCAGTTGAGCCGGATTTATATGTATGGGGAGCTTTATTAGGTGCATGTAGGAAGCATGGAGAAATAGAACTTGCTGAGGAAGCAGCCAAACATTTATCAGAATTGGAACCTGGAAGTATAGGGAACAGTTTACTATTGTCTGATCTGTATGCTAATGCAGGGAGTTGGGGACATGTTGTGAAGCTGAAAAGGATGATGAAGAAGAGGAAGCTGAAGAAATTTCCTGGGTGCAGTTGGATAGAGACAGCTTAGCAAAATCTTTCTTCAAAACAAGAGATTTTTCTAAAAGTATTCTACTAGAAGAAAGGGTATATTATGGCTGCAAGCTGCCAAGTTCTTCTAGTTGTTTATCTTTCTTGAGAGCTCCAAAGAGTTCATGGTTGTCAAAATGCTTCAAAAATGATTCTCTCAACAGAATTTCAGCAAAATTAAGCTGAGAGTAGGTTGTAGTTCATTAGAGATAACCCAATAAACAAAATCTAATGTACTCAAAATTTTCATTAAGATTCTAATTACTAACTTCAGTTTGTAATTCCAGTCAAACAGAGAAAGAGAAAGAGAAAGGGAGAGAGTAATCATGCTCCATTCAAGATCAAGATGAAATCTTCTCAAACTTCATTTATCATTTCGAGTTTGGATTAG

mRNA sequence

ATGCTCTCTCTTGCCGGCGGTGCTGTATTTCTCAGGGCTCCGAACAGATTTCCGGCCATCCTTCGTGGGTTTTGTTCCTCGTCCAATGAATATATTGAGCTCATCGAGACGTGTGGACGCGGTCGAGAATTGAACTTCGGGAGGTTAATTCACGCTCGATTGATCATCAATGGACTAGCCCGTTTAACACATTTTGCTGCGAAGATCATAGCATTCTACGCCGCATGTGGCAAAATCAAGGATGCACGGACATTGTTCGACAAAATTCCCCAAACGAATCCCCGCCGGTGGATTGTTTTGATTGGGGCATATTCTCGTTGTGGGTATTACACAGAAGCTCTGAGTGTATTTTGTGAGCTGCAGAGAGAAGGATTGAGGCCTAGCGAGTACATCATTCCCAGTGTCTTGAAAGCATGTGGCCATCTCTCTGATAACACTACAGGAAGAAAATTACATACCTTAATCCTCAAACACTCGCTTGAATCCGATGCTTATGTATGCAGCGCATTGATAGATATGTATGCAAAAAATGGACAGGTTGAGAAAGCTCGGCGAGTGTTTGAATCAATGGCTGGGAAGGATTTGGTGGCATTGAATGCAATGGTTTCAGGGTATGCCCACCATGGATTGGCTGAGGAAGCTTTGAATCTGGTGGAGGAGATGCAAGTATTGGGTGTAAAACCCAACTTGGTGACTTGGAACACTTTGGTTACAGGGTTTTCTCAGATGGGTGAAGAAGAGATTGTTCATGAGCTTTTCAAAGAGATGGAAGCCAATGGGATACAACCAGATGTAGTATCTTGGACATCTGTGATATCTGGGTTTGTACAGAACTTTAGAAATGAGGAGGCTTTTAGTACGTTTAGAAGGATGTTGAATGCTGGGTTCTGTCCAACTTCTGCTACAATCAGTAGTCTTTTGCCTGCTTGCGCATCCGTGGGGAACGGGCGGTGTGGGAAGGAGATCCATGGACATTCTCTGGTTCTTGGAGTTGAAAAGGATGTGTATTTTGTAATTCCAGTCAAACAGAGAAAGAGAAAGAGAAAGGGAGAGAGTAATCATGCTCCATTCAAGATCAAGATGAAATCTTCTCAAACTTCATTTATCATTTCGAGTTTGGATTAG

Coding sequence (CDS)

ATGCTCTCTCTTGCCGGCGGTGCTGTATTTCTCAGGGCTCCGAACAGATTTCCGGCCATCCTTCGTGGGTTTTGTTCCTCGTCCAATGAATATATTGAGCTCATCGAGACGTGTGGACGCGGTCGAGAATTGAACTTCGGGAGGTTAATTCACGCTCGATTGATCATCAATGGACTAGCCCGTTTAACACATTTTGCTGCGAAGATCATAGCATTCTACGCCGCATGTGGCAAAATCAAGGATGCACGGACATTGTTCGACAAAATTCCCCAAACGAATCCCCGCCGGTGGATTGTTTTGATTGGGGCATATTCTCGTTGTGGGTATTACACAGAAGCTCTGAGTGTATTTTGTGAGCTGCAGAGAGAAGGATTGAGGCCTAGCGAGTACATCATTCCCAGTGTCTTGAAAGCATGTGGCCATCTCTCTGATAACACTACAGGAAGAAAATTACATACCTTAATCCTCAAACACTCGCTTGAATCCGATGCTTATGTATGCAGCGCATTGATAGATATGTATGCAAAAAATGGACAGGTTGAGAAAGCTCGGCGAGTGTTTGAATCAATGGCTGGGAAGGATTTGGTGGCATTGAATGCAATGGTTTCAGGGTATGCCCACCATGGATTGGCTGAGGAAGCTTTGAATCTGGTGGAGGAGATGCAAGTATTGGGTGTAAAACCCAACTTGGTGACTTGGAACACTTTGGTTACAGGGTTTTCTCAGATGGGTGAAGAAGAGATTGTTCATGAGCTTTTCAAAGAGATGGAAGCCAATGGGATACAACCAGATGTAGTATCTTGGACATCTGTGATATCTGGGTTTGTACAGAACTTTAGAAATGAGGAGGCTTTTAGTACGTTTAGAAGGATGTTGAATGCTGGGTTCTGTCCAACTTCTGCTACAATCAGTAGTCTTTTGCCTGCTTGCGCATCCGTGGGGAACGGGCGGTGTGGGAAGGAGATCCATGGACATTCTCTGGTTCTTGGAGTTGAAAAGGATGTGTATTTTGTAATTCCAGTCAAACAGAGAAAGAGAAAGAGAAAGGGAGAGAGTAATCATGCTCCATTCAAGATCAAGATGAAATCTTCTCAAACTTCATTTATCATTTCGAGTTTGGATTAG

Protein sequence

MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLARLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCELQREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQVEKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGFSQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTSATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVYFVIPVKQRKRKRKGESNHAPFKIKMKSSQTSFIISSLD
Homology
BLAST of HG10001495 vs. NCBI nr
Match: XP_038901081.1 (pentatricopeptide repeat-containing protein At5g59600-like [Benincasa hispida])

HSP 1 Score: 644.8 bits (1662), Expect = 4.5e-181
Identity = 317/336 (94.35%), Postives = 327/336 (97.32%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLA 60
           MLSLAGGAVFLRA NRFPAI RGFCS SNEYIELIETCGR RELNFG+L+HARLIINGLA
Sbjct: 1   MLSLAGGAVFLRALNRFPAISRGFCSLSNEYIELIETCGRDRELNFGKLLHARLIINGLA 60

Query: 61  RLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120
           RLTHFAAK IAFYAACG+IKDAR LFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL
Sbjct: 61  RLTHFAAKFIAFYAACGRIKDARILFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120

Query: 121 QREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQV 180
           QR GLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILK+SLESDAYVCSALIDMYAK+G++
Sbjct: 121 QRGGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKYSLESDAYVCSALIDMYAKSGEI 180

Query: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240
           EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF
Sbjct: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240

Query: 241 SQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTS 300
           SQMGEEE+VHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAF TFRRMLNAGFCPTS
Sbjct: 241 SQMGEEEMVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFGTFRRMLNAGFCPTS 300

Query: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           +TISSLLPACASVGNGR GKEIHGHS+VLGVEKDVY
Sbjct: 301 STISSLLPACASVGNGRRGKEIHGHSMVLGVEKDVY 336

BLAST of HG10001495 vs. NCBI nr
Match: XP_008464099.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g59600 [Cucumis melo] >KAA0061874.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK15382.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 641.0 bits (1652), Expect = 6.6e-180
Identity = 316/336 (94.05%), Postives = 325/336 (96.73%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLA 60
           MLSLAGGAVFLRAPNR PAILRGFCSSS+EYIELIETCGR R+LNFGRL+HARLIING A
Sbjct: 1   MLSLAGGAVFLRAPNRLPAILRGFCSSSDEYIELIETCGRNRDLNFGRLLHARLIINGSA 60

Query: 61  RLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120
           RLTHFAAK IAFYAACGKIKDAR LFDKIPQTNPRRWIVLIGAYSRCGYY EALSVF EL
Sbjct: 61  RLTHFAAKFIAFYAACGKIKDARVLFDKIPQTNPRRWIVLIGAYSRCGYYPEALSVFGEL 120

Query: 121 QREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQV 180
           QR GLRPSEYIIPSVLKACGHLS+ TTGRKLHTLILKHSLESDAYVCSALIDMYAK+G+V
Sbjct: 121 QRGGLRPSEYIIPSVLKACGHLSEKTTGRKLHTLILKHSLESDAYVCSALIDMYAKSGEV 180

Query: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240
           EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLG+KPNLVTWNTLVTGF
Sbjct: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGIKPNLVTWNTLVTGF 240

Query: 241 SQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTS 300
           SQMGEEE+VHELFKEMEANGI+PDVVSWTSVISGFVQNFRNEEAF TFRRMLNAGF PTS
Sbjct: 241 SQMGEEEMVHELFKEMEANGIEPDVVSWTSVISGFVQNFRNEEAFDTFRRMLNAGFHPTS 300

Query: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY
Sbjct: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 336

BLAST of HG10001495 vs. NCBI nr
Match: XP_004143130.1 (pentatricopeptide repeat-containing protein At5g59600 [Cucumis sativus])

HSP 1 Score: 628.6 bits (1620), Expect = 3.4e-176
Identity = 309/336 (91.96%), Postives = 322/336 (95.83%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLA 60
           MLSL GGAVFLRAPNRFPAILRGFCSSS+ YIELIETCGR R+LNFGR +HARLII+G A
Sbjct: 1   MLSLTGGAVFLRAPNRFPAILRGFCSSSDGYIELIETCGRNRDLNFGRSLHARLIIDGSA 60

Query: 61  RLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120
           RLTHFAAK IAFYAACGKIKDAR LFDKIP+TNPRRWIVLIGAYSRCGYY EALSVFCEL
Sbjct: 61  RLTHFAAKFIAFYAACGKIKDARILFDKIPRTNPRRWIVLIGAYSRCGYYPEALSVFCEL 120

Query: 121 QREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQV 180
           QR GLRPSEYIIPSVLKACGHLS+ TTGRKLHTLILK+SLESDAYVCSALIDMYAK+G+V
Sbjct: 121 QRGGLRPSEYIIPSVLKACGHLSEKTTGRKLHTLILKNSLESDAYVCSALIDMYAKSGEV 180

Query: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240
           EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLG+KPNLVTWNTLVTGF
Sbjct: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGIKPNLVTWNTLVTGF 240

Query: 241 SQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTS 300
           SQ+GEEE+V ELFKEMEANGI+PDVVSWTSVISGFVQNFRNEEAF TFRRMLNAGF PTS
Sbjct: 241 SQIGEEEMVRELFKEMEANGIEPDVVSWTSVISGFVQNFRNEEAFDTFRRMLNAGFHPTS 300

Query: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           ATISSLLPACASVGNGRCGKEIHGHSL LGVEKDVY
Sbjct: 301 ATISSLLPACASVGNGRCGKEIHGHSLALGVEKDVY 336

BLAST of HG10001495 vs. NCBI nr
Match: XP_031743085.1 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g59600-like [Cucumis sativus])

HSP 1 Score: 625.5 bits (1612), Expect = 2.9e-175
Identity = 307/336 (91.37%), Postives = 322/336 (95.83%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLA 60
           MLSL GGAVFLRAPNRFPAILRGFCSSS+ YIELIETCGR R+LNFGR +HARLII+G A
Sbjct: 1   MLSLTGGAVFLRAPNRFPAILRGFCSSSDGYIELIETCGRNRDLNFGRSLHARLIIDGSA 60

Query: 61  RLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120
           RLTHFAAK IAFYAACGKIKDAR LFDKIP+TNPRRWIVLIGAYSRCGYY EALSVFCEL
Sbjct: 61  RLTHFAAKFIAFYAACGKIKDARILFDKIPRTNPRRWIVLIGAYSRCGYYPEALSVFCEL 120

Query: 121 QREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQV 180
           QR GLRPSEYIIPSVLKACGHLS+ TTGRKLHTLILK+SLESDAYVCSALIDMYAK+G+V
Sbjct: 121 QRGGLRPSEYIIPSVLKACGHLSEKTTGRKLHTLILKNSLESDAYVCSALIDMYAKSGEV 180

Query: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240
           EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLG+KPNLVTWNTLVTGF
Sbjct: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGIKPNLVTWNTLVTGF 240

Query: 241 SQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTS 300
           SQ+GE+++V ELFKEMEANGI+PDVVSWTSVISGFVQNFRNEEAF TFRRMLNAGF PTS
Sbjct: 241 SQIGEKKMVRELFKEMEANGIEPDVVSWTSVISGFVQNFRNEEAFDTFRRMLNAGFHPTS 300

Query: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           ATISSLLPACASVGNGRCGKEIHGHSL LGVEKDVY
Sbjct: 301 ATISSLLPACASVGNGRCGKEIHGHSLALGVEKDVY 336

BLAST of HG10001495 vs. NCBI nr
Match: XP_023511811.1 (pentatricopeptide repeat-containing protein At5g59600 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 613.6 bits (1581), Expect = 1.1e-171
Identity = 305/336 (90.77%), Postives = 319/336 (94.94%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLA 60
           MLSLAGGA FL+A NRFPAI RGFCSSS+EYI+LIET GR RELNFGRL+HARLIINGLA
Sbjct: 1   MLSLAGGAAFLKATNRFPAIRRGFCSSSDEYIKLIETYGRDRELNFGRLLHARLIINGLA 60

Query: 61  RLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120
           RLTHFAAK+IAFYAACGKI DAR +FDKIPQTNPRRWIVLIGAYSR G+YTEALSVFCEL
Sbjct: 61  RLTHFAAKLIAFYAACGKISDAREVFDKIPQTNPRRWIVLIGAYSRYGFYTEALSVFCEL 120

Query: 121 QREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQV 180
           QR+G RPSEYIIPSVLKACGHLSD  TGRKLH LILK+SLESDAYVCSALIDMYAK+GQV
Sbjct: 121 QRQGSRPSEYIIPSVLKACGHLSDIPTGRKLHALILKYSLESDAYVCSALIDMYAKSGQV 180

Query: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240
           EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVE+MQVLGVKPNLVTWNTLVTGF
Sbjct: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEKMQVLGVKPNLVTWNTLVTGF 240

Query: 241 SQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTS 300
           SQM EEE+VHELFKEMEANGI+PDVVSWTSVISGFVQNFRNEEAF TFRRMLNAG  PTS
Sbjct: 241 SQMDEEEMVHELFKEMEANGIEPDVVSWTSVISGFVQNFRNEEAFGTFRRMLNAGLWPTS 300

Query: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           ATISSLLPACASVGNGR GKEIHGH+LVLGVEKDVY
Sbjct: 301 ATISSLLPACASVGNGRRGKEIHGHALVLGVEKDVY 336

BLAST of HG10001495 vs. ExPASy Swiss-Prot
Match: Q9FGR2 (Pentatricopeptide repeat-containing protein At5g59600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E1 PE=2 SV=1)

HSP 1 Score: 328.6 bits (841), Expect = 9.5e-89
Identity = 156/306 (50.98%), Postives = 223/306 (72.88%), Query Frame = 0

Query: 27  SSNEYIELIETCGRGRELNFGRLIHARLIINGLARLTHFAAKIIAFYAACGKIKDARTLF 86
           S   Y+ELIE  GR R    GR++HA L+ +G+ARLT  AAK++ FY  CGK+ DAR +F
Sbjct: 15  SIGSYVELIEANGRDRLFCRGRVLHAHLVTSGIARLTRIAAKLVTFYVECGKVLDARKVF 74

Query: 87  DKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCELQREGLRPSEYIIPSVLKACGHLSDNT 146
           D++P+ +    +V+IGA +R GYY E+L  F E+ ++GL+   +I+PS+LKA  +L D  
Sbjct: 75  DEMPKRDISGCVVMIGACARNGYYQESLDFFREMYKDGLKLDAFIVPSLLKASRNLLDRE 134

Query: 147 TGRKLHTLILKHSLESDAYVCSALIDMYAKNGQVEKARRVFESMAGKDLVALNAMVSGYA 206
            G+ +H L+LK S ESDA++ S+LIDMY+K G+V  AR+VF  +  +DLV  NAM+SGYA
Sbjct: 135 FGKMIHCLVLKFSYESDAFIVSSLIDMYSKFGEVGNARKVFSDLGEQDLVVFNAMISGYA 194

Query: 207 HHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGFSQMGEEEIVHELFKEMEANGIQPDVV 266
           ++  A+EALNLV++M++LG+KP+++TWN L++GFS M  EE V E+ + M  +G +PDVV
Sbjct: 195 NNSQADEALNLVKDMKLLGIKPDVITWNALISGFSHMRNEEKVSEILELMCLDGYKPDVV 254

Query: 267 SWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTSATISSLLPACASVGNGRCGKEIHGHS 326
           SWTS+ISG V NF+NE+AF  F++ML  G  P SATI +LLPAC ++   + GKEIHG+S
Sbjct: 255 SWTSIISGLVHNFQNEKAFDAFKQMLTHGLYPNSATIITLLPACTTLAYMKHGKEIHGYS 314

Query: 327 LVLGVE 333
           +V G+E
Sbjct: 315 VVTGLE 320

BLAST of HG10001495 vs. ExPASy Swiss-Prot
Match: Q9FXH1 (Pentatricopeptide repeat-containing protein At1g19720 OS=Arabidopsis thaliana OX=3702 GN=DYW7 PE=2 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 4.4e-54
Identity = 104/307 (33.88%), Postives = 181/307 (58.96%), Query Frame = 0

Query: 31  YIELIETCGRGRELNFGRLIHARLIINGLARL--THFAAKIIAFYAACGKIKDARTLFDK 90
           Y++L+E+C     ++ GR++HAR    GL          K+++ YA CG I DAR +FD 
Sbjct: 84  YLKLLESCIDSGSIHLGRILHARF---GLFTEPDVFVETKLLSMYAKCGCIADARKVFDS 143

Query: 91  IPQTNPRRWIVLIGAYSRCGYYTEALSVFCELQREGLRPSEYIIPSVLKACGHLSDNTTG 150
           + + N   W  +IGAYSR   + E   +F  + ++G+ P +++ P +L+ C +  D   G
Sbjct: 144 MRERNLFTWSAMIGAYSRENRWREVAKLFRLMMKDGVLPDDFLFPKILQGCANCGDVEAG 203

Query: 151 RKLHTLILKHSLESDAYVCSALIDMYAKNGQVEKARRVFESMAGKDLVALNAMVSGYAHH 210
           + +H++++K  + S   V ++++ +YAK G+++ A + F  M  +D++A N+++  Y  +
Sbjct: 204 KVIHSVVIKLGMSSCLRVSNSILAVYAKCGELDFATKFFRRMRERDVIAWNSVLLAYCQN 263

Query: 211 GLAEEALNLVEEMQVLGVKPNLVTWNTLVTGFSQMGEEEIVHELFKEMEANGIQPDVVSW 270
           G  EEA+ LV+EM+  G+ P LVTWN L+ G++Q+G+ +   +L ++ME  GI  DV +W
Sbjct: 264 GKHEEAVELVKEMEKEGISPGLVTWNILIGGYNQLGKCDAAMDLMQKMETFGITADVFTW 323

Query: 271 TSVISGFVQNFRNEEAFSTFRRMLNAGFCPTSATISSLLPACASVGNGRCGKEIHGHSLV 330
           T++ISG + N    +A   FR+M  AG  P + TI S + AC+ +     G E+H  ++ 
Sbjct: 324 TAMISGLIHNGMRYQALDMFRKMFLAGVVPNAVTIMSAVSACSCLKVINQGSEVHSIAVK 383

Query: 331 LGVEKDV 336
           +G   DV
Sbjct: 384 MGFIDDV 387

BLAST of HG10001495 vs. ExPASy Swiss-Prot
Match: Q9FM64 (Pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR21 PE=2 SV=1)

HSP 1 Score: 179.1 bits (453), Expect = 9.3e-44
Identity = 91/280 (32.50%), Postives = 162/280 (57.86%), Query Frame = 0

Query: 47  GRLIHARLIINGLARLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSR 106
           G+  HA  I+NG+         ++ FY   G I+ A  +FD++ + +   W ++I  Y +
Sbjct: 293 GKQSHAIAIVNGMELDNILGTSLLNFYCKVGLIEYAEMVFDRMFEKDVVTWNLIISGYVQ 352

Query: 107 CGYYTEALSVFCELQR-EGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAY 166
            G   +A+   C+L R E L+     + +++ A     +   G+++    ++HS ESD  
Sbjct: 353 QGLVEDAI-YMCQLMRLEKLKYDCVTLATLMSAAARTENLKLGKEVQCYCIRHSFESDIV 412

Query: 167 VCSALIDMYAKNGQVEKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLG 226
           + S ++DMYAK G +  A++VF+S   KDL+  N +++ YA  GL+ EAL L   MQ+ G
Sbjct: 413 LASTVMDMYAKCGSIVDAKKVFDSTVEKDLILWNTLLAAYAESGLSGEALRLFYGMQLEG 472

Query: 227 VKPNLVTWNTLVTGFSQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAF 286
           V PN++TWN ++    + G+ +   ++F +M+++GI P+++SWT++++G VQN  +EEA 
Sbjct: 473 VPPNVITWNLIILSLLRNGQVDEAKDMFLQMQSSGIIPNLISWTTMMNGMVQNGCSEEAI 532

Query: 287 STFRRMLNAGFCPTSATISSLLPACASVGNGRCGKEIHGH 326
              R+M  +G  P + +I+  L ACA + +   G+ IHG+
Sbjct: 533 LFLRKMQESGLRPNAFSITVALSACAHLASLHIGRTIHGY 571

BLAST of HG10001495 vs. ExPASy Swiss-Prot
Match: Q9SV26 (Pentatricopeptide repeat-containing protein At4g01030, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H65 PE=3 SV=2)

HSP 1 Score: 172.6 bits (436), Expect = 8.7e-42
Identity = 100/340 (29.41%), Postives = 168/340 (49.41%), Query Frame = 0

Query: 32  IELIETCGRGRELNFGRLIHARLIINGLARLTHFAAKIIAFYAACGKIKDARTLFDKIPQ 91
           ++L++ C        GR IH  ++  GL         +I  Y+  GK++ +R +F+ +  
Sbjct: 93  VKLLQVCSNKEGFAEGRQIHGYVLRLGLESNVSMCNSLIVMYSRNGKLELSRKVFNSMKD 152

Query: 92  TNPRRWIVLIGAYSRCGYYTEALSVFCE-------------------------------- 151
            N   W  ++ +Y++ GY  +A+ +  E                                
Sbjct: 153 RNLSSWNSILSSYTKLGYVDDAIGLLDEMEICGLKPDIVTWNSLLSGYASKGLSKDAIAV 212

Query: 152 ---LQREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAK 211
              +Q  GL+PS   I S+L+A         G+ +H  IL++ L  D YV + LIDMY K
Sbjct: 213 LKRMQIAGLKPSTSSISSLLQAVAEPGHLKLGKAIHGYILRNQLWYDVYVETTLIDMYIK 272

Query: 212 NGQVEKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTL 271
            G +  AR VF+ M  K++VA N++VSG ++  L ++A  L+  M+  G+KP+ +TWN+L
Sbjct: 273 TGYLPYARMVFDMMDAKNIVAWNSLVSGLSYACLLKDAEALMIRMEKEGIKPDAITWNSL 332

Query: 272 VTGFSQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGF 331
            +G++ +G+ E   ++  +M+  G+ P+VVSWT++ SG  +N     A   F +M   G 
Sbjct: 333 ASGYATLGKPEKALDVIGKMKEKGVAPNVVSWTAIFSGCSKNGNFRNALKVFIKMQEEGV 392

Query: 332 CPTSATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
            P +AT+S+LL     +     GKE+HG  L   +  D Y
Sbjct: 393 GPNAATMSTLLKILGCLSLLHSGKEVHGFCLRKNLICDAY 432

BLAST of HG10001495 vs. ExPASy Swiss-Prot
Match: Q4V389 (Pentatricopeptide repeat-containing protein At1g22830 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E24 PE=2 SV=1)

HSP 1 Score: 171.8 bits (434), Expect = 1.5e-41
Identity = 112/379 (29.55%), Postives = 178/379 (46.97%), Query Frame = 0

Query: 26  SSSNEYI-----ELIETCGRGRELNFGRLIHARLIINGLARLTHFAAKIIAFYAACGKIK 85
           S S+E++      L+ TC    E   G+ +HA  I +GL   +    K++ FY+A   + 
Sbjct: 76  SGSHEFVLYSSASLLSTCVGFNEFVPGQQLHAHCISSGLEFDSVLVPKLVTFYSAFNLLD 135

Query: 86  DARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCELQREGLRPSEYIIPSVLKACG 145
           +A+T+ +     +P  W VLIG+Y R   + E++SV+  +  +G+R  E+  PSV+KAC 
Sbjct: 136 EAQTITENSEILHPLPWNVLIGSYIRNKRFQESVSVYKRMMSKGIRADEFTYPSVIKACA 195

Query: 146 HLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQVEKARRVFESMAGKDLVALNA 205
            L D   GR +H  I   S   + YVC+ALI MY + G+V+ ARR+F+ M+ +D V+ NA
Sbjct: 196 ALLDFAYGRVVHGSIEVSSHRCNLYVCNALISMYKRFGKVDVARRLFDRMSERDAVSWNA 255

Query: 206 MVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNT------------------------- 265
           +++ Y       EA  L++ M + GV+ ++VTWNT                         
Sbjct: 256 IINCYTSEEKLGEAFKLLDRMYLSGVEASIVTWNTIAGGCLEAGNYIGALNCVVGMRNCN 315

Query: 266 -----------------------------------------------LVTGFSQMGEEEI 325
                                                          L+T +S+  +   
Sbjct: 316 VRIGSVAMINGLKACSHIGALKWGKVFHCLVIRSCSFSHDIDNVRNSLITMYSRCSDLRH 375

Query: 326 VHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTSATISSLLP 328
              +F+++EAN +     +W S+ISGF  N R+EE     + ML +GF P   T++S+LP
Sbjct: 376 AFIVFQQVEANSLS----TWNSIISGFAYNERSEETSFLLKEMLLSGFHPNHITLASILP 435

BLAST of HG10001495 vs. ExPASy TrEMBL
Match: A0A5A7V3K8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold571G00210 PE=4 SV=1)

HSP 1 Score: 641.0 bits (1652), Expect = 3.2e-180
Identity = 316/336 (94.05%), Postives = 325/336 (96.73%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLA 60
           MLSLAGGAVFLRAPNR PAILRGFCSSS+EYIELIETCGR R+LNFGRL+HARLIING A
Sbjct: 1   MLSLAGGAVFLRAPNRLPAILRGFCSSSDEYIELIETCGRNRDLNFGRLLHARLIINGSA 60

Query: 61  RLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120
           RLTHFAAK IAFYAACGKIKDAR LFDKIPQTNPRRWIVLIGAYSRCGYY EALSVF EL
Sbjct: 61  RLTHFAAKFIAFYAACGKIKDARVLFDKIPQTNPRRWIVLIGAYSRCGYYPEALSVFGEL 120

Query: 121 QREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQV 180
           QR GLRPSEYIIPSVLKACGHLS+ TTGRKLHTLILKHSLESDAYVCSALIDMYAK+G+V
Sbjct: 121 QRGGLRPSEYIIPSVLKACGHLSEKTTGRKLHTLILKHSLESDAYVCSALIDMYAKSGEV 180

Query: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240
           EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLG+KPNLVTWNTLVTGF
Sbjct: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGIKPNLVTWNTLVTGF 240

Query: 241 SQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTS 300
           SQMGEEE+VHELFKEMEANGI+PDVVSWTSVISGFVQNFRNEEAF TFRRMLNAGF PTS
Sbjct: 241 SQMGEEEMVHELFKEMEANGIEPDVVSWTSVISGFVQNFRNEEAFDTFRRMLNAGFHPTS 300

Query: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY
Sbjct: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 336

BLAST of HG10001495 vs. ExPASy TrEMBL
Match: A0A1S3CL56 (pentatricopeptide repeat-containing protein At5g59600 OS=Cucumis melo OX=3656 GN=LOC103502062 PE=4 SV=1)

HSP 1 Score: 641.0 bits (1652), Expect = 3.2e-180
Identity = 316/336 (94.05%), Postives = 325/336 (96.73%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLA 60
           MLSLAGGAVFLRAPNR PAILRGFCSSS+EYIELIETCGR R+LNFGRL+HARLIING A
Sbjct: 1   MLSLAGGAVFLRAPNRLPAILRGFCSSSDEYIELIETCGRNRDLNFGRLLHARLIINGSA 60

Query: 61  RLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120
           RLTHFAAK IAFYAACGKIKDAR LFDKIPQTNPRRWIVLIGAYSRCGYY EALSVF EL
Sbjct: 61  RLTHFAAKFIAFYAACGKIKDARVLFDKIPQTNPRRWIVLIGAYSRCGYYPEALSVFGEL 120

Query: 121 QREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQV 180
           QR GLRPSEYIIPSVLKACGHLS+ TTGRKLHTLILKHSLESDAYVCSALIDMYAK+G+V
Sbjct: 121 QRGGLRPSEYIIPSVLKACGHLSEKTTGRKLHTLILKHSLESDAYVCSALIDMYAKSGEV 180

Query: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240
           EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLG+KPNLVTWNTLVTGF
Sbjct: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGIKPNLVTWNTLVTGF 240

Query: 241 SQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTS 300
           SQMGEEE+VHELFKEMEANGI+PDVVSWTSVISGFVQNFRNEEAF TFRRMLNAGF PTS
Sbjct: 241 SQMGEEEMVHELFKEMEANGIEPDVVSWTSVISGFVQNFRNEEAFDTFRRMLNAGFHPTS 300

Query: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY
Sbjct: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 336

BLAST of HG10001495 vs. ExPASy TrEMBL
Match: A0A0A0KBQ9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G190210 PE=4 SV=1)

HSP 1 Score: 628.6 bits (1620), Expect = 1.6e-176
Identity = 309/336 (91.96%), Postives = 322/336 (95.83%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLA 60
           MLSL GGAVFLRAPNRFPAILRGFCSSS+ YIELIETCGR R+LNFGR +HARLII+G A
Sbjct: 1   MLSLTGGAVFLRAPNRFPAILRGFCSSSDGYIELIETCGRNRDLNFGRSLHARLIIDGSA 60

Query: 61  RLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120
           RLTHFAAK IAFYAACGKIKDAR LFDKIP+TNPRRWIVLIGAYSRCGYY EALSVFCEL
Sbjct: 61  RLTHFAAKFIAFYAACGKIKDARILFDKIPRTNPRRWIVLIGAYSRCGYYPEALSVFCEL 120

Query: 121 QREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQV 180
           QR GLRPSEYIIPSVLKACGHLS+ TTGRKLHTLILK+SLESDAYVCSALIDMYAK+G+V
Sbjct: 121 QRGGLRPSEYIIPSVLKACGHLSEKTTGRKLHTLILKNSLESDAYVCSALIDMYAKSGEV 180

Query: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240
           EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLG+KPNLVTWNTLVTGF
Sbjct: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGIKPNLVTWNTLVTGF 240

Query: 241 SQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTS 300
           SQ+GEEE+V ELFKEMEANGI+PDVVSWTSVISGFVQNFRNEEAF TFRRMLNAGF PTS
Sbjct: 241 SQIGEEEMVRELFKEMEANGIEPDVVSWTSVISGFVQNFRNEEAFDTFRRMLNAGFHPTS 300

Query: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           ATISSLLPACASVGNGRCGKEIHGHSL LGVEKDVY
Sbjct: 301 ATISSLLPACASVGNGRCGKEIHGHSLALGVEKDVY 336

BLAST of HG10001495 vs. ExPASy TrEMBL
Match: A0A6J1JH75 (pentatricopeptide repeat-containing protein At5g59600 OS=Cucurbita maxima OX=3661 GN=LOC111484460 PE=4 SV=1)

HSP 1 Score: 612.1 bits (1577), Expect = 1.6e-171
Identity = 303/336 (90.18%), Postives = 317/336 (94.35%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRGFCSSSNEYIELIETCGRGRELNFGRLIHARLIINGLA 60
           MLSLAGGA F RA NRFPAI RGFCSSS+EYI+LIET GR RELNFGRL+HARLIINGLA
Sbjct: 1   MLSLAGGAAFFRATNRFPAIRRGFCSSSDEYIKLIETYGRDRELNFGRLLHARLIINGLA 60

Query: 61  RLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCEL 120
           RLTHFAAK+IAFYAACGKI DAR LFDKIPQTNPRRWIVLIGAYSR G+YTEALSVFCEL
Sbjct: 61  RLTHFAAKLIAFYAACGKINDARELFDKIPQTNPRRWIVLIGAYSRYGFYTEALSVFCEL 120

Query: 121 QREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQV 180
           QR+G  PSEYIIPSVLKACGHLSD  TGRKLH LILK+SLESDAYVCSALIDMYAK+GQV
Sbjct: 121 QRQGSTPSEYIIPSVLKACGHLSDIPTGRKLHALILKYSLESDAYVCSALIDMYAKSGQV 180

Query: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGF 240
           EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVE+MQVLGVKPNLVTWNTLVTGF
Sbjct: 181 EKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEKMQVLGVKPNLVTWNTLVTGF 240

Query: 241 SQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTS 300
           SQM EEE+VHELFKEMEANGI+PDVVSWTSVISGFVQNFRNEEAF TFRRMLNAG CPTS
Sbjct: 241 SQMDEEEMVHELFKEMEANGIEPDVVSWTSVISGFVQNFRNEEAFGTFRRMLNAGLCPTS 300

Query: 301 ATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           ATISSLLPACAS GNGR GKEIHG++LVLGVEKD+Y
Sbjct: 301 ATISSLLPACASAGNGRRGKEIHGYALVLGVEKDIY 336

BLAST of HG10001495 vs. ExPASy TrEMBL
Match: A0A6J1FW47 (pentatricopeptide repeat-containing protein At5g59600 OS=Cucurbita moschata OX=3662 GN=LOC111449033 PE=4 SV=1)

HSP 1 Score: 608.2 bits (1567), Expect = 2.3e-170
Identity = 303/337 (89.91%), Postives = 318/337 (94.36%), Query Frame = 0

Query: 1   MLSLAGGAVFLRAPNRFPAILRG-FCSSSNEYIELIETCGRGRELNFGRLIHARLIINGL 60
           MLSLAGGA FLRA NRFPAI RG FCSSS+EYI+LIET GR RELNFGRL+HARLIINGL
Sbjct: 1   MLSLAGGAAFLRATNRFPAIRRGFFCSSSDEYIKLIETYGRDRELNFGRLLHARLIINGL 60

Query: 61  ARLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCE 120
           ARLTHFAAK+IAFYAACGKI DAR +FDKIPQTNPRRWIVLIGAYSR G+YTEALSVFCE
Sbjct: 61  ARLTHFAAKLIAFYAACGKINDAREVFDKIPQTNPRRWIVLIGAYSRYGFYTEALSVFCE 120

Query: 121 LQREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQ 180
           LQR+G RPSEYIIPSVLKACGHLSD  TGRKLH LILK+S ESDAYVCSALIDMYAK+GQ
Sbjct: 121 LQRQGSRPSEYIIPSVLKACGHLSDILTGRKLHALILKYSFESDAYVCSALIDMYAKSGQ 180

Query: 181 VEKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTG 240
           VEKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVE+MQVLGVKPNLVTWNTLVTG
Sbjct: 181 VEKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEKMQVLGVKPNLVTWNTLVTG 240

Query: 241 FSQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPT 300
           FSQM EEE+VHE+FKEMEANGI+PDVVSWTSVISGFVQNFRNEEAF TFRRMLNAG CPT
Sbjct: 241 FSQMDEEEMVHEIFKEMEANGIEPDVVSWTSVISGFVQNFRNEEAFGTFRRMLNAGLCPT 300

Query: 301 SATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
           SATISSLLPACAS GNGR GKEIHG++LVLGVEKDVY
Sbjct: 301 SATISSLLPACASAGNGRRGKEIHGYALVLGVEKDVY 337

BLAST of HG10001495 vs. TAIR 10
Match: AT5G59600.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 328.6 bits (841), Expect = 6.7e-90
Identity = 156/306 (50.98%), Postives = 223/306 (72.88%), Query Frame = 0

Query: 27  SSNEYIELIETCGRGRELNFGRLIHARLIINGLARLTHFAAKIIAFYAACGKIKDARTLF 86
           S   Y+ELIE  GR R    GR++HA L+ +G+ARLT  AAK++ FY  CGK+ DAR +F
Sbjct: 15  SIGSYVELIEANGRDRLFCRGRVLHAHLVTSGIARLTRIAAKLVTFYVECGKVLDARKVF 74

Query: 87  DKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCELQREGLRPSEYIIPSVLKACGHLSDNT 146
           D++P+ +    +V+IGA +R GYY E+L  F E+ ++GL+   +I+PS+LKA  +L D  
Sbjct: 75  DEMPKRDISGCVVMIGACARNGYYQESLDFFREMYKDGLKLDAFIVPSLLKASRNLLDRE 134

Query: 147 TGRKLHTLILKHSLESDAYVCSALIDMYAKNGQVEKARRVFESMAGKDLVALNAMVSGYA 206
            G+ +H L+LK S ESDA++ S+LIDMY+K G+V  AR+VF  +  +DLV  NAM+SGYA
Sbjct: 135 FGKMIHCLVLKFSYESDAFIVSSLIDMYSKFGEVGNARKVFSDLGEQDLVVFNAMISGYA 194

Query: 207 HHGLAEEALNLVEEMQVLGVKPNLVTWNTLVTGFSQMGEEEIVHELFKEMEANGIQPDVV 266
           ++  A+EALNLV++M++LG+KP+++TWN L++GFS M  EE V E+ + M  +G +PDVV
Sbjct: 195 NNSQADEALNLVKDMKLLGIKPDVITWNALISGFSHMRNEEKVSEILELMCLDGYKPDVV 254

Query: 267 SWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTSATISSLLPACASVGNGRCGKEIHGHS 326
           SWTS+ISG V NF+NE+AF  F++ML  G  P SATI +LLPAC ++   + GKEIHG+S
Sbjct: 255 SWTSIISGLVHNFQNEKAFDAFKQMLTHGLYPNSATIITLLPACTTLAYMKHGKEIHGYS 314

Query: 327 LVLGVE 333
           +V G+E
Sbjct: 315 VVTGLE 320

BLAST of HG10001495 vs. TAIR 10
Match: AT1G19720.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 213.4 bits (542), Expect = 3.1e-55
Identity = 104/307 (33.88%), Postives = 181/307 (58.96%), Query Frame = 0

Query: 31  YIELIETCGRGRELNFGRLIHARLIINGLARL--THFAAKIIAFYAACGKIKDARTLFDK 90
           Y++L+E+C     ++ GR++HAR    GL          K+++ YA CG I DAR +FD 
Sbjct: 84  YLKLLESCIDSGSIHLGRILHARF---GLFTEPDVFVETKLLSMYAKCGCIADARKVFDS 143

Query: 91  IPQTNPRRWIVLIGAYSRCGYYTEALSVFCELQREGLRPSEYIIPSVLKACGHLSDNTTG 150
           + + N   W  +IGAYSR   + E   +F  + ++G+ P +++ P +L+ C +  D   G
Sbjct: 144 MRERNLFTWSAMIGAYSRENRWREVAKLFRLMMKDGVLPDDFLFPKILQGCANCGDVEAG 203

Query: 151 RKLHTLILKHSLESDAYVCSALIDMYAKNGQVEKARRVFESMAGKDLVALNAMVSGYAHH 210
           + +H++++K  + S   V ++++ +YAK G+++ A + F  M  +D++A N+++  Y  +
Sbjct: 204 KVIHSVVIKLGMSSCLRVSNSILAVYAKCGELDFATKFFRRMRERDVIAWNSVLLAYCQN 263

Query: 211 GLAEEALNLVEEMQVLGVKPNLVTWNTLVTGFSQMGEEEIVHELFKEMEANGIQPDVVSW 270
           G  EEA+ LV+EM+  G+ P LVTWN L+ G++Q+G+ +   +L ++ME  GI  DV +W
Sbjct: 264 GKHEEAVELVKEMEKEGISPGLVTWNILIGGYNQLGKCDAAMDLMQKMETFGITADVFTW 323

Query: 271 TSVISGFVQNFRNEEAFSTFRRMLNAGFCPTSATISSLLPACASVGNGRCGKEIHGHSLV 330
           T++ISG + N    +A   FR+M  AG  P + TI S + AC+ +     G E+H  ++ 
Sbjct: 324 TAMISGLIHNGMRYQALDMFRKMFLAGVVPNAVTIMSAVSACSCLKVINQGSEVHSIAVK 383

Query: 331 LGVEKDV 336
           +G   DV
Sbjct: 384 MGFIDDV 387

BLAST of HG10001495 vs. TAIR 10
Match: AT5G55740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 179.1 bits (453), Expect = 6.6e-45
Identity = 91/280 (32.50%), Postives = 162/280 (57.86%), Query Frame = 0

Query: 47  GRLIHARLIINGLARLTHFAAKIIAFYAACGKIKDARTLFDKIPQTNPRRWIVLIGAYSR 106
           G+  HA  I+NG+         ++ FY   G I+ A  +FD++ + +   W ++I  Y +
Sbjct: 293 GKQSHAIAIVNGMELDNILGTSLLNFYCKVGLIEYAEMVFDRMFEKDVVTWNLIISGYVQ 352

Query: 107 CGYYTEALSVFCELQR-EGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAY 166
            G   +A+   C+L R E L+     + +++ A     +   G+++    ++HS ESD  
Sbjct: 353 QGLVEDAI-YMCQLMRLEKLKYDCVTLATLMSAAARTENLKLGKEVQCYCIRHSFESDIV 412

Query: 167 VCSALIDMYAKNGQVEKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLG 226
           + S ++DMYAK G +  A++VF+S   KDL+  N +++ YA  GL+ EAL L   MQ+ G
Sbjct: 413 LASTVMDMYAKCGSIVDAKKVFDSTVEKDLILWNTLLAAYAESGLSGEALRLFYGMQLEG 472

Query: 227 VKPNLVTWNTLVTGFSQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAF 286
           V PN++TWN ++    + G+ +   ++F +M+++GI P+++SWT++++G VQN  +EEA 
Sbjct: 473 VPPNVITWNLIILSLLRNGQVDEAKDMFLQMQSSGIIPNLISWTTMMNGMVQNGCSEEAI 532

Query: 287 STFRRMLNAGFCPTSATISSLLPACASVGNGRCGKEIHGH 326
              R+M  +G  P + +I+  L ACA + +   G+ IHG+
Sbjct: 533 LFLRKMQESGLRPNAFSITVALSACAHLASLHIGRTIHGY 571

BLAST of HG10001495 vs. TAIR 10
Match: AT4G01030.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 172.6 bits (436), Expect = 6.2e-43
Identity = 100/340 (29.41%), Postives = 168/340 (49.41%), Query Frame = 0

Query: 32  IELIETCGRGRELNFGRLIHARLIINGLARLTHFAAKIIAFYAACGKIKDARTLFDKIPQ 91
           ++L++ C        GR IH  ++  GL         +I  Y+  GK++ +R +F+ +  
Sbjct: 93  VKLLQVCSNKEGFAEGRQIHGYVLRLGLESNVSMCNSLIVMYSRNGKLELSRKVFNSMKD 152

Query: 92  TNPRRWIVLIGAYSRCGYYTEALSVFCE-------------------------------- 151
            N   W  ++ +Y++ GY  +A+ +  E                                
Sbjct: 153 RNLSSWNSILSSYTKLGYVDDAIGLLDEMEICGLKPDIVTWNSLLSGYASKGLSKDAIAV 212

Query: 152 ---LQREGLRPSEYIIPSVLKACGHLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAK 211
              +Q  GL+PS   I S+L+A         G+ +H  IL++ L  D YV + LIDMY K
Sbjct: 213 LKRMQIAGLKPSTSSISSLLQAVAEPGHLKLGKAIHGYILRNQLWYDVYVETTLIDMYIK 272

Query: 212 NGQVEKARRVFESMAGKDLVALNAMVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNTL 271
            G +  AR VF+ M  K++VA N++VSG ++  L ++A  L+  M+  G+KP+ +TWN+L
Sbjct: 273 TGYLPYARMVFDMMDAKNIVAWNSLVSGLSYACLLKDAEALMIRMEKEGIKPDAITWNSL 332

Query: 272 VTGFSQMGEEEIVHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGF 331
            +G++ +G+ E   ++  +M+  G+ P+VVSWT++ SG  +N     A   F +M   G 
Sbjct: 333 ASGYATLGKPEKALDVIGKMKEKGVAPNVVSWTAIFSGCSKNGNFRNALKVFIKMQEEGV 392

Query: 332 CPTSATISSLLPACASVGNGRCGKEIHGHSLVLGVEKDVY 337
            P +AT+S+LL     +     GKE+HG  L   +  D Y
Sbjct: 393 GPNAATMSTLLKILGCLSLLHSGKEVHGFCLRKNLICDAY 432

BLAST of HG10001495 vs. TAIR 10
Match: AT1G22830.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 171.8 bits (434), Expect = 1.1e-42
Identity = 112/379 (29.55%), Postives = 178/379 (46.97%), Query Frame = 0

Query: 26  SSSNEYI-----ELIETCGRGRELNFGRLIHARLIINGLARLTHFAAKIIAFYAACGKIK 85
           S S+E++      L+ TC    E   G+ +HA  I +GL   +    K++ FY+A   + 
Sbjct: 76  SGSHEFVLYSSASLLSTCVGFNEFVPGQQLHAHCISSGLEFDSVLVPKLVTFYSAFNLLD 135

Query: 86  DARTLFDKIPQTNPRRWIVLIGAYSRCGYYTEALSVFCELQREGLRPSEYIIPSVLKACG 145
           +A+T+ +     +P  W VLIG+Y R   + E++SV+  +  +G+R  E+  PSV+KAC 
Sbjct: 136 EAQTITENSEILHPLPWNVLIGSYIRNKRFQESVSVYKRMMSKGIRADEFTYPSVIKACA 195

Query: 146 HLSDNTTGRKLHTLILKHSLESDAYVCSALIDMYAKNGQVEKARRVFESMAGKDLVALNA 205
            L D   GR +H  I   S   + YVC+ALI MY + G+V+ ARR+F+ M+ +D V+ NA
Sbjct: 196 ALLDFAYGRVVHGSIEVSSHRCNLYVCNALISMYKRFGKVDVARRLFDRMSERDAVSWNA 255

Query: 206 MVSGYAHHGLAEEALNLVEEMQVLGVKPNLVTWNT------------------------- 265
           +++ Y       EA  L++ M + GV+ ++VTWNT                         
Sbjct: 256 IINCYTSEEKLGEAFKLLDRMYLSGVEASIVTWNTIAGGCLEAGNYIGALNCVVGMRNCN 315

Query: 266 -----------------------------------------------LVTGFSQMGEEEI 325
                                                          L+T +S+  +   
Sbjct: 316 VRIGSVAMINGLKACSHIGALKWGKVFHCLVIRSCSFSHDIDNVRNSLITMYSRCSDLRH 375

Query: 326 VHELFKEMEANGIQPDVVSWTSVISGFVQNFRNEEAFSTFRRMLNAGFCPTSATISSLLP 328
              +F+++EAN +     +W S+ISGF  N R+EE     + ML +GF P   T++S+LP
Sbjct: 376 AFIVFQQVEANSLS----TWNSIISGFAYNERSEETSFLLKEMLLSGFHPNHITLASILP 435

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901081.14.5e-18194.35pentatricopeptide repeat-containing protein At5g59600-like [Benincasa hispida][more]
XP_008464099.16.6e-18094.05PREDICTED: pentatricopeptide repeat-containing protein At5g59600 [Cucumis melo] ... [more]
XP_004143130.13.4e-17691.96pentatricopeptide repeat-containing protein At5g59600 [Cucumis sativus][more]
XP_031743085.12.9e-17591.37LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g59600-like ... [more]
XP_023511811.11.1e-17190.77pentatricopeptide repeat-containing protein At5g59600 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q9FGR29.5e-8950.98Pentatricopeptide repeat-containing protein At5g59600 OS=Arabidopsis thaliana OX... [more]
Q9FXH14.4e-5433.88Pentatricopeptide repeat-containing protein At1g19720 OS=Arabidopsis thaliana OX... [more]
Q9FM649.3e-4432.50Pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Arabidop... [more]
Q9SV268.7e-4229.41Pentatricopeptide repeat-containing protein At4g01030, mitochondrial OS=Arabidop... [more]
Q4V3891.5e-4129.55Pentatricopeptide repeat-containing protein At1g22830 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7V3K83.2e-18094.05Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CL563.2e-18094.05pentatricopeptide repeat-containing protein At5g59600 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0KBQ91.6e-17691.96Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G190210 PE=4 SV=1[more]
A0A6J1JH751.6e-17190.18pentatricopeptide repeat-containing protein At5g59600 OS=Cucurbita maxima OX=366... [more]
A0A6J1FW472.3e-17089.91pentatricopeptide repeat-containing protein At5g59600 OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT5G59600.16.7e-9050.98Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G19720.13.1e-5533.88Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G55740.16.6e-4532.50Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G01030.16.2e-4329.41pentatricopeptide (PPR) repeat-containing protein [more]
AT1G22830.11.1e-4229.55Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 165..194
e-value: 1.0E-6
score: 26.6
coord: 197..229
e-value: 1.3E-6
score: 26.2
coord: 231..265
e-value: 1.8E-7
score: 28.9
coord: 266..299
e-value: 1.6E-5
score: 22.8
coord: 99..129
e-value: 2.5E-5
score: 22.2
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 217..277
e-value: 6.2E-12
score: 45.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 97..125
e-value: 2.8E-5
score: 24.1
coord: 166..193
e-value: 8.5E-6
score: 25.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 264..298
score: 11.213468
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 229..263
score: 12.221907
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 194..228
score: 11.925952
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..193
score: 9.985802
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 93..127
score: 10.98328
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 256..323
e-value: 6.0E-10
score: 40.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 145..248
e-value: 8.9E-27
score: 95.6
coord: 22..144
e-value: 2.5E-14
score: 55.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 66..195
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 276..325
NoneNo IPR availablePANTHERPTHR24015:SF1603PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 276..325
coord: 147..276
NoneNo IPR availablePANTHERPTHR24015:SF1603PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 30..232
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 147..276
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 30..232
coord: 230..292
NoneNo IPR availablePANTHERPTHR24015:SF1603PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 230..292

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001495.1HG10001495.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding