Bhi02G001054 (gene) Wax gourd

NameBhi02G001054
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionPentatricopeptide repeat
Locationchr2 : 30313868 .. 30316086 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTCGATCTTCATCGATATAGATGAGGTGTACAAGTGAGGCCTACCTCATGGATTCCTTATCCAAAGGCTAATGCTCTACAATAGAATCCACTTGGATTAGATTTGCTTTTGATATTTCTGTTGCTCGTTTCTCTTGGCACTAGAGAAAAAGAAACGTTTCCAAATCCTCCTCAATTCAAACAAAAGACGATGCTCCATCTCCAAGAACTTCAAATGGCATAGCAATCTCCATCAACGAGCATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAACCCATTATCGAAGCCAACAATTCCCCGATCACATTCAGACTCCCTCGTCACTCGCAAATTTTCAAACAAAACCCATCTCAGAAATGGCGCATCTTCTGCTGAATCCAGAGAACCCCATTTCTCCAATCTCCATAACAGAGATGCCCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAATGAGTCGCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTTAAACCTGATGTTGTTCTCTGTACGAAGCTCATTAAAGGGTTTTTTAATTCGAGGAATTTGAAGAAAGCTATGAGGGTTATGGAGATTTTGGAAACTTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATTAGTGGGTTTAGTAAAGCTAATCAAATTGAGTCTGCAAACAAGGTGTTTGATAGAATGCGCAGCAGGGGATTTTCCCCTGATGTTGTTACATACAATATAATGATTGGGTGTTTGTGTAGTAGGGGAAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGATGTAAACCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAATTGCTGTCGAGGGGCCTCCGTCCCGACTTGTATACATACAATGCCATCATTAGGGGTATTTGCAAGGAAGGAATGGAGGATCGAGCTGTGGAATTTGTTCAGGGTTTATCAGCTAGAGGGTGTAATCCAGATGTGATTTCATACAATATTCTGCTGCGTTCCTTTTTAAACAAAAGCAGGTGGGCAGACGGGGAGAAGCTTATGAAAGACATGGTTTTAATTGGCTGTGAGCCGAATGTCGTTACTCACAGCATCTTAATTAGTTCGTTGTGTCGTGAAGGGAGAGTTGGGGAAGCAGTGAATGTGTTGAAGGTGATGAAGGAGAAAGGCTTAACCCCAGATGCATATAGCTATGATCCACTGATTTCTGCCTTCTGCAAAGAAGGGAGATTAGATTTAGCAATTGAGTATTTGCACAAAATGGTTTCTGATGGTTGTTTGCCTGATATTGTTAACTACAATACAATTTTAGCTACTCTTTGTAAATTTGGTAGCGCTGATCTGGCTTTAGACATCTTTGAGAAGCTAGATGAAGTGGGTTGCCCTCCAAATGTGAGCTCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAAGAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCTGACGAGATAACGTACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTCGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAACTTCCAGCCAACAGTGATCAGCTTCAACATTGTTCTGCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGTGTACCGAACAAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCTGGGTGGCGAGCAGAGGCTATGGAGTTGGCTAACGCTCTGTACAGATTGGGAGTTATTTGTGAGGATTCTTCCAAGCGTTTGAATAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAACCTGATGTTGACTACAGAAGTTTCAACTTTTGGCCTATTTTGTGTGTAATGCTTTAAAAGGAAAATGATTCTGATTTTTTTTTTCCCTGTTCAGCTCAGTGATTAGAAAGCAAAGGAAGCAAATTTTTTGATAATCTAGAAGATTAAGGTCATCAACTCATGCAGCTTGTATTTACCTTCTAAGGGAGATACTCAAATTAACCTATAAGATCGGGCTCTAACCATATTTGACTACAA

mRNA sequence

GTTCGATCTTCATCGATATAGATGAGGTGTACAAGTGAGGCCTACCTCATGGATTCCTTATCCAAAGGCTAATGCTCTACAATAGAATCCACTTGGATTAGATTTGCTTTTGATATTTCTGTTGCTCGTTTCTCTTGGCACTAGAGAAAAAGAAACGTTTCCAAATCCTCCTCAATTCAAACAAAAGACGATGCTCCATCTCCAAGAACTTCAAATGGCATAGCAATCTCCATCAACGAGCATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAACCCATTATCGAAGCCAACAATTCCCCGATCACATTCAGACTCCCTCGTCACTCGCAAATTTTCAAACAAAACCCATCTCAGAAATGGCGCATCTTCTGCTGAATCCAGAGAACCCCATTTCTCCAATCTCCATAACAGAGATGCCCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAATGAGTCGCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTTAAACCTGATGTTGTTCTCTGTACGAAGCTCATTAAAGGGTTTTTTAATTCGAGGAATTTGAAGAAAGCTATGAGGGTTATGGAGATTTTGGAAACTTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATTAGTGGGTTTAGTAAAGCTAATCAAATTGAGTCTGCAAACAAGGTGTTTGATAGAATGCGCAGCAGGGGATTTTCCCCTGATGTTGTTACATACAATATAATGATTGGGTGTTTGTGTAGTAGGGGAAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGATGTAAACCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAATTGCTGTCGAGGGGCCTCCGTCCCGACTTGTATACATACAATGCCATCATTAGGGGTATTTGCAAGGAAGGAATGGAGGATCGAGCTGTGGAATTTGTTCAGGGTTTATCAGCTAGAGGGTGTAATCCAGATGTGATTTCATACAATATTCTGCTGCGTTCCTTTTTAAACAAAAGCAGGTGGGCAGACGGGGAGAAGCTTATGAAAGACATGGTTTTAATTGGCTGTGAGCCGAATGTCGTTACTCACAGCATCTTAATTAGTTCGTTGTGTCGTGAAGGGAGAGTTGGGGAAGCAGTGAATGTGTTGAAGGTGATGAAGGAGAAAGGCTTAACCCCAGATGCATATAGCTATGATCCACTGATTTCTGCCTTCTGCAAAGAAGGGAGATTAGATTTAGCAATTGAGTATTTGCACAAAATGGTTTCTGATGGTTGTTTGCCTGATATTGTTAACTACAATACAATTTTAGCTACTCTTTGTAAATTTGGTAGCGCTGATCTGGCTTTAGACATCTTTGAGAAGCTAGATGAAGTGGGTTGCCCTCCAAATGTGAGCTCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAAGAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCTGACGAGATAACGTACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTCGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAACTTCCAGCCAACAGTGATCAGCTTCAACATTGTTCTGCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGTGTACCGAACAAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCTGGGTGGCGAGCAGAGGCTATGGAGTTGGCTAACGCTCTGTACAGATTGGGAGTTATTTGTGAGGATTCTTCCAAGCGTTTGAATAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAACCTGATGTTGACTACAGAAGTTTCAACTTTTGGCCTATTTTGTGTGTAATGCTTTAAAAGGAAAATGATTCTGATTTTTTTTTTCCCTGTTCAGCTCAGTGATTAGAAAGCAAAGGAAGCAAATTTTTTGATAATCTAGAAGATTAAGGTCATCAACTCATGCAGCTTGTATTTACCTTCTAAGGGAGATACTCAAATTAACCTATAAGATCGGGCTCTAACCATATTTGACTACAA

Coding sequence (CDS)

ATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAACCCATTATCGAAGCCAACAATTCCCCGATCACATTCAGACTCCCTCGTCACTCGCAAATTTTCAAACAAAACCCATCTCAGAAATGGCGCATCTTCTGCTGAATCCAGAGAACCCCATTTCTCCAATCTCCATAACAGAGATGCCCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAATGAGTCGCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTTAAACCTGATGTTGTTCTCTGTACGAAGCTCATTAAAGGGTTTTTTAATTCGAGGAATTTGAAGAAAGCTATGAGGGTTATGGAGATTTTGGAAACTTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATTAGTGGGTTTAGTAAAGCTAATCAAATTGAGTCTGCAAACAAGGTGTTTGATAGAATGCGCAGCAGGGGATTTTCCCCTGATGTTGTTACATACAATATAATGATTGGGTGTTTGTGTAGTAGGGGAAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGATGTAAACCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAATTGCTGTCGAGGGGCCTCCGTCCCGACTTGTATACATACAATGCCATCATTAGGGGTATTTGCAAGGAAGGAATGGAGGATCGAGCTGTGGAATTTGTTCAGGGTTTATCAGCTAGAGGGTGTAATCCAGATGTGATTTCATACAATATTCTGCTGCGTTCCTTTTTAAACAAAAGCAGGTGGGCAGACGGGGAGAAGCTTATGAAAGACATGGTTTTAATTGGCTGTGAGCCGAATGTCGTTACTCACAGCATCTTAATTAGTTCGTTGTGTCGTGAAGGGAGAGTTGGGGAAGCAGTGAATGTGTTGAAGGTGATGAAGGAGAAAGGCTTAACCCCAGATGCATATAGCTATGATCCACTGATTTCTGCCTTCTGCAAAGAAGGGAGATTAGATTTAGCAATTGAGTATTTGCACAAAATGGTTTCTGATGGTTGTTTGCCTGATATTGTTAACTACAATACAATTTTAGCTACTCTTTGTAAATTTGGTAGCGCTGATCTGGCTTTAGACATCTTTGAGAAGCTAGATGAAGTGGGTTGCCCTCCAAATGTGAGCTCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAAGAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCTGACGAGATAACGTACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTCGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAACTTCCAGCCAACAGTGATCAGCTTCAACATTGTTCTGCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGTGTACCGAACAAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCTGGGTGGCGAGCAGAGGCTATGGAGTTGGCTAACGCTCTGTACAGATTGGGAGTTATTTGTGAGGATTCTTCCAAGCGTTTGAATAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAACCTGA

Protein sequence

MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGFSPDVVTYNIMIGCLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYNAIIRGICKEGMEDRAVEFVQGLSARGCNPDVISYNILLRSFLNKSRWADGEKLMKDMVLIGCEPNVVTHSILISSLCREGRVGEAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLAIEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSALWSCGKKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATNFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCVPNKTSYVLLIEGIAYAGWRAEAMELANALYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQT
BLAST of Bhi02G001054 vs. Swiss-Prot
Match: sp|Q9SR00|PP213_ARATH (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 3.5e-18
Identity = 44/76 (57.89%), Postives = 59/76 (77.63%), Query Frame = 0

Query: 49  ESREPHFSNLHNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFF 108
           E R+ H  +L  RD  ++K+ +RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF
Sbjct: 76  ERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFF 135

Query: 109 NSRNLKKAMRVMEILE 125
             RN+ KA+RVMEILE
Sbjct: 136 TLRNIPKAVRVMEILE 151

BLAST of Bhi02G001054 vs. Swiss-Prot
Match: sp|O80647|PP195_ARATH (Pentatricopeptide repeat-containing protein At2g39620 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E33 PE=3 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 4.4e-05
Identity = 30/86 (34.88%), Postives = 46/86 (53.49%), Query Frame = 0

Query: 74  RAGKHNESL-YFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG-DPDV 133
           RAG H E+L +F      KG  PD    T  +K    S + KK +R+ +++   G + DV
Sbjct: 76  RAGLHREALGFFGYMSEEKGIDPDKYSFTFALKACAGSMDFKKGLRIHDLIAEMGLESDV 135

Query: 134 YSYNAMISGFSKANQIESANKVFDRM 158
           Y   A++  + KA  + SA +VFD+M
Sbjct: 136 YIGTALVEMYCKARDLVSARQVFDKM 161

BLAST of Bhi02G001054 vs. Swiss-Prot
Match: sp|Q9LN22|PPR54_ARATH (Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g20300 PE=2 SV=1)

HSP 1 Score: 48.5 bits (114), Expect = 2.9e-04
Identity = 25/91 (27.47%), Postives = 44/91 (48.35%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG 127
           L+ R  RAG  +E+++    +   G  PD +  + +I      R   +A    + L+   
Sbjct: 192 LIRRYVRAGLASEAVHCFNRMEDYGCVPDKIAFSIVISNLSRKRRASEAQSFFDSLKDRF 251

Query: 128 DPDVYSYNAMISGFSKANQIESANKVFDRMR 159
           +PDV  Y  ++ G+ +A +I  A KVF  M+
Sbjct: 252 EPDVIVYTNLVRGWCRAGEISEAEKVFKEMK 282

BLAST of Bhi02G001054 vs. TAIR10
Match: AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 94.7 bits (234), Expect = 1.9e-19
Identity = 44/76 (57.89%), Postives = 59/76 (77.63%), Query Frame = 0

Query: 49  ESREPHFSNLHNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFF 108
           E R+ H  +L  RD  ++K+ +RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF
Sbjct: 76  ERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFF 135

Query: 109 NSRNLKKAMRVMEILE 125
             RN+ KA+RVMEILE
Sbjct: 136 TLRNIPKAVRVMEILE 151

BLAST of Bhi02G001054 vs. TAIR10
Match: AT2G39620.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 51.2 bits (121), Expect = 2.5e-06
Identity = 30/86 (34.88%), Postives = 46/86 (53.49%), Query Frame = 0

Query: 74  RAGKHNESL-YFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG-DPDV 133
           RAG H E+L +F      KG  PD    T  +K    S + KK +R+ +++   G + DV
Sbjct: 76  RAGLHREALGFFGYMSEEKGIDPDKYSFTFALKACAGSMDFKKGLRIHDLIAEMGLESDV 135

Query: 134 YSYNAMISGFSKANQIESANKVFDRM 158
           Y   A++  + KA  + SA +VFD+M
Sbjct: 136 YIGTALVEMYCKARDLVSARQVFDKM 161

BLAST of Bhi02G001054 vs. TAIR10
Match: AT1G20300.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 48.5 bits (114), Expect = 1.6e-05
Identity = 25/91 (27.47%), Postives = 44/91 (48.35%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG 127
           L+ R  RAG  +E+++    +   G  PD +  + +I      R   +A    + L+   
Sbjct: 192 LIRRYVRAGLASEAVHCFNRMEDYGCVPDKIAFSIVISNLSRKRRASEAQSFFDSLKDRF 251

Query: 128 DPDVYSYNAMISGFSKANQIESANKVFDRMR 159
           +PDV  Y  ++ G+ +A +I  A KVF  M+
Sbjct: 252 EPDVIVYTNLVRGWCRAGEISEAEKVFKEMK 282

BLAST of Bhi02G001054 vs. TAIR10
Match: AT3G09650.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 46.2 bits (108), Expect = 7.9e-05
Identity = 27/86 (31.40%), Postives = 43/86 (50.00%), Query Frame = 0

Query: 81  SLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDP----DVYSYNA 140
           +L F   + ++G  P  +  T L+K F  S   K A RV +  E   DP    D+ ++N 
Sbjct: 542 ALAFFNEMRTRGIAPTKISYTTLMKAFAMSGQPKLANRVFD--EMMNDPRVKVDLIAWNM 601

Query: 141 MISGFSKANQIESANKVFDRMRSRGF 163
           ++ G+ +   IE A +V  RM+  GF
Sbjct: 602 LVEGYCRLGLIEDAQRVVSRMKENGF 625

BLAST of Bhi02G001054 vs. TAIR10
Match: AT2G01740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 45.4 bits (106), Expect = 1.4e-04
Identity = 28/97 (28.87%), Postives = 50/97 (51.55%), Query Frame = 0

Query: 64  HLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEIL 123
           H+ +L+N +C       SL FL  +VS+G+ P       ++        +K A  ++  +
Sbjct: 27  HIHQLINSNCGI----LSLKFLAYLVSRGYTPHRSSFNSVVSFVCKLGQVKFAEDIVHSM 86

Query: 124 ETYG-DPDVYSYNAMISGFSKANQIESANKVFDRMRS 160
             +G +PDV SYN++I G  +   I SA+ V + +R+
Sbjct: 87  PRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRA 119

BLAST of Bhi02G001054 vs. TrEMBL
Match: tr|A0A1S3B9K3|A0A1S3B9K3_CUCME (pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487275 PE=4 SV=1)

HSP 1 Score: 300.4 bits (768), Expect = 8.5e-78
Identity = 149/162 (91.98%), Postives = 157/162 (96.91%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHN 60
           MFSSEFLPQSLHFTNPLSKPTIP+SHSDS+ TR+FSNKT+LRN  SSAESR+PHF NL N
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQI+SAN+VFDRMRSRGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 162

BLAST of Bhi02G001054 vs. TrEMBL
Match: tr|A0A0A0M3C6|A0A0A0M3C6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G666460 PE=4 SV=1)

HSP 1 Score: 292.7 bits (748), Expect = 1.8e-75
Identity = 146/162 (90.12%), Postives = 153/162 (94.44%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHN 60
           MFSSEFLPQSLHFTNPL+KPTIP+S SDS+   +FSNKTHLRN  SSAE R+PHF NL N
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQI+SAN+VFDRMRSRGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 162

BLAST of Bhi02G001054 vs. TrEMBL
Match: tr|A0A1U8LF97|A0A1U8LF97_GOSHI (pentatricopeptide repeat-containing protein At3g04760, chloroplastic-like OS=Gossypium hirsutum OX=3635 GN=LOC107926841 PE=4 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 3.7e-41
Identity = 95/174 (54.60%), Postives = 125/174 (71.84%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKP-TIPRSHSDSLVT-----------RKFSNKTHLRNGASSA 60
           +FS+E +P  L F     KP +   SH  SLV+           RK  N   +R    SA
Sbjct: 4   LFSTELIPHGLPFHPQQLKPVSNSSSHHTSLVSCLSHEGTKDSIRKSRNNQKVR---VSA 63

Query: 61  ESREPHFSNLHNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFF 120
           E+R  H S+   +++HLMKLLNRSC++GK++E+ YFLE +V KG+KPDVVLCTK+IKGFF
Sbjct: 64  ETRPTHLSSFDFKESHLMKLLNRSCKSGKYHEAFYFLECMVGKGYKPDVVLCTKMIKGFF 123

Query: 121 NSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           N RN++KA+RVME+LETYG+PDV++YNA+ISGF K N+++ ANKV DRMRSRGF
Sbjct: 124 NGRNVEKAIRVMEMLETYGEPDVFAYNALISGFCKMNRLDFANKVLDRMRSRGF 174

BLAST of Bhi02G001054 vs. TrEMBL
Match: tr|A0A0D2TML8|A0A0D2TML8_GOSRA (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_007G275900 PE=4 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 3.7e-41
Identity = 95/174 (54.60%), Postives = 125/174 (71.84%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKP-TIPRSHSDSLVT-----------RKFSNKTHLRNGASSA 60
           +FS+E +P  L F     KP +   SH  SLV+           RK  N   +R    SA
Sbjct: 4   LFSTELIPHGLPFHPQQLKPVSNSSSHHTSLVSCLSHEGTKDSIRKSRNNQKVR---VSA 63

Query: 61  ESREPHFSNLHNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFF 120
           E+R  H S+   +++HLMKLLNRSC++GK++E+ YFLE +V KG+KPDVVLCTK+IKGFF
Sbjct: 64  ETRPTHLSSFDFKESHLMKLLNRSCKSGKYHEAFYFLECMVGKGYKPDVVLCTKMIKGFF 123

Query: 121 NSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           N RN++KA+RVME+LETYG+PDV++YNA+ISGF K N+++ ANKV DRMRSRGF
Sbjct: 124 NGRNVEKAIRVMEMLETYGEPDVFAYNALISGFCKMNRLDFANKVLDRMRSRGF 174

BLAST of Bhi02G001054 vs. TrEMBL
Match: tr|A0A2P5YVS8|A0A2P5YVS8_GOSBA (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_AA00837 PE=4 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 3.7e-41
Identity = 95/174 (54.60%), Postives = 125/174 (71.84%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKP-TIPRSHSDSLVT-----------RKFSNKTHLRNGASSA 60
           +FS+E +P  L F     KP +   SH  SLV+           RK  N   +R    SA
Sbjct: 4   LFSTELIPHGLPFHPQQLKPVSNSSSHHTSLVSCLSHEGTKDSIRKSRNNQKVR---VSA 63

Query: 61  ESREPHFSNLHNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFF 120
           E+R  H S+   +++HLMKLLNRSC++GK++E+ YFLE +V KG+KPDVVLCTK+IKGFF
Sbjct: 64  ETRPTHLSSFDFKESHLMKLLNRSCKSGKYHEAFYFLECMVGKGYKPDVVLCTKMIKGFF 123

Query: 121 NSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           N RN++KA+RVME+LETYG+PDV++YNA+ISGF K N+++ ANKV DRMRSRGF
Sbjct: 124 NGRNVEKAIRVMEMLETYGEPDVFAYNALISGFCKMNRLDFANKVLDRMRSRGF 174

BLAST of Bhi02G001054 vs. NCBI nr
Match: XP_008443759.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis melo])

HSP 1 Score: 300.4 bits (768), Expect = 1.3e-77
Identity = 149/162 (91.98%), Postives = 157/162 (96.91%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHN 60
           MFSSEFLPQSLHFTNPLSKPTIP+SHSDS+ TR+FSNKT+LRN  SSAESR+PHF NL N
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQI+SAN+VFDRMRSRGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 162

BLAST of Bhi02G001054 vs. NCBI nr
Match: XP_004142590.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis sativus] >KGN66736.1 hypothetical protein Csa_1G666460 [Cucumis sativus])

HSP 1 Score: 292.7 bits (748), Expect = 2.7e-75
Identity = 146/162 (90.12%), Postives = 153/162 (94.44%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHN 60
           MFSSEFLPQSLHFTNPL+KPTIP+S SDS+   +FSNKTHLRN  SSAE R+PHF NL N
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQI+SAN+VFDRMRSRGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGF 162

BLAST of Bhi02G001054 vs. NCBI nr
Match: XP_022960811.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita moschata])

HSP 1 Score: 281.6 bits (719), Expect = 6.2e-72
Identity = 144/162 (88.89%), Postives = 148/162 (91.36%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHN 60
           MFSSE L QSLHF NPLS PTIP+SHS S  TR+F NKTHLRNGASSAE+REPH   L N
Sbjct: 1   MFSSELLSQSLHFINPLSNPTIPQSHSSSF-TRRFPNKTHLRNGASSAETREPHDPILDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           R+ HLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RETHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMR RGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRRRGF 161

BLAST of Bhi02G001054 vs. NCBI nr
Match: XP_023515375.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 280.0 bits (715), Expect = 1.8e-71
Identity = 143/162 (88.27%), Postives = 148/162 (91.36%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHN 60
           MFSSE L QSLHF NPLS PTIP+SHS S  TR+F NKTHLRNGASSAE+REPH   L N
Sbjct: 1   MFSSELLSQSLHFINPLSNPTIPQSHSSSF-TRRFPNKTHLRNGASSAETREPHDPILDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           R+ HLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RETHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQIESAN+VFDRMR RGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANEVFDRMRRRGF 161

BLAST of Bhi02G001054 vs. NCBI nr
Match: XP_022988060.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita maxima])

HSP 1 Score: 279.3 bits (713), Expect = 3.1e-71
Identity = 143/162 (88.27%), Postives = 147/162 (90.74%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHN 60
           MFSSE L QSLHF NPLS PTIP+SHS S  TR+F NKTHLRNGASSAE+REPH   L N
Sbjct: 1   MFSSELLSQSLHFINPLSNPTIPQSHSSSF-TRRFPNKTHLRNGASSAETREPHDPILDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           R+ HLMKLLNRSCRAGKHNESLYFLESVVSKG KPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RETHLMKLLNRSCRAGKHNESLYFLESVVSKGLKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGF 163
           EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMR RGF
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRRRGF 161

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
sp|Q9SR00|PP213_ARATH3.5e-1857.89Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
sp|O80647|PP195_ARATH4.4e-0534.88Pentatricopeptide repeat-containing protein At2g39620 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN22|PPR54_ARATH2.9e-0427.47Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT3G04760.11.9e-1957.89Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT2G39620.12.5e-0634.88Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G20300.11.6e-0527.47Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G09650.17.9e-0531.40Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G01740.11.4e-0428.87Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
tr|A0A1S3B9K3|A0A1S3B9K3_CUCME8.5e-7891.98pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucumis ... [more]
tr|A0A0A0M3C6|A0A0A0M3C6_CUCSA1.8e-7590.12Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G666460 PE=4 SV=1[more]
tr|A0A1U8LF97|A0A1U8LF97_GOSHI3.7e-4154.60pentatricopeptide repeat-containing protein At3g04760, chloroplastic-like OS=Gos... [more]
tr|A0A0D2TML8|A0A0D2TML8_GOSRA3.7e-4154.60Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_007G275900 PE=4 ... [more]
tr|A0A2P5YVS8|A0A2P5YVS8_GOSBA3.7e-4154.60Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_AA00837 PE=4 SV... [more]
Match NameE-valueIdentityDescription
XP_008443759.11.3e-7791.98PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
XP_004142590.12.7e-7590.12PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
XP_022960811.16.2e-7288.89pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita ... [more]
XP_023515375.11.8e-7188.27pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita ... [more]
XP_022988060.13.1e-7188.27pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucurbita ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi02M001054Bhi02M001054mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 192..261
e-value: 7.4E-19
score: 69.9
coord: 262..333
e-value: 5.8E-19
score: 70.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 45..191
e-value: 4.9E-33
score: 117.0
coord: 334..433
e-value: 1.1E-27
score: 99.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 434..555
e-value: 1.0E-26
score: 95.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 307..341
e-value: 9.8E-9
score: 32.9
coord: 132..166
e-value: 4.6E-11
score: 40.2
coord: 272..306
e-value: 9.5E-5
score: 20.3
coord: 342..376
e-value: 2.0E-9
score: 35.0
coord: 202..235
e-value: 1.2E-6
score: 26.3
coord: 482..515
e-value: 3.2E-5
score: 21.8
coord: 379..411
e-value: 2.3E-5
score: 22.2
coord: 413..446
e-value: 4.1E-4
score: 18.3
coord: 237..271
e-value: 4.8E-7
score: 27.5
coord: 447..480
e-value: 1.2E-8
score: 32.6
coord: 167..201
e-value: 3.3E-10
score: 37.5
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 85..122
e-value: 5.0E-6
score: 26.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 129..178
e-value: 2.5E-19
score: 69.1
coord: 374..421
e-value: 3.2E-11
score: 43.1
coord: 444..493
e-value: 1.3E-13
score: 50.8
coord: 234..281
e-value: 7.7E-11
score: 41.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 195..226
e-value: 8.8E-8
score: 31.6
coord: 335..368
e-value: 3.6E-9
score: 36.1
coord: 301..333
e-value: 2.1E-13
score: 49.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 61..95
score: 7.278
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 375..409
score: 11.345
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 410..444
score: 11.268
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 165..199
score: 13.439
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 445..479
score: 12.781
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 200..234
score: 11.641
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 480..514
score: 10.128
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 235..269
score: 11.707
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 130..164
score: 13.877
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 12.792
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 96..126
score: 6.697
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 515..549
score: 6.456
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 12.715
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..304
score: 10.808
NoneNo IPR availablePANTHERPTHR24015:SF572SUBFAMILY NOT NAMEDcoord: 9..542
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 9..542
NoneNo IPR availableSUPERFAMILYSSF81901HCP-likecoord: 104..256