Bhi08G000123 (gene) Wax gourd

NameBhi08G000123
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionPentatricopeptide repeat-containing protein
Locationchr8 : 5432029 .. 5435874 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCTTCATCGACTTAGGGCTGGAAGTCAATACGTTCTCCTCTGTGAGAGAAAAATAACACGCAGAGGGTTTTGGGGTATTTGAGCCTTCTTGTTCATACCAGCTGTATGGTTCGAACCACTTTCCCATTTACTTGCATTTTCGATTTCAATTCTGGCGGGTCATGTCAGTTTACGAACTTCCTGTCCACTAGGAATCTTCTGCATTGCTCCTACGCTGTAAGTTGAACCTCTAAATTTCTTTTCTTTTTGTTTGTTTGTGTAATATTGCACCTTGCCATTTGATTTTTATGCTATGAACTGTGTTTTTATCGATTAAATTCGACCCAGAAATTCATTGATTTCTCTGGGCGTGTTTGGGAGCCATTTTAAATGCTTAAAATCACTCTAAAAATGCTTTTAATCATTCAAATTCAATTTTAATAGTATGAAAAATGTATTAAAATTAATCATTCAATCCAAACAGGCCCTCAATTGATTTTTTGAGTGACGAAGGGTGTGTTTCTGAGTAATTTTAAAGATGATAAAAGTGATTTTAATTGTTCCAAAATCATTCCCAAACATGCACTCCGTTTTGAAGTTGATGGAGGTATAAACATGTTCAAATTAAAGTTTATGTCAATAAAATTTTTGCTTCTCGCCTGTTGATGATTCAATCAACACTGCTGAGGATTATCCGTAGATGTTGTTGATAGATGATAGGTGGAATCCAGATCATTTCTCAATGTCTTTTCTAGGTCTCCGTTGTGAAGGCTTTGCTATTGATCATGTTTTTATAGAGCATATAACCAATGTTTGTTCACCAACTTAATTTACTAAATTTTTGTTCCTTCACGTCAACTATTGGAAACCCAGATTTTGGAAGCTCAATAGTTGTTTAATAACTCATCAATGCCATTTATCTTTCTTAATTCAACTTAGGTTGAGAACTACATCTTATCCACCTTCATATCTAACCTGCATATATCTTTTAGATGATTACACATGACTTATGCCTTGCGAATTAAATATTTAATTGTCTCTACATGTCTAATACAGAATAGCATTGCATCTGTCCCAGTTGGTAACTCTCAATTTTGGCCGCTTTATGCCATCAGACTCCTTAGCCATCAGTCATCTAGTACAAATATCTGTCCTGATGAAGTGAAAGTGGGGGATGAAGTCTTGAATCAGATTATTGCTCCAAGGGAAAATGCCTCAAGGTGTAGCCATGAGACCTTTGATGCTTGCATTGATAAGATGTGTCGAATTGGACATCTTGCAGCTGCTGCTCAATTACTTAAATCATTGTGCGATGGGAAAATACCTCTTAGCTCCTCCAAGGCATATGATATGGTTTTGCTTGCAGCAAGTGAAAGCGGAGACACCACCCTTTTATTTCAAGTTTTTAAAGATTCCCTGGTTTCCTGTAAATCATTGAGTTCGACCTCTTACAAGAGTTTTGCCAACGCCTTTACCAGGACAAATGATAGTAACAAGCTACTGGAATATGTCAAAGAAATAATTGAGATGACCTTTCCAAACTGCATAGTTATAAACAGAATTATCTTTGCCTTCTCCAAATGTAGGGAGATTGATAAAGCCCTTCAGATATTTAATCAGATGAAGCTTCTGTCATGCAGACCAGATTTGTATACGTACAACATCATTTTGGATATGCTAGGTCGTGCAGGTCGCGTGGATGAAATTCTTCATTTATTTGTTTCCATGAAAGAAGATGGCATTGCCCCAGATATCGTGTCCTATAATACATTGATAAATAGTTTTAGGAAGGTGGGTAGACTAGATATGTGCTTGGTGTACTTCAAGGAAATGGTTGCAGTGAGAATTGAACCCGATTTGCTTACTTATACAGCTTTGATAGAGAGTTTTGGTCGATCTGGAAACATCGAGGAAGCTTGGACACTCCTCAGGGAGATGAAGCTTAAGAATATCTGTCCTTCAAGCTATATCTACAAGTCCCTTATCGGAAATTCAATGAAGATGGGGAAGGTGGAATTGGCTATGAACCTTCTCAAGGAAATGAAATTAAGTGATTCAAAACTTGCTGGTCCAAAGGATTTCAAACGAAGAAAAAGTTAACCAATTACAAGGTTTCTTAGTGACTTTGAGGCTGATAAACCTGTGAATGGCATGGCTTTAGTCTTCAAACTGAGATCTGCCAGAGTGACTCCATCTACAGGTAATATTTGTACTCATTTATCTACCTCTCTTACATCTAACATGAAATACAAAAGTTATAATTATCTTCAAACGGAGATTCATTCTAGGACTAATATATGAAAACCTAAGATGAATATTTAGTTTGTTTGATTGTCTGTCATGTGGTTTAAGTGTGTTAACTGTCATTTCAAGTAAAAGGATAGGCAGAAAAATTTCATGTTTCCACAATGTTTATATCTCTCTTTATGACAGATTTACAGAGAATTATAGTGATTCAGGTGAATTTTAGTTTACAAAAATTAATGATAATAATATTTAGATTGAAGAGCCAGAGAAAAATCTGTAACTACTTGTATAGGATACCTTGGGAATCATGAGTGATTTTTTTGTTTGTTTTGTATCATAGGAGACTTGAACCAAGGTTTTGTTGGGGTTGGGTGAATCATCATGTCAAACCAGTCAGTTTGGCTTATATTAGTCTATTAGATTCTCCGAGTGACTATTATATAAAACATGGGTGGCCTCTTTACCCCAGATTTAGATTTTCCATGCTTGTGTATATATGGTTAATGGTGGAATATACCATTTAAAGAAACTACTATGTTGAAATGTTACTTTTTTTAACAATATAATTCCATTACCAGAATTTTACTTCGTTTTGTTTTGCTGCCAAAAAAATTTGTATGGTACAATCATTAGTTGAATCTTACTATTTTTAGTGAGGGACGAAAAAAATATGACAATGATCATTTGTTTTACTCTAAATGCATATGGTAGTCTGTACCAGAGAGATAAAGTTTGGAATTTTAAATTTATAACTATAAATGTCATAGCTTTGCTTGAGTCACTCAACAATGGATACTGTGATGCTAAACAGCATATTGGTACTAGTACTAACAGTAAAATCTGGTCGATCTTTGGCCACTGTCCAGTTTGGTCAGATCGGAAGGTCCCCGAACTGTGATGCCGAAGAGAAGATAGCTCAAACAGAGGACACTCTCTTGGAAGTTATGTACTTAAAATGTTGGTAATATATGATGATCGCTAATTGGGAGCAGATATAGGGCTATTCGGACGAGTTGAAATTTAAGGACCGACGTGATGAAGATGAGGAATTGATCGATTATTTGGAAGAATCGCTAGGTAGCTTTGAGAGAGCTGTAAAGAAAATGGGAATGCAACGGTTTCCTTGAACAATGGATAATCTTGATTCTGTTTCCTTTGCAATAGAAAATAGCAAAGGAGAGCCAACTCATCAGATTCAATTTCTGTTTTCTTGAAGCCAAACCTTTCTTGAAGGAAATTCCATGTGAATCTTTTGATGGTTTCTACTTCAGTTCTGCCATTTGAATTGATTTCCAACTTTTACATCCAAGAATTTCCTCTGCAGTCTTTCTTTTCACTTATATATTCTTTGGCTACAATTGGAATATAGAATTTTCTTTCGACTGTTAAAAGGCATATTTGTAACCGTGGGAACGGTCGCTTTCAAGCTGGAAATGTCGACATTGATAGATATTTTTTTGTAAATCAGTTTGCATTCTAAACATGCATAACACAATCAGTTGGAAACTGTGCTTCCAATATTTCCAAAATGTAAGCCTTGAAAATTATTTACAAACATCAAATATAAATCCATTGAGAATTGTATACACAAAATATTAATAGATTTCAATAGAAAATAT

mRNA sequence

ATCTTCATCGACTTAGGGCTGGAAGTCAATACGTTCTCCTCTGTGAGAGAAAAATAACACGCAGAGGGTTTTGGGGTATTTGAGCCTTCTTGTTCATACCAGCTGTATGGTTCGAACCACTTTCCCATTTACTTGCATTTTCGATTTCAATTCTGGCGGGTCATGTCAGTTTACGAACTTCCTGTCCACTAGGAATCTTCTGCATTGCTCCTACGCTAATAGCATTGCATCTGTCCCAGTTGGTAACTCTCAATTTTGGCCGCTTTATGCCATCAGACTCCTTAGCCATCAGTCATCTAGTACAAATATCTGTCCTGATGAAGTGAAAGTGGGGGATGAAGTCTTGAATCAGATTATTGCTCCAAGGGAAAATGCCTCAAGGTGTAGCCATGAGACCTTTGATGCTTGCATTGATAAGATGTGTCGAATTGGACATCTTGCAGCTGCTGCTCAATTACTTAAATCATTGTGCGATGGGAAAATACCTCTTAGCTCCTCCAAGGCATATGATATGGTTTTGCTTGCAGCAAGTGAAAGCGGAGACACCACCCTTTTATTTCAAGTTTTTAAAGATTCCCTGGTTTCCTGTAAATCATTGAGTTCGACCTCTTACAAGAGTTTTGCCAACGCCTTTACCAGGACAAATGATAGTAACAAGCTACTGGAATATGTCAAAGAAATAATTGAGATGACCTTTCCAAACTGCATAGTTATAAACAGAATTATCTTTGCCTTCTCCAAATGTAGGGAGATTGATAAAGCCCTTCAGATATTTAATCAGATGAAGCTTCTGTCATGCAGACCAGATTTGTATACGTACAACATCATTTTGGATATGCTAGGTCGTGCAGGTCGCGTGGATGAAATTCTTCATTTATTTGTTTCCATGAAAGAAGATGGCATTGCCCCAGATATCGTGTCCTATAATACATTGATAAATAGTTTTAGGAAGGTGGGTAGACTAGATATGTGCTTGGTGTACTTCAAGGAAATGGTTGCAGTGAGAATTGAACCCGATTTGCTTACTTATACAGCTTTGATAGAGAGTTTTGGTCGATCTGGAAACATCGAGGAAGCTTGGACACTCCTCAGGGAGATGAAGCTTAAGAATATCTGTCCTTCAAGCTATATCTACAAGTCCCTTATCGGAAATTCAATGAAGATGGGGAAGGTGGAATTGGCTATGAACCTTCTCAAGGAAATGAAATTAAGTGATTCAAAACTTGCTGGTCCAAAGGATTTCAAACGAAGAAAAAGTTAACCAATTACAAGGTTTCTTAGTGACTTTGAGGCTGATAAACCTGTGAATGGCATGGCTTTAGTCTTCAAACTGAGATCTGCCAGAGTGACTCCATCTACAGTTTGGTCAGATCGGAAGGTCCCCGAACTGTGATGCCGAAGAGAAGATAGCTCAAACAGAGGACACTCTCTTGGAAGTTATGTACTTAAAATGTTGGTAATATATGATGATCGCTAATTGGGAGCAGATATAGGGCTATTCGGACGAGTTGAAATTTAAGGACCGACGTGATGAAGATGAGGAATTGATCGATTATTTGGAAGAATCGCTAGGTAGCTTTGAGAGAGCTGTAAAGAAAATGGGAATGCAACGGTTTCCTTGAACAATGGATAATCTTGATTCTGTTTCCTTTGCAATAGAAAATAGCAAAGGAGAGCCAACTCATCAGATTCAATTTCTGTTTTCTTGAAGCCAAACCTTTCTTGAAGGAAATTCCATGTGAATCTTTTGATGGTTTCTACTTCAGTTCTGCCATTTGAATTGATTTCCAACTTTTACATCCAAGAATTTCCTCTGCAGTCTTTCTTTTCACTTATATATTCTTTGGCTACAATTGGAATATAGAATTTTCTTTCGACTGTTAAAAGGCATATTTGTAACCGTGGGAACGGTCGCTTTCAAGCTGGAAATGTCGACATTGATAGATATTTTTTTGTAAATCAGTTTGCATTCTAAACATGCATAACACAATCAGTTGGAAACTGTGCTTCCAATATTTCCAAAATGTAAGCCTTGAAAATTATTTACAAACATCAAATATAAATCCATTGAGAATTGTATACACAAAATATTAATAGATTTCAATAGAAAATAT

Coding sequence (CDS)

ATGGTTCGAACCACTTTCCCATTTACTTGCATTTTCGATTTCAATTCTGGCGGGTCATGTCAGTTTACGAACTTCCTGTCCACTAGGAATCTTCTGCATTGCTCCTACGCTAATAGCATTGCATCTGTCCCAGTTGGTAACTCTCAATTTTGGCCGCTTTATGCCATCAGACTCCTTAGCCATCAGTCATCTAGTACAAATATCTGTCCTGATGAAGTGAAAGTGGGGGATGAAGTCTTGAATCAGATTATTGCTCCAAGGGAAAATGCCTCAAGGTGTAGCCATGAGACCTTTGATGCTTGCATTGATAAGATGTGTCGAATTGGACATCTTGCAGCTGCTGCTCAATTACTTAAATCATTGTGCGATGGGAAAATACCTCTTAGCTCCTCCAAGGCATATGATATGGTTTTGCTTGCAGCAAGTGAAAGCGGAGACACCACCCTTTTATTTCAAGTTTTTAAAGATTCCCTGGTTTCCTGTAAATCATTGAGTTCGACCTCTTACAAGAGTTTTGCCAACGCCTTTACCAGGACAAATGATAGTAACAAGCTACTGGAATATGTCAAAGAAATAATTGAGATGACCTTTCCAAACTGCATAGTTATAAACAGAATTATCTTTGCCTTCTCCAAATGTAGGGAGATTGATAAAGCCCTTCAGATATTTAATCAGATGAAGCTTCTGTCATGCAGACCAGATTTGTATACGTACAACATCATTTTGGATATGCTAGGTCGTGCAGGTCGCGTGGATGAAATTCTTCATTTATTTGTTTCCATGAAAGAAGATGGCATTGCCCCAGATATCGTGTCCTATAATACATTGATAAATAGTTTTAGGAAGGTGGGTAGACTAGATATGTGCTTGGTGTACTTCAAGGAAATGGTTGCAGTGAGAATTGAACCCGATTTGCTTACTTATACAGCTTTGATAGAGAGTTTTGGTCGATCTGGAAACATCGAGGAAGCTTGGACACTCCTCAGGGAGATGAAGCTTAAGAATATCTGTCCTTCAAGCTATATCTACAAGTCCCTTATCGGAAATTCAATGAAGATGGGGAAGGTGGAATTGGCTATGAACCTTCTCAAGGAAATGAAATTAAGTGATTCAAAACTTGCTGGTCCAAAGGATTTCAAACGAAGAAAAAGTTAA

Protein sequence

MVRTTFPFTCIFDFNSGGSCQFTNFLSTRNLLHCSYANSIASVPVGNSQFWPLYAIRLLSHQSSSTNICPDEVKVGDEVLNQIIAPRENASRCSHETFDACIDKMCRIGHLAAAAQLLKSLCDGKIPLSSSKAYDMVLLAASESGDTTLLFQVFKDSLVSCKSLSSTSYKSFANAFTRTNDSNKLLEYVKEIIEMTFPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCRPDLYTYNIILDMLGRAGRVDEILHLFVSMKEDGIAPDIVSYNTLINSFRKVGRLDMCLVYFKEMVAVRIEPDLLTYTALIESFGRSGNIEEAWTLLREMKLKNICPSSYIYKSLIGNSMKMGKVELAMNLLKEMKLSDSKLAGPKDFKRRKS
BLAST of Bhi08G000123 vs. TrEMBL
Match: tr|A0A1S3CIJ8|A0A1S3CIJ8_CUCME (pentatricopeptide repeat-containing protein At1g11900 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103500823 PE=4 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 3.3e-86
Identity = 167/205 (81.46%), Postives = 179/205 (87.32%), Query Frame = 0

Query: 29  RNLLHCSYANSIASVPVGNSQFWPLYAIRLLSHQSSSTNICPDEVKVGDEVLNQIIAPRE 88
           RNLLH SYAN IAS+PVGNSQ WPLYAI+  SHQSSSTNI PDEVKVGDEVLNQIIAPRE
Sbjct: 10  RNLLHYSYANRIASIPVGNSQIWPLYAIKCFSHQSSSTNISPDEVKVGDEVLNQIIAPRE 69

Query: 89  NASRCSHETFDACIDKMCRIGHLAAAAQLLKSLCDGKIPLSSSKAYDMVLLAASESGDTT 148
           NAS CSHE  DACIDK+C +GHLAAAAQLLKSLC+ KI L+SSKAYDMVLLAASE GDT 
Sbjct: 70  NASNCSHEIVDACIDKICGLGHLAAAAQLLKSLCNEKISLNSSKAYDMVLLAASERGDTP 129

Query: 149 LLFQVFKDSLVSCKSLSSTSYKSFANAFTRTNDSNKLLEYVKEIIEMTFPNCIVINRIIF 208
           LL QVFK ++VSCKSLSS SY SFA AFT+TNDS+KLLE VKEI+E+T  NC VINRIIF
Sbjct: 130 LLCQVFKVAVVSCKSLSSASYMSFARAFTKTNDSSKLLECVKEIVEVTSQNCSVINRIIF 189

Query: 209 AFSKCREIDKALQIFNQMKLLSCRP 234
           AFSKCREIDKA QIFNQMK LSC P
Sbjct: 190 AFSKCREIDKAFQIFNQMKCLSCTP 214

BLAST of Bhi08G000123 vs. TrEMBL
Match: tr|A0A1S3CH34|A0A1S3CH34_CUCME (pentatricopeptide repeat-containing protein At1g11900 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103500823 PE=4 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 3.3e-86
Identity = 167/205 (81.46%), Postives = 179/205 (87.32%), Query Frame = 0

Query: 29  RNLLHCSYANSIASVPVGNSQFWPLYAIRLLSHQSSSTNICPDEVKVGDEVLNQIIAPRE 88
           RNLLH SYAN IAS+PVGNSQ WPLYAI+  SHQSSSTNI PDEVKVGDEVLNQIIAPRE
Sbjct: 10  RNLLHYSYANRIASIPVGNSQIWPLYAIKCFSHQSSSTNISPDEVKVGDEVLNQIIAPRE 69

Query: 89  NASRCSHETFDACIDKMCRIGHLAAAAQLLKSLCDGKIPLSSSKAYDMVLLAASESGDTT 148
           NAS CSHE  DACIDK+C +GHLAAAAQLLKSLC+ KI L+SSKAYDMVLLAASE GDT 
Sbjct: 70  NASNCSHEIVDACIDKICGLGHLAAAAQLLKSLCNEKISLNSSKAYDMVLLAASERGDTP 129

Query: 149 LLFQVFKDSLVSCKSLSSTSYKSFANAFTRTNDSNKLLEYVKEIIEMTFPNCIVINRIIF 208
           LL QVFK ++VSCKSLSS SY SFA AFT+TNDS+KLLE VKEI+E+T  NC VINRIIF
Sbjct: 130 LLCQVFKVAVVSCKSLSSASYMSFARAFTKTNDSSKLLECVKEIVEVTSQNCSVINRIIF 189

Query: 209 AFSKCREIDKALQIFNQMKLLSCRP 234
           AFSKCREIDKA QIFNQMK LSC P
Sbjct: 190 AFSKCREIDKAFQIFNQMKCLSCTP 214

BLAST of Bhi08G000123 vs. TrEMBL
Match: tr|A0A0A0LXZ6|A0A0A0LXZ6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G132170 PE=4 SV=1)

HSP 1 Score: 295.0 bits (754), Expect = 2.4e-76
Identity = 156/205 (76.10%), Postives = 170/205 (82.93%), Query Frame = 0

Query: 29  RNLLHCSYANSIASVPVGNSQFWPLYAIRLLSHQSSSTNICPDEVKVGDEVLNQIIAPRE 88
           RN LH SYAN IAS PVGNSQ WPLYAI+ LSHQSS T I P+EVKVGDE LNQIIAP E
Sbjct: 10  RNFLHYSYANRIASAPVGNSQIWPLYAIKCLSHQSSGTKIFPNEVKVGDEDLNQIIAPTE 69

Query: 89  NASRCSHETFDACIDKMCRIGHLAAAAQLLKSLCDGKIPLSSSKAYDMVLLAASESGDTT 148
           NAS+C HE  DACIDK+CR+GHLAAAA LLKSLC+ K+   SS+AYDMVLLAASE GDT 
Sbjct: 70  NASKCIHEIIDACIDKICRLGHLAAAAHLLKSLCNEKV-FKSSEAYDMVLLAASERGDTP 129

Query: 149 LLFQVFKDSLVSCKSLSSTSYKSFANAFTRTNDSNKLLEYVKEIIEMTFPNCIVINRIIF 208
           LL +VFK +L+SCKSLSS SY SFA AFT+TNDS KLLE VKEIIE+T   CIVINRIIF
Sbjct: 130 LLCEVFKVALLSCKSLSSASYMSFARAFTKTNDS-KLLECVKEIIEITSQKCIVINRIIF 189

Query: 209 AFSKCREIDKALQIFNQMKLLSCRP 234
           AFS+ REIDKA QIFNQMK LSC P
Sbjct: 190 AFSERREIDKAFQIFNQMKCLSCTP 212

BLAST of Bhi08G000123 vs. TrEMBL
Match: tr|A0A2P4JMC8|A0A2P4JMC8_QUESU (Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_76968 PE=4 SV=1)

HSP 1 Score: 153.7 bits (387), Expect = 8.5e-34
Identity = 101/217 (46.54%), Postives = 134/217 (61.75%), Query Frame = 0

Query: 19  SCQFTNFLSTRNLLHCSYANSIASVPVGNSQFWPLYAIRLLSH-----QSSSTNICPDEV 78
           S     F   ++ LH  Y N IAS+ VG    +PL    ++ +     Q  +T   P++ 
Sbjct: 21  SLPIPQFSIIQSRLHI-YYNCIASITVG----FPLPFFTIIRNFIGKCQFPATQASPNKE 80

Query: 79  KVGDEVLNQIIAPRENASRCSHETFDACIDKMCRIGHLAAAAQLLKSLCDGKIPLSSSKA 138
           +V DEVLNQI+   ENA R + +   A  DK CR G+L+AAA+LL+SL    I L S KA
Sbjct: 81  EVTDEVLNQILTSVENAPRPNTKICTAYADKFCRAGNLSAAARLLQSLHHKHIFL-SPKA 140

Query: 139 YDMVLLAASESGDTTLLFQVFKDSLVSCKSLSSTSYKSFANAFTRTNDSNKLLEYVKEII 198
           Y ++L AASE  D  LL QVFKD L+S   LSST Y   A AFT+TND  +LL +VK++ 
Sbjct: 141 YKILLRAASERHDIDLLSQVFKDLLISNGVLSSTCYADVAKAFTKTNDGIQLLRFVKKVS 200

Query: 199 EMTFPNCIVINRIIFAFSKCREIDKALQIFNQMKLLS 231
            +T P+  V+NRIIFAF+KC +IDKAL I++Q K LS
Sbjct: 201 ALTSPSATVVNRIIFAFAKCGQIDKALLIYDQFKSLS 231

BLAST of Bhi08G000123 vs. TrEMBL
Match: tr|E0CU43|E0CU43_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_12s0028g00220 PE=4 SV=1)

HSP 1 Score: 151.4 bits (381), Expect = 4.2e-33
Identity = 102/242 (42.15%), Postives = 151/242 (62.40%), Query Frame = 0

Query: 1   MVRTTFPFTCIFDFNSGGSCQFTNFLSTRNLLHCS----YANSIASVPVGNSQFWPLYAI 60
           M+  + PFT IF      S  F +F S    +  +    Y+N I+++P   S   P +AI
Sbjct: 1   MIALSRPFTKIFP-----SIFFRSFHSNLGPIAATRRHRYSNFISAIPDDVSWILPFFAI 60

Query: 61  ---RLLSHQSSSTNICPDEVKVGDEVLNQIIAPRENASRCSHETF-DACIDKMCRIGHLA 120
               + S+QS +T   PDE  V DE LN+I++  E + + S E      IDK+ + G+ +
Sbjct: 61  LSKSIGSYQSLATEASPDEEVVPDEFLNEILSDIERSPKFSSEKLCTTYIDKLLKAGNPS 120

Query: 121 AAAQLLKSLCDGKIPLSSSKAYDMVLLAASESGDTTLLFQVFKDSLVSCKSLSSTSYKSF 180
           AAA+ ++SL D  I LS + AY+++L+AASE+     L Q+FKD LVS K LSSTSY + 
Sbjct: 121 AAARFMQSLHDKHIFLSPN-AYNLLLVAASEANAIDFLSQIFKDLLVSNKPLSSTSYFNV 180

Query: 181 ANAFTRTNDSNKLLEYVKEIIEMTFP-NCIVINRIIFAFSKCREIDKALQIFNQMKLLSC 234
           A  FT+T+DS  LL++V+E+ E+TFP N  ++NRII AF++CR+I+K+L IF+ MK L C
Sbjct: 181 AKVFTKTDDS-VLLKFVREVSELTFPRNATILNRIIHAFAECRQIEKSLIIFDHMKSLKC 235

BLAST of Bhi08G000123 vs. NCBI nr
Match: XP_008462480.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g11900 isoform X1 [Cucumis melo])

HSP 1 Score: 327.8 bits (839), Expect = 5.0e-86
Identity = 167/205 (81.46%), Postives = 179/205 (87.32%), Query Frame = 0

Query: 29  RNLLHCSYANSIASVPVGNSQFWPLYAIRLLSHQSSSTNICPDEVKVGDEVLNQIIAPRE 88
           RNLLH SYAN IAS+PVGNSQ WPLYAI+  SHQSSSTNI PDEVKVGDEVLNQIIAPRE
Sbjct: 10  RNLLHYSYANRIASIPVGNSQIWPLYAIKCFSHQSSSTNISPDEVKVGDEVLNQIIAPRE 69

Query: 89  NASRCSHETFDACIDKMCRIGHLAAAAQLLKSLCDGKIPLSSSKAYDMVLLAASESGDTT 148
           NAS CSHE  DACIDK+C +GHLAAAAQLLKSLC+ KI L+SSKAYDMVLLAASE GDT 
Sbjct: 70  NASNCSHEIVDACIDKICGLGHLAAAAQLLKSLCNEKISLNSSKAYDMVLLAASERGDTP 129

Query: 149 LLFQVFKDSLVSCKSLSSTSYKSFANAFTRTNDSNKLLEYVKEIIEMTFPNCIVINRIIF 208
           LL QVFK ++VSCKSLSS SY SFA AFT+TNDS+KLLE VKEI+E+T  NC VINRIIF
Sbjct: 130 LLCQVFKVAVVSCKSLSSASYMSFARAFTKTNDSSKLLECVKEIVEVTSQNCSVINRIIF 189

Query: 209 AFSKCREIDKALQIFNQMKLLSCRP 234
           AFSKCREIDKA QIFNQMK LSC P
Sbjct: 190 AFSKCREIDKAFQIFNQMKCLSCTP 214

BLAST of Bhi08G000123 vs. NCBI nr
Match: XP_008462492.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g11900 isoform X2 [Cucumis melo])

HSP 1 Score: 327.8 bits (839), Expect = 5.0e-86
Identity = 167/205 (81.46%), Postives = 179/205 (87.32%), Query Frame = 0

Query: 29  RNLLHCSYANSIASVPVGNSQFWPLYAIRLLSHQSSSTNICPDEVKVGDEVLNQIIAPRE 88
           RNLLH SYAN IAS+PVGNSQ WPLYAI+  SHQSSSTNI PDEVKVGDEVLNQIIAPRE
Sbjct: 10  RNLLHYSYANRIASIPVGNSQIWPLYAIKCFSHQSSSTNISPDEVKVGDEVLNQIIAPRE 69

Query: 89  NASRCSHETFDACIDKMCRIGHLAAAAQLLKSLCDGKIPLSSSKAYDMVLLAASESGDTT 148
           NAS CSHE  DACIDK+C +GHLAAAAQLLKSLC+ KI L+SSKAYDMVLLAASE GDT 
Sbjct: 70  NASNCSHEIVDACIDKICGLGHLAAAAQLLKSLCNEKISLNSSKAYDMVLLAASERGDTP 129

Query: 149 LLFQVFKDSLVSCKSLSSTSYKSFANAFTRTNDSNKLLEYVKEIIEMTFPNCIVINRIIF 208
           LL QVFK ++VSCKSLSS SY SFA AFT+TNDS+KLLE VKEI+E+T  NC VINRIIF
Sbjct: 130 LLCQVFKVAVVSCKSLSSASYMSFARAFTKTNDSSKLLECVKEIVEVTSQNCSVINRIIF 189

Query: 209 AFSKCREIDKALQIFNQMKLLSCRP 234
           AFSKCREIDKA QIFNQMK LSC P
Sbjct: 190 AFSKCREIDKAFQIFNQMKCLSCTP 214

BLAST of Bhi08G000123 vs. NCBI nr
Match: XP_022970322.1 (pentatricopeptide repeat-containing protein At1g11900 [Cucurbita maxima])

HSP 1 Score: 320.1 bits (819), Expect = 1.0e-83
Identity = 168/209 (80.38%), Postives = 178/209 (85.17%), Query Frame = 0

Query: 24  NFLSTRNLLHCSYANSIASVPVGNSQFWPLYAIRLLSHQSSSTNICPDEVKVGDEVLNQI 83
           + LS RN+LH SY NSI S+PVGN Q W LYAIR   HQSS+ NI PDE KV DEVLNQI
Sbjct: 8   SLLSARNVLHYSYTNSITSIPVGNPQNWLLYAIRRFGHQSSTNNISPDEEKVKDEVLNQI 67

Query: 84  IAPRENASRCSHETFDACIDKMCRIGHLAAAAQLLKSLCDGKIPLSSSKAYDMVLLAASE 143
            A RENASRCSHETFD CIDKMCR G+L AAAQLLKSLCD KI LSSSKAYDMVLLAASE
Sbjct: 68  TATRENASRCSHETFDVCIDKMCRSGNLTAAAQLLKSLCDRKISLSSSKAYDMVLLAASE 127

Query: 144 SGDTTLLFQVFKDSLVSCKSLSSTSYKSFANAFTRTNDSNKLLEYVKEIIEMTFPNCIVI 203
            GDTTLL QVFKDSLVS K LSSTSY +FA AF RT+DS+KLLEYVKEIIEMTFPN +VI
Sbjct: 128 RGDTTLLCQVFKDSLVSRKPLSSTSYMNFAKAFARTDDSSKLLEYVKEIIEMTFPNFLVI 187

Query: 204 NRIIFAFSKCREIDKALQIFNQMKLLSCR 233
           NRIIFAFS+CREIDKALQIFNQMKLLS R
Sbjct: 188 NRIIFAFSECREIDKALQIFNQMKLLSYR 216

BLAST of Bhi08G000123 vs. NCBI nr
Match: XP_022147838.1 (pentatricopeptide repeat-containing protein At1g11900-like isoform X1 [Momordica charantia])

HSP 1 Score: 314.3 bits (804), Expect = 5.7e-82
Identity = 166/233 (71.24%), Postives = 186/233 (79.83%), Query Frame = 0

Query: 2   VRTTFPFTCIFDFNSGGSCQFTNFLSTRNLLHCSYANSIASVPVGNSQFWPLYAI--RLL 61
           VRT   F+ I DF+SGGSC+F N  STR +LH  Y N IAS PVG+ Q W  +A   +  
Sbjct: 18  VRTASAFSYISDFSSGGSCRFRNLPSTRKVLHYPYTNCIASFPVGDPQIWLFFANMGKRF 77

Query: 62  SHQSSSTNICPDEVKVGDEVLNQIIAPRENASRCSHETFDACIDKMCRIGHLAAAAQLLK 121
           SHQS  T+  PDE KV DEVLNQI+A R+NASR SHETFDACI KMCR G+LAAAAQLLK
Sbjct: 78  SHQSYPTDTSPDEEKVIDEVLNQIVATRDNASRSSHETFDACIYKMCRSGNLAAAAQLLK 137

Query: 122 SLCDGKIPLSSSKAYDMVLLAASESGDTTLLFQVFKDSLVSCKSLSSTSYKSFANAFTRT 181
           SLCDGKI LS+SKAYDMVLLAASE GDT+L  QVFKD LVSCKSLSS +Y + A AF  T
Sbjct: 138 SLCDGKISLSASKAYDMVLLAASERGDTSLFCQVFKDCLVSCKSLSSATYMNLAKAFIST 197

Query: 182 NDSNKLLEYVKEIIEMTFPNCIVINRIIFAFSKCREIDKALQIFNQMKLLSCR 233
           ND  KLLEYVKE+IEMTFPN IVIN+IIFAFSKCREI+KAL+IFNQMKLLSC+
Sbjct: 198 NDVGKLLEYVKEVIEMTFPNLIVINKIIFAFSKCREIEKALRIFNQMKLLSCK 250

BLAST of Bhi08G000123 vs. NCBI nr
Match: XP_022964991.1 (pentatricopeptide repeat-containing protein At1g11900-like [Cucurbita moschata])

HSP 1 Score: 312.4 bits (799), Expect = 2.2e-81
Identity = 166/207 (80.19%), Postives = 174/207 (84.06%), Query Frame = 0

Query: 26  LSTRNLLHCSYANSIASVPVGNSQFWPLYAIRLLSHQSSSTNICPDEVKVGDEVLNQIIA 85
           LS RN+LH SY NSI SVPVGN Q W LYAIR   HQ S+TNI PDE KV DEVLNQI A
Sbjct: 10  LSARNVLHYSYTNSITSVPVGNPQNWLLYAIRRFGHQPSTTNISPDEEKVEDEVLNQITA 69

Query: 86  PRENASRCSHETFDACIDKMCRIGHLAAAAQLLKSLCDGKIPLSSSKAYDMVLLAASESG 145
            RENAS CSHETFD CIDKMCR  +L AAAQLLKS CD KI LSSSKAYDMVLLAASE G
Sbjct: 70  TRENASMCSHETFDICIDKMCRSDNLTAAAQLLKSSCDRKISLSSSKAYDMVLLAASERG 129

Query: 146 DTTLLFQVFKDSLVSCKSLSSTSYKSFANAFTRTNDSNKLLEYVKEIIEMTFPNCIVINR 205
           DTTLL QVFKDSLVS K LSSTSY +FA AF RT+DS+KLLEYVKEIIEMTFPN +VINR
Sbjct: 130 DTTLLCQVFKDSLVSRKPLSSTSYMNFAKAFARTDDSSKLLEYVKEIIEMTFPNFLVINR 189

Query: 206 IIFAFSKCREIDKALQIFNQMKLLSCR 233
           IIFAFS+CREIDKALQIFNQMKLLS R
Sbjct: 190 IIFAFSECREIDKALQIFNQMKLLSYR 216

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
tr|A0A1S3CIJ8|A0A1S3CIJ8_CUCME3.3e-8681.46pentatricopeptide repeat-containing protein At1g11900 isoform X1 OS=Cucumis melo... [more]
tr|A0A1S3CH34|A0A1S3CH34_CUCME3.3e-8681.46pentatricopeptide repeat-containing protein At1g11900 isoform X2 OS=Cucumis melo... [more]
tr|A0A0A0LXZ6|A0A0A0LXZ6_CUCSA2.4e-7676.10Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G132170 PE=4 SV=1[more]
tr|A0A2P4JMC8|A0A2P4JMC8_QUESU8.5e-3446.54Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_7... [more]
tr|E0CU43|E0CU43_VITVI4.2e-3342.15Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_12s0028g00220 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
XP_008462480.15.0e-8681.46PREDICTED: pentatricopeptide repeat-containing protein At1g11900 isoform X1 [Cuc... [more]
XP_008462492.15.0e-8681.46PREDICTED: pentatricopeptide repeat-containing protein At1g11900 isoform X2 [Cuc... [more]
XP_022970322.11.0e-8380.38pentatricopeptide repeat-containing protein At1g11900 [Cucurbita maxima][more]
XP_022147838.15.7e-8271.24pentatricopeptide repeat-containing protein At1g11900-like isoform X1 [Momordica... [more]
XP_022964991.12.2e-8180.19pentatricopeptide repeat-containing protein At1g11900-like [Cucurbita moschata][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090304 nucleic acid metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi08M000123Bhi08M000123mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 53..196
e-value: 7.1E-6
score: 27.6
coord: 197..377
e-value: 3.9E-48
score: 166.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 303..347
e-value: 2.2E-9
score: 37.2
coord: 233..281
e-value: 4.0E-13
score: 49.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 271..304
e-value: 8.9E-8
score: 29.9
coord: 307..339
e-value: 3.4E-9
score: 34.3
coord: 203..234
e-value: 1.9E-5
score: 22.6
coord: 236..270
e-value: 1.4E-9
score: 35.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 204..229
e-value: 0.0066
score: 16.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 339..373
score: 8.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 304..338
score: 12.529
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 199..233
score: 9.602
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 234..268
score: 12.858
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 269..303
score: 11.071
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 94..128
score: 6.686
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 165..195
score: 5.568
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 94..363
NoneNo IPR availablePANTHERPTHR24015:SF488SUBFAMILY NOT NAMEDcoord: 94..363
NoneNo IPR availableSUPERFAMILYSSF81901HCP-likecoord: 169..365