Bhi01M000472 (mRNA) Wax gourd

NameBhi01M000472
TypemRNA
OrganismBenincasa hispida (Wax gourd)
DescriptionTetratricopeptide repeat (TPR)-like superfamily protein
Locationchr1 : 11971697 .. 11976595 (-)
Sequence length3877
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGACAAATTTCTATCCATATTTATCACATAGTACCAATTTCATCGATTCCAATAAACTTATATGGGTCTCTATATTTATCGGTCTCAATTTTAAATAAAAACCACCCTTCTTTCAAAAGTGGGATTGTCCTCTTTAAACAAAACAAAAACTTGTTTCCATTACCCCAAAAGAGAAATTTGTTAAATTTCACTTTCCACTGAAATCCACCATCCCGTTTAATGATTTTTAAATAGAAGTTTTCTTTTTCTTCAAAGCTCCTCGGGATTTGATTTGAGTGTGATTCGGCAGCAGAAAGAAAGATCAATCGCTTCAATGTTGCACTCCACGGCCTCGTTTTCCATATACACTGATGATGAAAATCAAGAACAAGTAATTGGGTTAGAAGCATTTGAAAAGGGTATCATGATTGAAGTGAACAAAGAAGCTCTGGGTAGTATTAGTAATGATTTCAGTTTTTCTAAAGAGACTATGGGGTTGATTCAAGAGGAAGAAATGGAAGATGAGGATGGTCTCAACTCGATTACGAATCGAGGTTTTGACGACGGCGAGGTTAACCTGAGACCCGCCAGTCCTCCGCTCTATCTAGCGGCGGGGCTTGGAATGGATGCCTCTGGTCTTGGTGGTGGGTATGATTCTGTTGATTTCTTTGACGAGAAGATGGTGGACGAGCCGCACTCAGTCGATCATTCGTTGCTTCTGAGGAACTATGTGCAGTCTTTATGGGTAACTTCTCCTACTCCTCTCAGAGGTTTTAATTTCATCTGTATTTCATTGTGATGTACTGCTCAATCTTCATGAATTGGATAAATGTTAACCATCAATTTTTGGTTGATGCATTGAATTTCTGGCCGTTCTGCAAAAATTATGTTCCTGGGTTTGTTTTGAACTCCTTGTTCTTTCAGAATCTCTCTCGAGTTGATTTTCGAATTCTTAAGTTATGAAGTGAGACAGTGAGGGCTGGATCTGTTCTGTCTGTGGGCAGAGAGACAAATTCTTTTGGTTTTGTTTTAGAAGTTGTTCTATCAATTTGTGTATACCTGCGAAGAATCTTCTGAGAGCAATCTTAAAAATTGTAGTCAATACCATGTGTTTTTGATATTGTGATGACAGTCTGAGGGAAAACTCGATGAAGCTGAGGAACAGTGCTATCAAGCTACATTAACATATCCTGAAGATGGTGAAACTTTGATGCTGTATGCTCAGTTGGTTTGGGAAGTCCATCACGATCAAGCTAAAGCCTCAAATTACTTTGAACGTGCAGCTCTTGTTGCTCCAAATAACAGGTTGAATTCATTTGACAATAGAGTAATGAAATAGTTACCTCAACAACCAAATTATTGCAGATGTTCTTCTTCAATATGGATATTGACCTGAATTCAAGAGTGACAACTTTAGTGTTACTATCAGATGAACTGAATGTTTCTTTTTCTTAATCGGCAGCGATATTCTTGCAGCACGAGCTAAGTTCCTGTGGGAACTGAATGATGAGGATCAAACTATGGTGCTGTTACTCTTCCCTGCATAGTCTTTTTGTATTTTCAGAGCTAATATGTATAGGGTCTGATGTCCTGTTTATCTGCAGATATCAGGGGAAGAAGGCAGCAACATGGTTGATAGTTCATCCCCCAAAGAAAGAATTGAACCGGTAGCTGACACTAGTGAGAGTGATATGCAGGAGGATTATGAGAAGATGCTCAAGGAGAATCCTGTCGATCCATTGCTTCTGAAAAACTACGCTCGTTTCTTGCAACAGGTGAGACTATTTTTTGGTATTTAATCACAATGAGATCATTTTACATGACTAATACCTGCCACCAAAAGACTTTGATTGAAGTTTCAGCACTGGTCACCGATAATTAAGAGATAGAAGAAGTTTCCAACTCTTTCTGGGCGAGCATTCAGCTCACTATTTGAAATTTTGATCACTAGGTTGCACTGTGCTGAGATTGTTTGAAAAATGCTCCTTTATCTTAGAAACCATTTGTTTAGCAACTAGCATTGTTTTGAGATGTTGATATTCATGTTTGTTGACGCCGACAGTCCAAAGTAGACCTCCAAGGAGCAGAGAAATACTACTACCGTGGCATCCAAGCAGACCCAAGCGATGGGGAATTGCTATCAGAATATGCAAAATTGGTGTGGGAGCTTCATCATGACTATGATAAAGCATTAAATTACTTCGAGAGAGCAGTTGAAGCCTCTCCTACAAACAGGTACATCAAACTGCCTTCTTTTACTGCAGACCATCGTTAGCTTTGGGTGTCTTTTGCTGGTAATTAATATTTCATTGCCTTCTTTTTCTTTTTATTTCTTTCATTTTCAGTTATGTTCTTGGAGCATATGCAAGTTTCCTTTGGGAGACGGATGAACTCGAGGAGGAAGATGGGGCGAGCAAAAATGACTCTCAATGGCCTTCAAACACAGTGGCAGTTTCTGTAGGAAATGCTTGAATTTTGATAAACATAGCATAGTCAATGATATTATAATGTATTAAAGTGATGCACTTGAAGCAAATAATATTCTAGTTTTAACATACGAAATGAAATGTCATCAACTCCGAATGATAACAGTCTACAGAAAGTTGGTTTTCATTGAATCAGAAATTGGTTCAAAATTGTTGCTTTAAACAATAAGAAGCTAGTGCATCTTAATCTCAATACCATGACAATCTAAAAGAACTGCAGAAACCCACTATGTACATATATTTATAGAAACATGATGAAATTAATTTTGCATTGCATAAGAGTAATTGATGAATGAATACCAGAAAATCTGCAACAGGCTTTGTTGGTGAGTGAGACAAAATTGGATTATATCTAAAATAATGAATATTATAATTAAGCGTACCACAGGCAAAAACAAAGAGTTTCATGAAATGTTCTTGAATCAAGAAACTCAATGGAATAAATAATGCAATTCATATTCCCATAGACAAACAGCTTTAAACTGACTCAAATTATTCTTCAAAGTAAGGTGAGTGTGCCTCTGCTGAGCTGATAAATGCTAGAACCAAAACTTTTGCTTTGCTAAACGAGCTATGATGACTCTGATGGAGTTTCTCCATTTCCTGTTACTTGGTGGCTTATGTTCCCTTTCTCCTGTCGAGTAGTAGGTCGATAGAAGCCATTCCCCCTCGGTCTGATGTACTCACGGTCACTTCCATCTCGAACTCCCATATTGTAACTGTAACTGCGAGAACCATAATGCCCCTTCGAAGACTCGGTGTGATAGCTCCCTCTTCCTCTACCCCTTCCTGATCACAATTCAATATTTGGGGAACCAGTTGCAAAACACAATAAGATATTCAAAACATCTTGTTGTTTCACTTTAAATCAAGAATCTGTGCTACCACCGTGTATACAGAAAAATTTGTCCGTAAGCAGACAGACAAAAGGCCAGCTTTTAACTAATCAAATTAGTTTTGGTAGATTTCATGCATTGCAATTTTACAGCAACGCATCAGCAAAGCCGTTCACATGGTTTATCATATCATAGTAGCAGAGTGGGTGCATGTACTTTGACTTAGAAGATATTCAAAATTCTGAAGAAGCACAATGAGAAAGAATGCTTACTTCCCCCCCGGTGTGGGATGTTGCTATTTGCTCTCCGCTCCTCGATGTATACTTGACGACCAGCAACCTGGGCAGTACCCGCCTGGAAAGGGAAAAATGAACTATTAATCAGAATCTCTATAAAGTGAAGGCGTGATCTGCCAAGAGAACACTGAGTTAGGATTTCAGGTGAAGCAAAACCTTGACTGCATTTTGGACACCAGTGATGTCTTCGAATTCGACAAACGCATAGCAAAAACCAACATCCTGGAAGTCATGAGATAGTTAAGCAATCAAATAATTTAAGCCGGGCATCAAGAAATATGTTGGAAAGATTAAAAAAAAAAAAAAAGACTGCAGACCTTGCGACTCCTAATAACCACCCCGTCAGAACTAAGTTTGCCAAAATGCTTGAATTCCTCCTCAACCTCTGAAGCAGACACGGTGGATGGCAAGTTTCTTACATAAACAGACTTTATTTCACCTTCATATATCAAACATTGAAAAAGGTTATGCTTCGCATATAAATAGCTAAACAGATGAATAATGATCAATACGACATGACTAACCTTCATCATCCATAGATGGGGACTCCCCTCCCGTCTGTTCCCTTTCAGAGTTATTTTGGGGAGCTGAAGTTGCTTGCTGACTGGTGGGAGGTGGAGTGTAGTTTTGCTCGGAAGCAGGTGGTGTGCCCTTACTAACAGGATATTGAGGAGCAGCAACAGGAGCAGGAACATCTTGTCCTTTAGCAACTCTTAGCTAGAGAAATTAAAACAAATGAAGTCACTTCCATTCTATGCATATTGTAAATTGTTGTTTTTAAATATAGGAAAATGAACCAAAATATTTACAAATATAGTAAAATTTTACTGTTTATCTGCGATAGACCACGGTAGACTAGTATCTATGTCTACCTATCTCTTGATAGACACAAATAGTAGTCTATTGTAGTCTATCGCAGATAGACTGTGATATTTTACTATTATTTGTAAATATTTTCAACAATTTTGTTATTTAAAATAATTTCAGCTGTAAACACTGTTATAAATCTATGGTGAATTGATTATTAATGTCAAATCATACAATTGAAGCATATGTATGCTTTTGGGGCTCCTCAGACTGTTCTTCAACAGAAACAGGAAAATGATCTTGTGAAACAGTTGATGCATTCTGATGCATAGAAATTGGTTCTGCTGTATTTTCCTCAATAATGCTCTTAGGCTCTGGAACTTGCTGCACTTGCTGCTCTACAAACTTGTGGTTATCGATGTGACCATTTTCTTTGACAACAGGGGCTGCAAACTCTCGCGCCTGGACTGCTCCATTGAGTGAGTAATTGGGCA

mRNA sequence

TTGACAAATTTCTATCCATATTTATCACATAGTACCAATTTCATCGATTCCAATAAACTTATATGGGTCTCTATATTTATCGGTCTCAATTTTAAATAAAAACCACCCTTCTTTCAAAAGTGGGATTGTCCTCTTTAAACAAAACAAAAACTTGTTTCCATTACCCCAAAAGAGAAATTTGTTAAATTTCACTTTCCACTGAAATCCACCATCCCGTTTAATGATTTTTAAATAGAAGTTTTCTTTTTCTTCAAAGCTCCTCGGGATTTGATTTGAGTGTGATTCGGCAGCAGAAAGAAAGATCAATCGCTTCAATGTTGCACTCCACGGCCTCGTTTTCCATATACACTGATGATGAAAATCAAGAACAAGTAATTGGGTTAGAAGCATTTGAAAAGGGTATCATGATTGAAGTGAACAAAGAAGCTCTGGGTAGTATTAGTAATGATTTCAGTTTTTCTAAAGAGACTATGGGGTTGATTCAAGAGGAAGAAATGGAAGATGAGGATGGTCTCAACTCGATTACGAATCGAGGTTTTGACGACGGCGAGGTTAACCTGAGACCCGCCAGTCCTCCGCTCTATCTAGCGGCGGGGCTTGGAATGGATGCCTCTGGTCTTGGTGGTGGGTATGATTCTGTTGATTTCTTTGACGAGAAGATGGTGGACGAGCCGCACTCAGTCGATCATTCGTTGCTTCTGAGGAACTATGTGCAGTCTTTATGGTCTGAGGGAAAACTCGATGAAGCTGAGGAACAGTGCTATCAAGCTACATTAACATATCCTGAAGATGGTGAAACTTTGATGCTGTATGCTCAGTTGGTTTGGGAAGTCCATCACGATCAAGCTAAAGCCTCAAATTACTTTGAACGTGCAGCTCTTGTTGCTCCAAATAACAGCGATATTCTTGCAGCACGAGCTAAGTTCCTGTGGGAACTGAATGATGAGGATCAAACTATGATATCAGGGGAAGAAGGCAGCAACATGGTTGATAGTTCATCCCCCAAAGAAAGAATTGAACCGGTAGCTGACACTAGTGAGAGTGATATGCAGGAGGATTATGAGAAGATGCTCAAGGAGAATCCTGTCGATCCATTGCTTCTGAAAAACTACGCTCGTTTCTTGCAACAGTCCAAAGTAGACCTCCAAGGAGCAGAGAAATACTACTACCGTGGCATCCAAGCAGACCCAAGCGATGGGGAATTGCTATCAGAATATGCAAAATTGGTGTGGGAGCTTCATCATGACTATGATAAAGCATTAAATTACTTCGAGAGAGCAGTTGAAGCCTCTCCTACAAACAGTTATGTTCTTGGAGCATATGCAAGTTTCCTTTGGGAGACGGATGAACTCGAGGAGGAAGATGGGGCGAGCAAAAATGACTCTCAATGGCCTTCAAACACAGTGGCAGTTTCTGTAGGAAATGCTTGAATTTTGATAAACATAGCATAGTCAATGATATTATAATGTATTAAAGTGATGCACTTGAAGCAAATAATATTCTAGTTTTAACATACGAAATGAAATGTCATCAACTCCGAATGATAACAGTCTACAGAAAGTTGGTTTTCATTGAATCAGAAATTGGTTCAAAATTGTTGCTTTAAACAATAAGAAGCTAGTGCATCTTAATCTCAATACCATGACAATCTAAAAGAACTGCAGAAACCCACTATGTACATATATTTATAGAAACATGATGAAATTAATTTTGCATTGCATAAGAGTAATTGATGAATGAATACCAGAAAATCTGCAACAGGCTTTGTTGGTGAGTGAGACAAAATTGGATTATATCTAAAATAATGAATATTATAATTAAGCGTACCACAGGCAAAAACAAAGAGTTTCATGAAATGTTCTTGAATCAAGAAACTCAATGGAATAAATAATGCAATTCATATTCCCATAGACAAACAGCTTTAAACTGACTCAAATTATTCTTCAAAGTAAGGTGAGTGTGCCTCTGCTGAGCTGATAAATGCTAGAACCAAAACTTTTGCTTTGCTAAACGAGCTATGATGACTCTGATGGAGTTTCTCCATTTCCTGTTACTTGGTGGCTTATGTTCCCTTTCTCCTGTCGAGTAGTAGGTCGATAGAAGCCATTCCCCCTCGGTCTGATGTACTCACGGTCACTTCCATCTCGAACTCCCATATTGTAACTGTAACTGCGAGAACCATAATGCCCCTTCGAAGACTCGGTGTGATAGCTCCCTCTTCCTCTACCCCTTCCTGATCACAATTCAATATTTGGGGAACCAGTTGCAAAACACAATAAGATATTCAAAACATCTTGTTGTTTCACTTTAAATCAAGAATCTGTGCTACCACCGTGTATACAGAAAAATTTGTCCGTAAGCAGACAGACAAAAGGCCAGCTTTTAACTAATCAAATTAGTTTTGGTAGATTTCATGCATTGCAATTTTACAGCAACGCATCAGCAAAGCCGTTCACATGGTTTATCATATCATAGTAGCAGAGTGGGTGCATGTACTTTGACTTAGAAGATATTCAAAATTCTGAAGAAGCACAATGAGAAAGAATGCTTACTTCCCCCCCGGTGTGGGATGTTGCTATTTGCTCTCCGCTCCTCGATGTATACTTGACGACCAGCAACCTGGGCAGTACCCGCCTGGAAAGGGAAAAATGAACTATTAATCAGAATCTCTATAAAGTGAAGGCGTGATCTGCCAAGAGAACACTGAGTTAGGATTTCAGGTGAAGCAAAACCTTGACTGCATTTTGGACACCAGTGATGTCTTCGAATTCGACAAACGCATAGCAAAAACCAACATCCTGGAAGTCATGAGATAGTTAAGCAATCAAATAATTTAAGCCGGGCATCAAGAAATATGTTGGAAAGATTAAAAAAAAAAAAAAAGACTGCAGACCTTGCGACTCCTAATAACCACCCCGTCAGAACTAAGTTTGCCAAAATGCTTGAATTCCTCCTCAACCTCTGAAGCAGACACGGTGGATGGCAAGTTTCTTACATAAACAGACTTTATTTCACCTTCATATATCAAACATTGAAAAAGGTTATGCTTCGCATATAAATAGCTAAACAGATGAATAATGATCAATACGACATGACTAACCTTCATCATCCATAGATGGGGACTCCCCTCCCGTCTGTTCCCTTTCAGAGTTATTTTGGGGAGCTGAAGTTGCTTGCTGACTGGTGGGAGGTGGAGTGTAGTTTTGCTCGGAAGCAGGTGGTGTGCCCTTACTAACAGGATATTGAGGAGCAGCAACAGGAGCAGGAACATCTTGTCCTTTAGCAACTCTTAGCTAGAGAAATTAAAACAAATGAAGTCACTTCCATTCTATGCATATTGTAAATTGTTGTTTTTAAATATAGGAAAATGAACCAAAATATTTACAAATATAGTAAAATTTTACTGTTTATCTGCGATAGACCACGGTAGACTAGTATCTATGTCTACCTATCTCTTGATAGACACAAATAGTAGTCTATTGTAGTCTATCGCAGATAGACTGTGATATTTTACTATTATTTGTAAATATTTTCAACAATTTTGTTATTTAAAATAATTTCAGCTGTAAACACTGTTATAAATCTATGGTGAATTGATTATTAATGTCAAATCATACAATTGAAGCATATGTATGCTTTTGGGGCTCCTCAGACTGTTCTTCAACAGAAACAGGAAAATGATCTTGTGAAACAGTTGATGCATTCTGATGCATAGAAATTGGTTCTGCTGTATTTTCCTCAATAATGCTCTTAGGCTCTGGAACTTGCTGCACTTGCTGCTCTACAAACTTGTGGTTATCGATGTGACCATTTTCTTTGACAACAGGGGCTGCAAACTCTCGCGCCTGGACTGCTCCATTGAGTGAGTAATTGGGCA

Coding sequence (CDS)

ATGTTGCACTCCACGGCCTCGTTTTCCATATACACTGATGATGAAAATCAAGAACAAGTAATTGGGTTAGAAGCATTTGAAAAGGGTATCATGATTGAAGTGAACAAAGAAGCTCTGGGTAGTATTAGTAATGATTTCAGTTTTTCTAAAGAGACTATGGGGTTGATTCAAGAGGAAGAAATGGAAGATGAGGATGGTCTCAACTCGATTACGAATCGAGGTTTTGACGACGGCGAGGTTAACCTGAGACCCGCCAGTCCTCCGCTCTATCTAGCGGCGGGGCTTGGAATGGATGCCTCTGGTCTTGGTGGTGGGTATGATTCTGTTGATTTCTTTGACGAGAAGATGGTGGACGAGCCGCACTCAGTCGATCATTCGTTGCTTCTGAGGAACTATGTGCAGTCTTTATGGTCTGAGGGAAAACTCGATGAAGCTGAGGAACAGTGCTATCAAGCTACATTAACATATCCTGAAGATGGTGAAACTTTGATGCTGTATGCTCAGTTGGTTTGGGAAGTCCATCACGATCAAGCTAAAGCCTCAAATTACTTTGAACGTGCAGCTCTTGTTGCTCCAAATAACAGCGATATTCTTGCAGCACGAGCTAAGTTCCTGTGGGAACTGAATGATGAGGATCAAACTATGATATCAGGGGAAGAAGGCAGCAACATGGTTGATAGTTCATCCCCCAAAGAAAGAATTGAACCGGTAGCTGACACTAGTGAGAGTGATATGCAGGAGGATTATGAGAAGATGCTCAAGGAGAATCCTGTCGATCCATTGCTTCTGAAAAACTACGCTCGTTTCTTGCAACAGTCCAAAGTAGACCTCCAAGGAGCAGAGAAATACTACTACCGTGGCATCCAAGCAGACCCAAGCGATGGGGAATTGCTATCAGAATATGCAAAATTGGTGTGGGAGCTTCATCATGACTATGATAAAGCATTAAATTACTTCGAGAGAGCAGTTGAAGCCTCTCCTACAAACAGTTATGTTCTTGGAGCATATGCAAGTTTCCTTTGGGAGACGGATGAACTCGAGGAGGAAGATGGGGCGAGCAAAAATGACTCTCAATGGCCTTCAAACACAGTGGCAGTTTCTGTAGGAAATGCTTGA

Protein sequence

MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNKEALGSISNDFSFSKETMGLIQEEEMEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDEPHSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLYAQLVWEVHHDQAKASNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGEEGSNMVDSSSPKERIEPVADTSESDMQEDYEKMLKENPVDPLLLKNYARFLQQSKVDLQGAEKYYYRGIQADPSDGELLSEYAKLVWELHHDYDKALNYFERAVEASPTNSYVLGAYASFLWETDELEEEDGASKNDSQWPSNTVAVSVGNA
BLAST of Bhi01M000472 vs. TAIR10
Match: AT1G04530.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 80.1 bits (196), Expect = 3.2e-15
Identity = 73/232 (31.47%), Postives = 104/232 (44.83%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNKEALGSISN-----DFSFSKETMGL 60
           ML S AS SIY D  +     G ++ +    IE N E   +I +     +FSF+K     
Sbjct: 1   MLKSEASLSIYCDSGD-----GFKSEDPVTGIEENLERTVTIGDAIDGGEFSFAKHK--- 60

Query: 61  IQEEEMEDEDGLNSITNRGFDDG-------EVNLRPASPPLYLAAGLGMDASGLGG---- 120
            +E+  E E G+     +    G       E+  RP SPP++LAAGLG+D   L G    
Sbjct: 61  -EEDSGEGERGVFEEVIKKLGIGVRDELGFEIE-RPPSPPMHLAAGLGIDKFDLYGSEIK 120

Query: 121 ----GYDSV---DFFDEKMVDEPHSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYP 180
               GYD     D++   + + P    H LLL+NY + L  +G L  AEE  ++ T+  P
Sbjct: 121 FDLPGYDDKNCGDYYKGMLEEYPL---HPLLLKNYAKFLEYKGDLSGAEEYYHKCTVVEP 180

Query: 181 EDGETLMLYAQLVWEVHHDQAKASNYFERAALVAPNNSDILAARAKFLWELN 210
            DG  L  Y +LV ++H D+AK                        FLWE+N
Sbjct: 181 SDGVALANYGRLVMKLHQDEAKXXXXXXXXXXXXXXXXXXXXXXXXFLWEIN 219

BLAST of Bhi01M000472 vs. TAIR10
Match: AT1G80130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 75.9 bits (185), Expect = 6.0e-14
Identity = 41/127 (32.28%), Postives = 74/127 (58.27%), Query Frame = 0

Query: 103 GGGYDSVDFFDEKMVDEPHSVDHSLLLRNYVQSLWS-EGKLDEAEEQCYQATLTYPEDGE 162
           G   D+ D +  +M+D   +  +SLL  NY + L   +G + +AEE C +A L    DG 
Sbjct: 159 GRSEDATDTYYREMIDS--NPGNSLLTGNYAKFLKEVKGDMKKAEEYCERAILGNTNDGN 218

Query: 163 TLMLYAQLVWEVHHDQAKASNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGEEG 222
            L LYA L+   H D+ +A +Y+++A  ++P +  + A+ A+FLW+++++++    GEE 
Sbjct: 219 VLSLYADLILHNHQDRQRAHSYYKQAVKMSPEDCYVQASYARFLWDVDEDEEDEALGEEE 278

Query: 223 SNMVDSS 229
            N+ D +
Sbjct: 279 ENLSDET 283

BLAST of Bhi01M000472 vs. TrEMBL
Match: tr|A0A0A0L4X1|A0A0A0L4X1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G134570 PE=4 SV=1)

HSP 1 Score: 490.0 bits (1260), Expect = 4.8e-135
Identity = 328/372 (88.17%), Postives = 343/372 (92.20%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNK-EALGSISNDFSFSKETMGLIQEE 60
           MLHSTASFSIYTDDENQEQ++GLEAFEKG+MIEVNK E LGS  +DFSFS+  MGLIQEE
Sbjct: 1   MLHSTASFSIYTDDENQEQIMGLEAFEKGVMIEVNKEEVLGSTGHDFSFSERAMGLIQEE 60

Query: 61  EMEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE 120
           EMEDEDGL    NRGFDD EVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE
Sbjct: 61  EMEDEDGL----NRGFDDSEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE 120

Query: 121 PHSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLYAQLVWEVHHDQAK 180
             S+  SL LR+YVQSLWSEGKLDEAEEQCYQAT+T+PEDGETLMLYAQLVWE+HHDQAK
Sbjct: 121 TPSIHPSLSLRDYVQSLWSEGKLDEAEEQCYQATITFPEDGETLMLYAQLVWELHHDQAK 180

Query: 181 ASNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGEEGSNMVDSSSPKERIEPVAD 240
           AS+YFERAALVAPNNS+ILAARAKFLWELN+ED+TMI GEE SN VDSSSP+ERIEP  D
Sbjct: 181 ASSYFERAALVAPNNSNILAARAKFLWELNEEDETMIPGEEDSNPVDSSSPEERIEPAPD 240

Query: 241 TSESDMQEDYEKMLKENPVDPLLLKNYARFLQQSKVDLQGAEXXXXXXXXXXXXXXXXXX 300
           T ESDMQE YEKMLKENP DPLLLKNYARFLQQSKVDLQG  XXXXXXXXXXXXXXXXXX
Sbjct: 241 TGESDMQEYYEKMLKENPTDPLLLKNYARFLQQSKVDLQGXXXXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDELEEEDGASKNDSQW 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX E  EEDGASKNDSQW
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXE-HEEDGASKNDSQW 360

Query: 361 PSNTVAVSVGNA 372
           PSNTVAVSVGNA
Sbjct: 361 PSNTVAVSVGNA 367

BLAST of Bhi01M000472 vs. TrEMBL
Match: tr|A0A1S3AWE5|A0A1S3AWE5_CUCME (uncharacterized protein LOC103483542 OS=Cucumis melo OX=3656 GN=LOC103483542 PE=4 SV=1)

HSP 1 Score: 423.3 bits (1087), Expect = 5.6e-115
Identity = 300/372 (80.65%), Postives = 314/372 (84.41%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNK-EALGSISNDFSFSKETMGLIQEE 60
           MLHSTASFSIYTDDENQEQ++GLEAFEKG+MIEVNK E LGS  NDFSFSK  MGLIQEE
Sbjct: 1   MLHSTASFSIYTDDENQEQIMGLEAFEKGVMIEVNKEEVLGSTGNDFSFSKRAMGLIQEE 60

Query: 61  EMEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE 120
           EMEDEDGL    NRGFDD EVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE
Sbjct: 61  EMEDEDGL----NRGFDDSEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE 120

Query: 121 PHSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLYAQLVWEVHHDQAK 180
             S+  SL LR+YVQSLWSEGKLDEAEEQ YQAT+TYPEDGE L+LYAQLVWE+HHDQAK
Sbjct: 121 TPSIHPSLPLRDYVQSLWSEGKLDEAEEQSYQATITYPEDGEILVLYAQLVWELHHDQAK 180

Query: 181 ASNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGEEGSNMVDSSSPKERIEPVAD 240
           AS+YFE                          D+TMI GEE SN V+SSSP+ERIEP +D
Sbjct: 181 ASSYFEXXXXXXXXXXXXXXXXXXXXXXXXXXDETMIPGEEDSNKVNSSSPEERIEPASD 240

Query: 241 TSESDMQEDYEKMLKENPVDPLLLKNYARFLQQSKVDLQGAEXXXXXXXXXXXXXXXXXX 300
           T ESDMQE YEKMLKENP DPLLLKNYARFL+QSKVDLQ   XXXXXXXXXXXXXXXXXX
Sbjct: 241 TGESDMQEYYEKMLKENPTDPLLLKNYARFLKQSKVDLQXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDELEEEDGASKNDSQW 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX E  EEDGASKNDSQW
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEY-EEDGASKNDSQW 360

Query: 361 PSNTVAVSVGNA 372
           PSNTV VSVGNA
Sbjct: 361 PSNTVTVSVGNA 367

BLAST of Bhi01M000472 vs. TrEMBL
Match: tr|A0A1U8HS03|A0A1U8HS03_GOSHI (uncharacterized protein LOC107886606 isoform X2 OS=Gossypium hirsutum OX=3635 GN=LOC107886606 PE=4 SV=1)

HSP 1 Score: 146.7 bits (369), Expect = 1.0e-31
Identity = 107/316 (33.86%), Postives = 158/316 (50.00%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNKEALGSISNDFSFSKETMGLIQEEE 60
           MLHS  SFSI+ +   ++  +G E+ E+ + I  N +A+G  + DFSF K+ M LIQEEE
Sbjct: 122 MLHSAPSFSIFNEGV-EDGKLGEESLERTVTIGENIDAVG--NPDFSFRKKCMELIQEEE 181

Query: 61  -MEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDF-------- 120
             E +  LN I     D+ EV L P SPP+YLA GLG+D +G G   D VD         
Sbjct: 182 GDEGKKRLNRIRVSYNDEEEVELEPPSPPMYLATGLGIDCAGFGAMADGVDLSYMDLDEV 241

Query: 121 -----FDEKMVDEPHSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLY 180
                F +++VDE     H L LRNY + L S+G L  AE+  ++ATL  PED E L+ Y
Sbjct: 242 DDQEEFHKRLVDEYPC--HPLFLRNYAKFLQSKGDLQGAEDYYHRATLADPEDSEILLQY 301

Query: 181 AQLVWEVHH-------DQAKASNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGE 240
           A+++W++HH                                      E  +++   +  E
Sbjct: 302 AKILWDLHHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEYGEQEYQGVKEE 361

Query: 241 EGSNMVDSSSPKER---------------IEPVADTSESDMQEDYEKMLKENPVDPLLLK 281
           E   +  +S P+E                +E   D  + D+++ + +M++ENP +P +L 
Sbjct: 362 EIMKVAKNSLPEEETKLARLSMQLPNPAGLEVHTDIQDIDIEDYHTRMVQENPGNPSVLS 421

BLAST of Bhi01M000472 vs. TrEMBL
Match: tr|A0A0D2UVC7|A0A0D2UVC7_GOSRA (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_011G179800 PE=4 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 1.3e-31
Identity = 106/316 (33.54%), Postives = 158/316 (50.00%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNKEALGSISNDFSFSKETMGLIQEEE 60
           MLHS  SFSI+ +   ++  +G E+ E+ + I  N +A+G  + DFSF K+ M LIQEEE
Sbjct: 122 MLHSAPSFSIFNEGV-EDGKLGEESLERTVTIGENIDAVG--NPDFSFRKKCMELIQEEE 181

Query: 61  MEDE-DGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDF-------- 120
            ++E   LN I     D+ EV L P SPP+YLA GLG+D +G G   D VD         
Sbjct: 182 GDEEKKRLNRIRVSHNDEEEVELEPPSPPMYLATGLGIDCAGFGAMADGVDLSYMDLDEV 241

Query: 121 -----FDEKMVDEPHSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLY 180
                F +++VDE     H L LRNY + L S+G L  AE+  ++ATL  PED E L+ Y
Sbjct: 242 DDQEEFHKRLVDEYPC--HPLFLRNYAKFLQSKGDLQGAEDYYHRATLADPEDCEILLQY 301

Query: 181 AQLVWEVHHD-------QAKASNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGE 240
           A+++W++HHD                                     E  +++   +  E
Sbjct: 302 AKILWDLHHDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEYGEQEYQGVKEE 361

Query: 241 EGSNMVDSSSPKERIEPV---------------ADTSESDMQEDYEKMLKENPVDPLLLK 281
           E   +  +S  +E  +                  D  + D+++ + +M++ENP +P +L 
Sbjct: 362 ENMKVAKNSLHEEETKLARLSIHLPNPAGLGVHTDIQDIDIEDYHTRMVQENPGNPSVLS 421

BLAST of Bhi01M000472 vs. TrEMBL
Match: tr|A0A061EMH5|A0A061EMH5_THECC (Tetratricopeptide repeat-like superfamily protein, putative isoform 2 OS=Theobroma cacao OX=3641 GN=TCM_021008 PE=4 SV=1)

HSP 1 Score: 145.2 bits (365), Expect = 2.9e-31
Identity = 108/322 (33.54%), Postives = 157/322 (48.76%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNKEALGSISNDFSFSKETMGLIQEEE 60
           MLHS  SFSI+ +     Q  G EA E+ + I  + +A+G+   DFSF K+ M LIQEE 
Sbjct: 1   MLHSAPSFSIFNEGLEDGQG-GEEALERTVTIGESIDAVGNA--DFSFGKKCMELIQEEG 60

Query: 61  MEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDF--------- 120
            E+E+  N I +  +++ EV+L P SPP+YLA GLG+D  G G   D+VD          
Sbjct: 61  EEEEERGNRIQS-PYNEEEVDLEPPSPPMYLATGLGIDGPGFGTMADAVDLSSMDLDEAS 120

Query: 121 ----FDEKMVDEPHSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLYA 180
               F +++V+E     H L LRNY + L S+G +  AE+  ++ATL  PEDGE L  YA
Sbjct: 121 DLEEFHKRLVNEYPC--HPLFLRNYAKFLQSKGDVHGAEDYYHRATLADPEDGEILSQYA 180

Query: 181 QLVWEVHHDQAKASNYFERAALVAPNNSDILAARAKFLW--------ELNDEDQTMISGE 240
           ++VWE+H D+                                         E+   +  E
Sbjct: 181 KIVWELHQDKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXENREQEEYIKVEEE 240

Query: 241 EGSNMVDSSSPKERIEPVA----------------------DTSESDMQEDYEKMLKENP 280
           +   +  +S P E  +P +                         + D+++ Y +M+KENP
Sbjct: 241 KTLRLSKNSQPIEETDPASLSLHPPNPGGFGVHADYVKAAKGIRDVDVEDYYRRMVKENP 300

BLAST of Bhi01M000472 vs. NCBI nr
Match: XP_004134036.1 (PREDICTED: uncharacterized protein LOC101202732 [Cucumis sativus] >KGN56808.1 hypothetical protein Csa_3G134570 [Cucumis sativus])

HSP 1 Score: 490.0 bits (1260), Expect = 7.3e-135
Identity = 328/372 (88.17%), Postives = 343/372 (92.20%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNK-EALGSISNDFSFSKETMGLIQEE 60
           MLHSTASFSIYTDDENQEQ++GLEAFEKG+MIEVNK E LGS  +DFSFS+  MGLIQEE
Sbjct: 1   MLHSTASFSIYTDDENQEQIMGLEAFEKGVMIEVNKEEVLGSTGHDFSFSERAMGLIQEE 60

Query: 61  EMEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE 120
           EMEDEDGL    NRGFDD EVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE
Sbjct: 61  EMEDEDGL----NRGFDDSEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE 120

Query: 121 PHSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLYAQLVWEVHHDQAK 180
             S+  SL LR+YVQSLWSEGKLDEAEEQCYQAT+T+PEDGETLMLYAQLVWE+HHDQAK
Sbjct: 121 TPSIHPSLSLRDYVQSLWSEGKLDEAEEQCYQATITFPEDGETLMLYAQLVWELHHDQAK 180

Query: 181 ASNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGEEGSNMVDSSSPKERIEPVAD 240
           AS+YFERAALVAPNNS+ILAARAKFLWELN+ED+TMI GEE SN VDSSSP+ERIEP  D
Sbjct: 181 ASSYFERAALVAPNNSNILAARAKFLWELNEEDETMIPGEEDSNPVDSSSPEERIEPAPD 240

Query: 241 TSESDMQEDYEKMLKENPVDPLLLKNYARFLQQSKVDLQGAEXXXXXXXXXXXXXXXXXX 300
           T ESDMQE YEKMLKENP DPLLLKNYARFLQQSKVDLQG  XXXXXXXXXXXXXXXXXX
Sbjct: 241 TGESDMQEYYEKMLKENPTDPLLLKNYARFLQQSKVDLQGXXXXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDELEEEDGASKNDSQW 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX E  EEDGASKNDSQW
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXE-HEEDGASKNDSQW 360

Query: 361 PSNTVAVSVGNA 372
           PSNTVAVSVGNA
Sbjct: 361 PSNTVAVSVGNA 367

BLAST of Bhi01M000472 vs. NCBI nr
Match: XP_008438453.1 (PREDICTED: uncharacterized protein LOC103483542 [Cucumis melo])

HSP 1 Score: 423.3 bits (1087), Expect = 8.4e-115
Identity = 300/372 (80.65%), Postives = 314/372 (84.41%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNK-EALGSISNDFSFSKETMGLIQEE 60
           MLHSTASFSIYTDDENQEQ++GLEAFEKG+MIEVNK E LGS  NDFSFSK  MGLIQEE
Sbjct: 1   MLHSTASFSIYTDDENQEQIMGLEAFEKGVMIEVNKEEVLGSTGNDFSFSKRAMGLIQEE 60

Query: 61  EMEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE 120
           EMEDEDGL    NRGFDD EVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE
Sbjct: 61  EMEDEDGL----NRGFDDSEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDE 120

Query: 121 PHSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLYAQLVWEVHHDQAK 180
             S+  SL LR+YVQSLWSEGKLDEAEEQ YQAT+TYPEDGE L+LYAQLVWE+HHDQAK
Sbjct: 121 TPSIHPSLPLRDYVQSLWSEGKLDEAEEQSYQATITYPEDGEILVLYAQLVWELHHDQAK 180

Query: 181 ASNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGEEGSNMVDSSSPKERIEPVAD 240
           AS+YFE                          D+TMI GEE SN V+SSSP+ERIEP +D
Sbjct: 181 ASSYFEXXXXXXXXXXXXXXXXXXXXXXXXXXDETMIPGEEDSNKVNSSSPEERIEPASD 240

Query: 241 TSESDMQEDYEKMLKENPVDPLLLKNYARFLQQSKVDLQGAEXXXXXXXXXXXXXXXXXX 300
           T ESDMQE YEKMLKENP DPLLLKNYARFL+QSKVDLQ   XXXXXXXXXXXXXXXXXX
Sbjct: 241 TGESDMQEYYEKMLKENPTDPLLLKNYARFLKQSKVDLQXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDELEEEDGASKNDSQW 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX E  EEDGASKNDSQW
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEY-EEDGASKNDSQW 360

Query: 361 PSNTVAVSVGNA 372
           PSNTV VSVGNA
Sbjct: 361 PSNTVTVSVGNA 367

BLAST of Bhi01M000472 vs. NCBI nr
Match: XP_023526651.1 (uncharacterized protein LOC111790084 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 406.0 bits (1042), Expect = 1.4e-109
Identity = 211/280 (75.36%), Postives = 227/280 (81.07%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNKEALGSISNDFSFSKETMGLIQEEE 60
           MLHSTASFSIYTDDENQEQV+GLE FEKGI IEVNKEA+GS  NDF FSK  MGLIQEEE
Sbjct: 1   MLHSTASFSIYTDDENQEQVMGLEGFEKGITIEVNKEAVGS-GNDFCFSKGAMGLIQEEE 60

Query: 61  MEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDEP 120
           MED+DG+NSI   GFD GE +L PASPP+YLAAGLG+DASGLGG YDS+DFFDEKMVDEP
Sbjct: 61  MEDDDGMNSI--GGFDGGEFDLTPASPPMYLAAGLGLDASGLGGAYDSMDFFDEKMVDEP 120

Query: 121 HSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLYAQLVWEVHHDQAKA 180
            +V  SLLLR YVQSLWSEGKLDEAEEQCYQAT+TYPED E LMLYAQLVWEVHHDQAKA
Sbjct: 121 PAVHPSLLLRKYVQSLWSEGKLDEAEEQCYQATVTYPEDAEILMLYAQLVWEVHHDQAKA 180

Query: 181 SNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGEEGSNMVDSSSPKERIEPVADT 240
           S+YFERAA                    +DED+TM+ GEE  N + SS PKER EPVAD 
Sbjct: 181 SSYFERAAXXXXXXXXXXXXXXXXXXXXDDEDETMVPGEEDGNPIGSSPPKERTEPVADN 240

Query: 241 SESDMQEDYEKMLKENPVDPLLLKNYARFLQQSKVDLQGA 281
             SD+QE YEKMLKENP DPLLLKNYARFLQQSK DLQGA
Sbjct: 241 GGSDLQESYEKMLKENPTDPLLLKNYARFLQQSKADLQGA 277

BLAST of Bhi01M000472 vs. NCBI nr
Match: XP_022924558.1 (uncharacterized protein LOC111432003 [Cucurbita moschata])

HSP 1 Score: 402.9 bits (1034), Expect = 1.2e-108
Identity = 212/280 (75.71%), Postives = 225/280 (80.36%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNKEALGSISNDFSFSKETMGLIQEEE 60
           MLHSTASFSIYTDDENQEQV+GLE FEKGI IEVNKEALGS  NDF FSK  MGLIQEEE
Sbjct: 1   MLHSTASFSIYTDDENQEQVMGLEEFEKGITIEVNKEALGS-GNDFCFSKGAMGLIQEEE 60

Query: 61  MEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDEP 120
           MED DG+NSI +  FD GE +L PASPP+YLAAGLG+DASGLGG YDSVDFFDEKMVDEP
Sbjct: 61  MEDGDGMNSIGD--FDGGEFDLTPASPPMYLAAGLGLDASGLGGAYDSVDFFDEKMVDEP 120

Query: 121 HSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLYAQLVWEVHHDQAKA 180
            ++  SLLLR YVQSLWSEGKLDEAEEQCYQAT+TYPED E LMLYAQLVWEVHHDQAKA
Sbjct: 121 PAIHPSLLLRKYVQSLWSEGKLDEAEEQCYQATVTYPEDAEILMLYAQLVWEVHHDQAKA 180

Query: 181 SNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGEEGSNMVDSSSPKERIEPVADT 240
           S+YFERA                     +DED+TMI GEE  N V SS PKER EPVAD 
Sbjct: 181 SSYFERAXXXXXXXXXXXXXXXXXXXXXDDEDETMIPGEEDGNPVGSSPPKERTEPVADN 240

Query: 241 SESDMQEDYEKMLKENPVDPLLLKNYARFLQQSKVDLQGA 281
             SD+QE YEKMLKENP DPLLLKNYARFLQQSK DLQGA
Sbjct: 241 GGSDLQESYEKMLKENPTDPLLLKNYARFLQQSKADLQGA 277

BLAST of Bhi01M000472 vs. NCBI nr
Match: XP_022979332.1 (uncharacterized protein LOC111479088 [Cucurbita maxima])

HSP 1 Score: 399.1 bits (1024), Expect = 1.7e-107
Identity = 210/279 (75.27%), Postives = 221/279 (79.21%), Query Frame = 0

Query: 1   MLHSTASFSIYTDDENQEQVIGLEAFEKGIMIEVNKEALGSISNDFSFSKETMGLIQEEE 60
           MLHSTASFSIYTDDENQEQV+GLE FEKGI IEVNKEALGS  NDF FSK  MGLIQEEE
Sbjct: 1   MLHSTASFSIYTDDENQEQVMGLEEFEKGITIEVNKEALGS-GNDFCFSKGAMGLIQEEE 60

Query: 61  MEDEDGLNSITNRGFDDGEVNLRPASPPLYLAAGLGMDASGLGGGYDSVDFFDEKMVDEP 120
           MED+DG+NSI   GFD GE +L PASPP+YLAAGLG+DASGLGG YDSVDFFDEKMVDEP
Sbjct: 61  MEDDDGMNSI--GGFDGGEFDLTPASPPMYLAAGLGLDASGLGGAYDSVDFFDEKMVDEP 120

Query: 121 HSVDHSLLLRNYVQSLWSEGKLDEAEEQCYQATLTYPEDGETLMLYAQLVWEVHHDQAKA 180
            ++  SLLLR YVQSLWSEGKLDEAEEQCYQAT+TYPED E LMLYAQLVWEVHHDQAKA
Sbjct: 121 PAIHPSLLLRKYVQSLWSEGKLDEAEEQCYQATVTYPEDAEVLMLYAQLVWEVHHDQAKA 180

Query: 181 SNYFERAALVAPNNSDILAARAKFLWELNDEDQTMISGEEGSNMVDSSSPKERIEPVADT 240
           S+YFE                        DED+TMI GEE  N V SS PKER EPVAD 
Sbjct: 181 SSYFEXXXXXXXXXXXXXXXXXXXXXXXXDEDETMIPGEEDGNPVGSSPPKERTEPVADN 240

Query: 241 SESDMQEDYEKMLKENPVDPLLLKNYARFLQQSKVDLQG 280
             SDMQE YEK LKENP DPLLLKNYARFLQQSK DLQG
Sbjct: 241 GGSDMQESYEKTLKENPTDPLLLKNYARFLQQSKADLQG 276

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G04530.13.2e-1531.47Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G80130.16.0e-1432.28Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
tr|A0A0A0L4X1|A0A0A0L4X1_CUCSA4.8e-13588.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G134570 PE=4 SV=1[more]
tr|A0A1S3AWE5|A0A1S3AWE5_CUCME5.6e-11580.65uncharacterized protein LOC103483542 OS=Cucumis melo OX=3656 GN=LOC103483542 PE=... [more]
tr|A0A1U8HS03|A0A1U8HS03_GOSHI1.0e-3133.86uncharacterized protein LOC107886606 isoform X2 OS=Gossypium hirsutum OX=3635 GN... [more]
tr|A0A0D2UVC7|A0A0D2UVC7_GOSRA1.3e-3133.54Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_011G179800 PE=4 ... [more]
tr|A0A061EMH5|A0A061EMH5_THECC2.9e-3133.54Tetratricopeptide repeat-like superfamily protein, putative isoform 2 OS=Theobro... [more]
Match NameE-valueIdentityDescription
XP_004134036.17.3e-13588.17PREDICTED: uncharacterized protein LOC101202732 [Cucumis sativus] >KGN56808.1 hy... [more]
XP_008438453.18.4e-11580.65PREDICTED: uncharacterized protein LOC103483542 [Cucumis melo][more]
XP_023526651.11.4e-10975.36uncharacterized protein LOC111790084 [Cucurbita pepo subsp. pepo][more]
XP_022924558.11.2e-10875.71uncharacterized protein LOC111432003 [Cucurbita moschata][more]
XP_022979332.11.7e-10775.27uncharacterized protein LOC111479088 [Cucurbita maxima][more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR013026TPR-contain_dom
IPR011990TPR-like_helical_dom_sf
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Bhi01G000472Bhi01G000472gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Bhi01M000472Bhi01M000472-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Bhi01M000472.utr3p1Bhi01M000472.utr3p1three_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Bhi01M000472.exon6Bhi01M000472.exon6exon
Bhi01M000472.exon5Bhi01M000472.exon5exon
Bhi01M000472.exon4Bhi01M000472.exon4exon
Bhi01M000472.exon3Bhi01M000472.exon3exon
Bhi01M000472.exon2Bhi01M000472.exon2exon
Bhi01M000472.exon1Bhi01M000472.exon1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.Bhi01M000472cds.Bhi01M000472CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Bhi01M000472.utr5p1Bhi01M000472.utr5p1five_prime_UTR


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 57..229
e-value: 1.8E-9
score: 39.2
coord: 239..351
e-value: 1.7E-14
score: 55.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 124..213
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 247..348
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 350..371
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 355..371
NoneNo IPR availablePANTHERPTHR26312FAMILY NOT NAMEDcoord: 46..218
coord: 238..357
NoneNo IPR availablePANTHERPTHR26312:SF89CARBOXYLATE CLAMP-TETRATRICOPEPTIDE REPEAT PROTEINcoord: 238..357
NoneNo IPR availablePANTHERPTHR26312:SF89CARBOXYLATE CLAMP-TETRATRICOPEPTIDE REPEAT PROTEINcoord: 46..218
IPR013026Tetratricopeptide repeat-containing domainPROSITEPS50293TPR_REGIONcoord: 126..194
score: 8.014
IPR013026Tetratricopeptide repeat-containing domainPROSITEPS50293TPR_REGIONcoord: 247..329
score: 13.774