Bhi04G002045 (gene) Wax gourd (B227) v1

Overview
NameBhi04G002045
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionUnknown protein
Locationchr4: 67698049 .. 67702116 (+)
RNA-Seq ExpressionBhi04G002045
SyntenyBhi04G002045
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAGTCGTTTTGGGTGGTCAAAGGGGAAGTGAAGGTTTGTTTCGGTTCAATTCTACACACTGCTTCCACATTCGAGAATGGAGCTCCATTGCGCAACTCTTCAAACTTCATTCTCTTTCTCTATCAGAAGCAAAGCTCTTGCTCATGGAGATGCCTCCGCCGTCTGCTCTCCCTCCTCGCCATCTTTCTCAAGAATTACAGCTCGAAATTTCTCAATGGGTTCAAAAAGTAGAGGTAAGGTGGCTCAAGTTTGCTAAGTTTCCGTTGAGACGCGAAGACTGAGCCATGAGATCGATACTTCTGTAGTATTGCTCAATGCTTTCGATTTTAGTCGGTTATTAAATGGGAACGCCGGGGCGACGGAAAGAGTAGAGATTTTGATTCTGTTCTCTTTTCGTATTCCGTTCATTATATGACTGAACTATTCACGACGTGGAAAAATGGTCCTTTAGAATTATTTAAACAGAGACGTTGGCCAATAAATTTTCTATATGTGTGTGTGTGTGTGTAGCTATGGATTGGTGTAGATACTGAATTTCTGTTGTTTAAATTACTGATTCCTTATAGTTCATGTTCCAAGCTTCATATTGACTTGGCGTAGCAGCCACAAAGTGAAGCTAGTATACATAGTCTAAAATTATAGAAACTTAAGAAAATTGTTACCATGTTCTTTATTGAACTTGAGTGATCAGTGAAGTATTAGTAATTTTGTTGAATAGCAGGGTTCCCTTCACTGGTGTGTCGAGTTAGATTGAAGAAGTCATCCTTTTCTGCCGTTGTCAGAGGGGTGAGTGCAGTACCTAGTGATTGTAATTCAGAAACTCTTGATTGGTCAAATCCCACTCAATATGAGCCAAGAGATGTTCAAAATGCAAAAGATGGGGTCGAAAGCTTGGACCAGCCTAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACTCCAACAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAATATCCGTCCCCATTTCTTTAGGAGGTGCCAAGATCGAGCTGCCTCTGAAGACGATCCTGGGATGAGGCATAAACTACTCCGGCTGGGAAGAAAGTTGAAAGAGGTATGTCTTGGGATTTTAAAGTTATACGTACTAAAAAATTGGACTTGAAAATGGCAAGCCTTGTTTCTATAAATAATTAATTATGTAAATAAGAAGAGTGGAAACTAACACTTCGTATGCAAATGAGGTTAGGTCTATTGTTGTAAAAGCCACAAAGGATGTGTTGCTGAGAGAATAATTAAGATGAATTGTTGATGATAACATGGTTATTTTATTGATCAGATAGTTGCAGTATGTGAATTGTCAATCAACCTTTTCTTTCCTTCTATGAAATTTTCACAGATAGATGAAGATGTGAAGAGACATAATGAACTTCTTGAAGTGGTCAGAGCAGCAGCACCATCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACAAAAGAATTCTTTGTGCATCTTCACACGGTGGCTGAATCTTATTATGATGATCCGGCTGAGCAAAATGGTAATACTGCATACTTACGAGTTGGAAAAATAGTTATGAATAAATAGGTTCCTAAGGGGATGAGAAAGTGTAATTCAGGAAGCAAATATATTGGTTAAAAAAATACCTTTTTGGTCTCTAAGTTTTGAGTATAGTCTTAATTTGGTCCTTAAATTTTAAAATGTTATTTAATTTCTAAGATTTAAAATGTTATCCTTTGAAATGACCAGAAGGGAAACTATACTCAAAACTTAGGGCCAAAAGATATACTCTTTCCTATGTTGTTTTATGCTATTCAAAGAATGAGAGGCAACTTAAGAAATTGCTCCTTCCTTAAGAGTAGCTAGATTTGTTAAGACATGCATTTGTCAAATGTTCAAATCTATACCCCTACATATTTATTTATTTATTTATTCATTTATTATTATTATTATTATTATTATTGTTAATGATGTTTTCTTTCTTCAGTGAACGTCTTTTTCATATATGCATATATAAGATATGGTATTGATGGATGAACTGATATATTTCCTCTTATATTTCAATTGATAAGCTTTGGCAAAACTTGGGAATTCCTGCCTTTCTGCTGTACAAACATATGATGCTGCAACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAGGATATCATAAATTCTCCAACTTTAGATGCTGCTTGCAGAAAGATAGACAATTTGGCGGAGAAAAACCAACTTGATTCTGCATTAGTATTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCGAACATGATGAAAGATGAGGTAATGCATTTGCGACCCTTGCTGTTCGCCATTGATGCAAAATTGTGATTGCATGGTCTTATTTCTGTGATATTGTAGGTTTTCAAAGCATCATCTGACCATTCAAGAAATTTCAAATTATTGTGTTAAGTCAGTAATTACTGAAACTTCCACAAATACTGTCATCTATTTGAATGAAATGACTACAGCTTAAGGCACCAGATTAAAATATATATATATATATATATTTCTCTTGATTTTATGTGTTTCTATGTTGGGTCATGCGAGCAAATGGAGAATGCACCTCTATTTTATTGTTGCATAATGACTTTCTGATGATGTAAGTGTTGATGTTATTTCTCTTTGTTAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGCGGTAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATTCTGAAGTATCTTCTCACAATTAAGGATCCTGAGGAAAAGCTAAGTGCTCTGAAGGATGCGTTTACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGTAAGCTTTTGGACCTCGTTGTTAAGAGTATATTGAAATACTAAATATGTCAGTCTATTAGCTTGAACCAAATTTAGTGATTTATTTTGATATCAGATCTTCTATGGTCAAACTCTAATGATTCTATTTTCTTATCTATAGTGTTGCTAGTCTACTTGTTCGATATTCGTCATTGTGCCCATTTTTGAGCCTTCAAATGGGATGTCAAGCAATAAGCATATATTCAAAGAATTAGATATAGCCTATAACTTATTAACACTGAGTTAATCCTCATTGTTATCGTATCTTTGTTAGAATACCTCCCAACTAGAATTAAGATTTGGAATATATTGAAATATAACGGAATAACTACAAGATAACCCAAGTATAAAGGCACTTGGAACCTCCCTCTCGACTCCGGAGTGTGCAAAATCAAGCCCTAGACCCACTCCACAGAATTGCCTACCTTCCCTCGCTCACCCTCCTGGCCTCCTATTTATAACGATATGCACTAACTTCCTATCTAATTACCATGATATCCCTTACTAATACTATATTAATATTCTCATCACACAAGGGAAAAAAAGCCTTAGTCCCATCTCCGTAGATCTCCTGAACCTTTGATGTCCATCTAGTTAAGAGCCAGGATATACCACCTTGACTTTGAATTTGCGCAACAGACCACTTGGAACGTCGTCTGGAAAACGACCTTCAACTGGACTTTCTTTTTTTAAAAAAAAGATATATAAATATACACATATAAAAGCAATCTTATTAGGTTTCTTCTTCATTTGTGCACACTTTTAGGACCCCAGAGAAGCTTCACACGTGGATAAAGACGGTGGTAGATGCTTATCATTTCAGCCGGGAAGGCACTCTCATTAAGGAAGCCAGAGACCTTATGAATCCACAGGTCATTGTTAAACTTGAAGAATTGAAGCATCTCCTTGAGAAAAATTTCATGTGAGGTAAAGCATTTGGTAGTTAGGATTTATGTCTTTAGTTATGCATGAAAAATTGTATAGTGATTTTTTTTCTTTGTTGTATAACAAAATGAATATGAATAATGTATTCAAAATAGTTTATTTTCTCCATGGAAGTAGAGTGTGGTTCAAGTGGCTAGATAGTTGTACCTGTATTATGGCTTGTAGTTTCATGTTAGAATCTATTCTCTCTTTATTTTTCAATTTATTATTCTTTCACATAAAA

mRNA sequence

CAAAGTCGTTTTGGGTGGTCAAAGGGGAAGTGAAGGTTTGTTTCGGTTCAATTCTACACACTGCTTCCACATTCGAGAATGGAGCTCCATTGCGCAACTCTTCAAACTTCATTCTCTTTCTCTATCAGAAGCAAAGCTCTTGCTCATGGAGATGCCTCCGCCGTCTGCTCTCCCTCCTCGCCATCTTTCTCAAGAATTACAGCTCGAAATTTCTCAATGGGTTCAAAAAGTAGAGGGTTCCCTTCACTGGTGTGTCGAGTTAGATTGAAGAAGTCATCCTTTTCTGCCGTTGTCAGAGGGGTGAGTGCAGTACCTAGTGATTGTAATTCAGAAACTCTTGATTGGTCAAATCCCACTCAATATGAGCCAAGAGATGTTCAAAATGCAAAAGATGGGGTCGAAAGCTTGGACCAGCCTAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACTCCAACAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAATATCCGTCCCCATTTCTTTAGGAGGTGCCAAGATCGAGCTGCCTCTGAAGACGATCCTGGGATGAGGCATAAACTACTCCGGCTGGGAAGAAAGTTGAAAGAGATAGATGAAGATGTGAAGAGACATAATGAACTTCTTGAAGTGGTCAGAGCAGCAGCACCATCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACAAAAGAATTCTTTGTGCATCTTCACACGGTGGCTGAATCTTATTATGATGATCCGGCTGAGCAAAATGCTTTGGCAAAACTTGGGAATTCCTGCCTTTCTGCTGTACAAACATATGATGCTGCAACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAGGATATCATAAATTCTCCAACTTTAGATGCTGCTTGCAGAAAGATAGACAATTTGGCGGAGAAAAACCAACTTGATTCTGCATTAGTATTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCGAACATGATGAAAGATGAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGCGGTAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATTCTGAAGTATCTTCTCACAATTAAGGATCCTGAGGAAAAGCTAAGTGCTCTGAAGGATGCGTTTACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGACCCCAGAGAAGCTTCACACGTGGATAAAGACGGTGGTAGATGCTTATCATTTCAGCCGGGAAGGCACTCTCATTAAGGAAGCCAGAGACCTTATGAATCCACAGGTCATTGTTAAACTTGAAGAATTGAAGCATCTCCTTGAGAAAAATTTCATGTGAGGTAAAGCATTTGGTAGTTAGGATTTATGTCTTTAGTTATGCATGAAAAATTGTATAGTGATTTTTTTTCTTTGTTGTATAACAAAATGAATATGAATAATGTATTCAAAATAGTTTATTTTCTCCATGGAAGTAGAGTGTGGTTCAAGTGGCTAGATAGTTGTACCTGTATTATGGCTTGTAGTTTCATGTTAGAATCTATTCTCTCTTTATTTTTCAATTTATTATTCTTTCACATAAAA

Coding sequence (CDS)

ATGGAGCTCCATTGCGCAACTCTTCAAACTTCATTCTCTTTCTCTATCAGAAGCAAAGCTCTTGCTCATGGAGATGCCTCCGCCGTCTGCTCTCCCTCCTCGCCATCTTTCTCAAGAATTACAGCTCGAAATTTCTCAATGGGTTCAAAAAGTAGAGGGTTCCCTTCACTGGTGTGTCGAGTTAGATTGAAGAAGTCATCCTTTTCTGCCGTTGTCAGAGGGGTGAGTGCAGTACCTAGTGATTGTAATTCAGAAACTCTTGATTGGTCAAATCCCACTCAATATGAGCCAAGAGATGTTCAAAATGCAAAAGATGGGGTCGAAAGCTTGGACCAGCCTAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACTCCAACAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAATATCCGTCCCCATTTCTTTAGGAGGTGCCAAGATCGAGCTGCCTCTGAAGACGATCCTGGGATGAGGCATAAACTACTCCGGCTGGGAAGAAAGTTGAAAGAGATAGATGAAGATGTGAAGAGACATAATGAACTTCTTGAAGTGGTCAGAGCAGCAGCACCATCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACAAAAGAATTCTTTGTGCATCTTCACACGGTGGCTGAATCTTATTATGATGATCCGGCTGAGCAAAATGCTTTGGCAAAACTTGGGAATTCCTGCCTTTCTGCTGTACAAACATATGATGCTGCAACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAGGATATCATAAATTCTCCAACTTTAGATGCTGCTTGCAGAAAGATAGACAATTTGGCGGAGAAAAACCAACTTGATTCTGCATTAGTATTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCGAACATGATGAAAGATGAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGCGGTAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATTCTGAAGTATCTTCTCACAATTAAGGATCCTGAGGAAAAGCTAAGTGCTCTGAAGGATGCGTTTACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGACCCCAGAGAAGCTTCACACGTGGATAAAGACGGTGGTAGATGCTTATCATTTCAGCCGGGAAGGCACTCTCATTAAGGAAGCCAGAGACCTTATGAATCCACAGGTCATTGTTAAACTTGAAGAATTGAAGCATCTCCTTGAGAAAAATTTCATGTGA

Protein sequence

MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSRGFPSLVCRVRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEPRDVQNAKDGVESLDQPKMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRKLKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQNALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQVIVKLEELKHLLEKNFM
Homology
BLAST of Bhi04G002045 vs. TAIR 10
Match: AT1G36320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37920.1); Has 93 Blast hits to 90 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 518.8 bits (1335), Expect = 4.1e-147
Identity = 247/338 (73.08%), Postives = 302/338 (89.35%), Query Frame = 0

Query: 101 QNAKDGVES--LDQPKMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQ 160
           +  KDG E   +D  +M KVCDKLIEVFM+DKPTP+DWRRL+AFSKEWD+IRPHF++RCQ
Sbjct: 77  EEKKDGSEEVVVDNQRMIKVCDKLIEVFMVDKPTPSDWRRLLAFSKEWDSIRPHFYKRCQ 136

Query: 161 DRAASEDDPGMRHKLLRLGRKLKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTK 220
           +RA SED+P M+HK+ RL RKLKE+DED++RHNELL V++   P+E+GE+++RRRKDFT 
Sbjct: 137 ERADSEDNPEMKHKVHRLARKLKEVDEDIQRHNELLNVIKRTPPAEIGELVARRRKDFTN 196

Query: 221 EFFVHLHTVAESYYDDPAEQNALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINS 280
           EFF HLHTVAESYYD+P EQNALA LG   ++AVQ YD +TE+I+ALNAAE+K QDIINS
Sbjct: 197 EFFEHLHTVAESYYDNPDEQNALASLGKLSIAAVQAYDTSTESIDALNAAEMKLQDIINS 256

Query: 281 PTLDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQ 340
           P+LDAACRKID+LAEKNQLDSALVLMITKAWSAAKESNMMK+EVKDILYHLYVTA GNLQ
Sbjct: 257 PSLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKEEVKDILYHLYVTARGNLQ 316

Query: 341 RLMPKEIRILKYLLTIKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVD 400
           RLMPKE+RILKYLL+I+DP+E++SAL+DAFTPG+ELEG DVD LYTTPE L + +KTV++
Sbjct: 317 RLMPKEVRILKYLLSIEDPQEQISALQDAFTPGDELEGTDVDYLYTTPEHLQSLMKTVLE 376

Query: 401 AYHFSREGTLIKEARDLMNPQVIVKLEELKHLLEKNFM 437
           AYHFSREG+L+KEA+DLM+P++I K+E+LK L+EK +M
Sbjct: 377 AYHFSREGSLVKEAKDLMHPELIAKIEQLKKLVEKKYM 414

BLAST of Bhi04G002045 vs. TAIR 10
Match: AT4G37920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane, chloroplast, chloroplast envelope; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36320.1); Has 123 Blast hits to 120 proteins in 40 species: Archae - 2; Bacteria - 11; Metazoa - 8; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 299.3 bits (765), Expect = 5.1e-81
Identity = 150/373 (40.21%), Postives = 245/373 (65.68%), Query Frame = 0

Query: 68  FSAVVRGVSAVPSDCN----SETLDWSNPTQYEPRDVQNAKDGVESLDQPKMTKVCDKLI 127
           FSA + G   +         ++T+ ++  T  E +        VE  +   M + CDK+I
Sbjct: 32  FSAFINGGRKIRKSSTITFATDTVTYNGTTSAEVKSSVEDPMEVEVAEGYTMAQFCDKII 91

Query: 128 EVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRKLKEI 187
           ++F+ +KP    W+  +    EW+    +F++RC+ RA +E DP ++ KL+ L  K+K+I
Sbjct: 92  DLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRCRIRADTETDPILKQKLVSLESKVKKI 151

Query: 188 DEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQNALAK 247
           D+++++HN+LL+ ++   P+++  I ++RR+DFT EFF ++  ++E+  D   +++A+A+
Sbjct: 152 DKEMEKHNDLLKEIQ-ENPTDINAIAAKRRRDFTGEFFRYVTLLSET-LDGLEDRDAVAR 211

Query: 248 LGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDSALVL 307
           L   CLSAV  YD   E++E L+ A+ KF+DI+NSP++D+AC KI +LA+  +LDS+L+L
Sbjct: 212 LATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDSACEKIRSLAKAKELDSSLIL 271

Query: 308 MITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEEKLSA 367
           +I  A++AAKES  + +E KDI+YHLY     +L+ + PKEI++LKYLL I DPEE+ SA
Sbjct: 272 LINSAYAAAKESQTVTNEAKDIMYHLYKATKSSLRSITPKEIKLLKYLLNITDPEERFSA 331

Query: 368 LKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQVIVK 427
           L  AF+PG++ E +D   LYTTP++LH WIK ++DAYH ++E T IKEA+ +  P VI +
Sbjct: 332 LATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAKQMSQPIVIQR 391

Query: 428 LEELKHLLEKNFM 437
           L  LK  +E  ++
Sbjct: 392 LFILKDTIEDEYL 402

BLAST of Bhi04G002045 vs. ExPASy Swiss-Prot
Match: Q84WN0 (Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 PE=2 SV=2)

HSP 1 Score: 299.3 bits (765), Expect = 7.2e-80
Identity = 150/373 (40.21%), Postives = 245/373 (65.68%), Query Frame = 0

Query: 68  FSAVVRGVSAVPSDCN----SETLDWSNPTQYEPRDVQNAKDGVESLDQPKMTKVCDKLI 127
           FSA + G   +         ++T+ ++  T  E +        VE  +   M + CDK+I
Sbjct: 32  FSAFINGGRKIRKSSTITFATDTVTYNGTTSAEVKSSVEDPMEVEVAEGYTMAQFCDKII 91

Query: 128 EVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRKLKEI 187
           ++F+ +KP    W+  +    EW+    +F++RC+ RA +E DP ++ KL+ L  K+K+I
Sbjct: 92  DLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRCRIRADTETDPILKQKLVSLESKVKKI 151

Query: 188 DEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQNALAK 247
           D+++++HN+LL+ ++   P+++  I ++RR+DFT EFF ++  ++E+  D   +++A+A+
Sbjct: 152 DKEMEKHNDLLKEIQ-ENPTDINAIAAKRRRDFTGEFFRYVTLLSET-LDGLEDRDAVAR 211

Query: 248 LGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDSALVL 307
           L   CLSAV  YD   E++E L+ A+ KF+DI+NSP++D+AC KI +LA+  +LDS+L+L
Sbjct: 212 LATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDSACEKIRSLAKAKELDSSLIL 271

Query: 308 MITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEEKLSA 367
           +I  A++AAKES  + +E KDI+YHLY     +L+ + PKEI++LKYLL I DPEE+ SA
Sbjct: 272 LINSAYAAAKESQTVTNEAKDIMYHLYKATKSSLRSITPKEIKLLKYLLNITDPEERFSA 331

Query: 368 LKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQVIVK 427
           L  AF+PG++ E +D   LYTTP++LH WIK ++DAYH ++E T IKEA+ +  P VI +
Sbjct: 332 LATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAKQMSQPIVIQR 391

Query: 428 LEELKHLLEKNFM 437
           L  LK  +E  ++
Sbjct: 392 LFILKDTIEDEYL 402

BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match: A0A1S3B306 (uncharacterized protein At4g37920, chloroplastic isoform X2 OS=Cucumis melo OX=3656 GN=LOC103485431 PE=4 SV=1)

HSP 1 Score: 792.3 bits (2045), Expect = 9.9e-226
Identity = 405/437 (92.68%), Postives = 415/437 (94.97%), Query Frame = 0

Query: 1   MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSRGFPSLVCR 60
           MELH ATLQTSFSFSIR K+LA GDASA CSPSSPS SRIT RNFS+GSKSRGFPSL+CR
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRGFPSLLCR 60

Query: 61  VRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKVC 120
            R KKSSFS  VRGVSAVPSDCNSETLD  NP+  E  RDVQNAKD VESLDQ KMTKVC
Sbjct: 61  DRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRK 180
           DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFF RCQDRAASEDDPGM+HKLLRLGRK
Sbjct: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGRK 180

Query: 181 LKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
           LKEIDEDV+RHNELLEVVRA APSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240

Query: 241 ALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
           ALAKLGNSCL+AVQTYDAATENIEAL+AAELKFQDIINSPTLDAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
           ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
           KLSALKDAFTPGEE+EGQDVDCLYTTP+KLH WIKTVVDAYHFSREGTLIKEARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNPQ 420

Query: 421 VIVKLEELKHLLEKNFM 437
           VIVKLEELKHLLEK FM
Sbjct: 421 VIVKLEELKHLLEKKFM 437

BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match: A0A1S3B2I9 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485431 PE=4 SV=1)

HSP 1 Score: 787.7 bits (2033), Expect = 2.4e-224
Identity = 405/438 (92.47%), Postives = 415/438 (94.75%), Query Frame = 0

Query: 1   MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSR-GFPSLVC 60
           MELH ATLQTSFSFSIR K+LA GDASA CSPSSPS SRIT RNFS+GSKSR GFPSL+C
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRAGFPSLLC 60

Query: 61  RVRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKV 120
           R R KKSSFS  VRGVSAVPSDCNSETLD  NP+  E  RDVQNAKD VESLDQ KMTKV
Sbjct: 61  RDRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKV 120

Query: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGR 180
           CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFF RCQDRAASEDDPGM+HKLLRLGR
Sbjct: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGR 180

Query: 181 KLKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240
           KLKEIDEDV+RHNELLEVVRA APSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ
Sbjct: 181 KLKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240

Query: 241 NALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300
           NALAKLGNSCL+AVQTYDAATENIEAL+AAELKFQDIINSPTLDAACRKIDNLAEKNQLD
Sbjct: 241 NALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300

Query: 301 SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360
           SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE
Sbjct: 301 SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360

Query: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNP 420
           EKLSALKDAFTPGEE+EGQDVDCLYTTP+KLH WIKTVVDAYHFSREGTLIKEARDLMNP
Sbjct: 361 EKLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNP 420

Query: 421 QVIVKLEELKHLLEKNFM 437
           QVIVKLEELKHLLEK FM
Sbjct: 421 QVIVKLEELKHLLEKKFM 438

BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match: A0A0A0LMH6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G403700 PE=4 SV=1)

HSP 1 Score: 773.1 bits (1995), Expect = 6.2e-220
Identity = 391/437 (89.47%), Postives = 408/437 (93.36%), Query Frame = 0

Query: 1   MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSRGFPSLVCR 60
           MELH ATL TSFSFSIRS  LAHGDASA CSPS PS SRIT RNFS+GSKSRGFPSLVC 
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITIRNFSLGSKSRGFPSLVCH 60

Query: 61  VRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKVC 120
            R KKSSFSA VRGV AVPSDCNSETLD  NP+  EP RDVQNAKD VE+LDQ KMTKVC
Sbjct: 61  DRPKKSSFSAFVRGVKAVPSDCNSETLDLLNPSSDEPVRDVQNAKDSVENLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRK 180
           DKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFF RCQDRAASEDDPGM+HKLLR GRK
Sbjct: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180

Query: 181 LKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
           LKEIDEDV+RHNELLEVVRA +PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+QN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240

Query: 241 ALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
           ALAKLGNSCL+AVQTYDAATENIEALNAAELKFQDIINSPT+DAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
           ALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
           KLSALKDAFTPGEELEGQDVDCLYTTPE+LHTW+KTVVDAYHFSREGTL++EARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420

Query: 421 VIVKLEELKHLLEKNFM 437
           +IVKLEELK L+EK FM
Sbjct: 421 LIVKLEELKGLIEKKFM 437

BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match: A0A6J1JSR8 (uncharacterized protein At4g37920-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489439 PE=4 SV=1)

HSP 1 Score: 756.9 bits (1953), Expect = 4.6e-215
Identity = 390/437 (89.24%), Postives = 406/437 (92.91%), Query Frame = 0

Query: 1   MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSRGFPSLVCR 60
           MELHCATLQ SFSF IR K L HGDASA CS SS S SRITAR+FS+GSKSRGFPSL  R
Sbjct: 1   MELHCATLQASFSFYIRGKTLPHGDASATCSSSSSSVSRITARSFSLGSKSRGFPSLTWR 60

Query: 61  VRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKVC 120
           VRLKKSS SAVVRG SA PS C+++TLD SN T  +  RDVQNAK+ VE LDQ KMTKVC
Sbjct: 61  VRLKKSSSSAVVRGGSAEPSHCSTDTLDSSNTTPDDSVRDVQNAKNDVECLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRK 180
           DKLIEVF+IDKPTPTDWRRLIAFSK WDNIRPHFFRRCQ+RAASEDDPGMRHKLLRLGRK
Sbjct: 121 DKLIEVFLIDKPTPTDWRRLIAFSKTWDNIRPHFFRRCQERAASEDDPGMRHKLLRLGRK 180

Query: 181 LKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
           LKEIDEDV+RHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYY DPAEQN
Sbjct: 181 LKEIDEDVQRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYADPAEQN 240

Query: 241 ALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
           ALAKLGNSCL AVQ YDAATENIEALNAAELKFQDIINSPTLDAACRKID+LAEKNQLDS
Sbjct: 241 ALAKLGNSCLVAVQAYDAATENIEALNAAELKFQDIINSPTLDAACRKIDSLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
           ALVLMI+KAWSAAKESNMMKDEVKDILYHLYVTA GNLQRLMPKEIRILKYLLTIKDPEE
Sbjct: 301 ALVLMISKAWSAAKESNMMKDEVKDILYHLYVTARGNLQRLMPKEIRILKYLLTIKDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
           KLSALKDAFTPGEE+EGQDVDCLYTTPEKLHTWIKTV+DAYHFSREGTL+KEARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEEVEGQDVDCLYTTPEKLHTWIKTVLDAYHFSREGTLVKEARDLMNPQ 420

Query: 421 VIVKLEELKHLLEKNFM 437
           VIVKLEELK L+EK FM
Sbjct: 421 VIVKLEELKLLVEKKFM 437

BLAST of Bhi04G002045 vs. ExPASy TrEMBL
Match: A0A6J1JWA0 (uncharacterized protein At4g37920-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489439 PE=4 SV=1)

HSP 1 Score: 752.3 bits (1941), Expect = 1.1e-213
Identity = 390/438 (89.04%), Postives = 406/438 (92.69%), Query Frame = 0

Query: 1   MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSR-GFPSLVC 60
           MELHCATLQ SFSF IR K L HGDASA CS SS S SRITAR+FS+GSKSR GFPSL  
Sbjct: 1   MELHCATLQASFSFYIRGKTLPHGDASATCSSSSSSVSRITARSFSLGSKSRAGFPSLTW 60

Query: 61  RVRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKV 120
           RVRLKKSS SAVVRG SA PS C+++TLD SN T  +  RDVQNAK+ VE LDQ KMTKV
Sbjct: 61  RVRLKKSSSSAVVRGGSAEPSHCSTDTLDSSNTTPDDSVRDVQNAKNDVECLDQHKMTKV 120

Query: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGR 180
           CDKLIEVF+IDKPTPTDWRRLIAFSK WDNIRPHFFRRCQ+RAASEDDPGMRHKLLRLGR
Sbjct: 121 CDKLIEVFLIDKPTPTDWRRLIAFSKTWDNIRPHFFRRCQERAASEDDPGMRHKLLRLGR 180

Query: 181 KLKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240
           KLKEIDEDV+RHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYY DPAEQ
Sbjct: 181 KLKEIDEDVQRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYADPAEQ 240

Query: 241 NALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300
           NALAKLGNSCL AVQ YDAATENIEALNAAELKFQDIINSPTLDAACRKID+LAEKNQLD
Sbjct: 241 NALAKLGNSCLVAVQAYDAATENIEALNAAELKFQDIINSPTLDAACRKIDSLAEKNQLD 300

Query: 301 SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360
           SALVLMI+KAWSAAKESNMMKDEVKDILYHLYVTA GNLQRLMPKEIRILKYLLTIKDPE
Sbjct: 301 SALVLMISKAWSAAKESNMMKDEVKDILYHLYVTARGNLQRLMPKEIRILKYLLTIKDPE 360

Query: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNP 420
           EKLSALKDAFTPGEE+EGQDVDCLYTTPEKLHTWIKTV+DAYHFSREGTL+KEARDLMNP
Sbjct: 361 EKLSALKDAFTPGEEVEGQDVDCLYTTPEKLHTWIKTVLDAYHFSREGTLVKEARDLMNP 420

Query: 421 QVIVKLEELKHLLEKNFM 437
           QVIVKLEELK L+EK FM
Sbjct: 421 QVIVKLEELKLLVEKKFM 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G36320.14.1e-14773.08unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G37920.15.1e-8140.21unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
Match NameE-valueIdentityDescription
Q84WN07.2e-8040.21Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 P... [more]
Match NameE-valueIdentityDescription
A0A1S3B3069.9e-22692.68uncharacterized protein At4g37920, chloroplastic isoform X2 OS=Cucumis melo OX=3... [more]
A0A1S3B2I92.4e-22492.47uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3... [more]
A0A0A0LMH66.2e-22089.47Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G403700 PE=4 SV=1[more]
A0A6J1JSR84.6e-21589.24uncharacterized protein At4g37920-like isoform X2 OS=Cucurbita maxima OX=3661 GN... [more]
A0A6J1JWA01.1e-21389.04uncharacterized protein At4g37920-like isoform X1 OS=Cucurbita maxima OX=3661 GN... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 180..200
NoneNo IPR availablePANTHERPTHR31755:SF3FOLATE RECEPTOR-LIKEcoord: 6..436
IPR040320Uncharacterized protein At4g37920-likePANTHERPTHR31755FOLATE RECEPTOR-LIKEcoord: 6..436

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M002045Bhi04M002045mRNA