Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGAAATAGAGGTGAGAAGTATGAGCGTGTCTCTTTCTAGACCTTTCCCTCCATTTCTTCTACACACTTTTTCTTCCTTTACTCATCACTGTCTCTTCCATTCGAAATCCATTCTCTGGAACTCACATTCCAAGCATCGTTATCTCTCTCTCCGTCTCTCCATGGCTACGCTAACTGCAGCTTCTTCTTCTTCTTCTTCTTACATGTGCTTAACCAAGGTATCATCACCTCCATTACCTTCCACTTCATTCCCCATTTCGTTCCCATATCATCCGAAGCTCCCTCGCGATTCTTCTTTCTCTTTGGCTTCCCCTGTGACTTCAAGGACAAGCGTTCGGTTCAATCCGTCTTTTGCACGGGACGACGAGTTCGGTGATTTTGTGGAAATGAAGGAAACAAGCGAGACGCGTTTGTACTCTCTATCGCCTTTTCCTTTACTGTTCATCGCTGCGCTTCCTGGAGGTACTCACTCACTTCTCTCGTTATTCAGTGTTTTTATGTTGTCTTGAATTTGAGTGATTGGTTTCTGTTTTCTTCTGGAGTCAACGCTTTACTGTATCTTCTCGATGTAGAATGATTTGTACTTTACTTGACCTTAGACTGTTTTTGATGTGTTAGTGGTATTACGAAAGGAGATTTTGTTGTCGCTCCGAACTTATGGGATTGTTGCGAAGAAACTTTTTGCAATGCTCACACTATCTCAATAAAAATTATGCGAGAGAAGTGAATTCGAACTTCTTTACGAATTCTGTGTTCTTCAAGGCACATTACGTCTGTACTGAACAGTTTATGCAGTTTAGACTTAATCTTGTTTACACTTTTCGAATCTTGTGTCTGCAGACGATCATCCTCCAATCATCTCTACAATGGTATGATATTGTCTACTTTAGACATAAATCTTCATGCTTTTGCTTTTGGTTTCACTTCGAAAGGCTTCATATCATTGGAGATAGTCGTTTTCCCTTATAAACTCATGATCACCCCTTATTTAAGCAATCGGGGATTTTGGTCGCATTTCTAACAATCCTCCTCTCTAACAAAGGATCACATAGTCTCCTCTCAAATAGTCCTCCAACCTTCTAACTCTTATCAAGGCTTGCTCTTGTTCACCAAAGTACCCAATCTATCCAGTAAAGTTCAACCACGGATCACATCGTGACTACTTTCGAGGCTCCACTACACCTCTGTTCGGCACTTGAGGATTCTACCGACTAGGTTAGGTTATCACTTGTTGGGAAAAACCTCTCACTTCTAGAAAAAACATAAAAATAACAGAAAAATTAAAAGAAATCAATAAAAACTGAGAATACAAGAATTTACATGGAAAACTCCTGATTTGGAGAAAGCCATAGGCTACCAGCAAGAAATCCATTTTGTGAAAAATTGTTACAATCACGTAGAACAATTTTCTCTTCTGATCCCAATTACAAGAGCACTCATTCAAAGCTTTTATATTATTTACACCCGCACACTCAAGCTTAAATTGTTTCTAATTGGGACGACTGAAAACCAAGGGCATGGACTCTTTCTATAGACTTAGAGCCCATCCCCATTTTTCAATTTAACCGAGGTGGGATTCTTCACTTTCAATATTTTTGCCCTGTTTCAATTTTGGAGTAAACTAATTTTGAATAGATTCTGCTCTTTTCTCAAAACTGTTTATCTTAATTTGTTGCCCCCTTTGAGAGTTGATAGATTAGGTTTTCCAGCTGGCGGTTTCAGAGTAAAGAAGGCAAGATTGAAGTCCATTTGGTGGACTAATTTCATTCTGATTGAGACGTGGGATATTCTGATGGAAGCAAATATAAGAGCATTTGAAAATAAGAAAAGAAACCTTATGAGTCGTTGGATTTAGATTACTAGTAGCCAAAGCATCTTGTAGTTAGCCCTTTTTTGGAATTCGCTCATTGAAGAATGAGGAGTTATCTAAAAGAAATATACATCTCCCTTTGCATTTCTTCACAGGACGACTTGTTCTTCCTTTTGTATATATGTTCATGAAATACAAGTTAATAAATTGTTTCTTTCTTAGACATATGCACATGAAAGCACAAATAGATAATATTTTTCTTTTCATTCTTTCCTTTTAATTGAGGTGAGATGCAATCTTTGGTCACTTGTCATGAACTTGAATGGATACCTAAATGACAGCGGGAACTGTGAGGTCTCTCTTTGGTCCTTTCGTTGAGCTTGTTAAATCTTGGAATCTTCCTGAATGGCTGGTACATTGGGGTCATCCTGGCAACATGGCAAGTTTTCTTTTGTTTTGCTTCTTATGTAGTAATGTTATCATTGATTTTCTAATCCGAGCTGTGTCCTGATTAGGCTGTTGTGCTCTTCGCCATGGGTGGCTATGGAACATATTTAGGTTTTCGAATCCGTTACTCTGACAACGTGGTATGAATATTAGCTTCAGACCTTTTCTGCACTTACAGAAAGTCTCCATTCACAATCTAGTTACATCGTACATTTGAACTGATCCAGGAGGAGAAGGCTAATGCCAAAGACTTGCATCCAAAGCTTCTAGGCGGGATGTTTTTCTTTTTCGCTCTTGGAGCAACAGGTGGAATCACATCTCTACTTACATCAGACAAACCTATATTCGAGAGGTACAGTGATAAACCAATTTCTCTACCCTTTTCTCCCTCCTTATCAAAGCACATTGGTTTAATTTCAATTGCATCATTCTGTCCAAATTCCGAACTACTTTATCAACTTTAACTCCTGCCCCAAGTTGATTATTTCTAAGAACAACTTTGGAGTTGGTTTTGTACCCCCTTAGTGAAAAGGGTAAAATGTTGTGGCAAGCGAGCTTCTTTTCTATTTTTTGGGGTATTTGATCCGAGTGCAATAGCAGTTTTTTTTAGGAATTGTGGAGAAATTTAGAGAGGTTTGGAATTTGGTGAGATTTAATGCTTTCTTTTGGGCATCCGTCACTTGATTGTTTTATAATTATGATATTGTTCTTGTTTTTGTTTTTCGTATTGGAGTCCTTTCTATAGTTAGTGTTGGGCTCCTGTTGTTTCGCTTATTTTTTGTCTCTGTATTCTTTCATTTTTCAATGAAAGTTAGGTTTCTTTTGAACGAAAAAAATTCATATGAAGCACTTCTATGACAGCTTATTAATTATTTTTATAATGACTTAGCATAAGTTCCTAGGACCAAAATTCAACTCAGACAAAAGCTGTCAATATAATCTTAACTTCGGTTTCGTCTTATCTCTAGTCCGCATGCTGTAACGGGGTTCATCGGCCTCACACTCTTGACTGTACAAACTCTCCTGCCTTCACTTTTTGAGGTAACGATTGACCAATGCTTAACCTTCCTTTCCCTACTACTTGTCCTCTTATTGTTATTCCCATGTGCAGAATAATCCTGGGCTGAGGAATGTTCATGGTATTTTGGGTAGTGGAATCATGACACTGTTTCTCATCCATGCTGCACTTGGACTTCAACTTGGCCTCAGTTACTAAATCTTAACGGCATGTATTGAAGAACCGCATAGGTTATTCCGAATCTTTCTTTCAATACAATCTTCTTTCATCGTCACGCAGCCTGCAATCGTAACAAAGATGCCTTGGCGCAATCCCCTCTCAGATGAGGAAAACTTCATTTTACTGCCTTTTCCTCGGGTATTATTCCGGGCTGCCTTCCGTGTTCACTGTTTATTCATTGTTCTCATATTTTCACCAATCTATGTAGGGCCTTTCATTTGCTACTTATTCTTTTTCTTGCTGTTGGTGATAGTATATGTTTAATCTTGTTTTCTAATTGCTTTGTTATGTATATTGAGACAATCAATTTATATGTTAAGTGTGATCATGTGTATTAATATACGGTTTGAACCCTTGAGATGGTTTTCTTACATTTAATCCTTTTTCGTCGTCGTTTA
mRNA sequence
TAGAAATAGAGGTGAGAAGTATGAGCGTGTCTCTTTCTAGACCTTTCCCTCCATTTCTTCTACACACTTTTTCTTCCTTTACTCATCACTGTCTCTTCCATTCGAAATCCATTCTCTGGAACTCACATTCCAAGCATCGTTATCTCTCTCTCCGTCTCTCCATGGCTACGCTAACTGCAGCTTCTTCTTCTTCTTCTTCTTACATGTGCTTAACCAAGGTATCATCACCTCCATTACCTTCCACTTCATTCCCCATTTCGTTCCCATATCATCCGAAGCTCCCTCGCGATTCTTCTTTCTCTTTGGCTTCCCCTGTGACTTCAAGGACAAGCGTTCGGTTCAATCCGTCTTTTGCACGGGACGACGAGTTCGGTGATTTTGTGGAAATGAAGGAAACAAGCGAGACGCGTTTGTACTCTCTATCGCCTTTTCCTTTACTGTTCATCGCTGCGCTTCCTGGAGCGGGAACTGTGAGGTCTCTCTTTGGTCCTTTCGTTGAGCTTGTTAAATCTTGGAATCTTCCTGAATGGCTGGTACATTGGGGTCATCCTGGCAACATGGCTGTTGTGCTCTTCGCCATGGGTGGCTATGGAACATATTTAGGTTTTCGAATCCGTTACTCTGACAACGTGGAGGAGAAGGCTAATGCCAAAGACTTGCATCCAAAGCTTCTAGGCGGGATGTTTTTCTTTTTCGCTCTTGGAGCAACAGGTGGAATCACATCTCTACTTACATCAGACAAACCTATATTCGAGAGTCCGCATGCTGTAACGGGGTTCATCGGCCTCACACTCTTGACTGTACAAACTCTCCTGCCTTCACTTTTTGAGAATAATCCTGGGCTGAGGAATGTTCATGGTATTTTGGGTAGTGGAATCATGACACTGTTTCTCATCCATGCTGCACTTGGACTTCAACTTGGCCTCAGTTACTAAATCTTAACGGCATGTATTGAAGAACCGCATAGGTTATTCCGAATCTTTCTTTCAATACAATCTTCTTTCATCGTCACGCAGCCTGCAATCGTAACAAAGATGCCTTGGCGCAATCCCCTCTCAGATGAGGAAAACTTCATTTTACTGCCTTTTCCTCGGGTATTATTCCGGGCTGCCTTCCGTGTTCACTGTTTATTCATTGTTCTCATATTTTCACCAATCTATGTAGGGCCTTTCATTTGCTACTTATTCTTTTTCTTGCTGTTGGTGATAGTATATGTTTAATCTTGTTTTCTAATTGCTTTGTTATGTATATTGAGACAATCAATTTATATGTTAAGTGTGATCATGTGTATTAATATACGGTTTGAACCCTTGAGATGGTTTTCTTACATTTAATCCTTTTTCGTCGTCGTTTA
Coding sequence (CDS)
ATGAGCGTGTCTCTTTCTAGACCTTTCCCTCCATTTCTTCTACACACTTTTTCTTCCTTTACTCATCACTGTCTCTTCCATTCGAAATCCATTCTCTGGAACTCACATTCCAAGCATCGTTATCTCTCTCTCCGTCTCTCCATGGCTACGCTAACTGCAGCTTCTTCTTCTTCTTCTTCTTACATGTGCTTAACCAAGGTATCATCACCTCCATTACCTTCCACTTCATTCCCCATTTCGTTCCCATATCATCCGAAGCTCCCTCGCGATTCTTCTTTCTCTTTGGCTTCCCCTGTGACTTCAAGGACAAGCGTTCGGTTCAATCCGTCTTTTGCACGGGACGACGAGTTCGGTGATTTTGTGGAAATGAAGGAAACAAGCGAGACGCGTTTGTACTCTCTATCGCCTTTTCCTTTACTGTTCATCGCTGCGCTTCCTGGAGCGGGAACTGTGAGGTCTCTCTTTGGTCCTTTCGTTGAGCTTGTTAAATCTTGGAATCTTCCTGAATGGCTGGTACATTGGGGTCATCCTGGCAACATGGCTGTTGTGCTCTTCGCCATGGGTGGCTATGGAACATATTTAGGTTTTCGAATCCGTTACTCTGACAACGTGGAGGAGAAGGCTAATGCCAAAGACTTGCATCCAAAGCTTCTAGGCGGGATGTTTTTCTTTTTCGCTCTTGGAGCAACAGGTGGAATCACATCTCTACTTACATCAGACAAACCTATATTCGAGAGTCCGCATGCTGTAACGGGGTTCATCGGCCTCACACTCTTGACTGTACAAACTCTCCTGCCTTCACTTTTTGAGAATAATCCTGGGCTGAGGAATGTTCATGGTATTTTGGGTAGTGGAATCATGACACTGTTTCTCATCCATGCTGCACTTGGACTTCAACTTGGCCTCAGTTACTAA
Protein sequence
MSVSLSRPFPPFLLHTFSSFTHHCLFHSKSILWNSHSKHRYLSLRLSMATLTAASSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRDSSFSLASPVTSRTSVRFNPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKSWNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGSGIMTLFLIHAALGLQLGLSY
Homology
BLAST of Bhi03G001198 vs. TAIR 10
Match:
AT2G36885.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 172 Blast hits to 172 proteins in 58 species: Archae - 0; Bacteria - 116; Metazoa - 0; Fungi - 0; Plants - 32; Viruses - 0; Other Eukaryotes - 24 (source: NCBI BLink). )
HSP 1 Score: 303.5 bits (776), Expect = 1.9e-82
Identity = 167/264 (63.26%), Postives = 192/264 (72.73%), Query Frame = 0
Query: 42 LSLRLSMATLTAASSSSSSYMCLTKVSSPPLPSTSFP-ISFPYHPKLPRDSSFSLASPVT 101
LS +S A+ +S +S CL++ S+ +SFP +S K+P + S
Sbjct: 10 LSNHISPASSLPSSRLLNSTQCLSRFSN----VSSFPALSTFRRRKIPLTPACSSIVDGD 69
Query: 102 SRTSVRFNPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVE 161
R DDE E ET + S+SP PLL +A+LPGA TVRS+FGP VE
Sbjct: 70 EEIEAR------GDDE-------NEIRETLMLSVSPLPLLLVASLPGAETVRSVFGPVVE 129
Query: 162 LVKSWNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGG 221
+VKS NLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD++EEKA AKDLHPKLL G
Sbjct: 130 IVKSLNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDIEEKAKAKDLHPKLLAG 189
Query: 222 MFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHG 281
MFFFFALGATGG+ SLLTSDKPIFESPHAVTG IGL LLTVQT+LPSLF+ P LRNVHG
Sbjct: 190 MFFFFALGATGGVISLLTSDKPIFESPHAVTGLIGLGLLTVQTILPSLFKEKPELRNVHG 249
Query: 282 ILGSGIMTLFLIHAALGLQLGLSY 305
ILGSGIM LFL+HAA GLQLGLS+
Sbjct: 250 ILGSGIMALFLVHAAFGLQLGLSF 256
BLAST of Bhi03G001198 vs. TAIR 10
Match:
AT2G36885.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 298.9 bits (764), Expect = 4.6e-81
Identity = 167/264 (63.26%), Postives = 192/264 (72.73%), Query Frame = 0
Query: 42 LSLRLSMATLTAASSSSSSYMCLTKVSSPPLPSTSFP-ISFPYHPKLPRDSSFSLASPVT 101
LS +S A+ +S +S CL++ S+ +SFP +S K+P + S
Sbjct: 10 LSNHISPASSLPSSRLLNSTQCLSRFSN----VSSFPALSTFRRRKIPLTPACSSIVDGD 69
Query: 102 SRTSVRFNPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVE 161
R DDE E ET + S+SP PLL +A+LPGA TVRS+FGP VE
Sbjct: 70 EEIEAR------GDDE-------NEIRETLMLSVSPLPLLLVASLPGAETVRSVFGPVVE 129
Query: 162 LVKSWNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGG 221
+VKS NLP+WLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD++EEKA AKDLHPKLL G
Sbjct: 130 IVKSLNLPDWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDIEEKAKAKDLHPKLLAG 189
Query: 222 MFFFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHG 281
MFFFFALGATGG+ SLLTSDKPIFESPHAVTG IGL LLTVQT+LPSLF+ P LRNVHG
Sbjct: 190 MFFFFALGATGGVISLLTSDKPIFESPHAVTGLIGLGLLTVQTILPSLFK-KPELRNVHG 249
Query: 282 ILGSGIMTLFLIHAALGLQLGLSY 305
ILGSGIM LFL+HAA GLQLGLS+
Sbjct: 250 ILGSGIMALFLVHAAFGLQLGLSF 255
BLAST of Bhi03G001198 vs. NCBI nr
Match:
XP_038880983.1 (uncharacterized protein LOC120072635 [Benincasa hispida])
HSP 1 Score: 515.4 bits (1326), Expect = 3.4e-142
Identity = 257/257 (100.00%), Postives = 257/257 (100.00%), Query Frame = 0
Query: 48 MATLTAASSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRDSSFSLASPVTSRTSVRF 107
MATLTAASSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRDSSFSLASPVTSRTSVRF
Sbjct: 1 MATLTAASSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRDSSFSLASPVTSRTSVRF 60
Query: 108 NPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKSWNL 167
NPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKSWNL
Sbjct: 61 NPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKSWNL 120
Query: 168 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFAL 227
PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFAL
Sbjct: 121 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFAL 180
Query: 228 GATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGSGIM 287
GATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGSGIM
Sbjct: 181 GATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGSGIM 240
Query: 288 TLFLIHAALGLQLGLSY 305
TLFLIHAALGLQLGLSY
Sbjct: 241 TLFLIHAALGLQLGLSY 257
BLAST of Bhi03G001198 vs. NCBI nr
Match:
XP_022977650.1 (uncharacterized protein LOC111477900 [Cucurbita maxima])
HSP 1 Score: 449.1 bits (1154), Expect = 3.0e-122
Identity = 227/256 (88.67%), Postives = 237/256 (92.58%), Query Frame = 0
Query: 48 MATLTAASSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRDSSFSLASPVTSRTSVRF 107
MATLT A SSSSSYMCLTKV PP STSFPI PY K+PRDS FSLASPVT R VRF
Sbjct: 1 MATLTGA-SSSSSYMCLTKV-LPPFSSTSFPILLPYLSKVPRDSPFSLASPVTGRRKVRF 60
Query: 108 NPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKSWNL 167
NPSFARD EFGD VE +ET ETRLYSL+PFPLLF+AALPGAGTVRSLFGPFVELVKSWNL
Sbjct: 61 NPSFARDREFGDLVETRETRETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWNL 120
Query: 168 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFAL 227
PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFAL
Sbjct: 121 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSNDVEEKANAKDLHPKLLGGMFFFFAL 180
Query: 228 GATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGSGIM 287
GATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE+NPGLRN+HGILGSGIM
Sbjct: 181 GATGGITSLLTSDKPIFESPHAVTGFIGLALLTLQSLLPSLFEDNPGLRNIHGILGSGIM 240
Query: 288 TLFLIHAALGLQLGLS 304
TLFLIHAALGLQLGLS
Sbjct: 241 TLFLIHAALGLQLGLS 254
BLAST of Bhi03G001198 vs. NCBI nr
Match:
XP_008462484.1 (PREDICTED: uncharacterized protein LOC103500827 [Cucumis melo])
HSP 1 Score: 448.4 bits (1152), Expect = 5.1e-122
Identity = 235/261 (90.04%), Postives = 241/261 (92.34%), Query Frame = 0
Query: 48 MATLTAA--SSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRD-SSFSLASPVTSRTS 107
MATLTAA SSSSSSYMCLTKV SPPLPSTSFPI FP PK PR+ SSFS ASP+ SRTS
Sbjct: 1 MATLTAASFSSSSSSYMCLTKV-SPPLPSTSFPIPFPTLPKTPRNSSSFSFASPLPSRTS 60
Query: 108 VRFNPSFARDDEFGDFVEMK-ETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVK 167
+RFNP FAR+D FGDFVEMK ETSE RLYSLSPFPLLFIAALPG GTVRSLFGPFVELVK
Sbjct: 61 LRFNPCFARNDHFGDFVEMKEETSEMRLYSLSPFPLLFIAALPGGGTVRSLFGPFVELVK 120
Query: 168 SWNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFF 227
S NLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMFF
Sbjct: 121 SLNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAYAKDLHPKLLGGMFF 180
Query: 228 FFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILG 287
FFALGATGGI SLLTSDKPIFESPHAVTGFIGL LLTVQTLLPSLFE+NPGLRNVHGILG
Sbjct: 181 FFALGATGGIISLLTSDKPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILG 240
Query: 288 SGIMTLFLIHAALGLQLGLSY 305
SGIMTLFLIHAA GLQLGLSY
Sbjct: 241 SGIMTLFLIHAAFGLQLGLSY 260
BLAST of Bhi03G001198 vs. NCBI nr
Match:
KAG6604398.1 (hypothetical protein SDJN03_05007, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 448.4 bits (1152), Expect = 5.1e-122
Identity = 228/259 (88.03%), Postives = 239/259 (92.28%), Query Frame = 0
Query: 48 MATLTAA---SSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRDSSFSLASPVTSRTS 107
MATLT A SSSSSSYMCLTKV PP STSFPI P K+PRDSSFSLASPVT R
Sbjct: 1 MATLTGASSSSSSSSSYMCLTKV-LPPFSSTSFPILLPNLSKVPRDSSFSLASPVTGRRK 60
Query: 108 VRFNPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKS 167
VRF+PSFARD EFGD VEM+ET ETRLYSL+PFPLLF+AALPGAGTVRSLFGPFVELVKS
Sbjct: 61 VRFSPSFARDREFGDLVEMRETRETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKS 120
Query: 168 WNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFF 227
WNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFF
Sbjct: 121 WNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSNDVEEKANAKDLHPKLLGGMFFF 180
Query: 228 FALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGS 287
FALGATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE+NPGLRN+HGILGS
Sbjct: 181 FALGATGGITSLLTSDKPIFESPHAVTGFIGLALLTLQSLLPSLFEDNPGLRNIHGILGS 240
Query: 288 GIMTLFLIHAALGLQLGLS 304
GIMTLFLIHAALGLQLGLS
Sbjct: 241 GIMTLFLIHAALGLQLGLS 258
BLAST of Bhi03G001198 vs. NCBI nr
Match:
XP_022925916.1 (uncharacterized protein LOC111433189 [Cucurbita moschata])
HSP 1 Score: 448.0 bits (1151), Expect = 6.6e-122
Identity = 227/256 (88.67%), Postives = 238/256 (92.97%), Query Frame = 0
Query: 48 MATLTAASSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRDSSFSLASPVTSRTSVRF 107
MATLT A SSSSSYMCLTKV PP STSFPI P K+PRDSSFSLASPVT R VRF
Sbjct: 1 MATLTGA-SSSSSYMCLTKV-LPPFSSTSFPILLPNLSKVPRDSSFSLASPVTGRRKVRF 60
Query: 108 NPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKSWNL 167
+PSFARD EFGD VEM+ET ETRLYSL+PFPLLF+AALPGAGTVRSLFGPFVELVKSWNL
Sbjct: 61 SPSFARDREFGDLVEMRETRETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWNL 120
Query: 168 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFAL 227
PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFAL
Sbjct: 121 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSNDVEEKANAKDLHPKLLGGMFFFFAL 180
Query: 228 GATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGSGIM 287
GATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE+NPGLRN+HGILGSGIM
Sbjct: 181 GATGGITSLLTSDKPIFESPHAVTGFIGLALLTLQSLLPSLFEDNPGLRNIHGILGSGIM 240
Query: 288 TLFLIHAALGLQLGLS 304
TLFLIHAALGLQLGLS
Sbjct: 241 TLFLIHAALGLQLGLS 254
BLAST of Bhi03G001198 vs. ExPASy TrEMBL
Match:
A0A6J1IMY7 (uncharacterized protein LOC111477900 OS=Cucurbita maxima OX=3661 GN=LOC111477900 PE=4 SV=1)
HSP 1 Score: 449.1 bits (1154), Expect = 1.4e-122
Identity = 227/256 (88.67%), Postives = 237/256 (92.58%), Query Frame = 0
Query: 48 MATLTAASSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRDSSFSLASPVTSRTSVRF 107
MATLT A SSSSSYMCLTKV PP STSFPI PY K+PRDS FSLASPVT R VRF
Sbjct: 1 MATLTGA-SSSSSYMCLTKV-LPPFSSTSFPILLPYLSKVPRDSPFSLASPVTGRRKVRF 60
Query: 108 NPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKSWNL 167
NPSFARD EFGD VE +ET ETRLYSL+PFPLLF+AALPGAGTVRSLFGPFVELVKSWNL
Sbjct: 61 NPSFARDREFGDLVETRETRETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWNL 120
Query: 168 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFAL 227
PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFAL
Sbjct: 121 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSNDVEEKANAKDLHPKLLGGMFFFFAL 180
Query: 228 GATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGSGIM 287
GATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE+NPGLRN+HGILGSGIM
Sbjct: 181 GATGGITSLLTSDKPIFESPHAVTGFIGLALLTLQSLLPSLFEDNPGLRNIHGILGSGIM 240
Query: 288 TLFLIHAALGLQLGLS 304
TLFLIHAALGLQLGLS
Sbjct: 241 TLFLIHAALGLQLGLS 254
BLAST of Bhi03G001198 vs. ExPASy TrEMBL
Match:
A0A1S3CHJ9 (uncharacterized protein LOC103500827 OS=Cucumis melo OX=3656 GN=LOC103500827 PE=4 SV=1)
HSP 1 Score: 448.4 bits (1152), Expect = 2.5e-122
Identity = 235/261 (90.04%), Postives = 241/261 (92.34%), Query Frame = 0
Query: 48 MATLTAA--SSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRD-SSFSLASPVTSRTS 107
MATLTAA SSSSSSYMCLTKV SPPLPSTSFPI FP PK PR+ SSFS ASP+ SRTS
Sbjct: 1 MATLTAASFSSSSSSYMCLTKV-SPPLPSTSFPIPFPTLPKTPRNSSSFSFASPLPSRTS 60
Query: 108 VRFNPSFARDDEFGDFVEMK-ETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVK 167
+RFNP FAR+D FGDFVEMK ETSE RLYSLSPFPLLFIAALPG GTVRSLFGPFVELVK
Sbjct: 61 LRFNPCFARNDHFGDFVEMKEETSEMRLYSLSPFPLLFIAALPGGGTVRSLFGPFVELVK 120
Query: 168 SWNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFF 227
S NLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMFF
Sbjct: 121 SLNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAYAKDLHPKLLGGMFF 180
Query: 228 FFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILG 287
FFALGATGGI SLLTSDKPIFESPHAVTGFIGL LLTVQTLLPSLFE+NPGLRNVHGILG
Sbjct: 181 FFALGATGGIISLLTSDKPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILG 240
Query: 288 SGIMTLFLIHAALGLQLGLSY 305
SGIMTLFLIHAA GLQLGLSY
Sbjct: 241 SGIMTLFLIHAAFGLQLGLSY 260
BLAST of Bhi03G001198 vs. ExPASy TrEMBL
Match:
A0A6J1EDG7 (uncharacterized protein LOC111433189 OS=Cucurbita moschata OX=3662 GN=LOC111433189 PE=4 SV=1)
HSP 1 Score: 448.0 bits (1151), Expect = 3.2e-122
Identity = 227/256 (88.67%), Postives = 238/256 (92.97%), Query Frame = 0
Query: 48 MATLTAASSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRDSSFSLASPVTSRTSVRF 107
MATLT A SSSSSYMCLTKV PP STSFPI P K+PRDSSFSLASPVT R VRF
Sbjct: 1 MATLTGA-SSSSSYMCLTKV-LPPFSSTSFPILLPNLSKVPRDSSFSLASPVTGRRKVRF 60
Query: 108 NPSFARDDEFGDFVEMKETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKSWNL 167
+PSFARD EFGD VEM+ET ETRLYSL+PFPLLF+AALPGAGTVRSLFGPFVELVKSWNL
Sbjct: 61 SPSFARDREFGDLVEMRETRETRLYSLAPFPLLFVAALPGAGTVRSLFGPFVELVKSWNL 120
Query: 168 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFAL 227
PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYS++VEEKANAKDLHPKLLGGMFFFFAL
Sbjct: 121 PEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSNDVEEKANAKDLHPKLLGGMFFFFAL 180
Query: 228 GATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGSGIM 287
GATGGITSLLTSDKPIFESPHAVTGFIGL LLT+Q+LLPSLFE+NPGLRN+HGILGSGIM
Sbjct: 181 GATGGITSLLTSDKPIFESPHAVTGFIGLALLTLQSLLPSLFEDNPGLRNIHGILGSGIM 240
Query: 288 TLFLIHAALGLQLGLS 304
TLFLIHAALGLQLGLS
Sbjct: 241 TLFLIHAALGLQLGLS 254
BLAST of Bhi03G001198 vs. ExPASy TrEMBL
Match:
A0A0A0KIF5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G446520 PE=4 SV=1)
HSP 1 Score: 434.1 bits (1115), Expect = 4.8e-118
Identity = 229/262 (87.40%), Postives = 238/262 (90.84%), Query Frame = 0
Query: 48 MATLTAASSSSSSYMCLTKVSSPPLPSTSFPISFPYHPKLPRD----SSFSLASPVTSRT 107
MATLT A SSS SY+CLTKV SPPLPSTS + PK+PR+ SSFS ASP+ RT
Sbjct: 1 MATLT-APSSSFSYICLTKV-SPPLPSTSLNL-----PKIPRNSSSSSSFSFASPLPLRT 60
Query: 108 SVRFNPSFARDDEFGDFVEMK-ETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELV 167
SVRFNPSFAR+DEFGDF E K ETSE RLYSLSPFPLLFIAALPGAGTVRSLFGPFVELV
Sbjct: 61 SVRFNPSFARNDEFGDFEETKEETSEMRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELV 120
Query: 168 KSWNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMF 227
KSWNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMF
Sbjct: 121 KSWNLPEWLVHWGHPGNMAVVLFAMGGYGTYLGFRIRYSDDVEEKAYAKDLHPKLLGGMF 180
Query: 228 FFFALGATGGITSLLTSDKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGIL 287
FFFALGATGG+TSLLTSDKPI ESPHAVTGFIGLTLLTVQTLLPSLFE+NPGLRNVHGIL
Sbjct: 181 FFFALGATGGVTSLLTSDKPILESPHAVTGFIGLTLLTVQTLLPSLFEDNPGLRNVHGIL 240
Query: 288 GSGIMTLFLIHAALGLQLGLSY 305
GSGIMTLFLIHAALGLQLGLSY
Sbjct: 241 GSGIMTLFLIHAALGLQLGLSY 255
BLAST of Bhi03G001198 vs. ExPASy TrEMBL
Match:
A0A5D3C7L2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold202G00350 PE=4 SV=1)
HSP 1 Score: 430.6 bits (1106), Expect = 5.3e-117
Identity = 220/245 (89.80%), Postives = 226/245 (92.24%), Query Frame = 0
Query: 62 MCLTKVSSPPLPSTSFPISFPYHPKLPRD-SSFSLASPVTSRTSVRFNPSFARDDEFGDF 121
MCLTKV SPPLPSTSFP FP PK PR+ SSFS ASP+ SRTS+RFNP FAR+D FGDF
Sbjct: 1 MCLTKV-SPPLPSTSFPFPFPTLPKTPRNSSSFSFASPLPSRTSLRFNPCFARNDHFGDF 60
Query: 122 VEMK-ETSETRLYSLSPFPLLFIAALPGAGTVRSLFGPFVELVKSWNLPEWLVHWGHPGN 181
VEMK ETSE RLYSLSPFPLLFIAALPG GTVRSLFGPFVELVKS NLPEWLVHWGHPGN
Sbjct: 61 VEMKEETSEMRLYSLSPFPLLFIAALPGGGTVRSLFGPFVELVKSLNLPEWLVHWGHPGN 120
Query: 182 MAVVLFAMGGYGTYLGFRIRYSDNVEEKANAKDLHPKLLGGMFFFFALGATGGITSLLTS 241
MAVVLFAMGGYGTYLGFRIRYSD+VEEKA AKDLHPKLLGGMFFFFALGATGGI SLLTS
Sbjct: 121 MAVVLFAMGGYGTYLGFRIRYSDDVEEKAYAKDLHPKLLGGMFFFFALGATGGIISLLTS 180
Query: 242 DKPIFESPHAVTGFIGLTLLTVQTLLPSLFENNPGLRNVHGILGSGIMTLFLIHAALGLQ 301
DKPIFESPHAVTGFIGL LLTVQTLLPSLFE+NPGLRNVHGILGSGIMTLFLIHAA GLQ
Sbjct: 181 DKPIFESPHAVTGFIGLALLTVQTLLPSLFEDNPGLRNVHGILGSGIMTLFLIHAAFGLQ 240
Query: 302 LGLSY 305
LGLSY
Sbjct: 241 LGLSY 244
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT2G36885.1 | 1.9e-82 | 63.26 | unknown protein; FUNCTIONS IN: molecular_function unknown; LOCATED IN: chloropla... | [more] |
AT2G36885.2 | 4.6e-81 | 63.26 | unknown protein; FUNCTIONS IN: molecular_function unknown; LOCATED IN: chloropla... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_038880983.1 | 3.4e-142 | 100.00 | uncharacterized protein LOC120072635 [Benincasa hispida] | [more] |
XP_022977650.1 | 3.0e-122 | 88.67 | uncharacterized protein LOC111477900 [Cucurbita maxima] | [more] |
XP_008462484.1 | 5.1e-122 | 90.04 | PREDICTED: uncharacterized protein LOC103500827 [Cucumis melo] | [more] |
KAG6604398.1 | 5.1e-122 | 88.03 | hypothetical protein SDJN03_05007, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022925916.1 | 6.6e-122 | 88.67 | uncharacterized protein LOC111433189 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1IMY7 | 1.4e-122 | 88.67 | uncharacterized protein LOC111477900 OS=Cucurbita maxima OX=3661 GN=LOC111477900... | [more] |
A0A1S3CHJ9 | 2.5e-122 | 90.04 | uncharacterized protein LOC103500827 OS=Cucumis melo OX=3656 GN=LOC103500827 PE=... | [more] |
A0A6J1EDG7 | 3.2e-122 | 88.67 | uncharacterized protein LOC111433189 OS=Cucurbita moschata OX=3662 GN=LOC1114331... | [more] |
A0A0A0KIF5 | 4.8e-118 | 87.40 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G446520 PE=4 SV=1 | [more] |
A0A5D3C7L2 | 5.3e-117 | 89.80 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |