Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCACATCCGAACCAAAATCAAATTCATTCATCCCCTTTTTTAAAAACTCCAAACTCTTCGCTCCATCTCCATCTTCATCTCCTTCCAATTCAAGAAAACAAAACCCTAGAAATCCGCTGAATGAATCAAATCAAACACTGCAGAATCTTAGTTCTTCTTCCAAGCTCATTGCGTGATTGAACATCTTTTTGTTTTTAAGATCTTCAAGATCTCCAGTTTGTCCATCTATGGCGAAAGTCATGAAGCCTTCTTCGCGCTACAGTTCCTACGATGTCCGATCTTCTACTTCCTCCCATTTCTCCGACCCTTCTTCTTCCTGTGAGTTCAATCTCAAGTCTCCACTGCCAGCTAATTCGTCTTCTTCGCGTGCTCTTGTTAAAACCAAGCCTTCTGATCTAGCTAGGGCTAAGGCTAAGCCGTCGGATCAGAATTTGACGGCGATGGTGAAGAAATTCATGGAGAAGCGATCTGGTTCGAAGCCGAAGACGGTGAAGCAGGCGGCTGGATTGGTGATTCCGTCGGATTTGATTGCGGAGGATTTGAAGAAGACGGCGAGGAAAGGGACGAACTTCGGTGGTCTCCATAAGAAGTTGTTTGGGAAGGGAACGGTGGAGAAGAAGGAGGTGAAGGAAGTCAAGGCGTTGACGGAGGTGAAAGGGAACACGAGGACATTGGCAATGGTGCTGAGGAGTGAAAGAGAGCTTTTGAGTTTGAATAAGGAGCAGGAGTTGGAGATCACTGAGCTCAAATTAATTCTGGAGGAGAAGTACAGAGAGGTGAGGAATGAGGATACATTTCTTAATTTTACTTTACCTTTTACTTTCAGTTCTAAATTATGTAGCAAATTTGAATAGATGGTAAACTTAATGCACTTGTGTTAAATTTTCATTTTTTTCCCTGTTAGATTTATGACTAGTTAGGATAAGGAATATGAATATTATTGTACTTGAATGTTTTTGGTCATGTTTGATGTTCTTTAGCTGAATATGGCATTCTCATTATTAAGTTCGGTCATTTTTCAATTATGTTTGATTTTGTGTCGAGGTTTGAATGTTCTTTGAATAGATTTCAGTACTTATATAGTTGGATTTTGTCGTTTGTTTAGATTAAAAAATAAAAAAATTATGATTAGTGGATGATATGACTGTAATTTTCCACATATAGTTTGTTAACACAACATTGGTACTTATTCTTGTTGCTATCAGATTGAGAAGTTGAAAGACTTGTGTTTGAAGCAAAGAGAAGAAATAAAGTCATTGAAGAATGCAATATTGTTCCCAGATGTTATGAACTCTCAGCTTCAAAATATGCTTGAAAAGCAGGACTCAGAGTTGAAGCAAGCCAAACAAATAATCCCCACTCTACAAAAGCAGGTCACCACTCTCACTGGCCAGCTTCATTCCCTCGCCGAGGACCTTGCTGAGGTATGAATCAATGTAGTGTTCTTTACAATGATACATATATGAATCCTTGAGAAGCATAAGAAGAAAGACCTGAAACGTAGGAAACCTCTGATTGGCCGATCACCGAGAGGCTTTGTTGAGTTGAACCTAATGCTTTTAACAACTGGATGACTCAACTTAGCTCTAGAACTTGAAATCTTTCATTTCTTTGATATGAACCATGAAAGTAATACTGCTTAGAACACAGCCTTGAATGAAATTTAGACTTTCCTTTATATGTTTGTTGTACCGAATAGCTGTTATTGGATGAGTTTTCGTTCGAATAGCTGTTATTGGATGAGTTTTCGTTGCTTTCTCCTTGGTGATTGATTTATCTCTTATCACAACTGACATTTTATTTTGAGCAGGTGAAGGCTGATAAATATTCAGGAAAGTCTTGGTTACAAGGTAGTATTTCTCCTCACACACCAACATATGATCAGGAGGATGCTTCTAACTCATTGGTAAGCACAAAAGCTATCCTTTGATCTTCATGTCCATAATTGTAACGTTGCTACGTACCACTCCTCTTGGCTTGCCCTGCTTGCTCAAATGAGAAAAACTTGAGATTATGGTTAAATTACTACTGAAAATTCAGTTCAGGAAGATAACAAGTAGGGACATAGTCTATTTATCTCCTAGATGTATAATTACTCATCCAAACTTCATGTCATTCAGGAGTTCAGTGCCTGTGATCCAACATCCCCTGGCAGTCCAGATGACTTTTTGCTGAAGGATGTAAATCCCTGTCTAACACCCTATTATGCAACTAAATCCAAGGTATAACTTTTTATTTCTGCTGTTAGTTCCTCACATCTCAAGATTGTTCATTCAGCAAACTGATCAACTGAACTCATTCCTACAGGAGTTTGAGGCAATGGGATATGATTCTCCTCGAGACGAAATTCTATCCCATAACAGAATGGAATTTGGTTTTAAATCTTGTTCCAGGAAGTTGTCCAAAAGTTCTGATTGCAGACAGAATTCCGACAAAGCAAACACTACAAAGACAGCCCGACGATCTGATGAGGCCAAATACATGTATGGAAAGCCAATGCATAAATTTTAGTGAACCTCATTTTTTTTTTCAAGCTTGGGTTAATATTCTTGTTCCATCTGCATACCTTCTTGTACTTTGTCACCCACTGTCACTGTATGAACCGTAGTTCATATTGCATCTGATCCAACAACTGAGGCTGTAATGCGAACCGTTGTTCTTTGTTCCTTGGTTGAGAATGGTCCATTTGAGCTTTAGGGGAAGTAATACCATTGTATACATGCTACATTTGGAAAGCTCATCAATAATAGTAAATGGTCCTATTCTTTGAGAATTGTTGCCACTGTGCGTTGCGTTTCACTCCTATGCTCCATTTACTTCCCTG
mRNA sequence
GTCACATCCGAACCAAAATCAAATTCATTCATCCCCTTTTTTAAAAACTCCAAACTCTTCGCTCCATCTCCATCTTCATCTCCTTCCAATTCAAGAAAACAAAACCCTAGAAATCCGCTGAATGAATCAAATCAAACACTGCAGAATCTTAGTTCTTCTTCCAAGCTCATTGCGTGATTGAACATCTTTTTGTTTTTAAGATCTTCAAGATCTCCAGTTTGTCCATCTATGGCGAAAGTCATGAAGCCTTCTTCGCGCTACAGTTCCTACGATGTCCGATCTTCTACTTCCTCCCATTTCTCCGACCCTTCTTCTTCCTGTGAGTTCAATCTCAAGTCTCCACTGCCAGCTAATTCGTCTTCTTCGCGTGCTCTTGTTAAAACCAAGCCTTCTGATCTAGCTAGGGCTAAGGCTAAGCCGTCGGATCAGAATTTGACGGCGATGGTGAAGAAATTCATGGAGAAGCGATCTGGTTCGAAGCCGAAGACGGTGAAGCAGGCGGCTGGATTGGTGATTCCGTCGGATTTGATTGCGGAGGATTTGAAGAAGACGGCGAGGAAAGGGACGAACTTCGGTGGTCTCCATAAGAAGTTGTTTGGGAAGGGAACGGTGGAGAAGAAGGAGGTGAAGGAAGTCAAGGCGTTGACGGAGGTGAAAGGGAACACGAGGACATTGGCAATGGTGCTGAGGAGTGAAAGAGAGCTTTTGAGTTTGAATAAGGAGCAGGAGTTGGAGATCACTGAGCTCAAATTAATTCTGGAGGAGAAGTACAGAGAGATTGAGAAGTTGAAAGACTTGTGTTTGAAGCAAAGAGAAGAAATAAAGTCATTGAAGAATGCAATATTGTTCCCAGATGTTATGAACTCTCAGCTTCAAAATATGCTTGAAAAGCAGGACTCAGAGTTGAAGCAAGCCAAACAAATAATCCCCACTCTACAAAAGCAGGTCACCACTCTCACTGGCCAGCTTCATTCCCTCGCCGAGGACCTTGCTGAGGTGAAGGCTGATAAATATTCAGGAAAGTCTTGGTTACAAGGTAGTATTTCTCCTCACACACCAACATATGATCAGGAGGATGCTTCTAACTCATTGGAGTTCAGTGCCTGTGATCCAACATCCCCTGGCAGTCCAGATGACTTTTTGCTGAAGGATGTAAATCCCTGTCTAACACCCTATTATGCAACTAAATCCAAGGAGTTTGAGGCAATGGGATATGATTCTCCTCGAGACGAAATTCTATCCCATAACAGAATGGAATTTGGTTTTAAATCTTGTTCCAGGAAGTTGTCCAAAAGTTCTGATTGCAGACAGAATTCCGACAAAGCAAACACTACAAAGACAGCCCGACGATCTGATGAGGCCAAATACATGTATGGAAAGCCAATGCATAAATTTTAGTGAACCTCATTTTTTTTTTCAAGCTTGGGTTAATATTCTTGTTCCATCTGCATACCTTCTTGTACTTTGTCACCCACTGTCACTGTATGAACCGTAGTTCATATTGCATCTGATCCAACAACTGAGGCTGTAATGCGAACCGTTGTTCTTTGTTCCTTGGTTGAGAATGGTCCATTTGAGCTTTAGGGGAAGTAATACCATTGTATACATGCTACATTTGGAAAGCTCATCAATAATAGTAAATGGTCCTATTCTTTGAGAATTGTTGCCACTGTGCGTTGCGTTTCACTCCTATGCTCCATTTACTTCCCTG
Coding sequence (CDS)
ATGGCGAAAGTCATGAAGCCTTCTTCGCGCTACAGTTCCTACGATGTCCGATCTTCTACTTCCTCCCATTTCTCCGACCCTTCTTCTTCCTGTGAGTTCAATCTCAAGTCTCCACTGCCAGCTAATTCGTCTTCTTCGCGTGCTCTTGTTAAAACCAAGCCTTCTGATCTAGCTAGGGCTAAGGCTAAGCCGTCGGATCAGAATTTGACGGCGATGGTGAAGAAATTCATGGAGAAGCGATCTGGTTCGAAGCCGAAGACGGTGAAGCAGGCGGCTGGATTGGTGATTCCGTCGGATTTGATTGCGGAGGATTTGAAGAAGACGGCGAGGAAAGGGACGAACTTCGGTGGTCTCCATAAGAAGTTGTTTGGGAAGGGAACGGTGGAGAAGAAGGAGGTGAAGGAAGTCAAGGCGTTGACGGAGGTGAAAGGGAACACGAGGACATTGGCAATGGTGCTGAGGAGTGAAAGAGAGCTTTTGAGTTTGAATAAGGAGCAGGAGTTGGAGATCACTGAGCTCAAATTAATTCTGGAGGAGAAGTACAGAGAGATTGAGAAGTTGAAAGACTTGTGTTTGAAGCAAAGAGAAGAAATAAAGTCATTGAAGAATGCAATATTGTTCCCAGATGTTATGAACTCTCAGCTTCAAAATATGCTTGAAAAGCAGGACTCAGAGTTGAAGCAAGCCAAACAAATAATCCCCACTCTACAAAAGCAGGTCACCACTCTCACTGGCCAGCTTCATTCCCTCGCCGAGGACCTTGCTGAGGTGAAGGCTGATAAATATTCAGGAAAGTCTTGGTTACAAGGTAGTATTTCTCCTCACACACCAACATATGATCAGGAGGATGCTTCTAACTCATTGGAGTTCAGTGCCTGTGATCCAACATCCCCTGGCAGTCCAGATGACTTTTTGCTGAAGGATGTAAATCCCTGTCTAACACCCTATTATGCAACTAAATCCAAGGAGTTTGAGGCAATGGGATATGATTCTCCTCGAGACGAAATTCTATCCCATAACAGAATGGAATTTGGTTTTAAATCTTGTTCCAGGAAGTTGTCCAAAAGTTCTGATTGCAGACAGAATTCCGACAAAGCAAACACTACAAAGACAGCCCGACGATCTGATGAGGCCAAATACATGTATGGAAAGCCAATGCATAAATTTTAG
Protein sequence
MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSCEFNLKSPLPANSSSSRALVKTKPSDLARAKAKPSDQNLTAMVKKFMEKRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHKKLFGKGTVEKKEVKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLILEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQVTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSISPHTPTYDQEDASNSLEFSACDPTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLSKSSDCRQNSDKANTTKTARRSDEAKYMYGKPMHKF
Homology
BLAST of Bhi08G000626 vs. TAIR 10
Match:
AT4G17240.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 8 growth stages; Has 1142 Blast hits to 1055 proteins in 252 species: Archae - 22; Bacteria - 318; Metazoa - 248; Fungi - 96; Plants - 59; Viruses - 3; Other Eukaryotes - 396 (source: NCBI BLink). )
HSP 1 Score: 250.0 bits (637), Expect = 3.2e-66
Identity = 188/383 (49.09%), Postives = 237/383 (61.88%), Query Frame = 0
Query: 8 SSRYSSYDVRSS-TSSHFSDPSSSCEFNLKSPLPANSSSSRALVKTKPSDLARA----KA 67
+SRY+SYD RSS TSS SD SSS EF P+ SS+A+V++K S L + K
Sbjct: 2 ASRYNSYDSRSSVTSSIHSDLSSSAEFKSNKPI-----SSKAIVRSKSSYLTKTTKPIKP 61
Query: 68 KPSDQNLTAMVKKFME-KRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHKK 127
+ NLT M+KK ME K+S SK K V+ LVIP +L D K K T G L +K
Sbjct: 62 DSNPGNLTNMMKKLMEMKKSNSKSKRVE----LVIPEELKKIDTGKGGGKST-LGTLQRK 121
Query: 128 LFGKGTVEKKEVKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLILEEKY 187
LFGK ++VKALTEVK NTRTL+MVLRSERELL +NK+QE+EI ELK LEEK
Sbjct: 122 LFGK--------EKVKALTEVKSNTRTLSMVLRSERELLGMNKDQEVEIAELKFQLEEKN 181
Query: 188 REIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQVT 247
RE+EKLKDLCLKQREEIKSLK+A+LFPD MNSQ+ M EL QA++IIP LQKQV
Sbjct: 182 REVEKLKDLCLKQREEIKSLKSAVLFPDSMNSQINQM-----QELNQAREIIPNLQKQVI 241
Query: 248 TLTGQLHSLAEDLAEVKADKYSGKS--WLQGSISPHTPTYDQEDASNSLEFSACDPTSPG 307
+L GQL +A+DLAEVKA+KY +S W T +YD SLEFS+ G
Sbjct: 242 SLNGQLQCIAQDLAEVKANKYLSESCYW-----QAQTSSYD------SLEFSS------G 301
Query: 308 SPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLSKSSDC 367
SPD L+D+NPCLTPY K KE+E + DS + + + + + + KSS
Sbjct: 302 SPDGLALEDLNPCLTPYTKKKPKEYERV--DSAEESLSGRSTI-----TTTGGKVKSSSR 337
Query: 368 RQNSDKANTTKTARRSDEAKYMY 383
+++ K +RS+E+K Y
Sbjct: 362 SVKMSRSSEGKAGQRSEESKGWY 337
BLAST of Bhi08G000626 vs. TAIR 10
Match:
AT4G17240.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 8 growth stages. )
HSP 1 Score: 201.1 bits (510), Expect = 1.7e-51
Identity = 167/383 (43.60%), Postives = 219/383 (57.18%), Query Frame = 0
Query: 8 SSRYSSYDVRSS-TSSHFSDPSSSCEFNLKSPLPANSSSSRALVKTKPSDLARA----KA 67
+SRY+SYD RSS TSS SD SSS EF P+ SS+A+V++K S L + K
Sbjct: 2 ASRYNSYDSRSSVTSSIHSDLSSSAEFKSNKPI-----SSKAIVRSKSSYLTKTTKPIKP 61
Query: 68 KPSDQNLTAMVKKFME-KRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHKK 127
+ NLT M+KK ME K+S SK K V+ LVIP +L D K K T G L +K
Sbjct: 62 DSNPGNLTNMMKKLMEMKKSNSKSKRVE----LVIPEELKKIDTGKGGGKST-LGTLQRK 121
Query: 128 LFGKGTVEKKEVKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLILEEKY 187
LFGK ++VKALTEVK NTRTL+M+ E+ ++K+ L
Sbjct: 122 LFGK--------EKVKALTEVKSNTRTLSMI-----------HERLAVCNQIKVFL---- 181
Query: 188 REIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQVT 247
++EKLKDLCLKQREEIKSLK+A+LFPD MNSQ+ M EL QA++IIP LQKQV
Sbjct: 182 -QVEKLKDLCLKQREEIKSLKSAVLFPDSMNSQINQM-----QELNQAREIIPNLQKQVI 241
Query: 248 TLTGQLHSLAEDLAEVKADKYSGKS--WLQGSISPHTPTYDQEDASNSLEFSACDPTSPG 307
+L GQL +A+DLAEVKA+KY +S W T +YD SLEFS+ G
Sbjct: 242 SLNGQLQCIAQDLAEVKANKYLSESCYW-----QAQTSSYD------SLEFSS------G 301
Query: 308 SPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLSKSSDC 367
SPD L+D+NPCLTPY K KE+E + DS + + + + + + KSS
Sbjct: 302 SPDGLALEDLNPCLTPYTKKKPKEYERV--DSAEESLSGRSTI-----TTTGGKVKSSSR 321
Query: 368 RQNSDKANTTKTARRSDEAKYMY 383
+++ K +RS+E+K Y
Sbjct: 362 SVKMSRSSEGKAGQRSEESKGWY 321
BLAST of Bhi08G000626 vs. ExPASy TrEMBL
Match:
A0A1S3CL74 (uncharacterized protein LOC103501712 OS=Cucumis melo OX=3656 GN=LOC103501712 PE=4 SV=1)
HSP 1 Score: 690.6 bits (1781), Expect = 3.6e-195
Identity = 365/389 (93.83%), Postives = 375/389 (96.40%), Query Frame = 0
Query: 1 MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSCEFNLKSPLPANSSSSRALVKTKPSDLARA 60
MAKVMKPSSRYSSYDVRSSTSSHFSDPSSS +F +KSPLPANSSSSRALVKTKP+DLARA
Sbjct: 1 MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSSDFKIKSPLPANSSSSRALVKTKPTDLARA 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
K KPSDQNLTAMVKKFMEKRSGSKPK VK AAGLVIPSDLIAEDLKKTARKGT+FGGLHK
Sbjct: 61 KMKPSDQNLTAMVKKFMEKRSGSKPKAVKHAAGLVIPSDLIAEDLKKTARKGTSFGGLHK 120
Query: 121 KLFGKGTVEKKEVKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLILEEK 180
KLFGKGT+EKK+ KEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL+LEEK
Sbjct: 121 KLFGKGTMEKKDAKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVLEEK 180
Query: 181 YREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQV 240
YREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQV
Sbjct: 181 YREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQV 240
Query: 241 TTLTGQLHSLAEDLAEVKADKYSGKSWLQGSISPHTPTYDQEDASNSLEFSACDPTSPGS 300
TTLTGQLHSLAEDLAEVKADKYSGKSWLQGSISPHTPTYD EDASNSLEFS CDPTSPGS
Sbjct: 241 TTLTGQLHSLAEDLAEVKADKYSGKSWLQGSISPHTPTYDHEDASNSLEFSVCDPTSPGS 300
Query: 301 PDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLSKSSDCR 360
PDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPR E +S NRME GFKSCSRKLSKSSDCR
Sbjct: 301 PDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRGETVSQNRMESGFKSCSRKLSKSSDCR 360
Query: 361 QNSDKANTTKTARRSDEAKYMYGKPMHKF 390
QNS+KANTTKT R+SDEAKY YGKPMHKF
Sbjct: 361 QNSNKANTTKTGRQSDEAKYTYGKPMHKF 389
BLAST of Bhi08G000626 vs. ExPASy TrEMBL
Match:
A0A0A0LTE0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G181410 PE=4 SV=1)
HSP 1 Score: 686.8 bits (1771), Expect = 5.3e-194
Identity = 363/389 (93.32%), Postives = 375/389 (96.40%), Query Frame = 0
Query: 1 MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSCEFNLKSPLPANSSSSRALVKTKPSDLARA 60
MAKVMKPSSRY+SYD+RSSTSSHFSDPSSS +FN+KSPLP NSSSSRALVKTKPSDLARA
Sbjct: 1 MAKVMKPSSRYTSYDIRSSTSSHFSDPSSSSDFNIKSPLPPNSSSSRALVKTKPSDLARA 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
K KPSDQNLTAMVKKFMEKRSGSKPKT+K AAGLVI SDLIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KVKPSDQNLTAMVKKFMEKRSGSKPKTLKHAAGLVISSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGTVEKKEVKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLILEEK 180
KLFGKGTVEKKEVKEVKALTEVKGNTRTLAMVLRSERELLSLNK+QELEITELKL+LEEK
Sbjct: 121 KLFGKGTVEKKEVKEVKALTEVKGNTRTLAMVLRSERELLSLNKDQELEITELKLVLEEK 180
Query: 181 YREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQV 240
YREIEKLKDLCLKQREEIKSLKNA+LFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQV
Sbjct: 181 YREIEKLKDLCLKQREEIKSLKNAVLFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQV 240
Query: 241 TTLTGQLHSLAEDLAEVKADKYSGKSWLQGSISPHTPTYDQEDASNSLEFSACDPTSPGS 300
TTLTGQL+SLAEDLAEVKADKYSGKSWLQGSISPHTPTYD EDASNSLEFS CDPTSPGS
Sbjct: 241 TTLTGQLYSLAEDLAEVKADKYSGKSWLQGSISPHTPTYDHEDASNSLEFSVCDPTSPGS 300
Query: 301 PDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLSKSSDCR 360
PDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEIL NRME GFKSCSRKLSKSSDC+
Sbjct: 301 PDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILPQNRMESGFKSCSRKLSKSSDCK 360
Query: 361 QNSDKANTTKTARRSDEAKYMYGKPMHKF 390
Q S+KANTTKT R+SDEAKY YGKPM KF
Sbjct: 361 QISNKANTTKTGRQSDEAKYTYGKPMRKF 389
BLAST of Bhi08G000626 vs. ExPASy TrEMBL
Match:
A0A6J1EHN1 (uncharacterized protein LOC111432624 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432624 PE=4 SV=1)
HSP 1 Score: 656.8 bits (1693), Expect = 5.8e-185
Identity = 352/389 (90.49%), Postives = 364/389 (93.57%), Query Frame = 0
Query: 1 MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSCEFNLKSPLPANSSSSRALVKTKPSDLARA 60
MAKV+ PSSRYSSYDVRSS SSHFSDPSSS EF LKSP+ A+SSSSRA+VK+K +DL RA
Sbjct: 1 MAKVINPSSRYSSYDVRSSGSSHFSDPSSSSEFKLKSPMKADSSSSRAIVKSKAADLPRA 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
K KPSDQNLTAMVKKFMEKRSG KPKTVK A GLVIPSDLIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KTKPSDQNLTAMVKKFMEKRSGLKPKTVKHATGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGTVEKKEVKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLILEEK 180
KLFGKG VEKKE KEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL+LEEK
Sbjct: 121 KLFGKGMVEKKE-KEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVLEEK 180
Query: 181 YREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQV 240
Y EIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ +LEKQDSELKQAKQIIPTLQKQV
Sbjct: 181 YGEIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQGILEKQDSELKQAKQIIPTLQKQV 240
Query: 241 TTLTGQLHSLAEDLAEVKADKYSGKSWLQGSISPHTPTYDQEDASNSLEFSACDPTSPGS 300
TTLTGQL+SLAEDLAEVKADKYSGK WLQGS SPHTPTYD EDASN LEFSACDPTSP
Sbjct: 241 TTLTGQLYSLAEDLAEVKADKYSGKGWLQGSSSPHTPTYDHEDASNPLEFSACDPTSPSR 300
Query: 301 PDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLSKSSDCR 360
PDD+LLKDVNPCLTPYYATKSK+FEAMGYDSPRDEILSHNRME GF SCSRKLSKSSDCR
Sbjct: 301 PDDYLLKDVNPCLTPYYATKSKDFEAMGYDSPRDEILSHNRMESGFTSCSRKLSKSSDCR 360
Query: 361 QNSDKANTTKTARRSDEAKYMYGKPMHKF 390
QNS+KA TTKTARRSDEAKY YGKPMHKF
Sbjct: 361 QNSNKAKTTKTARRSDEAKYTYGKPMHKF 388
BLAST of Bhi08G000626 vs. ExPASy TrEMBL
Match:
A0A6J1CNL5 (inner centromere protein A OS=Momordica charantia OX=3673 GN=LOC111012662 PE=4 SV=1)
HSP 1 Score: 654.4 bits (1687), Expect = 2.9e-184
Identity = 357/395 (90.38%), Postives = 370/395 (93.67%), Query Frame = 0
Query: 1 MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSCEFNLKSPLPAN--SSSSRALVKTKPSDLA 60
MA V+KPSSRYSSYDVRSSTSSHFSDPS+S EF LKSP+ AN SSSSRALVK+K SDLA
Sbjct: 1 MASVIKPSSRYSSYDVRSSTSSHFSDPSTSSEFKLKSPMAANSSSSSSRALVKSKASDLA 60
Query: 61 RAKAKPSDQNLTAMVKKFMEKRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGL 120
RAK+KPSDQNLTAMVKKFMEKRS SKPKT K A GLVIPSDLIAEDLKKTARKGTNFGGL
Sbjct: 61 RAKSKPSDQNLTAMVKKFMEKRSASKPKTAKHATGLVIPSDLIAEDLKKTARKGTNFGGL 120
Query: 121 HKKLFGKGT--VEKKEVK-EVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL 180
HKKLFGKG+ VEKKE K EVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL
Sbjct: 121 HKKLFGKGSAAVEKKEKKEEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL 180
Query: 181 ILEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPT 240
+LEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ MLEKQDSELKQAKQIIPT
Sbjct: 181 VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQEMLEKQDSELKQAKQIIPT 240
Query: 241 LQKQVTTLTGQLHSLAEDLAEVKADKYSGKSWLQ-GSISPHTPTYDQEDASNSLEFSACD 300
LQKQVT LTGQLHSLAEDLAEVKADKYSGK+WLQ S SPHTPTYD EDASNSLEFSACD
Sbjct: 241 LQKQVTXLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDDEDASNSLEFSACD 300
Query: 301 PTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLS 360
PTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNR E GF+SCSRKLS
Sbjct: 301 PTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRKESGFESCSRKLS 360
Query: 361 KSSDCRQNSDKANTTKTARRSDEAKYMYGKPMHKF 390
+SSDCRQ S++ NTT+TARRSDEAKYMYGKPMHKF
Sbjct: 361 RSSDCRQKSNETNTTRTARRSDEAKYMYGKPMHKF 395
BLAST of Bhi08G000626 vs. ExPASy TrEMBL
Match:
A0A6J1HTF1 (uncharacterized protein LOC111466593 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466593 PE=4 SV=1)
HSP 1 Score: 651.7 bits (1680), Expect = 1.9e-183
Identity = 350/389 (89.97%), Postives = 361/389 (92.80%), Query Frame = 0
Query: 1 MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSCEFNLKSPLPANSSSSRALVKTKPSDLARA 60
MAKV+ PSSRYSSYDVRSS SSHFSDPSSS EF LKSP+ A+SSSSR +VK+K DLARA
Sbjct: 1 MAKVINPSSRYSSYDVRSSNSSHFSDPSSSSEFKLKSPMKADSSSSRTIVKSKAVDLARA 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
K KP DQNLTAMVKKFMEKRSG KPKTVK A GLVIPSDLIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KTKPLDQNLTAMVKKFMEKRSGLKPKTVKHATGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGTVEKKEVKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLILEEK 180
KLFGKG VEKKE KEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL+LEEK
Sbjct: 121 KLFGKGMVEKKE-KEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVLEEK 180
Query: 181 YREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQV 240
Y EIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ +LEKQDSELKQAKQIIPTLQKQV
Sbjct: 181 YGEIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQGILEKQDSELKQAKQIIPTLQKQV 240
Query: 241 TTLTGQLHSLAEDLAEVKADKYSGKSWLQGSISPHTPTYDQEDASNSLEFSACDPTSPGS 300
TTLTGQL+SLAEDLAEVKADKYSGK WLQGS SPHTPTYD EDASN LEFSACDPTSP
Sbjct: 241 TTLTGQLYSLAEDLAEVKADKYSGKGWLQGSSSPHTPTYDHEDASNPLEFSACDPTSPSR 300
Query: 301 PDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLSKSSDCR 360
PDD+LLKDVNPCLTPYYATKSK+FEAMGYDSPRDEILSHNRME F SCSRKLSKSSDCR
Sbjct: 301 PDDYLLKDVNPCLTPYYATKSKDFEAMGYDSPRDEILSHNRMESDFTSCSRKLSKSSDCR 360
Query: 361 QNSDKANTTKTARRSDEAKYMYGKPMHKF 390
QNS+KA TTKTARRSDEAKY YGKPMHKF
Sbjct: 361 QNSNKAKTTKTARRSDEAKYTYGKPMHKF 388
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT4G17240.1 | 3.2e-66 | 49.09 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT4G17240.2 | 1.7e-51 | 43.60 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3CL74 | 3.6e-195 | 93.83 | uncharacterized protein LOC103501712 OS=Cucumis melo OX=3656 GN=LOC103501712 PE=... | [more] |
A0A0A0LTE0 | 5.3e-194 | 93.32 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G181410 PE=4 SV=1 | [more] |
A0A6J1EHN1 | 5.8e-185 | 90.49 | uncharacterized protein LOC111432624 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1CNL5 | 2.9e-184 | 90.38 | inner centromere protein A OS=Momordica charantia OX=3673 GN=LOC111012662 PE=4 S... | [more] |
A0A6J1HTF1 | 1.9e-183 | 89.97 | uncharacterized protein LOC111466593 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |