Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGACCATTAACTATTTTTGAAACAAAATAAATCATTATATTAAATGCAGTTCTCATCGTAAGGTATCTTTGGAAGGGCATATTGGTCAATTCGTCTCTGCCATTGCTCCACGAGCTCGCTCAGAAAACTGGAAAATCGATTCACACACTGCGATTTTCTCAATCCATCTTCCAACCGGATGGAGACGACGATGAAGCTGCCGCCGCCGGCGAAGTCTCATCAGGTTCATACATTCACCTCTTCACTCTATCCAAAGTCGGTGAACCAATCGCCGGAATTGGATCTTCAGCAGACGCCAACTTCACGCAAGGATTCTCGGCGGAGGATCCGAAACCTTTCGTTGATTAAGAGGAAACTGGCGCCGTCCGGCCGGAGGAGTCGGCCACAGACTCCGCTTCTGAAGTGGAAGGTCGAGGAGAGAGTCGATGGTGGAGGTGAAGAGGACCAGGACGAGAAGAAGTCGGAATCGGAGAACGGAGGAAAAGATCTCCGGCAGGCGAGTGGGGAAAGAGATGTGATCGTATCAGCAAGGAAACTCGCTGCTGGTTTTTGGCGGTTTCAGAAGCCGGAGGTTAGCGCTGATGGAGGAAGGAATGGTTTGAGACGCAAGCAGGAGCAGGGGATCGGTTTTCAGGTAAATGTTACTTTGGAAATCTCGTTTCGAATTGAATTTGGAGAAATTTGATGGAACTTGAATTGTTCTATTTGTATGATTTGCAATTGGTGAATATTAAGTGTGCAGTTTTCTCTTTACTTAGAATCATAATTTTGGAAACCGCTGCGTTGCTTGGGATTGGATATTTCGTTTCTGTTTCTGACATTGCTGTTAAGAATTGTGATTAGATGAGTTTTAGGAATTAAAATTTCGCTTTCATTTCTACAATTCGGTTCATTGCTGTGATCATTTATCATTCATCTTGCAGCCTGTTGCTGGCCATGTTCGGGTCCCAATTCTCCGTCATCACAATAACAACATATTTAGTAATGAAACAAGGGATCTGTTACAGGGCCAACCCTCGTCTGGTATGAGAAATGGTGTTCTGTGCAAGGTAATGACACTACCTTCAGTTCCTCTTTCATATATATATCTTTTTTCATGAAAATACTTGTTGGTATAAAATTTGAAGTGTTTTCTTCAATCCCCCTGCCATCAGCTTGAGCCGTTCTTTCAATTCTCCAACTCAGTTATGGAGGGAGCAACTAAGTGGGATCCTGTTGGCTCGAAAATTTCAGATGAAAGGGGCCATATTTACAACCAAAGAGAGCTTCTTGACCAGCAAGTGAGCCTGGTTTCTGTTATATCTTCCCTTGAAGCCGAACTAAAGCAGGCACGAGTGCGCATTTTGGAACTTGAAACTGAACGTCATGTATCGAAAAAGAAGCTTGAGAGCTTCTTGAGAAAGGTTGATGAGGAAAAGGCTGTATGGCGTATGCGGGAACATGAGAAAGTACGTGTATTTATAGAAAGCATCAGAACGGAGTTGAACCATGAAAGGAAAAATCGACGAAGAGTAGAGCATTTCAATTCAAAACTGGTTCGTGAGCTGGCTGATGCCAAGTCATTGGTGAAACAGCTGATGCAGGACTATGAAGAAGAAAGGAAGGAAAGAGTATTGATTGAACAAGTGTGTGAAGAGCTTGCTAAAGAAATTGGAGATGACAAAGCAGAAATAGAGGCGTCGAAGAGAGAATCTGCCAGACTTAGAGAGGAAGTGGAAGGAGAGAGAAAGATGTTGCAGTTGGCAGAAGTATGGCGTGAAGAACGCGTTCAAATGAAGCTGGTCGATGCCAAAGTAGCTGTAGAAGAGAAATACTCTCAGATGAATAGGCTTGTTGCAGATCTTGAAAATTTCCTAAGATTAAGGGGAGCAATCTCAGACATTAAGGAGATGAAAGAAGCTGTAATACTCGGAAAGACTGCTTCTGCAGTGGACATTCAAGACATAAAGCAGTTATCTTACCAACATCCTAAACCAGACGATATTTTCTCCATCTTTGAAGAAGTTAATTTTGATGAAAACCATGAGAGGGAGGTTAAGCCATATGGTTCTTACAGTCCAGCTACCGAAATCTCTAAAGTTGGAACAACGAGTCCTGAAGTAAACGTGGATGCGGCTAAACGAGTGGATGGCACTCTGATTGCGTCACATCCATGCATCAATCAGAATGGTGACATAGACGATGAGAGTGGATGGGAAACAGTGAGCCAAGTTGAGGATCAGGATTCGAGTTCTTCACCGGAAGGAAGCATGATACCACCTGCTAATAAGAATTGTGGAAAAAGTAGCAGCACCTCAGGCTCAGGTAGTGTAACAGACTGGGAAGAATACGGAGGAAATGGTGAAACAACAATCAACATCAGTGAAGTCTACTCAGAACTTGTAAAGAAATCAAAGAAAGTATCAAACTTAACAAAGAAGCTTTGGAAATCTGGCCATCATAATGGAGGAGACAGCAACAAGATGATACCAGTAAAGGAGTCTCATAGGATAATAACATCATCACCAGAAGCAGAATCAGGGAATGGTGGCTCTAGTCCAGATTTTATAGGTCAATGGAGTTCCTTCGACTTAAGCGACGCTCAAATAGCTCGACAGAGGAAAGTTCAGATAAATGTGAAGGAAAGCCAGAAGCTACAATTGCGGCATGTCCTTAAACAGAAGATATAGAGGCAGTGAGTGATAGCTTCTTGAGATTTGTGATCCAACCATCAAGCTGTAATGCTAATTTTACCTTTCTTGGCTGTCTATGGCTTTGCAACTGGGAACGGAAACAATGATTGAGAGCAAGCCTGGACTATTTTCCGCGGCTGTAAAGATCGATTTTCGTCAAGAAGGCATCGCAATGTTATATATGTACTGTACTGTAAGATTCAAACTAATGAGGCTTGGAAGATAATTTCCGAATTCTGCCTCTATTTTCTCCAGATGTATCGAATATTCGACTTATCAATGAAATAATAACCTTCCAAACATCCCTTAGGAGGACTCCCTTGAAGAGTAGGGCTTATATTTTGGAAAGTACAATAGTTTATAAATTTTATAAAAGAATATAATATGTTGGGATATATCACTGTTATCGACAAGCAAGAACAAAGGAAGAGAAGAAGCTCAGTCACTTGGGGAAAACAGCTAGAATGCAAAATGATTCTGCACCATATGTAAAGAGAGATCACAACATCTTATTCTGTGACAATTTAATCAATTACACAACACTCTAGTCAGTAAATGTTACACTTTCCCGGGTGATCAAGATTTTGAAATGGTAATATAACAATTCAACAAGCTGGAGGAGTCGATAAAGCGGAAACCGATCAGATCAACTATTTTGCCCAACATTCTTCATGTCACCTATACTCTTGAAAGTTGACATGGCCAGT
mRNA sequence
GGACCATTAACTATTTTTGAAACAAAATAAATCATTATATTAAATGCAGTTCTCATCGTAAGGTATCTTTGGAAGGGCATATTGGTCAATTCGTCTCTGCCATTGCTCCACGAGCTCGCTCAGAAAACTGGAAAATCGATTCACACACTGCGATTTTCTCAATCCATCTTCCAACCGGATGGAGACGACGATGAAGCTGCCGCCGCCGGCGAAGTCTCATCAGGTTCATACATTCACCTCTTCACTCTATCCAAAGTCGGTGAACCAATCGCCGGAATTGGATCTTCAGCAGACGCCAACTTCACGCAAGGATTCTCGGCGGAGGATCCGAAACCTTTCGTTGATTAAGAGGAAACTGGCGCCGTCCGGCCGGAGGAGTCGGCCACAGACTCCGCTTCTGAAGTGGAAGGTCGAGGAGAGAGTCGATGGTGGAGGTGAAGAGGACCAGGACGAGAAGAAGTCGGAATCGGAGAACGGAGGAAAAGATCTCCGGCAGGCGAGTGGGGAAAGAGATGTGATCGTATCAGCAAGGAAACTCGCTGCTGGTTTTTGGCGGTTTCAGAAGCCGGAGGTTAGCGCTGATGGAGGAAGGAATGGTTTGAGACGCAAGCAGGAGCAGGGGATCGGTTTTCAGCCTGTTGCTGGCCATGTTCGGGTCCCAATTCTCCGTCATCACAATAACAACATATTTAGTAATGAAACAAGGGATCTGTTACAGGGCCAACCCTCGTCTGGTATGAGAAATGGTGTTCTGTGCAAGCTTGAGCCGTTCTTTCAATTCTCCAACTCAGTTATGGAGGGAGCAACTAAGTGGGATCCTGTTGGCTCGAAAATTTCAGATGAAAGGGGCCATATTTACAACCAAAGAGAGCTTCTTGACCAGCAAGTGAGCCTGGTTTCTGTTATATCTTCCCTTGAAGCCGAACTAAAGCAGGCACGAGTGCGCATTTTGGAACTTGAAACTGAACGTCATGTATCGAAAAAGAAGCTTGAGAGCTTCTTGAGAAAGGTTGATGAGGAAAAGGCTGTATGGCGTATGCGGGAACATGAGAAAGTACGTGTATTTATAGAAAGCATCAGAACGGAGTTGAACCATGAAAGGAAAAATCGACGAAGAGTAGAGCATTTCAATTCAAAACTGGTTCGTGAGCTGGCTGATGCCAAGTCATTGGTGAAACAGCTGATGCAGGACTATGAAGAAGAAAGGAAGGAAAGAGTATTGATTGAACAAGTGTGTGAAGAGCTTGCTAAAGAAATTGGAGATGACAAAGCAGAAATAGAGGCGTCGAAGAGAGAATCTGCCAGACTTAGAGAGGAAGTGGAAGGAGAGAGAAAGATGTTGCAGTTGGCAGAAGTATGGCGTGAAGAACGCGTTCAAATGAAGCTGGTCGATGCCAAAGTAGCTGTAGAAGAGAAATACTCTCAGATGAATAGGCTTGTTGCAGATCTTGAAAATTTCCTAAGATTAAGGGGAGCAATCTCAGACATTAAGGAGATGAAAGAAGCTGTAATACTCGGAAAGACTGCTTCTGCAGTGGACATTCAAGACATAAAGCAGTTATCTTACCAACATCCTAAACCAGACGATATTTTCTCCATCTTTGAAGAAGTTAATTTTGATGAAAACCATGAGAGGGAGGTTAAGCCATATGGTTCTTACAGTCCAGCTACCGAAATCTCTAAAGTTGGAACAACGAGTCCTGAAGTAAACGTGGATGCGGCTAAACGAGTGGATGGCACTCTGATTGCGTCACATCCATGCATCAATCAGAATGGTGACATAGACGATGAGAGTGGATGGGAAACAGTGAGCCAAGTTGAGGATCAGGATTCGAGTTCTTCACCGGAAGGAAGCATGATACCACCTGCTAATAAGAATTGTGGAAAAAGTAGCAGCACCTCAGGCTCAGGTAGTGTAACAGACTGGGAAGAATACGGAGGAAATGGTGAAACAACAATCAACATCAGTGAAGTCTACTCAGAACTTGTAAAGAAATCAAAGAAAGTATCAAACTTAACAAAGAAGCTTTGGAAATCTGGCCATCATAATGGAGGAGACAGCAACAAGATGATACCAGTAAAGGAGTCTCATAGGATAATAACATCATCACCAGAAGCAGAATCAGGGAATGGTGGCTCTAGTCCAGATTTTATAGGTCAATGGAGTTCCTTCGACTTAAGCGACGCTCAAATAGCTCGACAGAGGAAAGTTCAGATAAATGTGAAGGAAAGCCAGAAGCTACAATTGCGGCATGTCCTTAAACAGAAGATATAGAGGCAGTGAGTGATAGCTTCTTGAGATTTGTGATCCAACCATCAAGCTGTAATGCTAATTTTACCTTTCTTGGCTGTCTATGGCTTTGCAACTGGGAACGGAAACAATGATTGAGAGCAAGCCTGGACTATTTTCCGCGGCTGTAAAGATCGATTTTCGTCAAGAAGGCATCGCAATGTTATATATGTACTGTACTGTAAGATTCAAACTAATGAGGCTTGGAAGATAATTTCCGAATTCTGCCTCTATTTTCTCCAGATGTATCGAATATTCGACTTATCAATGAAATAATAACCTTCCAAACATCCCTTAGGAGGACTCCCTTGAAGAGTAGGGCTTATATTTTGGAAAGTACAATAGTTTATAAATTTTATAAAAGAATATAATATGTTGGGATATATCACTGTTATCGACAAGCAAGAACAAAGGAAGAGAAGAAGCTCAGTCACTTGGGGAAAACAGCTAGAATGCAAAATGATTCTGCACCATATGTAAAGAGAGATCACAACATCTTATTCTGTGACAATTTAATCAATTACACAACACTCTAGTCAGTAAATGTTACACTTTCCCGGGTGATCAAGATTTTGAAATGGTAATATAACAATTCAACAAGCTGGAGGAGTCGATAAAGCGGAAACCGATCAGATCAACTATTTTGCCCAACATTCTTCATGTCACCTATACTCTTGAAAGTTGACATGGCCAGT
Coding sequence (CDS)
ATGGAGACGACGATGAAGCTGCCGCCGCCGGCGAAGTCTCATCAGGTTCATACATTCACCTCTTCACTCTATCCAAAGTCGGTGAACCAATCGCCGGAATTGGATCTTCAGCAGACGCCAACTTCACGCAAGGATTCTCGGCGGAGGATCCGAAACCTTTCGTTGATTAAGAGGAAACTGGCGCCGTCCGGCCGGAGGAGTCGGCCACAGACTCCGCTTCTGAAGTGGAAGGTCGAGGAGAGAGTCGATGGTGGAGGTGAAGAGGACCAGGACGAGAAGAAGTCGGAATCGGAGAACGGAGGAAAAGATCTCCGGCAGGCGAGTGGGGAAAGAGATGTGATCGTATCAGCAAGGAAACTCGCTGCTGGTTTTTGGCGGTTTCAGAAGCCGGAGGTTAGCGCTGATGGAGGAAGGAATGGTTTGAGACGCAAGCAGGAGCAGGGGATCGGTTTTCAGCCTGTTGCTGGCCATGTTCGGGTCCCAATTCTCCGTCATCACAATAACAACATATTTAGTAATGAAACAAGGGATCTGTTACAGGGCCAACCCTCGTCTGGTATGAGAAATGGTGTTCTGTGCAAGCTTGAGCCGTTCTTTCAATTCTCCAACTCAGTTATGGAGGGAGCAACTAAGTGGGATCCTGTTGGCTCGAAAATTTCAGATGAAAGGGGCCATATTTACAACCAAAGAGAGCTTCTTGACCAGCAAGTGAGCCTGGTTTCTGTTATATCTTCCCTTGAAGCCGAACTAAAGCAGGCACGAGTGCGCATTTTGGAACTTGAAACTGAACGTCATGTATCGAAAAAGAAGCTTGAGAGCTTCTTGAGAAAGGTTGATGAGGAAAAGGCTGTATGGCGTATGCGGGAACATGAGAAAGTACGTGTATTTATAGAAAGCATCAGAACGGAGTTGAACCATGAAAGGAAAAATCGACGAAGAGTAGAGCATTTCAATTCAAAACTGGTTCGTGAGCTGGCTGATGCCAAGTCATTGGTGAAACAGCTGATGCAGGACTATGAAGAAGAAAGGAAGGAAAGAGTATTGATTGAACAAGTGTGTGAAGAGCTTGCTAAAGAAATTGGAGATGACAAAGCAGAAATAGAGGCGTCGAAGAGAGAATCTGCCAGACTTAGAGAGGAAGTGGAAGGAGAGAGAAAGATGTTGCAGTTGGCAGAAGTATGGCGTGAAGAACGCGTTCAAATGAAGCTGGTCGATGCCAAAGTAGCTGTAGAAGAGAAATACTCTCAGATGAATAGGCTTGTTGCAGATCTTGAAAATTTCCTAAGATTAAGGGGAGCAATCTCAGACATTAAGGAGATGAAAGAAGCTGTAATACTCGGAAAGACTGCTTCTGCAGTGGACATTCAAGACATAAAGCAGTTATCTTACCAACATCCTAAACCAGACGATATTTTCTCCATCTTTGAAGAAGTTAATTTTGATGAAAACCATGAGAGGGAGGTTAAGCCATATGGTTCTTACAGTCCAGCTACCGAAATCTCTAAAGTTGGAACAACGAGTCCTGAAGTAAACGTGGATGCGGCTAAACGAGTGGATGGCACTCTGATTGCGTCACATCCATGCATCAATCAGAATGGTGACATAGACGATGAGAGTGGATGGGAAACAGTGAGCCAAGTTGAGGATCAGGATTCGAGTTCTTCACCGGAAGGAAGCATGATACCACCTGCTAATAAGAATTGTGGAAAAAGTAGCAGCACCTCAGGCTCAGGTAGTGTAACAGACTGGGAAGAATACGGAGGAAATGGTGAAACAACAATCAACATCAGTGAAGTCTACTCAGAACTTGTAAAGAAATCAAAGAAAGTATCAAACTTAACAAAGAAGCTTTGGAAATCTGGCCATCATAATGGAGGAGACAGCAACAAGATGATACCAGTAAAGGAGTCTCATAGGATAATAACATCATCACCAGAAGCAGAATCAGGGAATGGTGGCTCTAGTCCAGATTTTATAGGTCAATGGAGTTCCTTCGACTTAAGCGACGCTCAAATAGCTCGACAGAGGAAAGTTCAGATAAATGTGAAGGAAAGCCAGAAGCTACAATTGCGGCATGTCCTTAAACAGAAGATATAG
Protein sequence
METTMKLPPPAKSHQVHTFTSSLYPKSVNQSPELDLQQTPTSRKDSRRRIRNLSLIKRKLAPSGRRSRPQTPLLKWKVEERVDGGGEEDQDEKKSESENGGKDLRQASGERDVIVSARKLAAGFWRFQKPEVSADGGRNGLRRKQEQGIGFQPVAGHVRVPILRHHNNNIFSNETRDLLQGQPSSGMRNGVLCKLEPFFQFSNSVMEGATKWDPVGSKISDERGHIYNQRELLDQQVSLVSVISSLEAELKQARVRILELETERHVSKKKLESFLRKVDEEKAVWRMREHEKVRVFIESIRTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEERKERVLIEQVCEELAKEIGDDKAEIEASKRESARLREEVEGERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNRLVADLENFLRLRGAISDIKEMKEAVILGKTASAVDIQDIKQLSYQHPKPDDIFSIFEEVNFDENHEREVKPYGSYSPATEISKVGTTSPEVNVDAAKRVDGTLIASHPCINQNGDIDDESGWETVSQVEDQDSSSSPEGSMIPPANKNCGKSSSTSGSGSVTDWEEYGGNGETTINISEVYSELVKKSKKVSNLTKKLWKSGHHNGGDSNKMIPVKESHRIITSSPEAESGNGGSSPDFIGQWSSFDLSDAQIARQRKVQINVKESQKLQLRHVLKQKI
Homology
BLAST of Bhi09G001374 vs. TAIR 10
Match:
AT1G50660.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20350.1); Has 21445 Blast hits to 15134 proteins in 1325 species: Archae - 461; Bacteria - 2309; Metazoa - 11052; Fungi - 1737; Plants - 1035; Viruses - 42; Other Eukaryotes - 4809 (source: NCBI BLink). )
HSP 1 Score: 501.9 bits (1291), Expect = 8.3e-142
Identity = 337/710 (47.46%), Postives = 452/710 (63.66%), Query Frame = 0
Query: 35 DLQQTPTSRKDSRRRIRNLSLIK-RKLAPS-GRRSRPQTPLLKWKVEER--VDGGGEEDQ 94
DL+ + ++RR RN SL + R+ PS GRRSRP+TPLLKWKVE+R G ED
Sbjct: 28 DLRAIQRATTVTKRRARNPSLTRQRRSGPSGGRRSRPETPLLKWKVEDRNKERSGVVEDD 87
Query: 95 DEKKSESENGGKDLRQASGERDVI--VSARKLAAGFWRFQKPEVSADGGRNGLRRKQEQG 154
D + + + + R + VS RKLAAG WR Q P+ S+ GG RK ++G
Sbjct: 88 DYEDDNHQVARSETTRRKDRRKIARPVSVRKLAAGLWRLQVPDASSSGG----ERKGKEG 147
Query: 155 IGFQPVAGHVRVPILRHHNNNIFSNETRDLLQGQPS-SGMRNGVLCKLEPFFQFSNSVME 214
+GFQ G++ VP L HH++ ++ + Q + + +NG LCKLEP F +S ME
Sbjct: 148 LGFQGNGGYMGVPYLYHHSDKPSGGQSNKIRQNPSTIATTKNGFLCKLEPSMPFPHSAME 207
Query: 215 GATKWDPVGSKISDERGHIYNQRELLDQQVSLVSVISSLEAELKQARVRILELETERHVS 274
GATKWDPV +E IY+ + +DQQV+ VS++SSLEAEL++A RI +LE+E+
Sbjct: 208 GATKWDPVCLDTMEEVHQIYSNMKRIDQQVNAVSLVSSLEAELEEAHARIEDLESEKRSH 267
Query: 275 KKKLESFLRKVDEEKAVWRMREHEKVRVFIESIRTELNHERKNRRRVEHFNSKLVRELAD 334
KKKLE FLRKV EE+A WR REHEKVR I+ ++T++N E+K R+R+E N KLV ELAD
Sbjct: 268 KKKLEQFLRKVSEERAAWRSREHEKVRAIIDDMKTDMNREKKTRQRLEIVNHKLVNELAD 327
Query: 335 AKSLVKQLMQDYEEERKERVLIEQVCEELAKEIGDDKAEIEASKRESARLREEVEGERKM 394
+K VK+ MQDYE+ERK R LIE+VC+ELAKEIG+DKAEIEA KRES LREEV+ ER+M
Sbjct: 328 SKLAVKRYMQDYEKERKARELIEEVCDELAKEIGEDKAEIEALKRESMSLREEVDDERRM 387
Query: 395 LQLAEVWREERVQMKLVDAKVAVEEKYSQMNRLVADLENFLRLRGAISDIKEMKEAVILG 454
LQ+AEVWREERVQMKL+DAKVA+EE+YSQMN+LV DLE+FLR R ++D+KE++EA +L
Sbjct: 388 LQMAEVWREERVQMKLIDAKVALEERYSQMNKLVGDLESFLRSRDIVTDVKEVREAELLR 447
Query: 455 KTASAVDIQDIKQLSYQHPKPDDIFSIFEEVNFDENHEREVKPYGSYSPATEISKVGTTS 514
+TA++V+IQ+IK+ +Y PDDI+++FEE+N E H+RE++ +YSP + SKV T S
Sbjct: 448 ETAASVNIQEIKEFTYVPANPDDIYAVFEEMNLGEAHDREMEKSVAYSPISHDSKVHTVS 507
Query: 515 PEVNVDAAKRVDGTLIASHPCINQNGDI-DDESGWETVSQVEDQDSSSSPEGSMIPPANK 574
+ N+ K S +QNGDI +D+SGWETVS +E+Q SS SP+GS+ NK
Sbjct: 508 LDANMMNKKGRH-----SDAYTHQNGDIEEDDSGWETVSHLEEQGSSYSPDGSIPSVNNK 567
Query: 575 NCGKSSSTSGSGSVTDWEEYGGNGET-TINISEVYSELVKKSKKVSNLTKKLWKS-GHHN 634
N S + SG + + T T ISEV S + SKKVS++ KLW+S G N
Sbjct: 568 NHNHRHSNASSGGTESLGKVWDDTMTPTTEISEVCSIPRRSSKKVSSIA-KLWRSTGASN 627
Query: 635 GG-DSN-KMIPV------------KESHRIITSSPEAESGNGGSSP--DFIGQWSSFDLS 694
G DSN K+I + K S ++ SP+ S GG SP D +GQW+S S
Sbjct: 628 GDRDSNYKVISMEGMNGGRVSNGRKSSAGMV--SPDRVSSKGGFSPMMDLVGQWNSSPES 687
Query: 695 -------------------DAQIARQRKVQINVK-ESQKLQLRHVLKQKI 699
AQ + + I + ESQK+QL+HVLKQ+I
Sbjct: 688 ANHPHVNRGGMKGCIEWPRGAQKSSLKSKLIEARIESQKVQLKHVLKQRI 725
BLAST of Bhi09G001374 vs. TAIR 10
Match:
AT3G20350.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: cotyledon; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G50660.1); Has 15095 Blast hits to 11224 proteins in 1051 species: Archae - 223; Bacteria - 1586; Metazoa - 7000; Fungi - 1255; Plants - 746; Viruses - 40; Other Eukaryotes - 4245 (source: NCBI BLink). )
HSP 1 Score: 399.1 bits (1024), Expect = 7.5e-111
Identity = 295/693 (42.57%), Postives = 409/693 (59.02%), Query Frame = 0
Query: 40 PTSRKDSRRRIRNLSLIK-RKLAPSGRR-SRPQTPLLKWKVEER-------VDGGGEEDQ 99
P R RRR R S + R+ S RR SRP+TP LK KVE++ V+ G ED
Sbjct: 18 PNIRDIHRRRARKPSFTRQRRSGVSVRRLSRPETPQLKSKVEDQNIERCGGVEDGDNEDD 77
Query: 100 DEKKSESENGGKDLRQASGERDVIVSARKLAAGFWRFQKPEVSADGGRNGLRRKQEQGIG 159
D K + + +R + RKLAAG WR + P+ + GG ++ + +
Sbjct: 78 DCNKMRCQERSRSVRPD--------TVRKLAAGVWRLRVPDAVSSGG----DKRSKDRLR 137
Query: 160 FQPVAGHV--RVPILRHHNNNIFSNETRDLLQGQPSSGMRNGVLCKLEPFFQFSNSVMEG 219
FQ AG P+ +H++ ++ Q S + LCK EP F + MEG
Sbjct: 138 FQETAGPAGNLGPLFYYHHH----DDKHSGFQSNNSRNKHSRFLCKHEPSVPFPHCAMEG 197
Query: 220 ATKWDPVGSKISDERGHIYNQRELLDQQVSLVSVISSLEAELKQARVRILELETERHVSK 279
ATKWDP+ D+ IY + +QQV+ VS+ SS+E +L++AR I +LE+E+ K
Sbjct: 198 ATKWDPICLDTRDDVHQIYTNVKWNNQQVNDVSLASSIELKLQEARACIKDLESEKRSQK 257
Query: 280 KKLESFLRKVDEEKAVWRMREHEKVRVFIESIRTELNHERKNRRRVEHFNSKLVRELADA 339
KKLE FL+KV EE+A WR REHEKVR I+ ++ ++N E+K R+R+E NSKLV ELAD+
Sbjct: 258 KKLEQFLKKVSEERAAWRSREHEKVRAIIDDMKADMNQEKKTRQRLEIVNSKLVNELADS 317
Query: 340 KSLVKQLMQDYEEERKERVLIEQVCEELAKEIGDDKAEIEASKRESARLREEVEGERKML 399
K VK+ M DY++ERK R LIE+VC+ELAKEI +DKAEIEA K ES LREEV+ ER+ML
Sbjct: 318 KLAVKRYMHDYQQERKARELIEEVCDELAKEIEEDKAEIEALKSESMNLREEVDDERRML 377
Query: 400 QLAEVWREERVQMKLVDAKVAVEEKYSQMNRLVADLENFLRLRGAISDIKEMKEAVILGK 459
Q+AEVWREERVQMKL+DAKV +EEKYSQMN+LV D+E FL R + +KE++ A +L +
Sbjct: 378 QMAEVWREERVQMKLIDAKVTLEEKYSQMNKLVGDMEAFLSSRNT-TGVKEVRVAELLRE 437
Query: 460 TASAVD-IQDIKQLSYQHPKPDDIFSIFEEVNFDENHEREVKPYGSYSPATEISKVGTTS 519
TA++VD IQ+IK+ +Y+ KPDDI +FE++N EN +RE + Y +YSP + SK T S
Sbjct: 438 TAASVDNIQEIKEFTYEPAKPDDILMLFEQMNMGENQDRESEQYVAYSPVSHASKAHTVS 497
Query: 520 PEVNVDAAKRVDGTLIASHPCINQNGDI-DDESGWETVSQVEDQDSSSSPEGSMIPPANK 579
P+VN+ R S+ +QNG+ +D+SGWETVS E+ SS SP+ S IP +
Sbjct: 498 PDVNLINKGR------HSNAFTDQNGEFEEDDSGWETVSHSEEHGSSYSPDES-IPNISN 557
Query: 580 NCGKSSSTSGSGSVTDWEEYGGNGETTINISEVYSELVKKSKKVSNLTKKLWKS-GHHNG 639
++S+ S +G T++E+ I EV S ++SKK+ ++ KLW S NG
Sbjct: 558 THHRNSNVSMNG--TEYEK-----TLLREIKEVCSVPRRQSKKLPSMA-KLWSSLEGMNG 617
Query: 640 GDSNKMIPVKESHRIITSSPEAESGNGG-SSPDFIGQWSSF-DLSDAQIAR--------- 699
SN + SPE S GG ++ D +GQWSS D ++A + R
Sbjct: 618 RVSN-----ARKSTVEMVSPETGSNKGGFNTLDLVGQWSSSPDSANANLNRGGRKGCIEW 673
BLAST of Bhi09G001374 vs. TAIR 10
Match:
AT3G11590.1 (unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G22310.1); Has 22320 Blast hits to 15179 proteins in 1213 species: Archae - 372; Bacteria - 2307; Metazoa - 10906; Fungi - 1700; Plants - 1146; Viruses - 65; Other Eukaryotes - 5824 (source: NCBI BLink). )
HSP 1 Score: 149.1 bits (375), Expect = 1.4e-35
Identity = 88/198 (44.44%), Postives = 138/198 (69.70%), Query Frame = 0
Query: 234 DQQVSLVSVISSLEAELKQARVRILELETERHVSKKKLESFLRKVDEEKAVWRMREHEKV 293
D+ S +S++S+L +EL++AR+++ +L E + +++ EEKAVW+ E E V
Sbjct: 248 DRPSSSMSLVSALHSELERARLQVNQLIHEHKPENNDISYLMKRFAEEKAVWKSNEQEVV 307
Query: 294 RVFIESIRTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEERKERVLIEQVC 353
IES+ EL ERK RRR E N KL +ELA+ KS + + +++ E E++ RV++E+VC
Sbjct: 308 EAAIESVAGELEVERKLRRRFESLNKKLGKELAETKSALMKAVKEIENEKRARVMVEKVC 367
Query: 354 EELAKEIGDDKAEIEASKRESARLREEVEGERKMLQLAEVWREERVQMKLVDAKVAVEEK 413
+ELA++I +DKAE+E KRES +++EEVE ER+MLQLA+ REERVQMKL +AK +EEK
Sbjct: 368 DELARDISEDKAEVEELKRESFKVKEEVEKEREMLQLADALREERVQMKLSEAKHQLEEK 427
Query: 414 YSQMNRLVADLENFLRLR 432
+ +++L L+ +L+ +
Sbjct: 428 NAAVDKLRNQLQTYLKAK 445
BLAST of Bhi09G001374 vs. TAIR 10
Match:
AT1G11690.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20350.1); Has 5959 Blast hits to 4807 proteins in 476 species: Archae - 156; Bacteria - 436; Metazoa - 2789; Fungi - 309; Plants - 336; Viruses - 9; Other Eukaryotes - 1924 (source: NCBI BLink). )
HSP 1 Score: 112.1 bits (279), Expect = 1.8e-24
Identity = 86/258 (33.33%), Postives = 144/258 (55.81%), Query Frame = 0
Query: 206 MEGATKWD--PVGSKISDERGHIYNQRELLDQQVSLVSVISSLEAELKQARVRILELETE 265
ME T+WD + + S E + + E LD +++ L+ EL +A+ RI ELE E
Sbjct: 1 MESITEWDLGSLRTYYSVEPSENFQEDEFLD-----FNLVPCLQTELWKAQTRIKELEAE 60
Query: 266 RHVSKKKLESFLRKVDEEKAVWRMREHEKVRVFIESIRTELNHERKNRRRVEHFNSKLVR 325
+ S++ + +R EK E F++ ++ +L+ ER+ ++RV+ NS+L +
Sbjct: 61 KFKSEETIRCLIRNQRNEK-------EETTNPFVDYLKEKLSKEREEKKRVKAENSRLKK 120
Query: 326 ELADAKSLVKQLMQDYEEERKERVLIEQVCEELAKEIGDDKAEIEASKRESARLREEVEG 385
++ D +S V +L R+ER +E+VCEEL I+ K + R+ +E E
Sbjct: 121 KILDMESSVNRL-------RRERDTMEKVCEELV-------TRIDELKVNTRRVWDETEE 180
Query: 386 ERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNRLVADLENFLRLRGAISDIKE--MK 445
ER+MLQ+AE+WREERV++K +DAK+A++EKY +MN V +LE L + I+E ++
Sbjct: 181 ERQMLQMAEMWREERVRVKFMDAKLALQEKYEEMNLFVVELEKCLETAREVGGIEEKRLR 232
Query: 446 EAVILGKTASAVDIQDIK 460
L K A ++++ D K
Sbjct: 241 HGEGLIKMAKSMEVVDSK 232
BLAST of Bhi09G001374 vs. TAIR 10
Match:
AT5G22310.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11590.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 89.7 bits (221), Expect = 9.8e-18
Identity = 78/229 (34.06%), Postives = 128/229 (55.90%), Query Frame = 0
Query: 237 VSLVSVISSLEAELKQARVRILE-LETERHVSKKKLESFLRKVDEEKAVWRMREHEKVRV 296
V ++ I L + K A R++ L E ++ L+ + ++DEE+ E+ R
Sbjct: 193 VKVLKRIGELGDDHKTASNRLISALLCELDRARSSLKHLMSELDEEE--------EEKRR 252
Query: 297 FIESIRTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEERKERVLIEQVCEE 356
IES++ E ERK RRR E N +L REL +AK +++ ++ + E++ + ++E+VC+E
Sbjct: 253 LIESLQEEAMVERKLRRRTEKMNRRLGRELTEAKETERKMKEEMKREKRAKDVLEEVCDE 312
Query: 357 LAKEIGDDKAEIEASKRESARLREEVEGERKMLQLAEVWREERVQMKLVDAKVAVEEKYS 416
L K IGDDK +E+E ER+M+ +A+V REERVQMKL +AK E+KY+
Sbjct: 313 LTKGIGDDK--------------KEMEKEREMMHIADVLREERVQMKLTEAKFEFEDKYA 372
Query: 417 QMNRLVADLENFL---RLRGAISDIKEMKEAVILGKTASAVDIQDIKQL 462
+ RL +L L +G+ S+I+ + E VI G + + D+K +
Sbjct: 373 AVERLKKELRRVLDGEEGKGS-SEIRRILE-VIDGSGSDDDEESDLKSI 397
BLAST of Bhi09G001374 vs. ExPASy Swiss-Prot
Match:
Q66GQ2 (Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 PE=2 SV=2)
HSP 1 Score: 85.5 bits (210), Expect = 2.6e-15
Identity = 62/200 (31.00%), Postives = 116/200 (58.00%), Query Frame = 0
Query: 232 LLDQQVSLVSVISSLEAELKQARVRILELETERHVSKKKLESFLRKVDEEKAVWRMREHE 291
L +Q VS +S+I +L+ E+ +RVRI EL + + +L+S ++++ EEK + + +E E
Sbjct: 209 LEEQHVSNISLIKALKTEVAHSRVRIKELLRYQQADRHELDSVVKQLAEEKLLSKNKEVE 268
Query: 292 KVRVFIESIRTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEERKERVLIEQ 351
++ ++S+R L ERK R+R E + K+ REL++ KS + +++ E K ++E
Sbjct: 269 RMSSAVQSVRKALEDERKLRKRSESLHRKMARELSEVKSSLSNCVKELERGSKSNKMMEL 328
Query: 352 VCEELAKEIGDDKAEIEASKRES--ARLREEVEGERKMLQLAEVWREERVQMKLVDAKVA 411
+C+E AK I + EI K+++ G++ +L +AE W +ER+QM+L
Sbjct: 329 LCDEFAKGIKSYEEEIHGLKKKNLDKDWAGRGGGDQLVLHIAESWLDERMQMRLEGGDTL 388
Query: 412 VEEKYSQMNRLVADLENFLR 430
+ S +++L ++E FL+
Sbjct: 389 NGKNRSVLDKLEVEIETFLQ 408
BLAST of Bhi09G001374 vs. ExPASy Swiss-Prot
Match:
F4I878 (Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1)
HSP 1 Score: 65.9 bits (159), Expect = 2.1e-09
Identity = 51/134 (38.06%), Postives = 80/134 (59.70%), Query Frame = 0
Query: 284 VWRMREHEKVRVFIESIRTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEER 343
V+ E K + I+ ++ EL++ERK RRR A+ ++K+L +D EEER
Sbjct: 71 VFMESELGKAQDEIKELKAELDYERKARRR--------------AELMIKKLAKDVEEER 130
Query: 344 KERVLIEQVCEELAKEIGDDKAEIEASKRESARLREEVEGERKMLQLAEVWREERVQMKL 403
R E + L KE+ +K+E+ R++ ++E ER+M +LAEV REERVQMKL
Sbjct: 131 MAREAEEMQNKRLFKELSSEKSEM-------VRMKRDLEEERQMHRLAEVLREERVQMKL 183
Query: 404 VDAKVAVEEKYSQM 418
+DA++ +EEK S++
Sbjct: 191 MDARLFLEEKLSEL 183
BLAST of Bhi09G001374 vs. ExPASy TrEMBL
Match:
A0A5D3DHI0 (Putative F11F12.2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold142G001150 PE=4 SV=1)
HSP 1 Score: 1173.7 bits (3035), Expect = 0.0e+00
Identity = 631/701 (90.01%), Postives = 658/701 (93.87%), Query Frame = 0
Query: 1 METTMKLPPPAKSHQVHTFTSSLYPKSVNQSPELDLQQTPTSRKDSRRRIRNLSLIKRKL 60
METTMKLP PAKS Q+ TFT+SLYPKSVN+SPELDLQQTP+SRKDSRRRIRNLSLIKRKL
Sbjct: 1 METTMKLPLPAKSQQIPTFTTSLYPKSVNRSPELDLQQTPSSRKDSRRRIRNLSLIKRKL 60
Query: 61 APSGRRSRPQTPLLKWKVEERVDGGGEEDQDEKKSESENGGKDLRQASGERDVIVSARKL 120
APS RRSRPQTPLLKWKVEERVDGGGE D+DE KSE ENGGKDL++ SGERDVIVSARKL
Sbjct: 61 APSSRRSRPQTPLLKWKVEERVDGGGEVDEDETKSELENGGKDLQRVSGERDVIVSARKL 120
Query: 121 AAGFWRFQKPEVSADGGRNGLRRKQEQGIGFQPVAGHVRVPILRHHNNNIFSNETRDLLQ 180
AAGFWRFQKPEVSADGGR+GL+R QEQGIG QPVAGHVRVPILRHHN+NIFSNETRDL+Q
Sbjct: 121 AAGFWRFQKPEVSADGGRSGLKRTQEQGIGSQPVAGHVRVPILRHHNSNIFSNETRDLIQ 180
Query: 181 GQPS-SGMRNGVLCKLEPFFQFSNSVMEGATKWDPVGSKISDERGHIYNQRELLDQQVSL 240
GQPS SGMRNGVLCKLEPFFQFSNSVMEGATKWDP+GSKISD+RGHIY QRELLDQQVSL
Sbjct: 181 GQPSTSGMRNGVLCKLEPFFQFSNSVMEGATKWDPIGSKISDDRGHIYIQRELLDQQVSL 240
Query: 241 VSVISSLEAELKQARVRILELETERHVSKKKLESFLRKVDEEKAVWRMREHEKVRVFIES 300
VSVISSLEAELKQAR RILELETERH SKKKLESFLRKV EEKA+WRMREHEKVRVFIES
Sbjct: 241 VSVISSLEAELKQARGRILELETERHASKKKLESFLRKVGEEKALWRMREHEKVRVFIES 300
Query: 301 IRTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEERKERVLIEQVCEELAKE 360
IRTELNHERKNRRRVEHFNSKLV ELADAKSLVKQLMQDYEEERKERVLIEQVCEELAKE
Sbjct: 301 IRTELNHERKNRRRVEHFNSKLVHELADAKSLVKQLMQDYEEERKERVLIEQVCEELAKE 360
Query: 361 IGDDKAEIEASKRESARLREEVEGERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNR 420
IGD+KA+IEASKRESARLREEVE ERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNR
Sbjct: 361 IGDNKADIEASKRESARLREEVEEERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNR 420
Query: 421 LVADLENFLRLRGAISDIKEMKEAVILGKTASAVDIQDIKQLSYQHPKPDDIFSIFEEVN 480
LVADLENFLRLRGAISDIKEMKEAVILGKTASAV+IQDIKQLSYQH KPDDIFSIFEEVN
Sbjct: 421 LVADLENFLRLRGAISDIKEMKEAVILGKTASAVNIQDIKQLSYQHSKPDDIFSIFEEVN 480
Query: 481 FDENHEREVKPYGSYSPATEISKVGTTSPEVNVDAAKRVDGTLIASHPCINQNGDIDDES 540
FDENHEREVKPYGSYSPATEISKV T SPEVNVD AKR DGTL+ S CI+QNG+IDDES
Sbjct: 481 FDENHEREVKPYGSYSPATEISKVRTVSPEVNVDTAKRADGTLMVSRTCIDQNGEIDDES 540
Query: 541 GWETVSQVEDQDSSSSPEGSMIPPANKNCGKSSSTSGSGSVTDWEEY--GGNGETTINIS 600
GWETVSQVEDQDSSSSPEGS + PANKNCGKSSSTSGS SVTDWEEY GG GE+TINIS
Sbjct: 541 GWETVSQVEDQDSSSSPEGSTVLPANKNCGKSSSTSGS-SVTDWEEYGGGGGGESTINIS 600
Query: 601 EVYSELVKKSKKVSNLTKKLWKSGHHNGGDSNKMIPVKESHRIITSSPEAESGNGGSSPD 660
EVYSELVKKSKKVSNLTK+LWKSGHHNGGDSNKM+PVKE H I +S PEAESGNG SSPD
Sbjct: 601 EVYSELVKKSKKVSNLTKRLWKSGHHNGGDSNKMMPVKEPHGITSSPPEAESGNGESSPD 660
Query: 661 FIGQWSSFDLSDAQIARQRKVQINVKESQKLQLRHVLKQKI 699
F GQWSSFDL IARQRKVQIN K++QKLQLRHVL QKI
Sbjct: 661 FTGQWSSFDL----IARQRKVQINAKDNQKLQLRHVLNQKI 696
BLAST of Bhi09G001374 vs. ExPASy TrEMBL
Match:
A0A0A0K768 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G209530 PE=4 SV=1)
HSP 1 Score: 1169.5 bits (3024), Expect = 0.0e+00
Identity = 634/709 (89.42%), Postives = 662/709 (93.37%), Query Frame = 0
Query: 1 METTMK----LPPPAKSHQVHTFTSSLYPKSVNQSPELDLQQTPTSRKDSRRRIRNLSLI 60
ME+TMK LP PAKS + TFTSSLYPKSVN+SPELDLQQTP+SRKDSRRRIRNLS I
Sbjct: 1 MESTMKLPLPLPLPAKSQHIPTFTSSLYPKSVNRSPELDLQQTPSSRKDSRRRIRNLSFI 60
Query: 61 KRKLAPSGRRSRPQTPLLKWKVEERVDGGGEEDQDEKKSESENGGKDLRQASGERDVIVS 120
KRKLAPSGRRSRPQTPLLKWKVEERVDGGGE D+DEKKSESENGGKDL++ SGERDVIVS
Sbjct: 61 KRKLAPSGRRSRPQTPLLKWKVEERVDGGGEMDEDEKKSESENGGKDLQRVSGERDVIVS 120
Query: 121 ARKLAAGFWRFQKPEVSADGGRNGLRRKQEQGIGFQPVAGHVRVPILRHHNNNIFSNETR 180
ARKLAAGFWRFQKPEVS DGG++GL+R QEQGIG QPVAGHVRVPILRHHN+NIFSNETR
Sbjct: 121 ARKLAAGFWRFQKPEVSVDGGKSGLKRTQEQGIGSQPVAGHVRVPILRHHNSNIFSNETR 180
Query: 181 DLLQGQPS-SGMRNGVLCKLEPFFQFSNSVMEGATKWDPVGSKISDERGH-IYNQRELLD 240
DL+QGQPS SG+RNGVL KLEPFFQFSNSVMEGATKWDP+GSKISD+RG IYNQRELLD
Sbjct: 181 DLIQGQPSTSGVRNGVLRKLEPFFQFSNSVMEGATKWDPIGSKISDDRGGLIYNQRELLD 240
Query: 241 QQVSLVSVISSLEAELKQARVRILELETERHVSKKKLESFLRKVDEEKAVWRMREHEKVR 300
QQVSLVSVISSLEAELKQ RVRILELETERH SKKKLESFLRKVDEEKAVWRMREHEKVR
Sbjct: 241 QQVSLVSVISSLEAELKQTRVRILELETERHASKKKLESFLRKVDEEKAVWRMREHEKVR 300
Query: 301 VFIESIRTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEERKERVLIEQVCE 360
VFIESIRTELNHERKNRRRVEHFNSKLV ELADAKSLVK+LMQDYEEERKERVLIEQVCE
Sbjct: 301 VFIESIRTELNHERKNRRRVEHFNSKLVHELADAKSLVKRLMQDYEEERKERVLIEQVCE 360
Query: 361 ELAKEIGDDKAEIEASKRESARLREEVEGERKMLQLAEVWREERVQMKLVDAKVAVEEKY 420
ELAKEIGDDKAEIEASKRESARLREEVE ERKMLQLAEVWREERVQMKLVDAKVAVEEKY
Sbjct: 361 ELAKEIGDDKAEIEASKRESARLREEVEEERKMLQLAEVWREERVQMKLVDAKVAVEEKY 420
Query: 421 SQMNRLVADLENFLRLRGAISDIKEMKEAVILGKTASAVDIQDIKQLSYQHPKPDDIFSI 480
SQMNRLVADLENFLRLRGAISDIKEMKEAVILGKTASA++IQDIKQLSYQH KPDDIFSI
Sbjct: 421 SQMNRLVADLENFLRLRGAISDIKEMKEAVILGKTASALNIQDIKQLSYQHSKPDDIFSI 480
Query: 481 FEEVNFDENHEREVKPYGSYSPATEISKVGTTSPEVNVDAAKRVDGTLIASHPCINQNGD 540
FEEVNFDENHEREVKPYGS+SPAT ISKVGTTSPEVNVD AKRVDGTL+AS CINQNG+
Sbjct: 481 FEEVNFDENHEREVKPYGSFSPATVISKVGTTSPEVNVDTAKRVDGTLMASRTCINQNGE 540
Query: 541 IDDESGWETVSQVEDQDSSSSPEGSMIPPANKNCGKSSSTSGSGSVTDWEEY----GGNG 600
IDDESGWETVSQVEDQDSSSSPEGS I PANKNCGKSSSTSGS SVTDWEEY GG G
Sbjct: 541 IDDESGWETVSQVEDQDSSSSPEGSTILPANKNCGKSSSTSGS-SVTDWEEYGHGGGGGG 600
Query: 601 ETTINISEVYSELVKKSKKVSNLTKKLWKSGHHNGGDSNKMIPVKE-SHRIITSSPEAES 660
E+TIN+SEVYSELVKKSKKVSNLTK+LWKSGHHNGGDSNKMI VKE H I +SSP+AES
Sbjct: 601 ESTINVSEVYSELVKKSKKVSNLTKRLWKSGHHNGGDSNKMITVKEPPHGITSSSPDAES 660
Query: 661 GNGGSSPDFIGQWSSFDLSDAQIARQRKVQINVKESQKLQLRHVLKQKI 699
GNG SPDF GQW SFD+SD QIARQRKVQIN KE+QKLQLRHVL QKI
Sbjct: 661 GNGEYSPDFTGQWGSFDISDGQIARQRKVQINAKENQKLQLRHVLNQKI 708
BLAST of Bhi09G001374 vs. ExPASy TrEMBL
Match:
A0A6J1CCS7 (uncharacterized protein LOC111010326 OS=Momordica charantia OX=3673 GN=LOC111010326 PE=4 SV=1)
HSP 1 Score: 1083.9 bits (2802), Expect = 0.0e+00
Identity = 592/705 (83.97%), Postives = 638/705 (90.50%), Query Frame = 0
Query: 1 METTMKL--PPPAKSHQVHTFTSSLYPKSVNQSPELDLQQTPTSRKDSRRRIRNLSLIKR 60
METT KL PPPAK + TFTSSLY KSVNQSPELDLQQTP+SRKD+RRRIRNLSLIKR
Sbjct: 1 METTTKLLPPPPAKFQLLPTFTSSLYSKSVNQSPELDLQQTPSSRKDARRRIRNLSLIKR 60
Query: 61 KLAPSGRRSRPQTPLLKWKVEERVDGGGE--EDQDEKKSESENGGKDLRQASGERDVIVS 120
K APSGRRSRPQTPLLKWKVEER DGG + ED+DEKKS SEN GKDLR+ S ERDVIVS
Sbjct: 61 KAAPSGRRSRPQTPLLKWKVEERSDGGDDRAEDEDEKKSVSEN-GKDLRRISRERDVIVS 120
Query: 121 ARKLAAGFWRFQKPEVSADGGRNGLRRKQEQGIGFQPVAGHVRVPILRHHNNNIFSNETR 180
ARKLAAGFWRFQKPEVSADGGR L R QEQ GFQ VAGHVR+PILRHHNNNIFSNETR
Sbjct: 121 ARKLAAGFWRFQKPEVSADGGRRNLIRTQEQQPGFQSVAGHVRLPILRHHNNNIFSNETR 180
Query: 181 DLLQGQPS-SGMRNGVLCKLEPFFQFSNSVMEGATKWDPVGSKISDERGHIYNQRELLDQ 240
DLLQGQPS SG+RNG+LCKLEPFFQFSNSVMEGATKWDP+GSKI+DERG+IYNQ ELLD+
Sbjct: 181 DLLQGQPSTSGVRNGLLCKLEPFFQFSNSVMEGATKWDPIGSKIADERGNIYNQTELLDR 240
Query: 241 QVSLVSVISSLEAELKQARVRILELETERHVSKKKLESFLRKVDEEKAVWRMREHEKVRV 300
Q+SLVSV+++LEAELKQARVRILELETERH SKKKLE+FLRKVDEEK VWRMREHEK+RV
Sbjct: 241 QMSLVSVVTALEAELKQARVRILELETERHASKKKLENFLRKVDEEKTVWRMREHEKIRV 300
Query: 301 FIESIRTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEERKERVLIEQVCEE 360
FIES+RTELNHERKNRRR EHF+SKLV EL DAKSLVK+LMQDYEEERKER LIEQVCEE
Sbjct: 301 FIESMRTELNHERKNRRRAEHFSSKLVHELTDAKSLVKRLMQDYEEERKERELIEQVCEE 360
Query: 361 LAKEIGDDKAEIEASKRESARLREEVEGERKMLQLAEVWREERVQMKLVDAKVAVEEKYS 420
LAKEIGDDKAE+EASKRESA+LREEVE ERKMLQLAEVWREERVQMKLVDAKVAVEEKYS
Sbjct: 361 LAKEIGDDKAEMEASKRESAKLREEVEEERKMLQLAEVWREERVQMKLVDAKVAVEEKYS 420
Query: 421 QMNRLVADLENFLRLRGAISDIKEMKEAVILGKTASAVDIQDIKQLSYQHPKPDDIFSIF 480
QMN+L ADL+NFL+ RGAISDIKEM+EAV+LG AS+V IQDI+Q +YQ KPDDIFSIF
Sbjct: 421 QMNKLAADLQNFLKSRGAISDIKEMREAVLLGNAASSVSIQDIRQFTYQPSKPDDIFSIF 480
Query: 481 EEVNFDENHEREVKPYGSYSPATEISKVGTTSPEVNVDAAKRVDGTLIASHPCINQNGDI 540
EEVNFDENHEREVKPYGS SPATE SKVG+TSP+VNVDAAKR DG LI SH I+QNGDI
Sbjct: 481 EEVNFDENHEREVKPYGSDSPATETSKVGSTSPDVNVDAAKRGDGALICSHALIDQNGDI 540
Query: 541 DDESGWETVSQVEDQDSSSSPEGSMI-PPANKNCGKSSSTSGSGSVTDWEEYGGNGETTI 600
DDESGWETVSQVEDQDSS SPEGSM PPANKNC KSSSTSG+ S TDWE YGG GETTI
Sbjct: 541 DDESGWETVSQVEDQDSSYSPEGSMTKPPANKNCKKSSSTSGTESRTDWEAYGG-GETTI 600
Query: 601 NISEVYSELVKKSKKVSNLTKKLWKSGHHNGGDSNK-MIPVKESHRIITSSPEAESGNGG 660
NISEVYSELVKKSKKVS+LTK+LWKSGH+NGGDSNK M+PVKE + +S E ES NGG
Sbjct: 601 NISEVYSELVKKSKKVSSLTKRLWKSGHNNGGDSNKMMVPVKECNGRASSPEEGESVNGG 660
Query: 661 SSPDFIGQWSSFDLSDAQIARQRKVQINVKESQKLQLRHVLKQKI 699
SSPDF+GQW+SFDL +AQIARQRK QI+VKESQKLQLRHVLKQKI
Sbjct: 661 SSPDFMGQWNSFDLGEAQIARQRKGQISVKESQKLQLRHVLKQKI 703
BLAST of Bhi09G001374 vs. ExPASy TrEMBL
Match:
A0A6J1FIU3 (uncharacterized protein LOC111445867 OS=Cucurbita moschata OX=3662 GN=LOC111445867 PE=4 SV=1)
HSP 1 Score: 1082.8 bits (2799), Expect = 0.0e+00
Identity = 593/699 (84.84%), Postives = 630/699 (90.13%), Query Frame = 0
Query: 2 ETTMKLPPPAKSHQVHTFTSSLYPKSVNQSPELDLQQTPTSRKDSRRRIRNLSLIKRKLA 61
+TT+KL AKS ++ TF SSL PKS NQSPELDL+QT +SR+DSRRRIRNLSLIKRKLA
Sbjct: 3 KTTVKLSVLAKSQRIPTFASSLNPKSGNQSPELDLRQTLSSRRDSRRRIRNLSLIKRKLA 62
Query: 62 PSGRRSRPQTPLLKWKVEERVDGGGEEDQDEKKSESENGGKDLRQASGERDVIVSARKLA 121
PSG RSRPQTPLLKWKVE RVDG GE D+DEKKSESENGGKDLR+ S ERDV VSARKLA
Sbjct: 63 PSGPRSRPQTPLLKWKVEVRVDGEGEGDEDEKKSESENGGKDLRRMSRERDVNVSARKLA 122
Query: 122 AGFWRFQKPEVSADGGRNGLRRKQEQGIGFQPVAGHVRVPILRHHNNNIFSNETRDLLQG 181
AGFWRFQKPEVSADGGR GLRR EQGIGFQPVAGHVRVPILRHHNNNI SNETRDLLQ
Sbjct: 123 AGFWRFQKPEVSADGGRRGLRRTLEQGIGFQPVAGHVRVPILRHHNNNILSNETRDLLQS 182
Query: 182 QPS-SGMRNGVLCKLEPFFQFSNSVMEGATKWDPVGSKISDERGHIYNQRELLDQQVSLV 241
QPS SGMRNGVLCKLEPFFQFSNSVMEGATKWDP+GSKISDERGHIYNQ ELLDQQ+SLV
Sbjct: 183 QPSTSGMRNGVLCKLEPFFQFSNSVMEGATKWDPIGSKISDERGHIYNQTELLDQQMSLV 242
Query: 242 SVISSLEAELKQARVRILELETERHVSKKKLESFLRKVDEEKAVWRMREHEKVRVFIESI 301
SVI +L+AELKQA+V ILELETERHVSKKKLESFLRKVD+EK WRMREH+K+RVF+ESI
Sbjct: 243 SVICALQAELKQAQVHILELETERHVSKKKLESFLRKVDKEKTAWRMREHDKIRVFMESI 302
Query: 302 RTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEERKERVLIEQVCEELAKEI 361
RTELN+ERKNRR EHFNSKLV ELADAKSLVKQLM+DYEEERKERVLIEQVCEELAKEI
Sbjct: 303 RTELNYERKNRRGAEHFNSKLVHELADAKSLVKQLMRDYEEERKERVLIEQVCEELAKEI 362
Query: 362 GDDKAEIEASKRESARLREEVEGERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNRL 421
GDDKAEIEASKRESA+LREE E ERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNRL
Sbjct: 363 GDDKAEIEASKRESAKLREEAEEERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNRL 422
Query: 422 VADLENFLRLRGAISDIKEMKEAVILGKTASAVDIQDIKQLSYQHPKPDDIFSIFEEVNF 481
V+DLENFLR RGAISDIKEM+EA++LG+ ASAV+IQDIKQLSYQ KPDDIFSI E VNF
Sbjct: 423 VSDLENFLRSRGAISDIKEMREAILLGQAASAVNIQDIKQLSYQPSKPDDIFSILEGVNF 482
Query: 482 DENHEREVKPYGSYSPATEISKVGTTSPEVNVDAAKRVDGTLIASHPCINQNGDIDDESG 541
DEN E+EV PYGSYSPATEI K GTTSP++ VDAAKRVDGTL+ASH CI+QNGDIDDESG
Sbjct: 483 DENQEKEVNPYGSYSPATEIPKAGTTSPDLTVDAAKRVDGTLMASHACIDQNGDIDDESG 542
Query: 542 WETVSQVEDQDSSSSPEGSMIPP-ANKNCGKSSSTSGSGSVTDWEEYGGNGETTINISEV 601
WETVSQVEDQDSS S EG IPP ANKNC K SS SGSGS TDW ETTINISEV
Sbjct: 543 WETVSQVEDQDSSYSLEGCTIPPAANKNC-KKSSISGSGSGTDW-------ETTINISEV 602
Query: 602 YSELVKKSKKVSNLTKKLWKSGHHNGGDSNKMIPVKESHRIITSSPEAESGNGGSSPDFI 661
YSELVKKSKKVSNLTK+LWKSGH+NG K IPVKES+ I SSPEAESGNGGSSPDFI
Sbjct: 603 YSELVKKSKKVSNLTKRLWKSGHNNGRGDIKTIPVKESNG-IASSPEAESGNGGSSPDFI 662
Query: 662 GQWSSFDLSDAQIARQRKVQINVKESQKLQLRHVLKQKI 699
G+WSSFDLSDA+IARQRKVQINVKESQKLQLRH LKQKI
Sbjct: 663 GRWSSFDLSDARIARQRKVQINVKESQKLQLRHALKQKI 692
BLAST of Bhi09G001374 vs. ExPASy TrEMBL
Match:
A0A6J1J201 (uncharacterized protein LOC111480535 OS=Cucurbita maxima OX=3661 GN=LOC111480535 PE=4 SV=1)
HSP 1 Score: 1069.3 bits (2764), Expect = 6.7e-309
Identity = 588/699 (84.12%), Postives = 625/699 (89.41%), Query Frame = 0
Query: 2 ETTMKLPPPAKSHQVHTFTSSLYPKSVNQSPELDLQQTPTSRKDSRRRIRNLSLIKRKLA 61
+TT+KL AKS + TF SSL PKS NQSPELDL+QT +SR+DSRRRIRNLSLIK+KLA
Sbjct: 3 KTTVKLSVLAKSQGIPTFDSSLNPKSGNQSPELDLRQTLSSRRDSRRRIRNLSLIKKKLA 62
Query: 62 PSGRRSRPQTPLLKWKVEERVDGGGEEDQDEKKSESENGGKDLRQASGERDVIVSARKLA 121
PSG RSRPQTPLLKWKVEERVDG GE D+DEKKSE ENGGKDLR+ S ERDV VSARKLA
Sbjct: 63 PSGTRSRPQTPLLKWKVEERVDGEGEGDEDEKKSELENGGKDLRRMSRERDVNVSARKLA 122
Query: 122 AGFWRFQKPEVSADGGRNGLRRKQEQGIGFQPVAGHVRVPILRHHNNNIFSNETRDLLQG 181
AGFWRFQKPEVSADGGR GLRR QEQGIGFQPVAGHVR+PILRHH NNI SNETRDLLQ
Sbjct: 123 AGFWRFQKPEVSADGGRRGLRRTQEQGIGFQPVAGHVRLPILRHH-NNILSNETRDLLQS 182
Query: 182 QPS-SGMRNGVLCKLEPFFQFSNSVMEGATKWDPVGSKISDERGHIYNQRELLDQQVSLV 241
QPS SGMRNGVLCKLEPFFQFSNSVMEGATKWDP+ SKISDERGHIYNQ ELLDQQ+SLV
Sbjct: 183 QPSTSGMRNGVLCKLEPFFQFSNSVMEGATKWDPIRSKISDERGHIYNQTELLDQQMSLV 242
Query: 242 SVISSLEAELKQARVRILELETERHVSKKKLESFLRKVDEEKAVWRMREHEKVRVFIESI 301
SVI +L+AELKQARV ILELETERHVSKKKLESFLRKVD+EK WRMREH+K+RVF+ESI
Sbjct: 243 SVICALQAELKQARVHILELETERHVSKKKLESFLRKVDKEKTAWRMREHDKIRVFMESI 302
Query: 302 RTELNHERKNRRRVEHFNSKLVRELADAKSLVKQLMQDYEEERKERVLIEQVCEELAKEI 361
RTELN+ERKNRR EHFNSKLV ELADAKSLVKQLMQDYEEERKER LIEQVCEELAKEI
Sbjct: 303 RTELNYERKNRRGAEHFNSKLVHELADAKSLVKQLMQDYEEERKERALIEQVCEELAKEI 362
Query: 362 GDDKAEIEASKRESARLREEVEGERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNRL 421
G+DKAEIE SKRESA+LREE E ERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNRL
Sbjct: 363 GNDKAEIEVSKRESAKLREEAEEERKMLQLAEVWREERVQMKLVDAKVAVEEKYSQMNRL 422
Query: 422 VADLENFLRLRGAISDIKEMKEAVILGKTASAVDIQDIKQLSYQHPKPDDIFSIFEEVNF 481
V+DLENFLR RGAISDIKEM+EA++LG+ ASAV+IQDIKQLSYQ KPDDIFSI E VNF
Sbjct: 423 VSDLENFLRSRGAISDIKEMREAILLGQAASAVNIQDIKQLSYQPSKPDDIFSILEGVNF 482
Query: 482 DENHEREVKPYGSYSPATEISKVGTTSPEVNVDAAKRVDGTLIASHPCINQNGDIDDESG 541
DEN EREV PYGSYSPATEI K GTTSP++NVDAAKRVDG L+ASH CI+QNGDIDDESG
Sbjct: 483 DENQEREVNPYGSYSPATEIPKAGTTSPDLNVDAAKRVDGILMASHACIDQNGDIDDESG 542
Query: 542 WETVSQVEDQDSSSSPEGSMIPP-ANKNCGKSSSTSGSGSVTDWEEYGGNGETTINISEV 601
WETVSQVEDQDSS S EG IPP ANKN KSS SGSGS TDW ETTINISEV
Sbjct: 543 WETVSQVEDQDSSYSLEGCTIPPAANKNSKKSSIISGSGSGTDW-------ETTINISEV 602
Query: 602 YSELVKKSKKVSNLTKKLWKSGHHNGGDSNKMIPVKESHRIITSSPEAESGNGGSSPDFI 661
YSELVKKSKKVSNLTK+LWKSGH+NG K IPVKES+ + SSPEAESGNGGSSPDFI
Sbjct: 603 YSELVKKSKKVSNLTKRLWKSGHNNGRGDIKTIPVKESNG-MASSPEAESGNGGSSPDFI 662
Query: 662 GQWSSFDLSDAQIARQRKVQINVKESQKLQLRHVLKQKI 699
G+WSSFDLSDA+IARQRKVQINVKESQKLQLRH LKQKI
Sbjct: 663 GRWSSFDLSDARIARQRKVQINVKESQKLQLRHALKQKI 692
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
AT1G50660.1 | 8.3e-142 | 47.46 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... | [more] |
AT3G20350.1 | 7.5e-111 | 42.57 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma mem... | [more] |
AT3G11590.1 | 1.4e-35 | 44.44 | unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures;... | [more] |
AT1G11690.1 | 1.8e-24 | 33.33 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT5G22310.1 | 9.8e-18 | 34.06 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
Match Name | E-value | Identity | Description | |
Q66GQ2 | 2.6e-15 | 31.00 | Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 P... | [more] |
F4I878 | 2.1e-09 | 38.06 | Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3DHI0 | 0.0e+00 | 90.01 | Putative F11F12.2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffo... | [more] |
A0A0A0K768 | 0.0e+00 | 89.42 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G209530 PE=4 SV=1 | [more] |
A0A6J1CCS7 | 0.0e+00 | 83.97 | uncharacterized protein LOC111010326 OS=Momordica charantia OX=3673 GN=LOC111010... | [more] |
A0A6J1FIU3 | 0.0e+00 | 84.84 | uncharacterized protein LOC111445867 OS=Cucurbita moschata OX=3662 GN=LOC1114458... | [more] |
A0A6J1J201 | 6.7e-309 | 84.12 | uncharacterized protein LOC111480535 OS=Cucurbita maxima OX=3661 GN=LOC111480535... | [more] |