Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTGAGTTAAAGTAAGTGGTAGAGAAGTTTTATCAAAATGAGTCATTTTATAAAATTTTCCAAGTTTATAGGCATTTTTTACCATTTAGAAAAAAAAAAAAACCAGCGTCCATCATCCTTTTCTTCCTCTCCATTGCGCCGACCGACCACCCCCTTCTTCAAACCTCTTCACTGGCGACCAAACCCGCGAGCAGCTGCGTTACCACGGACTCAGCGACCAGTCCTTGCCATCCGACGATTCAGATCGACGTGCTCTTGCTTCACGTTCAATGGCGGAGGTAGTCTGCGACATGAATACCTATTCGAAGCATGTTTCTTTTCCATTTCAGATTTCGGCGCGAACATCAAGTTATTCAACACGTTTTCGTCGACTATGAGAATTTTCCAGCAACGTTTTTTACTTGGGTGAGCTTTTGACGAGTTTCGGCGTTAACCCACTCTCCAGTTGGTTTCAAAGAAGTTTACCCATAACGTTTAAGGTAAGGACGTTTTCGGACCCCAACCAGCAAGGTTCGAAACCTTTTCGGTAAGTTTTGATATGGGTTATTGCTTTATTTTTCCTTTTCGAATTGGATGTGAGCGTTGGAATATTGTTCTGAACTCTCGAAATTTGTTTGTTGTTATGCTTGGATAAATTATTTAGTATTTGATTAAGGTTCAGCAAGGTTGAACACGTTTCGGACTAGTTCAGACTCAGTTAAAGCATGTTAGAGCAATCTTTGGTTTGTGAAAGTGTGTTGGTTGTATTTTTGGACTAATTCGGGGTGATTAATTAAGGTTTATGGTGGATTTTGGTTAAGAATATGTGTTTTGGAAAACCTCCCAATGTTTATTGAGGTTTTCGAACTAAGTTCTAACACTTAAACATGATGTTTATTATTTAGGCGTTGTAAGAGTTGATGTTCGTTCAATGAAGTGGTTTTTCGGATTGGGAATTATAGGATTGTTGATCGAGTAAGTAGTTTCCACATTCTTCCACAATGTAAATTTTGATAAAGATATATTTTGACAGCTATGTTTTGATATACGATTGAAAGTTTGAAAATAGGTTTTGCACCATTTAAAACAAACTAAACCTGTTTTGGATGTTTTAAATAGTTTCTTATTAGAAAACATGTTTTATGTGATTTGAGAATCTACTGAAAGCATTTTGTATACTTTAGTTGATTTTGTCAAATAATGCTTTGAATGATGGCTGGAAGTTTTTTGACTGATCTTGAAGTTTTCATTTTTGTTTAGCAAACTTTTGAAAGGTTTGAATGGTTTGATTTGAGGGTTTCTGCTGCTGAAATTGCTATTTGAAACTATAGGTTTTGTTTCAAATGTTTTTGAATGTTATGATTGGGGCTATTTGAATATGTAGTTTTGTGTTTTGGGGATTGTATGGTATTTTGTTGTAGTCAAATTTGGAACCGACGGTTTCTGTTTTGATTGAAGCTTATTTGTGAAAAGATACATTTTTGATGTGGTTTGGCATTCTTCAGAATGAAGATTCAAGTGTGTGTTCACTCACATAGCCTTACACTTGAGTAGTTTATTCTGTTCTGGTTGGTTGTTAACCAATATTTTGCTGTTCTAGTCGGTTGTTAAACGACATTTCTATTGTTTTGGTTGATTTTATCGATATTTTTGCTGGTGTTACCGATGGGTAATCATTTCTACAGGTGTTACTAATGCATAACCATTTTTGTTGGGTTACTTCAAATTTAAGCATCTTTCCTGTTCTGTATTTGAAAGTTATGAATGATTGAAAGCAGTATCATAAAAAGTATAAACTAATTGAAAGTTTTGTTCTGTATTTGAAAATATTGATTGGTTTGAAAACTTTGTATTTGAAAGATTTGTGTTTTAGAAAAGTATGAATGTTCTGTTTATTAAAAGTTTTGAAAGCTTATTTGAACGTTTTGATTTTGTTGAAAGACTTTTTTGAAAGTTTTAGCTTGAAACGTTTAGTTTGAGAGATTATTTGCTTGAAAGATTTTGATTTCGAAAGTATCATTTTGACTGAAAGTTCTGAAATGACTTGTCTTTTGATTTTCAAAAGTTTTGAAAGAAAATGGAGTTCAAAACTTGATTTAGGAATTGCTTATTGAGTATTTTTGTACTCATCTTTGTTTTTCCAAAATGTTTTCAAATGAAGGATCAAGCGATGGTTTTGAAAGAAGTGAAGTTTGGTGGTTTGGCGTAGGACATCAGAGGATGTTTACATTTGGATTTGATAGTTTGACTTATTACTTGTATTGTAATAATTTGCTTGCACTGAGTTGTGTAAGTTTTAAAAGTCATAATTGTTATTTTTTCTGCATTTTTTGTTATACTAGTTGTATTGAAAAATACGGGCTGTTACAAATCCGTTGAAAGAGAAGCTGATTCCAACAGAGTTGTTACAATGTCGAAGTCCCTTCAACTTCTTCATGAGCTTTGTATTCAGTAAGTTCCTCTTGTGCACACGCTCCTTCTGTCTCTCTCTACCTCTCAAATTGCTTGTATGTAATTTATCCTTTTTGTGATTTTAGGTTTTCGGAACCAATCATCAAATCCCTTTCGAAAATTTACGATAAACCTTCAGAGGGGTCGAATGTCTCTGTTACAGCCATTCTCGAATCCCTTCTACCTCGAAAAACCTCACTTCCAAATAATCCTTCTGAGGACGATATTTACTCCTCCATCAAAGACTTCACACTAGCATGTGGTTTAATATTGTCTTCTTGTTCCTCGACCTTTGACCTTCTTTCATGGATTCCGAAAGACCTCTCACTTACTGCCGAGTCGGCGTTTCGAATGCTCTCGAAGGCGTATGCCTCTGCTTCTTGCGATGGGTTTTCGAAGTACATTGAGGAGCTGGGTTTAGATTTGAGTTTGATACCAAAGGAGAAAAGATTAGTGGTGGAAATCATTCCTAAGGTGCTTCCGCTGTTGAAGGAGAGTATTAAGGAGAGCTCGATAGATAAATCTGACGAGGTTGATGAGGTCTCCGCTGCATCAGCGAGGGTGCCTGTGGGGTTTGCTATTGTTGCGGCCCATCAACTTGGATGGTTCATTACTCAGGTTTGAGGTTTTGTTCTTGTTAGATATAGTGCAGTTCAGTGAATGGGTGTGTTGCCAGTATTTCAAGTTGATTTGGGGTTTGTTTATGGAAAAATTGTTGCAGATTGATTATCCACATTTGGGGAAGTTGTGTAATTTGGTGATTCCTTGTGCATTGACTGCTCTTGATCATTGGTCACCAGAAGTCAAAGTAAGCATTCTGATTTCCTTGTTTGCTATCTCTGATGCTTGCCAATTATGTTGGATTTTATGCTCATATTGGTAATGGAACTTAGTAATCTGTTTCAGATTTATGTTGAGTGAACGAGAAGCTTATTTCTTAATTTCATTGTTGGAAGTTGCTCATGATACTGAATGAAAGGCTTTAACTTCTTTGCCTATTTTTAGGGGCAGGGTATGGTTAGCTTTATTCATCTTGCAAAAAATGTCAATGCTGCAGAGCTTGGTTGGTATGAAGATGCGATTCTTGATGCATGCTGCTCAAATGTCCCTTCAAGTGATGAGATATGGCCTTATGTTGTTGAAATGTCAGTTCTTCTTGCAACTAGCATTCACAAAATGAATCCTCGTAGCTCATGGTAATTTTTTGTTGTAATGCATTATCCCATTCTTTTCAACATTTGGTAATAAGTGGCAACGCTTCTAGAAGTTGTGCTTATTCGCCCTAGTCATTTTAACTGTATGGGTTTTGTCGTGTCAAATATATATATATATATATATATATATATATATAGGATTGAAAGGATGGTTAATGAGATGCTGGGTCACTTGGAGCGACAGCCAAGGAACAAGGAACGGCGTATTGCATGGCTTCAACACATTGAACCTCTTTTCAACTGTATGGGTTTGGTGCTGTTGGCTCATACAAGGCGCATATTTCCACTTTTCTTTCAGTGGATGAATGCTGAAGATGATGAAACTACTTTACTGGTAAGAGATATATGATTTGTCATTTAAAATATACAGGATAACATTATATGATGAATAATACTTCCTGATCGCTTCTCCTGAGGAGCATTATATTTTTGACAACTAGAAGAGAAGAATGGTGAATAGGAGTTAGAGAAGAGAGATCTATCTCCTTTTTTCAAATCAATAAATAACTGTTAAATTTTTAATGGTGTTAGTAGGTAGATTAATTATTGTTCATATTATATCTATTCTCTAAATTCTTAAGGTGTTAAGACTCTTCTACTTCCATTAGGAAGAAGACGTTCTTGTTTGAGAATACATCAGAGGATTTTAAAGTTAACTTTCTTGTGTCAATTCCTATTGAAAACTTTCTCGTGTCAACTTCCTGATTTGCAGTCTTTGAGTTCAAGCATCAAAGGAATAGCTTTTGTTTAACCTCTGACTTCATTTTGGAATGCAAATGCAAGGGCAAGGGCAAGGGCAAGGGCAAGGGGAGGGACCTTAAATTTCTTACAAATGTTTTCTTATGCGTAGTAAAGAGCAGTCCAATTTATACTTCTACCTTCTATTTTCTGTATGCGTAGTTGGGATCCGTATCAACCATACCTTTAGACTGTAAATATTCAAAGAAACATTCATGTCCATAGTGAATTTTGTGAACTCACTTTTTCTCTGTTGAGAAGAAGTCACTTCAAAAACCTCTAAATAACGTAACTCTTAATTGCAAAACTAATTTCCTGATCAAACGGGGTTTTTGATTCTTGACACTGCTCTGCATTTGAGTCCTATAATAAATCGTTTTGTAACATGTGCACCTGTTTTGCTGTGATTATAACACCTTATCTTGGTGTCGTTTCTTGACAGTGAGAAAGCTTTTATGTGATTATTTATATTTGTGATTTAAGTCCCTTGGGAGTTGGGCTGATGAGCATCAAGGTTTCACACCTTGGCTAGAGTTTCACAATGCTCACTCGTGGCTTCTTGAGTTCTTTCAAATGTGAACATGTTATGTTGCCTTTTTCTGCCGGACACTATTGCATAAGTTCAATCCCTTTTGTGTGTACTACCTGGATTTATTATAAATTTGTCTTGTTATCATTGGGCAGGTTCTACAGAGAATACAAGCAGTTGTGAGGTTAACATGGATAAAGAATACAACATATGTTGAAAGGTTATTTAAGTTTTATGTTTTACGTTTCTTTGAATTTGGTGATATCTTAATTTTTTTTCCTTTTACAATTTTCAGTTTACGTAGTCTGTGTTGTTTACTTTCACAATTTCATTTCATTCAAATTCTGCTGTGCTTTCGTCACTTTCACCTGTTTTCAAGTGCTTAATAAAGCTATCCTAGTTCAAATCAAGCAATTGTGCTTTCGTCCCTTACTTTCACCTGTTTCTTGTGCTTGATCAAGCAATGGTTCTATAGGATTAGTCACATACCCATGGTTATTTAAGAAGAAAACATGGGAGAGAATCTGTGTGGGAATACAGGGCATATATCAAGCTATTCTATCCAGCCATATGGGACTGAAATTCAGTTATTAAACTAGTTAGCACAGGGAAGAGATTAGAAAGTGGTAGGAAATAAAGAAAGTTAGATCGTACAGAAACCCTTTATCTGAATTTGGAACTAGGAAATCCATCAATTGAGTAATGAAGAGGACTAGAGGCATCCCTAATTCTGTAATTTTCTAGTGTTTGATGAAGTTCAACCAGAGCTTTTTCAGGATCATAATACATGGCATGACATTTAGACGGATTAATTTCTGATGTCCATGAAGCTCTCTCCAAAAAATGGGATATAACCATTAACAATAATAACAGGACATTACTAATTTCTCACACCATTTTAATGGTCATCTTCATATTTGTGCCACACGTTGCAGCATTGATCAATTGCTTTTGAACTTCCACTCATATGGTTGTATACTGGTGACAGATTGGTGGATGAGCTTGCACTGTTATATGAAAAGGCTGCATCAAGAAGTTCTGGTGATGCAATTAGAAAGCATGTTGTTGATACACTCATCCTACTCCAGAAGTAAGAATAGGGCGTTCTTATACCATGTTTGCTTGTCCTGATTGAATCTTGAGGTTCTAATTTTGGTTGTTTTCAGGAGCAAAGGCCTACAGTTTAAGGCAGCTTGGAACAAGCACAAGGATCATCAAAACTTGGTTTCGCTTACTACATCTTTAACAGGATTGAACTTGGCAGATAATGTTGATTGTTAAGCCTTGGCATATTATAGATAGTGGGAACTCTATAATTTGAAGACTGACTGCAAATTCTTCTCATGAATCGAGAGTTGGTTAGTGTTGCTAAAGCATTTTTGGGCGGAGAAGTTTTTGCTCTTAACTCTAGTTGGTCGATCTGTCATCGCCACCAATTTGTTTGTATATCAGCTCATGTGCATAGGCCTATCTCTTTTGTAATTATTTGCTTTATCTCATTTTTCTTCACAGATTTTTTTCCCTTTCTTTTTGGCTTTGATTGCCTTTTTGTTCTTTTCTTTTTCTTTATGAAAGTTTAGTTTTTTATATTAAAAAAACTCGGGTACAGAATATATTAGCTACGTTTTAGTGCTTTATTATTATTATTATTATTTAAATTTTATTTTTAAATTTATTAGCTTGTCTCAATTGGGTCCTTCCATAAAAAAGAGAAAGTTCGAAGTAGTTATGAAATTTGAACAGCCACATAGAATATCAAGTTGGTCCACCAAAGTGTATTGAGGTTGAAAAGGTTAATTTTACAGATATATGGGTAAAATATTACTATTGACATATGTATTTGAAGGGAGTTATAATTTTTTAAT
mRNA sequence
TTTGAGTTAAAGTAAGTGGTAGAGAAGTTTTATCAAAATGAGTCATTTTATAAAATTTTCCAAGTTTATAGGCATTTTTTACCATTTAGAAAAAAAAAAAAACCAGCGTCCATCATCCTTTTCTTCCTCTCCATTGCGCCGACCGACCACCCCCTTCTTCAAACCTCTTCACTGGCGACCAAACCCGCGAGCAGCTGCGTTACCACGGACTCAGCGACCAGTCCTTGCCATCCGACGATTCAGATCGACGTGCTCTTGCTTCACGTTCAATGGCGGAGGTAGTCTGCGACATGAATACCTATTCGAAGCATGTTTCTTTTCCATTTCAGATTTCGGCGCGAACATCAAGTTATTCAACACGTTTTCGTCGACTATGAGAATTTTCCAGCAACGTTTTTTACTTGGGTGAGCTTTTGACGAGTTTCGGCGTTAACCCACTCTCCAGTTGGTTTCAAAGAAGTTTACCCATAACGTTTAAGGTAAGGACGTTTTCGGACCCCAACCAGCAAGGTTCGAAACCTTTTCGGTAAGTTTTGATATGGGTTATTGCTTTATTTTTCCTTTTCGAATTGGATGTGAGCGTTGGAATATTGTTCTGAACTCTCGAAATTTGTTTGTTGTTATGCTTGGATAAATTATTTAGTATTTGATTAAGGTTCAGCAAGGTTGAACACGTTTCGGACTAGTTCAGACTCAGTTAAAGCATGTTAGAGCAATCTTTGGTTTGTGAAAGTGTGTTGGTTGTATTTTTGGACTAATTCGGGGTGATTAATTAAGGTTTATGGTGGATTTTGGTTAAGAATATGTGTTTTGGAAAACCTCCCAATGTTTATTGAGGTTTTCGAACTAAGTTCTAACACTTAAACATGATGTTTATTATTTAGGCGTTGTAAGAGTTGATGTTCGTTCAATGAAGTGGTTTTTCGGATTGGGAATTATAGGATTGTTGATCGATTGTATTGAAAAATACGGGCTGTTACAAATCCGTTGAAAGAGAAGCTGATTCCAACAGAGTTGTTACAATGTCGAAGTCCCTTCAACTTCTTCATGAGCTTTGTATTCAGTTTTCGGAACCAATCATCAAATCCCTTTCGAAAATTTACGATAAACCTTCAGAGGGGTCGAATGTCTCTGTTACAGCCATTCTCGAATCCCTTCTACCTCGAAAAACCTCACTTCCAAATAATCCTTCTGAGGACGATATTTACTCCTCCATCAAAGACTTCACACTAGCATGTGGTTTAATATTGTCTTCTTGTTCCTCGACCTTTGACCTTCTTTCATGGATTCCGAAAGACCTCTCACTTACTGCCGAGTCGGCGTTTCGAATGCTCTCGAAGGCGTATGCCTCTGCTTCTTGCGATGGGTTTTCGAAGTACATTGAGGAGCTGGGTTTAGATTTGAGTTTGATACCAAAGGAGAAAAGATTAGTGGTGGAAATCATTCCTAAGGTGCTTCCGCTGTTGAAGGAGAGTATTAAGGAGAGCTCGATAGATAAATCTGACGAGGTTGATGAGGTCTCCGCTGCATCAGCGAGGGTGCCTGTGGGGTTTGCTATTGTTGCGGCCCATCAACTTGGATGGTTCATTACTCAGTTCAGTGAATGGGTGTGTTGCCAGTATTTCAAGTTGATTTGGGGTTTGTTTATGGAAAAATTGTTGCAGATTGATTATCCACATTTGGGGAAGTTGTGTAATTTGGTGATTCCTTGTGCATTGACTGCTCTTGATCATTGGTCACCAGAAGTCAAAGGGCAGGGTATGGTTAGCTTTATTCATCTTGCAAAAAATGTCAATGCTGCAGAGCTTGGTTGGTATGAAGATGCGATTCTTGATGCATGCTGCTCAAATGTCCCTTCAAGTGATGAGATATGGCCTTATGTTGTTGAAATGTCAGTTCTTCTTGCAACTAGCATTCACAAAATGAATCCTCGTAGCTCATGGATTGAAAGGATGGTTAATGAGATGCTGGGTCACTTGGAGCGACAGCCAAGGAACAAGGAACGGCGTATTGCATGGCTTCAACACATTGAACCTCTTTTCAACTGTATGGGTTTGGTGCTGTTGGCTCATACAAGGCGCATATTTCCACTTTTCTTTCAGTGGATGAATGCTGAAGATGATGAAACTACTTTACTGGTTCTACAGAGAATACAAGCAGTTGTGAGGTTAACATGGATAAAGAATACAACATATGTTGAAAGATTGGTGGATGAGCTTGCACTGTTATATGAAAAGGCTGCATCAAGAAGTTCTGGTGATGCAATTAGAAAGCATGTTGTTGATACACTCATCCTACTCCAGAAGAGCAAAGGCCTACAGTTTAAGGCAGCTTGGAACAAGCACAAGGATCATCAAAACTTGGTTTCGCTTACTACATCTTTAACAGGATTGAACTTGGCAGATAATGTTGATTGTTAAGCCTTGGCATATTATAGATAGTGGGAACTCTATAATTTGAAGACTGACTGCAAATTCTTCTCATGAATCGAGAGTTGGTTAGTGTTGCTAAAGCATTTTTGGGCGGAGAAGTTTTTGCTCTTAACTCTAGTTGGTCGATCTGTCATCGCCACCAATTTGTTTGTATATCAGCTCATGTGCATAGGCCTATCTCTTTTGTAATTATTTGCTTTATCTCATTTTTCTTCACAGATTTTTTTCCCTTTCTTTTTGGCTTTGATTGCCTTTTTGTTCTTTTCTTTTTCTTTATGAAAGTTTAGTTTTTTATATTAAAAAAACTCGGGTACAGAATATATTAGCTACGTTTTAGTGCTTTATTATTATTATTATTATTTAAATTTTATTTTTAAATTTATTAGCTTGTCTCAATTGGGTCCTTCCATAAAAAAGAGAAAGTTCGAAGTAGTTATGAAATTTGAACAGCCACATAGAATATCAAGTTGGTCCACCAAAGTGTATTGAGGTTGAAAAGGTTAATTTTACAGATATATGGGTAAAATATTACTATTGACATATGTATTTGAAGGGAGTTATAATTTTTTAAT
Coding sequence (CDS)
ATGTCGAAGTCCCTTCAACTTCTTCATGAGCTTTGTATTCAGTTTTCGGAACCAATCATCAAATCCCTTTCGAAAATTTACGATAAACCTTCAGAGGGGTCGAATGTCTCTGTTACAGCCATTCTCGAATCCCTTCTACCTCGAAAAACCTCACTTCCAAATAATCCTTCTGAGGACGATATTTACTCCTCCATCAAAGACTTCACACTAGCATGTGGTTTAATATTGTCTTCTTGTTCCTCGACCTTTGACCTTCTTTCATGGATTCCGAAAGACCTCTCACTTACTGCCGAGTCGGCGTTTCGAATGCTCTCGAAGGCGTATGCCTCTGCTTCTTGCGATGGGTTTTCGAAGTACATTGAGGAGCTGGGTTTAGATTTGAGTTTGATACCAAAGGAGAAAAGATTAGTGGTGGAAATCATTCCTAAGGTGCTTCCGCTGTTGAAGGAGAGTATTAAGGAGAGCTCGATAGATAAATCTGACGAGGTTGATGAGGTCTCCGCTGCATCAGCGAGGGTGCCTGTGGGGTTTGCTATTGTTGCGGCCCATCAACTTGGATGGTTCATTACTCAGTTCAGTGAATGGGTGTGTTGCCAGTATTTCAAGTTGATTTGGGGTTTGTTTATGGAAAAATTGTTGCAGATTGATTATCCACATTTGGGGAAGTTGTGTAATTTGGTGATTCCTTGTGCATTGACTGCTCTTGATCATTGGTCACCAGAAGTCAAAGGGCAGGGTATGGTTAGCTTTATTCATCTTGCAAAAAATGTCAATGCTGCAGAGCTTGGTTGGTATGAAGATGCGATTCTTGATGCATGCTGCTCAAATGTCCCTTCAAGTGATGAGATATGGCCTTATGTTGTTGAAATGTCAGTTCTTCTTGCAACTAGCATTCACAAAATGAATCCTCGTAGCTCATGGATTGAAAGGATGGTTAATGAGATGCTGGGTCACTTGGAGCGACAGCCAAGGAACAAGGAACGGCGTATTGCATGGCTTCAACACATTGAACCTCTTTTCAACTGTATGGGTTTGGTGCTGTTGGCTCATACAAGGCGCATATTTCCACTTTTCTTTCAGTGGATGAATGCTGAAGATGATGAAACTACTTTACTGGTTCTACAGAGAATACAAGCAGTTGTGAGGTTAACATGGATAAAGAATACAACATATGTTGAAAGATTGGTGGATGAGCTTGCACTGTTATATGAAAAGGCTGCATCAAGAAGTTCTGGTGATGCAATTAGAAAGCATGTTGTTGATACACTCATCCTACTCCAGAAGAGCAAAGGCCTACAGTTTAAGGCAGCTTGGAACAAGCACAAGGATCATCAAAACTTGGTTTCGCTTACTACATCTTTAACAGGATTGAACTTGGCAGATAATGTTGATTGTTAA
Protein sequence
MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDDIYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYIEELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIVAAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSPEVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHKMNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQWMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVVDTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNLADNVDC
Homology
BLAST of Lsi11G008870 vs. ExPASy Swiss-Prot
Match:
Q8GXP4 (Uncharacterized protein At2g39910 OS=Arabidopsis thaliana OX=3702 GN=At2g39910 PE=2 SV=2)
HSP 1 Score: 422.2 bits (1084), Expect = 7.8e-117
Identity = 223/444 (50.23%), Postives = 300/444 (67.57%), Query Frame = 0
Query: 8 LHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDDIYSSIKD 67
LH ++ SEPI + L + P E S VS IL SLLP TS +E+ SIK
Sbjct: 20 LHGRLLRLSEPIAEILRRTQYTPQESSKVSTKDILLSLLP-NTSSSRLANEE----SIKS 79
Query: 68 FTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYIEELGLDL 127
LAC L+ SS SST +LLSWIP++LS+ ES F +S+ D FS + +
Sbjct: 80 LALACALLASSRSSTHELLSWIPENLSVMGESTFWEISR-------DCFSDFSSNSNAEK 139
Query: 128 SLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIVAAHQLGW 187
+ E +E++P VLP LK+ I++SS+ K + ++VSAA AR PVG+AI+AAHQL W
Sbjct: 140 LVELVEDSEKIEMLPIVLPELKDGIEKSSLGKGSDAEDVSAAMARTPVGYAILAAHQLRW 199
Query: 188 FITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSPEVKGQGM 247
F+T Q+ P+L K CNLV+PCALTALDHWSPEVKGQGM
Sbjct: 200 FVT-----------------------QVKKPNLVKFCNLVVPCALTALDHWSPEVKGQGM 259
Query: 248 VSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHKMNPRSSW 307
++F+HLAKNV++ +LG Y D +LDACC N+ S DEIW +VVE+SVLL T IH NPRS W
Sbjct: 260 ITFVHLAKNVSSGDLGLYGDVVLDACCQNIASDDEIWIHVVELSVLLVTKIHPNNPRSPW 319
Query: 308 IERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQWMNAEDD 367
E+++NEMLGHLERQPRNKERRI WL+ +EPL N +GL LLAH RRIFPLFFQWM+++D
Sbjct: 320 YEKIMNEMLGHLERQPRNKERRITWLRFVEPLLNSLGLFLLAHFRRIFPLFFQWMHSDDA 379
Query: 368 ETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVVDTLILLQ 427
ET LLVL+R++ VVRLTWI+++ RLVDEL LY++++ R D IR ++ L+LL+
Sbjct: 380 ETVLLVLERLETVVRLTWIRHSPVFPRLVDELVSLYKESSMRKDRDDIRPLILRILMLLR 428
Query: 428 KSKGLQFKAAWNKHKDHQNLVSLT 452
+ KGL+F++AW+++++ NL +++
Sbjct: 440 QCKGLRFESAWSQYQEDPNLSTVS 428
BLAST of Lsi11G008870 vs. ExPASy TrEMBL
Match:
A0A5A7T3G7 (Putative ARM repeat superfamily protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold85G00930 PE=4 SV=1)
HSP 1 Score: 778.5 bits (2009), Expect = 1.6e-221
Identity = 400/465 (86.02%), Postives = 418/465 (89.89%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
MS SLQLLH+LCIQFSEPIIKSLS I DKPSEGSNVSV ILESLLPRKTSL +PSEDD
Sbjct: 1 MSNSLQLLHDLCIQFSEPIIKSLSNICDKPSEGSNVSVKPILESLLPRKTSLRISPSEDD 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDFTLAC L+LSS SSTFDLLSWI +DL+LTAESAFRMLSKAYASASC GFSK I
Sbjct: 61 IYSSIKDFTLACALVLSSRSSTFDLLSWITEDLALTAESAFRMLSKAYASASCHGFSKNI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EELGLD SLIP+EKRLVVEIIPKVLPLLK+SIKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EELGLDFSLIPEEKRLVVEIIPKVLPLLKDSIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
AAHQLGWFIT QIDYPHLGKLCNLVIPC LTALDHWSP
Sbjct: 181 AAHQLGWFIT-----------------------QIDYPHLGKLCNLVIPCGLTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
EVKGQGM+SFIHLAKNVNAAELGWYED ILDACCSNVPSSDEIWPYVVEMSVLLATSIH
Sbjct: 241 EVKGQGMLSFIHLAKNVNAAELGWYEDVILDACCSNVPSSDEIWPYVVEMSVLLATSIHN 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLF+CMGLVLLAHTRRIFPLFFQ
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFHCMGLVLLAHTRRIFPLFFQ 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELA+LYEKAA+RSSGDAIRKH+V
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELAMLYEKAATRSSGDAIRKHIV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNLADNVDC 466
D L+LLQ+SKG QFKAAWNK KDHQNLVSL+TSLT L++ D VDC
Sbjct: 421 DALMLLQESKGQQFKAAWNKLKDHQNLVSLSTSLTRLDITDCVDC 442
BLAST of Lsi11G008870 vs. ExPASy TrEMBL
Match:
A0A1S3CK85 (uncharacterized protein At2g39910 OS=Cucumis melo OX=3656 GN=LOC103501414 PE=4 SV=1)
HSP 1 Score: 778.5 bits (2009), Expect = 1.6e-221
Identity = 400/465 (86.02%), Postives = 418/465 (89.89%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
MS SLQLLH+LCIQFSEPIIKSLS I DKPSEGSNVSV ILESLLPRKTSL +PSEDD
Sbjct: 1 MSNSLQLLHDLCIQFSEPIIKSLSNICDKPSEGSNVSVKPILESLLPRKTSLRISPSEDD 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDFTLAC L+LSS SSTFDLLSWI +DL+LTAESAFRMLSKAYASASC GFSK I
Sbjct: 61 IYSSIKDFTLACALVLSSRSSTFDLLSWITEDLALTAESAFRMLSKAYASASCHGFSKNI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EELGLD SLIP+EKRLVVEIIPKVLPLLK+SIKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EELGLDFSLIPEEKRLVVEIIPKVLPLLKDSIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
AAHQLGWFIT QIDYPHLGKLCNLVIPC LTALDHWSP
Sbjct: 181 AAHQLGWFIT-----------------------QIDYPHLGKLCNLVIPCGLTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
EVKGQGM+SFIHLAKNVNAAELGWYED ILDACCSNVPSSDEIWPYVVEMSVLLATSIH
Sbjct: 241 EVKGQGMLSFIHLAKNVNAAELGWYEDVILDACCSNVPSSDEIWPYVVEMSVLLATSIHN 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLF+CMGLVLLAHTRRIFPLFFQ
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFHCMGLVLLAHTRRIFPLFFQ 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELA+LYEKAA+RSSGDAIRKH+V
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELAMLYEKAATRSSGDAIRKHIV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNLADNVDC 466
D L+LLQ+SKG QFKAAWNK KDHQNLVSL+TSLT L++ D VDC
Sbjct: 421 DALMLLQESKGQQFKAAWNKLKDHQNLVSLSTSLTRLDITDCVDC 442
BLAST of Lsi11G008870 vs. ExPASy TrEMBL
Match:
A0A0A0KM16 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G198180 PE=4 SV=1)
HSP 1 Score: 768.8 bits (1984), Expect = 1.3e-218
Identity = 398/465 (85.59%), Postives = 414/465 (89.03%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
MS SLQLLHELCI+FSEPIIKSLS I DKPSEGSNVSV ILESLLPRKTSL +PSEDD
Sbjct: 1 MSNSLQLLHELCIEFSEPIIKSLSNICDKPSEGSNVSVKPILESLLPRKTSLRISPSEDD 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDFTLAC LILSS SSTFDLLSWI +DL+LTAESAFRMLSKAYASASCDGFSK I
Sbjct: 61 IYSSIKDFTLACALILSSRSSTFDLLSWITEDLALTAESAFRMLSKAYASASCDGFSKNI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EELGLD SLIP+EKRLVVEIIPKVLPLLK+SIKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EELGLDFSLIPEEKRLVVEIIPKVLPLLKDSIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
AAHQL WFIT QIDYPHLGKLCNLVIPC LTALDHWSP
Sbjct: 181 AAHQLRWFIT-----------------------QIDYPHLGKLCNLVIPCGLTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
EVKGQGM+SFIHLAKNVNAAELGWYED ILDACCSNVPSSDEIWP VVEMSVLLATSIH
Sbjct: 241 EVKGQGMLSFIHLAKNVNAAELGWYEDVILDACCSNVPSSDEIWPCVVEMSVLLATSIHN 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKER IAWLQHIEPLFNCMGLVLLAHTRRIFPLFF+
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERCIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFK 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELA+LYEKAA+R SGDAIRKHVV
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELAMLYEKAATRRSGDAIRKHVV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNLADNVDC 466
D L+LLQ+SKG QFKAAW+KHKD QNLV L+TSLT LN+ D VDC
Sbjct: 421 DALMLLQESKGQQFKAAWSKHKDLQNLVPLSTSLTRLNITDCVDC 442
BLAST of Lsi11G008870 vs. ExPASy TrEMBL
Match:
A0A6J1H048 (uncharacterized protein At2g39910 OS=Cucurbita moschata OX=3662 GN=LOC111458735 PE=4 SV=1)
HSP 1 Score: 765.0 bits (1974), Expect = 1.8e-217
Identity = 395/459 (86.06%), Postives = 410/459 (89.32%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
MS SLQ+L +LCIQFSEPII+SLSKI D+PSEGSNVSV AILESLLPRKTSLP +++D
Sbjct: 1 MSDSLQVLRDLCIQFSEPIIQSLSKICDEPSEGSNVSVKAILESLLPRKTSLPITLTDED 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDF LAC LILSS SSTFDL SWIP+DLSL AESAFRMLSKAY SA CDGFSK I
Sbjct: 61 IYSSIKDFALACALILSSRSSTFDLFSWIPEDLSLAAESAFRMLSKAYVSAFCDGFSKDI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EELGLD SLIP+EKRLVVEIIPKVLPLLKE+IKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EELGLDFSLIPEEKRLVVEIIPKVLPLLKENIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
AAHQL WFIT QIDYPHLGKLCNLVIPCALTALDHWSP
Sbjct: 181 AAHQLAWFIT-----------------------QIDYPHLGKLCNLVIPCALTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
E+KGQGMVSFIHLAKNVNAAELGWYED ILDACCSNVPSSDEIWPYVVEMSVLL TSIHK
Sbjct: 241 ELKGQGMVSFIHLAKNVNAAELGWYEDVILDACCSNVPSSDEIWPYVVEMSVLLVTSIHK 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELALLYEKAASRSS DAIRKHVV
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELALLYEKAASRSSRDAIRKHVV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNL 460
D LILLQ+SKG QFKAAWNKHKD QNLV LTTSLTG+N+
Sbjct: 421 DALILLQESKGQQFKAAWNKHKDDQNLVWLTTSLTGMNI 436
BLAST of Lsi11G008870 vs. ExPASy TrEMBL
Match:
A0A6J1JJB6 (uncharacterized protein At2g39910 OS=Cucurbita maxima OX=3661 GN=LOC111484954 PE=4 SV=1)
HSP 1 Score: 758.8 bits (1958), Expect = 1.3e-215
Identity = 390/459 (84.97%), Postives = 407/459 (88.67%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
MS SLQ++ +LCIQFS+PII+SLSKI D+PSEGSN SV AILESLLPRKTSLP P+E+D
Sbjct: 1 MSDSLQVIRDLCIQFSKPIIQSLSKICDEPSEGSNFSVKAILESLLPRKTSLPITPTEED 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDF LAC LI+SS SSTF LLSWIP+DLSL AESAFRMLSKAY SA CDGFSK I
Sbjct: 61 IYSSIKDFALACALIMSSRSSTFGLLSWIPEDLSLAAESAFRMLSKAYVSAFCDGFSKDI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EE+GLD SLIP+EKRLVVEIIPKVLPLLKE+IKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EEVGLDFSLIPEEKRLVVEIIPKVLPLLKENIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
A HQL WFIT QIDYPHLGKLCNLVIPCALTALDHWSP
Sbjct: 181 AGHQLAWFIT-----------------------QIDYPHLGKLCNLVIPCALTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
EVKGQGM SFIHLAKNVNAAELGWYED ILDACCSNVPSSDEIWPYVVEMSVLL TSIHK
Sbjct: 241 EVKGQGMFSFIHLAKNVNAAELGWYEDVILDACCSNVPSSDEIWPYVVEMSVLLVTSIHK 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELALLYEKA SRSS DAIRKHVV
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELALLYEKATSRSSRDAIRKHVV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNL 460
D LILLQ+SKG QFKAAWNKHKD QNLV LTTSLTG+N+
Sbjct: 421 DALILLQESKGQQFKAAWNKHKDDQNLVWLTTSLTGMNI 436
BLAST of Lsi11G008870 vs. NCBI nr
Match:
XP_038891492.1 (uncharacterized protein At2g39910 [Benincasa hispida])
HSP 1 Score: 793.9 bits (2049), Expect = 7.5e-226
Identity = 410/465 (88.17%), Postives = 420/465 (90.32%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
M SLQLLHELCIQFSEPIIKSLSKI DKPSEGSNVSV AILESLLPRKTS+ NPSEDD
Sbjct: 1 MPNSLQLLHELCIQFSEPIIKSLSKICDKPSEGSNVSVKAILESLLPRKTSVSVNPSEDD 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDF LAC LILSS SSTFDLLSWIP+DLSL AESAFRMLSKAYASASCDGFSK I
Sbjct: 61 IYSSIKDFALACALILSSRSSTFDLLSWIPEDLSLAAESAFRMLSKAYASASCDGFSKNI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EELGLD SLIP+EKRLVVEIIPKVLPLLKE IKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EELGLDFSLIPEEKRLVVEIIPKVLPLLKERIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
AAHQLGWFIT QIDYPHLGKLCNLVIPCALTALDHWSP
Sbjct: 181 AAHQLGWFIT-----------------------QIDYPHLGKLCNLVIPCALTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
EVKGQGMVSFIHLAKNVNAAELG Y+D ILDACCSNVPSSDEIWPYVVEMSVLLATSIHK
Sbjct: 241 EVKGQGMVSFIHLAKNVNAAELGRYDDVILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELALLYEKA SR+ GDAIRKHVV
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELALLYEKAESRTFGDAIRKHVV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNLADNVDC 466
D LILLQ+SKG+QF+AAWNKHKDHQNLVSLTTSLTGLN+ D VDC
Sbjct: 421 DALILLQQSKGVQFQAAWNKHKDHQNLVSLTTSLTGLNITDCVDC 442
BLAST of Lsi11G008870 vs. NCBI nr
Match:
XP_008463207.1 (PREDICTED: uncharacterized protein At2g39910 [Cucumis melo] >KAA0037338.1 putative ARM repeat superfamily protein [Cucumis melo var. makuwa] >TYJ97569.1 putative ARM repeat superfamily protein [Cucumis melo var. makuwa])
HSP 1 Score: 778.5 bits (2009), Expect = 3.3e-221
Identity = 400/465 (86.02%), Postives = 418/465 (89.89%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
MS SLQLLH+LCIQFSEPIIKSLS I DKPSEGSNVSV ILESLLPRKTSL +PSEDD
Sbjct: 1 MSNSLQLLHDLCIQFSEPIIKSLSNICDKPSEGSNVSVKPILESLLPRKTSLRISPSEDD 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDFTLAC L+LSS SSTFDLLSWI +DL+LTAESAFRMLSKAYASASC GFSK I
Sbjct: 61 IYSSIKDFTLACALVLSSRSSTFDLLSWITEDLALTAESAFRMLSKAYASASCHGFSKNI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EELGLD SLIP+EKRLVVEIIPKVLPLLK+SIKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EELGLDFSLIPEEKRLVVEIIPKVLPLLKDSIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
AAHQLGWFIT QIDYPHLGKLCNLVIPC LTALDHWSP
Sbjct: 181 AAHQLGWFIT-----------------------QIDYPHLGKLCNLVIPCGLTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
EVKGQGM+SFIHLAKNVNAAELGWYED ILDACCSNVPSSDEIWPYVVEMSVLLATSIH
Sbjct: 241 EVKGQGMLSFIHLAKNVNAAELGWYEDVILDACCSNVPSSDEIWPYVVEMSVLLATSIHN 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLF+CMGLVLLAHTRRIFPLFFQ
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFHCMGLVLLAHTRRIFPLFFQ 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELA+LYEKAA+RSSGDAIRKH+V
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELAMLYEKAATRSSGDAIRKHIV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNLADNVDC 466
D L+LLQ+SKG QFKAAWNK KDHQNLVSL+TSLT L++ D VDC
Sbjct: 421 DALMLLQESKGQQFKAAWNKLKDHQNLVSLSTSLTRLDITDCVDC 442
BLAST of Lsi11G008870 vs. NCBI nr
Match:
XP_023549283.1 (uncharacterized protein At2g39910 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 768.8 bits (1984), Expect = 2.6e-218
Identity = 396/459 (86.27%), Postives = 411/459 (89.54%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
MS SLQ+L +LCIQFSEPII+SLSK D+PSEGSNVSV AILESLLPRKTSLP P+E+D
Sbjct: 1 MSDSLQVLRDLCIQFSEPIIQSLSKFCDEPSEGSNVSVKAILESLLPRKTSLPITPTEED 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDF LAC LILSS SSTFDLLSWIP+DLSL AESAFRMLSKAY SA CDGFSK I
Sbjct: 61 IYSSIKDFALACALILSSRSSTFDLLSWIPEDLSLAAESAFRMLSKAYVSAFCDGFSKDI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EE+GLD SLIP+EKRLVVEIIPKVLPLLKE+IKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EEVGLDFSLIPEEKRLVVEIIPKVLPLLKENIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
AAHQL WFIT QIDYPHLGKLCNLVIPCALTALDHWSP
Sbjct: 181 AAHQLAWFIT-----------------------QIDYPHLGKLCNLVIPCALTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
EVKGQGMVSFIHLAKNVNAAELGWYED ILDACCSNVPSSDEIWPYVVEMSVLL TSIHK
Sbjct: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDVILDACCSNVPSSDEIWPYVVEMSVLLVTSIHK 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELALLYEKAASRSS DAIRKHVV
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELALLYEKAASRSSRDAIRKHVV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNL 460
D LILLQ+SKG QFKAAWNKHKD QNLV LTTS+TG+N+
Sbjct: 421 DALILLQESKGQQFKAAWNKHKDDQNLVWLTTSVTGMNI 436
BLAST of Lsi11G008870 vs. NCBI nr
Match:
XP_011654982.1 (uncharacterized protein At2g39910 [Cucumis sativus] >KGN50628.1 hypothetical protein Csa_021487 [Cucumis sativus])
HSP 1 Score: 768.8 bits (1984), Expect = 2.6e-218
Identity = 398/465 (85.59%), Postives = 414/465 (89.03%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
MS SLQLLHELCI+FSEPIIKSLS I DKPSEGSNVSV ILESLLPRKTSL +PSEDD
Sbjct: 1 MSNSLQLLHELCIEFSEPIIKSLSNICDKPSEGSNVSVKPILESLLPRKTSLRISPSEDD 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDFTLAC LILSS SSTFDLLSWI +DL+LTAESAFRMLSKAYASASCDGFSK I
Sbjct: 61 IYSSIKDFTLACALILSSRSSTFDLLSWITEDLALTAESAFRMLSKAYASASCDGFSKNI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EELGLD SLIP+EKRLVVEIIPKVLPLLK+SIKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EELGLDFSLIPEEKRLVVEIIPKVLPLLKDSIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
AAHQL WFIT QIDYPHLGKLCNLVIPC LTALDHWSP
Sbjct: 181 AAHQLRWFIT-----------------------QIDYPHLGKLCNLVIPCGLTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
EVKGQGM+SFIHLAKNVNAAELGWYED ILDACCSNVPSSDEIWP VVEMSVLLATSIH
Sbjct: 241 EVKGQGMLSFIHLAKNVNAAELGWYEDVILDACCSNVPSSDEIWPCVVEMSVLLATSIHN 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKER IAWLQHIEPLFNCMGLVLLAHTRRIFPLFF+
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERCIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFK 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELA+LYEKAA+R SGDAIRKHVV
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELAMLYEKAATRRSGDAIRKHVV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNLADNVDC 466
D L+LLQ+SKG QFKAAW+KHKD QNLV L+TSLT LN+ D VDC
Sbjct: 421 DALMLLQESKGQQFKAAWSKHKDLQNLVPLSTSLTRLNITDCVDC 442
BLAST of Lsi11G008870 vs. NCBI nr
Match:
KAG7031582.1 (hypothetical protein SDJN02_05623 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 766.1 bits (1977), Expect = 1.7e-217
Identity = 396/459 (86.27%), Postives = 410/459 (89.32%), Query Frame = 0
Query: 1 MSKSLQLLHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDD 60
MS SLQ+L +LCIQFSEPII+SLSKI D+PSEGSNVSV AILESLLPRKTSLP P+E+D
Sbjct: 1 MSDSLQVLRDLCIQFSEPIIQSLSKICDEPSEGSNVSVKAILESLLPRKTSLPITPTEED 60
Query: 61 IYSSIKDFTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYI 120
IYSSIKDF LAC LILSS SSTFDL SWIP+DLSL AESAFRMLSKAY SA CDGFSK I
Sbjct: 61 IYSSIKDFALACALILSSRSSTFDLFSWIPEDLSLAAESAFRMLSKAYVSAFCDGFSKDI 120
Query: 121 EELGLDLSLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIV 180
EELGLD SLIP+EKRLVVEIIPKVLPLLKE+IKESSIDKSDEVDEVSAASARVPVGFAIV
Sbjct: 121 EELGLDFSLIPEEKRLVVEIIPKVLPLLKENIKESSIDKSDEVDEVSAASARVPVGFAIV 180
Query: 181 AAHQLGWFITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSP 240
AAHQL WFIT QIDYPHLGKLCNLVIPCALTALDHWSP
Sbjct: 181 AAHQLAWFIT-----------------------QIDYPHLGKLCNLVIPCALTALDHWSP 240
Query: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHK 300
EVKGQGMVSFIHLAKNVNAAELGWYED ILDACCSNVPSSDEIW YVVEMSVLL TSIHK
Sbjct: 241 EVKGQGMVSFIHLAKNVNAAELGWYEDVILDACCSNVPSSDEIWSYVVEMSVLLVTSIHK 300
Query: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ
Sbjct: 301 MNPRSSWIERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQ 360
Query: 361 WMNAEDDETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVV 420
WMNAEDDETTLLVLQRIQ VVRLTWI+NT YVERLVDELALLYEKAASRSS DAIRKHVV
Sbjct: 361 WMNAEDDETTLLVLQRIQTVVRLTWIRNTPYVERLVDELALLYEKAASRSSRDAIRKHVV 420
Query: 421 DTLILLQKSKGLQFKAAWNKHKDHQNLVSLTTSLTGLNL 460
D LILLQ+SKG QFKAAWNKHK+ QNLV LTTSLTG+N+
Sbjct: 421 DALILLQESKGQQFKAAWNKHKNDQNLVWLTTSLTGMNI 436
BLAST of Lsi11G008870 vs. TAIR 10
Match:
AT2G39910.1 (ARM repeat superfamily protein )
HSP 1 Score: 422.2 bits (1084), Expect = 5.5e-118
Identity = 223/444 (50.23%), Postives = 300/444 (67.57%), Query Frame = 0
Query: 8 LHELCIQFSEPIIKSLSKIYDKPSEGSNVSVTAILESLLPRKTSLPNNPSEDDIYSSIKD 67
LH ++ SEPI + L + P E S VS IL SLLP TS +E+ SIK
Sbjct: 20 LHGRLLRLSEPIAEILRRTQYTPQESSKVSTKDILLSLLP-NTSSSRLANEE----SIKS 79
Query: 68 FTLACGLILSSCSSTFDLLSWIPKDLSLTAESAFRMLSKAYASASCDGFSKYIEELGLDL 127
LAC L+ SS SST +LLSWIP++LS+ ES F +S+ D FS + +
Sbjct: 80 LALACALLASSRSSTHELLSWIPENLSVMGESTFWEISR-------DCFSDFSSNSNAEK 139
Query: 128 SLIPKEKRLVVEIIPKVLPLLKESIKESSIDKSDEVDEVSAASARVPVGFAIVAAHQLGW 187
+ E +E++P VLP LK+ I++SS+ K + ++VSAA AR PVG+AI+AAHQL W
Sbjct: 140 LVELVEDSEKIEMLPIVLPELKDGIEKSSLGKGSDAEDVSAAMARTPVGYAILAAHQLRW 199
Query: 188 FITQFSEWVCCQYFKLIWGLFMEKLLQIDYPHLGKLCNLVIPCALTALDHWSPEVKGQGM 247
F+T Q+ P+L K CNLV+PCALTALDHWSPEVKGQGM
Sbjct: 200 FVT-----------------------QVKKPNLVKFCNLVVPCALTALDHWSPEVKGQGM 259
Query: 248 VSFIHLAKNVNAAELGWYEDAILDACCSNVPSSDEIWPYVVEMSVLLATSIHKMNPRSSW 307
++F+HLAKNV++ +LG Y D +LDACC N+ S DEIW +VVE+SVLL T IH NPRS W
Sbjct: 260 ITFVHLAKNVSSGDLGLYGDVVLDACCQNIASDDEIWIHVVELSVLLVTKIHPNNPRSPW 319
Query: 308 IERMVNEMLGHLERQPRNKERRIAWLQHIEPLFNCMGLVLLAHTRRIFPLFFQWMNAEDD 367
E+++NEMLGHLERQPRNKERRI WL+ +EPL N +GL LLAH RRIFPLFFQWM+++D
Sbjct: 320 YEKIMNEMLGHLERQPRNKERRITWLRFVEPLLNSLGLFLLAHFRRIFPLFFQWMHSDDA 379
Query: 368 ETTLLVLQRIQAVVRLTWIKNTTYVERLVDELALLYEKAASRSSGDAIRKHVVDTLILLQ 427
ET LLVL+R++ VVRLTWI+++ RLVDEL LY++++ R D IR ++ L+LL+
Sbjct: 380 ETVLLVLERLETVVRLTWIRHSPVFPRLVDELVSLYKESSMRKDRDDIRPLILRILMLLR 428
Query: 428 KSKGLQFKAAWNKHKDHQNLVSLT 452
+ KGL+F++AW+++++ NL +++
Sbjct: 440 QCKGLRFESAWSQYQEDPNLSTVS 428
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q8GXP4 | 7.8e-117 | 50.23 | Uncharacterized protein At2g39910 OS=Arabidopsis thaliana OX=3702 GN=At2g39910 P... | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7T3G7 | 1.6e-221 | 86.02 | Putative ARM repeat superfamily protein OS=Cucumis melo var. makuwa OX=1194695 G... | [more] |
A0A1S3CK85 | 1.6e-221 | 86.02 | uncharacterized protein At2g39910 OS=Cucumis melo OX=3656 GN=LOC103501414 PE=4 S... | [more] |
A0A0A0KM16 | 1.3e-218 | 85.59 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G198180 PE=4 SV=1 | [more] |
A0A6J1H048 | 1.8e-217 | 86.06 | uncharacterized protein At2g39910 OS=Cucurbita moschata OX=3662 GN=LOC111458735 ... | [more] |
A0A6J1JJB6 | 1.3e-215 | 84.97 | uncharacterized protein At2g39910 OS=Cucurbita maxima OX=3661 GN=LOC111484954 PE... | [more] |
Match Name | E-value | Identity | Description | |
XP_038891492.1 | 7.5e-226 | 88.17 | uncharacterized protein At2g39910 [Benincasa hispida] | [more] |
XP_008463207.1 | 3.3e-221 | 86.02 | PREDICTED: uncharacterized protein At2g39910 [Cucumis melo] >KAA0037338.1 putati... | [more] |
XP_023549283.1 | 2.6e-218 | 86.27 | uncharacterized protein At2g39910 [Cucurbita pepo subsp. pepo] | [more] |
XP_011654982.1 | 2.6e-218 | 85.59 | uncharacterized protein At2g39910 [Cucumis sativus] >KGN50628.1 hypothetical pro... | [more] |
KAG7031582.1 | 1.7e-217 | 86.27 | hypothetical protein SDJN02_05623 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
AT2G39910.1 | 5.5e-118 | 50.23 | ARM repeat superfamily protein | [more] |