Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCACATCTTTACAACAAACTTCCAATTCTCTTCATCAGAAAAGACAGAGACCCAAACTGCCGGCCATAATCTATGCGAAAAAAATCTCCCCAAAAAAATGATAATTCCAATGTTCGAATTAACAAATTAAATCCCTTCTTCAAGTACAAATTAATAACCTTCTCTACCGAAAATCAACTCGCAATCTCTAAATTCAAACTCCAATTAGCAAGATACAGTTCTAAATTAAGGTTTCAGAGCAATGGAATTGAGATCGAGCAGAGAGAATTCCAGAGAAATTGCAAAATTAGGGCACAAAATCGTTCAAAAGAGACTAGCCAATTCTCGGCCCAAGGCTCAGCAGCAGGCTCCCGATCTGACCGATTTTATGAACGATATGTTCTTTGGAACCGTAAACAAGGACAAGAAAACCTATAATTTGACTGGTGATGAAGATGATGAAGATGAAGAAGAATTGTTTGATCGGAGCAACAGAAGTAGAAACAGCCAATTGACGGAGGAGTGGCTGGATGAGGCGCGCCGGTTGGTGACTTCTTCTCCTTCGCGGTCTAATTCTCCGGCTCGGCTGGGTGGATCGCCGAGGTTTGCGGCCGTCAACGGCATATCGCCGGCGTCGATTAATGATCGGAGAGATCCACTCTCACGCTCCGCTAGAAGGTACTTCACAGTTCACACCAAAGGAGGGACATTTAAATCCTAATTTTCAAATTTTCTATTATTCTCTCATTTTATAATATTAGAGTGCGTTTCGGGGAAAAAAAATATATTGAATTTAGGATTCAATGAAATGAAAATAAGTTAATATATGAAGCTTTGACCTAAAATAAAAAGAATTACATGTCAATAACTTATTGAATTGTATTTATTTTGACAAACATATTATGTAGGGTACTTTATTAACCAATTTTATATTATAATTTTTTTATATTTACATTTTATAATATTAGATAATTTATAATTATTATATTATATATGTTTTCATTTTTACAAAAGTAAAATGAGTTTATAAACTGATTTTTTTTCGAGTACAAATATAGGTTGGGTCTTGGGAGATTTGAACCACATACCTCATACTCATAGTTATTAACACATTTGATATGTTAGTTGAACTATTTTTTTTTTTTTTTTTGCTATAACATGAGATAATATATTCATAATAATGTAATTTTTAATTTCTAACCTTACAATTTAAAAACATTACAAACATAAAAAATTTGTAAATTTAAAAATAATTTTAAGAACAAACTCATAAAAAAAACTAAATTAAAAAAACAAGAAACAAAATATGAATTGTATAACAAATAAATCCTTGTTATTCTCATTTCTAGGATGATGGAAATCCAAGCGTCTTTATGTTTTTTAAAATGATATAAGGTTAGATAAACTATTAGTTGAAAAAATAAAACGTGAACTTAGAGATAAAATTTTAAAATATACTTTAATTGAAAAGTAGAAATAGAGTTTAGGATTGATTCGGAGAGTGCATAACATTCAAAAGGTAGATTTGTGCATGAATAGTTGATATAAAAAACTATAATTTAGATAATGTGTTCTATTTTAGTATAGGGACGAGAAATATTCGTAAGACTTATTTTTATCCTAAATAAAAAGCAAAATTGTGGTATTTTGTTGTGTTTATGTAATTTGTATTAGTATGTATTTGGGAGTTAGAATTCACAAATGTTATTTCAAAAATAATAATAATAATAATAATGAGTGGCAAGTTAGTCATGATCGTTTTTTGATAAATTAATATATCATTTTTTTTATTTCAACGTCATTTAGGGGTAGAAGATTCAAATATTTGACCTCTTAATTGAAGGTACATGTTGATTACTGATGAACTAAACTCACTTTGACACTAATATATCATTTTATACCTTGATCAAATATATAAGTAATATTTTTAATATAAATCAAATAATAGATTTGTTGTTTTTGTACTATATCAATAATATATGTAATATAAAATACATCACTTTCATACAAGTTATTACATTTGCATGACATTTTTACAAAATATTTTATTACACATCTTTTATTACTTAGGCCATAACAATTCACAAGAAGTAGGAAAAGTGTTGGTAATTATATTGGGTAGCAAATTAGAGTGAATTATTTGCAAATATAACAACTTTTAAAAAGATTTGTAAATATAACAAATCTTTTAAATAAAGTTTTGTTATTCCCTTTTTATCCTTATGGTTGCATATGAGAGTGAAAAAATGACCACCCAAAACATGTTGTGATTTTTTAATGTCTCCGAACAACTGTGGCATGAGATTTTTTTTGAATTTTTTATTTTCTTTGCCTTCCTCTTCTCCTCCCCTCTCTCCACATGTGTGCATTTCATACACATGTTTTTCATTCCAACTTTTTATGCAATTTAGAAATTTTATGATTATAATATTACTTAAATCTTCTTGAAAATTACAATTAACAAAAAAAAAAGTCTTACTATACTAGATATTATACTGCGAGAGATTAAAACCTATAACAATTTCGTCGTTGATATATTAATTAAATATCACCAAATGGCGATAGAGAAAGAAACAAAATAAAAAGATGAAGTATTAATTGAAAAATATCATTTGATAGAATGTGTAATTAAAAAACAGTTTTACATTTGAACAAAATTCTATATAAATGATTAATCTTTAATGAGAAAAACTTTTGGGTTTAATCTTTAAATCTCCAATTTAAAACTAGTAGTTAACACTTACCAAGCTTCTAAATTATTGAATTTGGAACTTTGAAATAATTTTCGTCCAATAACCCCAAAAATTCAGCTATCTCAATTTAGTCACTTAATTCTAAAGACACTCCAAAAGACGAAATTAGTCGGATCTGAAAACGCGGGACGTTACCCAAGGAGTCTATCAGTGATAGTGATAAACTTCTATCACTGATAGAATTCAATCAACACATAAAAATAATACCAAACAAGAAATCTTTCAGTGATAGTGATAGACTTCTATCACAGATAAACTTCAATCAGAAGAGATAAACAATAGAAATAGAAATCTATCAGTGATAGTGATAGACTCTAATAAAAAGAGATAAACAACACAAATAGAAGTCTATTGATAATAGCGATAGATTTCTATCACTGATCGGTTCCAATTAGAACAGATAAAACACACACATATAGAAGTCTATTAGTGATAATGGTAGACTTTTATCACTGATAGACTCTAATCAGAAGAGATAAACAACAAAAATAGAAGTCTATCGGTGATAGTGATATATTTCAATCAGAACAAATAAAAATCTCAAACAAGAAGTCTATCAGTGATAATGATAGACTTCAATCAGAACAGATAAAAATCTCAAATAAGAAAACTATCATTGATAATGATAGACTTCAATCAGAACATACAAAAATCTCAACGAAAAAGTCTATCAGTGATAGTGATAAACTTCTATCACTGATAGACTTCAATCAGAACAGATAAAAAATCTCAAACAAGAAGTGTATCAGTAATAGTGATAGTAATAAATTTCTATCATTGACAGACTTTAATCAAAATAGATATACAATCAAACCAAGAAGTTCAGTGATAGACTTCTATCATTGATATGAATTGTGGGGTTTTTTTTTGTAATTTAAATAGGTTGAGCAACCAATTTTGCTATATTTGCAATATTTTTTTATATTTTGCTCTATGTGCTATTTTTTACCCTCATTTTGCCATATGCCCAATCGTCCCAAAAAAATATCCTTAATTACAAATTTGGTCTATGTGATTTAATTTAAAGTTAATTTTTATGGTTTTAAAAGATTCTATTTCAATTTGACGGAGGTAGTTTTAAAAGTTTCAATTTGGTCTTATAATTTGAGTAAAACTAACCGATATTCATCTTACAACCACTTTACCAACGTATTTTCCCCCTTTATTTTTACTCTACCTTTAACATATTAATATATCCAATTTAATATTAATAGTAAAAAAGAATATATCAAGTAACTACCAAAATGAGCTTAGCATAACTACAATTGACATATATATTCAACCATTAAGTCACATGTTTGAATCTCACATTTCTAAATTTTGTTAAACTAGAAAAAAAACATATCAAGTAGCTACTACGTGAATAAAAATATGTCGTGCCTATAAGATTTGATTAGAGCAAAAGGACGAAGTTGAACTTTTGAACATATACAAACCAAATTGAAACTAAGTTTAAATTCTATGGTCTAAATTAAAACCTTAAAAATTATATGTTAAGTATGCCCAAACAATTAGAACTAAATTTGTAATTTAACTTTTATTTTCAATAGATAAAAAACTATGAAATTCACTTTTTCGTTTATCACACATTCTCGGTTACTCAACCTCTTTGAAGAATAAAACTTACCTCTTAACTGACGAGCAAGAATTCATCGTTAAAATATTGACAATTATTTATTATTCATTCTATTTGTCAAAAACATAATGGTGGCAAATAGAAGAAAGCTATCCACTGATTAAATTTAAGATTTAAGTTTTGTCTCTCTCGGATCTCTGGAGAAATTATATTGTTTCTTCTTTTGAGTTCAACAACAAATGGGGTAGACTTTCAAATTCAAACCTCTTAATCAAAGATATATACCTTAATCAATTGAGTTATGTTTACATTGGTAAATAAAATTTTTTAATACTAAAATAATATAGATATTGATCTCATTAATTATGATGTCAAATCTACCTGTATTAAATAATTATTTATATAAAAAAAACCATAACATGACGCATGATTTATTTTTTTTTAAAATGAAAAAAAAAAATTCTAGTGCCTATTCTTCTGCTCATTCATAATTTTAGAGAGTCATATATTAATTTAATACTAGATTCTTTTGCTGTCCCTACCAAAATTTCTGTCACATAAAAGAAAACGTATAAAAAGTTTCTGATTGTTTTGTTAATGGCAACAACAGGCACAGAGCAGTGGACAATTTCAGTGGTGAGATCCTCTCCAAAGTCGTACGCCACAGTCGCAATAAATCCGAATCCGCCGCCGCTTCCGAGGACGAGCCGATAAACCCCACTTCCGCCGTTCAAAAATGGATATCCAACATCCTCAAACCCTCCAATCCCCCCATTTCCGCCCCTCTTCCAATCTCCGATCCTCTTCCCACCCCTCGCAAATCTCGATTCCACACCGATCTTCCTCCTTCTCGTCTCCCAATTCCTCCGCCAGATGTCGCCCTTCTTTCTCCACCCAAGACCCTTACCGATCATCCTCCGAGAAGAACGGTCTCATCGCCGGCCTGTTCGCTTCAATCGATTCGACCCAAATCCAATCTAAATGGGTTCTCGAGGGATGATTCTGGAGACCTGGAATTTGGCCTTAATGGGTTTCTCAAAGAGCAGCGGAGCAAGATTCAACAAATCTCGAATGGGGAGCTTGATGTCGAGGTTAAAATCATTCTATCCGGACCTTCAAACAGTTGAGTATACTCCTAAAATGTAGTATTTGATGCATCTTTTTGAAGAATTGATGGTAGTTTTGATGAGAATGAAGACATTGTGAGTGTTTTGTGCCCTCAATGCATAGAATGAAGTTGAAGTGCATAATTGTAGAGACTAGAGATGGGGTTACTTGCTTGTTTCTGTATTGAAAAAGCTTTTGACATTTTGGGATTGCAGGCACTAGTTCAATGGTGGCGGCAGTTTGCTATGCTTGGCTACTAGAGAACAAAATGAGGCAGCAGAATGCCGATAGCCGTCGAGAGTGTCATGTGGTGCCTGTGATGAATATGCCGAGGGTAAAGATGTGGAACCAACGCCAGGTTGCTTGGCTCTTTTATCATCTTGGACTTGATGCTTCCTCAATTCTTTTCACTGATGAGGTACTTGTTTTCCTTGTGATTTGGTTGTCATGCTTGATAGATCGTTTTCGAATCTGAACCTTATGTGTTACCATCAGCAAAAAGAAATGGTAGCTGAAGAATATTATGTGTTTAGTAATATCTGTTAATCATGTTAAAAAAATGTAGCATGATACAGAGTAATACATAGATTTCAGTTGTATTGTAGTAAAGTTAATAAATCTAGAGAACATTGATTGATATGCTACAAATGCACCATAACCATTGACAATGAGTAGTGTACTGATCTTAGTTATTGCTATGTGGTTCAGGTGGACTTAGAAAGTCTGATGATGGCTGGTCGAACTAGCATCCTTGTCGTAGGTCAAGATGTCCTTAAGATGAGTGATGGAGTAAGTCTATAATGATGAAAATGAATTTCACTTGTAAATTATCTTTTGAGAATCCCTTCATAGTTGTCGTATTTTGTGTGATCATTCTTTACTATTTGTTCCTTAGGTTGGATCTCAGTGTACCATTCTTACAGATAACTATTGTGAGGATGCCTATCATTTACTTCAAACCCCTTTGTTGAAGAATCTCTTGGTAACTTTTACTATACAGAACATTTGGTCTTACTTGTCCCCAGTTTTAATTGTACATTCGAGCAACTGATTCTGATAGAAGCAATATTTGGGACTTATTTTCTCATGTGCATCTCATGTCTTCCAGCTCGCAGGAATTCTGTTGGACACAAGAAATCTCGATGCATCGGCTCAATCATCCATGACAAGAGATGCGGAGGCTGTTCAGTTGCTATCGGTTGGCTCAGCCCCAAATTGTAGGAATGGACTTTATGATCAGTGTATGCTTCAGCAATTTAGTTTTACCATCTCTTTATGCGTTTACCAGCACATTCTCTGTTAAAAAATATCAATCTTGCAACCAATTTTTTTCTGGTTTTGTTTCATATTAGTGATGCGAGTTCAAAAGGAGCGTCCATTTTTGGATGCCTTACAACAAAGTTATGGAAAGCCTCCTAATGATGGTCAGCTTCCTATTATTTTATCTTGAAAAATTTATGCATAACCTTTCTCATGTTCTCTCAGTTGCATCAGTTCAAGTACTTTGATCATAGTTCCTCTCCTCTCTTGAACTTCTTGTATCAACTTCGTGTCTATACGTATGACTTTGTAATCTTTCTATAGGTAGTAACAATGGCGTGGGGCGTGTAGAGCGCATCACGGAGAGGAATCAAACATCCATCTCACCCCATGGTGATGCAATTAACCAACAAAAGAAGCGTAATGAGTTTGGAACTGCCAAAACTTGCAGGGTTTCGCCGAAATCCGGTACGTGTAACAGCATACAATTACACTATAACCCTTTTGCATTCTACATTGATGAGAGATTTTATTAACATGGAAACTGTCTCTAATTTTTCTCTTGTAGCTAAACCCAGTTCGTTACCTATTCAAACACCAGCTAGAGAAGCACCCAACACATCTCGTGGAAAGAACAAAAATTTCCTGGCAAAATGGTTTGGTTTTGGGTCAAAATGAGATAAGATTGGCTGGGTTCGATCGAGCTTTTAGCCGGATTCAGAGATCCCCAGAATTTGAACATGAATCATATTGGTTCAAGTCCTTGAGTAGTTGATCGGTTCATCATGATCTGAGTTTTGTCCCACATTTTCACAAATATATGTTTTTTTTTTGTTCTTTGCAGTGTATTAGGCATTGTAGTAATAGTAGCTCTGAGCCTTTTGATGATTCTTCTGAACAGAAGATTCATAAAAGTTATTATTTATGTTGGTGTGTTGTAACGATGCTGGAAATTCAGATTCTTGTAATTTGGGATTGGTATTCCGAATGGTG
mRNA sequence
CTCACATCTTTACAACAAACTTCCAATTCTCTTCATCAGAAAAGACAGAGACCCAAACTGCCGGCCATAATCTATGCGAAAAAAATCTCCCCAAAAAAATGATAATTCCAATGTTCGAATTAACAAATTAAATCCCTTCTTCAAGTACAAATTAATAACCTTCTCTACCGAAAATCAACTCGCAATCTCTAAATTCAAACTCCAATTAGCAAGATACAGTTCTAAATTAAGGTTTCAGAGCAATGGAATTGAGATCGAGCAGAGAGAATTCCAGAGAAATTGCAAAATTAGGGCACAAAATCGTTCAAAAGAGACTAGCCAATTCTCGGCCCAAGGCTCAGCAGCAGGCTCCCGATCTGACCGATTTTATGAACGATATGTTCTTTGGAACCGTAAACAAGGACAAGAAAACCTATAATTTGACTGGTGATGAAGATGATGAAGATGAAGAAGAATTGTTTGATCGGAGCAACAGAAGTAGAAACAGCCAATTGACGGAGGAGTGGCTGGATGAGGCGCGCCGGTTGGTGACTTCTTCTCCTTCGCGGTCTAATTCTCCGGCTCGGCTGGGTGGATCGCCGAGGTTTGCGGCCGTCAACGGCATATCGCCGGCGTCGATTAATGATCGGAGAGATCCACTCTCACGCTCCGCTAGAAGGCACAGAGCAGTGGACAATTTCAGTGGTGAGATCCTCTCCAAAGTCGTACGCCACAGTCGCAATAAATCCGAATCCGCCGCCGCTTCCGAGGACGAGCCGATAAACCCCACTTCCGCCGTTCAAAAATGGATATCCAACATCCTCAAACCCTCCAATCCCCCCATTTCCGCCCCTCTTCCAATCTCCGATCCTCTTCCCACCCCTCGCAAATCTCGATTCCACACCGATCTTCCTCCTTCTCGTCTCCCAATTCCTCCGCCAGATGTCGCCCTTCTTTCTCCACCCAAGACCCTTACCGATCATCCTCCGAGAAGAACGGTCTCATCGCCGGCCTGTTCGCTTCAATCGATTCGACCCAAATCCAATCTAAATGGGTTCTCGAGGGATGATTCTGGAGACCTGGAATTTGGCCTTAATGGGTTTCTCAAAGAGCAGCGGAGCAAGATTCAACAAATCTCGAATGGGGAGCTTGATGTCGAGGTTAAAATCATTCTATCCGGACCTTCAAACAGCACTAGTTCAATGGTGGCGGCAGTTTGCTATGCTTGGCTACTAGAGAACAAAATGAGGCAGCAGAATGCCGATAGCCGTCGAGAGTGTCATGTGGTGCCTGTGATGAATATGCCGAGGGTAAAGATGTGGAACCAACGCCAGGTTGCTTGGCTCTTTTATCATCTTGGACTTGATGCTTCCTCAATTCTTTTCACTGATGAGGTGGACTTAGAAAGTCTGATGATGGCTGGTCGAACTAGCATCCTTGTCGTAGGTCAAGATGTCCTTAAGATGAGTGATGGAGTTGGATCTCAGTGTACCATTCTTACAGATAACTATTGTGAGGATGCCTATCATTTACTTCAAACCCCTTTGTTGAAGAATCTCTTGCTCGCAGGAATTCTGTTGGACACAAGAAATCTCGATGCATCGGCTCAATCATCCATGACAAGAGATGCGGAGGCTGTTCAGTTGCTATCGGTTGGCTCAGCCCCAAATTGTAGGAATGGACTTTATGATCAGTTGATGCGAGTTCAAAAGGAGCGTCCATTTTTGGATGCCTTACAACAAAGTTATGGAAAGCCTCCTAATGATGGTAGTAACAATGGCGTGGGGCGTGTAGAGCGCATCACGGAGAGGAATCAAACATCCATCTCACCCCATGGTGATGCAATTAACCAACAAAAGAAGCGTAATGAGTTTGGAACTGCCAAAACTTGCAGGGTTTCGCCGAAATCCGGTACGTGTAACAGCATACAATTACACTATAACCCTTTTGCATTCTACATTGATGAGAGATTTTATTAACATGGAAACTGTCTCTAATTTTTCTCTTGTAGCTAAACCCAGTTCGTTACCTATTCAAACACCAGCTAGAGAAGCACCCAACACATCTCGTGGAAAGAACAAAAATTTCCTGGCAAAATGGTTTGGTTTTGGGTCAAAATGAGATAAGATTGGCTGGGTTCGATCGAGCTTTTAGCCGGATTCAGAGATCCCCAGAATTTGAACATGAATCATATTGGTTCAAGTCCTTGAGTAGTTGATCGGTTCATCATGATCTGAGTTTTGTCCCACATTTTCACAAATATATGTTTTTTTTTTGTTCTTTGCAGTGTATTAGGCATTGTAGTAATAGTAGCTCTGAGCCTTTTGATGATTCTTCTGAACAGAAGATTCATAAAAGTTATTATTTATGTTGGTGTGTTGTAACGATGCTGGAAATTCAGATTCTTGTAATTTGGGATTGGTATTCCGAATGGTG
Coding sequence (CDS)
ATGGAATTGAGATCGAGCAGAGAGAATTCCAGAGAAATTGCAAAATTAGGGCACAAAATCGTTCAAAAGAGACTAGCCAATTCTCGGCCCAAGGCTCAGCAGCAGGCTCCCGATCTGACCGATTTTATGAACGATATGTTCTTTGGAACCGTAAACAAGGACAAGAAAACCTATAATTTGACTGGTGATGAAGATGATGAAGATGAAGAAGAATTGTTTGATCGGAGCAACAGAAGTAGAAACAGCCAATTGACGGAGGAGTGGCTGGATGAGGCGCGCCGGTTGGTGACTTCTTCTCCTTCGCGGTCTAATTCTCCGGCTCGGCTGGGTGGATCGCCGAGGTTTGCGGCCGTCAACGGCATATCGCCGGCGTCGATTAATGATCGGAGAGATCCACTCTCACGCTCCGCTAGAAGGCACAGAGCAGTGGACAATTTCAGTGGTGAGATCCTCTCCAAAGTCGTACGCCACAGTCGCAATAAATCCGAATCCGCCGCCGCTTCCGAGGACGAGCCGATAAACCCCACTTCCGCCGTTCAAAAATGGATATCCAACATCCTCAAACCCTCCAATCCCCCCATTTCCGCCCCTCTTCCAATCTCCGATCCTCTTCCCACCCCTCGCAAATCTCGATTCCACACCGATCTTCCTCCTTCTCGTCTCCCAATTCCTCCGCCAGATGTCGCCCTTCTTTCTCCACCCAAGACCCTTACCGATCATCCTCCGAGAAGAACGGTCTCATCGCCGGCCTGTTCGCTTCAATCGATTCGACCCAAATCCAATCTAAATGGGTTCTCGAGGGATGATTCTGGAGACCTGGAATTTGGCCTTAATGGGTTTCTCAAAGAGCAGCGGAGCAAGATTCAACAAATCTCGAATGGGGAGCTTGATGTCGAGGTTAAAATCATTCTATCCGGACCTTCAAACAGCACTAGTTCAATGGTGGCGGCAGTTTGCTATGCTTGGCTACTAGAGAACAAAATGAGGCAGCAGAATGCCGATAGCCGTCGAGAGTGTCATGTGGTGCCTGTGATGAATATGCCGAGGGTAAAGATGTGGAACCAACGCCAGGTTGCTTGGCTCTTTTATCATCTTGGACTTGATGCTTCCTCAATTCTTTTCACTGATGAGGTGGACTTAGAAAGTCTGATGATGGCTGGTCGAACTAGCATCCTTGTCGTAGGTCAAGATGTCCTTAAGATGAGTGATGGAGTTGGATCTCAGTGTACCATTCTTACAGATAACTATTGTGAGGATGCCTATCATTTACTTCAAACCCCTTTGTTGAAGAATCTCTTGCTCGCAGGAATTCTGTTGGACACAAGAAATCTCGATGCATCGGCTCAATCATCCATGACAAGAGATGCGGAGGCTGTTCAGTTGCTATCGGTTGGCTCAGCCCCAAATTGTAGGAATGGACTTTATGATCAGTTGATGCGAGTTCAAAAGGAGCGTCCATTTTTGGATGCCTTACAACAAAGTTATGGAAAGCCTCCTAATGATGGTAGTAACAATGGCGTGGGGCGTGTAGAGCGCATCACGGAGAGGAATCAAACATCCATCTCACCCCATGGTGATGCAATTAACCAACAAAAGAAGCGTAATGAGTTTGGAACTGCCAAAACTTGCAGGGTTTCGCCGAAATCCGGTACGTGTAACAGCATACAATTACACTATAACCCTTTTGCATTCTACATTGATGAGAGATTTTATTAA
Protein sequence
MELRSSRENSREIAKLGHKIVQKRLANSRPKAQQQAPDLTDFMNDMFFGTVNKDKKTYNLTGDEDDEDEEELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFAAVNGISPASINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSESAAASEDEPINPTSAVQKWISNILKPSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSPPKTLTDHPPRRTVSSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISNGELDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMWNQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILTDNYCEDAYHLLQTPLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNGLYDQLMRVQKERPFLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKKRNEFGTAKTCRVSPKSGTCNSIQLHYNPFAFYIDERFY
Homology
BLAST of Tan0015821 vs. NCBI nr
Match:
XP_023542616.1 (uncharacterized protein LOC111802467 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 921.8 bits (2381), Expect = 2.9e-264
Identity = 482/552 (87.32%), Postives = 508/552 (92.03%), Query Frame = 0
Query: 1 MELRSSRENSREIAKLGHKIVQKRLANSRPKAQQQAPDLTDFMNDMFFGTVNKDKKTYNL 60
ME R SREN+RE+AKLGH IVQKRLANSRPK+QQQAPDLTDFMNDMFFG+VNK+KK YNL
Sbjct: 1 MESRISRENTREMAKLGHIIVQKRLANSRPKSQQQAPDLTDFMNDMFFGSVNKEKKAYNL 60
Query: 61 TGDEDDEDEEELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFAAVNG 120
TGDED+E+EEE FDRSNRSRNS LTEEWLDEARRLV SSPSR NSPAR GSPRFAA NG
Sbjct: 61 TGDEDEEEEEESFDRSNRSRNSLLTEEWLDEARRLVASSPSRPNSPARFVGSPRFAAANG 120
Query: 121 ISPASINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES---AAASEDEPINPTS 180
S A INDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES ++A+ +EPINP+S
Sbjct: 121 RSSALINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSESFSVSSAAVEEPINPSS 180
Query: 181 AVQKWISNILKPSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSPPKTL 240
AVQKWISN+LKPSNP PISDP PT RKSRFHTDLPPSRLPIPPPDV LLSPPKTL
Sbjct: 181 AVQKWISNVLKPSNP------PISDPPPTTRKSRFHTDLPPSRLPIPPPDV-LLSPPKTL 240
Query: 241 TDHPPRRTVSSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISNGELD 300
T+ PPRRTVSSPACS+QSIRPKSNLNGFSRDDS DLEFGLNGFLKEQRSKIQQIS+G+LD
Sbjct: 241 TNPPPRRTVSSPACSIQSIRPKSNLNGFSRDDSEDLEFGLNGFLKEQRSKIQQISDGQLD 300
Query: 301 VEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMWNQRQ 360
VEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQ NA+S REC VVPVMNM R MWNQRQ
Sbjct: 301 VEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQSNAESGRECLVVPVMNMQRGNMWNQRQ 360
Query: 361 VAWLFYHLGLDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILTDNYC 420
VAWLFYHLGLDASSILFTDEVDLESLM+AGRTS+L+VGQDVLKMSDGVGSQCTILTDNYC
Sbjct: 361 VAWLFYHLGLDASSILFTDEVDLESLMVAGRTSVLIVGQDVLKMSDGVGSQCTILTDNYC 420
Query: 421 EDAYHLLQTPLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNGLYDQ 480
EDAYHLLQTPLLKNLLLAGILLDT+NLD SAQSSMTRDAEAVQLLSVGSAPNCRNGLYDQ
Sbjct: 421 EDAYHLLQTPLLKNLLLAGILLDTKNLDGSAQSSMTRDAEAVQLLSVGSAPNCRNGLYDQ 480
Query: 481 LMRVQKERPFLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKKRNEF 540
LMRVQKERPFLDALQQSYGKPP+DGSN+G G VERI ERN+TSISPH D INQQKK N+F
Sbjct: 481 LMRVQKERPFLDALQQSYGKPPDDGSNDGTGHVERIMERNRTSISPHDDTINQQKKPNDF 540
Query: 541 GTAKTCRVSPKS 550
GTAK CR SPKS
Sbjct: 541 GTAKICRASPKS 545
BLAST of Tan0015821 vs. NCBI nr
Match:
KAG6573088.1 (50S ribosomal protein L19, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 913.3 bits (2359), Expect = 1.0e-261
Identity = 480/556 (86.33%), Postives = 508/556 (91.37%), Query Frame = 0
Query: 1 MELRSSRENSREIAKLGHKIVQKRLANSRPKAQQQAPDLTDFMNDMFFGTVNKDKKTYNL 60
ME R SREN+RE+AKLGH IVQKRLANSRPKAQQQAPDLTDFMNDMFFG+VNK+KK YNL
Sbjct: 1 MESRISRENTREMAKLGHIIVQKRLANSRPKAQQQAPDLTDFMNDMFFGSVNKEKKAYNL 60
Query: 61 TGD----EDDEDEEELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFA 120
TGD ED+E++E+ FDRSNRSRNS LTEEWLDEARRLV SSPSR NSPAR GSPRFA
Sbjct: 61 TGDENEEEDEEEQEDSFDRSNRSRNSLLTEEWLDEARRLVASSPSRPNSPARFVGSPRFA 120
Query: 121 AVNGISPASINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES---AAASEDEPI 180
A NG S A INDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES ++A+ +EPI
Sbjct: 121 AANGRSSALINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSESFSVSSAAVEEPI 180
Query: 181 NPTSAVQKWISNILKPSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSP 240
NP+SAVQKWISN+LKPSNP +SDP PT RKSRFHTDLPPSRLPIPPPDV LLSP
Sbjct: 181 NPSSAVQKWISNVLKPSNP------TLSDPPPTTRKSRFHTDLPPSRLPIPPPDV-LLSP 240
Query: 241 PKTLTDHPPRRTVSSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISN 300
PKTLT+ PPRRTVSSPACS+QSIRPKSNLNGFSRDDS DLEFGLNGFLKEQRSKIQQIS+
Sbjct: 241 PKTLTNPPPRRTVSSPACSIQSIRPKSNLNGFSRDDSEDLEFGLNGFLKEQRSKIQQISD 300
Query: 301 GELDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMW 360
G+LDVEVKIILSGPSNSTSSMVAAVCYAWLLENKM+Q NA+S REC VVPVMNM R MW
Sbjct: 301 GQLDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMKQSNAESGRECLVVPVMNMQRGNMW 360
Query: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILT 420
NQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTS+LVVGQDVLKMSDGVGSQCTILT
Sbjct: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSVLVVGQDVLKMSDGVGSQCTILT 420
Query: 421 DNYCEDAYHLLQTPLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNG 480
DNYCEDAYHLLQTPLLKNL+LAGILLDT+NLD SAQSSMTRDAEAVQLLSVGSAPNCRNG
Sbjct: 421 DNYCEDAYHLLQTPLLKNLMLAGILLDTKNLDGSAQSSMTRDAEAVQLLSVGSAPNCRNG 480
Query: 481 LYDQLMRVQKERPFLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKK 540
LYDQLMRVQKERPFLDALQQSYGKPP+DGSN+G G VERI ERN+TSISPH D INQQKK
Sbjct: 481 LYDQLMRVQKERPFLDALQQSYGKPPDDGSNDGAGHVERIMERNRTSISPHDDTINQQKK 540
Query: 541 RNEFGTAKTCRVSPKS 550
N+FGTAKTCR SPKS
Sbjct: 541 PNDFGTAKTCRASPKS 549
BLAST of Tan0015821 vs. NCBI nr
Match:
KAG7012275.1 (hypothetical protein SDJN02_25027, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 902.5 bits (2331), Expect = 1.8e-258
Identity = 480/562 (85.41%), Postives = 506/562 (90.04%), Query Frame = 0
Query: 1 MELRSSRENSREIAKLGHKIVQKRLANSRPKAQQQAPDLTDFMNDMFFGTVNKDKKTYNL 60
ME R SREN+RE+AKLGH IVQKRLANSR KAQQQAPDLTDFMNDMFFG+VNK+KK YNL
Sbjct: 1 MESRISRENTREMAKLGHIIVQKRLANSRSKAQQQAPDLTDFMNDMFFGSVNKEKKAYNL 60
Query: 61 TGD----EDDEDEEELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFA 120
TGD ED+E+EEE FDRSNRSRNS LTEEWLDEARRLV SSPSR NSPAR GSPRFA
Sbjct: 61 TGDENEEEDEEEEEESFDRSNRSRNSLLTEEWLDEARRLVASSPSRPNSPARFVGSPRFA 120
Query: 121 AVNGISPASINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES---AAASEDEPI 180
A NG S A INDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES ++A+ +EPI
Sbjct: 121 AANGRSSALINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSESFSVSSAAVEEPI 180
Query: 181 NPTSAVQKWISNILKPSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSP 240
NP+SAVQKWISN+LKPSNP PISDP PT RKSRFHTDLPPSRLPIPPPD ALLSP
Sbjct: 181 NPSSAVQKWISNVLKPSNP------PISDPPPTTRKSRFHTDLPPSRLPIPPPD-ALLSP 240
Query: 241 PKTLTDHPPRRTVSSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISN 300
PKTLT+ PPRRTVSSPACS+QSIRPKS LN FSRDDS DLEFGLNGFLKEQRSKIQQIS+
Sbjct: 241 PKTLTNPPPRRTVSSPACSIQSIRPKSILNEFSRDDSEDLEFGLNGFLKEQRSKIQQISD 300
Query: 301 GELDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMW 360
G+LDVEVKIILSGPSNSTSSMVAAVCYAWLLENKM+Q NA+S REC VVPVMNM R MW
Sbjct: 301 GQLDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMKQSNAESGRECLVVPVMNMQRGNMW 360
Query: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILT 420
NQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTS+LVVGQDVLKMSDGVGSQCTILT
Sbjct: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSVLVVGQDVLKMSDGVGSQCTILT 420
Query: 421 DNYCEDAYHLLQTPLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNG 480
DNYCEDAYHLLQTPLLKNL+LAGILLDT+NLD SAQSSMTRDAEAVQLLSVGSAPNCRNG
Sbjct: 421 DNYCEDAYHLLQTPLLKNLMLAGILLDTKNLDGSAQSSMTRDAEAVQLLSVGSAPNCRNG 480
Query: 481 LYDQ------LMRVQKERPFLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDA 540
LYDQ +MRVQKERPFLDALQQSYGKPP+DGSN+G G VERI ERN+TSISPH D
Sbjct: 481 LYDQCTLHRSMMRVQKERPFLDALQQSYGKPPDDGSNDGAGHVERIMERNRTSISPHDDT 540
Query: 541 INQQKKRNEFGTAKTCRVSPKS 550
INQQKK N+FGTAKTCR SPKS
Sbjct: 541 INQQKKPNDFGTAKTCRASPKS 555
BLAST of Tan0015821 vs. NCBI nr
Match:
XP_022954466.1 (uncharacterized protein LOC111456732 [Cucurbita moschata])
HSP 1 Score: 901.7 bits (2329), Expect = 3.1e-258
Identity = 475/556 (85.43%), Postives = 504/556 (90.65%), Query Frame = 0
Query: 1 MELRSSRENSREIAKLGHKIVQKRLANSRPKAQQQAPDLTDFMNDMFFGTVNKDKKTYNL 60
ME R SREN+RE+AKLGH IVQKRLANSRPKAQQQAPDLTDFMNDMFFG+VNK+KK YNL
Sbjct: 1 MESRISRENTREMAKLGHIIVQKRLANSRPKAQQQAPDLTDFMNDMFFGSVNKEKKAYNL 60
Query: 61 TGD----EDDEDEEELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFA 120
TGD ED+E++E+ FDRSNRSRNS LTEEWLDEARRLV SSPSR NSPAR GSPRFA
Sbjct: 61 TGDENEEEDEEEQEDSFDRSNRSRNSLLTEEWLDEARRLVASSPSRPNSPARFVGSPRFA 120
Query: 121 AVNGISPASINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES---AAASEDEPI 180
A NG S A INDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES ++A+ +EPI
Sbjct: 121 AANGRSSALINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSESFSVSSAAVEEPI 180
Query: 181 NPTSAVQKWISNILKPSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSP 240
NP+SAVQKWISN+LKPSNP +SDP PT RKSRFHTDLPPSRLPIPPPDV LLSP
Sbjct: 181 NPSSAVQKWISNVLKPSNP------TLSDPPPTTRKSRFHTDLPPSRLPIPPPDV-LLSP 240
Query: 241 PKTLTDHPPRRTVSSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISN 300
PKTLT+ PPRRTVSS ACS+QSIRPKSNLN FSRDDS DLEFGLNGFLKEQRSKIQQIS+
Sbjct: 241 PKTLTNPPPRRTVSSSACSIQSIRPKSNLNEFSRDDSEDLEFGLNGFLKEQRSKIQQISD 300
Query: 301 GELDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMW 360
G+LDVEVKIILSGPSNSTSSMVAAVCYAWLLENKM+Q NA+S REC VVPVMNM R MW
Sbjct: 301 GQLDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMKQSNAESGRECLVVPVMNMQRGNMW 360
Query: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILT 420
NQRQVAWLFYHLGLDASSILFTDEVDLESLM+ GRTS+LVVGQDVLKMSDGVGSQCTILT
Sbjct: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLMVTGRTSVLVVGQDVLKMSDGVGSQCTILT 420
Query: 421 DNYCEDAYHLLQTPLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNG 480
DNYCEDAYHLLQTPLLKNL+LAGILLDT+NLD SAQ SMTRDAEAVQLLSVGSAPNCRNG
Sbjct: 421 DNYCEDAYHLLQTPLLKNLMLAGILLDTKNLDGSAQLSMTRDAEAVQLLSVGSAPNCRNG 480
Query: 481 LYDQLMRVQKERPFLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKK 540
LYDQLMRVQKERPFLDALQQSYGKPP+DGSN+G G VERI ERN+TSISPH D INQQKK
Sbjct: 481 LYDQLMRVQKERPFLDALQQSYGKPPDDGSNDGAGHVERIMERNRTSISPHDDTINQQKK 540
Query: 541 RNEFGTAKTCRVSPKS 550
N+FGTAKTCR SPKS
Sbjct: 541 PNDFGTAKTCRASPKS 549
BLAST of Tan0015821 vs. NCBI nr
Match:
XP_008446103.1 (PREDICTED: uncharacterized protein LOC103488927 [Cucumis melo])
HSP 1 Score: 840.9 bits (2171), Expect = 6.6e-240
Identity = 453/563 (80.46%), Postives = 491/563 (87.21%), Query Frame = 0
Query: 1 MELRSSRENSREIAKLGHKIVQKRLANSRPKAQQQAPDLTDFMNDMFFGTVNKDKKTYNL 60
MELR SRE SRE+AKLG QKR+ NSRPK QQQAPDLTDFMNDMFFG VNKDKK YNL
Sbjct: 1 MELRHSREKSREMAKLG----QKRVTNSRPKTQQQAPDLTDFMNDMFFGAVNKDKKAYNL 60
Query: 61 TGDE--DDEDEEELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFAAV 120
TG+E DD+D+EE FDRSNRSRN QLTEEWLDEARRLV SSPSR NSPARL GSPRFAA
Sbjct: 61 TGNEENDDDDDEEWFDRSNRSRNEQLTEEWLDEARRLVASSPSRCNSPARLVGSPRFAAA 120
Query: 121 NGISPASINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES---AAASEDEPINP 180
NG SPAS DRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES ++A+E++ INP
Sbjct: 121 NGRSPASNIDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSESFSTSSAAEEDLINP 180
Query: 181 TSAVQKWISNILK-PSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSPP 240
SAVQKWISNILK PSNP IS P P P TPRKSRFHT LPPSRLP P D ALLSPP
Sbjct: 181 ASAVQKWISNILKPPSNPAISIPDP---PPSTPRKSRFHTHLPPSRLPNTPSD-ALLSPP 240
Query: 241 KTLTDHPPRRTVSSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISNG 300
K LTD PPRRTVSSPA S+Q++R KSNLNGFSRDDSGDLEFGLNGFLKEQR+KI++ISNG
Sbjct: 241 KILTDPPPRRTVSSPAFSIQTVRSKSNLNGFSRDDSGDLEFGLNGFLKEQRNKIKKISNG 300
Query: 301 ELDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMWN 360
ELD EVKIILSGP+NSTSSMVAA+CYAWLLENK+RQ N ++ +EC VVPVMNM R KMWN
Sbjct: 301 ELDAEVKIILSGPTNSTSSMVAAICYAWLLENKLRQTNVETGQECVVVPVMNMQRGKMWN 360
Query: 361 QRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILTD 420
QRQVAWLFYHLGLDASSILFTDEVDLESLM+ G+TSILVVGQDVLKM+DGVGSQCTILTD
Sbjct: 361 QRQVAWLFYHLGLDASSILFTDEVDLESLMITGQTSILVVGQDVLKMNDGVGSQCTILTD 420
Query: 421 NYCEDAYHLLQTPLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNGL 480
NYCEDAYHLLQTPLLKNLLLAGILLDT+NLD S+QSSMTRDAEAVQLLSVGSAP +NGL
Sbjct: 421 NYCEDAYHLLQTPLLKNLLLAGILLDTKNLDTSSQSSMTRDAEAVQLLSVGSAPISKNGL 480
Query: 481 YDQLMRVQKERPFLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKKR 540
YDQLMRVQKE FLDAL Q+YGKPP+DGSN+G GR E I ERNQ S PHG+AINQQKK
Sbjct: 481 YDQLMRVQKESSFLDALIQNYGKPPSDGSNDGEGRSEHIKERNQPSSPPHGEAINQQKKS 540
Query: 541 NEFGTAKTCRVSPKSGTCNSIQL 558
++ GTAKT +VSPKS +S+ +
Sbjct: 541 SDIGTAKTSKVSPKSAKPSSLPI 555
BLAST of Tan0015821 vs. ExPASy TrEMBL
Match:
A0A6J1GQZ5 (uncharacterized protein LOC111456732 OS=Cucurbita moschata OX=3662 GN=LOC111456732 PE=4 SV=1)
HSP 1 Score: 901.7 bits (2329), Expect = 1.5e-258
Identity = 475/556 (85.43%), Postives = 504/556 (90.65%), Query Frame = 0
Query: 1 MELRSSRENSREIAKLGHKIVQKRLANSRPKAQQQAPDLTDFMNDMFFGTVNKDKKTYNL 60
ME R SREN+RE+AKLGH IVQKRLANSRPKAQQQAPDLTDFMNDMFFG+VNK+KK YNL
Sbjct: 1 MESRISRENTREMAKLGHIIVQKRLANSRPKAQQQAPDLTDFMNDMFFGSVNKEKKAYNL 60
Query: 61 TGD----EDDEDEEELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFA 120
TGD ED+E++E+ FDRSNRSRNS LTEEWLDEARRLV SSPSR NSPAR GSPRFA
Sbjct: 61 TGDENEEEDEEEQEDSFDRSNRSRNSLLTEEWLDEARRLVASSPSRPNSPARFVGSPRFA 120
Query: 121 AVNGISPASINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES---AAASEDEPI 180
A NG S A INDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES ++A+ +EPI
Sbjct: 121 AANGRSSALINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSESFSVSSAAVEEPI 180
Query: 181 NPTSAVQKWISNILKPSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSP 240
NP+SAVQKWISN+LKPSNP +SDP PT RKSRFHTDLPPSRLPIPPPDV LLSP
Sbjct: 181 NPSSAVQKWISNVLKPSNP------TLSDPPPTTRKSRFHTDLPPSRLPIPPPDV-LLSP 240
Query: 241 PKTLTDHPPRRTVSSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISN 300
PKTLT+ PPRRTVSS ACS+QSIRPKSNLN FSRDDS DLEFGLNGFLKEQRSKIQQIS+
Sbjct: 241 PKTLTNPPPRRTVSSSACSIQSIRPKSNLNEFSRDDSEDLEFGLNGFLKEQRSKIQQISD 300
Query: 301 GELDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMW 360
G+LDVEVKIILSGPSNSTSSMVAAVCYAWLLENKM+Q NA+S REC VVPVMNM R MW
Sbjct: 301 GQLDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMKQSNAESGRECLVVPVMNMQRGNMW 360
Query: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILT 420
NQRQVAWLFYHLGLDASSILFTDEVDLESLM+ GRTS+LVVGQDVLKMSDGVGSQCTILT
Sbjct: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLMVTGRTSVLVVGQDVLKMSDGVGSQCTILT 420
Query: 421 DNYCEDAYHLLQTPLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNG 480
DNYCEDAYHLLQTPLLKNL+LAGILLDT+NLD SAQ SMTRDAEAVQLLSVGSAPNCRNG
Sbjct: 421 DNYCEDAYHLLQTPLLKNLMLAGILLDTKNLDGSAQLSMTRDAEAVQLLSVGSAPNCRNG 480
Query: 481 LYDQLMRVQKERPFLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKK 540
LYDQLMRVQKERPFLDALQQSYGKPP+DGSN+G G VERI ERN+TSISPH D INQQKK
Sbjct: 481 LYDQLMRVQKERPFLDALQQSYGKPPDDGSNDGAGHVERIMERNRTSISPHDDTINQQKK 540
Query: 541 RNEFGTAKTCRVSPKS 550
N+FGTAKTCR SPKS
Sbjct: 541 PNDFGTAKTCRASPKS 549
BLAST of Tan0015821 vs. ExPASy TrEMBL
Match:
A0A1S3BEY4 (uncharacterized protein LOC103488927 OS=Cucumis melo OX=3656 GN=LOC103488927 PE=4 SV=1)
HSP 1 Score: 840.9 bits (2171), Expect = 3.2e-240
Identity = 453/563 (80.46%), Postives = 491/563 (87.21%), Query Frame = 0
Query: 1 MELRSSRENSREIAKLGHKIVQKRLANSRPKAQQQAPDLTDFMNDMFFGTVNKDKKTYNL 60
MELR SRE SRE+AKLG QKR+ NSRPK QQQAPDLTDFMNDMFFG VNKDKK YNL
Sbjct: 1 MELRHSREKSREMAKLG----QKRVTNSRPKTQQQAPDLTDFMNDMFFGAVNKDKKAYNL 60
Query: 61 TGDE--DDEDEEELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFAAV 120
TG+E DD+D+EE FDRSNRSRN QLTEEWLDEARRLV SSPSR NSPARL GSPRFAA
Sbjct: 61 TGNEENDDDDDEEWFDRSNRSRNEQLTEEWLDEARRLVASSPSRCNSPARLVGSPRFAAA 120
Query: 121 NGISPASINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES---AAASEDEPINP 180
NG SPAS DRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES ++A+E++ INP
Sbjct: 121 NGRSPASNIDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSESFSTSSAAEEDLINP 180
Query: 181 TSAVQKWISNILK-PSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSPP 240
SAVQKWISNILK PSNP IS P P P TPRKSRFHT LPPSRLP P D ALLSPP
Sbjct: 181 ASAVQKWISNILKPPSNPAISIPDP---PPSTPRKSRFHTHLPPSRLPNTPSD-ALLSPP 240
Query: 241 KTLTDHPPRRTVSSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISNG 300
K LTD PPRRTVSSPA S+Q++R KSNLNGFSRDDSGDLEFGLNGFLKEQR+KI++ISNG
Sbjct: 241 KILTDPPPRRTVSSPAFSIQTVRSKSNLNGFSRDDSGDLEFGLNGFLKEQRNKIKKISNG 300
Query: 301 ELDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMWN 360
ELD EVKIILSGP+NSTSSMVAA+CYAWLLENK+RQ N ++ +EC VVPVMNM R KMWN
Sbjct: 301 ELDAEVKIILSGPTNSTSSMVAAICYAWLLENKLRQTNVETGQECVVVPVMNMQRGKMWN 360
Query: 361 QRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILTD 420
QRQVAWLFYHLGLDASSILFTDEVDLESLM+ G+TSILVVGQDVLKM+DGVGSQCTILTD
Sbjct: 361 QRQVAWLFYHLGLDASSILFTDEVDLESLMITGQTSILVVGQDVLKMNDGVGSQCTILTD 420
Query: 421 NYCEDAYHLLQTPLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNGL 480
NYCEDAYHLLQTPLLKNLLLAGILLDT+NLD S+QSSMTRDAEAVQLLSVGSAP +NGL
Sbjct: 421 NYCEDAYHLLQTPLLKNLLLAGILLDTKNLDTSSQSSMTRDAEAVQLLSVGSAPISKNGL 480
Query: 481 YDQLMRVQKERPFLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKKR 540
YDQLMRVQKE FLDAL Q+YGKPP+DGSN+G GR E I ERNQ S PHG+AINQQKK
Sbjct: 481 YDQLMRVQKESSFLDALIQNYGKPPSDGSNDGEGRSEHIKERNQPSSPPHGEAINQQKKS 540
Query: 541 NEFGTAKTCRVSPKSGTCNSIQL 558
++ GTAKT +VSPKS +S+ +
Sbjct: 541 SDIGTAKTSKVSPKSAKPSSLPI 555
BLAST of Tan0015821 vs. ExPASy TrEMBL
Match:
A0A6J1C6V5 (uncharacterized protein LOC111008802 OS=Momordica charantia OX=3673 GN=LOC111008802 PE=4 SV=1)
HSP 1 Score: 820.1 bits (2117), Expect = 5.8e-234
Identity = 448/564 (79.43%), Postives = 485/564 (85.99%), Query Frame = 0
Query: 1 MELRSSRENSREIAKLGHKIVQ-KRLAN-SRPKAQQQAPDLTDFMNDMFFGTVNKDKKTY 60
ME + SRENSR+ AKLGHK V KRLAN SRP QQAPDLTDFMNDMFFG VN D++ Y
Sbjct: 1 METKQSRENSRDAAKLGHKNVPIKRLANHSRP---QQAPDLTDFMNDMFFGAVNADRRAY 60
Query: 61 NLTGDEDDEDEEELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFAAV 120
NLTG +EE+ FDRS SR+SQLTEEWLDEARRLV SSPSR +SPARL GSPRFAA
Sbjct: 61 NLTGTA--TEEEDSFDRS--SRSSQLTEEWLDEARRLVASSPSRCSSPARLVGSPRFAAA 120
Query: 121 NGISPASINDRRDPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES-----AAASEDEPI 180
NG SPA++ DRRDPLSRSARR RAVDNFSGEILSKVVRHSRNKSES AAA E+E I
Sbjct: 121 NGRSPAALIDRRDPLSRSARRRRAVDNFSGEILSKVVRHSRNKSESFSTSAAAAEEEEHI 180
Query: 181 NPTSAVQKWISNILKPSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSP 240
NP AVQKWISNIL PSNPP++ P+PISDP TPRKSRFHT+LP SRL IPP D ALLSP
Sbjct: 181 NPALAVQKWISNILNPSNPPVATPIPISDPPSTPRKSRFHTNLPSSRLAIPPSD-ALLSP 240
Query: 241 PKTLTDHPPRRTVSSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISN 300
PK LT+ PPRRT+SSPACSLQ+IRPKS+LNGF+R DSGDLEFGLNGFL+EQR KIQ ISN
Sbjct: 241 PKALTESPPRRTLSSPACSLQAIRPKSSLNGFARADSGDLEFGLNGFLEEQRRKIQNISN 300
Query: 301 GELDVEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMW 360
GEL+VEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQ A+S +EC VVPV+NM R KMW
Sbjct: 301 GELNVEVKIILSGPSNSTSSMVAAVCYAWLLENKMRQSEAESCQECLVVPVINMQRGKMW 360
Query: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILT 420
NQRQVAWLFYHLGLDASSILFTDEVDLESL+MAG+TSILVVGQDVLKMSDGVGSQCTILT
Sbjct: 361 NQRQVAWLFYHLGLDASSILFTDEVDLESLLMAGQTSILVVGQDVLKMSDGVGSQCTILT 420
Query: 421 DNYCEDAYHLLQTPLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNG 480
DNYCEDAYHLLQTPLLKNLLLAGILLDT+NLDAS+QSSMTRDAEAV+LL VGSAPN RNG
Sbjct: 421 DNYCEDAYHLLQTPLLKNLLLAGILLDTKNLDASSQSSMTRDAEAVRLLLVGSAPNYRNG 480
Query: 481 LYDQLMRVQKERPFLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKK 540
LYDQLMRVQKERPFLDALQQSYGKPPN+ E ITERNQTSISP D+INQ+KK
Sbjct: 481 LYDQLMRVQKERPFLDALQQSYGKPPNN---------EHITERNQTSISPRDDSINQRKK 540
Query: 541 RNEFGTAKTCRVSPKSGTCNSIQL 558
N+ GTAKT R SP+S S+ +
Sbjct: 541 FNDLGTAKTSRASPQSAKPGSLPI 547
BLAST of Tan0015821 vs. ExPASy TrEMBL
Match:
A0A5D3BJR5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G003990 PE=4 SV=1)
HSP 1 Score: 802.4 bits (2071), Expect = 1.3e-228
Identity = 433/539 (80.33%), Postives = 469/539 (87.01%), Query Frame = 0
Query: 13 IAKLGHKIVQKRLANSRPKAQQQAPDLTDFMNDMFFGTVNKDKKTYNLTGDE--DDEDEE 72
+AKLG QKR+ NSRPK QQQAPDLTDFMNDMFFG VNKDKK YNLTG+E DD+D+E
Sbjct: 1 MAKLG----QKRVTNSRPKTQQQAPDLTDFMNDMFFGAVNKDKKAYNLTGNEENDDDDDE 60
Query: 73 ELFDRSNRSRNSQLTEEWLDEARRLVTSSPSRSNSPARLGGSPRFAAVNGISPASINDRR 132
E FDRSNRSRN QLTEEWLDEARRLV SSPSR NSPARL GSPRFAA NG SPAS DRR
Sbjct: 61 EWFDRSNRSRNEQLTEEWLDEARRLVASSPSRCNSPARLVGSPRFAAANGRSPASNIDRR 120
Query: 133 DPLSRSARRHRAVDNFSGEILSKVVRHSRNKSES---AAASEDEPINPTSAVQKWISNIL 192
DPLS RHRAVDNFSGEILSKVVRHSRNKSES ++A+E++ INP SAVQKWISNIL
Sbjct: 121 DPLS----RHRAVDNFSGEILSKVVRHSRNKSESFSTSSAAEEDLINPASAVQKWISNIL 180
Query: 193 K-PSNPPISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSPPKTLTDHPPRRTV 252
K PSNP IS P P P TPRKSRFHT LPPSRLP P D ALLSPPK LTD PPRRTV
Sbjct: 181 KPPSNPAISIPDP---PPSTPRKSRFHTHLPPSRLPNTPSD-ALLSPPKILTDPPPRRTV 240
Query: 253 SSPACSLQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQISNGELDVEVKIILSG 312
SSPA S+Q++R KSNLNGFSRDDSGDLEFGLNGFLKEQR+KI++ISNGELD EVKIILSG
Sbjct: 241 SSPAFSIQTVRSKSNLNGFSRDDSGDLEFGLNGFLKEQRNKIKKISNGELDAEVKIILSG 300
Query: 313 PSNSTSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMWNQRQVAWLFYHLG 372
P+NSTSSMVAA+CYAWLLENK+RQ N ++ +EC VVPVMNM R KMWNQRQVAWLFYHLG
Sbjct: 301 PTNSTSSMVAAICYAWLLENKLRQTNVETGQECVVVPVMNMQRGKMWNQRQVAWLFYHLG 360
Query: 373 LDASSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILTDNYCEDAYHLLQT 432
LDASSILFTDEVDLESLM+ G+TSILVVGQDVLKM+DGVGSQCTILTDNYCEDAYHLLQT
Sbjct: 361 LDASSILFTDEVDLESLMITGQTSILVVGQDVLKMNDGVGSQCTILTDNYCEDAYHLLQT 420
Query: 433 PLLKNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNGLYDQLMRVQKERP 492
PLLKNLLLAGILLDT+NLD S+QSSMTRDAEAVQLLSVGSAP +NGLYDQLMRVQKE
Sbjct: 421 PLLKNLLLAGILLDTKNLDTSSQSSMTRDAEAVQLLSVGSAPISKNGLYDQLMRVQKESS 480
Query: 493 FLDALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKKRNEFGTAKTCRV 546
FLDAL Q+YGKPP+DGSN+G GR E I ERNQ S PHG+AINQQKK ++ GTAKT ++
Sbjct: 481 FLDALIQNYGKPPSDGSNDGEGRSEHIKERNQPSSPPHGEAINQQKKSSDIGTAKTSKI 527
BLAST of Tan0015821 vs. ExPASy TrEMBL
Match:
A0A0A0LRB9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G046920 PE=4 SV=1)
HSP 1 Score: 596.7 bits (1537), Expect = 1.0e-166
Identity = 325/422 (77.01%), Postives = 355/422 (84.12%), Query Frame = 0
Query: 138 RRHRAVDNFSGEILSKVVRHSRNKSE----SAAASEDEPINPTSAVQKWISNILK-PSNP 197
+RHRAVDNFSGEILSKVVRHSRNKSE S+AA E+E NP SAVQKWISNILK P NP
Sbjct: 14 KRHRAVDNFSGEILSKVVRHSRNKSESYSTSSAAEEEELTNPASAVQKWISNILKPPPNP 73
Query: 198 PISAPLPISDPLPTPRKSRFHTDLPPSRLPIPPPDVALLSPPKTLTDHPPRRTVSSPACS 257
IS P P P TPRKSRFH LPPSRLP P D ALLSPPKTLTD PPRRTVSSPA S
Sbjct: 74 AISIPDP---PPSTPRKSRFHAHLPPSRLPNTPSD-ALLSPPKTLTDPPPRRTVSSPAFS 133
Query: 258 LQSIRPKSNLNGFSRDDSGDLEFGLNGFLKEQRSKIQQI--SNGELDVEV-KIILSGPSN 317
LQ++R KSNLNGFS++D GDLEFGLNGFLKEQR K++++ G L V + +L +
Sbjct: 134 LQTVRSKSNLNGFSQNDYGDLEFGLNGFLKEQRMKLKRVVLEMGILHCSVLRNLLIFENA 193
Query: 318 STSSMVAAVCYAWLLENKMRQQNADSRRECHVVPVMNMPRVKMWNQRQVAWLFYHLGLDA 377
TSSMVAA+CYAWLLENK+RQ N ++ REC VVPVMNM R KMWNQRQVAWLFYHLGLDA
Sbjct: 194 GTSSMVAAICYAWLLENKLRQTNVETGRECLVVPVMNMQRGKMWNQRQVAWLFYHLGLDA 253
Query: 378 SSILFTDEVDLESLMMAGRTSILVVGQDVLKMSDGVGSQCTILTDNYCEDAYHLLQTPLL 437
SSILFTDEVDLESLM+AG+TSI VVGQDVLKM+DGVGSQCTILTDNYCEDAYHLLQTPLL
Sbjct: 254 SSILFTDEVDLESLMIAGQTSISVVGQDVLKMNDGVGSQCTILTDNYCEDAYHLLQTPLL 313
Query: 438 KNLLLAGILLDTRNLDASAQSSMTRDAEAVQLLSVGSAPNCRNGLYDQLMRVQKERPFLD 497
KNLLLAGILLDT+NLDAS+QSSMTRDAEAVQLLSVGSAPN +NGLYDQLMRVQKE FLD
Sbjct: 314 KNLLLAGILLDTKNLDASSQSSMTRDAEAVQLLSVGSAPNSKNGLYDQLMRVQKESSFLD 373
Query: 498 ALQQSYGKPPNDGSNNGVGRVERITERNQTSISPHGDAINQQKKRNEFGTAKTCRVSPKS 552
AL Q+YGKPP+DGSNN VG I ERNQ S PHG+AINQQKK ++ GTAKT +VSPKS
Sbjct: 374 ALIQNYGKPPSDGSNNHVGNTNHIKERNQPSSPPHGNAINQQKKSSDIGTAKTSKVSPKS 431
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023542616.1 | 2.9e-264 | 87.32 | uncharacterized protein LOC111802467 [Cucurbita pepo subsp. pepo] | [more] |
KAG6573088.1 | 1.0e-261 | 86.33 | 50S ribosomal protein L19, chloroplastic, partial [Cucurbita argyrosperma subsp.... | [more] |
KAG7012275.1 | 1.8e-258 | 85.41 | hypothetical protein SDJN02_25027, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022954466.1 | 3.1e-258 | 85.43 | uncharacterized protein LOC111456732 [Cucurbita moschata] | [more] |
XP_008446103.1 | 6.6e-240 | 80.46 | PREDICTED: uncharacterized protein LOC103488927 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GQZ5 | 1.5e-258 | 85.43 | uncharacterized protein LOC111456732 OS=Cucurbita moschata OX=3662 GN=LOC1114567... | [more] |
A0A1S3BEY4 | 3.2e-240 | 80.46 | uncharacterized protein LOC103488927 OS=Cucumis melo OX=3656 GN=LOC103488927 PE=... | [more] |
A0A6J1C6V5 | 5.8e-234 | 79.43 | uncharacterized protein LOC111008802 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A5D3BJR5 | 1.3e-228 | 80.33 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A0A0LRB9 | 1.0e-166 | 77.01 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G046920 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |