Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGACATGTTTAATAGTAAAGGATAATTTATAGGAGGTATGTGATTGTTTTTTTTTTACTCTTAAAAAGATTTATCTCTTAATTTTTCACTCGGAAAGAAATTAAGAATTATTTGCTTGTGCGTTACCAGGTGCTAATGGAGAGGCAGGGTAAAGATGACATCCACGACTGCACAATTAAGCTGGTTTGTATTTCCATTCTTAATTTCTTATTAATTTTTTGTAAACAAGTTGATGAAATGATTGTTAGAACTAAGGTGTTGGTGTCTGTTGACAGAGAGTAAATCCTCAAAAACGTAGAGACAAGGTGTACATTGGCTGTGGTGCTGGATTTGGAGGCGATAGGCCAACAGCAGCTCTTAAATTGCTTCAGAGGGTCAAAAACCTAAACTATCTCGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCGCTATCAAGTTATGTTGTCTGGTGGTGATGGTTATGATTCAAGGAGTATGTCCATGTTTAATTATCATATTTATCCTACTATTTTTCTATCAATTATATTTTTATCATTTCTTTCTCATATTCATATTTCTTGAAAGTTCCTAAGATTGTTGAGATGATAGTTTTTTTTTGTCAAGAAGCTGGCAAATGGTTCCTTATATTTGTTAGTAGCTATTAGTAATTCTAAAAATCTTGAAGCTCCTTTTCATACATTGACATTTGTATACGGTTATTTTGGTAGAAACTTGTTCTCTCTCTTGTGCAACAAGATTGTTAGTTCATGCTGTACTCAGATTGCAATTTGACTCCTCATTTTGCCTGTGTGCAGTTCTATATTTGCTTAGAGTCCACACTGCCAATTGAGCAAAAATGAGTGGATGGTTAGAAGAGAACCACGAGTGGATGGTTAGAAGAGAACCACCAAGATATACTAAGGTTTTGTAGGAAATTTCTGCAAAATGCATAATATAAAAAGAATACAAATTGGGGCAAGTATTTTGAAATGATGCAAAGACCAATTACCTTGCTGTAATGGTATTTCATACACTGGTTTTAAGACGTCTTCTCTTCACATTCAATTGCTGGATCTGATGTTATGGAACCGCCAAGAAGTTTGTCGGTTTTTTATGGCTAGTTACTGACCGAGTTTTCTTGTTCGTTAAGTTAATTGCCCTGGAAGTTATCTTGTAGTTCTAACCTATTCTACTAACTATAATGGGTTGAGCAAAAGGGAAGCATTGAATTGAAAAAGAAAAAAAAAATTGAAATATACGAGAAATTTACCCAAGAAGGTCTTCTTCATGATCAGTTATCTCAGTATACATTTTCTTGAACTCTATTTCCATTAATACCTTTCATGTTAAGTCAGTTTTTGCAAGCTGTGCGGTGCTTTGCTCATGACTATTATTTGAATTACTCTATAGTTGCAGATTGGATGAAATTGCTTCTTCCTTTGGCTATGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAAGTAAGATATTAAAGTCCAAATAAGTTTCAAATTCTTATTCTGCTTAGTTATGAGAGTTGGTTTGATCTGGTGGTGGTGGTTAATTACTTATTTATCATCTCTTCTTTTCTTTCCTGGGCAGTGGACCCTCCTGGGGCCCAGCGAAATGTTATAGAAATAGCAGGCAGTCTGGGGTTGAATGTTTCAGTTGCAGTTGCTTATGAGGTTTCAGTAAAAGAACCAGGTAATTGTTCTTTATTATTTTTAGAGGTTTATATGGATTATCTGTCCAATAAGGATTGACTTCCAAGAATTGTTGTTGTGCCCTCCCAAATGGTGGTCAAGTGTACAAGTGAATTAGATATTCCGTATTGGTACTCATACCAAATTTTGAGGACAAGGGTCGTGTTAGAGACACTGTTGTTTATTGTAGACTCAGGTTTAGGATCCTATGAAGTGCATTTTTTTAGCTTTAATGTCAGTTGAAAATAATGATATTCCCCTCCTGCATATTAGGAGCAAAATTAAGTTCTTTTTCCCAGTTCATATGTGTAACATGACTTCACAAAATATAAATTACAACCAACGCTGGTAGTTTTGGTATATGGCAGCCACTCAAGACACTTTTCTTTTACTCTTTTTTGTTCATTATCGGTATCTTTTAAAGGTGAGCTCATCATTGGTTTGATCAATGAAAGGCATTAGCACGTATCTGGGAGCAGCTCCAATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCACGTGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTAGGAAGTGATTTCTACATGATGCTGGCACAATCTTTTAGAAGCGATTCATACTTATGTGTCTTTTTTCAATACAACATGTGTTACCTTTAGAAAAAAATGCTGTTTTTAAGGAGAATATCAATCACTTTAGAGAAGTTGCTTTGTGCCTCCTTTGGGGGAGGGAAAACTTTATGGATTTTAGGGATTTCAGCCTCTTTTTTGGGCCTCAACCAAGGAGTTACAGAAAATATATCCAGTTGATTGTGATAGAAGGAATGTTGTTAGAATTTGTTAGACAGCATCTTAAGATTATATGATTTTTATTTTATTTTCCTTTGGAAAAAAGAACATCTAGGATTATATGTAGTCGGGACAAAGATAAAGAATATTTGACAGATGGATTGCATTCACTGTTTTAGTGAAAATAGCAAACTTAATGGGGATCATATAGGTAGTCTGACTTTATACTTCATGCTATCATGAATTATGATTTATCATAGCAAAAATTTTAACTCCTTCGTTAGGTTGTTTCAAAGTTTCTCAGAGAACAACTCCATAACAATGTGCATACATTTAGGTCTATGAACTTGGTTGGAACTGGGATGATCTTCCACGGCTAGCACAGGGAATACTGGCTGGTCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGTCTACTTCTCCTTCATACCCACCCAAATCCCAATTCTTTGTATGTGGAGCATGGAACTCATAGTTGTAGTTTGGCTTATGATTCAAACGATCTGCAGGAGACAAATATAGAAGCATGTCTTCCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAGTGTGATGGAAAAACCACTGTAGCCAAGGCAGAAGAGACTGGAGGTCTTTTGAATTTCAGTACATGTGCCGAACAACTTCTGTATGAGGTTGGTGATCCATCGGCTTATATCACCCCTGATATGGTAAGTTAAATGTATTATAGGACTACTTTGTGGATTGAATACTAAGGCAAAATACAAATATAACTGGTCGTGATTGCAGGTGGTTGACTTCAGCAATGTTTCATTTTGCTCTATATCCAGCTCTAGGGTTTTATGCTCCAGAGCAAAACCATCTATTCAAGGAGTGCCTGAGAAACTCTTGCAGTTGGCTCCAAAGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGTCATGAATATCTAAACATTTTAACAGATAGATAAATGAGTCATTTGCTTCCGAGTCTATGAGTGATAAATACCTGGAAGATCAAGAGATTAACCTTAATTTGAACGTCGGGGATTGGTTTAAAATAAACTATGGAATCAATAGGAAGACCTACAGGTTTTGTTTTGTTTTTTTCTTAAATAATATATGTGTGTGTGTGTGTGGAGTTACACACAAGTCTATTAGAGTAAAAAGTTCCCAACTAGGAAAGCTAACTTCTCAGGTAATTCCAAAATGTTGTTGATGACTAGCTGGAAAACATGCAGAATATATGCTTGTCCATTAATCTTTATTCATGAAAGAGCAGGCTCTTTGTTTCCTACGCATGACTAGAGCCAATATTGCCTATTCATCCTCGGGTTAACTAGAGTGCTTTTAGCATTGCGAATGAGATTACCCTGGAGAAGATCCTTCCTTTTATTTTCTATCTGTGCAAGCTAATCCTGGATGTGACATATCAGGACTGTGGATGGAAAGGATGGGGAGAGATATCCTATGGGGGACGTGAATGTGTTCTGCGTGCTAAAGCTGCAGAATATCTGGTAATTCAATAGAGAGCTTTCACGTTGTTTCACTATAATTTTCTATCTTTGATATTTTTTTCCTTTTATATAATATTAGTTCATATTGGCTTCTTGAAATGACTGTTGGTGCCATGTTTAAGTAAATTAAGATTCCTTAACGTGACCAGTTTTATCACACTTTTCTTGTTCTTCCTTTCATAGGTTCGGTCGTGGATGGAAGAACAGTTGATTGGTATTAATCAGCATATAGTGTCTTACATAATTGGACTCGACAGCCTTAAAGCATCCAGCAACAGTAGCAATAGTGTTGAAGATATTAGGTTGCGCATGGATGGACTCTTCAAGCAGAAGGAGCACGCTCTCCTGTTTGTTAGAGAATTTACAGCTTTATACACAAATGGGCCAGCTGGCGGCGGCGGCATCAGGTTGATGCACACTTGTAATTGCCCTCTCTAGCTGTGACAGTCTGAAATTTGTTTATGTTAGTTATTTCCTTCATGTTCATGATTGTTTTAATTAACTGATTTTTTCTTCTTTTTTTTTTTTTTTTCGTATTCGTGAGTGCTCCATACTAGATTATAGACAAGCCACCTATCATAGAAGTCAATTTATTTTTTTATTAACAAGAAACCAAGCTTTTATTGGTTAAGATGAAAGGATATACGAGGCAAACAAAAAAGAAGCCCAAAACAAACAACCCTCCACCCAATTACAAGAAGGGTCCCAATCCAAAAGAATGATGCTAAGCTGATAATTACAAAACACCCTATTAACAAGCGCCCATAAAGAAGCATTATATCTAACCAAATCCCAAATATCCTCCCCTGACCTCTCTACCTACGAAAATTTTTACCGTTCCTCTCAAGCCAAATGCCTTCACAACACTCCAAAAGTCTTAAAGAACCAATTTCACAAAGAGCAGACAAAATCACAATCCCAAAGCAAATGATCAAGATCCTCTTTTTGTCTACAAAGAACACACCATTGAGCTCGTAACATCATGGAGGAAAGCTTCTGGGTACGTTCTTGAGTATATACTTTCTCAAGCAAAATATGTGGCAAAAAAATTCATTCTCGTAGGAATTTTAACTTTCCAATAAGAGAAAAATATAAGGGTTCCCTATGGGGGAGAAGGGGCACAAAGAATCTGAAGAATATAATAGTAATAATAATAATAATAATATATTGGAAAAACCTTTAGAGGGGATTCGGAATTCAAGATTTAAAATATCTTCTCCCTACACAAGTAAAATGATTAGAAATAACAGAAAGGAGTTTGACCATCTCACCAGCCTCATAGTCTGTGAGGGGACAACAAAAACCCAAGAAAAGAGAGCAAAACCTATCAGACGAAGTCAAATTAGAAGTCCAACCTCTTATCCGGCAGATGATAAAAAGACGAAGGAACAGAGCACACAAGGGTTTCCCTCTGACCCGATATCCTCCCAAAAAATAAGTATTCAACTTGTCTCCGATAGAGCATTTAACAAACTGAGATAACATAGGAATGCTTGAGACCCACTCAAAAAGGGTGAGGTCCATATTTCCTGACAATAATCCTCTGCCACAAAAAGTCAGGTTCCAAATGGCTATCATAGAAGATTATCAAAAATCTTCTGGTAGATTAGGATGATTACACAAACTGAGAGTAATATGCACAGGCAGCCATGTTTATTTACGTAGTAGTACAAACATCCAAACCATTTTTATTGACCAGACTTGGTTGAAATATATATACGGATCAAACTTGAAGTTGGACGGTAGCTTTTGTTTCAACCTCAGACTTGTTCTTTGATGTATATCCTCTGTTGATCTGTTACTTGATAGAAAGCATGTAACTTTCAGAAACTTTTGGGAATATTATTCTTGTGGTCCATCTTTGTGCTAATGCATTTTACGTGACGATGACATGATTTCTGTCCTATTTTGAATTGCAGCACTGGCTACAAGAAAGAAATTGTGCTTGAAAAACAACTGGTACTACCTCTTTCTTTCTCTCTTACTGAAGTTGATATTTGGATTAACAGTATTTCATTTTCTCCATCGAAATTCTATTGGTAACTTACTGATGTTCTTGAGTTATGATTCTTGAATATTGCCATAATCACCCGTTTTCATTTGTTTTCTCTTCTTCTTCATTAGTCCGCCAATGTCTTTTTGTTCCATAATGAGCTTTTCTCTTATCCTTCATTCTTCAACGTCTTTTCCACTTGATGGTTTTCAAATTAGGTTGGGCGTGAAAATATTTTCTGGCAAACAGGAGTGAAGTGTACTGTAGCAGTAAAATTAGACAGTCAACCAACAGATCTTCGAAAGGATCCGGCAGAGGAATGTTCTTCGCCCCGAGTAACGTTGCCATGTCCGATATCTGCGTATGCAGAGAAACCTTGTACAGGCTCCTTTCCACCAGAAACGGGTCATTCCCCTATTCCATCTGGCCAGGAGATTGCTCTTTACAATGTAGCCCATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTGTCATTCCTCATTATCCTTCTGATATTGAGCGATTGAGAATGATCATCACGCCCGAATGGGTGATGAGAGTTCTCTCGGTTCTGCATAATTCGACTCTGTTTCCTTCTTCGGATGCTGATAAGAAGAGAGACGATTGGGTAGATGAACATGTGAAGGTTGAAATATACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTAGATGGTGGCGTAAATTGCTCACGGAGAATCGATCGCCATGGAAAGACCTTATCGGATCTCATCTTGAACCAGCAAATTGTTTTGCCACCATAGTTTGGTGGTTCAAACAGGAAAGCTAAAATTTCTGAACCAGACGGCTCTTCTGGACAATAAAGTATCTTGCCGCTGTTGGATTGCTTTTTTTTTCTTCTTCTTTTGTGGAGTGGGAGAATCTAACCTCTAATTTCAAGTTCGATTGTAAAGGTTTCCCTATTAGTTGAACTAATGCTCATTTCGGCCATTACCATAAAACTTTGAGGTCAATTAAATGTCTTAATCATTCTTACATAATTCCATATATTCTTACATTATCAAGTATGGATTATCCTTAAATTCCATAAGCTTGGATTAATAATTCCATATATTCTTACATT
mRNA sequence
ATGAGACATGTGCTAATGGAGAGGCAGGGTAAAGATGACATCCACGACTGCACAATTAAGCTGAGAGTAAATCCTCAAAAACGTAGAGACAAGGTGTACATTGGCTGTGGTGCTGGATTTGGAGGCGATAGGCCAACAGCAGCTCTTAAATTGCTTCAGAGGGTCAAAAACCTAAACTATCTCGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCGCTATCAAGTTATGTTGTCTGGTGGTGATGGTTATGATTCAAGGAATTGGATGAAATTGCTTCTTCCTTTGGCTATGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGACCCTCCTGGGGCCCAGCGAAATGTTATAGAAATAGCAGGCAGTCTGGGGTTGAATGTTTCAGTTGCAGTTGCTTATGAGGTTTCAGTAAAAGAACCAGGCATTAGCACGTATCTGGGAGCAGCTCCAATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCACGTGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTAGGAAGTGATTTCTACATGATGCTGGCACAATCTTTTAGAAGCGATTCATACTTATGTGAGAATATCAATCACTTTAGAGAAGTTGCTTTGTGCCTCCTTTGGGGGAGGGAAAACTTTATGGATTTTAGGGATTTCAGCCTCTTTTTTGGGCCTCAACCAAGGAGTTACAGAAAATATATCCAGTTGATTGTGATAGAAGGAATGTTGTTAGAATTTGTCTATGAACTTGGTTGGAACTGGGATGATCTTCCACGGCTAGCACAGGGAATACTGGCTGGTCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGTCTACTTCTCCTTCATACCCACCCAAATCCCAATTCTTTGTATGTGGAGCATGGAACTCATAGAGACAAATATAGAAGCATGTCTTCCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAGTGTGATGGAAAAACCACTGTAGCCAAGGCAGAAGAGACTGGAGGTCTTTTGAATTTCAGTACATGTGCCGAACAACTTCTGTATGAGGTTGGTGATCCATCGGCTTATATCACCCCTGATATGCTAATCCTGGATGTGACATATCAGGACTGTGGATGGAAAGGATGGGGAGAGATATCCTATGGGGGACGTGAATGTGTTCTGCGTGCTAAAGCTGCAGAATATCTGGTTCGGTCGTGGATGGAAGAACAGTTGATTGGTATTAATCAGCATATAGTGTCTTACATAATTGGACTCGACAGCCTTAAAGCATCCAGCAACAGTAGCAATAGTGTTGAAGATATTAGGTTGCGCATGGATGGACTCTTCAAGCAGAAGGAGCACGCTCTCCTGTTTGTTAGAGAATTTACAGCTTTATACACAAATGGGCCAGCTGGCGGCGGCGGCATCAGTTGGACGGTAGCTTTTGTTTCAACCTCAGACTTGTTCTTTGATGTATATCCTCTGTTGATCTGTTACTTGATAGAAAGCATCACTGGCTACAAGAAAGAAATTGTGCTTGAAAAACAACTGTCCGCCAATGTCTTTTTGTTCCATAATGAGCTTTTCTCTTATCCTTCATTCTTCAACGTTGGGCGTGAAAATATTTTCTGGCAAACAGGAGTGAAGTGTACTGTAGCAGTAAAATTAGACAGTCAACCAACAGATCTTCGAAAGGATCCGGCAGAGGAATGTTCTTCGCCCCGAGTAACGTTGCCATGTCCGATATCTGCGTATGCAGAGAAACCTTGTACAGGCTCCTTTCCACCAGAAACGGGTCATTCCCCTATTCCATCTGGCCAGGAGATTGCTCTTTACAATGTAGCCCATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTGTCATTCCTCATTATCCTTCTGATATTGAGCGATTGAGAATGATCATCACGCCCGAATGGGTGATGAGAGTTCTCTCGGTTCTGCATAATTCGACTCTGTTTCCTTCTTCGGATGCTGATAAGAAGAGAGACGATTGGGTAGATGAACATGTGAAGGTTGAAATATACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTAGATGGTGGCGTAAATTGCTCACGGAGAATCGATCGCCATGGAAAGACCTTATCGGATCTCATCTTGAACCAGCAAATTGTTTTGCCACCATAGTTTGGTGGTTCAAACAGGAAAGCTAAAATTTCTGAACCAGACGGCTCTTCTGGACAATAAAGTATCTTGCCGCTGTTGGATTGCTTTTTTTTTCTTCTTCTTTTGTGGAGTGGGAGAATCTAACCTCTAATTTCAAGTTCGATTGTAAAGGTTTCCCTATTAGTTGAACTAATGCTCATTTCGGCCATTACCATAAAACTTTGAGGTCAATTAAATGTCTTAATCATTCTTACATAATTCCATATATTCTTACATTATCAAGTATGGATTATCCTTAAATTCCATAAGCTTGGATTAATAATTCCATATATTCTTACATT
Coding sequence (CDS)
ATGAGACATGTGCTAATGGAGAGGCAGGGTAAAGATGACATCCACGACTGCACAATTAAGCTGAGAGTAAATCCTCAAAAACGTAGAGACAAGGTGTACATTGGCTGTGGTGCTGGATTTGGAGGCGATAGGCCAACAGCAGCTCTTAAATTGCTTCAGAGGGTCAAAAACCTAAACTATCTCGTACTTGAATGCCTAGCAGAACGCACTCTTGCAGATCGCTATCAAGTTATGTTGTCTGGTGGTGATGGTTATGATTCAAGGAATTGGATGAAATTGCTTCTTCCTTTGGCTATGAAGAGAAATATTTGCATAATTACCAACATGGGTGCAATGGACCCTCCTGGGGCCCAGCGAAATGTTATAGAAATAGCAGGCAGTCTGGGGTTGAATGTTTCAGTTGCAGTTGCTTATGAGGTTTCAGTAAAAGAACCAGGCATTAGCACGTATCTGGGAGCAGCTCCAATTGTTGAGTGTCTGGAAAAGTACCATCCAAATGTCATAATTACTTCACGTGTTGCAGATGCTGCCCTATTCTTGGCTCCAATGGTAGGAAGTGATTTCTACATGATGCTGGCACAATCTTTTAGAAGCGATTCATACTTATGTGAGAATATCAATCACTTTAGAGAAGTTGCTTTGTGCCTCCTTTGGGGGAGGGAAAACTTTATGGATTTTAGGGATTTCAGCCTCTTTTTTGGGCCTCAACCAAGGAGTTACAGAAAATATATCCAGTTGATTGTGATAGAAGGAATGTTGTTAGAATTTGTCTATGAACTTGGTTGGAACTGGGATGATCTTCCACGGCTAGCACAGGGAATACTGGCTGGTCATCTTCTGGAATGTGGCTGTCAACTTACAGGGGGATACTTTATGCATCCAGGTCTACTTCTCCTTCATACCCACCCAAATCCCAATTCTTTGTATGTGGAGCATGGAACTCATAGAGACAAATATAGAAGCATGTCTTCCCAACAGCTTCTGAATATATCACTGCCTTATGCGGAAGTTGAGTGTGATGGAAAAACCACTGTAGCCAAGGCAGAAGAGACTGGAGGTCTTTTGAATTTCAGTACATGTGCCGAACAACTTCTGTATGAGGTTGGTGATCCATCGGCTTATATCACCCCTGATATGCTAATCCTGGATGTGACATATCAGGACTGTGGATGGAAAGGATGGGGAGAGATATCCTATGGGGGACGTGAATGTGTTCTGCGTGCTAAAGCTGCAGAATATCTGGTTCGGTCGTGGATGGAAGAACAGTTGATTGGTATTAATCAGCATATAGTGTCTTACATAATTGGACTCGACAGCCTTAAAGCATCCAGCAACAGTAGCAATAGTGTTGAAGATATTAGGTTGCGCATGGATGGACTCTTCAAGCAGAAGGAGCACGCTCTCCTGTTTGTTAGAGAATTTACAGCTTTATACACAAATGGGCCAGCTGGCGGCGGCGGCATCAGTTGGACGGTAGCTTTTGTTTCAACCTCAGACTTGTTCTTTGATGTATATCCTCTGTTGATCTGTTACTTGATAGAAAGCATCACTGGCTACAAGAAAGAAATTGTGCTTGAAAAACAACTGTCCGCCAATGTCTTTTTGTTCCATAATGAGCTTTTCTCTTATCCTTCATTCTTCAACGTTGGGCGTGAAAATATTTTCTGGCAAACAGGAGTGAAGTGTACTGTAGCAGTAAAATTAGACAGTCAACCAACAGATCTTCGAAAGGATCCGGCAGAGGAATGTTCTTCGCCCCGAGTAACGTTGCCATGTCCGATATCTGCGTATGCAGAGAAACCTTGTACAGGCTCCTTTCCACCAGAAACGGGTCATTCCCCTATTCCATCTGGCCAGGAGATTGCTCTTTACAATGTAGCCCATAGCAGAGCTGGAGACAAAGGGAATGACTTGAACTTCTCTGTCATTCCTCATTATCCTTCTGATATTGAGCGATTGAGAATGATCATCACGCCCGAATGGGTGATGAGAGTTCTCTCGGTTCTGCATAATTCGACTCTGTTTCCTTCTTCGGATGCTGATAAGAAGAGAGACGATTGGGTAGATGAACATGTGAAGGTTGAAATATACGAAGTTAAAGGTATCCATTCTTTGAATGTTGTTGTTCGTAACATTCTAGATGGTGGCGTAAATTGCTCACGGAGAATCGATCGCCATGGAAAGACCTTATCGGATCTCATCTTGAACCAGCAAATTGTTTTGCCACCATAG
Protein sequence
MRHVLMERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERTLADRYQVMLSGGDGYDSRNWMKLLLPLAMKRNICIITNMGAMDPPGAQRNVIEIAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPMVGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYRKYIQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHTHPNPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCAEQLLYEVGDPSAYITPDMLILDVTYQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSVEDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLICYLIESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDSQPTDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP
Homology
BLAST of Lsi05G020240 vs. ExPASy TrEMBL
Match:
A0A0A0L7H7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G122560 PE=4 SV=1)
HSP 1 Score: 1009.2 bits (2608), Expect = 8.8e-291
Identity = 535/773 (69.21%), Postives = 563/773 (72.83%), Query Frame = 0
Query: 4 VLMERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVL 63
VLME G+ DIHDCTIKLRVNPQK+RDKV IGCGAGFGGDRPTAALKLLQRVKNLNYLVL
Sbjct: 583 VLMEGHGQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVL 642
Query: 64 ECLAERTLADRYQVMLSGGDGYDSR--NWMKLLLPLAMKRNICIITNMGAMDPPGAQRNV 123
ECLAERTLAD YQVMLSGGDGYD R +WMKLLLPLAMKRNICIITNMGAMDPP AQ+NV
Sbjct: 643 ECLAERTLADHYQVMLSGGDGYDPRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQNV 702
Query: 124 IEIAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLA 183
IE+AGSLGLNVSVAVAYE SVKE GISTY+G APIVECLEKYHPNVIITSRVADAALFLA
Sbjct: 703 IEVAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLA 762
Query: 184 PMVGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYR 243
PM
Sbjct: 763 PM---------------------------------------------------------- 822
Query: 244 KYIQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHT 303
VYELGWNWDD P LAQGILAGHLLECGCQLTGGYFMHPG
Sbjct: 823 ---------------VYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPG------ 882
Query: 304 HPNPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCA 363
DKYRSMS QQLLNISLPYAEVECDGK TVAK EE+GGLLNFSTCA
Sbjct: 883 ---------------DKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCA 942
Query: 364 EQLLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCG 423
EQLLYE+G+PSAYITPD+++ L + +DCG
Sbjct: 943 EQLLYEIGNPSAYITPDLVVDFSNVSFCSISSSRVLCSGAKPSIQGVPEKLLQLAPKDCG 1002
Query: 424 WKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSV 483
WKGWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSY IGLDSLKASSN SN V
Sbjct: 1003 WKGWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINRHIVSYTIGLDSLKASSNGSNCV 1062
Query: 484 EDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLIC 543
EDIRLRMDGLF+QKEHALLFV+EFTALYTNGPAGGGGIS
Sbjct: 1063 EDIRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS--------------------- 1122
Query: 544 YLIESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDS 603
TGYKKEIVLEKQL VGRENIFWQT V CT AVKLDS
Sbjct: 1123 ------TGYKKEIVLEKQL-------------------VGRENIFWQTEVTCTEAVKLDS 1182
Query: 604 QPTDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSR 663
Q TDL+KDPAE CSSPRVTLPCPIS +A++ CTGS PPE GHSPIPSGQEIALYNVAHSR
Sbjct: 1183 QSTDLQKDPAEACSSPRVTLPCPISDHADELCTGSLPPEMGHSPIPSGQEIALYNVAHSR 1215
Query: 664 AGDKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDE 723
AGDKGNDLNFS+IPH PSDIERL+MIITPEWVMRVLSVLHNST F SS+AD+KR++WV E
Sbjct: 1243 AGDKGNDLNFSLIPHCPSDIERLKMIITPEWVMRVLSVLHNSTRFHSSNADEKRNEWVSE 1215
Query: 724 HVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
VKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKT+SDLILNQ IVLPP
Sbjct: 1303 DVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLILNQLIVLPP 1215
BLAST of Lsi05G020240 vs. ExPASy TrEMBL
Match:
A0A1S3AV50 (uncharacterized protein LOC103483286 OS=Cucumis melo OX=3656 GN=LOC103483286 PE=4 SV=1)
HSP 1 Score: 1003.8 bits (2594), Expect = 3.7e-289
Identity = 531/771 (68.87%), Postives = 562/771 (72.89%), Query Frame = 0
Query: 6 MERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC 65
MER + DIHDCTIKLRVNP+K+RDKV IGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC
Sbjct: 1 MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC 60
Query: 66 LAERTLADRYQVMLSGGDGYDSR--NWMKLLLPLAMKRNICIITNMGAMDPPGAQRNVIE 125
LAERTLAD YQVMLSGGDGYDSR WMKLLLPL+MKRNICIITNMGAMDP AQ+ VIE
Sbjct: 61 LAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQKVIE 120
Query: 126 IAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 185
+AGSLGLNVSVAVAYE SVKE GISTY+G APIVECLEKYHPNVIITSRVADAALFLAPM
Sbjct: 121 VAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPM 180
Query: 186 VGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYRKY 245
Sbjct: 181 ------------------------------------------------------------ 240
Query: 246 IQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHTHP 305
VYELGWNWDD P LAQGILAGHLLECGCQLTGGYFMHPG
Sbjct: 241 -------------VYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPG-------- 300
Query: 306 NPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCAEQ 365
DKYRSMS QQLLNISLPYAEVECDGK TVAK EE+GGLLNFSTCAEQ
Sbjct: 301 -------------DKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQ 360
Query: 366 LLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCGWK 425
LLYE+GDPSAYITPD+++ L + +DCGWK
Sbjct: 361 LLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLLQLAPKDCGWK 420
Query: 426 GWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSVED 485
GWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSY IGLDSLKASSNSSN +ED
Sbjct: 421 GWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIED 480
Query: 486 IRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLICYL 545
IRLRMDGLF+QKEHALLFV+EFTALYTNGPAGGGGIS
Sbjct: 481 IRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS----------------------- 540
Query: 546 IESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDSQP 605
TGYKKEIVLEKQL VGRENIFWQT VKC+ AVKLDSQ
Sbjct: 541 ----TGYKKEIVLEKQL-------------------VGRENIFWQTEVKCSEAVKLDSQS 600
Query: 606 TDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSRAG 665
TDL+KDPAE CSSPRVTLPCPIS++AEK CTGSFPPETGHSPIPSGQEIALY+VAHSRAG
Sbjct: 601 TDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGHSPIPSGQEIALYDVAHSRAG 631
Query: 666 DKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDEHV 725
DKGNDLNFS+IPHYPSDIERL+MIITPEWVMRVLS LHN T F SS+A +KR++WV+E V
Sbjct: 661 DKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDV 631
Query: 726 KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
KVEIYEVK IHSLNVVVRNILDGGVNCSRRIDRHGKT+SDLILNQ IVLPP
Sbjct: 721 KVEIYEVKSIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLILNQLIVLPP 631
BLAST of Lsi05G020240 vs. ExPASy TrEMBL
Match:
A0A6J1IE50 (uncharacterized protein LOC111474742 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111474742 PE=4 SV=1)
HSP 1 Score: 1000.3 bits (2585), Expect = 4.1e-288
Identity = 521/771 (67.57%), Postives = 564/771 (73.15%), Query Frame = 0
Query: 4 VLMERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVL 63
+LMERQG+DD+HDCTIKLRVNP+KRRDKVYIGCGAGFGGDRPTAALKLLQRVK+LNYLVL
Sbjct: 19 MLMERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVL 78
Query: 64 ECLAERTLADRYQVMLSGGDGYDSRNWMKLLLPLAMKRNICIITNMGAMDPPGAQRNVIE 123
ECLAERTLADR+Q M SGGDGYDSRNWMKLLLPLA+KRNICIITNMGAMDPPGAQ+NVIE
Sbjct: 79 ECLAERTLADRHQAMSSGGDGYDSRNWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 138
Query: 124 IAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 183
IA SLGL+VSVAVAYEVSVKE GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+APM
Sbjct: 139 IASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMAPM 198
Query: 184 VGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYRKY 243
Sbjct: 199 ------------------------------------------------------------ 258
Query: 244 IQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHTHP 303
VYELGWNWDD PRL+QG LAGHLLECGCQLTGGYFMHPG
Sbjct: 259 -------------VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPG-------- 318
Query: 304 NPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCAEQ 363
DK+RSM QQLL+ISLPYAE++CDGK VAKAEETGGLLNFSTCAEQ
Sbjct: 319 -------------DKHRSMPFQQLLDISLPYAEIDCDGKVYVAKAEETGGLLNFSTCAEQ 378
Query: 364 LLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCGWK 423
LLYEVGDPSAYITPD+++ L + +DCGWK
Sbjct: 379 LLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVFCSGAKPSIQVVPEKLLQLAPKDCGWK 438
Query: 424 GWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSVED 483
GWGEISYGGRECVLRAKAAEYLVRSWMEE L G+NQHIVSYIIGLDSLKAS NSS SVED
Sbjct: 439 GWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGVNQHIVSYIIGLDSLKASINSS-SVED 498
Query: 484 IRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLICYL 543
IRLRMDGLF+ KEHALLFVREFTALYTNGPAGGGGIS
Sbjct: 499 IRLRMDGLFETKEHALLFVREFTALYTNGPAGGGGIS----------------------- 558
Query: 544 IESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDSQP 603
TGYKKEIVLEKQL VGRE++FW+ GVKCT AV+LDS+P
Sbjct: 559 ----TGYKKEIVLEKQL-------------------VGREHVFWRMGVKCTKAVELDSRP 618
Query: 604 TDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSRAG 663
TDLR+DPA+ +SPRVTLPC I AYA+ PC S PETGHSPIPSGQ++ALYNVAHSRAG
Sbjct: 619 TDLREDPAKARTSPRVTLPCSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAG 648
Query: 664 DKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDEHV 723
DKGND+NFSV+PHYPSDIERL+MIITPEWV RVLS L NS+ F DADKKRD+WV+EHV
Sbjct: 679 DKGNDMNFSVVPHYPSDIERLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHV 648
Query: 724 KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKT+SDL+LNQQ+VLPP
Sbjct: 739 KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 648
BLAST of Lsi05G020240 vs. ExPASy TrEMBL
Match:
A0A6J1IJ63 (uncharacterized protein LOC111474742 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111474742 PE=4 SV=1)
HSP 1 Score: 993.4 bits (2567), Expect = 5.0e-286
Identity = 520/773 (67.27%), Postives = 564/773 (72.96%), Query Frame = 0
Query: 4 VLMERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVL 63
+LMERQG+DD+HDCTIKLRVNP+KRRDKVYIGCGAGFGGDRPTAALKLLQRVK+LNYLVL
Sbjct: 1 MLMERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVL 60
Query: 64 ECLAERTLADRYQVMLSGGDGYDSR--NWMKLLLPLAMKRNICIITNMGAMDPPGAQRNV 123
ECLAERTLADR+Q M SGGDGYDSR +WMKLLLPLA+KRNICIITNMGAMDPPGAQ+NV
Sbjct: 61 ECLAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMDPPGAQQNV 120
Query: 124 IEIAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLA 183
IEIA SLGL+VSVAVAYEVSVKE GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+A
Sbjct: 121 IEIASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMA 180
Query: 184 PMVGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYR 243
PM
Sbjct: 181 PM---------------------------------------------------------- 240
Query: 244 KYIQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHT 303
VYELGWNWDD PRL+QG LAGHLLECGCQLTGGYFMHPG
Sbjct: 241 ---------------VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPG------ 300
Query: 304 HPNPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCA 363
DK+RSM QQLL+ISLPYAE++CDGK VAKAEETGGLLNFSTCA
Sbjct: 301 ---------------DKHRSMPFQQLLDISLPYAEIDCDGKVYVAKAEETGGLLNFSTCA 360
Query: 364 EQLLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCG 423
EQLLYEVGDPSAYITPD+++ L + +DCG
Sbjct: 361 EQLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVFCSGAKPSIQVVPEKLLQLAPKDCG 420
Query: 424 WKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSV 483
WKGWGEISYGGRECVLRAKAAEYLVRSWMEE L G+NQHIVSYIIGLDSLKAS NSS SV
Sbjct: 421 WKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGVNQHIVSYIIGLDSLKASINSS-SV 480
Query: 484 EDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLIC 543
EDIRLRMDGLF+ KEHALLFVREFTALYTNGPAGGGGIS
Sbjct: 481 EDIRLRMDGLFETKEHALLFVREFTALYTNGPAGGGGIS--------------------- 540
Query: 544 YLIESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDS 603
TGYKKEIVLEKQL VGRE++FW+ GVKCT AV+LDS
Sbjct: 541 ------TGYKKEIVLEKQL-------------------VGREHVFWRMGVKCTKAVELDS 600
Query: 604 QPTDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSR 663
+PTDLR+DPA+ +SPRVTLPC I AYA+ PC S PETGHSPIPSGQ++ALYNVAHSR
Sbjct: 601 RPTDLREDPAKARTSPRVTLPCSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSR 632
Query: 664 AGDKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDE 723
AGDKGND+NFSV+PHYPSDIERL+MIITPEWV RVLS L NS+ F DADKKRD+WV+E
Sbjct: 661 AGDKGNDMNFSVVPHYPSDIERLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNE 632
Query: 724 HVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
HVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKT+SDL+LNQQ+VLPP
Sbjct: 721 HVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 632
BLAST of Lsi05G020240 vs. ExPASy TrEMBL
Match:
A0A6J1IGN9 (uncharacterized protein LOC111474742 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111474742 PE=4 SV=1)
HSP 1 Score: 993.4 bits (2567), Expect = 5.0e-286
Identity = 520/773 (67.27%), Postives = 564/773 (72.96%), Query Frame = 0
Query: 4 VLMERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVL 63
+LMERQG+DD+HDCTIKLRVNP+KRRDKVYIGCGAGFGGDRPTAALKLLQRVK+LNYLVL
Sbjct: 19 MLMERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVL 78
Query: 64 ECLAERTLADRYQVMLSGGDGYDSR--NWMKLLLPLAMKRNICIITNMGAMDPPGAQRNV 123
ECLAERTLADR+Q M SGGDGYDSR +WMKLLLPLA+KRNICIITNMGAMDPPGAQ+NV
Sbjct: 79 ECLAERTLADRHQAMSSGGDGYDSRIADWMKLLLPLAVKRNICIITNMGAMDPPGAQQNV 138
Query: 124 IEIAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLA 183
IEIA SLGL+VSVAVAYEVSVKE GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+A
Sbjct: 139 IEIASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMA 198
Query: 184 PMVGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYR 243
PM
Sbjct: 199 PM---------------------------------------------------------- 258
Query: 244 KYIQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHT 303
VYELGWNWDD PRL+QG LAGHLLECGCQLTGGYFMHPG
Sbjct: 259 ---------------VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPG------ 318
Query: 304 HPNPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCA 363
DK+RSM QQLL+ISLPYAE++CDGK VAKAEETGGLLNFSTCA
Sbjct: 319 ---------------DKHRSMPFQQLLDISLPYAEIDCDGKVYVAKAEETGGLLNFSTCA 378
Query: 364 EQLLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCG 423
EQLLYEVGDPSAYITPD+++ L + +DCG
Sbjct: 379 EQLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVFCSGAKPSIQVVPEKLLQLAPKDCG 438
Query: 424 WKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSV 483
WKGWGEISYGGRECVLRAKAAEYLVRSWMEE L G+NQHIVSYIIGLDSLKAS NSS SV
Sbjct: 439 WKGWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGVNQHIVSYIIGLDSLKASINSS-SV 498
Query: 484 EDIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLIC 543
EDIRLRMDGLF+ KEHALLFVREFTALYTNGPAGGGGIS
Sbjct: 499 EDIRLRMDGLFETKEHALLFVREFTALYTNGPAGGGGIS--------------------- 558
Query: 544 YLIESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDS 603
TGYKKEIVLEKQL VGRE++FW+ GVKCT AV+LDS
Sbjct: 559 ------TGYKKEIVLEKQL-------------------VGREHVFWRMGVKCTKAVELDS 618
Query: 604 QPTDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSR 663
+PTDLR+DPA+ +SPRVTLPC I AYA+ PC S PETGHSPIPSGQ++ALYNVAHSR
Sbjct: 619 RPTDLREDPAKARTSPRVTLPCSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSR 650
Query: 664 AGDKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDE 723
AGDKGND+NFSV+PHYPSDIERL+MIITPEWV RVLS L NS+ F DADKKRD+WV+E
Sbjct: 679 AGDKGNDMNFSVVPHYPSDIERLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNE 650
Query: 724 HVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
HVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKT+SDL+LNQQ+VLPP
Sbjct: 739 HVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 650
BLAST of Lsi05G020240 vs. NCBI nr
Match:
XP_038900159.1 (uncharacterized protein LOC120087281 isoform X1 [Benincasa hispida] >XP_038900167.1 uncharacterized protein LOC120087281 isoform X1 [Benincasa hispida])
HSP 1 Score: 1038.1 bits (2683), Expect = 3.7e-299
Identity = 546/771 (70.82%), Postives = 570/771 (73.93%), Query Frame = 0
Query: 6 MERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC 65
MERQGKDDIHDCTIKLRVNPQK+RDKVYIGCGAGFGGDRPTAALKLLQRVK+LNYL+LEC
Sbjct: 1 MERQGKDDIHDCTIKLRVNPQKQRDKVYIGCGAGFGGDRPTAALKLLQRVKSLNYLILEC 60
Query: 66 LAERTLADRYQVMLSGGDGYDSR--NWMKLLLPLAMKRNICIITNMGAMDPPGAQRNVIE 125
LAERTLADRYQVMLSGGDGYDSR +WMKLLLPLAMKRNICIITNMGAMDPP AQRNVIE
Sbjct: 61 LAERTLADRYQVMLSGGDGYDSRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQRNVIE 120
Query: 126 IAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 185
IAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSR+ADAALFLAPM
Sbjct: 121 IAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRIADAALFLAPM 180
Query: 186 VGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYRKY 245
Sbjct: 181 ------------------------------------------------------------ 240
Query: 246 IQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHTHP 305
VYELGWNWDD PRLAQGILAGHLLECGCQLTGGYFMHPG
Sbjct: 241 -------------VYELGWNWDDFPRLAQGILAGHLLECGCQLTGGYFMHPG-------- 300
Query: 306 NPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCAEQ 365
DKYRSMS QQLLNISLPYAE+ECDG+ TVAKAEETGGLLNFSTCAEQ
Sbjct: 301 -------------DKYRSMSLQQLLNISLPYAEIECDGRITVAKAEETGGLLNFSTCAEQ 360
Query: 366 LLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCGWK 425
LLYEVGDPSAYITPDM++ L + +DCGWK
Sbjct: 361 LLYEVGDPSAYITPDMVVDFSNVSFCSISSSRVFCSGAKPSIQGTPEKLLQLAPKDCGWK 420
Query: 426 GWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSVED 485
GWGEISYGGRECVLRAKAA+YLVRSW+EE LIG+NQ IVSY IGLDSLKAS+NSS SVED
Sbjct: 421 GWGEISYGGRECVLRAKAADYLVRSWIEELLIGVNQDIVSYTIGLDSLKASNNSSTSVED 480
Query: 486 IRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLICYL 545
IRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGIS
Sbjct: 481 IRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGIS----------------------- 540
Query: 546 IESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDSQP 605
TGYKKEIVLEKQL VGRENIFWQTGVKCT AVKLDSQP
Sbjct: 541 ----TGYKKEIVLEKQL-------------------VGRENIFWQTGVKCTEAVKLDSQP 600
Query: 606 TDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSRAG 665
D R+DPAE SSP+V LPCPISA AEKP G FPPE GHSP+PS QEIALYNVAHSRAG
Sbjct: 601 IDARQDPAEARSSPQVMLPCPISASAEKPFMGFFPPEPGHSPVPSAQEIALYNVAHSRAG 631
Query: 666 DKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDEHV 725
DKGNDLNFSVIPHYPSDIERL+M+ITPEWV RVLSVLHNST FPSSDA+KKRD+WVDE V
Sbjct: 661 DKGNDLNFSVIPHYPSDIERLKMVITPEWVTRVLSVLHNSTQFPSSDANKKRDEWVDEDV 631
Query: 726 KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
KVEIYEV+GIHSLNVVVRNILDGGVNCSRRIDRHGK +SDLILNQ IVLPP
Sbjct: 721 KVEIYEVEGIHSLNVVVRNILDGGVNCSRRIDRHGKAISDLILNQHIVLPP 631
BLAST of Lsi05G020240 vs. NCBI nr
Match:
XP_023539935.1 (uncharacterized protein LOC111800461 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1007.3 bits (2603), Expect = 6.9e-290
Identity = 524/772 (67.88%), Postives = 568/772 (73.58%), Query Frame = 0
Query: 4 VLMERQGK-DDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLV 63
+LMERQG+ DD+HDCTIKLR+NP+KRRDKVYIGCGAGFGGDRPTAALKLLQRVK+LNYLV
Sbjct: 1 MLMERQGEDDDVHDCTIKLRLNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLV 60
Query: 64 LECLAERTLADRYQVMLSGGDGYDSRNWMKLLLPLAMKRNICIITNMGAMDPPGAQRNVI 123
LECLAERTLADR+Q M SGGDGYDSRNWMKLLLPLA++RNICIITNMGAMDPPGAQ+NVI
Sbjct: 61 LECLAERTLADRHQAMSSGGDGYDSRNWMKLLLPLAVERNICIITNMGAMDPPGAQQNVI 120
Query: 124 EIAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAP 183
EIA SLGL+VSVAVAYEVSVKE GISTYLGAAPIVECLEKYHPNVIITSRVADAALF+AP
Sbjct: 121 EIASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVECLEKYHPNVIITSRVADAALFMAP 180
Query: 184 MVGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYRK 243
M
Sbjct: 181 M----------------------------------------------------------- 240
Query: 244 YIQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHTH 303
VYELGWNWDD PRL+QG LAGHLLECGCQLTGGYFMHPG
Sbjct: 241 --------------VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPG------- 300
Query: 304 PNPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCAE 363
DK+RSM QQLL+ISLPYAE++CDGK VAKAEETGGLLNFSTCAE
Sbjct: 301 --------------DKHRSMHFQQLLDISLPYAEIDCDGKVYVAKAEETGGLLNFSTCAE 360
Query: 364 QLLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCGW 423
QLLYEVGDPSAYITPD+++ L + +DCGW
Sbjct: 361 QLLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVFCSGAKPSIQVVPEKLLQLAPKDCGW 420
Query: 424 KGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSVE 483
KGWGEISYGGRECVLRAKAAEYLVRSWMEE L G+NQHIVSYIIGLDSLKAS NSS SVE
Sbjct: 421 KGWGEISYGGRECVLRAKAAEYLVRSWMEEVLNGVNQHIVSYIIGLDSLKASINSS-SVE 480
Query: 484 DIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLICY 543
DIRLRMDGLF+ KEH LLFVREFTALYTNGPAGGGGIS
Sbjct: 481 DIRLRMDGLFETKEHTLLFVREFTALYTNGPAGGGGIS---------------------- 540
Query: 544 LIESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDSQ 603
TGYKKEI+LEKQL VGRE++FW+TGVKCT AV+LDS+
Sbjct: 541 -----TGYKKEILLEKQL-------------------VGREHVFWRTGVKCTKAVELDSR 600
Query: 604 PTDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSRA 663
PTDLR+DPA+ +SPRVTLPCPI AYA+ PC GS PPETGHSPIPSGQ++ALYNVAHSRA
Sbjct: 601 PTDLREDPAKARTSPRVTLPCPIFAYADNPCAGSSPPETGHSPIPSGQKVALYNVAHSRA 631
Query: 664 GDKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDEH 723
GDKGND+NFSVIPHYPSDIERL+MIITPEWV RVLS L NS+ FP DADKKRD+W+DEH
Sbjct: 661 GDKGNDMNFSVIPHYPSDIERLKMIITPEWVKRVLSSLQNSSTFPDLDADKKRDEWIDEH 631
Query: 724 VKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
VKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKT+SDL+LNQQ+VLPP
Sbjct: 721 VKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 631
BLAST of Lsi05G020240 vs. NCBI nr
Match:
XP_004134329.1 (uncharacterized protein LOC101212841 isoform X2 [Cucumis sativus] >XP_011650759.1 uncharacterized protein LOC101212841 isoform X2 [Cucumis sativus])
HSP 1 Score: 1006.1 bits (2600), Expect = 1.5e-289
Identity = 533/771 (69.13%), Postives = 561/771 (72.76%), Query Frame = 0
Query: 6 MERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC 65
ME G+ DIHDCTIKLRVNPQK+RDKV IGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC
Sbjct: 1 MEGHGQADIHDCTIKLRVNPQKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC 60
Query: 66 LAERTLADRYQVMLSGGDGYDSR--NWMKLLLPLAMKRNICIITNMGAMDPPGAQRNVIE 125
LAERTLAD YQVMLSGGDGYD R +WMKLLLPLAMKRNICIITNMGAMDPP AQ+NVIE
Sbjct: 61 LAERTLADHYQVMLSGGDGYDPRIADWMKLLLPLAMKRNICIITNMGAMDPPAAQQNVIE 120
Query: 126 IAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 185
+AGSLGLNVSVAVAYE SVKE GISTY+G APIVECLEKYHPNVIITSRVADAALFLAPM
Sbjct: 121 VAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPM 180
Query: 186 VGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYRKY 245
Sbjct: 181 ------------------------------------------------------------ 240
Query: 246 IQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHTHP 305
VYELGWNWDD P LAQGILAGHLLECGCQLTGGYFMHPG
Sbjct: 241 -------------VYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPG-------- 300
Query: 306 NPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCAEQ 365
DKYRSMS QQLLNISLPYAEVECDGK TVAK EE+GGLLNFSTCAEQ
Sbjct: 301 -------------DKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQ 360
Query: 366 LLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCGWK 425
LLYE+G+PSAYITPD+++ L + +DCGWK
Sbjct: 361 LLYEIGNPSAYITPDLVVDFSNVSFCSISSSRVLCSGAKPSIQGVPEKLLQLAPKDCGWK 420
Query: 426 GWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSVED 485
GWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSY IGLDSLKASSN SN VED
Sbjct: 421 GWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINRHIVSYTIGLDSLKASSNGSNCVED 480
Query: 486 IRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLICYL 545
IRLRMDGLF+QKEHALLFV+EFTALYTNGPAGGGGIS
Sbjct: 481 IRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS----------------------- 540
Query: 546 IESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDSQP 605
TGYKKEIVLEKQL VGRENIFWQT V CT AVKLDSQ
Sbjct: 541 ----TGYKKEIVLEKQL-------------------VGRENIFWQTEVTCTEAVKLDSQS 600
Query: 606 TDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSRAG 665
TDL+KDPAE CSSPRVTLPCPIS +A++ CTGS PPE GHSPIPSGQEIALYNVAHSRAG
Sbjct: 601 TDLQKDPAEACSSPRVTLPCPISDHADELCTGSLPPEMGHSPIPSGQEIALYNVAHSRAG 631
Query: 666 DKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDEHV 725
DKGNDLNFS+IPH PSDIERL+MIITPEWVMRVLSVLHNST F SS+AD+KR++WV E V
Sbjct: 661 DKGNDLNFSLIPHCPSDIERLKMIITPEWVMRVLSVLHNSTRFHSSNADEKRNEWVSEDV 631
Query: 726 KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKT+SDLILNQ IVLPP
Sbjct: 721 KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLILNQLIVLPP 631
BLAST of Lsi05G020240 vs. NCBI nr
Match:
XP_008438065.1 (PREDICTED: uncharacterized protein LOC103483286 [Cucumis melo] >XP_008438066.1 PREDICTED: uncharacterized protein LOC103483286 [Cucumis melo])
HSP 1 Score: 1003.8 bits (2594), Expect = 7.7e-289
Identity = 531/771 (68.87%), Postives = 562/771 (72.89%), Query Frame = 0
Query: 6 MERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC 65
MER + DIHDCTIKLRVNP+K+RDKV IGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC
Sbjct: 1 MERHSQADIHDCTIKLRVNPKKQRDKVCIGCGAGFGGDRPTAALKLLQRVKNLNYLVLEC 60
Query: 66 LAERTLADRYQVMLSGGDGYDSR--NWMKLLLPLAMKRNICIITNMGAMDPPGAQRNVIE 125
LAERTLAD YQVMLSGGDGYDSR WMKLLLPL+MKRNICIITNMGAMDP AQ+ VIE
Sbjct: 61 LAERTLADHYQVMLSGGDGYDSRIAEWMKLLLPLSMKRNICIITNMGAMDPLAAQQKVIE 120
Query: 126 IAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 185
+AGSLGLNVSVAVAYE SVKE GISTY+G APIVECLEKYHPNVIITSRVADAALFLAPM
Sbjct: 121 VAGSLGLNVSVAVAYEGSVKESGISTYMGGAPIVECLEKYHPNVIITSRVADAALFLAPM 180
Query: 186 VGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYRKY 245
Sbjct: 181 ------------------------------------------------------------ 240
Query: 246 IQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHTHP 305
VYELGWNWDD P LAQGILAGHLLECGCQLTGGYFMHPG
Sbjct: 241 -------------VYELGWNWDDFPLLAQGILAGHLLECGCQLTGGYFMHPG-------- 300
Query: 306 NPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCAEQ 365
DKYRSMS QQLLNISLPYAEVECDGK TVAK EE+GGLLNFSTCAEQ
Sbjct: 301 -------------DKYRSMSFQQLLNISLPYAEVECDGKLTVAKPEESGGLLNFSTCAEQ 360
Query: 366 LLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCGWK 425
LLYE+GDPSAYITPD+++ L + +DCGWK
Sbjct: 361 LLYEIGDPSAYITPDLVVDFSNVSFCSISSSRVVCSGAKPSIQGVPEKLLQLAPKDCGWK 420
Query: 426 GWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSVED 485
GWGEISYGGRECVLRAKAAEYLVRSWMEE LIGIN+HIVSY IGLDSLKASSNSSN +ED
Sbjct: 421 GWGEISYGGRECVLRAKAAEYLVRSWMEELLIGINEHIVSYTIGLDSLKASSNSSNCIED 480
Query: 486 IRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLICYL 545
IRLRMDGLF+QKEHALLFV+EFTALYTNGPAGGGGIS
Sbjct: 481 IRLRMDGLFEQKEHALLFVKEFTALYTNGPAGGGGIS----------------------- 540
Query: 546 IESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDSQP 605
TGYKKEIVLEKQL VGRENIFWQT VKC+ AVKLDSQ
Sbjct: 541 ----TGYKKEIVLEKQL-------------------VGRENIFWQTEVKCSEAVKLDSQS 600
Query: 606 TDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSRAG 665
TDL+KDPAE CSSPRVTLPCPIS++AEK CTGSFPPETGHSPIPSGQEIALY+VAHSRAG
Sbjct: 601 TDLQKDPAEACSSPRVTLPCPISSHAEKLCTGSFPPETGHSPIPSGQEIALYDVAHSRAG 631
Query: 666 DKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDEHV 725
DKGNDLNFS+IPHYPSDIERL+MIITPEWVMRVLS LHN T F SS+A +KR++WV+E V
Sbjct: 661 DKGNDLNFSLIPHYPSDIERLKMIITPEWVMRVLSGLHNLTRFHSSNAGEKRNEWVNEDV 631
Query: 726 KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
KVEIYEVK IHSLNVVVRNILDGGVNCSRRIDRHGKT+SDLILNQ IVLPP
Sbjct: 721 KVEIYEVKSIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLILNQLIVLPP 631
BLAST of Lsi05G020240 vs. NCBI nr
Match:
XP_022975426.1 (uncharacterized protein LOC111474742 isoform X2 [Cucurbita maxima])
HSP 1 Score: 1000.3 bits (2585), Expect = 8.5e-288
Identity = 521/771 (67.57%), Postives = 564/771 (73.15%), Query Frame = 0
Query: 4 VLMERQGKDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVL 63
+LMERQG+DD+HDCTIKLRVNP+KRRDKVYIGCGAGFGGDRPTAALKLLQRVK+LNYLVL
Sbjct: 19 MLMERQGEDDVHDCTIKLRVNPKKRRDKVYIGCGAGFGGDRPTAALKLLQRVKDLNYLVL 78
Query: 64 ECLAERTLADRYQVMLSGGDGYDSRNWMKLLLPLAMKRNICIITNMGAMDPPGAQRNVIE 123
ECLAERTLADR+Q M SGGDGYDSRNWMKLLLPLA+KRNICIITNMGAMDPPGAQ+NVIE
Sbjct: 79 ECLAERTLADRHQAMSSGGDGYDSRNWMKLLLPLAVKRNICIITNMGAMDPPGAQQNVIE 138
Query: 124 IAGSLGLNVSVAVAYEVSVKEPGISTYLGAAPIVECLEKYHPNVIITSRVADAALFLAPM 183
IA SLGL+VSVAVAYEVSVKE GISTYLGAAPIV+CLEKYHPNVIITSRVADAALF+APM
Sbjct: 139 IASSLGLSVSVAVAYEVSVKESGISTYLGAAPIVKCLEKYHPNVIITSRVADAALFMAPM 198
Query: 184 VGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGPQPRSYRKY 243
Sbjct: 199 ------------------------------------------------------------ 258
Query: 244 IQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPGLLLLHTHP 303
VYELGWNWDD PRL+QG LAGHLLECGCQLTGGYFMHPG
Sbjct: 259 -------------VYELGWNWDDFPRLSQGTLAGHLLECGCQLTGGYFMHPG-------- 318
Query: 304 NPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLLNFSTCAEQ 363
DK+RSM QQLL+ISLPYAE++CDGK VAKAEETGGLLNFSTCAEQ
Sbjct: 319 -------------DKHRSMPFQQLLDISLPYAEIDCDGKVYVAKAEETGGLLNFSTCAEQ 378
Query: 364 LLYEVGDPSAYITPDMLI-------------------------------LDVTYQDCGWK 423
LLYEVGDPSAYITPD+++ L + +DCGWK
Sbjct: 379 LLYEVGDPSAYITPDLVVDLSNVSFCSISSSKVFCSGAKPSIQVVPEKLLQLAPKDCGWK 438
Query: 424 GWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKASSNSSNSVED 483
GWGEISYGGRECVLRAKAAEYLVRSWMEE L G+NQHIVSYIIGLDSLKAS NSS SVED
Sbjct: 439 GWGEISYGGRECVLRAKAAEYLVRSWMEEVLYGVNQHIVSYIIGLDSLKASINSS-SVED 498
Query: 484 IRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDLFFDVYPLLICYL 543
IRLRMDGLF+ KEHALLFVREFTALYTNGPAGGGGIS
Sbjct: 499 IRLRMDGLFETKEHALLFVREFTALYTNGPAGGGGIS----------------------- 558
Query: 544 IESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGVKCTVAVKLDSQP 603
TGYKKEIVLEKQL VGRE++FW+ GVKCT AV+LDS+P
Sbjct: 559 ----TGYKKEIVLEKQL-------------------VGREHVFWRMGVKCTKAVELDSRP 618
Query: 604 TDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQEIALYNVAHSRAG 663
TDLR+DPA+ +SPRVTLPC I AYA+ PC S PETGHSPIPSGQ++ALYNVAHSRAG
Sbjct: 619 TDLREDPAKARTSPRVTLPCSIFAYADNPCASSSTPETGHSPIPSGQKVALYNVAHSRAG 648
Query: 664 DKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDADKKRDDWVDEHV 723
DKGND+NFSV+PHYPSDIERL+MIITPEWV RVLS L NS+ F DADKKRD+WV+EHV
Sbjct: 679 DKGNDMNFSVVPHYPSDIERLKMIITPEWVKRVLSSLQNSSTFHDLDADKKRDEWVNEHV 648
Query: 724 KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIVLPP 744
KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKT+SDL+LNQQ+VLPP
Sbjct: 739 KVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTISDLVLNQQVVLPP 648
BLAST of Lsi05G020240 vs. TAIR 10
Match:
AT1G01770.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1446 (InterPro:IPR010839); Has 1597 Blast hits to 1509 proteins in 306 species: Archae - 4; Bacteria - 843; Metazoa - 22; Fungi - 131; Plants - 31; Viruses - 0; Other Eukaryotes - 566 (source: NCBI BLink). )
HSP 1 Score: 687.6 bits (1773), Expect = 1.1e-197
Identity = 392/781 (50.19%), Postives = 467/781 (59.80%), Query Frame = 0
Query: 11 KDDIHDCTIKLRVNPQKRRDKVYIGCGAGFGGDRPTAALKLLQRVKNLNYLVLECLAERT 70
K+ + DC I LR NP++RR+ VY+GCGAGFGGDRP AALKLLQRV+ LNYLVLECLAERT
Sbjct: 7 KEILCDCVINLRENPKRRRETVYVGCGAGFGGDRPLAALKLLQRVEELNYLVLECLAERT 66
Query: 71 LADRYQVMLSGGDGYDSR--NWMKLLLPLAMKRNICIITNMGAMDPPGAQRNVIEIAGSL 130
LADR+ M SGG GYD R WM+LLLPLA++R CIITNMGA+DP GAQ+ V+E+AG L
Sbjct: 67 LADRWLSMASGGLGYDPRVSEWMQLLLPLAVERGTCIITNMGAIDPSGAQKKVLEVAGEL 126
Query: 131 GLNVSVAVAYEVSVK-------------EPGISTYLGAAPIVECLEKYHPNVIITSRVAD 190
GL +SVAVA+EV + G STYLGAAPIVECLEKY PNVIITSRVAD
Sbjct: 127 GLTISVAVAHEVHFETGSGSSFGGQYCSAGGTSTYLGAAPIVECLEKYQPNVIITSRVAD 186
Query: 191 AALFLAPMVGSDFYMMLAQSFRSDSYLCENINHFREVALCLLWGRENFMDFRDFSLFFGP 250
AALFLAPM
Sbjct: 187 AALFLAPM---------------------------------------------------- 246
Query: 251 QPRSYRKYIQLIVIEGMLLEFVYELGWNWDDLPRLAQGILAGHLLECGCQLTGGYFMHPG 310
VYELGWNW+DL LAQG LAGHLLECGCQLTGGYFMHPG
Sbjct: 247 ---------------------VYELGWNWNDLELLAQGTLAGHLLECGCQLTGGYFMHPG 306
Query: 311 LLLLHTHPNPNSLYVEHGTHRDKYRSMSSQQLLNISLPYAEVECDGKTTVAKAEETGGLL 370
D+YR M+ L ++SLPYAE+ DGK V+K E +GG+L
Sbjct: 307 ---------------------DQYRDMAFPLLQDLSLPYAEIGYDGKVCVSKVEGSGGIL 366
Query: 371 NFSTCAEQLLYEVGDPSAYITPDMLI--------------------------------LD 430
N STCAEQLLYE+ DPSAYITPD++I L
Sbjct: 367 NTSTCAEQLLYEIADPSAYITPDVVIDIRGVSFLPLSDCKVQCSGAKPSSNTSVPEKLLR 426
Query: 431 VTYQDCGWKGWGEISYGGRECVLRAKAAEYLVRSWMEEQLIGINQHIVSYIIGLDSLKAS 490
+ ++CGWKGWGEISYGG + RAKA+E+LVRSWMEE + G+N I+SY+IG+DSLKA+
Sbjct: 427 LIPKECGWKGWGEISYGGNGSIQRAKASEFLVRSWMEETIPGVNHCILSYVIGVDSLKAT 486
Query: 491 SNSSNSVE---DIRLRMDGLFKQKEHALLFVREFTALYTNGPAGGGGISWTVAFVSTSDL 550
SN + S + DIRLRMDGLFK KEHA+ +EFTALYTNGPAGGGGIS
Sbjct: 487 SNGTESWQSCGDIRLRMDGLFKLKEHAVQLTKEFTALYTNGPAGGGGIS----------- 546
Query: 551 FFDVYPLLICYLIESITGYKKEIVLEKQLSANVFLFHNELFSYPSFFNVGRENIFWQTGV 610
TG+K EIVLEK+L V RE++ W+TG+
Sbjct: 547 ----------------TGHKMEIVLEKRL-------------------VSRESVMWKTGL 606
Query: 611 KCTVAVKLDSQPTDLRKDPAEECSSPRVTLPCPISAYAEKPCTGSFPPETGHSPIPSGQE 670
Q T+ + E SP P G + HSP PSGQ+
Sbjct: 607 ----------QHTNTSEPETSEHHSPEKMPKLPKENPKNLTMRG-YQSGFHHSPAPSGQK 631
Query: 671 IALYNVAHSRAGDKGNDLNFSVIPHYPSDIERLRMIITPEWVMRVLSVLHNSTLFPSSDA 730
I LY+VAHSRAGDKGND+NFS+IPHY D+ERL++IITP+WV V+SVL +++ F DA
Sbjct: 667 IPLYSVAHSRAGDKGNDINFSIIPHYSPDVERLKLIITPQWVKHVMSVLLSTSSFLELDA 631
Query: 731 DKKRDDWVDEHVKVEIYEVKGIHSLNVVVRNILDGGVNCSRRIDRHGKTLSDLILNQQIV 742
+DE+V VEIY+V+GIH++NVVVRNILDGGVNCSRRIDRHGKT+SDLIL QQ+V
Sbjct: 727 KP-----MDENVSVEIYDVEGIHAMNVVVRNILDGGVNCSRRIDRHGKTISDLILCQQVV 631
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L7H7 | 8.8e-291 | 69.21 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G122560 PE=4 SV=1 | [more] |
A0A1S3AV50 | 3.7e-289 | 68.87 | uncharacterized protein LOC103483286 OS=Cucumis melo OX=3656 GN=LOC103483286 PE=... | [more] |
A0A6J1IE50 | 4.1e-288 | 67.57 | uncharacterized protein LOC111474742 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1IJ63 | 5.0e-286 | 67.27 | uncharacterized protein LOC111474742 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1IGN9 | 5.0e-286 | 67.27 | uncharacterized protein LOC111474742 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
XP_038900159.1 | 3.7e-299 | 70.82 | uncharacterized protein LOC120087281 isoform X1 [Benincasa hispida] >XP_03890016... | [more] |
XP_023539935.1 | 6.9e-290 | 67.88 | uncharacterized protein LOC111800461 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_004134329.1 | 1.5e-289 | 69.13 | uncharacterized protein LOC101212841 isoform X2 [Cucumis sativus] >XP_011650759.... | [more] |
XP_008438065.1 | 7.7e-289 | 68.87 | PREDICTED: uncharacterized protein LOC103483286 [Cucumis melo] >XP_008438066.1 P... | [more] |
XP_022975426.1 | 8.5e-288 | 67.57 | uncharacterized protein LOC111474742 isoform X2 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT1G01770.1 | 1.1e-197 | 50.19 | unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1446... | [more] |