Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGGAAGATTCTCCCGTCGATAGTCCGTTTCAAACGGAACACATTTTCCAGAAAACTATATATACTTTCCATTGTTCCCAATTCGTGCCCTAACTATCTGATTTCTCTCACTCTTAGCCGCCATTTCGTTTTCTCCGATCAAATTCTCGCCGGCACGATCTTCAAATTCTCTTCTGTAATTGCAGATTTGCGGCTTCTTTCGCCGGCGATGTAGGGAGATGTGTAGCGGCGGCGGTGAATCGAGTTCTGCTTTAGGGCTTCGGTGATGGAAGCTCTTGAATTGACTTTTCCGGTGGATGTGGCGGCGGCGGCGCCGAAGCTCATGGGATCTGACGGCTCTGCTAGAACCGAGGTCGAGTTGTGCGGTTTTGTGCCTTCTTCTAGCTGTTCTTTTGCATTTCAGCATTTTAGGTCGTACGGTAGGGGAAAAGGTGAATTTGCTTTACTTTTTTGCTGTAATTGTTCTGTTTAATTTTGGATTTGTTCCTGGTTTTTTTTTTAATTTTGAACTGCGTGTTGTTTGTTTAATTTTGGAGATTGTTGGCTTAGTGCTTGGAATTTTAAGCATTGCTTGTTTCTGAGCTGATTCTGTGTAGTAGGATGGAACTTATAGGCGCTGTTTGTTTGGCATATTTAGGTAAGATTTAGTTTTAGGATTTATTAACTTCTCATCGAAAATCTACATCTAAATGAGTTTTTATCAACTTCTTATCATTTTTACTTTTAAAAACAATTCTCATTTCTAATCATTTATTTTTAATCATTCATTTATAACTACTCAATCATTTTTAACCACTCATTTCTAATCATTTCTTTCAAATCAATCTAAATAACTCATTTTTTACCATTCATTTTTAATCATTCATTTTTTAACTATTAAAAATAACTACTAATTTTAACTCTACATCCAAACAAAGATTTGAATCTAACCCCTCTATACAAAAACTACAACCAAACGAGTTTTTGACACAACTCTCAATTTTTAACAACTCAATATAACTCCTCAACATAACTCTTTAACCTAAATCCTACATAAATATGTCAAACAAACGGCCCCGTAGAGATATTTGAGGCACAGACACTTATTGTTGAAGACCAATGAGTTTCAGTAGCCCTGTGAAGTCAGTCGCCACTTTTGTTATACGTTTTTCAATAATTATGCTCCCACTGATTTTTCTAACTCAAGTGCTTTTTATGAACCCTAGGCGTTTGTGAGGATGTTTTATCTTCTCTCGTTGTGCTGTGTCATTATTTATGAAAGATTTCTCTAATTAAATGTGTTCCAGTGGATTTTGAGTGAGGCCGACTCGGGATCCAGCAAGCTGGCTACTTAAAATGTGGCAATCATTTGGTTTTGTATTGTATTATTTTGAAAAGGAAATGTGATAAAGAAAAAAAATCTGCTGCAGTGATGTGATAGCGGAAATATCAAACATTTTGCTTAAGAAACATTGTGTCCAATCTTATTCTTACATAGAGGGTTTATAACATTTGTCTACTGAAATTCTGAACATGGAATAAGATCTATATATGGTCTTTTTAAGATCTTTATAAGGAATAACATATATATTTTCTGGCTCATTTGTCAAAGGATTGTCATATATTAGACTTATTTGGACTTTTGGAGTGTTCTTTTTTTACTCTGCCACATACCATCTATGTAATTTTAGTCCAAGTTGAATTTTGTAGATTATAGCATTTCATCTCCTTTAACTGGTATTTTTATTTCATAATTTTCCATACATTAATAAACTAATGCTCTATAGAAGTCGGTATGCCAGCTGGGACGAGCTCTATCAACAATCTGGGTTCTGTTTCTTTGGGTAAGAAGATCCCTGATGGAGCAGTTTCTAAGGGTGGTGAAGATCCATCTGAAGATTTTGAAAGTAGGAACAAAAGAAGTCATTTGTCCACCTTAAGTCCGGGAGTACAGCTACGTAAATCATTAAAGGTGTCAAGGAGTAGTAGTAGTAGTTTATGTTCTAAAAGGCGTGTGGTTCAGTGGGAAGATTCTTTATTATTAAGTGGAGCTGATGAAGTAAGAGATTCATCTGATAAGCTTGGATCGTATCTTAAAAAGTGCGGTTCTCATGGTAATTCACCAGTTCCCAATTTAATGCTATAAATGTTCTTTATTATTCTTTTTGGATCCTAATGAAGCCATGCAGTCCATCTATTTCATATTTTTGGATTGGAGATAATAACTTTTCTTGTAAAAAGAGTTAGATCATCCAACTGTTTACCTATAACTAGAACTAAAAAAAAAACATGAGCCAAGTACTGTCCTTAAAAGTAGGAATGGGTAAAATTTACAGAATAAAGGTAGCAATAAGACCCATTAGTAGAGGCCGTGAAAGGCAATTCAGTGAACTTTGCTCACTCCCCATTTATTCAAATTGTGGTTTTTTCCATCTTCGTTCTTTTATTTGAATGCTGATGATTCTTTCATTTATCGTCGTAAATTTATGAGAGAAGGATATTTGATAAAAAGGTATGGTTATTTTATTTATTGTTTTTTTTTAGTACATAGTGGGGGAAGGGGGACCAACAAACCTTACATAGAGGGCACAACTACTCTCAAATAATATGAGAGCCAAATACCGATTTCCGGGCTTGAACTCGTGACCGTGGAGTTGAAATGAATTAACACCAGACAAGATAACCAACTGTACCACCGACCCCAGTGTTATTTCATTTATTGTGAATGTATGAGAGGATGATATTTTGGAATTTTTTTGGTCGCTTTTGTAATGTTATTAATACACTTTACACTCTCTCTTCTACAGTTAAGAAGTCATTATATGTTCATATTTTAATGACTATGTATTGACTCTTGAGAAACTGTTGCTATGATAGCAATTTTACTTTTATATATGTTTGTGTATGTATATATATATATATTAAATCTTTATTGCTGTGCAAGCTTAGAATGGGCAACATCTGTTGCATATACACGATTGAGTTATAGAATATTTGTTTTTCATTTTCAATACTAGATTCCCATGCTTTTTCCTATTGTCAGTTGATGGGAAATCAGATATTTACTGATATATCCACCCCAAAAAAAATCAGAGAAAACTCAATTGCTGAAACAGAAGAGTGGCCTAAGCAGCAAGCGGGGTGAAAAGAGAAATCTGAAGGTGTCAATGAAGACAAAATTTGATTCACTCTCCATAAATTATGGAAATGGCTCGGCTGCAGCAGGGAGCAGTTTTTTTGGTATGATGATGTCCTTTGAATTATTAATTAATATGCTCAGGAACTTGTATTTATATGGTCTTTTGAGGATCTTATAACATGTATTGCAAACAGTCTCCTCTGGAAATGCAGATGCTTCAGATTGCTAAGGTGTAGATATTCTTGGTTTGTTGTGAGGAAAATTTGGACCATTGTTGTCACAGAACCTAAATTGTGGGACAACTGATAAGTATAACCAGATCGGGACAGTTCTAATTATATTATGTCTTTATTTGTTTGTTACTTCGATATCTTTTGTAAAGACGAGTTGATTTGTCCTTGTAATTAAGATATCTTGGAAAGATATAATCACGTGACTTGCCTTTCTAATAAGAGGTACGACCATTCTAGCTGGCTACCCATCTCATGAAACATAGAATGTGACTCTCTTAACATCAGCTATTGTATTTTATCTTCCAAGAAAAATCACTCCTTGAGTATTTGGTCTATAAAAATGACTTTACAAAGCTATTGAATTCTTATACAGGACTATATGGGCTGAAATCAGATGTTCACGATTTTACAAAGCTTACGGACGATCCACCGCTGAATGATCTTCTTGATGGCAGTTATGACTGTACTAGTTCAAGTATAGACAGAGGTAAAATAGATGCAACTGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCTTGGCCTATCCGCACACAAAATACTGAAGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTATCAAGCATAGAAGAAGGGGTGATTTTGGATGTAAAAGGAACTAGTGCAACAGATGCACTTTCATTAAACAAGGTCTGTGAGTAAGAACTTGACCATCCATCTCTCTACCACGCTCTCAAACACGTACTGTACCTTAGTGTTTGTTGGTTTTTATATTTTATATATATACTTTTTTGATATCCGTGACTGTCCCACCTCGAGTCCTCGACTAATGTCACGGGACATACCGCCGAACCAGCCCGCTCTCAAACACGTACTGTACCTGAGTGTTTGTTGGTTTTTAACTTTATGTTCTATACACTACTATTCATTTTTCTTCAATTGCTTCAAATTATTTTGGCATCTCTATTATAGACTTATAGTTATGTGAGATCTGTCCACAACCTGATCGAATTTATTTTTGTTGATTCTTTTGTTGTGATAATTGCTGTGGCAGAAATCTCAAATCTTCATTGAGAGGATTTTTATTCGTGTTCTAAATTTTAATCATAAATTACAGAGCTCTTATCCTTTATTACATTCAGATTTAGACCTTAATTTCTTTTTCTCTCCTGTTTTTCCATGAAGGTGCAAGATGTCTGTAGCAATTCTGAAACTTTGACTAAAATACTTGATTTTAAGTTGTGCAAACCTGGTGAGATATTCGTGATATTGGGCCTTCCTCGACCAAAGGACTTAGAATCTTTGCTCCAGGATGCCAGCAAGTCTGTATCTTCAAAGAATGCCACAGATTTGCGTTTAGCAAAGCATCAAACTCATAGAGCGATCTTGCAACCATTTCAATGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTTTCTGCAAATAGGACCACATGTCCTGGTAGGTGCTGGCGAGTTGGAAACTTTTCCAATATTCCCAGTGCTTCCTCTGATTGTTTTACAAAAGACTTGGAGTCATTGACTTTTAACCAGAGTTTATTTCCTTCAACCATAAGAGTTGGTCCTGAAGATGGAAAATCCTGTGCATCTGTTAATTATCATCAATGTGGATGGGATTCCCTTCCTTCTGCAACTTGTTCGAAAGCTTCCTCTGTGCTTGCAGGTAAGGATTTTATTGGAAGGCATTTATGTCAGCGAAATTGACATACGGTACTCATCTTGGAATCATTTTGCTGTTTTTTTTTTGTTTAACTTAAATTGTGACCGCTTCTTTTGGTGTACATACAGAATCTCGCGAACAGATGAATCAAGAGGCAAATGGTATGATAAGTTTCAACTTTCTTATATGTTGTGTACCTTTATATATTTGATAGCTGCGGGTGGGGCTAAAACTTGCACTGCAGCACTACTTTTTTGGCTATAATATTAATAATGGGATAAACTGTAAACATGAGAAGGCTTTAATGCTTTGTGTATTTTGTTTTTGAACTCCTTATTTTTCTTCTATGAATCATATTAAACCTTATTTTTGAATTTTTCTAAGTTAGTAGTTAAAGATCCAATAAATTATAAAAGGTAAAAGTAAGATCCATCAGTAAATGACTTACCAATGGCACCATAAGAACTTCTCAGATTGCCTACCCACTGAATTTCTTTTATCACATAAATGACTTTTCTTATGAAATACCAACAGTAACCCTACATGAAAGGAGTCTTAATGGGTTTCCTTTCTCCAAAGGAACCATAAAAATTCCACTAAAGGAAGAGAGCGAAGAGAAAGATGAGGAAGAGAATGATAGGATTTAGATTGGGGGTATATTAGTAAATACGTAAAAGAATTTGCTGGATATTTAGTTATAAATAGAGGAGGTAGATATGGAAGAGTGTGGCAATTATATTTTGATAGAATTCTAACTCTATTACCATATAGAAATAAAAGAAATACAAGATAAATCAAGTATTTGATAGGTACGATACATGAACTTCCCTCTCTTGCAAAACTCTTGAGCTCAAAAGACTTCACCATATAATTGCCTCTCTCTTCCATACCCACCTCCTCTATATATAACTAACTCTTTATGTATTTACTAATGTACCCCTGATGTAATTCCTATCAAAGTCATTATGCAGTTTTAAACGAAGAGTTTTTCAGATGTTCAGTGGTTGAAGAAAAGGCTTTGCTAGTTTGAGAATTTTGGTAGGTAGAGAGAAGCCTACTCTGTAAAACATTAAAACGTTGGAGAGAAATCTAAATTAAATCCTCAGCAATTACTTGTTGTGCATGATTCAGAAATGTTGAGAACTTGGATTCTTAATCTCATACGATACGTCTCTCACTACCTTCCTAATTCATTCATTACAAACTTCTAATTTCAATCTATGAAATTTTTTTGTCATCAACTAAGAGTATAATTGAATTTTGTCTTCGGTTATTGATATGATATTGCTTCTTTACAGAACAGCAGTGTCCAAGAGTAATGACTGCTGCACGAACTCTCTACGATATTGTATCTACTGCATCGAGGCAACACATAGATGGGATAGTAAAGTGGCCAAAAAAGTCTTCACAAAAGTCCATGAAAGCTCGCAAGTTGAAATCAGAAGAAACTGAAGAGTTATACGCTGCCCCTACAGCGTACACGACCGAGGGCCATATACATACCTCAAAGAAGCCAAAGCTAGGAGCAGCAGAGAGCAGAAGAGACATTACTCAAACAAGTAGAAGAAAAGAACCATTGAATTTGGCAACACCCAGATCAAGTAGATCGTCCCCGAGTAAGTTCGTCAAAGATTCGGTTTCGGAAGCTAAACATTCAGCCTCCAGCATTGTAAAACTATCATCAATGATGCCTCCTCCAGCAACTCTTTTGTCCAAGGCTGGCGAGTGTCGACAAAAGACACGAAAGTTAATGCTGATGGATTGGAGAAGAGGAGGAGCACCGGGTTAGATTTTTCAAGAAACTCTCTAAGATTATATCCATGTCCAACAAAATGCTGTTATTAAATTAGAATCACCCCTCTGTTTCCAATGGTTGAAGACCTGGGCTTTGAGGGTATGCTCCCCTCAAGGTTCCAGGTTCGAGACTCAGCTGTGACATTACTCCTTCAATGTCTTTCGGTGCCTGACCTAGGGACGATCGTGGTTACTTTCGTTTCAAAAAAAAAAAAAAAAGAATAACTGCTTTGGGCTAGTAGTGTCTAATCAGGTTGGTTGATTTGGTTGTGTACATAATACTGTTGCAAGTGATGGATTGATGTCTTGAATAGTGACAAGCTTTCTAATTTTTTTAACATCATGGTAGGAGTTCAATCCCACCACAATGTAGTTTATTTATTGAAAGTATAGCTCAATTGACATGAAGTATGCTTTTATAATTAATGTTAAAAAATTGTTGTAACATCAACTTCATTCCAGAATAGCAATATAAATTTTAGTTCTTG
mRNA sequence
CGGAAGATTCTCCCGTCGATAGTCCGTTTCAAACGGAACACATTTTCCAGAAAACTATATATACTTTCCATTGTTCCCAATTCGTGCCCTAACTATCTGATTTCTCTCACTCTTAGCCGCCATTTCGTTTTCTCCGATCAAATTCTCGCCGGCACGATCTTCAAATTCTCTTCTGTAATTGCAGATTTGCGGCTTCTTTCGCCGGCGATGTAGGGAGATGTGTAGCGGCGGCGGTGAATCGAGTTCTGCTTTAGGGCTTCGGTGATGGAAGCTCTTGAATTGACTTTTCCGGTGGATGTGGCGGCGGCGGCGCCGAAGCTCATGGGATCTGACGGCTCTGCTAGAACCGAGGTCGAGTTGTGCGGTTTTGTGCCTTCTTCTAGCTGTTCTTTTGCATTTCAGCATTTTAGGTCGTACGGTAGGGGAAAAGCTGGGACGAGCTCTATCAACAATCTGGGTTCTGTTTCTTTGGGTAAGAAGATCCCTGATGGAGCAGTTTCTAAGGGTGGTGAAGATCCATCTGAAGATTTTGAAAGTAGGAACAAAAGAAGTCATTTGTCCACCTTAAGTCCGGGAGTACAGCTACGTAAATCATTAAAGGTGTCAAGGAGTAGTAGTAGTAGTTTATGTTCTAAAAGGCGTGTGGTTCAGTGGGAAGATTCTTTATTATTAAGTGGAGCTGATGAAGTAAGAGATTCATCTGATAAGCTTGGATCGTATCTTAAAAAGTGCGGTTCTCATGAGAAAACTCAATTGCTGAAACAGAAGAGTGGCCTAAGCAGCAAGCGGGGTGAAAAGAGAAATCTGAAGGTGTCAATGAAGACAAAATTTGATTCACTCTCCATAAATTATGGAAATGGCTCGGCTGCAGCAGGGAGCAGTTTTTTTGGACTATATGGGCTGAAATCAGATGTTCACGATTTTACAAAGCTTACGGACGATCCACCGCTGAATGATCTTCTTGATGGCAGTTATGACTGTACTAGTTCAAGTATAGACAGAGGTAAAATAGATGCAACTGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCTTGGCCTATCCGCACACAAAATACTGAAGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTATCAAGCATAGAAGAAGGGGTGATTTTGGATGTAAAAGGAACTAGTGCAACAGATGCACTTTCATTAAACAAGGTGCAAGATGTCTGTAGCAATTCTGAAACTTTGACTAAAATACTTGATTTTAAGTTGTGCAAACCTGGTGAGATATTCGTGATATTGGGCCTTCCTCGACCAAAGGACTTAGAATCTTTGCTCCAGGATGCCAGCAAGTCTGTATCTTCAAAGAATGCCACAGATTTGCGTTTAGCAAAGCATCAAACTCATAGAGCGATCTTGCAACCATTTCAATGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTTTCTGCAAATAGGACCACATGTCCTGGTAGGTGCTGGCGAGTTGGAAACTTTTCCAATATTCCCAGTGCTTCCTCTGATTGTTTTACAAAAGACTTGGAGTCATTGACTTTTAACCAGAGTTTATTTCCTTCAACCATAAGAGTTGGTCCTGAAGATGGAAAATCCTGTGCATCTGTTAATTATCATCAATGTGGATGGGATTCCCTTCCTTCTGCAACTTGTTCGAAAGCTTCCTCTGTGCTTGCAGAATCTCGCGAACAGATGAATCAAGAGGCAAATGAACAGCAGTGTCCAAGAGTAATGACTGCTGCACGAACTCTCTACGATATTGTATCTACTGCATCGAGGCAACACATAGATGGGATAGTAAAGTGGCCAAAAAAGTCTTCACAAAAGTCCATGAAAGCTCGCAAGTTGAAATCAGAAGAAACTGAAGAGTTATACGCTGCCCCTACAGCGTACACGACCGAGGGCCATATACATACCTCAAAGAAGCCAAAGCTAGGAGCAGCAGAGAGCAGAAGAGACATTACTCAAACAAGTAGAAGAAAAGAACCATTGAATTTGGCAACACCCAGATCAAGTAGATCGTCCCCGAGTAAGTTCGTCAAAGATTCGGTTTCGGAAGCTAAACATTCAGCCTCCAGCATTGTAAAACTATCATCAATGATGCCTCCTCCAGCAACTCTTTTGTCCAAGGCTGGCGAGTGTCGACAAAAGACACGAAAGTTAATGCTGATGGATTGGAGAAGAGGAGGAGCACCGGGTTAGATTTTTCAAGAAACTCTCTAAGATTATATCCATGTCCAACAAAATGCTGTTATTAAATTAGAATCACCCCTCTGTTTCCAATGGTTGAAGACCTGGGCTTTGAGGGTATGCTCCCCTCAAGGTTCCAGGTTCGAGACTCAGCTGTGACATTACTCCTTCAATGTCTTTCGGTGCCTGACCTAGGGACGATCGTGGTTACTTTCGTTTCAAAAAAAAAAAAAAAAGAATAACTGCTTTGGGCTAGTAGTGTCTAATCAGGTTGGTTGATTTGGTTGTGTACATAATACTGTTGCAAGTGATGGATTGATGTCTTGAATAGTGACAAGCTTTCTAATTTTTTTAACATCATGGTAGGAGTTCAATCCCACCACAATGTAGTTTATTTATTGAAAGTATAGCTCAATTGACATGAAGTATGCTTTTATAATTAATGTTAAAAAATTGTTGTAACATCAACTTCATTCCAGAATAGCAATATAAATTTTAGTTCTTG
Coding sequence (CDS)
ATGGAAGCTCTTGAATTGACTTTTCCGGTGGATGTGGCGGCGGCGGCGCCGAAGCTCATGGGATCTGACGGCTCTGCTAGAACCGAGGTCGAGTTGTGCGGTTTTGTGCCTTCTTCTAGCTGTTCTTTTGCATTTCAGCATTTTAGGTCGTACGGTAGGGGAAAAGCTGGGACGAGCTCTATCAACAATCTGGGTTCTGTTTCTTTGGGTAAGAAGATCCCTGATGGAGCAGTTTCTAAGGGTGGTGAAGATCCATCTGAAGATTTTGAAAGTAGGAACAAAAGAAGTCATTTGTCCACCTTAAGTCCGGGAGTACAGCTACGTAAATCATTAAAGGTGTCAAGGAGTAGTAGTAGTAGTTTATGTTCTAAAAGGCGTGTGGTTCAGTGGGAAGATTCTTTATTATTAAGTGGAGCTGATGAAGTAAGAGATTCATCTGATAAGCTTGGATCGTATCTTAAAAAGTGCGGTTCTCATGAGAAAACTCAATTGCTGAAACAGAAGAGTGGCCTAAGCAGCAAGCGGGGTGAAAAGAGAAATCTGAAGGTGTCAATGAAGACAAAATTTGATTCACTCTCCATAAATTATGGAAATGGCTCGGCTGCAGCAGGGAGCAGTTTTTTTGGACTATATGGGCTGAAATCAGATGTTCACGATTTTACAAAGCTTACGGACGATCCACCGCTGAATGATCTTCTTGATGGCAGTTATGACTGTACTAGTTCAAGTATAGACAGAGGTAAAATAGATGCAACTGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCTTGGCCTATCCGCACACAAAATACTGAAGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTATCAAGCATAGAAGAAGGGGTGATTTTGGATGTAAAAGGAACTAGTGCAACAGATGCACTTTCATTAAACAAGGTGCAAGATGTCTGTAGCAATTCTGAAACTTTGACTAAAATACTTGATTTTAAGTTGTGCAAACCTGGTGAGATATTCGTGATATTGGGCCTTCCTCGACCAAAGGACTTAGAATCTTTGCTCCAGGATGCCAGCAAGTCTGTATCTTCAAAGAATGCCACAGATTTGCGTTTAGCAAAGCATCAAACTCATAGAGCGATCTTGCAACCATTTCAATGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTTTCTGCAAATAGGACCACATGTCCTGGTAGGTGCTGGCGAGTTGGAAACTTTTCCAATATTCCCAGTGCTTCCTCTGATTGTTTTACAAAAGACTTGGAGTCATTGACTTTTAACCAGAGTTTATTTCCTTCAACCATAAGAGTTGGTCCTGAAGATGGAAAATCCTGTGCATCTGTTAATTATCATCAATGTGGATGGGATTCCCTTCCTTCTGCAACTTGTTCGAAAGCTTCCTCTGTGCTTGCAGAATCTCGCGAACAGATGAATCAAGAGGCAAATGAACAGCAGTGTCCAAGAGTAATGACTGCTGCACGAACTCTCTACGATATTGTATCTACTGCATCGAGGCAACACATAGATGGGATAGTAAAGTGGCCAAAAAAGTCTTCACAAAAGTCCATGAAAGCTCGCAAGTTGAAATCAGAAGAAACTGAAGAGTTATACGCTGCCCCTACAGCGTACACGACCGAGGGCCATATACATACCTCAAAGAAGCCAAAGCTAGGAGCAGCAGAGAGCAGAAGAGACATTACTCAAACAAGTAGAAGAAAAGAACCATTGAATTTGGCAACACCCAGATCAAGTAGATCGTCCCCGAGTAAGTTCGTCAAAGATTCGGTTTCGGAAGCTAAACATTCAGCCTCCAGCATTGTAAAACTATCATCAATGATGCCTCCTCCAGCAACTCTTTTGTCCAAGGCTGGCGAGTGTCGACAAAAGACACGAAAGTTAATGCTGATGGATTGGAGAAGAGGAGGAGCACCGGGTTAG
Protein sequence
MEALELTFPVDVAAAAPKLMGSDGSARTEVELCGFVPSSSCSFAFQHFRSYGRGKAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVSRSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSGLSSKRGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKPSTSLVSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTKILDFKLCKPGEIFVILGLPRPKDLESLLQDASKSVSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKANSDSSKFSANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIRVGPEDGKSCASVNYHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVSTASRQHIDGIVKWPKKSSQKSMKARKLKSEETEELYAAPTAYTTEGHIHTSKKPKLGAAESRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHSASSIVKLSSMMPPPATLLSKAGECRQKTRKLMLMDWRRGGAPG
Homology
BLAST of Sed0002932 vs. NCBI nr
Match:
KAG6600547.1 (hypothetical protein SDJN03_05780, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 969.5 bits (2505), Expect = 1.4e-278
Identity = 533/685 (77.81%), Postives = 579/685 (84.53%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC-GFVPSSSCSFAFQHFRSYGRG 60
M+ALEL +PVDV AAPKLMG DGS RT EVELC S+ S++FQHF SYG
Sbjct: 15 MDALELNYPVDV--AAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQ 74
Query: 61 KAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVS 120
K GTSSIN+LGSVSL +IPDGAVSK GED EDFESRNKRSHLST SPGVQ RKSLKVS
Sbjct: 75 KVGTSSINDLGSVSL-DEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS 134
Query: 121 RSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSGLSSK 180
RSSSSSLCSKRR+VQ EDSLLLSGADEV+D+SDKLGSYLKKCGSHEK QL+KQKS +SSK
Sbjct: 135 RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK 194
Query: 181 RGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLNDLLD 240
RG+KRNLKVS+KTKFDSL IN GNGSA AG FFGLYGLKSDVHDFTKLTDDPPLN LLD
Sbjct: 195 RGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNGLLD 254
Query: 241 GSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKPSTSL 300
GSYD +S S +GK D VNECFLQSIRKACSVLQLPWP+R QNT ESESCSNSKPSTSL
Sbjct: 255 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSL 314
Query: 301 VSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTKILDFKLCKPGEIFVILGLP 360
VSSVSS+EEGV D K SATDA SL+KVQD CSNSETLT +LDFKL KP ++FV LGLP
Sbjct: 315 VSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 374
Query: 361 RPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKANSDSSKFS 420
PKDLESLLQDASK SVSSKNATDLR AK Q+ RAILQPF WSHSFNGHSKANSDSSKFS
Sbjct: 375 LPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFS 434
Query: 421 ANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIRVGPEDGKSCA-SVN 480
ANRTTC GR WRVGNF+NIP+A++DCFTK+LESLTFNQSLFPST+ VGP+DGKS + SVN
Sbjct: 435 ANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMGVGPDDGKSSSVSVN 494
Query: 481 YHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVS-TASRQHI 540
+HQ GWDSL SATCSKASS+L +SR +MN+EANEQ CPRVM AA+TLYDI + ASRQ+I
Sbjct: 495 HHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAASRQNI 554
Query: 541 DGIVKWPKKSSQKSMKARKLKSEETEELYAAPTAY--------TTEGHIHTSKKPKLGAA 600
DGIVKWPKK SQKSMKARKLKSEETEELYAAPT Y EGH+H SKKPK G
Sbjct: 555 DGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTV 614
Query: 601 ESRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHSASSIVKLSSMMPPPATLL 660
ESRRDITQT+ RK PLN AT +SSRSSPSKF +DSVSEAKHS S +VK SSMMPPPAT L
Sbjct: 615 ESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDSVSEAKHSTSGLVKQSSMMPPPATHL 674
Query: 661 SKAGECRQKTRKLMLMDWRRGGAPG 669
SKA E +QKTRKLMLMDW+RGG G
Sbjct: 675 SKASEGQQKTRKLMLMDWKRGGGTG 696
BLAST of Sed0002932 vs. NCBI nr
Match:
XP_023535222.1 (uncharacterized protein LOC111796709 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 968.8 bits (2503), Expect = 2.5e-278
Identity = 531/685 (77.52%), Postives = 579/685 (84.53%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC-GFVPSSSCSFAFQHFRSYGRG 60
M+ALEL +PVDV AAPKLMG DGS RT EVELC S+ S++FQHF SYG
Sbjct: 1 MDALELNYPVDV--AAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQ 60
Query: 61 KAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVS 120
K GTSSIN+LGSVSL +IPDGAVSK GED EDFESRNKRSHLST SPGVQ RKSLKVS
Sbjct: 61 KVGTSSINDLGSVSL-DEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS 120
Query: 121 RSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSGLSSK 180
RSSSSSLCSKRR+VQ EDSLLLSGADEV+D+SDKLGSYLKKCGSHEK QL+KQKS +SSK
Sbjct: 121 RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK 180
Query: 181 RGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLNDLLD 240
RG+KRNLKVS+KTKFDSL IN GNGSA AG FFGLYGLKSDVHDFTKLTDDPPLN LLD
Sbjct: 181 RGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNGLLD 240
Query: 241 GSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKPSTSL 300
GSYD +S S +GK D VNECFLQSIRKACSVLQLPWP+R QNT ESESCSNSKPSTSL
Sbjct: 241 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSL 300
Query: 301 VSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTKILDFKLCKPGEIFVILGLP 360
VSSVSS+EEGV D K S TDA SL+KVQD CSNSETLT +LDFKL KP ++FV LGLP
Sbjct: 301 VSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 360
Query: 361 RPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKANSDSSKFS 420
PKDLESLLQDASK SVSSKNATDLR AK Q+ RAILQPF WSHSFNGHSKANSDSSKFS
Sbjct: 361 LPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFS 420
Query: 421 ANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIRVGPEDGKSCA-SVN 480
ANRTTC GR WRVGNF+NIP+A++DCFTK+LESLTFNQSLFPST+RVGP+DGKS + SVN
Sbjct: 421 ANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVN 480
Query: 481 YHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVSTAS-RQHI 540
+HQ GWDSL SATCSKASS+L + R +MN+EANEQ CPRVM AA+TLYDI + A+ RQ+I
Sbjct: 481 HHQSGWDSLSSATCSKASSMLVDFRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNI 540
Query: 541 DGIVKWPKKSSQKSMKARKLKSEETEELYAAPTAY--------TTEGHIHTSKKPKLGAA 600
DGIVKWPKK SQKSMKARKLKSEETEELYAAPT Y EGH+H SKKPK G
Sbjct: 541 DGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTV 600
Query: 601 ESRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHSASSIVKLSSMMPPPATLL 660
ESRRDITQT+ RK PLN AT +SSRSSPSKF++DSVSEAKHS S +VK SSMMPPPAT L
Sbjct: 601 ESRRDITQTNNRKGPLNWATTKSSRSSPSKFIRDSVSEAKHSTSGVVKQSSMMPPPATHL 660
Query: 661 SKAGECRQKTRKLMLMDWRRGGAPG 669
SKA E +QKTRKLMLMDW+RGG G
Sbjct: 661 SKASEGQQKTRKLMLMDWKRGGGTG 682
BLAST of Sed0002932 vs. NCBI nr
Match:
XP_022942381.1 (uncharacterized protein LOC111447442 isoform X1 [Cucurbita moschata])
HSP 1 Score: 966.5 bits (2497), Expect = 1.2e-277
Identity = 531/685 (77.52%), Postives = 578/685 (84.38%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC-GFVPSSSCSFAFQHFRSYGRG 60
M+ALEL +PVDV AAPKLMG DGS RT EVELC S+ S++FQHF SYG
Sbjct: 1 MDALELNYPVDV--AAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQ 60
Query: 61 KAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVS 120
K GTSSIN+LGSVSL +IPDGAVSK GED EDFESRNKRSHLST SPGVQ RKSLKVS
Sbjct: 61 KVGTSSINDLGSVSL-DEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS 120
Query: 121 RSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSGLSSK 180
RSSSSSLCSKRR+VQ EDSLLLSGADEV+D+SDKLGSYLKKCGSHEK QL+KQKS +SSK
Sbjct: 121 RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK 180
Query: 181 RGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLNDLLD 240
RG+KRNL VS+KTKFDSL IN GNGSA AG FFGLYGLKSDVHDFTKLTDDP LN LLD
Sbjct: 181 RGDKRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLD 240
Query: 241 GSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKPSTSL 300
GSYD +S S +GK D VNECFLQSIRKACSVLQLPWP+R QNT ESESCSNSKPSTSL
Sbjct: 241 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSL 300
Query: 301 VSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTKILDFKLCKPGEIFVILGLP 360
VSSVSS+EEGV D K S TDA SL+KVQD CSNSETLT +LDFKL KP ++FV LGLP
Sbjct: 301 VSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 360
Query: 361 RPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKANSDSSKFS 420
PKDLESLLQDASK SVSSKNATDLR AK Q+ RAILQPF WSHSFNGHSKANSDSSKFS
Sbjct: 361 LPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFS 420
Query: 421 ANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIRVGPEDGKSCA-SVN 480
ANRTTC GR WRVGNF+NIP+A++DCFTK+LESLTFNQSLFPST+RVGP+DGKS + SVN
Sbjct: 421 ANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVN 480
Query: 481 YHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVSTAS-RQHI 540
+HQ GWDSL SATCSKASS+L +SR +MN+EANEQ CPRVM AA+TLYDI + A+ RQ+I
Sbjct: 481 HHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNI 540
Query: 541 DGIVKWPKKSSQKSMKARKLKSEETEELYAAPTAY--------TTEGHIHTSKKPKLGAA 600
DGIVKWPKK SQKSMKARKLKSEETEELYAAPT Y EGH+H SKKPK G
Sbjct: 541 DGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTV 600
Query: 601 ESRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHSASSIVKLSSMMPPPATLL 660
ESRRDITQT+ RK PLN AT +SSRSSPSKFV+DSVSEAKHS S +VK SSMMPPPAT L
Sbjct: 601 ESRRDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHL 660
Query: 661 SKAGECRQKTRKLMLMDWRRGGAPG 669
SKA E +QKTRKLMLMDW+RGG G
Sbjct: 661 SKASEGQQKTRKLMLMDWKRGGGTG 682
BLAST of Sed0002932 vs. NCBI nr
Match:
XP_022979382.1 (uncharacterized protein LOC111479123 isoform X1 [Cucurbita maxima])
HSP 1 Score: 953.7 bits (2464), Expect = 8.2e-274
Identity = 525/682 (76.98%), Postives = 573/682 (84.02%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC-GFVPSSSCSFAFQHFRSYGRG 60
M+ALEL +PVDV AAPKLMG DGS RT EV+LC S+ S++FQHF SYG
Sbjct: 1 MDALELNYPVDV--AAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQ 60
Query: 61 KAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVS 120
K GTSSIN+LGSVSL +I DGAVSK GED EDFESRNKRSHLST SPGVQ RKSLKVS
Sbjct: 61 KVGTSSINDLGSVSL-DEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS 120
Query: 121 RSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSGLSSK 180
RSSSSSLCSKRR+VQ EDSLLLSGADEV+D+SDKLGSYLKKCGSHEK QL+KQKS +SSK
Sbjct: 121 RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK 180
Query: 181 RGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLNDLLD 240
RG+KRNLKVS+KTKFDSL IN GNGSA AG FFGLYGLKSDVHDFTKLTDDPPLNDLLD
Sbjct: 181 RGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLD 240
Query: 241 GSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKPSTSL 300
GSYD +S S +GK D VNECFLQSIRKACSVLQLPWP+ QNT ESESCSNSKPSTSL
Sbjct: 241 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSL 300
Query: 301 VSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTKILDFKLCKPGEIFVILGLP 360
VSSVSS+EEGV D K S TDA L+KVQD CSNSETLT +LDFKL KP ++FV LGLP
Sbjct: 301 VSSVSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 360
Query: 361 RPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKANSDSSKFS 420
PKDLESLLQDASK SVSSKNATDLR AK Q+ RAILQPF WSHSFNGHSKANSDSSKFS
Sbjct: 361 LPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFS 420
Query: 421 ANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIRVGPEDGKSCA-SVN 480
ANRTTC GR WRVGNF+NIP+A++DCFTK+LESL FNQSLFPST+RVGP+DGKS + SVN
Sbjct: 421 ANRTTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVN 480
Query: 481 YHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVSTAS-RQHI 540
+HQ GWDSL SATCSKASS+L +SR +MN+EANE CPRVM AA+TLYDI + A+ RQ+I
Sbjct: 481 HHQSGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNI 540
Query: 541 DGIVKWPKKSSQKSMKARKLKSEETEELYAAPTAY--------TTEGHIHTSKKPKLGAA 600
DGIVKWPKK SQKSMKARKLKSEETEELYAAPT Y EG +H SKKPK G
Sbjct: 541 DGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTV 600
Query: 601 ESRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHSASSIVKLSSMMPPPATLL 660
ESRRDITQT+ RK PLN A +SSRSSPSKFV+DSVSEAKHS S +VK SSMMPPPAT L
Sbjct: 601 ESRRDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHL 660
Query: 661 SKAGECRQKTRKLMLMDWRRGG 666
SKA E +QKTRKLMLMDW+RGG
Sbjct: 661 SKASEGQQKTRKLMLMDWKRGG 679
BLAST of Sed0002932 vs. NCBI nr
Match:
KAG7031186.1 (hypothetical protein SDJN02_05226 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 953.0 bits (2462), Expect = 1.4e-273
Identity = 532/713 (74.61%), Postives = 579/713 (81.21%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC-GFVPSSSCSFAFQHFRSYGRG 60
M+ALEL +PVDV AAPKLMG DGS RT EVELC S+ S++FQHF SYG
Sbjct: 1 MDALELNYPVDV--AAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQ 60
Query: 61 KAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVS 120
K GTSSIN+LGSVSL +IPDGAVSK GED EDFESRNKRSHLST SPGVQ RKSLKVS
Sbjct: 61 KVGTSSINDLGSVSL-DEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS 120
Query: 121 RSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSGLSSK 180
RSSSSSLCSKRR+VQ EDSLLLSGADEV+D+SDKLGSYLKKCGSHEK QL+KQKS +SSK
Sbjct: 121 RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK 180
Query: 181 RGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFF-------------------------- 240
RG+KRNLKVS+KTKFDSL IN GNGSA AG FF
Sbjct: 181 RGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEM 240
Query: 241 --GLYGLKSDVHDFTKLTDDPPLNDLLDGSYDCTSSSIDRGKIDATVNECFLQSIRKACS 300
GLYGLKSDVHDFTKLTDDPPLN LLDGSYD +S S +GK D VNECFLQSIRKACS
Sbjct: 241 FTGLYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACS 300
Query: 301 VLQLPWPIRTQNTEESESCSNSKPSTSLVSSVSSIEEGVILDVKGTSATDALSLNKVQDV 360
VLQLPWP+R QNT ESESCSNSKPSTSLVSSVSS+EEGV D K SATDA SL+KVQD
Sbjct: 301 VLQLPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDA 360
Query: 361 CSNSETLTKILDFKLCKPGEIFVILGLPRPKDLESLLQDASK-SVSSKNATDLRLAKHQT 420
CSNSETLT +LDFKL KP ++FV LGLP PKDLESLLQDASK SVSSKNATDLR AK Q+
Sbjct: 361 CSNSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQS 420
Query: 421 HRAILQPFQWSHSFNGHSKANSDSSKFSANRTTCPGRCWRVGNFSNIPSASSDCFTKDLE 480
RAILQPF WSHSFNGHSKANSDSSKFSANRTTC GR WRVGNF+NIP+A++DCFTK+LE
Sbjct: 421 RRAILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLE 480
Query: 481 SLTFNQSLFPSTIRVGPEDGKSCA-SVNYHQCGWDSLPSATCSKASSVLAESREQMNQEA 540
SLTFNQSLFPST+ VGP+DGKS + SVN+HQ GWDSL SATCSKASS+L +SR +MN+EA
Sbjct: 481 SLTFNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREA 540
Query: 541 NEQQCPRVMTAARTLYDIVSTAS-RQHIDGIVKWPKKSSQKSMKARKLKSEETEELYAAP 600
NEQ CPRVM AA+TLYDI + A+ RQ+IDGIVKWPKK SQKSMKARKLKSEETEELYAAP
Sbjct: 541 NEQHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAP 600
Query: 601 TAY--------TTEGHIHTSKKPKLGAAESRRDITQTSRRKEPLNLATPRSSRSSPSKFV 660
T Y EGH+H SKKPK G ESRRDITQT+ RK PLN AT +SSRSSPSKF
Sbjct: 601 TTYGLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFF 660
Query: 661 KDSVSEAKHSASSIVKLSSMMPPPATLLSKAGECRQKTRKLMLMDWRRGGAPG 669
+DSVSEAKHS S +VK SSMMPPPAT LSKA E +QKTRKLMLMDW+RGG G
Sbjct: 661 RDSVSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 710
BLAST of Sed0002932 vs. ExPASy TrEMBL
Match:
A0A6J1FNQ1 (uncharacterized protein LOC111447442 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447442 PE=4 SV=1)
HSP 1 Score: 966.5 bits (2497), Expect = 5.9e-278
Identity = 531/685 (77.52%), Postives = 578/685 (84.38%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC-GFVPSSSCSFAFQHFRSYGRG 60
M+ALEL +PVDV AAPKLMG DGS RT EVELC S+ S++FQHF SYG
Sbjct: 1 MDALELNYPVDV--AAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQ 60
Query: 61 KAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVS 120
K GTSSIN+LGSVSL +IPDGAVSK GED EDFESRNKRSHLST SPGVQ RKSLKVS
Sbjct: 61 KVGTSSINDLGSVSL-DEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS 120
Query: 121 RSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSGLSSK 180
RSSSSSLCSKRR+VQ EDSLLLSGADEV+D+SDKLGSYLKKCGSHEK QL+KQKS +SSK
Sbjct: 121 RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK 180
Query: 181 RGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLNDLLD 240
RG+KRNL VS+KTKFDSL IN GNGSA AG FFGLYGLKSDVHDFTKLTDDP LN LLD
Sbjct: 181 RGDKRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLD 240
Query: 241 GSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKPSTSL 300
GSYD +S S +GK D VNECFLQSIRKACSVLQLPWP+R QNT ESESCSNSKPSTSL
Sbjct: 241 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSL 300
Query: 301 VSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTKILDFKLCKPGEIFVILGLP 360
VSSVSS+EEGV D K S TDA SL+KVQD CSNSETLT +LDFKL KP ++FV LGLP
Sbjct: 301 VSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 360
Query: 361 RPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKANSDSSKFS 420
PKDLESLLQDASK SVSSKNATDLR AK Q+ RAILQPF WSHSFNGHSKANSDSSKFS
Sbjct: 361 LPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFS 420
Query: 421 ANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIRVGPEDGKSCA-SVN 480
ANRTTC GR WRVGNF+NIP+A++DCFTK+LESLTFNQSLFPST+RVGP+DGKS + SVN
Sbjct: 421 ANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVN 480
Query: 481 YHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVSTAS-RQHI 540
+HQ GWDSL SATCSKASS+L +SR +MN+EANEQ CPRVM AA+TLYDI + A+ RQ+I
Sbjct: 481 HHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNI 540
Query: 541 DGIVKWPKKSSQKSMKARKLKSEETEELYAAPTAY--------TTEGHIHTSKKPKLGAA 600
DGIVKWPKK SQKSMKARKLKSEETEELYAAPT Y EGH+H SKKPK G
Sbjct: 541 DGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTV 600
Query: 601 ESRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHSASSIVKLSSMMPPPATLL 660
ESRRDITQT+ RK PLN AT +SSRSSPSKFV+DSVSEAKHS S +VK SSMMPPPAT L
Sbjct: 601 ESRRDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHL 660
Query: 661 SKAGECRQKTRKLMLMDWRRGGAPG 669
SKA E +QKTRKLMLMDW+RGG G
Sbjct: 661 SKASEGQQKTRKLMLMDWKRGGGTG 682
BLAST of Sed0002932 vs. ExPASy TrEMBL
Match:
A0A6J1INL3 (uncharacterized protein LOC111479123 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111479123 PE=4 SV=1)
HSP 1 Score: 953.7 bits (2464), Expect = 4.0e-274
Identity = 525/682 (76.98%), Postives = 573/682 (84.02%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC-GFVPSSSCSFAFQHFRSYGRG 60
M+ALEL +PVDV AAPKLMG DGS RT EV+LC S+ S++FQHF SYG
Sbjct: 1 MDALELNYPVDV--AAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQ 60
Query: 61 KAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVS 120
K GTSSIN+LGSVSL +I DGAVSK GED EDFESRNKRSHLST SPGVQ RKSLKVS
Sbjct: 61 KVGTSSINDLGSVSL-DEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS 120
Query: 121 RSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSGLSSK 180
RSSSSSLCSKRR+VQ EDSLLLSGADEV+D+SDKLGSYLKKCGSHEK QL+KQKS +SSK
Sbjct: 121 RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK 180
Query: 181 RGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLNDLLD 240
RG+KRNLKVS+KTKFDSL IN GNGSA AG FFGLYGLKSDVHDFTKLTDDPPLNDLLD
Sbjct: 181 RGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLD 240
Query: 241 GSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKPSTSL 300
GSYD +S S +GK D VNECFLQSIRKACSVLQLPWP+ QNT ESESCSNSKPSTSL
Sbjct: 241 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSL 300
Query: 301 VSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTKILDFKLCKPGEIFVILGLP 360
VSSVSS+EEGV D K S TDA L+KVQD CSNSETLT +LDFKL KP ++FV LGLP
Sbjct: 301 VSSVSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 360
Query: 361 RPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKANSDSSKFS 420
PKDLESLLQDASK SVSSKNATDLR AK Q+ RAILQPF WSHSFNGHSKANSDSSKFS
Sbjct: 361 LPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFS 420
Query: 421 ANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIRVGPEDGKSCA-SVN 480
ANRTTC GR WRVGNF+NIP+A++DCFTK+LESL FNQSLFPST+RVGP+DGKS + SVN
Sbjct: 421 ANRTTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVN 480
Query: 481 YHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVSTAS-RQHI 540
+HQ GWDSL SATCSKASS+L +SR +MN+EANE CPRVM AA+TLYDI + A+ RQ+I
Sbjct: 481 HHQSGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNI 540
Query: 541 DGIVKWPKKSSQKSMKARKLKSEETEELYAAPTAY--------TTEGHIHTSKKPKLGAA 600
DGIVKWPKK SQKSMKARKLKSEETEELYAAPT Y EG +H SKKPK G
Sbjct: 541 DGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTV 600
Query: 601 ESRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHSASSIVKLSSMMPPPATLL 660
ESRRDITQT+ RK PLN A +SSRSSPSKFV+DSVSEAKHS S +VK SSMMPPPAT L
Sbjct: 601 ESRRDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHL 660
Query: 661 SKAGECRQKTRKLMLMDWRRGG 666
SKA E +QKTRKLMLMDW+RGG
Sbjct: 661 SKASEGQQKTRKLMLMDWKRGG 679
BLAST of Sed0002932 vs. ExPASy TrEMBL
Match:
A0A6J1C5T9 (uncharacterized protein LOC111008234 OS=Momordica charantia OX=3673 GN=LOC111008234 PE=4 SV=1)
HSP 1 Score: 930.6 bits (2404), Expect = 3.6e-267
Identity = 518/690 (75.07%), Postives = 574/690 (83.19%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC-GFVPSSSCSFAFQHFRSYGRG 60
M+A+ELT+PVDV AAPKLMG DGS RT EVELC S+ S++FQHF SYG
Sbjct: 1 MDAVELTYPVDV--AAPKLMGPDGSVRTGVTIEEVELCESDRVSAPPSYSFQHFSSYGSQ 60
Query: 61 KAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVS 120
KAGTSSIN++GSVSL KIPDGAVSK GE SED ESRNKRS L T SPGVQ RKSLKVS
Sbjct: 61 KAGTSSINDVGSVSL-DKIPDGAVSKDGEGTSEDLESRNKRSLLFTSSPGVQQRKSLKVS 120
Query: 121 RSSSSSLCSKR-RVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSH--EKTQLLKQKSGL 180
RSSSSSLCSKR RVV+ EDSL LSGAD+V+D+SDKLGSYLKKC SH EK QLLKQKS L
Sbjct: 121 RSSSSSLCSKRPRVVRLEDSLFLSGADDVKDTSDKLGSYLKKCSSHETEKAQLLKQKSSL 180
Query: 181 SSKRGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLND 240
SSKRG+KRNLKVS+KTKFDSLSIN GNGSAAAGSSF LYGLKSDVHDFTKL DDPPLND
Sbjct: 181 SSKRGDKRNLKVSLKTKFDSLSINAGNGSAAAGSSFLALYGLKSDVHDFTKLVDDPPLND 240
Query: 241 LLDGSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKPS 300
LLDGSYD S SID+GK D VNECFLQS+RKACSVLQLPWP+ QN ESE CSNSKPS
Sbjct: 241 LLDGSYDSASLSIDKGKKDTNVNECFLQSVRKACSVLQLPWPVHPQNIAESEGCSNSKPS 300
Query: 301 TSLVSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTKILDFKLCKPGEIFVIL 360
TS+VS VSS+EEGV DVK ATD+ SLNKV+D CSNSETLT LDFKL KP ++FV +
Sbjct: 301 TSIVSYVSSMEEGVNFDVKEPIATDSPSLNKVRDACSNSETLTNPLDFKLYKPDDMFVKM 360
Query: 361 GLPRPKDLESLLQDASKS--VSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKANSDS 420
GLP PKDLESLLQDASKS SSKN TDLR AK Q+ RA+LQPF WSHSFNGHSK+NSDS
Sbjct: 361 GLPLPKDLESLLQDASKSSVSSSKNVTDLRSAKQQSRRAMLQPFPWSHSFNGHSKSNSDS 420
Query: 421 SKFSANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIR-VGPEDGKSC 480
SKFSANRTTCPGR WR+GNFS+IPSA++DCFTKDLESLTFNQSLFPST+R VGP+D +S
Sbjct: 421 SKFSANRTTCPGRWWRIGNFSSIPSATADCFTKDLESLTFNQSLFPSTMRVVGPDDRRSS 480
Query: 481 A-SVNYHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVS-TA 540
+ SVN+HQCGWDSL SA CSKASSVL ESR + N EAN+QQCP+V+ AA+TLYDI + A
Sbjct: 481 SVSVNHHQCGWDSLSSAICSKASSVLVESRGKTNYEANDQQCPKVIAAAKTLYDIATYAA 540
Query: 541 SRQHIDGIVKWPKKSSQKSMKARKLKSEETEELYAAPT-------AYTTEGHIHTSKKPK 600
SRQ+IDGIV+WPKK SQKSM+ARKLKSEETEELYAAPT + +EGH+H+SKKPK
Sbjct: 541 SRQNIDGIVRWPKKPSQKSMRARKLKSEETEELYAAPTYGLWSDNPFKSEGHMHSSKKPK 600
Query: 601 LGAAESRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHSASSIVKLSSMMPPP 660
LG ESRRD+ T+ R+ PLN ATPRSSRSSPSKFV+DS S+AKHS S IVK SSMMPPP
Sbjct: 601 LGTTESRRDLAHTNCRRGPLNWATPRSSRSSPSKFVRDSASDAKHSTSGIVKPSSMMPPP 660
Query: 661 A-TLLSKAGECRQKTRKLMLMDWRRGGAPG 669
A TLL K GE +QKTRKLMLMDW+RGG G
Sbjct: 661 ATTLLCKGGEGQQKTRKLMLMDWKRGGGTG 687
BLAST of Sed0002932 vs. ExPASy TrEMBL
Match:
A0A6J1EYC2 (uncharacterized protein LOC111437570 OS=Cucurbita moschata OX=3662 GN=LOC111437570 PE=4 SV=1)
HSP 1 Score: 894.8 bits (2311), Expect = 2.2e-256
Identity = 507/685 (74.01%), Postives = 556/685 (81.17%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC-GFVPSSSCSFAFQHFRSYGRG 60
M ALELT PVDV + KLMG DGS RT EVELC S+ S++FQHF SYG
Sbjct: 1 MYALELTCPVDVVVS--KLMGPDGSVRTGVTIEEVELCEADRGSAPPSYSFQHFSSYGCK 60
Query: 61 KAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSLKVS 120
K GTSSIN+LG VSL K+PDGAV K GE+ SEDFESRNKRSHLST S GVQ RK LKVS
Sbjct: 61 KDGTSSINDLGPVSL-DKVPDGAVFKDGENTSEDFESRNKRSHLSTSSLGVQPRKPLKVS 120
Query: 121 RSSSSSLCSKR-RVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSGLSS 180
R SSSLCSKR RVVQ ED L LSGAD+V SDKLGSYLKKC SHEKTQLLKQKS LSS
Sbjct: 121 R-GSSSLCSKRPRVVQLEDPLFLSGADDV---SDKLGSYLKKCNSHEKTQLLKQKSSLSS 180
Query: 181 KRGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLNDLL 240
KRG+KRNLKVS+KTKFDS S N GNGSAAAGSSF GLYGLKS DFTKLTDDPPLND+L
Sbjct: 181 KRGDKRNLKVSLKTKFDSFSTNAGNGSAAAGSSFHGLYGLKSGARDFTKLTDDPPLNDIL 240
Query: 241 DGSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKPSTS 300
DGSYDC + S D+GK D VNECFLQSIRKACSVLQLPWP+R QN ESESCSNSKP TS
Sbjct: 241 DGSYDCANLSKDKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNMAESESCSNSKPDTS 300
Query: 301 LVSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTKILDFKLCKPGEIFVILGL 360
LVSSVSS+EE V DVK SATD+ SLNKV+D C+NSE LT LDFKL KP +F+ LGL
Sbjct: 301 LVSSVSSMEEKVNFDVKELSATDSPSLNKVEDACNNSEPLTNALDFKLYKPDHMFMKLGL 360
Query: 361 PRPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKANSDSSKF 420
P PKDL SLLQDASK SVSS NATDLR AK Q+ RA+LQPF WSHSFNGHSKANSDSSKF
Sbjct: 361 PIPKDLNSLLQDASKSSVSSNNATDLRSAKQQSRRAMLQPFAWSHSFNGHSKANSDSSKF 420
Query: 421 SANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIRV-GPEDGKSCASV 480
SANRTTC GR WRV NFSNIPSA++DCFTKDLESLTFNQSLFPST+RV GP+DG+S SV
Sbjct: 421 SANRTTCLGRWWRVRNFSNIPSATADCFTKDLESLTFNQSLFPSTMRVIGPDDGRSSISV 480
Query: 481 NYHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVSTAS-RQH 540
N+HQCGWDSL SATCSK SSVL ESR +MN E+ EQQCPRVM AA+TLYDI ++A+ RQ+
Sbjct: 481 NHHQCGWDSLSSATCSKTSSVLVESRGKMNSESYEQQCPRVMAAAQTLYDIATSAALRQN 540
Query: 541 IDGIVKWPKKSSQKSMKARKLKSEETEELYAAPTAY--------TTEG-HIHTSKKPKLG 600
IDG+V+WPKK+SQKSM+ARKLKSEETEELY PT Y EG H H SKKPKLG
Sbjct: 541 IDGMVRWPKKASQKSMRARKLKSEETEELYTTPTTYGLWSNNSIKNEGHHAHPSKKPKLG 600
Query: 601 AAESRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHS-ASSIVKLSSMMPPPA 660
ESRRD+ QT+ ++ PLN TPRSSRSSPSKF++DSVSEAK S A +I + SSMMPPPA
Sbjct: 601 TTESRRDVAQTNCKRGPLNWTTPRSSRSSPSKFIRDSVSEAKPSTAGAIKQSSSMMPPPA 660
Query: 661 TLLSKAGECRQKTRKLMLMDWRRGG 666
TLL KAGE +QKTRKLMLMDW+RGG
Sbjct: 661 TLLCKAGEGQQKTRKLMLMDWKRGG 678
BLAST of Sed0002932 vs. ExPASy TrEMBL
Match:
A0A5A7TNS5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold74G001420 PE=4 SV=1)
HSP 1 Score: 891.7 bits (2303), Expect = 1.8e-255
Identity = 514/697 (73.74%), Postives = 559/697 (80.20%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSART-----EVELC----GFVPSSSCSFAFQHFRSY 60
M+ALELTFP VA KLMG DGS RT EVELC G PS SF+FQHF SY
Sbjct: 1 MDALELTFPAVVAPL--KLMGPDGSVRTEVTIEEVELCEADRGSAPS---SFSFQHFSSY 60
Query: 61 GRGKAGTSSINNLGSVSLGKKIPDGAVSKGGEDPSEDFESRNKRSHLSTLSPGVQLRKSL 120
G KAGTSSIN+LGSV L KIPDGAVS+ GED SEDFESRNK S LST SPGV RKSL
Sbjct: 61 GSLKAGTSSINDLGSVPL-DKIPDGAVSRDGEDASEDFESRNKGSQLSTSSPGVHPRKSL 120
Query: 121 KVSRSSSSSLCSKR-RVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGSHEKTQLLKQKSG 180
KV RSSSSSLCSKR RVVQ EDSL LSGAD+ +D+SDKLGSYLKKC SHEKTQLLKQKS
Sbjct: 121 KVPRSSSSSLCSKRPRVVQLEDSLFLSGADDAKDASDKLGSYLKKCNSHEKTQLLKQKSS 180
Query: 181 LSSKRGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVHDFTKLTDDPPLN 240
LSSKRG+KRNLKVS+KTK +SLS N GN SAA GSSF GLYGLKSDVHDFTKLTDDPPL+
Sbjct: 181 LSSKRGDKRNLKVSLKTKLESLSTNAGNCSAAPGSSFSGLYGLKSDVHDFTKLTDDPPLS 240
Query: 241 DLLDGSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQNTEESESCSNSKP 300
LLDGSYDC + S D+G+ DA VNECFLQSIRKACSVLQLP P+ QN ESESCSNSKP
Sbjct: 241 GLLDGSYDCANLSKDKGRKDANVNECFLQSIRKACSVLQLPLPVHPQNVPESESCSNSKP 300
Query: 301 STSLVSSVSSIEEGVILDVKGTSA---TDALSLNKVQDVCSNSETLTKILDFKLCKPGEI 360
STSLV+ VSS+EE D KGTSA TD+ SLNKVQD CSNSE L +LDF+L KP +I
Sbjct: 301 STSLVTPVSSMEEQANFDAKGTSASWVTDSPSLNKVQDACSNSEPLANVLDFELHKPDDI 360
Query: 361 FVILGLPRPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPFQWSHSFNGHSKAN 420
FV LGLP PKDLESLLQDASK S+ SKNATDLR AK Q RA+LQPF WSHSFNGHSKAN
Sbjct: 361 FVKLGLPLPKDLESLLQDASKSSIPSKNATDLRSAKQQFRRAMLQPFPWSHSFNGHSKAN 420
Query: 421 SDSSKFSANRTTCPGRCWRVGNFSNIPSASSDCFTKDLESLTFNQSLFPSTIR-VGPEDG 480
SDSSK SANRTTCPGR WRVGNFSNIP A++DCFTKDLESLTFNQSLFPST+R VG +DG
Sbjct: 421 SDSSKLSANRTTCPGRWWRVGNFSNIPCAATDCFTKDLESLTFNQSLFPSTMRVVGSKDG 480
Query: 481 KSCASVNYHQCGWDSLPSATCSKASSVLAESREQMNQEANEQQCPRVMTAARTLYDIVST 540
S SVN+HQCGWDSL SATCSK SSVL ESR ++NQEANEQQCPRVM AA+TL DI ++
Sbjct: 481 GSFVSVNHHQCGWDSLSSATCSKTSSVLVESRGKINQEANEQQCPRVMAAAQTLCDIATS 540
Query: 541 AS-RQHIDGIVKWPKKSSQKSMKARKLKSEETEELYAAPTAY--------TTEGH--IHT 600
AS RQ+IDGIV+WPKK SQKSMKARKLKSEETEELY PT Y EGH H
Sbjct: 541 ASLRQNIDGIVRWPKKPSQKSMKARKLKSEETEELYTTPTTYGLWSNNSFKNEGHQTPHP 600
Query: 601 SKKPKLG-AAESRRD-ITQTSRRKEPLNLATPRSSRSSPSKFVKDSVSEAKHSASSIVKL 660
KKPKLG E+RRD I QT+ R+ PLN +TPRSSRSSPSKF++DSVS+ KHS VK
Sbjct: 601 LKKPKLGTTTENRRDNIAQTNCRR-PLNWSTPRSSRSSPSKFIEDSVSDIKHSTVGTVKQ 660
Query: 661 SSMMPPPA-TLLSKAGECRQKTRKLMLMDWRRGGAPG 669
SSMMPPPA TLL KAG+ +QKTRKLMLMDW+RGG G
Sbjct: 661 SSMMPPPATTLLCKAGDGQQKTRKLMLMDWKRGGGTG 690
BLAST of Sed0002932 vs. TAIR 10
Match:
AT1G64050.1 (unknown protein; Has 524 Blast hits to 342 proteins in 101 species: Archae - 0; Bacteria - 106; Metazoa - 106; Fungi - 24; Plants - 25; Viruses - 0; Other Eukaryotes - 263 (source: NCBI BLink). )
HSP 1 Score: 213.4 bits (542), Expect = 5.6e-55
Identity = 223/702 (31.77%), Postives = 336/702 (47.86%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSARTEVELCGFVPSSSCSFAFQHFRSYGRGKAGTSS 60
M+ L+++ PVDV+ A KLMGS+G CG S + A + SS
Sbjct: 1 MDGLKISCPVDVSLPA-KLMGSEG--------CGGGVRVSSNKADNNCDKARVSIGVNSS 60
Query: 61 INNLGSVSLGKKIPDGAVSKGGEDP---------SEDF--------------ESRNKRSH 120
I S S+ KK GA S G S DF E +N S
Sbjct: 61 IERCSSASINKK---GAGSSSGASDSSLWRKLMHSHDFVHDRLTKLRVDNSSEPQNGYSP 120
Query: 121 LSTLSPGVQLRKSLKVSRSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCG 180
+++ RK K+SRSSS + +R + D + + D D+ + G C
Sbjct: 121 IASPESAESPRKRGKLSRSSSGNGTPRRTKLILLDETVRTQRD--NDTKEICGQGSTSC- 180
Query: 181 SHEKTQLLKQKSGLSSKRGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDV 240
+K ++KQ+S + KRG+KR KV ++T +IN SA ++FFG YGLK +
Sbjct: 181 -LDKPFVVKQRSSYNGKRGDKRISKVPVRT---LSTIN----SATGENAFFGAYGLKPAI 240
Query: 241 HDFTKLTDDPPLNDLLDGSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQ 300
+D TKL +D L LL+GSY+C S D+ K N L ++ S+L P+++Q
Sbjct: 241 NDVTKLVEDFSLKSLLEGSYECPSLGKDKMKKSENTNNTLLSVVKNVWSILPTKRPVQSQ 300
Query: 301 NTEESESCSN---SKPSTSLVSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLT 360
++ E ++C + P +S+ +++ + E ++ +A D + +D C NSE +
Sbjct: 301 SSTELDTCLSRTLGSPPSSISATLPNSE-----NIDKVNALDGDLSSSSKDHCINSEIPS 360
Query: 361 KILDFKLCKPGEIFVILGLPRPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPF 420
L F LC G++ LGLP KDL+SLLQDASK S +SKN D + + H L F
Sbjct: 361 TPLSFPLCDAGDVLKRLGLPPSKDLDSLLQDASKPSHNSKNNLDQQRSAKPPHSG-LPHF 420
Query: 421 QWSHSFNGHSKANSDSSKFSANRTTCPGRCWRVGNFS-NIPSASSDCFTKDLESLTFNQS 480
WS FNG S+ NS+++K +T C GR R+ + S + P +D F +LESLTFNQ+
Sbjct: 421 PWSQPFNGSSRTNSEAAKLVTCKTLCQGRWLRIADTSMSSPEGITDNFA-NLESLTFNQN 480
Query: 481 LFPSTIRVGPEDGKSCASVNYHQCGWDSLPSATCSKAS--------SVLAESREQMNQEA 540
L P ++ ++ V Q + + S C++AS V E + E
Sbjct: 481 LVPPLLK------QTITGVKTSQTKFANTISCQCAEASVSTLQNSFFVPKEPEGSPDVED 540
Query: 541 NEQQCPRVMTAARTLYDI-VSTASRQHIDGIVKWPKKSSQKSMKARKLK-----SEETEE 600
+ CP+++ AARTL DI V +A+ + +GI++WPKK SQKSMKARK K E
Sbjct: 541 DALSCPQLLEAARTLCDIAVQSANHDNPNGILRWPKKLSQKSMKARKSKLIEKPLERHRT 600
Query: 601 LYAAPTAYTTEGHIHTSKKPKLGAAE-SRRDITQTSRRKEPLNLATPRSSRSSPSKFVKD 656
++ ++ + + + K AAE + + + L L+T + + S
Sbjct: 601 TVSSIDLNSSNNNNNKNHVRKDSAAEHNHHHHHHHPKPSKRLKLSTMENKKRSFPSSSSP 660
BLAST of Sed0002932 vs. TAIR 10
Match:
AT1G64050.2 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages. )
HSP 1 Score: 213.0 bits (541), Expect = 7.3e-55
Identity = 222/701 (31.67%), Postives = 335/701 (47.79%), Query Frame = 0
Query: 1 MEALELTFPVDVAAAAPKLMGSDGSARTEVELCGFVPSSSCSFAFQHFRSYGRGKAGTSS 60
M+ L+++ PVDV+ A KLMGS+G CG S + A + SS
Sbjct: 1 MDGLKISCPVDVSLPA-KLMGSEG--------CGGGVRVSSNKADNNCDKARVSIGVNSS 60
Query: 61 INNLGSVSLGKKIPDGAVSKGGEDP--------SEDF--------------ESRNKRSHL 120
I S S+ KK S G D S DF E +N S +
Sbjct: 61 IERCSSASINKK----GSSSGASDSSLWRKLMHSHDFVHDRLTKLRVDNSSEPQNGYSPI 120
Query: 121 STLSPGVQLRKSLKVSRSSSSSLCSKRRVVQWEDSLLLSGADEVRDSSDKLGSYLKKCGS 180
++ RK K+SRSSS + +R + D + + D D+ + G C
Sbjct: 121 ASPESAESPRKRGKLSRSSSGNGTPRRTKLILLDETVRTQRD--NDTKEICGQGSTSC-- 180
Query: 181 HEKTQLLKQKSGLSSKRGEKRNLKVSMKTKFDSLSINYGNGSAAAGSSFFGLYGLKSDVH 240
+K ++KQ+S + KRG+KR KV ++T +IN SA ++FFG YGLK ++
Sbjct: 181 LDKPFVVKQRSSYNGKRGDKRISKVPVRT---LSTIN----SATGENAFFGAYGLKPAIN 240
Query: 241 DFTKLTDDPPLNDLLDGSYDCTSSSIDRGKIDATVNECFLQSIRKACSVLQLPWPIRTQN 300
D TKL +D L LL+GSY+C S D+ K N L ++ S+L P+++Q+
Sbjct: 241 DVTKLVEDFSLKSLLEGSYECPSLGKDKMKKSENTNNTLLSVVKNVWSILPTKRPVQSQS 300
Query: 301 TEESESCSN---SKPSTSLVSSVSSIEEGVILDVKGTSATDALSLNKVQDVCSNSETLTK 360
+ E ++C + P +S+ +++ + E ++ +A D + +D C NSE +
Sbjct: 301 STELDTCLSRTLGSPPSSISATLPNSE-----NIDKVNALDGDLSSSSKDHCINSEIPST 360
Query: 361 ILDFKLCKPGEIFVILGLPRPKDLESLLQDASK-SVSSKNATDLRLAKHQTHRAILQPFQ 420
L F LC G++ LGLP KDL+SLLQDASK S +SKN D + + H L F
Sbjct: 361 PLSFPLCDAGDVLKRLGLPPSKDLDSLLQDASKPSHNSKNNLDQQRSAKPPHSG-LPHFP 420
Query: 421 WSHSFNGHSKANSDSSKFSANRTTCPGRCWRVGNFS-NIPSASSDCFTKDLESLTFNQSL 480
WS FNG S+ NS+++K +T C GR R+ + S + P +D F +LESLTFNQ+L
Sbjct: 421 WSQPFNGSSRTNSEAAKLVTCKTLCQGRWLRIADTSMSSPEGITDNFA-NLESLTFNQNL 480
Query: 481 FPSTIRVGPEDGKSCASVNYHQCGWDSLPSATCSKAS--------SVLAESREQMNQEAN 540
P ++ ++ V Q + + S C++AS V E + E +
Sbjct: 481 VPPLLK------QTITGVKTSQTKFANTISCQCAEASVSTLQNSFFVPKEPEGSPDVEDD 540
Query: 541 EQQCPRVMTAARTLYDI-VSTASRQHIDGIVKWPKKSSQKSMKARKLK-----SEETEEL 600
CP+++ AARTL DI V +A+ + +GI++WPKK SQKSMKARK K E
Sbjct: 541 ALSCPQLLEAARTLCDIAVQSANHDNPNGILRWPKKLSQKSMKARKSKLIEKPLERHRTT 600
Query: 601 YAAPTAYTTEGHIHTSKKPKLGAAE-SRRDITQTSRRKEPLNLATPRSSRSSPSKFVKDS 656
++ ++ + + + K AAE + + + L L+T + + S
Sbjct: 601 VSSIDLNSSNNNNNKNHVRKDSAAEHNHHHHHHHPKPSKRLKLSTMENKKRSFPSSSSPI 660
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG6600547.1 | 1.4e-278 | 77.81 | hypothetical protein SDJN03_05780, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023535222.1 | 2.5e-278 | 77.52 | uncharacterized protein LOC111796709 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022942381.1 | 1.2e-277 | 77.52 | uncharacterized protein LOC111447442 isoform X1 [Cucurbita moschata] | [more] |
XP_022979382.1 | 8.2e-274 | 76.98 | uncharacterized protein LOC111479123 isoform X1 [Cucurbita maxima] | [more] |
KAG7031186.1 | 1.4e-273 | 74.61 | hypothetical protein SDJN02_05226 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FNQ1 | 5.9e-278 | 77.52 | uncharacterized protein LOC111447442 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1INL3 | 4.0e-274 | 76.98 | uncharacterized protein LOC111479123 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1C5T9 | 3.6e-267 | 75.07 | uncharacterized protein LOC111008234 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1EYC2 | 2.2e-256 | 74.01 | uncharacterized protein LOC111437570 OS=Cucurbita moschata OX=3662 GN=LOC1114375... | [more] |
A0A5A7TNS5 | 1.8e-255 | 73.74 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT1G64050.1 | 5.6e-55 | 31.77 | unknown protein; Has 524 Blast hits to 342 proteins in 101 species: Archae - 0; ... | [more] |
AT1G64050.2 | 7.3e-55 | 31.67 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_c... | [more] |