Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATATACTTACCTTTGTTCCCCAATTCGTGCCCTAAAAGCTCTGATTTCTCCTACTCTTAGCCGCAACATGCTAAACACTCCCGTTTTTCTTGCTCTCTCCTTTCGTTTTCTCCCACCTAATTCACTTTCTTGACTCACTCTCCGATCCCAGTCTTTCCGGTGCGATGTCCAAATGGTCTTCTGTAAGTTCAGATTGGTATCTGTTTTCGACGGCGTTTTAGGTTCCTTTGAGATGTTCGGCGGCTGCAGTGAATCGTCTTCTTTTTATAGTTCGCTGTTGTTAGGGCTTCGGTGATGGATGCTCTTGAGTTGAATTATCCGGTGGATGTGGCGGCGCCGAAGCTCATGGGACCTGACGGCTCTGTTAGAACCGGGATAATAATCGATGAGGTTGAGTTGTGCGAAGCTGATCGTGGTTCTGCCCCTCCTAGCTATTCGTTTCAGCATTTTAGCTCGTACGGTAGTCAAAAAGGTGAAACAGCCTTTTTTTTTTTGCTGTAATACTGCTGTTTAATTTTTTGATTTGTTGCTGGTTTCGTATTTTTAACTGCATCCGATGGCTTAGTAACCTCCTCACATTCCAATTGGCCAATACTGGAGGCTATGGAGAGTGATATTTGTTTCGAAAAAGAGATTATAATGAATATAGTGCTTGGAGTTTGATGCATTGCTTGTTCACGAACTGACTCTATGTAGTAGGATGAAAGTTACAGAGCGTTTTGAGTCACTTATTGTTGAAGACCAATGCGTTTCAGTAGCCCCATGAAGACAGCCACCACTTTGTAATAACATTTTTTCAATAAATGGGTTCCCGCGGATTTTGCGAATTGAAGTGTTTTATATAAACCCTAGGCGTTTGTGAGGATGTCTTATCCTTTTTTCTTTTTGTGCTTTTTCATTATTAATAAAGGACTCTTCTTATTAAATGTGTTCCAGTGGATTTTGAGTGAGGTTGACTTGGGATCCAGCATGCAGATTACTTAAATGTGGCAGTCGTTTGGTTTTGTATTTTATTATTTTGAAAGGGAAATGTGATAAAGAAAAAATTATGCTCTTAGGATATGGTGGCGGAAACATCAAACATTTTGCATAAGAAGAAACACTGTGTTTTTTTTTAGTATTCTTTCATAGGATGGTTTATATCATCTGTCTAGTGAAATTCTGAACATGGAATAAGGTCTATATGATGGATTTAGACAATCTTGAAGAAAAAATTTGCATCTTTTTTTTAACATACTTTCTATAGATGGATGACGATGATAGTTGGTTCTGGCTGCTGCATTTGTGAAAGGATTGTCATATATTTTGGTTTATTTGGACTTTTTTAAGTCTTCTACTTTTTTATTCTTTGCCACATATCATCTGTGTATGTGATTCCACTTCGAGTTGATTTTTGTAAAATTTATATATCACAAGTCTCTAGCATATCATCTCCTTTATCTGTCATTTTTTCTTTCTTCATTTTCTATATTTTAATGAAATAATGGTCTATAAATGTCGGTGTGCCAGTTGGGACGAGCTCTATCAATGATTTGGGTTCTGTTTCTTTGGATGAGATCCCTGATGGGGCAGTTTCTAAGGATGGTGAAGATACACCTGAAGATTTTGAGAGTAGGAACAAAAGAAGTCATTTGTCCACTTCAAGTCCAGGAGTACAGCAACGTAAATCGTTAAAGGTCTCAAGGAGTAGTAGCAGTAGTTTATGTTCTAAGAGGCGTTTGGTTCAATTGGAAGATTCTTTATTACTAAGTGGGGCTGATGAAGTAAAAGATACGTCTGACAAGCTTGGATCATATCTTAAAAAGTGCGGCTCTCATGGTAATGTTCAGTTCCCAATTTAATGTTATAAATGTTCATCTTTTGCTATTACTTTAGACTATTCTTATCATTTCCAAATAACGCCATGCAATACATCTATTTCATATTTTCAGATAGGATCATTGTGCAAGAAATCTGATTAAAAAATAATCCAAATGTTTAGCTATAACTAGAAGCAATAGGAACGGATAAAGAATCTAAAAACTAAAAGTAGCCTCAGTTTAAAATTATTACATGTGAATTGTTTATAAGATCCACGTGTAGAGGACCATGAAAGGCAATTCAATGAACTTTGATCCCAATTATTCAGGATGTGGTTATTTTCTTCTTTGCTATTTGAATGTTATGATTCTTTCATTTATCGTAAATTTATGACAGAGGAGGATATTTGGGAAATTTTTGGTTACATTGGTAATGTTATCAATTCATTGATCTTACTATGTACTTTCTATTTCTAGAGTTAAGAAATCATTGTATTCATATTTTCTTGACTTAATGTATTGAAAAACTGGTGCTACGGTGGTGATTTGAATATTGTGCAAGCTTAGAATGGGCAATATCTGTTGCATATATGTGATTGAGTTAGGATATCCTTTCGCATTTTCTATACTTCCTATGCTTTCCTCATTTAAGTTGTTACTGATGTATACACCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAAAAAAAACAGAGAAGCCTCAATTGATGAAACAGAAAAGCAGCGTAAGCAGCAAGCGGGGTGATAAGAGAAATCTCAAAGTGTCATTGAAGACCAAATTTGATTCACTCCCTATAAATGCTGGAAATGGCTCAGCTACTGCAGGGTGCATTTTTTTCGGTATGATGATGTCCTGTGAATTATTAATTAATGTGCTCAGAAACTTGCATTTATGTGGTACTTATGGTATTTTGAGGACCTTATGACATGTATTGAAGAAAGACTCCACTGGATAATGCAGACGTTTTAAATTGTGTAGATATACTTAGTTTGTTGTGAAGAAAATTTATATCATTGTTGTGACAGAACCTAAATTGAGGGATAACTAATAAGTCTAACCAAATTAGAACAGTTCTAATTCTATCATGTCTTGATTTGTGTGTTACTTCAATATTTTTCTGTAAAGATATTAATTGGATAGTCCTTGTAATTAAGATATCTCTGGAAAGACATTATCACATGAATAGCCTTTTGAATAAGAGGTATAGCCATTGCACCTATCTCGCGAAAAGATAGAAAGTGAAATGTCGGCTATTTTATTTTGACTTCCAACAAGAGTTATTCCTTGAGTATGTCTTGATGTACAAAAATGAGTTGCGTCCCAGTTTTTACAAGTGTTATAATGAAAGCAATTTCTCTTTACCTTCTTTGATTTTTTTTTCCTTCCATCACTATAGGTCGCTACTGGATCTTGTTTTCAGTTTTGCAAAGAGTTTAATTGCTCAGTGCTCAATTCCTTTAGTTGTTACTTCCTATTATGACATAAGTAAAAAAACAAATCACTTGTTTATAATCTATTTATAATAAAGGTTCTCATTGGTTTTTACTTTTGTGCACCTTACTTTGCATTACACAGCATAAAATCATTTATGGGCCTTGTGAAATGTTTACAGGACTATATGGGCTGAAATCAGATGTTCATGATTTCACAAAGCTTACGGATGATCCACCGCTGAATGGTCTTCTTGATGGCAGTTATGACTATTCTAGTTTAAGTAAAGCCAAAGGTAAAAAAGACACAAATGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCATGGCCCGTCCGTCCACAAAATACTGCGGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTTTCAAGCATGGAAGAAGGGGTGAATTTTGATGCAAAAGAACTTAGTGCAACAGATGCACCTTCATTAAGCAAGGTCTGTGAGTAAGAATTTGGCTATATCTCTCTGGCCTATGTGCACATGCACTCACTCTAACACATACTGTACCTTAGTGTTTGTTGTTTTTAACTTCTTGATCTGTACGCTGACTATACATTTTTATTCAATTGGTTTAAATTATTTTGACATCTCTTTTATAGTTATTTGAGATCTGTCTACAACCTGATTGAATCTATTTTTGCTGATTCTCTTATGTGGTAATTGCCGTGGCAAAATACTCAATTCTTCATCGAAAGGATATTTTTTAGTGTCCTAAACTTCAATCATAAATTCGGAGTTTCTTTCCATTATTACATTTAGATTTAGACCTTAATTTCGTTCTCTCTTGGTTTTCCATGAAGGTGCAAGATGCTTGTAGCAATTCTGAAACTTTGACCAACGTACTTGATTTTAAGTTGTACAAACCAGATGACATGTTTGTGAAATTGGGCCTTCCTCTGCCAAAGGATTTGGAATCTTTGCTTCAGGATGCCAGCAAGTCCTCTGTATCTTCAAAGAATGCGACAGATTTGCGTTCAGCAAAGCAACAATCTCGTCGAGCAATCTTGCAACCATTTCCGTGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTTTCTGCAAATAGAACCACATGTACCGGTAGGTGGTGGCGAGTCGGAAACTTTGCCAATATCCCCACTGCTACTGCTGATTGTTTTACAAAAAACTTGGAATCATTGACTTTTAACCAGAGTTTATTTCCTTCAACAATGGGAGTTGGTCCTGATGATGGAAAGTCCTCCTCTGTATCTGTTAATCATCATCAAAGTGGATGGGATTCCCTGTCTTCTGCAACTTGTTCGAAAGCTTCCTCTATGCTTGTGGGTAAGGATTTATTGGAAGGCATTTATGTCGCGTAATTGAGATACAGTATCCACCTTTGAACCATTCTGCTATTATTGTTTATTTTACTTAAATCGTGACCACTGCTTTTGGTCTACATACAGATTCTCGAGGGAAGATGAATCGTGAGGCAAATGGTATGACCTCCTTATTTTTCTGCTATGAATCAACTAAATCTTATTCTTGATTATTATTATTTTTTCTTTTAAAGTTAGTAGTTAAGATCCAATAAATTATAAAAAAGTTGAACTGTGTCCGTAACTGACTTTTCAATGGCACCATAAGAAATACCCACTTAATTTCTTATATCACATAAATGACTTCTCTTAATGGGGTCCTTTCTCTGAAGGAACCAAAAAGAATCCACTGAAGGAAGATGCTTATTGAAGAGCTGAGGGAGAGAAAGATTGATCTTGAGAGAGAAAAGTAAGAGAGAGGAAAGTGAGATGAGGAAGAGAAGTCCTTGAGGAGTAATCTGGGTCTGTAGTTATAAACGAGAAGTTTTTTCAGAAGTTCAGTAGTTGAAGAAAGTGCTTTGCTAGTCTGAGAATTTTTGGCAAGTACAGAGAAGGCTGCTCTAGAAAACATTAAGCGTTTTAGAGGAAACTAAATCAAATCCCTTGCATTTATTTGTTATGCATAATTCAGAAATGTTTAAAACTTGGATTCTTAATCTCCCTAGAGCTATGAGTACGAACTCCTTAGTTTGGCTAGTGTGTCCATGTCTAATACGTTTTAGACACTTCAACACTTCAACACTTCGACATTTGTTGGATATGTATCAGATACTCTTTGGTACAAAATCAATATGAGTTCTATATTTGTCCGACACTTATCGAACACTTATTAAGTATACTAAATAGACACATAGGACAAAAATCATTAAGTTTGAGAGTAAGACATATCAAACTCATTTATTTAATCATATAAATGCCTTAACTTATTGACTTCGGATTTCTTTAGACACCTATCGTACACTTCTAAAGTATACTAAATACACATATAAGACGATAATCATTAAATTTGAGAGTGAAATACATCTAACTCATTTATTTAATCTTATAAATGTCTTAACTTGTTGACTTTGGATTTCTTTATATAAACGTATCCTTGATGTGTCTATGTCATAATTTTTTTAAATAATCATGTGTTTCGTGTGATGTCATATCCACGTCCTGTTCTTCTTAGCGATAGAGAACTGTAGATATAATGTAGGTCTCTCAAACATTAAGCCACTTTCCTAATCTTTTTTGCGAATGCTATCTCGCTGCAGCTTGAAACTTTATGAATTTTGTTGTGTCATTTGCATGCAACTAAAGCCTAATTGAATTTTTTGTCTTCGGCTATTGATATTGCTTCTCTACAGAGCAGCACTGTCCAAGAGTAATGGCTGCTGCTCAAACTCTCTACGATATTGCAACTCGTGCTGCATTGAGGCAAAACATAGACGGTATAGTAAAGTGGCCAAAAAAGCCTTCACAAAAGTCCATGAAGGCTCGAAAGTTGAAATCAGAGGAAACCGAAGAATTATATGCCGCACCGACCACATACGGATTATGGTCCAACAATTCATTTAAAAACGAGGGCCATTTGCATCCCTCGAAGAAGCCAAAGCCAGGAACAGTAGAGAGCAGAAGAGACATTACCCAGACAAATAATAGAAAGGGACCATTGAATTGGGCTACGACCAAATCGAGTAGATCGTCCCCAAGTAAGTTCTTTAGAGATTCGGTTTCGGAAGCTAAACATTCAACCTCCGGCCTAGTAAAACAATCATCAATGATGCCTCCTCCAGCAACCCATTTGTCCAAGGCTAGCGAGGGACAACAAAAGACACGAAAGTTAATGCTCATGGATTGGAAAAGAGGAGGAGGAACAGGTTAGATTGATCAAGAAACTCTACATTATTATATCCCTTTCCAACAAAACGCTGCTATTAAATTAGAACCACTCCTCTGGGCTTAGGTGTGGCTAAACTGGTAGGTTGTTTGTGTACATAATATTGTTGTAAGTGGCTGGTTGATATGATTATTGTAATGGTCCGTCCTGAATAGTGACACGCTTTCTAATTGTTGTAACATCACATCAACTTCATTCCAGAACGCAGTAGCAAATTCCTTGCCACTGTGTGAAAAAGTATTTGACGAGTCAATAGAAATTTTAGTTCTTATTACTGTTCCCTTGT
mRNA sequence
ATATACTTACCTTTGTTCCCCAATTCGTGCCCTAAAAGCTCTGATTTCTCCTACTCTTAGCCGCAACATGCTAAACACTCCCGTTTTTCTTGCTCTCTCCTTTCGTTTTCTCCCACCTAATTCACTTTCTTGACTCACTCTCCGATCCCAGTCTTTCCGGTGCGATGTCCAAATGGTCTTCTGTAAGTTCAGATTGGTATCTGTTTTCGACGGCGTTTTAGGTTCCTTTGAGATGTTCGGCGGCTGCAGTGAATCGTCTTCTTTTTATAGTTCGCTGTTGTTAGGGCTTCGGTGATGGATGCTCTTGAGTTGAATTATCCGGTGGATGTGGCGGCGCCGAAGCTCATGGGACCTGACGGCTCTGTTAGAACCGGGATAATAATCGATGAGGTTGAGTTGTGCGAAGCTGATCGTGGTTCTGCCCCTCCTAGCTATTCGTTTCAGCATTTTAGCTCGTACGGTAGTCAAAAAGTTGGGACGAGCTCTATCAATGATTTGGGTTCTGTTTCTTTGGATGAGATCCCTGATGGGGCAGTTTCTAAGGATGGTGAAGATACACCTGAAGATTTTGAGAGTAGGAACAAAAGAAGTCATTTGTCCACTTCAAGTCCAGGAGTACAGCAACGTAAATCGTTAAAGGTCTCAAGGAGTAGTAGCAGTAGTTTATGTTCTAAGAGGCGTTTGGTTCAATTGGAAGATTCTTTATTACTAAGTGGGGCTGATGAAGTAAAAGATACGTCTGACAAGCTTGGATCATATCTTAAAAAGTGCGGCTCTCATGAGAAGCCTCAATTGATGAAACAGAAAAGCAGCGTAAGCAGCAAGCGGGGTGATAAGAGAAATCTCAAAGTGTCATTGAAGACCAAATTTGATTCACTCCCTATAAATGCTGGAAATGGCTCAGCTACTGCAGGGTGCATTTTTTTCGGTTCTCATTGGTTTTTACTTTTGTGCACCTTACTTTGCATTACACAGCATAAAATCATTTATGGGCCTTGTGAAATGTTTACAGGACTATATGGGCTGAAATCAGATGTTCATGATTTCACAAAGCTTACGGATGATCCACCGCTGAATGGTCTTCTTGATGGCAGTTATGACTATTCTAGTTTAAGTAAAGCCAAAGGTAAAAAAGACACAAATGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCATGGCCCGTCCGTCCACAAAATACTGCGGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTTTCAAGCATGGAAGAAGGGGTGAATTTTGATGCAAAAGAACTTAGTGCAACAGATGCACCTTCATTAAGCAAGGTGCAAGATGCTTGTAGCAATTCTGAAACTTTGACCAACGTACTTGATTTTAAGTTGTACAAACCAGATGACATGTTTGTGAAATTGGGCCTTCCTCTGCCAAAGGATTTGGAATCTTTGCTTCAGGATGCCAGCAAGTCCTCTGTATCTTCAAAGAATGCGACAGATTTGCGTTCAGCAAAGCAACAATCTCGTCGAGCAATCTTGCAACCATTTCCGTGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTTTCTGCAAATAGAACCACATGTACCGGTAGGTGGTGGCGAGTCGGAAACTTTGCCAATATCCCCACTGCTACTGCTGATTGTTTTACAAAAAACTTGGAATCATTGACTTTTAACCAGAGTTTATTTCCTTCAACAATGGGAGTTGGTCCTGATGATGGAAAGTCCTCCTCTGTATCTGTTAATCATCATCAAAGTGGATGGGATTCCCTGTCTTCTGCAACTTGTTCGAAAGCTTCCTCTATGCTTGTGGATTCTCGAGGGAAGATGAATCGTGAGGCAAATGAGCAGCACTGTCCAAGAGTAATGGCTGCTGCTCAAACTCTCTACGATATTGCAACTCGTGCTGCATTGAGGCAAAACATAGACGGTATAGTAAAGTGGCCAAAAAAGCCTTCACAAAAGTCCATGAAGGCTCGAAAGTTGAAATCAGAGGAAACCGAAGAATTATATGCCGCACCGACCACATACGGATTATGGTCCAACAATTCATTTAAAAACGAGGGCCATTTGCATCCCTCGAAGAAGCCAAAGCCAGGAACAGTAGAGAGCAGAAGAGACATTACCCAGACAAATAATAGAAAGGGACCATTGAATTGGGCTACGACCAAATCGAGTAGATCGTCCCCAAGTAAGTTCTTTAGAGATTCGGTTTCGGAAGCTAAACATTCAACCTCCGGCCTAGTAAAACAATCATCAATGATGCCTCCTCCAGCAACCCATTTGTCCAAGGCTAGCGAGGGACAACAAAAGACACGAAAGTTAATGCTCATGGATTGGAAAAGAGGAGGAGGAACAGGTTAGATTGATCAAGAAACTCTACATTATTATATCCCTTTCCAACAAAACGCTGCTATTAAATTAGAACCACTCCTCTGGGCTTAGGTGTGGCTAAACTGGTAGGTTGTTTGTGTACATAATATTGTTGTAAGTGGCTGGTTGATATGATTATTGTAATGGTCCGTCCTGAATAGTGACACGCTTTCTAATTGTTGTAACATCACATCAACTTCATTCCAGAACGCAGTAGCAAATTCCTTGCCACTGTGTGAAAAAGTATTTGACGAGTCAATAGAAATTTTAGTTCTTATTACTGTTCCCTTGT
Coding sequence (CDS)
ATGGATGCTCTTGAGTTGAATTATCCGGTGGATGTGGCGGCGCCGAAGCTCATGGGACCTGACGGCTCTGTTAGAACCGGGATAATAATCGATGAGGTTGAGTTGTGCGAAGCTGATCGTGGTTCTGCCCCTCCTAGCTATTCGTTTCAGCATTTTAGCTCGTACGGTAGTCAAAAAGTTGGGACGAGCTCTATCAATGATTTGGGTTCTGTTTCTTTGGATGAGATCCCTGATGGGGCAGTTTCTAAGGATGGTGAAGATACACCTGAAGATTTTGAGAGTAGGAACAAAAGAAGTCATTTGTCCACTTCAAGTCCAGGAGTACAGCAACGTAAATCGTTAAAGGTCTCAAGGAGTAGTAGCAGTAGTTTATGTTCTAAGAGGCGTTTGGTTCAATTGGAAGATTCTTTATTACTAAGTGGGGCTGATGAAGTAAAAGATACGTCTGACAAGCTTGGATCATATCTTAAAAAGTGCGGCTCTCATGAGAAGCCTCAATTGATGAAACAGAAAAGCAGCGTAAGCAGCAAGCGGGGTGATAAGAGAAATCTCAAAGTGTCATTGAAGACCAAATTTGATTCACTCCCTATAAATGCTGGAAATGGCTCAGCTACTGCAGGGTGCATTTTTTTCGGTTCTCATTGGTTTTTACTTTTGTGCACCTTACTTTGCATTACACAGCATAAAATCATTTATGGGCCTTGTGAAATGTTTACAGGACTATATGGGCTGAAATCAGATGTTCATGATTTCACAAAGCTTACGGATGATCCACCGCTGAATGGTCTTCTTGATGGCAGTTATGACTATTCTAGTTTAAGTAAAGCCAAAGGTAAAAAAGACACAAATGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCATGGCCCGTCCGTCCACAAAATACTGCGGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTTTCAAGCATGGAAGAAGGGGTGAATTTTGATGCAAAAGAACTTAGTGCAACAGATGCACCTTCATTAAGCAAGGTGCAAGATGCTTGTAGCAATTCTGAAACTTTGACCAACGTACTTGATTTTAAGTTGTACAAACCAGATGACATGTTTGTGAAATTGGGCCTTCCTCTGCCAAAGGATTTGGAATCTTTGCTTCAGGATGCCAGCAAGTCCTCTGTATCTTCAAAGAATGCGACAGATTTGCGTTCAGCAAAGCAACAATCTCGTCGAGCAATCTTGCAACCATTTCCGTGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTTTCTGCAAATAGAACCACATGTACCGGTAGGTGGTGGCGAGTCGGAAACTTTGCCAATATCCCCACTGCTACTGCTGATTGTTTTACAAAAAACTTGGAATCATTGACTTTTAACCAGAGTTTATTTCCTTCAACAATGGGAGTTGGTCCTGATGATGGAAAGTCCTCCTCTGTATCTGTTAATCATCATCAAAGTGGATGGGATTCCCTGTCTTCTGCAACTTGTTCGAAAGCTTCCTCTATGCTTGTGGATTCTCGAGGGAAGATGAATCGTGAGGCAAATGAGCAGCACTGTCCAAGAGTAATGGCTGCTGCTCAAACTCTCTACGATATTGCAACTCGTGCTGCATTGAGGCAAAACATAGACGGTATAGTAAAGTGGCCAAAAAAGCCTTCACAAAAGTCCATGAAGGCTCGAAAGTTGAAATCAGAGGAAACCGAAGAATTATATGCCGCACCGACCACATACGGATTATGGTCCAACAATTCATTTAAAAACGAGGGCCATTTGCATCCCTCGAAGAAGCCAAAGCCAGGAACAGTAGAGAGCAGAAGAGACATTACCCAGACAAATAATAGAAAGGGACCATTGAATTGGGCTACGACCAAATCGAGTAGATCGTCCCCAAGTAAGTTCTTTAGAGATTCGGTTTCGGAAGCTAAACATTCAACCTCCGGCCTAGTAAAACAATCATCAATGATGCCTCCTCCAGCAACCCATTTGTCCAAGGCTAGCGAGGGACAACAAAAGACACGAAAGTTAATGCTCATGGATTGGAAAAGAGGAGGAGGAACAGGTTAG
Protein sequence
MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKVGTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTGLYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDSVSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG
Homology
BLAST of Carg14118 vs. NCBI nr
Match:
KAG7031186.1 (hypothetical protein SDJN02_05226 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1388.2 bits (3592), Expect = 0.0e+00
Identity = 710/710 (100.00%), Postives = 710/710 (100.00%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
Query: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ
Sbjct: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
Query: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN
Sbjct: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
Query: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA
Sbjct: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
Query: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT
Sbjct: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
Query: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ
Sbjct: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
Query: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY
Sbjct: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
Query: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS
Sbjct: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
Query: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 711
VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 710
BLAST of Carg14118 vs. NCBI nr
Match:
KAG6600547.1 (hypothetical protein SDJN03_05780, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1308.1 bits (3384), Expect = 0.0e+00
Identity = 681/710 (95.92%), Postives = 681/710 (95.92%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 15 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 74
Query: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 75 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 134
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 135 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 194
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFF G
Sbjct: 195 KRNLKVSLKTKFDSLPINAGNGSATAGCIFF----------------------------G 254
Query: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ
Sbjct: 255 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 314
Query: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN
Sbjct: 315 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 374
Query: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA
Sbjct: 375 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 434
Query: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT
Sbjct: 435 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 494
Query: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ
Sbjct: 495 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 554
Query: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
HCPRVMAAAQTLYDIATRAA RQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY
Sbjct: 555 HCPRVMAAAQTLYDIATRAASRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 614
Query: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS
Sbjct: 615 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 674
Query: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 711
VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG
Sbjct: 675 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 696
BLAST of Carg14118 vs. NCBI nr
Match:
XP_023535222.1 (uncharacterized protein LOC111796709 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1298.5 bits (3359), Expect = 0.0e+00
Identity = 676/710 (95.21%), Postives = 678/710 (95.49%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIII+EVELCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFF G
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFF----------------------------G 240
Query: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ
Sbjct: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
Query: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELS TDAPSLSKVQDACSN
Sbjct: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSN 360
Query: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA
Sbjct: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
Query: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT
Sbjct: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
Query: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
FNQSLFPSTM VGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVD RGKMNREANEQ
Sbjct: 481 FNQSLFPSTMRVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDFRGKMNREANEQ 540
Query: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY
Sbjct: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
Query: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKF RDS
Sbjct: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFIRDS 660
Query: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 711
VSEAKHSTSG+VKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 VSEAKHSTSGVVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 682
BLAST of Carg14118 vs. NCBI nr
Match:
XP_022942381.1 (uncharacterized protein LOC111447442 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1295.4 bits (3351), Expect = 0.0e+00
Identity = 675/710 (95.07%), Postives = 677/710 (95.35%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIII+EVELCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
KRNL VSLKTKFDSLPINAGNGSATAGCIFF G
Sbjct: 181 KRNLSVSLKTKFDSLPINAGNGSATAGCIFF----------------------------G 240
Query: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
LYGLKSDVHDFTKLTDDP LNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ
Sbjct: 241 LYGLKSDVHDFTKLTDDPTLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
Query: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELS TDAPSLSKVQDACSN
Sbjct: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSN 360
Query: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA
Sbjct: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
Query: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT
Sbjct: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
Query: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
FNQSLFPSTM VGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ
Sbjct: 481 FNQSLFPSTMRVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
Query: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY
Sbjct: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
Query: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKF RDS
Sbjct: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFVRDS 660
Query: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 711
VSEAKHSTSG+VKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 VSEAKHSTSGVVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 682
BLAST of Carg14118 vs. NCBI nr
Match:
XP_022979382.1 (uncharacterized protein LOC111479123 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1273.1 bits (3293), Expect = 0.0e+00
Identity = 665/707 (94.06%), Postives = 668/707 (94.48%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIII+EV+LCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEI DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFF G
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFF----------------------------G 240
Query: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
LYGLKSDVHDFTKLTDDPPLN LLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ
Sbjct: 241 LYGLKSDVHDFTKLTDDPPLNDLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
Query: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
LPWPV PQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELS TDAP LSKVQDACSN
Sbjct: 301 LPWPVHPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPLLSKVQDACSN 360
Query: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA
Sbjct: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
Query: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESL
Sbjct: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLI 480
Query: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
FNQSLFPSTM VGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANE
Sbjct: 481 FNQSLFPSTMRVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEP 540
Query: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY
Sbjct: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
Query: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
GLWSNNSFKNEG LHPSKKPKPGTVESRRDITQTNNRKGPLNWA TKSSRSSPSKF RDS
Sbjct: 601 GLWSNNSFKNEGQLHPSKKPKPGTVESRRDITQTNNRKGPLNWAATKSSRSSPSKFVRDS 660
Query: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG 708
VSEAKHSTSG+VKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG
Sbjct: 661 VSEAKHSTSGVVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG 679
BLAST of Carg14118 vs. ExPASy TrEMBL
Match:
A0A6J1FNQ1 (uncharacterized protein LOC111447442 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447442 PE=4 SV=1)
HSP 1 Score: 1295.4 bits (3351), Expect = 0.0e+00
Identity = 675/710 (95.07%), Postives = 677/710 (95.35%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIII+EVELCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
KRNL VSLKTKFDSLPINAGNGSATAGCIFF G
Sbjct: 181 KRNLSVSLKTKFDSLPINAGNGSATAGCIFF----------------------------G 240
Query: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
LYGLKSDVHDFTKLTDDP LNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ
Sbjct: 241 LYGLKSDVHDFTKLTDDPTLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
Query: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELS TDAPSLSKVQDACSN
Sbjct: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSN 360
Query: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA
Sbjct: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
Query: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT
Sbjct: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
Query: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
FNQSLFPSTM VGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ
Sbjct: 481 FNQSLFPSTMRVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
Query: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY
Sbjct: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
Query: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKF RDS
Sbjct: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFVRDS 660
Query: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 711
VSEAKHSTSG+VKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 VSEAKHSTSGVVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 682
BLAST of Carg14118 vs. ExPASy TrEMBL
Match:
A0A6J1INL3 (uncharacterized protein LOC111479123 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111479123 PE=4 SV=1)
HSP 1 Score: 1273.1 bits (3293), Expect = 0.0e+00
Identity = 665/707 (94.06%), Postives = 668/707 (94.48%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIII+EV+LCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEI DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFF G
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFF----------------------------G 240
Query: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
LYGLKSDVHDFTKLTDDPPLN LLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ
Sbjct: 241 LYGLKSDVHDFTKLTDDPPLNDLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
Query: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
LPWPV PQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELS TDAP LSKVQDACSN
Sbjct: 301 LPWPVHPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPLLSKVQDACSN 360
Query: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA
Sbjct: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
Query: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESL
Sbjct: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLI 480
Query: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
FNQSLFPSTM VGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANE
Sbjct: 481 FNQSLFPSTMRVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEP 540
Query: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY
Sbjct: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
Query: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
GLWSNNSFKNEG LHPSKKPKPGTVESRRDITQTNNRKGPLNWA TKSSRSSPSKF RDS
Sbjct: 601 GLWSNNSFKNEGQLHPSKKPKPGTVESRRDITQTNNRKGPLNWAATKSSRSSPSKFVRDS 660
Query: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG 708
VSEAKHSTSG+VKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG
Sbjct: 661 VSEAKHSTSGVVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG 679
BLAST of Carg14118 vs. ExPASy TrEMBL
Match:
A0A6J1C5T9 (uncharacterized protein LOC111008234 OS=Momordica charantia OX=3673 GN=LOC111008234 PE=4 SV=1)
HSP 1 Score: 1090.9 bits (2820), Expect = 0.0e+00
Identity = 582/716 (81.28%), Postives = 624/716 (87.15%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
MDA+EL YPVDVAAPKLMGPDGSVRTG+ I+EVELCE+DR SAPPSYSFQHFSSYGSQK
Sbjct: 1 MDAVELTYPVDVAAPKLMGPDGSVRTGVTIEEVELCESDRVSAPPSYSFQHFSSYGSQKA 60
Query: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSIND+GSVSLD+IPDGAVSKDGE T ED ESRNKRS L TSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDVGSVSLDKIPDGAVSKDGEGTSEDLESRNKRSLLFTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKR-RLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSH--EKPQLMKQKSSVSSK 180
SSSLCSKR R+V+LEDSL LSGAD+VKDTSDKLGSYLKKC SH EK QL+KQKSS+SSK
Sbjct: 121 SSSLCSKRPRVVRLEDSLFLSGADDVKDTSDKLGSYLKKCSSHETEKAQLLKQKSSLSSK 180
Query: 181 RGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEM 240
RGDKRNLKVSLKTKFDSL INAGNGSA AG
Sbjct: 181 RGDKRNLKVSLKTKFDSLSINAGNGSAAAG----------------------------SS 240
Query: 241 FTGLYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACS 300
F LYGLKSDVHDFTKL DDPPLN LLDGSYD +SLS KGKKDTNVNECFLQS+RKACS
Sbjct: 241 FLALYGLKSDVHDFTKLVDDPPLNDLLDGSYDSASLSIDKGKKDTNVNECFLQSVRKACS 300
Query: 301 VLQLPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDA 360
VLQLPWPV PQN AESE CSNSKPSTS+VS VSSMEEGVNFD KE ATD+PSL+KV+DA
Sbjct: 301 VLQLPWPVHPQNIAESEGCSNSKPSTSIVSYVSSMEEGVNFDVKEPIATDSPSLNKVRDA 360
Query: 361 CSNSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSV-SSKNATDLRSAKQQ 420
CSNSETLTN LDFKLYKPDDMFVK+GLPLPKDLESLLQDASKSSV SSKN TDLRSAKQQ
Sbjct: 361 CSNSETLTNPLDFKLYKPDDMFVKMGLPLPKDLESLLQDASKSSVSSSKNVTDLRSAKQQ 420
Query: 421 SRRAILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNL 480
SRRA+LQPFPWSHSFNGHSK+NSDSSKFSANRTTC GRWWR+GNF++IP+ATADCFTK+L
Sbjct: 421 SRRAMLQPFPWSHSFNGHSKSNSDSSKFSANRTTCPGRWWRIGNFSSIPSATADCFTKDL 480
Query: 481 ESLTFNQSLFPSTMG-VGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNR 540
ESLTFNQSLFPSTM VGPDD +SSSVSVNHHQ GWDSLSSA CSKASS+LV+SRGK N
Sbjct: 481 ESLTFNQSLFPSTMRVVGPDDRRSSSVSVNHHQCGWDSLSSAICSKASSVLVESRGKTNY 540
Query: 541 EANEQHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYA 600
EAN+Q CP+V+AAA+TLYDIAT AA RQNIDGIV+WPKKPSQKSM+ARKLKSEETEELYA
Sbjct: 541 EANDQQCPKVIAAAKTLYDIATYAASRQNIDGIVRWPKKPSQKSMRARKLKSEETEELYA 600
Query: 601 APTTYGLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSK 660
AP TYGLWS+N FK+EGH+H SKKPK GT ESRRD+ TN R+GPLNWAT +SSRSSPSK
Sbjct: 601 AP-TYGLWSDNPFKSEGHMHSSKKPKLGTTESRRDLAHTNCRRGPLNWATPRSSRSSPSK 660
Query: 661 FFRDSVSEAKHSTSGLVKQSSMMPPPATH-LSKASEGQQKTRKLMLMDWKRGGGTG 711
F RDS S+AKHSTSG+VK SSMMPPPAT L K EGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 FVRDSASDAKHSTSGIVKPSSMMPPPATTLLCKGGEGQQKTRKLMLMDWKRGGGTG 687
BLAST of Carg14118 vs. ExPASy TrEMBL
Match:
A0A6J1EYC2 (uncharacterized protein LOC111437570 OS=Cucurbita moschata OX=3662 GN=LOC111437570 PE=4 SV=1)
HSP 1 Score: 1055.4 bits (2728), Expect = 1.0e-304
Identity = 568/711 (79.89%), Postives = 604/711 (84.95%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
M ALEL PVDV KLMGPDGSVRTG+ I+EVELCEADRGSAPPSYSFQHFSSYG +K
Sbjct: 1 MYALELTCPVDVVVSKLMGPDGSVRTGVTIEEVELCEADRGSAPPSYSFQHFSSYGCKKD 60
Query: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLG VSLD++PDGAV KDGE+T EDFESRNKRSHLSTSS GVQ RK LKVSR
Sbjct: 61 GTSSINDLGPVSLDKVPDGAVFKDGENTSEDFESRNKRSHLSTSSLGVQPRKPLKVSR-G 120
Query: 121 SSSLCSKR-RLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRG 180
SSSLCSKR R+VQLED L LSGAD D SDKLGSYLKKC SHEK QL+KQKSS+SSKRG
Sbjct: 121 SSSLCSKRPRVVQLEDPLFLSGAD---DVSDKLGSYLKKCNSHEKTQLLKQKSSLSSKRG 180
Query: 181 DKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFT 240
DKRNLKVSLKTKFDS NAGNGSA AG F
Sbjct: 181 DKRNLKVSLKTKFDSFSTNAGNGSAAAG----------------------------SSFH 240
Query: 241 GLYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVL 300
GLYGLKS DFTKLTDDPPLN +LDGSYD ++LSK KGKKDTNVNECFLQSIRKACSVL
Sbjct: 241 GLYGLKSGARDFTKLTDDPPLNDILDGSYDCANLSKDKGKKDTNVNECFLQSIRKACSVL 300
Query: 301 QLPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACS 360
QLPWPVRPQN AESESCSNSKP TSLVSSVSSMEE VNFD KELSATD+PSL+KV+DAC+
Sbjct: 301 QLPWPVRPQNMAESESCSNSKPDTSLVSSVSSMEEKVNFDVKELSATDSPSLNKVEDACN 360
Query: 361 NSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRR 420
NSE LTN LDFKLYKPD MF+KLGLP+PKDL SLLQDASKSSVSS NATDLRSAKQQSRR
Sbjct: 361 NSEPLTNALDFKLYKPDHMFMKLGLPIPKDLNSLLQDASKSSVSSNNATDLRSAKQQSRR 420
Query: 421 AILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESL 480
A+LQPF WSHSFNGHSKANSDSSKFSANRTTC GRWWRV NF+NIP+ATADCFTK+LESL
Sbjct: 421 AMLQPFAWSHSFNGHSKANSDSSKFSANRTTCLGRWWRVRNFSNIPSATADCFTKDLESL 480
Query: 481 TFNQSLFPSTMGV-GPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREAN 540
TFNQSLFPSTM V GPDDG+ SS+SVNHHQ GWDSLSSATCSK SS+LV+SRGKMN E+
Sbjct: 481 TFNQSLFPSTMRVIGPDDGR-SSISVNHHQCGWDSLSSATCSKTSSVLVESRGKMNSESY 540
Query: 541 EQHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPT 600
EQ CPRVMAAAQTLYDIAT AALRQNIDG+V+WPKK SQKSM+ARKLKSEETEELY PT
Sbjct: 541 EQQCPRVMAAAQTLYDIATSAALRQNIDGMVRWPKKASQKSMRARKLKSEETEELYTTPT 600
Query: 601 TYGLWSNNSFKNEG-HLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFF 660
TYGLWSNNS KNEG H HPSKKPK GT ESRRD+ QTN ++GPLNW T +SSRSSPSKF
Sbjct: 601 TYGLWSNNSIKNEGHHAHPSKKPKLGTTESRRDVAQTNCKRGPLNWTTPRSSRSSPSKFI 660
Query: 661 RDSVSEAKHSTSGLVKQ-SSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG 708
RDSVSEAK ST+G +KQ SSMMPPPAT L KA EGQQKTRKLMLMDWKRGG
Sbjct: 661 RDSVSEAKPSTAGAIKQSSSMMPPPATLLCKAGEGQQKTRKLMLMDWKRGG 678
BLAST of Carg14118 vs. ExPASy TrEMBL
Match:
A0A6J1JD46 (uncharacterized protein LOC111483405 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483405 PE=4 SV=1)
HSP 1 Score: 1045.4 bits (2702), Expect = 1.1e-301
Identity = 559/694 (80.55%), Postives = 595/694 (85.73%), Query Frame = 0
Query: 18 MGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKVGTSSINDLGSVSLDEIP 77
MGPDGSVRTG+ I+EVELCEADRGSAPPSYSFQHFSSYGS+K GTSSINDLG VSLD++P
Sbjct: 1 MGPDGSVRTGVTIEEVELCEADRGSAPPSYSFQHFSSYGSKKDGTSSINDLGPVSLDKVP 60
Query: 78 DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSSSSSLCSKR-RLVQLEDS 137
DGAV KDGE+T EDFESRNKRSHLSTSSPGVQ RK LKVSR SSSLCSKR R+VQLED
Sbjct: 61 DGAVFKDGENTYEDFESRNKRSHLSTSSPGVQPRKPLKVSR-GSSSLCSKRPRVVQLEDP 120
Query: 138 LLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGDKRNLKVSLKTKFDSLP 197
L LSGAD D SDKLGSYLKKC SHEK QL+KQKSS+SSKRGDKRNLKVSLKTKFDS
Sbjct: 121 LFLSGAD---DVSDKLGSYLKKCNSHEKTQLLKQKSSLSSKRGDKRNLKVSLKTKFDSFS 180
Query: 198 INAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTGLYGLKSDVHDFTKLTD 257
NAGNGSA AG F GLYGLKS DFTKLTD
Sbjct: 181 TNAGNGSAAAG----------------------------SSFHGLYGLKSGARDFTKLTD 240
Query: 258 DPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESC 317
DPPLN +L+GSYDY++LSK KGKKDTNVNECFLQSIRKACSVLQLPWPVRPQN AESESC
Sbjct: 241 DPPLNDILNGSYDYANLSKDKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNMAESESC 300
Query: 318 SNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSNSETLTNVLDFKLYKPD 377
SNSKP TSLVSSVSSMEE VNFD KELSATD+PSL+KV+DAC NSE LTN LDFKLYKPD
Sbjct: 301 SNSKPDTSLVSSVSSMEEKVNFDVKELSATDSPSLNKVEDACDNSEPLTNALDFKLYKPD 360
Query: 378 DMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSK 437
MF+KLGLP+PKDL SLLQDASKSSVSS NATDLRSAKQQSRRA+LQPF WSHSFNGHSK
Sbjct: 361 HMFMKLGLPIPKDLNSLLQDASKSSVSSNNATDLRSAKQQSRRAMLQPFAWSHSFNGHSK 420
Query: 438 ANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMG-VGPD 497
ANSDSSKFSANRTTC GRWWRV NF+NIP+ATADCFTK+LESLTFNQSLFPSTM VGPD
Sbjct: 421 ANSDSSKFSANRTTCLGRWWRVRNFSNIPSATADCFTKDLESLTFNQSLFPSTMRVVGPD 480
Query: 498 DGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDI 557
DG+ SS+SVNHHQ GWDSLSSATCSK SS+LV+SRGKMN EANEQ CPRVMAAAQTLYDI
Sbjct: 481 DGR-SSISVNHHQCGWDSLSSATCSKTSSVLVESRGKMNNEANEQQCPRVMAAAQTLYDI 540
Query: 558 ATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEG-HL 617
AT AALRQNIDG+V+WPKK SQKSM+ARKLKSEETEELY PTTYGLWSNNS KNEG H
Sbjct: 541 ATSAALRQNIDGMVRWPKKASQKSMRARKLKSEETEELYTTPTTYGLWSNNSIKNEGHHA 600
Query: 618 HPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDSVSEAKHSTSGLVKQ 677
HPSKKPK GT ESRRD+ QTN ++GPLNW T +SSRSSPSKF RDS+SEAK ST+G +KQ
Sbjct: 601 HPSKKPKLGTTESRRDVAQTNCKRGPLNWTTPRSSRSSPSKFIRDSISEAKPSTAGAIKQ 660
Query: 678 -SSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG 708
SSMMPPPAT L KA EGQQ TRKLMLMDWKRGG
Sbjct: 661 SSSMMPPPATLLCKAGEGQQNTRKLMLMDWKRGG 661
BLAST of Carg14118 vs. TAIR 10
Match:
AT1G64050.1 (unknown protein; Has 524 Blast hits to 342 proteins in 101 species: Archae - 0; Bacteria - 106; Metazoa - 106; Fungi - 24; Plants - 25; Viruses - 0; Other Eukaryotes - 263 (source: NCBI BLink). )
HSP 1 Score: 243.4 bits (620), Expect = 5.4e-64
Identity = 242/736 (32.88%), Postives = 349/736 (47.42%), Query Frame = 0
Query: 1 MDALELNYPVDVAAP-KLMGPDGSVRTGIIIDEVEL---CEADRGSAPPSYSFQHFSSYG 60
MD L+++ PVDV+ P KLMG +G G+ + + C+ R S + S + SS
Sbjct: 1 MDGLKISCPVDVSLPAKLMGSEG-CGGGVRVSSNKADNNCDKARVSIGVNSSIERCSSAS 60
Query: 61 SQKVGTSSINDLGSVSL--------DEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGV 120
K G S + SL D + D +++ E +N S +++
Sbjct: 61 INKKGAGSSSGASDSSLWRKLMHSHDFVHDRLTKLRVDNSS---EPQNGYSPIASPESAE 120
Query: 121 QQRKSLKVSRSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLM 180
RK K+SRSSS + +R + L D + + D DT + G C +KP ++
Sbjct: 121 SPRKRGKLSRSSSGNGTPRRTKLILLDETVRTQRD--NDTKEICGQGSTSC--LDKPFVV 180
Query: 181 KQKSSVSSKRGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQH 240
KQ+SS + KRGDKR KV ++T IN+ G
Sbjct: 181 KQRSSYNGKRGDKRISKVPVRT---LSTINSATGE------------------------- 240
Query: 241 KIIYGPCEMFTGLYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECF 300
F G YGLK ++D TKL +D L LL+GSY+ SL K K KK N N
Sbjct: 241 -------NAFFGAYGLKPAINDVTKLVEDFSLKSLLEGSYECPSLGKDKMKKSENTNNTL 300
Query: 301 LQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDA 360
L ++ S+L PV+ Q++ E ++C + + S +++ N D +++A D
Sbjct: 301 LSVVKNVWSILPTKRPVQSQSSTELDTCLSRTLGSPPSSISATLPNSENID--KVNALDG 360
Query: 361 PSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNAT 420
S +D C NSE + L F L D+ +LGLP KDL+SLLQDASK S +SKN
Sbjct: 361 DLSSSSKDHCINSEIPSTPLSFPLCDAGDVLKRLGLPPSKDLDSLLQDASKPSHNSKNNL 420
Query: 421 D-LRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFA-NIPT 480
D RSAK + L FPWS FNG S+ NS+++K +T C GRW R+ + + + P
Sbjct: 421 DQQRSAKPP--HSGLPHFPWSQPFNGSSRTNSEAAKLVTCKTLCQGRWLRIADTSMSSPE 480
Query: 481 ATADCFTKNLESLTFNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKAS-SM 540
D F NLESLTFNQ+L P + K + V Q+ + + S C++AS S
Sbjct: 481 GITDNFA-NLESLTFNQNLVPPLL-------KQTITGVKTSQTKFANTISCQCAEASVST 540
Query: 541 LVDS-------RGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQK 600
L +S G + E + CP+++ AA+TL DIA ++A N +GI++WPKK SQK
Sbjct: 541 LQNSFFVPKEPEGSPDVEDDALSCPQLLEAARTLCDIAVQSANHDNPNGILRWPKKLSQK 600
Query: 601 SMKARKLK--SEETEELYAAPTTYGLWSNNSFKNEGHL-----------HPSKKPKPGTV 660
SMKARK K + E ++ L S+N+ N+ H+ H PKP
Sbjct: 601 SMKARKSKLIEKPLERHRTTVSSIDLNSSNNNNNKNHVRKDSAAEHNHHHHHHHPKP--- 660
Query: 661 ESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDSVSEAKHSTSGLVKQSSMM----PPP 698
R ++ N+K + S SSP + S+ KHS+S K S M PPP
Sbjct: 661 SKRLKLSTMENKK------RSFPSSSSPIE------SDRKHSSSSKFKNHSRMMLPPPPP 666
BLAST of Carg14118 vs. TAIR 10
Match:
AT1G64050.2 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages. )
HSP 1 Score: 243.4 bits (620), Expect = 5.4e-64
Identity = 243/734 (33.11%), Postives = 351/734 (47.82%), Query Frame = 0
Query: 1 MDALELNYPVDVAAP-KLMGPDGSVRTGIIIDEVEL---CEADRGSAPPSYSFQHFSSYG 60
MD L+++ PVDV+ P KLMG +G G+ + + C+ R S + S + SS
Sbjct: 1 MDGLKISCPVDVSLPAKLMGSEG-CGGGVRVSSNKADNNCDKARVSIGVNSSIERCSSAS 60
Query: 61 SQKVGTSSINDLGSV------SLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQ 120
K G+SS S+ S D + D +++ E +N S +++
Sbjct: 61 INKKGSSSGASDSSLWRKLMHSHDFVHDRLTKLRVDNSS---EPQNGYSPIASPESAESP 120
Query: 121 RKSLKVSRSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQ 180
RK K+SRSSS + +R + L D + + D DT + G C +KP ++KQ
Sbjct: 121 RKRGKLSRSSSGNGTPRRTKLILLDETVRTQRD--NDTKEICGQGSTSC--LDKPFVVKQ 180
Query: 181 KSSVSSKRGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKI 240
+SS + KRGDKR KV ++T IN+ G
Sbjct: 181 RSSYNGKRGDKRISKVPVRT---LSTINSATGE--------------------------- 240
Query: 241 IYGPCEMFTGLYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQ 300
F G YGLK ++D TKL +D L LL+GSY+ SL K K KK N N L
Sbjct: 241 -----NAFFGAYGLKPAINDVTKLVEDFSLKSLLEGSYECPSLGKDKMKKSENTNNTLLS 300
Query: 301 SIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPS 360
++ S+L PV+ Q++ E ++C + + S +++ N D +++A D
Sbjct: 301 VVKNVWSILPTKRPVQSQSSTELDTCLSRTLGSPPSSISATLPNSENID--KVNALDGDL 360
Query: 361 LSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATD- 420
S +D C NSE + L F L D+ +LGLP KDL+SLLQDASK S +SKN D
Sbjct: 361 SSSSKDHCINSEIPSTPLSFPLCDAGDVLKRLGLPPSKDLDSLLQDASKPSHNSKNNLDQ 420
Query: 421 LRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFA-NIPTAT 480
RSAK + L FPWS FNG S+ NS+++K +T C GRW R+ + + + P
Sbjct: 421 QRSAKPP--HSGLPHFPWSQPFNGSSRTNSEAAKLVTCKTLCQGRWLRIADTSMSSPEGI 480
Query: 481 ADCFTKNLESLTFNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKAS-SMLV 540
D F NLESLTFNQ+L P + K + V Q+ + + S C++AS S L
Sbjct: 481 TDNFA-NLESLTFNQNLVPPLL-------KQTITGVKTSQTKFANTISCQCAEASVSTLQ 540
Query: 541 DS-------RGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSM 600
+S G + E + CP+++ AA+TL DIA ++A N +GI++WPKK SQKSM
Sbjct: 541 NSFFVPKEPEGSPDVEDDALSCPQLLEAARTLCDIAVQSANHDNPNGILRWPKKLSQKSM 600
Query: 601 KARKLK--SEETEELYAAPTTYGLWSNNSFKNEGHL-----------HPSKKPKPGTVES 660
KARK K + E ++ L S+N+ N+ H+ H PKP
Sbjct: 601 KARKSKLIEKPLERHRTTVSSIDLNSSNNNNNKNHVRKDSAAEHNHHHHHHHPKP---SK 660
Query: 661 RRDITQTNNRKGPLNWATTKSSRSSPSKFFRDSVSEAKHSTSGLVKQSSMM----PPPAT 698
R ++ N+K + S SSP + S+ KHS+S K S M PPP
Sbjct: 661 RLKLSTMENKK------RSFPSSSSPIE------SDRKHSSSSKFKNHSRMMLPPPPPTR 664
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG7031186.1 | 0.0e+00 | 100.00 | hypothetical protein SDJN02_05226 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
KAG6600547.1 | 0.0e+00 | 95.92 | hypothetical protein SDJN03_05780, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023535222.1 | 0.0e+00 | 95.21 | uncharacterized protein LOC111796709 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022942381.1 | 0.0e+00 | 95.07 | uncharacterized protein LOC111447442 isoform X1 [Cucurbita moschata] | [more] |
XP_022979382.1 | 0.0e+00 | 94.06 | uncharacterized protein LOC111479123 isoform X1 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FNQ1 | 0.0e+00 | 95.07 | uncharacterized protein LOC111447442 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1INL3 | 0.0e+00 | 94.06 | uncharacterized protein LOC111479123 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1C5T9 | 0.0e+00 | 81.28 | uncharacterized protein LOC111008234 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1EYC2 | 1.0e-304 | 79.89 | uncharacterized protein LOC111437570 OS=Cucurbita moschata OX=3662 GN=LOC1114375... | [more] |
A0A6J1JD46 | 1.1e-301 | 80.55 | uncharacterized protein LOC111483405 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT1G64050.1 | 5.4e-64 | 32.88 | unknown protein; Has 524 Blast hits to 342 proteins in 101 species: Archae - 0; ... | [more] |
AT1G64050.2 | 5.4e-64 | 33.11 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_c... | [more] |