Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGACTGTTCTACAGAGAGAAAAAAAGGCAAGTAAAACTCAAGAACAAGCTCATCACCCACCAACTCCAAAAGAATTCAGAATCAAATGGGCAAAAAAGGTGGATGGAAATTAAAAAGGTAACCCAATTCATTTACGGTTTCTCGAAAGGCAGCCACCCATTTGTTCTTCCTTTTCTCTCTCCTTCGTTTCCCACAGCTCATTGATTCTCGCCCACGTTTGAACAAACAAGCAAAGAATCGTTCAAAATTTGGATACCCCATAAATTTTTTAGGATACCCATAAACGTTTTAGGTTTATCTCTACATACCCAGTAACCCCATGATGATTTTTTAGGTTATAAACAGATCTTCTATCAATTTCGAAGTGTTGGGGCTTTGAATCTCGAAAACCCCACATCCCATTTCAGCTTTTCGAGGTTGAATTTCTGGGTTGTTCTAAAGGGTTGCGAATGTCATGTCCAAGCAGTAGCTTCCGCTACAATGGCAGCCACTGTGCGTGCCCACCAGGCCAGCTTCTCAATCGAAGCAACAATAGCTGTGTTATCTTCAATAGCCCTTCGGTTATCACTACAGGGCGGTTTGAGAGCTATGCTGTTAGCTTCCCTGAGACCATTTTCTCCTTTGATTCAATCAGGAAGTTCACTCAGTCTCAGGCTGTGTTCCTTGAAGCTACTCTGTTCTTGCTGCTTTCTTGGCTCTTTTTCTGTATGTTCCTTAGGTTCATGAAGCTTGGAGATGGGAGAAATATTTGGTTCAGGATGAGATGGTGGGTTAGCAGATTGGATGTTTGCTTTGCCACAAGACATTGGCTGGTTAGCTCTTAGCTTTGCCTCTTCTGTTCATATTTGGAATGAAATCCTTGCTTTCTTTTGTTTTTGAATGGCTTTAATTATGACCGTTTTGTTTTGAGTTGTTTATTTAATGTTCTTCACTGTGCATTATGCGAGATGTTCCCCTGTGTTGACTGTTTCAATTCTTGTGTTAACTGATATTGATATGGTTTCAGTTATGGGACTTGTAAATCCTCAACATGAACCATTAATTAGATAAAGCGGGTTGGGAAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAAACGTGTTTTAAAAACTTTGAGGGGAAGCCCGAAAGGGAAAGCTCAAAGAAGACAATATTTGCTAGTGGCGGGCTTGGACTGTTACAAATGGTATTAGAGCTAGACATTGGGCGATGTGCCAACGAGGAGGCTGAGCCCCAAAGGGGACGGTCACGACGGTGTGCTAGCAAGGACGCTGAGCCCTGAAGAGGGTAGATTGGGGGGTTCCACATCGATTGAAGAAGGGAACGAGTGCCAACGAGGACGCTGGGCCTCAAAGGGGGTGGATTGTGCCAACGAGGAGGCTGAGCCCCAAAGGGGATGGTCACGACGGTGTGCTAGCAAGGACGCTGAGCCCTGAAGAGGGTAGATTGGGGGGTTCCACATCGATTGAAGAAGGGAACGAGTGCCAACGAGGACGCTGGGCCTCAAAGGGGGTGGATTGTGAGATCTCACATTGATTGGAGAGGAGAACGAAACATTTTTTAAAAGGGTGTGGAAACCTCTCTCTAACTGACGCGTTTTAAAAACCTTGAGGGGAAGCCCAAAAGGGAAAGCTCAAAGAGGAGAATATCTGCTAGCGGTGGGGTTGGGCTGTTACAAATGGTATTGCTGAGCCCCAAATGGAATGTGCATACTTTTTATATGTTTGGGATACATAAATGTTCTTACTTTTAATTAGCTTGCTACCTTTATTTATGTTGAAGGACGACCAAAAGGTAGTTACGAAACGTAAAACCGAACTTGGTGGAACGTTCTCAATAGCAAGTTGGATTCTTTTCATCGGCTTGTTTGCTGCGTAAGTACACATTTTCTCGTATACTAAAAACAATTCTCGTGTTCGTTTCATATTGACTTGTTCTTTCAGTGTTATCTCAGGTTGCTTTACCAAATCATATCGAAGAGAAGTATCGAAGTGCATAACGTGAAAGCAGCGAACGCACCAGACTTGGTTTCCTTTGTGAATGATATGGAATTTAATATAACCACGGTCTCGACTATGAGTTGTGCGAATATACGCGGTCTTGGTACCGCTGTATTTGGAAATCCTGGTTTTCTGGAACAGTCAGTAATGCCTCTTTCAAAGTTTGCAAACTACTCGTGTCAAAACACGAGTGAAGGGCCAACTATAAGTGTTAAGTGCGAACGATGTCGTTTCATTCAGGACGATCTTTACGTCTCGTGGCAGTTTGTTGATCTTCCAAATAACCCTGCAAGTGCAGTTGGATTTCAGTTTAACTTCTCTGCTAAGGATCATGTTCAAAAAAATCAGGAAAGTTTCGTTAGTGGTACGTTAAAAAATCGAAGCAATTTCGATGATACGCCAGTTACGTTCCGAGGGAAGAATGCAAATATAATGCAATTTAACCTATTTCCAAGAATATACCGCAGTAAACGTGATTCTAAGCTCATGCAGCCTTTATTTCACGAGTTCGTTTCGGGTTCATCCTTTCAAAATACTAATGAGCTCCAACTATCCCTTGAAAATGCCAATGATGGACTTATCAACATCACCTTGTACATCAATCTTCTCTCATCCTACATTGTTGAGGTGGAGACTCAAAATATTTTGGGCCCTGGTAAGTGTTTTGCTTTATTGACATATGTACTCGTCTTCACACACGCTTCGTTTTTTGTATGTTTGCACAAACAAAATGAATTGTTCGGATCTTGAGATAGGGATGGTTGTTCATTTAGACTTTGTATATATCGAAATAATGTGTCCTTGTCCTAGATTTTCCGAAAGTAATGTGTCGGTGTGAGATCCCACGTCGGTTGGAGAGGAGAATGAAACATTCCTTATAAGGGTGTGAAAACCTCTCCCTAGCAGACGTGTTTTAAAAACCTTGAGAGGAAGCCCAAAAGGGAAAGCCGAAATAGTACAATATCTGCTAGCAGTGGGCTTGAGTCATTACAAATGGTATCAGAGTCAGACACTAGGCGATGTGCTAGCGAGGAGGCTGAGCCCCAGAGGGGGATGGACACGAGGCGGTGTGCCAGCAAGGACGCTGGGCTCTTAAGGGTGGATTTGGGGGTCCCACATCTATTGGAGAAGTGAACGAGTGTCAGCGAGGACGTTGGGCCCTGAAGGGGGGTGGATTGTGAGATTCCACATCGGTTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGACGTGTTTTAAAAACTTTGAGGGGAAGCCCGAAAGGGAAAGCTCAAAGAGGATAATATCTGCTAGCAGTGGGCTTGAGCCATTACAAATGGTATCAGAGTCAGACATCAGGTGATGTGCTAGCGAGGAGGCTGAGCTCCAGAGGGGGATGGACACGAGGCGGTGTGCCAGCAAGGATGCTGGACCCTGAAGGGGGGTGGATTGGGGGGTCCCACATCGATTGGAAAAGTGAACGAGTGCCAACGAGAACGCTGGGCCCTGAAGGGGGGAGGATTATGAGATCCCACATCGGTTGGGGAGGAGAACGAAACATTCTTTATAAGGGTGTAGAAACCTCTCTCTAGCAAACGTGTTTTAAAAAGCTTGAGGGGAAGCTCGAGAGGGAAAGCCCACAGAGTACAATATATGCTAACAGTGGGCTTAGGCCGTTACCGAATATGTCTAGATTCTCCTGTTGATCGATCTAAAAGTTAATGGTTCGCTTCTATAAAAGATTAATAAATTATGCATTTAATAGAACTTCTCTTGTAACGTCTCTCTTTTGAGTTTTCTAATGTTTGTAATGGTATTTGACTGACACTGATTGGCCCTACAATATCTTTGTTATGCAGTTAGCTTTCTTGCTGATCTTGGTGGCCTATATTGCATTAGTGTTGGGATTTTCTTCTACCTTCTAGTGCAGGTATTACTTTTTTTTTGTAATGGATTTAATTATTACACGTTAGTCCACTGCTAGCAGATGTTGTCCTCTCTGTGTTTTTCCTTTCGGGTTTCCCTTCAAGGTTTTTAAAACGCGTTTGCTAGGGAGGTATTACTNATTGGCCCTACAATATCTTTGTTATGCAGTTAGCTTTCTTGCTGATCTTGGTGGCCTATATTGCATTAGTGTTGGGATTTTCTTCTACCTTCTAGTGCAGGTATTACTTTTTTTTTGTAATGGATTTAATTATTACACGTTAGTCCACTGCTAGCAGATGTTGTCCTCTCTGTGTTTTTCCTTTCGGGTTTCCCTTCAAGGTTTTTAAAATGCGTCTGCTAGGGAGAGGTTTCCATACCCTTATAAAGAATGTTTCGTTCTCCTCTCCAACCGATGTGGGATCTCACAATCCACCCTTTTCAGGGCCCAGCATCCTCGCTGGCACTCGTTCACTTCTCCAATTGATGTGGGACCCCCTAATCCACCCCCCATAGGGCTCAACGTCCTTGCTGGCACACTGCCTCGTGTCCACCCCCCTTCGGGACTCATCCTCCTCGTTGGCACATTTTTCGGTGTCTGGATCTGATACCATTAGTAATAGCCCAAGCCCACCACTAGCAGATATTGTCCTCTTTGGGTTTTCCCTTTCGAACTTCCCCTAAAGGTTCTTAAAACGTGTCTGCTAGGGAGAGGTTTCCACACTCTTCTAAAAAATGTTTCATTCTCCTCCCCAACTGATGTGGGATCTCACAATTCACCTCCCTTCAGGGCCCAGCTTCCTCGCTAGCACTCGTTCACTTCTCCAATTGATGTGGGACCCCCTAATCCACCCCCCTTTAGGGCCCAACGTCCTTGTTGGCACACTGCCTCGTGTCCACCCCCCTTCGGGACTCATCCTCCTCGTTGGCACATTTCCCGGTGTCTGGATCTGATACCATTAGTAACAGCCCAAGCCCACCACTAGCAGATATTGTCCTCTTTGGGTTTTCCCTTTCAAACTTCTCCTAAAGGTTCTTAAAACGTGTCTGCTAGGGGGAGGTTTCCACACTCTTCTAAAAAATGTTTCATTCTCCTCCCCAACTGATGTGGGATCTCACAATTCACCTCCCTTCAGGGCCCAGCTTCCTCGCTAGCACTCGTTCACTTCTCCAATTGATGTGGGACCCCCTAATCCACCCCCCATAGGGCCCAACGTCCTTGTTGGCACACTGCCTCGTGTCCACCCCCTTCGAGGCTCAGCCTCTTCGTTGGCACATCGCCCCGTGTCCACCCCCCTTCGAGGCTCAACCTCTTCGTTGGCACATCGCCCCGTGTCTGGCTCCGATACCATTTGTAACGGCCTAACCCCACCACTAGTAGATATTATCCTCTTTGGGTTTTTCCTTTCAAGCTTCTCCTCAAAGTTTCTAAAACGTGTCTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTTCGTTCTCCTCCCCAACCGATGCGGGATCTCACATTTTACGGTGCCATAATGTTCCTCTTCTAGTTTCTTATTTTGCAAGAACTATATATACACACACAGATATTTTCTAATAAAAATGTAACATCTGTTACAAGTCGTCATCATAGCACGAGATGTTATTTCACAAGTGAGCTTTGCACTGAAACCGAGTATCGTTCTTTCTCCAAGATTGAGTAATTTATTCACTTCCCTAATTTGGTTTGTTTTGTAGTTCGAGTACCGAATCAAAAAGCTCCGCAACGAAGACAGCGTTATGCGTAAAATTAGGAATCGAAGAAAAGCACAAGAACATTGGAATAAGGTGATTCTTCTGCATCTTTTTCTCTCAAGTTATTTGATATACCAGTCAAATGGGTATCGGTGGGCGTAGGGTAGGACAGGTTTCGAGCGATATAGTCGATTTCCGAAACTAAGTTCTTGAATTACAACGGGTACCGAACTGACTTATTTTATTTTCTAGTTGAGGAAATATGTAATGTATACATGGGACTGCAGTGCAATGTATGACAATTGTAACGATCCATCAAAAACGTCAAATTGCGCGAACTGCATTGGTCAACCGGCTCGTAAGGATGAATCGTTGCGCAAGCGGAGGTTAAAGAATGGAAGTAGTACTGCTATCAGTTTTAAGTTAGATGTTAATGGATCTGCCAAGAAGGTAATGTTCTTGTTGTCTTACATCTGCATGAGCTAATGTCTTCTGGTATTGTCTCTTCAAGTCCTAATTGATTAGACTTATCATATGCAGTCTTCTAAAGATGAGAAATCTCCAAAGGCAAGAGCTACTGACCAGGAAATGGGAATGATAACAACCAAACAAGAGCCGGTAAGTTTTGGGGTCGTTTAGCACGGAAATTTTGACCGTGCAATTCTAATGGATTTGCATCTAAAATGTGAGATCCCACGTCGATTGGAGAGGGGAACGAAAAATTCCTTATTAGGGTGTGGAAACCTCTCCCTAGTAGACGCGTTTTAAAACTGTGAGGTTGACGGTGATACGTAACGGGCCGAAGCGGACAATGTTTGCTAGCGGTGAGGTTGGACTGTTACAAATGGTATCAGAGCTAGACACTGAGCGGTGTGGCAACGAGGACGCTGGGCTCCCTAGGAGGGTGGATTGTGAGATCCTACATCGATTGGAGAAGGGAACGAAACATTTCTTATAAGGGTGTGGAAACCTCTCCCTAGCAGACGCGATTTAAAACCGTGAGGCTAACGGCGATATATAATGGGTCGAAGTGGACAATATCTACTAGCGGTGGGCTTGGATTGTACAAATGGTATCAGAGCTAGACACTGGGTTGTGTGCTAGCGAGGACGCTGGACCCAAGGGGGATGAACTGTGAGATCCCACATCGATTGGAGAGGAAAATGGAACATTCCTTATAAGGGTGTGAAAACCTCTCCCTAGTAGATGCGTTTTAAAACTCTGAGGCTCATAGCGATGTGCAACGAGCTGAAGCGGACAATATCTGCTAGTGGTGGGCTTGAGTTGTTACAAATGGTATCAGAGCCAAACACCGGGTGGTGTGCCAACGAGGACGTTGGGCTCCCTAGAGGGGTGGATTGTGAGATCCCGCATCGATTGGAGAGGGGAACGAAACATTCCTTACAAGGGTGTGGAAATCTCTTCCTAGCAGACGCGTTTTAAAATTGTGAGGCTAACGACGATACATAATGGGTCGAAGTGGACAATATCTACTAGCGGTGGGCTTGGGCGGTGTGCCAGCAAGCAAGGACATTGGGCCCCCAAGAGAGGTGGATTGTGAGATCCCACATCGATTGGAGAGGGGAACAAAACATTCCTTATAAAGGTGTGGAAACCTCTCCCTAGTAAACGTGTTTTAAAACCGTGAGGCTAACGGCGATACATAACAGGCCAAAACGGACAATATCTGCTAGCGGTGGGCTTATCCTACCTGCCACTAAACCACAGTCCATTTTGCTGATGTAAAACTTGCCAAAAGTATTGGAAATACATGTTGAATCACTAACCATTCCATTTTCATATTCTCCATCCAGCCTCTGCAACATCAAGTGCTTGGTTCTACCCATGAGACGAAACAAAGTTCAACCGTTCCATTCGAGGGAGATTCTTCACAACCCGGAGAATTTTCTCGTCCCGAAGATATCATACCTCCGCCACCGTTGATAGGTAACTATCGCTGTTTTTTACGGGCAATATCCGTCATTCCCACTCTTCCCTCCATTGACCAGAAGAGGAGGGGATGGATTAGAATACTTAGATCATGAAATGACCTGAATTTTCTGATGCAGACTTCAAACACAGTTCTGATATTGACATGTCCGATGTCTTGAAGAATATAAAAAGTTTGTACGAGTATAACGTATTTCTTAGAGAAAAGCTATTGTCCACTCAATCCGAGGTTCGTGCTTTATCAGCCAAGTCTGCACCGTGAACGGAGCGCCAAACCAAACGTAGAGAACATGTACATCTAGTAAGTAAACAGTTTACTTCTGATACTGCAGTTCCAAGATTCTCCGCTAAGTAGACGACTCGTAAGATGTAGGATTACTGCTACCATTGTGTGCTCGTGGTTCTTTGCAGGCGCTCGGATGGCAGAGAGACACCTTGCCAATGTACTTTGTTGTAATATTGTTAACATTTGTTTGTACTATAGAGTAGAACATGGATGGATTGCTGAGAAAAATGTATAGTTTTGTATCTGGTGTTGTACTCTGCTGCACATTACTTGTTGCATTACTGTATGAACTCAAGGACTTTGATTCTTACTACTTGTGAATAACACTAGTTGGTTGAATTTTAGGAGCTTTTAGATTAAGTATTTTCACATTA
mRNA sequence
TGACTGTTCTACAGAGAGAAAAAAAGGCAAGTAAAACTCAAGAACAAGCTCATCACCCACCAACTCCAAAAGAATTCAGAATCAAATGGGCAAAAAAGGTGGATGGAAATTAAAAAGGTAACCCAATTCATTTACGGTTTCTCGAAAGGCAGCCACCCATTTGTTCTTCCTTTTCTCTCTCCTTCGTTTCCCACAGCTCATTGATTCTCGCCCACGTTTGAACAAACAAGCAAAGAATCGTTCAAAATTTGGATACCCCATAAATTTTTTAGGATACCCATAAACGTTTTAGGTTTATCTCTACATACCCAGTAACCCCATGATGATTTTTTAGGTTATAAACAGATCTTCTATCAATTTCGAAGTGTTGGGGCTTTGAATCTCGAAAACCCCACATCCCATTTCAGCTTTTCGAGGTTGAATTTCTGGGTTGTTCTAAAGGGTTGCGAATGTCATGTCCAAGCAGTAGCTTCCGCTACAATGGCAGCCACTGTGCGTGCCCACCAGGCCAGCTTCTCAATCGAAGCAACAATAGCTGTGTTATCTTCAATAGCCCTTCGGTTATCACTACAGGGCGGTTTGAGAGCTATGCTGTTAGCTTCCCTGAGACCATTTTCTCCTTTGATTCAATCAGGAAGTTCACTCAGTCTCAGGCTGTGTTCCTTGAAGCTACTCTGTTCTTGCTGCTTTCTTGGCTCTTTTTCTGTATGTTCCTTAGGTTCATGAAGCTTGGAGATGGGAGAAATATTTGGTTCAGGATGAGATGGTGGGTTAGCAGATTGGATGTTTGCTTTGCCACAAGACATTGGCTGGACGACCAAAAGGTAGTTACGAAACGTAAAACCGAACTTGGTGGAACGTTCTCAATAGCAAGTTGGATTCTTTTCATCGGCTTGTTTGCTGCGTTGCTTTACCAAATCATATCGAAGAGAAGTATCGAAGTGCATAACGTGAAAGCAGCGAACGCACCAGACTTGGTTTCCTTTGTGAATGATATGGAATTTAATATAACCACGGTCTCGACTATGAGTTGTGCGAATATACGCGGTCTTGGTACCGCTGTATTTGGAAATCCTGGTTTTCTGGAACAGTCAGTAATGCCTCTTTCAAAGTTTGCAAACTACTCGTGTCAAAACACGAGTGAAGGGCCAACTATAAGTGTTAAGTGCGAACGATGTCGTTTCATTCAGGACGATCTTTACGTCTCGTGGCAGTTTGTTGATCTTCCAAATAACCCTGCAAGTGCAGTTGGATTTCAGTTTAACTTCTCTGCTAAGGATCATGTTCAAAAAAATCAGGAAAGTTTCGTTAGTGGTACGTTAAAAAATCGAAGCAATTTCGATGATACGCCAGTTACGTTCCGAGGGAAGAATGCAAATATAATGCAATTTAACCTATTTCCAAGAATATACCGCAGTAAACGTGATTCTAAGCTCATGCAGCCTTTATTTCACGAGTTCGTTTCGGGTTCATCCTTTCAAAATACTAATGAGCTCCAACTATCCCTTGAAAATGCCAATGATGGACTTATCAACATCACCTTGTACATCAATCTTCTCTCATCCTACATTGTTGAGGTGGAGACTCAAAATATTTTGGGCCCTGGTAAGTGTTTTGCTTTATTGACATATTTCGAGTACCGAATCAAAAAGCTCCGCAACGAAGACAGCGTTATGCGTAAAATTAGGAATCGAAGAAAAGCACAAGAACATTGGAATAAGTTGAGGAAATATGTAATGTATACATGGGACTGCAGTGCAATGTATGACAATTGTAACGATCCATCAAAAACGTCAAATTGCGCGAACTGCATTGGTCAACCGGCTCGTAAGGATGAATCGTTGCGCAAGCGGAGGTTAAAGAATGGAAGTAGTACTGCTATCAGTTTTAAGTTAGATGTTAATGGATCTGCCAAGAAGTCTTCTAAAGATGAGAAATCTCCAAAGGCAAGAGCTACTGACCAGGAAATGGGAATGATAACAACCAAACAAGAGCCGCCTCTGCAACATCAAGTGCTTGGTTCTACCCATGAGACGAAACAAAGTTCAACCGTTCCATTCGAGGGAGATTCTTCACAACCCGGAGAATTTTCTCGTCCCGAAGATATCATACCTCCGCCACCGTTGATAGACTTCAAACACAGTTCTGATATTGACATGTCCGATGTCTTGAAGAATATAAAAAGTTTGTACGAGTATAACGTATTTCTTAGAGAAAAGCTATTGTCCACTCAATCCGAGGTTCGTGCTTTATCAGCCAAGTCTGCACCGTGAACGGAGCGCCAAACCAAACGTAGAGAACATGTACATCTAGTAAGTAAACAGTTTACTTCTGATACTGCAGTTCCAAGATTCTCCGCTAAGTAGACGACTCGTAAGATGTAGGATTACTGCTACCATTGTGTGCTCGTGGTTCTTTGCAGGCGCTCGGATGGCAGAGAGACACCTTGCCAATGTACTTTGTTGTAATATTGTTAACATTTGTTTGTACTATAGAGTAGAACATGGATGGATTGCTGAGAAAAATGTATAGTTTTGTATCTGGTGTTGTACTCTGCTGCACATTACTTGTTGCATTACTGTATGAACTCAAGGACTTTGATTCTTACTACTTGTGAATAACACTAGTTGGTTGAATTTTAGGAGCTTTTAGATTAAGTATTTTCACATTA
Coding sequence (CDS)
ATGTCATGTCCAAGCAGTAGCTTCCGCTACAATGGCAGCCACTGTGCGTGCCCACCAGGCCAGCTTCTCAATCGAAGCAACAATAGCTGTGTTATCTTCAATAGCCCTTCGGTTATCACTACAGGGCGGTTTGAGAGCTATGCTGTTAGCTTCCCTGAGACCATTTTCTCCTTTGATTCAATCAGGAAGTTCACTCAGTCTCAGGCTGTGTTCCTTGAAGCTACTCTGTTCTTGCTGCTTTCTTGGCTCTTTTTCTGTATGTTCCTTAGGTTCATGAAGCTTGGAGATGGGAGAAATATTTGGTTCAGGATGAGATGGTGGGTTAGCAGATTGGATGTTTGCTTTGCCACAAGACATTGGCTGGACGACCAAAAGGTAGTTACGAAACGTAAAACCGAACTTGGTGGAACGTTCTCAATAGCAAGTTGGATTCTTTTCATCGGCTTGTTTGCTGCGTTGCTTTACCAAATCATATCGAAGAGAAGTATCGAAGTGCATAACGTGAAAGCAGCGAACGCACCAGACTTGGTTTCCTTTGTGAATGATATGGAATTTAATATAACCACGGTCTCGACTATGAGTTGTGCGAATATACGCGGTCTTGGTACCGCTGTATTTGGAAATCCTGGTTTTCTGGAACAGTCAGTAATGCCTCTTTCAAAGTTTGCAAACTACTCGTGTCAAAACACGAGTGAAGGGCCAACTATAAGTGTTAAGTGCGAACGATGTCGTTTCATTCAGGACGATCTTTACGTCTCGTGGCAGTTTGTTGATCTTCCAAATAACCCTGCAAGTGCAGTTGGATTTCAGTTTAACTTCTCTGCTAAGGATCATGTTCAAAAAAATCAGGAAAGTTTCGTTAGTGGTACGTTAAAAAATCGAAGCAATTTCGATGATACGCCAGTTACGTTCCGAGGGAAGAATGCAAATATAATGCAATTTAACCTATTTCCAAGAATATACCGCAGTAAACGTGATTCTAAGCTCATGCAGCCTTTATTTCACGAGTTCGTTTCGGGTTCATCCTTTCAAAATACTAATGAGCTCCAACTATCCCTTGAAAATGCCAATGATGGACTTATCAACATCACCTTGTACATCAATCTTCTCTCATCCTACATTGTTGAGGTGGAGACTCAAAATATTTTGGGCCCTGGTAAGTGTTTTGCTTTATTGACATATTTCGAGTACCGAATCAAAAAGCTCCGCAACGAAGACAGCGTTATGCGTAAAATTAGGAATCGAAGAAAAGCACAAGAACATTGGAATAAGTTGAGGAAATATGTAATGTATACATGGGACTGCAGTGCAATGTATGACAATTGTAACGATCCATCAAAAACGTCAAATTGCGCGAACTGCATTGGTCAACCGGCTCGTAAGGATGAATCGTTGCGCAAGCGGAGGTTAAAGAATGGAAGTAGTACTGCTATCAGTTTTAAGTTAGATGTTAATGGATCTGCCAAGAAGTCTTCTAAAGATGAGAAATCTCCAAAGGCAAGAGCTACTGACCAGGAAATGGGAATGATAACAACCAAACAAGAGCCGCCTCTGCAACATCAAGTGCTTGGTTCTACCCATGAGACGAAACAAAGTTCAACCGTTCCATTCGAGGGAGATTCTTCACAACCCGGAGAATTTTCTCGTCCCGAAGATATCATACCTCCGCCACCGTTGATAGACTTCAAACACAGTTCTGATATTGACATGTCCGATGTCTTGAAGAATATAAAAAGTTTGTACGAGTATAACGTATTTCTTAGAGAAAAGCTATTGTCCACTCAATCCGAGGTTCGTGCTTTATCAGCCAAGTCTGCACCGTGA
Protein sequence
MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDSIRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHWLDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFVNDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKCERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDTPVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGLINITLYINLLSSYIVEVETQNILGPGKCFALLTYFEYRIKKLRNEDSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESLRKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGSTHETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNVFLREKLLSTQSEVRALSAKSAP
Homology
BLAST of Cp4.1LG07g07860 vs. NCBI nr
Match:
XP_023538020.1 (uncharacterized protein LOC111798904 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1186 bits (3068), Expect = 0.0
Identity = 602/622 (96.78%), Postives = 602/622 (96.78%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS
Sbjct: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC
Sbjct: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT
Sbjct: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL
Sbjct: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRNE 420
INITLYINLLSSYIVEVETQNILGP G F LL FEYRIKKLRNE
Sbjct: 361 INITLYINLLSSYIVEVETQNILGPVSFLADLGGLYCISVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL
Sbjct: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
Query: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGS 540
RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGS
Sbjct: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGS 540
Query: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV
Sbjct: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
Query: 601 FLREKLLSTQSEVRALSAKSAP 607
FLREKLLSTQSEVRALSAKSAP
Sbjct: 601 FLREKLLSTQSEVRALSAKSAP 622
BLAST of Cp4.1LG07g07860 vs. NCBI nr
Match:
KAG7020961.1 (hypothetical protein SDJN02_17649 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1173 bits (3035), Expect = 0.0
Identity = 594/622 (95.50%), Postives = 597/622 (95.98%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS
Sbjct: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQKVVTKRKTELGGTFSIASWILFIG+FAALLYQIISKRSIEVHNVKAANAPDLVSFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSIASWILFIGMFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
NDMEFNITTVSTMSCA+IRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC
Sbjct: 181 NDMEFNITTVSTMSCAHIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT
Sbjct: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGKN NIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL
Sbjct: 301 PVTFRGKNTNIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRNE 420
INITLYINLLSSYIVEVETQNILGP G F LL FEYRIKKLRNE
Sbjct: 361 INITLYINLLSSYIVEVETQNILGPVSFLADLGGLYCISVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCS +YDNCNDPSKTSNCANCIGQP RKDESL
Sbjct: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSTLYDNCNDPSKTSNCANCIGQPTRKDESL 480
Query: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGS 540
RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMI TKQEPP QHQVLGS
Sbjct: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMIATKQEPPQQHQVLGS 540
Query: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV
Sbjct: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
Query: 601 FLREKLLSTQSEVRALSAKSAP 607
FLREKLLSTQSEVRALSAKSAP
Sbjct: 601 FLREKLLSTQSEVRALSAKSAP 622
BLAST of Cp4.1LG07g07860 vs. NCBI nr
Match:
XP_022937645.1 (uncharacterized protein LOC111443988 [Cucurbita moschata] >XP_022937646.1 uncharacterized protein LOC111443988 [Cucurbita moschata] >XP_022937647.1 uncharacterized protein LOC111443988 [Cucurbita moschata])
HSP 1 Score: 1170 bits (3026), Expect = 0.0
Identity = 595/622 (95.66%), Postives = 597/622 (95.98%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS
Sbjct: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC
Sbjct: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT
Sbjct: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL
Sbjct: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRNE 420
INITLYINLLSSYIVEVETQNILGP G F LL FEYRIKKLRNE
Sbjct: 361 INITLYINLLSSYIVEVETQNILGPVSFLADLGGLYCVSVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
D+VMRKIRNRRKAQEHWNKLRKYVMYTWDCS +YDNCNDPSKTSNCANCIGQPARKDESL
Sbjct: 421 DTVMRKIRNRRKAQEHWNKLRKYVMYTWDCSTLYDNCNDPSKTSNCANCIGQPARKDESL 480
Query: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGS 540
RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMI TKQEPP QH VLGS
Sbjct: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMIATKQEPP-QHHVLGS 540
Query: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDM DVLKNIKSLYEYNV
Sbjct: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMFDVLKNIKSLYEYNV 600
Query: 601 FLREKLLSTQSEVRALSAKSAP 607
FLREKLLSTQSEVRALSAKSAP
Sbjct: 601 FLREKLLSTQSEVRALSAKSAP 621
BLAST of Cp4.1LG07g07860 vs. NCBI nr
Match:
KAG6586139.1 (hypothetical protein SDJN03_18872, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1161 bits (3004), Expect = 0.0
Identity = 588/622 (94.53%), Postives = 591/622 (95.02%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS
Sbjct: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC
Sbjct: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT
Sbjct: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL
Sbjct: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRNE 420
INITLYINLLSSYIVEVETQNILGP G F LL FEYRIKKLRNE
Sbjct: 361 INITLYINLLSSYIVEVETQNILGPVSFLADLGGLYCVSVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
D+VMRKIRNRRKAQEHWNKLRKYVMYTWDCS +YDNCNDPSKTSNCANCIGQPARKDESL
Sbjct: 421 DTVMRKIRNRRKAQEHWNKLRKYVMYTWDCSTLYDNCNDPSKTSNCANCIGQPARKDESL 480
Query: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGS 540
RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMI TKQEPPLQHQVLGS
Sbjct: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMIATKQEPPLQHQVLGS 540
Query: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
THE KQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV
Sbjct: 541 THEAKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
Query: 601 FLREKLLSTQSEVRALSAKSAP 607
FLREKLLSTQSE + P
Sbjct: 601 FLREKLLSTQSEALGWQRDALP 622
BLAST of Cp4.1LG07g07860 vs. NCBI nr
Match:
XP_022965476.1 (uncharacterized protein LOC111465369 [Cucurbita maxima] >XP_022965477.1 uncharacterized protein LOC111465369 [Cucurbita maxima])
HSP 1 Score: 1155 bits (2989), Expect = 0.0
Identity = 585/622 (94.05%), Postives = 595/622 (95.66%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPET+FSFDS
Sbjct: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETVFSFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRKFT+SQAVFLEATLFLLLSWLFFCMFLRFM LGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IRKFTRSQAVFLEATLFLLLSWLFFCMFLRFMNLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC
Sbjct: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT
Sbjct: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGKNANI+QFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL
Sbjct: 301 PVTFRGKNANILQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRNE 420
INITLYINLLSSYIVEVETQNILGP G F LL FEYRIKKLRNE
Sbjct: 361 INITLYINLLSSYIVEVETQNILGPVSFLADLGGLYCISVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
D+VMRKIRNRRKAQEHWNKLRKYVMYTW+CSA++DNCND SKTSNCANCIGQPARKDESL
Sbjct: 421 DTVMRKIRNRRKAQEHWNKLRKYVMYTWNCSALHDNCNDSSKTSNCANCIGQPARKDESL 480
Query: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGS 540
R+RRLKNGSSTAISFKLDVNGSAKKSSKDEK PKARATDQE+ M+ TKQEPPLQHQVLGS
Sbjct: 481 RQRRLKNGSSTAISFKLDVNGSAKKSSKDEKCPKARATDQELKMLATKQEPPLQHQVLGS 540
Query: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
THE KQSSTVPFEGDSSQPGEFSRPEDIIP PPLIDFKHSSDIDMSDVLKNIKSLYEYNV
Sbjct: 541 THEAKQSSTVPFEGDSSQPGEFSRPEDIIPLPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
Query: 601 FLREKLLSTQSEVRALSAKSAP 607
FLREKLLSTQSEVRALSAKSAP
Sbjct: 601 FLREKLLSTQSEVRALSAKSAP 622
BLAST of Cp4.1LG07g07860 vs. ExPASy TrEMBL
Match:
A0A6J1FBT5 (uncharacterized protein LOC111443988 OS=Cucurbita moschata OX=3662 GN=LOC111443988 PE=4 SV=1)
HSP 1 Score: 1170 bits (3026), Expect = 0.0
Identity = 595/622 (95.66%), Postives = 597/622 (95.98%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS
Sbjct: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC
Sbjct: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT
Sbjct: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL
Sbjct: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRNE 420
INITLYINLLSSYIVEVETQNILGP G F LL FEYRIKKLRNE
Sbjct: 361 INITLYINLLSSYIVEVETQNILGPVSFLADLGGLYCVSVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
D+VMRKIRNRRKAQEHWNKLRKYVMYTWDCS +YDNCNDPSKTSNCANCIGQPARKDESL
Sbjct: 421 DTVMRKIRNRRKAQEHWNKLRKYVMYTWDCSTLYDNCNDPSKTSNCANCIGQPARKDESL 480
Query: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGS 540
RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMI TKQEPP QH VLGS
Sbjct: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMIATKQEPP-QHHVLGS 540
Query: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDM DVLKNIKSLYEYNV
Sbjct: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMFDVLKNIKSLYEYNV 600
Query: 601 FLREKLLSTQSEVRALSAKSAP 607
FLREKLLSTQSEVRALSAKSAP
Sbjct: 601 FLREKLLSTQSEVRALSAKSAP 621
BLAST of Cp4.1LG07g07860 vs. ExPASy TrEMBL
Match:
A0A6J1HLS7 (uncharacterized protein LOC111465369 OS=Cucurbita maxima OX=3661 GN=LOC111465369 PE=4 SV=1)
HSP 1 Score: 1155 bits (2989), Expect = 0.0
Identity = 585/622 (94.05%), Postives = 595/622 (95.66%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPET+FSFDS
Sbjct: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETVFSFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRKFT+SQAVFLEATLFLLLSWLFFCMFLRFM LGDGRNIWFRMRWWVSRLDVCFATRHW
Sbjct: 61 IRKFTRSQAVFLEATLFLLLSWLFFCMFLRFMNLGDGRNIWFRMRWWVSRLDVCFATRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC
Sbjct: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT
Sbjct: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGKNANI+QFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL
Sbjct: 301 PVTFRGKNANILQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRNE 420
INITLYINLLSSYIVEVETQNILGP G F LL FEYRIKKLRNE
Sbjct: 361 INITLYINLLSSYIVEVETQNILGPVSFLADLGGLYCISVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
D+VMRKIRNRRKAQEHWNKLRKYVMYTW+CSA++DNCND SKTSNCANCIGQPARKDESL
Sbjct: 421 DTVMRKIRNRRKAQEHWNKLRKYVMYTWNCSALHDNCNDSSKTSNCANCIGQPARKDESL 480
Query: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLGS 540
R+RRLKNGSSTAISFKLDVNGSAKKSSKDEK PKARATDQE+ M+ TKQEPPLQHQVLGS
Sbjct: 481 RQRRLKNGSSTAISFKLDVNGSAKKSSKDEKCPKARATDQELKMLATKQEPPLQHQVLGS 540
Query: 541 THETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
THE KQSSTVPFEGDSSQPGEFSRPEDIIP PPLIDFKHSSDIDMSDVLKNIKSLYEYNV
Sbjct: 541 THEAKQSSTVPFEGDSSQPGEFSRPEDIIPLPPLIDFKHSSDIDMSDVLKNIKSLYEYNV 600
Query: 601 FLREKLLSTQSEVRALSAKSAP 607
FLREKLLSTQSEVRALSAKSAP
Sbjct: 601 FLREKLLSTQSEVRALSAKSAP 622
BLAST of Cp4.1LG07g07860 vs. ExPASy TrEMBL
Match:
A0A0A0LJH8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074060 PE=4 SV=1)
HSP 1 Score: 1005 bits (2598), Expect = 0.0
Identity = 513/623 (82.34%), Postives = 559/623 (89.73%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPS+SFRYNGS CACPPGQLLNR+NNSCV+F+ S ITTGR ++YAVSFPETIFSFDS
Sbjct: 1 MSCPSNSFRYNGSLCACPPGQLLNRANNSCVLFSRTSAITTGRLQNYAVSFPETIFSFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRK TQSQAVFLEATL +LLSWLFFC+FLRFMKLGDGRNIWFR+RWWVSRLDVCFATRHW
Sbjct: 61 IRKITQSQAVFLEATLVMLLSWLFFCIFLRFMKLGDGRNIWFRIRWWVSRLDVCFATRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQ++VTKRKTELGG FSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV
Sbjct: 121 LDDQRIVTKRKTELGGMFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
ND+EFNITTVSTMSCANIRGL T VFGNPGFLEQ VMPLS FAN+SCQN SEGPTIS+KC
Sbjct: 181 NDIEFNITTVSTMSCANIRGLDTVVFGNPGFLEQKVMPLSSFANFSCQNRSEGPTISLKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
ERCRFIQDD+Y+SWQFVDLPNNPASAVGF+FN SAKD VQ++QESFVSGTLKNRSNFDDT
Sbjct: 241 ERCRFIQDDVYISWQFVDLPNNPASAVGFEFNISAKDQVQRSQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGK+ANI+QFNLFPRIY +K+DSKLMQPLFHEFVSGSSFQNTN+LQLSLEN NDGL
Sbjct: 301 PVTFRGKSANIVQFNLFPRIYSNKQDSKLMQPLFHEFVSGSSFQNTNDLQLSLENTNDGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRNE 420
+NITLYINLLSSYIVEVE+QNILGP G F LL FEYRIK+LRNE
Sbjct: 361 LNITLYINLLSSYIVEVESQNILGPVSFLADLGGLYCISFGIFFYLLVQFEYRIKRLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYD-NCNDPSKTSNCANCIGQPARKDES 480
DSVMRKIRNRRKAQEHWNKLRKYVMYTW CSA+ D + NDPSKTS+C NCIGQP+ K+ S
Sbjct: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWGCSALLDGDYNDPSKTSSCPNCIGQPSHKNGS 480
Query: 481 LRKRRLKNGSSTAISFKLDVNGSAKKS-SKDEKSPKARATDQEMGMITTKQEPPLQHQVL 540
RKRRLK+GSSTAISF +DVNG+ ++ ++D KSPKA ATDQEM MI TKQE PL HQVL
Sbjct: 481 SRKRRLKSGSSTAISFNIDVNGATNRTVNQDMKSPKATATDQEMRMIATKQEQPLHHQVL 540
Query: 541 GSTHETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEY 600
GST+E KQ TVPF+GDSSQP +FSR EDI PPPPLIDF SSDIDMS++LKN+KSLYEY
Sbjct: 541 GSTYEEKQR-TVPFKGDSSQPVDFSRSEDI-PPPPLIDFNDSSDIDMSNILKNMKSLYEY 600
Query: 601 NVFLREKLLSTQSEVRALSAKSA 606
NVFLREKLLSTQSEVRAL+ KSA
Sbjct: 601 NVFLREKLLSTQSEVRALATKSA 621
BLAST of Cp4.1LG07g07860 vs. ExPASy TrEMBL
Match:
A0A1S3B545 (uncharacterized protein LOC103485904 OS=Cucumis melo OX=3656 GN=LOC103485904 PE=4 SV=1)
HSP 1 Score: 997 bits (2577), Expect = 0.0
Identity = 502/613 (81.89%), Postives = 551/613 (89.89%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPS+SFRYNGS CACPPGQLL+R+NNSC++F+ S ITTGR ++YAVSFPETIFSFDS
Sbjct: 1 MSCPSNSFRYNGSLCACPPGQLLSRTNNSCILFSRTSAITTGRLQNYAVSFPETIFSFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRK TQSQAVFLEATL +LLSWLFFC+FLRFMKLGDGRNIWFR+RWWVSRLDVCFATRHW
Sbjct: 61 IRKITQSQAVFLEATLVMLLSWLFFCIFLRFMKLGDGRNIWFRIRWWVSRLDVCFATRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQ+ VTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV
Sbjct: 121 LDDQRTVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
ND+EFNITTVSTMSCANIRGL T VFGNPGFLEQ VMPLS FAN+SCQN SEGPTIS+KC
Sbjct: 181 NDIEFNITTVSTMSCANIRGLDTIVFGNPGFLEQKVMPLSSFANFSCQNRSEGPTISLKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
ERCRFIQDD+Y+SWQFVDLPNNPASAVGF+FN SAKDHVQKNQESFVSGTLKNRSNFDDT
Sbjct: 241 ERCRFIQDDVYISWQFVDLPNNPASAVGFEFNISAKDHVQKNQESFVSGTLKNRSNFDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGK+ANI+QFNLFPRIY +K+DSKLMQPLFHEFVSGSSFQNTN+LQLSLENANDGL
Sbjct: 301 PVTFRGKSANIVQFNLFPRIYSNKQDSKLMQPLFHEFVSGSSFQNTNDLQLSLENANDGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRNE 420
+NITLYINLLSSYI+EVE+QNILGP G F LL FEYRIKKLRNE
Sbjct: 361 LNITLYINLLSSYIIEVESQNILGPVSFLADLGGLYCISVGIFFYLLVQFEYRIKKLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
DSVMRKIRNRRKAQEHWNKLRKYVMYTW CSA+ + ND S+TS+C NCIGQP+ K+ S
Sbjct: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWGCSALVGDYNDQSETSSCPNCIGQPSHKNGSS 480
Query: 481 RKRRLKNGSSTAISFKLDVNGSAKKSS-KDEKSPKARATDQEMGMITTKQEPPLQHQVLG 540
RKR L++GSSTAI+F +DVNG+ K+++ +D K+PKA ATDQEM MI TKQE PL HQVLG
Sbjct: 481 RKRHLRSGSSTAINFNIDVNGATKRTANQDMKTPKATATDQEMRMIATKQEQPLHHQVLG 540
Query: 541 STHETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYN 597
ST+E KQ TVPF+GDSSQP +FSRPEDIIP PPLIDF SD+DMS++LKN+KSLYEYN
Sbjct: 541 STYEEKQR-TVPFKGDSSQPVDFSRPEDIIPLPPLIDFNDCSDVDMSNILKNMKSLYEYN 600
BLAST of Cp4.1LG07g07860 vs. ExPASy TrEMBL
Match:
A0A6J1ENC3 (uncharacterized protein LOC111436176 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436176 PE=4 SV=1)
HSP 1 Score: 963 bits (2489), Expect = 0.0
Identity = 491/623 (78.81%), Postives = 540/623 (86.68%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAVSFPETIFSFDS 60
MSCPS+SFRYNGS CACPPGQLLNR++NSCV+F+ S ITTGR E+ AVSFPETIF+FDS
Sbjct: 1 MSCPSNSFRYNGSLCACPPGQLLNRTSNSCVVFSGTSAITTGRLENSAVSFPETIFAFDS 60
Query: 61 IRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRHW 120
IRK TQSQAVFL+ATL +LL WLFFC+FLRFMKL DGRNIWFRMRWWVSRLDVCF+TRHW
Sbjct: 61 IRKITQSQAVFLKATLVMLLCWLFFCIFLRFMKLEDGRNIWFRMRWWVSRLDVCFSTRHW 120
Query: 121 LDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSFV 180
LDDQKVVTKRKTELGGTFS+ASWI+F GLFAALLYQIISKRSIEVHN+KAANAPDLVSFV
Sbjct: 121 LDDQKVVTKRKTELGGTFSMASWIVFSGLFAALLYQIISKRSIEVHNMKAANAPDLVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVKC 240
NDMEFNITTVSTMSC+NIRGL T VFGNPGFL Q VMPLS FAN+SCQN SEGPTISVKC
Sbjct: 181 NDMEFNITTVSTMSCSNIRGLDTIVFGNPGFLAQKVMPLSNFANFSCQNRSEGPTISVKC 240
Query: 241 ERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDDT 300
E+CRFIQDD+Y+SWQF+DLPNNPASAVGFQFNFS+KDHVQKNQESFVSGTLKNRSN DDT
Sbjct: 241 EKCRFIQDDIYISWQFIDLPNNPASAVGFQFNFSSKDHVQKNQESFVSGTLKNRSNLDDT 300
Query: 301 PVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDGL 360
PVTFRGKNANI+QFNLFPRI+R+ +DSKLMQPLFHEFVSGSSFQNTNELQLSLENAN+GL
Sbjct: 301 PVTFRGKNANIVQFNLFPRIFRNNQDSKLMQPLFHEFVSGSSFQNTNELQLSLENANEGL 360
Query: 361 INITLYINLLSSYIVEVETQNILGPGKCFA---------------LLTYFEYRIKKLRNE 420
+NITLYINLLSSYIVEVE QNI GP A LL EYR+KKLRNE
Sbjct: 361 LNITLYINLLSSYIVEVERQNIFGPVSFLADLGGLYCITFSIFFYLLVQLEYRVKKLRNE 420
Query: 421 DSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDESL 480
DSVM K+RNRRKAQEHWNKLRKYVMYTW SA+ ++ +DPSK S+C NCIG +RK+ S
Sbjct: 421 DSVMLKVRNRRKAQEHWNKLRKYVMYTWGYSALDNDYSDPSKGSSCTNCIGPSSRKNGSS 480
Query: 481 RKRRLKNGSSTAISFKLDVNGSAKKSSK-DEKSPKARATDQEMGMITTKQEPPLQHQVLG 540
R R L++GSSTAISF +DVNG K+++K D SPKA ATD+EM I TKQE PL HQVLG
Sbjct: 481 RPRGLRSGSSTAISFHVDVNGYTKETAKHDMISPKATATDREMRTIATKQERPLHHQVLG 540
Query: 541 STHETKQSSTVPFEGDSSQPGEFSRPEDIIPPPPLIDFKHSSDIDMSDVLKNIKSLYEYN 600
STHE KQ TVPF+GD FS PEDIIPPPP IDFK SSDI MSD+L+++KSLYEYN
Sbjct: 541 STHEGKQRLTVPFKGD------FSHPEDIIPPPPSIDFKDSSDIGMSDILRSMKSLYEYN 600
Query: 601 VFLREKLLSTQSEVRALSAKSAP 607
VFLREKLLSTQSEVRAL+ KS P
Sbjct: 601 VFLREKLLSTQSEVRALATKSTP 617
BLAST of Cp4.1LG07g07860 vs. TAIR 10
Match:
AT5G16520.1 (unknown protein; Has 25 Blast hits to 25 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 25; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 620.5 bits (1599), Expect = 1.4e-177
Identity = 329/624 (52.72%), Postives = 422/624 (67.63%), Query Frame = 0
Query: 1 MSCPSSSFRYNGSHCACPPGQLLNRSNNSCVIFNSPSVITTGRFESYAV-SFPETIFSFD 60
M+CP +S YN + CAC GQLLNRS+ SC IF PS I+T + +Y+V SF ET+F+FD
Sbjct: 1 MACPRNSITYNATRCACGIGQLLNRSSGSCEIFGWPSTISTDKDVNYSVISFAETLFAFD 60
Query: 61 SIRKFTQSQAVFLEATLFLLLSWLFFCMFLRFMKLGDGRNIWFRMRWWVSRLDVCFATRH 120
IRKFTQSQA+FLEATL +LLSWL FC FLRF KLGDGRN+WF +RWW++RLDV F+TRH
Sbjct: 61 RIRKFTQSQAIFLEATLVMLLSWLVFCFFLRFTKLGDGRNVWFNLRWWITRLDVFFSTRH 120
Query: 121 WLDDQKVVTKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNVKAANAPDLVSF 180
WLDDQ++V KRKTELGGTFS+ASWI+FIGLFAALLYQII+KR+IEVHNV+A +PDL+SF
Sbjct: 121 WLDDQQIVKKRKTELGGTFSVASWIVFIGLFAALLYQIITKRTIEVHNVRATGSPDLISF 180
Query: 181 VNDMEFNITTVSTMSCANIRGLGTAVFGNPGFLEQSVMPLSKFANYSCQNTSEGPTISVK 240
ND+EFNIT VS MSC+N+RG+G V GNPGF E V LS +Y+C+NT+ GPT++ K
Sbjct: 181 ENDLEFNITAVSDMSCSNLRGIGNVVMGNPGFSEFKVAALSSLGSYTCKNTTSGPTVNFK 240
Query: 241 CERCRFIQDDLYVSWQFVDLPNNPASAVGFQFNFSAKDHVQKNQESFVSGTLKNRSNFDD 300
C +CR D +Y+SW FVDLP++PA+AVGFQFNF++K+ + SFVSGTL+N S D+
Sbjct: 241 CTKCRLTNDYIYISWHFVDLPDSPAAAVGFQFNFTSKNGPNEKHMSFVSGTLRNGSILDE 300
Query: 301 TPVTFRGKNANIMQFNLFPRIYRSKRDSKLMQPLFHEFVSGSSFQNTNELQLSLENANDG 360
+PVTFRG NI++FNLFPRIY D KL+QPLFHEF+ GS +++T +LQ S+ + DG
Sbjct: 301 SPVTFRGTEGNILKFNLFPRIYHHLHDLKLIQPLFHEFIPGSVYRDTTQLQASMGRSTDG 360
Query: 361 LINITLYINLLSSYIVEVETQNILGP---------------GKCFALLTYFEYRIKKLRN 420
++N TL+IN LS+YIVE++ +NILGP G F LL EYRIKKLRN
Sbjct: 361 ILNTTLFINYLSAYIVEIDHENILGPVSFLADLGGLYCISIGIFFYLLVQCEYRIKKLRN 420
Query: 421 EDSVMRKIRNRRKAQEHWNKLRKYVMYTWDCSAMYDNCNDPSKTSNCANCIGQPARKDES 480
ED+V RKIRNRRKA +HW+KLR+YV YTWDCS + D+ +K S P + S
Sbjct: 421 EDTVFRKIRNRRKALDHWDKLRRYVAYTWDCSILVDDAIKTTKVSGMCGLTRPPTSSNSS 480
Query: 481 LRKRRLKNGSSTAISFKLDVNGSAKKSSKDEKSPKARATDQEMGMITTKQEPPLQHQVLG 540
++G S + + K + EK+ ++ E+ + L H G
Sbjct: 481 ------EHGES--------IMANKKPNLGIEKNVISQPASLELSSFDSASS--LAH---G 540
Query: 541 STHETKQSSTVPFEGDSSQPGEFSRPEDI-IPPPP---LIDFKHSSDIDMSDVLKNIKSL 600
K+S T P S ED+ IPPPP ID S++D D+ + L
Sbjct: 541 DNFSNKKSITHP----------ISHSEDVSIPPPPPMEFIDGSSGSEVDAMDIKNKFQLL 595
Query: 601 YEYNVFLREKLLSTQSEVRALSAK 605
Y+YNV LREKLL TQS + L+ K
Sbjct: 601 YDYNVLLREKLLETQSLLNTLAPK 595
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023538020.1 | 0.0 | 96.78 | uncharacterized protein LOC111798904 [Cucurbita pepo subsp. pepo] | [more] |
KAG7020961.1 | 0.0 | 95.50 | hypothetical protein SDJN02_17649 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022937645.1 | 0.0 | 95.66 | uncharacterized protein LOC111443988 [Cucurbita moschata] >XP_022937646.1 unchar... | [more] |
KAG6586139.1 | 0.0 | 94.53 | hypothetical protein SDJN03_18872, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022965476.1 | 0.0 | 94.05 | uncharacterized protein LOC111465369 [Cucurbita maxima] >XP_022965477.1 uncharac... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FBT5 | 0.0 | 95.66 | uncharacterized protein LOC111443988 OS=Cucurbita moschata OX=3662 GN=LOC1114439... | [more] |
A0A6J1HLS7 | 0.0 | 94.05 | uncharacterized protein LOC111465369 OS=Cucurbita maxima OX=3661 GN=LOC111465369... | [more] |
A0A0A0LJH8 | 0.0 | 82.34 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074060 PE=4 SV=1 | [more] |
A0A1S3B545 | 0.0 | 81.89 | uncharacterized protein LOC103485904 OS=Cucumis melo OX=3656 GN=LOC103485904 PE=... | [more] |
A0A6J1ENC3 | 0.0 | 78.81 | uncharacterized protein LOC111436176 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT5G16520.1 | 1.4e-177 | 52.72 | unknown protein; Has 25 Blast hits to 25 proteins in 9 species: Archae - 0; Bact... | [more] |