Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTGTGATTTGTTGTTGTGATTGCATTTTGGTTTCTGTTGTTTTGATTGATGGCTTCCACGAATTCACCGCCCAACATTGATGCTTCGGCGTTGACGGATGATTTAGTGACGAAAGCTTTGAATAAACGGTATGAGTGCCTTGTAACTGTTCGAACGAAGGCAATTAAGGGGAAAGGGGCTTGGTATTGGGCTCATTTGGAGCCTGTTCTTATACGAAATCCTAGTAATAGTCTTCCGAAAGCGGTGAAGCTCAAGTGTTCTTTGTGCGATTCTGTTTTCTCGGCTTCGAACCCCTCGCGGACTGCATCTGAGCATTTGAAACGAGGCACTTGTCCTAATTTGAGCTCCATTTCAAGGTCTAATGCTACGGCGTCACCGTTGCCGATATCGTCCATTCCTTCTCCGACATTGCACAACCACAAGAAGCGAAGCTCTCAAATGAATGCTCCGATTCTCACTGCTTCCTATCAGGTACATTCTCTTGCCATGATTGAGCCGACGCGTTCCTATGCTCCGTTAATTTCCTCGCCGCCGACGCCGGTGGCTCAAAATCCGGTTGGGATGGCGAGTAAGATGGAGGTGAATCAGCATCAGTTGGTGTTATCAGGTGGGAAAGATGATTTAGGTGCACTAGAAATGCTGGAAAACAGTGTCAAGAAACTGAGGAGTCCACATGCCTCACCTGGACCAAGGTTAAGTAAGGAACAAATTGATTCTGCTATCGAATTACTGACTGATTGGTTTATCGAGTCGTGTGGGTCAGTATCTCTTTCCTGCCTTGAGCATCCGAAGTTTAAAGCCTTGCTTAGTCAGATGGGCTTGCCTTCATTACTTCGAACCGACATTTTAGGAGCTCGGCTCGATTCCAAGTTTGAGGAGGCCAAAGCTGATTCAGAAGCCAGGATTAGAGATGCTTCGTTTTTCCAGATCGCTTCGGATGGGTGGAAGAATAAGAACTGCTGTGGCTATTACTGTGGCGAAGAGAGTGTAGTTAAATTTATGGTTAATCTTCCAAATGGTTCTACGATGTTCCAAAAAGCATTGTTTACAGGGGGATTGGTGTCACCCAAGTATGCGGAAGAGGTTATTTTAGATACGGTCAATGAGATTTGTGGGAATAGTCTGCAGAGATGTGTGGGGATAATAGCAGATAGGTATAAGGGCAAGGCATTGAGGAATTTGGAGATAAAGAATCATTGGATGGTAAATCTCTCTTGCCAGCTTCAGGGTTTTATTTGTTTGATAAAGGATTTTAACAAAGAGCTTCCACTTTTCAGGGTAGTCACTGAAAATTGCTTGAAGGTTGCAAACTTTGTAAACACCAAATCTCAAGTTAGGAATTGTTTAAACAAGTATAAGGTGCAGGAGCTAGAAGGTCGATGGTTGCTTCACGTTCCTTCGCCAAATTGTGACACATCCAAAAACTTCTCACCTGTTTATGCAATGCTTGATGATATGCTTAGCTCTGCTCATGTCCTTCAAATGGTTGTGTTAGACGAATCTTATAGGTTAGTATGCATGGAGGATCCACTTGCTTCTGAGGTTTCAAGTCTGATACAAAATGAACGCTTTTGGGATGAAATGGAGGGAGTTCATTCACTTGTGAAAATGATCCGAGGGATGGCTCAAGAGATTGAAGCCGAAAGGCCACTGATTGGTCAATGCTTGCCTCTCTGGGAGGAGCTGAGAACAAAAGTGAAGGAATGGTGTGCTAAGTTCAGCATAGCTGAAGGGCCAGTGGAGAAAATTTTAGAAAAGCGGTTTAGGAAAAATTATCATCCAGCATGGTCTGCTGCATTTATACTGGACCCGCTTTACTTGAGGAGGGACATAAATGGGAAATATCTTCCACCCTTCAAGTGCCTTTCACAAGAGCAAGAAAAGGATGTTGATTCGCTTATTAACCGGTTGGTGTCCAGGGAAGAAGCTCATGTCGCATTCATGGAGCTTATGAAATGGAGATCCGAGGGGCTAGATCCACTTTATGCTCAGGCAGTTCAGGTCAAACAACTAGACCCTTTAACCGGAAAGATGAAAATTGCCAACCCACAGAGTAGGCGACTTGTCTGGGAAACTTGCCTAAGTGAGTTCAAGACCCTTGGTAAGGTTGCACTGAGGCTTATTTTCCTTCATTCAACATCTTGTGGCTACAAGTGTAAGTGTTCTATCATGAATTTGGTTTGCTCACATCGGCACTCGAGGATCGGCTTGGAGAGAGCTCAGAAGATGGTATTTGTTGCAGCTCATGCCAAGCTTGAAAGGAAAGACTTTTCTAATGAGGATGACAAAGATGCAGAACTATTTGCAATGGCGGATGGTGAAAATGACATGCTCAATGAGGTCTTTTCTGATGCACCCTCAATGTAATGTCTTCTTTTGAACTTCTCTAGAATTATGAAAATAGGGTATATTAAATTCTTTTGTTGTTTTAATTCTTCTTCCTTTTCCTTTTTGTTCTTTTTTCTTTTCAATTTTCGTGTTGCCCGTGTAGAGTAGATTGAGCTATGTACTGATAGAATCTACGCAGCTGGTCAGGAATTGAGTTATGAGCACTCTTGTAATGTTTTTCTGGATCAACGTGGGACTTGAAGTGCTACTAACTTGTCACCCCTTTAAATTGACGATTATTATCTTTGGACTGTGGAATTAACTTGTTCTGACTAAGAAATTTCTGGTATACAATCCTTGTCTTTGTATGCATCAAGAACACGTTATGCCTTTGACAGGCACCCGAATGATCTTCATGGTTAATCTTTCTGATTAAGTTTAACCATGTAAGCTCCTTCAAAGACTACTTGAGATAAAATATAAGTTTTATGTCGCCATTTTCCTTAGGATAAGGAAATTCAGTTCTAGTAAATGAAGCGAGTCCATGTTCCCTGGATTGTCACAATTTTGAATACCTCTTTGACACTGCTAGGCTAGCTAGTAAATGTACCCAACTGACTTACCACACTACAAAGTGTGCCGCATTGTACGAAGTTTCTGAAATAAAAAGCGGTGAAGGTTATTAGATTCTTGCATTCAAACAAAATTTCCCCCTTTAGTAACTATATCATGCTAGCATGACCTCATTTTAAGGTCATGAACCTCCTAAACATTAACATTTAAGATTTCAATTTTTACTAGTACAGATGGTTAGAGATTGCTGCTCTGTTTAGCTTCTTCATTTTTATGTGCTTAGTACGTTAAAAGTCTATTATAGCATTAAACATTGCTACATTTTTCTGAAAAGAGAACATGAAAATATTTTTTCTTTATCTGAATAGTTGTGTTTTCATGTTATTTATTCAAGCATCAAGGTAGAAGATGATTAAACTAATTTTCATGACCAGAAAGCCCAAATGTTTAATTAATTTGGGCATTATGAAGATCTAATCAATATTTCTTATTTCATAGATAAAGATGATCAAAGCAAGCAAGAGCCTTGATTTTTCAAGGCTTGGTGGATAAAGAAATTGGAATTTTCTGCAACTTCTTTTATGCAGTAACGTTGGCTTGAAATTTTTCTGCAACTTCTTTTATGCAGTACAGGGCTGGATGTGTTCGATCAAATTGAACCAGAGTTGTCGAAGTCAACCTCGGACTAGGTACTATCTTTGCAGCTGAAGATGCACTTCTCCGATAGAATCCCAGATATATCTCCTGTTGTGCCTACTTTTGTTGCTTTTTGAGCCCAACTGTGATGAGTGGATGTCCATCTCAATGCTGTTGTAGATGTCATAGTCAAAGAAAAGACATTTACAAGGTGACATATCCCCTTCTCTCTCTCTCTCCCCTTCTCTATGTTTATCCGTGTGGTGGGGGAGGGCACACACCTTCATGCACGAGCCTTTGAGCATGCATTTTCTTTATTCATTTGTTGCTCTCTTCACTCCTGTCACCATTTTTTAGTAGCTTGTTATTGNTGCCTACTTTTGTTGCTTTTTGAGCCCAACTGTGATGAGTGGATGTCCATCTCAATGCTGTTGTAGATGTCATAGTCAAAGAAAAGACATTTACAAGGTGACATATCCCCTTCTCTCTCTCTCTCCCCTTCTCTATGTTTATCCGTGTGGTGGGGGAGGGCACACACCTTCATGCACGAGCCTTTGAGCATGCATTTTCTTTATTCATTTGTTGCTCTCTTCACTCCTGTCACCATTTTTTAGTAGCTTGTTATTGAAATCTTCCTGTTCTTATGACATTTTTACTCTTTGAATGGGCCCTTAGTGAAATTTCCACCTGAAAGTTGAACTTTGAAGTTTCGTAGATTATTGGTTAGCCCTTGGCATGATTGCTAAAAAGTCTTGAGCAAAGGGCGAGCAAGGAGGCTGTCGGCATGAGTATGCACATAGCAAGGCCGAAGGGGATGTGGCAGCGAACTTGAATTGATGTTGCATGTATGGTTCAGAAAACATCTTTGACAACACAAAGCAGGTTGGTAGTGCTTTTGAGGCGACATGTTTTGTTTATTATTTTTTCACTGTTCTTTTCAACCTCTCGGCTCCATCCAAGAAAGTTAATATTCATGGAATATCTTTTGTTATAACAATAATTTTAATGTTTGACGAGGATGTCAATAGAAATAATAATTGTCGCTATATAATATAATAGAGAACATTATAGTCAATTTAGGATGTATGCTATGATTTACCGTAGATTACCATTTACCTGTCTACATTTTTACAAACCTGGACTACGTATTCTTTTCCTAGGTTTCTCCTTCTTGCTTTTTAGTGAAGTTTCCTTTTATACTTGATTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAAGTACTTCATTGCATTTATTATATTTCAGGTGGATATTGATTTAAGAGAATGGAGGAGGTTCTAGATCTCAAAGGATCCAGTATGCTAAATTTGCCAATTTATCTTTTATGATGGTAAGAACAAAATCTTTTTATTAGTTTATCTTGACCTATCTTTCTCATAACGTTTCATTGAGCATCCTCTGCCTAACTTTTCCTTGCTAAAATGGAAAGTTAGATGCCATTTAGGTGGAGTGGAGTCCTTAGAGAGAGAGAGAGAGAGAGAGGGGGGGGGTTACTTGGATAGGCTTCCCAAACTCATACTGACGTGTTTTGGCTTAACCCGGAGTTACTGTTCGTATTGAGAATAATATCATAGTTAGCTGTTAGATTGGCCGAGTTGTATACCACTGGCATTCCATTGGCGACTGTCTAAACCAACTTGCATGCACCTTGAACAGACGACCCTACTACATTTGGTTGACATCAAAACACGTAAGATATTAAATTTTAAGTAGATGAAACCATAGATTGAACTCATCCTCTCTTATTCTTCCAGTGCCCTTATGAATACTGACCCTAACCCATGATACTGGCCCTTATGAATGCTTAAAAAATTTATAGGTACAGCCAATTAACTTTTTAATACGATAATTAGATCTAGTTGTAATGGAAGGCATAGGATAGAGCAGTAATGAGGTGAAAACAGTTTCAAAGCTGAGAGAGGATTACATGAGCAAGTGTAAAAAATTGAAGATATTTGTAGATATGATATGTTATTAGGTCCAGAATGCTCTTAATCACAATAAATTTTTCTTCAGGGTTTACCAATGAAAGAAAGAGTGTCCACTAGGTCTTCTTCCATTAAAGAATAAACCAAAAACCACGTGGCTTCCCTAATTCCTAGGGATATCAATAAATTTCATTGACCCACTTCCACTTTCTCTCCTCTTAACTATGGGTTTCAGGATTTACCTCAGTAGTACACACACAGCCACTGCAAATTGACCTTACCATTGAATAGATAGGGTGTGAAAAATGATACTTAAAATTAGGTTACCCTTGATTATTTTGAAGCGCTAGGTCTTAAATTTACTCTGATTTGGCTTTGAGATTCAATAATGGTGGAATGTATGGATTATGTCGGCAATGTCAGACAATGGTGACTGCTCACCAACTAGAGAAGAATGCTTTTATTATTATGATTGTCATTATACTTTTATCATTCACTTTTTAGTATGCTTAGTAATATGTCATTCCACTCATTGCTGTTGTTGTTAAAGGTAAAGTAGATATAATTATTTGTACAGGACTCACTCTTTTATACAATGATTCATGTTTTCAAATTTTTATTTTTCAAGTTTGATCACTTTTGAAATTGATACTCATCCACATTTTTATATGCCATGATGTGAAAACAATGGGTAGAATTCACTGCTTTCTGACCTTTTCTTTACTTTATTCTTCTATTTGCATAGTTTGCAAATTACTTTTTGATTTTCAAGTAGAAGTTTAGCATTGATTCTCCTTAGTTTCTGTGGGACTAAGTTGTTGAGCGTTGTTACTAGAAGAAAGATGTAGGAAACTGTCTTTCATTTCTAGAAATGAAGGACAAACTAGAGGAAAACTAATGAAATCAAATATGTTTTTTTAACCTATGACAAACTAGGGGATAGTCAACCCCTCCTTTGTTCCCTTAAAAATAGGGCCTTGAAGGTTGTTTGAGAACTTGCTAGGGTACTTTCTTGAAAAAGGCCTGCATTCTGACTCAATGTTCTTCGCTCTCAGGGTGTAACGGCTCAAATCCTCCAACGCGAGCAGATATTGTCCTCTCTCGACTTTTCCTTTTAGACTTCTCCTGAAGGTTTAAAACGTGTTTGCTAGAGAGAGGTTTCCACACCCTTATAAAGAATGCTTTGTTCTCCCCTCCAACCAACTTGGGATCTCACACAGGGTGTTGTCTCTAGTTTAACACCCAATGAAAAGCCACCACTAGTAGATATTGTCTTCTTTGAGTTTTCTCTTTCGAGTTCCCCTCAAAGTTTTTAAAACGTGTCTACTAGGGAGAAGAAGTTTTCATACCCTTACAAAGAATGCTTCATTCTACTCTCCAACGACGTGGGATCTCACACACAGGGTATTGTCTCTAGAAAAACTCACTATATACATCTATATTTTCCCATCTGCCGTAGTAAAACTTGTATGGTGGGAATGAAGTTGGTTGAATTAGTGCACTGAAATCTCACCTAAAAGGAGAATATAGTGGTGAGATTATGTGAATGATTTAGAGTAAGAAAAAATAAAATACTTTATCTTTTTGTTTGAATGACATTTTACTGTTGATTTCACATGCTAATCTACTTTTTTTGGTATGCACTTTTGGCCCTTTCCCTATGAACTTTGGTACAATGGAATTTCACTATGCATTTTTTCCCATCCCACCAAACCTTCATTTAATTGGTTCAGTGGCCCCCAGGAAGTGTATAAAAAAGCTAGGTAGAGAGATTCAGACTTTTCTAATAGGGAATTAATTATTCCTGTATTGCCAATACAATTAT
mRNA sequence
ATGACTTGCCTTGTAACTGTTCGAACGAAGGCAATTAAGGGGAAAGGGGCTTGGTATTGGGCTCATTTGGAGCCTGTTCTTATACGAAATCCTAGTAATAGTCTTCCGAAAGCGGTGAAGCTCAAGTGTTCTTTGTGCGATTCTGTTTTCTCGGCTTCGAACCCCTCGCGGACTGCATCTGAGCATTTGAAACGAGGCACTTGTCCTAATTTGAGCTCCATTTCAAGGTCTAATGCTACGGCGTCACCGTTGCCGATATCGTCCATTCCTTCTCCGACATTGCACAACCACAAGAAGCGAAGCTCTCAAATGAATGCTCCGATTCTCACTGCTTCCTATCAGGTACATTCTCTTGCCATGATTGAGCCGACGCGTTCCTATGCTCCGTTAATTTCCTCGCCGCCGACGCCGGTGGCTCAAAATCCGGTTGGGATGGCGAGTAAGATGGAGGTGAATCAGCATCAGTTGGTGTTATCAGGTGGGAAAGATGATTTAGGTGCACTAGAAATGCTGGAAAACAGTGTCAAGAAACTGAGGAGTCCACATGCCTCACCTGGACCAAGGTTAAGTAAGGAACAAATTGATTCTGCTATCGAATTACTGACTGATTGGTTTATCGAGTCGTGTGGGTCAGTATCTCTTTCCTGCCTTGAGCATCCGAAGTTTAAAGCCTTGCTTAGTCAGATGGGCTTGCCTTCATTACTTCGAACCGACATTTTAGGAGCTCGGCTCGATTCCAAGTTTGAGGAGGCCAAAGCTGATTCAGAAGCCAGGATTAGAGATGCTTCGTTTTTCCAGATCGCTTCGGATGGGTGGAAGAATAAGAACTGCTGTGGCTATTACTGTGGCGAAGAGAGTGTAGTTAAATTTATGGTTAATCTTCCAAATGGTTCTACGATGTTCCAAAAAGCATTGTTTACAGGGGGATTGGTGTCACCCAAGTATGCGGAAGAGGTTATTTTAGATACGGTCAATGAGATTTGTGGGAATAGTCTGCAGAGATGTGTGGGGATAATAGCAGATAGGTATAAGGGCAAGGCATTGAGGAATTTGGAGATAAAGAATCATTGGATGGTAAATCTCTCTTGCCAGCTTCAGGGTTTTATTTGTTTGATAAAGGATTTTAACAAAGAGCTTCCACTTTTCAGGGTAGTCACTGAAAATTGCTTGAAGGTTGCAAACTTTGTAAACACCAAATCTCAAGTTAGGAATTGTTTAAACAAGTATAAGGTGCAGGAGCTAGAAGGTCGATGGTTGCTTCACGTTCCTTCGCCAAATTGTGACACATCCAAAAACTTCTCACCTGTTTATGCAATGCTTGATGATATGCTTAGCTCTGCTCATGTCCTTCAAATGGTTGTGTTAGACGAATCTTATAGGTTAGTATGCATGGAGGATCCACTTGCTTCTGAGGTTTCAAGTCTGATACAAAATGAACGCTTTTGGGATGAAATGGAGGGAGTTCATTCACTTGTGAAAATGATCCGAGGGATGGCTCAAGAGATTGAAGCCGAAAGGCCACTGATTGGTCAATGCTTGCCTCTCTGGGAGGAGCTGAGAACAAAAGTGAAGGAATGGTGTGCTAAGTTCAGCATAGCTGAAGGGCCAGTGGAGAAAATTTTAGAAAAGCGGTTTAGGAAAAATTATCATCCAGCATGGTCTGCTGCATTTATACTGGACCCGCTTTACTTGAGGAGGGACATAAATGGGAAATATCTTCCACCCTTCAAGTGCCTTTCACAAGAGCAAGAAAAGGATGTTGATTCGCTTATTAACCGGTTGGTGTCCAGGGAAGAAGCTCATGTCGCATTCATGGAGCTTATGAAATGGAGATCCGAGGGGCTAGATCCACTTTATGCTCAGGCAGTTCAGGTCAAACAACTAGACCCTTTAACCGGAAAGATGAAAATTGCCAACCCACAGAGTAGGCGACTTGTCTGGGAAACTTGCCTAAGTGAGTTCAAGACCCTTGGTAAGGTTGCACTGAGGCTTATTTTCCTTCATTCAACATCTTGTGGCTACAAGTGTAAGTGTTCTATCATGAATTTGGTTTGCTCACATCGGCACTCGAGGATCGGCTTGGAGAGAGCTCAGAAGATGGTATTTGTTGCAGCTCATGCCAAGCTTGAAAGGAAAGACTTTTCTAATGAGGATGACAAAGATGCAGAACTATTTGCAATGGCGGATGGTGAAAATGACATGCTCAATGAGGTCTTTTCTGATGCACCCTCAATTACAGGGCTGGATGTGTTCGATCAAATTGAACCAGAGTTGTCGAAGTCAACCTCGGACTAGGTACTATCTTTGCAGCTGAAGATGCACTTCTCCGATAGAATCCCAGATATATCTCCTGTTGTGCCTACTTTTGTTGCTTTTTGAGCCCAACTGTGATGAGTGGATGTCCATCTCAATGCTGTTGTAGATGTCATAGTCAAAGAAAAGACATTTACAAGTTTCGTAGATTATTGGTTAGCCCTTGGCATGATTGCTAAAAAGTCTTGAGCAAAGGGCGAGCAAGGAGGCTGTCGGCATGAGTATGCACATAGCAAGGCCGAAGGGGATGTGGCAGCGAACTTGAATTGATGTTGCATGTATGGTTCAGAAAACATCTTTGACAACACAAAGCAGGTGGATATTGATTTAAGAGAATGGAGGAGGTTCTAGATCTCAAAGGATCCAGTATGCTAAATTTGCCAATTTATCTTTTATGATGGGTGTAACGGCTCAAATCCTCCAACGCGAGCAGATATTGTCCTCTCTCGACTTTTCCTTTTAGACTTCTCCTGAAGGTTTAAAACGTGTTTGCTAGAGAGAGGTTTCCACACCCTTATAAAGAATGCTTTGTTCTCCCCTCCAACCAACTTGGGATCTCACACAGGGTGTTGTCTCTAGTTTAACACCCAATGAAAAGCCACCACTAGTAGATATTGTCTTCTTTGAGTTTTCTCTTTCGAGTTCCCCTCAAAGTTTTTAAAACGTGTCTACTAGGGAGAAGAAGTTTTCATACCCTTACAAAGAATGCTTCATTCTACTCTCCAACGACGTGGGATCTCACACACAGGGTATTGTCTCTAGAAAAACTCACTATATACATCTATATTTTCCCATCTGCCGTAGTAAAACTTGTATGGTGGGAATGAAGTTGGTTGAATTAGTGCACTGAAATCTCACCTAAAAGGAGAATATAGTGGTGAGATTATGTGAATGATTTAGAGTAAGAAAAAATAAAATACTTTATCTTTTTGTTTGAATGACATTTTACTGTTGATTTCACATGCTAATCTACTTTTTTTGGTATGCACTTTTGGCCCTTTCCCTATGAACTTTGGTACAATGGAATTTCACTATGCATTTTTTCCCATCCCACCAAACCTTCATTTAATTGGTTCAGTGGCCCCCAGGAAGTGTATAAAAAAGCTAGGTAGAGAGATTCAGACTTTTCTAATAGGGAATTAATTATTCCTGTATTGCCAATACAATTAT
Coding sequence (CDS)
ATGACTTGCCTTGTAACTGTTCGAACGAAGGCAATTAAGGGGAAAGGGGCTTGGTATTGGGCTCATTTGGAGCCTGTTCTTATACGAAATCCTAGTAATAGTCTTCCGAAAGCGGTGAAGCTCAAGTGTTCTTTGTGCGATTCTGTTTTCTCGGCTTCGAACCCCTCGCGGACTGCATCTGAGCATTTGAAACGAGGCACTTGTCCTAATTTGAGCTCCATTTCAAGGTCTAATGCTACGGCGTCACCGTTGCCGATATCGTCCATTCCTTCTCCGACATTGCACAACCACAAGAAGCGAAGCTCTCAAATGAATGCTCCGATTCTCACTGCTTCCTATCAGGTACATTCTCTTGCCATGATTGAGCCGACGCGTTCCTATGCTCCGTTAATTTCCTCGCCGCCGACGCCGGTGGCTCAAAATCCGGTTGGGATGGCGAGTAAGATGGAGGTGAATCAGCATCAGTTGGTGTTATCAGGTGGGAAAGATGATTTAGGTGCACTAGAAATGCTGGAAAACAGTGTCAAGAAACTGAGGAGTCCACATGCCTCACCTGGACCAAGGTTAAGTAAGGAACAAATTGATTCTGCTATCGAATTACTGACTGATTGGTTTATCGAGTCGTGTGGGTCAGTATCTCTTTCCTGCCTTGAGCATCCGAAGTTTAAAGCCTTGCTTAGTCAGATGGGCTTGCCTTCATTACTTCGAACCGACATTTTAGGAGCTCGGCTCGATTCCAAGTTTGAGGAGGCCAAAGCTGATTCAGAAGCCAGGATTAGAGATGCTTCGTTTTTCCAGATCGCTTCGGATGGGTGGAAGAATAAGAACTGCTGTGGCTATTACTGTGGCGAAGAGAGTGTAGTTAAATTTATGGTTAATCTTCCAAATGGTTCTACGATGTTCCAAAAAGCATTGTTTACAGGGGGATTGGTGTCACCCAAGTATGCGGAAGAGGTTATTTTAGATACGGTCAATGAGATTTGTGGGAATAGTCTGCAGAGATGTGTGGGGATAATAGCAGATAGGTATAAGGGCAAGGCATTGAGGAATTTGGAGATAAAGAATCATTGGATGGTAAATCTCTCTTGCCAGCTTCAGGGTTTTATTTGTTTGATAAAGGATTTTAACAAAGAGCTTCCACTTTTCAGGGTAGTCACTGAAAATTGCTTGAAGGTTGCAAACTTTGTAAACACCAAATCTCAAGTTAGGAATTGTTTAAACAAGTATAAGGTGCAGGAGCTAGAAGGTCGATGGTTGCTTCACGTTCCTTCGCCAAATTGTGACACATCCAAAAACTTCTCACCTGTTTATGCAATGCTTGATGATATGCTTAGCTCTGCTCATGTCCTTCAAATGGTTGTGTTAGACGAATCTTATAGGTTAGTATGCATGGAGGATCCACTTGCTTCTGAGGTTTCAAGTCTGATACAAAATGAACGCTTTTGGGATGAAATGGAGGGAGTTCATTCACTTGTGAAAATGATCCGAGGGATGGCTCAAGAGATTGAAGCCGAAAGGCCACTGATTGGTCAATGCTTGCCTCTCTGGGAGGAGCTGAGAACAAAAGTGAAGGAATGGTGTGCTAAGTTCAGCATAGCTGAAGGGCCAGTGGAGAAAATTTTAGAAAAGCGGTTTAGGAAAAATTATCATCCAGCATGGTCTGCTGCATTTATACTGGACCCGCTTTACTTGAGGAGGGACATAAATGGGAAATATCTTCCACCCTTCAAGTGCCTTTCACAAGAGCAAGAAAAGGATGTTGATTCGCTTATTAACCGGTTGGTGTCCAGGGAAGAAGCTCATGTCGCATTCATGGAGCTTATGAAATGGAGATCCGAGGGGCTAGATCCACTTTATGCTCAGGCAGTTCAGGTCAAACAACTAGACCCTTTAACCGGAAAGATGAAAATTGCCAACCCACAGAGTAGGCGACTTGTCTGGGAAACTTGCCTAAGTGAGTTCAAGACCCTTGGTAAGGTTGCACTGAGGCTTATTTTCCTTCATTCAACATCTTGTGGCTACAAGTGTAAGTGTTCTATCATGAATTTGGTTTGCTCACATCGGCACTCGAGGATCGGCTTGGAGAGAGCTCAGAAGATGGTATTTGTTGCAGCTCATGCCAAGCTTGAAAGGAAAGACTTTTCTAATGAGGATGACAAAGATGCAGAACTATTTGCAATGGCGGATGGTGAAAATGACATGCTCAATGAGGTCTTTTCTGATGCACCCTCAATTACAGGGCTGGATGTGTTCGATCAAATTGAACCAGAGTTGTCGAAGTCAACCTCGGACTAG
Protein sequence
MTCLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEHLKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGARLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQKALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLSCQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHVPSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFWDEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILEKRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVALRLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDAELFAMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD
Homology
BLAST of Cp4.1LG05g07210 vs. NCBI nr
Match:
XP_023533563.1 (uncharacterized protein LOC111795394 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023533564.1 uncharacterized protein LOC111795394 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1529 bits (3960), Expect = 0.0
Identity = 762/762 (100.00%), Postives = 762/762 (100.00%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE
Sbjct: 510 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD 764
ELFAMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD
Sbjct: 750 ELFAMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD 791
BLAST of Cp4.1LG05g07210 vs. NCBI nr
Match:
XP_022958247.1 (uncharacterized protein LOC111459531 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1523 bits (3944), Expect = 0.0
Identity = 758/762 (99.48%), Postives = 759/762 (99.61%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSS MNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSHMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRL CMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLACMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKI+E
Sbjct: 510 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKIVE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD 764
ELF MADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD
Sbjct: 750 ELFVMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD 791
BLAST of Cp4.1LG05g07210 vs. NCBI nr
Match:
XP_022995714.1 (uncharacterized protein LOC111491170 isoform X1 [Cucurbita maxima] >XP_022995715.1 uncharacterized protein LOC111491170 isoform X1 [Cucurbita maxima] >XP_022995716.1 uncharacterized protein LOC111491170 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1509 bits (3906), Expect = 0.0
Identity = 750/762 (98.43%), Postives = 756/762 (99.21%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAK DSEARIRDASFFQIASDGWKNKNCCGYYC EESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKVDSEARIRDASFFQIASDGWKNKNCCGYYCSEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLE+KNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEMKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRL CMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLACMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DE+EGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKI+E
Sbjct: 510 DEVEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKIVE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLI+RLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLIDRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGY CKCSI+NLVCSHRHSRIGLERAQKMVFVAAH KLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYTCKCSILNLVCSHRHSRIGLERAQKMVFVAAHTKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD 764
ELFAMADGENDMLNEVFSDAPSIT LDVFD+IEPELSKSTSD
Sbjct: 750 ELFAMADGENDMLNEVFSDAPSITRLDVFDRIEPELSKSTSD 791
BLAST of Cp4.1LG05g07210 vs. NCBI nr
Match:
XP_023533567.1 (uncharacterized protein LOC111795394 isoform X3 [Cucurbita pepo subsp. pepo] >XP_023533568.1 uncharacterized protein LOC111795394 isoform X3 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1495 bits (3870), Expect = 0.0
Identity = 743/745 (99.73%), Postives = 744/745 (99.87%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE
Sbjct: 510 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSITG 747
ELFAMADGENDMLNEVFSDAPS+ G
Sbjct: 750 ELFAMADGENDMLNEVFSDAPSMAG 774
BLAST of Cp4.1LG05g07210 vs. NCBI nr
Match:
XP_023533570.1 (uncharacterized protein LOC111795394 isoform X5 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1494 bits (3867), Expect = 0.0
Identity = 743/743 (100.00%), Postives = 743/743 (100.00%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE
Sbjct: 510 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSI 745
ELFAMADGENDMLNEVFSDAPSI
Sbjct: 750 ELFAMADGENDMLNEVFSDAPSI 772
BLAST of Cp4.1LG05g07210 vs. ExPASy TrEMBL
Match:
A0A6J1H1B8 (uncharacterized protein LOC111459531 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111459531 PE=4 SV=1)
HSP 1 Score: 1523 bits (3944), Expect = 0.0
Identity = 758/762 (99.48%), Postives = 759/762 (99.61%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSS MNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSHMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRL CMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLACMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKI+E
Sbjct: 510 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKIVE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD 764
ELF MADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD
Sbjct: 750 ELFVMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD 791
BLAST of Cp4.1LG05g07210 vs. ExPASy TrEMBL
Match:
A0A6J1K2P3 (uncharacterized protein LOC111491170 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111491170 PE=4 SV=1)
HSP 1 Score: 1509 bits (3906), Expect = 0.0
Identity = 750/762 (98.43%), Postives = 756/762 (99.21%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAK DSEARIRDASFFQIASDGWKNKNCCGYYC EESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKVDSEARIRDASFFQIASDGWKNKNCCGYYCSEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLE+KNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEMKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRL CMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLACMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DE+EGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKI+E
Sbjct: 510 DEVEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKIVE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLI+RLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLIDRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGY CKCSI+NLVCSHRHSRIGLERAQKMVFVAAH KLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYTCKCSILNLVCSHRHSRIGLERAQKMVFVAAHTKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSITGLDVFDQIEPELSKSTSD 764
ELFAMADGENDMLNEVFSDAPSIT LDVFD+IEPELSKSTSD
Sbjct: 750 ELFAMADGENDMLNEVFSDAPSITRLDVFDRIEPELSKSTSD 791
BLAST of Cp4.1LG05g07210 vs. ExPASy TrEMBL
Match:
A0A6J1H4J7 (uncharacterized protein LOC111459531 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111459531 PE=4 SV=1)
HSP 1 Score: 1488 bits (3851), Expect = 0.0
Identity = 739/743 (99.46%), Postives = 740/743 (99.60%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSS MNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSHMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRL CMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLACMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKI+E
Sbjct: 510 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKIVE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSI 745
ELF MADGENDMLNEVFSDAPSI
Sbjct: 750 ELFVMADGENDMLNEVFSDAPSI 772
BLAST of Cp4.1LG05g07210 vs. ExPASy TrEMBL
Match:
A0A6J1K8T9 (uncharacterized protein LOC111491170 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111491170 PE=4 SV=1)
HSP 1 Score: 1479 bits (3828), Expect = 0.0
Identity = 733/745 (98.39%), Postives = 739/745 (99.19%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAK DSEARIRDASFFQIASDGWKNKNCCGYYC EESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKVDSEARIRDASFFQIASDGWKNKNCCGYYCSEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLE+KNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEMKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRL CMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLACMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DE+EGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKI+E
Sbjct: 510 DEVEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKIVE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLI+RLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLIDRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGY CKCSI+NLVCSHRHSRIGLERAQKMVFVAAH KLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYTCKCSILNLVCSHRHSRIGLERAQKMVFVAAHTKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSITG 747
ELFAMADGENDMLNEVFSDAPS+ G
Sbjct: 750 ELFAMADGENDMLNEVFSDAPSMAG 774
BLAST of Cp4.1LG05g07210 vs. ExPASy TrEMBL
Match:
A0A6J1K6Q8 (uncharacterized protein LOC111491170 isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111491170 PE=4 SV=1)
HSP 1 Score: 1477 bits (3825), Expect = 0.0
Identity = 733/743 (98.65%), Postives = 738/743 (99.33%), Query Frame = 0
Query: 3 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 62
CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH
Sbjct: 30 CLVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEH 89
Query: 63 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 122
LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE
Sbjct: 90 LKRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIE 149
Query: 123 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 182
PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH
Sbjct: 150 PTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLENSVKKLRSPH 209
Query: 183 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 242
ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA
Sbjct: 210 ASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPSLLRTDILGA 269
Query: 243 RLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVNLPNGSTMFQ 302
RLDSKFEEAK DSEARIRDASFFQIASDGWKNKNCCGYYC EESVVKFMVNLPNGSTMFQ
Sbjct: 270 RLDSKFEEAKVDSEARIRDASFFQIASDGWKNKNCCGYYCSEESVVKFMVNLPNGSTMFQ 329
Query: 303 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEIKNHWMVNLS 362
KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLE+KNHWMVNLS
Sbjct: 330 KALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEMKNHWMVNLS 389
Query: 363 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 422
CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV
Sbjct: 390 CQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQELEGRWLLHV 449
Query: 423 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVSSLIQNERFW 482
PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRL CMEDPLASEVSSLIQNERFW
Sbjct: 450 PSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLACMEDPLASEVSSLIQNERFW 509
Query: 483 DEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKILE 542
DE+EGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKI+E
Sbjct: 510 DEVEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIAEGPVEKIVE 569
Query: 543 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLINRLVSREEAHV 602
KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLI+RLVSREEAHV
Sbjct: 570 KRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLIDRLVSREEAHV 629
Query: 603 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 662
AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL
Sbjct: 630 AFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLSEFKTLGKVAL 689
Query: 663 RLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERKDFSNEDDKDA 722
RLIFLHSTSCGY CKCSI+NLVCSHRHSRIGLERAQKMVFVAAH KLERKDFSNEDDKDA
Sbjct: 690 RLIFLHSTSCGYTCKCSILNLVCSHRHSRIGLERAQKMVFVAAHTKLERKDFSNEDDKDA 749
Query: 723 ELFAMADGENDMLNEVFSDAPSI 745
ELFAMADGENDMLNEVFSDAPSI
Sbjct: 750 ELFAMADGENDMLNEVFSDAPSI 772
BLAST of Cp4.1LG05g07210 vs. TAIR 10
Match:
AT1G12380.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G62870.1); Has 173 Blast hits to 170 proteins in 34 species: Archae - 0; Bacteria - 4; Metazoa - 25; Fungi - 8; Plants - 123; Viruses - 7; Other Eukaryotes - 6 (source: NCBI BLink). )
HSP 1 Score: 851.7 bits (2199), Expect = 4.7e-247
Identity = 425/768 (55.34%), Postives = 573/768 (74.61%), Query Frame = 0
Query: 4 LVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEHL 63
L+TVRTKA+KGKGAWYW HLEP+L+RN LPKAVKL+CSLCD+VFSASNPSRTASEHL
Sbjct: 49 LMTVRTKAVKGKGAWYWTHLEPILVRNTDTGLPKAVKLRCSLCDAVFSASNPSRTASEHL 108
Query: 64 KRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRS---------SQMNAPILTASYQ 123
KRGTCPN +S++ +T +P P SS SP H+ K+ S S++N P + SY
Sbjct: 109 KRGTCPNFNSVT-PISTITPSPTSSSSSPQTHHRKRNSSGAVTTAIPSRLNPPPIGGSYH 168
Query: 124 VHSLAMIEPTRSYAPLI--SSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLE 183
V + +++P+R + S+PP P QH L+LSGGKDDLG L MLE
Sbjct: 169 VTPITVVDPSRFCGGELHYSTPPPP---------------QH-LMLSGGKDDLGPLAMLE 228
Query: 184 NSVKKLRSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLP 243
+SVKKL+SP S L++ QI+SA++ L+DW ESCGSVSLS LEHPKF+A L+Q+GLP
Sbjct: 229 DSVKKLKSPKPSQTQSLTRSQIESALDSLSDWVFESCGSVSLSGLEHPKFRAFLTQVGLP 288
Query: 244 SLLRTDILGARLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMV 303
+ + D RLD K EEA+A++E+RIRDA FFQI+SDGWK ES+V +V
Sbjct: 289 IISKRDFATTRLDLKHEEARAEAESRIRDAMFFQISSDGWKPGE------SGESLVNLIV 348
Query: 304 NLPNGSTMFQKALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLE 363
NLPNG++++++A+ G V YAEEV+L+TV ICGNS QRCVGI++D++K KALRNLE
Sbjct: 349 NLPNGTSLYRRAVLVNGAVPSNYAEEVLLETVKGICGNSPQRCVGIVSDKFKTKALRNLE 408
Query: 364 IKNHWMVNLSCQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQ 423
++ WMVNLSCQ QG LIKDF KELPLF+ V++NC+++A F+N +Q+RN KY++Q
Sbjct: 409 SQHQWMVNLSCQFQGLNSLIKDFVKELPLFKSVSQNCVRLAKFINNTAQIRNAHCKYQLQ 468
Query: 424 ELEGRWLLHVP--------SPNCDTSKN-------FSPVYAMLDDMLSSAHVLQMVVLDE 483
E +L +P +C +S + + P++ +L+D+LSSA +Q+VV D+
Sbjct: 469 EHGESIMLRLPLHCYYDDERRSCSSSSSGSNKVCFYEPLFNLLEDVLSSARAIQLVVHDD 528
Query: 484 SYRLVCMEDPLASEVSSLIQNERFWDEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWE 543
+ ++V MED +A EV ++ +E FW+E+E VH+L+K+++ MA+ IE E+ L+GQCLPLW+
Sbjct: 529 ACKVVLMEDHMAREVREMVGDEGFWNEVEAVHALIKLVKEMARRIEEEKLLVGQCLPLWD 588
Query: 544 ELRTKVKEWCAKFSIAEGPVEKILEKRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFK 603
ELR KVK+W +KF++ EG VEK++E+RF+K+YHPAW+AAFILDPLYL RD +GKYLPPFK
Sbjct: 589 ELRAKVKDWDSKFNVGEGHVEKVVERRFKKSYHPAWAAAFILDPLYLIRDSSGKYLPPFK 648
Query: 604 CLSQEQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKI 663
CLS EQEKDVD LI RLVSR+EAH+A MELMKWR+EGLDP+YA+AVQ+K+ DP++GKM+I
Sbjct: 649 CLSPEQEKDVDKLITRLVSRDEAHIALMELMKWRTEGLDPMYARAVQMKERDPVSGKMRI 708
Query: 664 ANPQSRRLVWETCLSEFKTLGKVALRLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERA 723
ANPQS RLVWET LSEF++LGKVA+RLIFLH+T+ G+KC S++ V S+ S ++RA
Sbjct: 709 ANPQSSRLVWETYLSEFRSLGKVAVRLIFLHATTGGFKCNSSLLKWVNSNGRSHAAVDRA 768
Query: 724 QKMVFVAAHAKLERKDFSNEDDKDAELFAMADGENDMLNEVFSDAPSI 746
QK++F++A++K ER+DFSNE+D+DAEL AMA+G++ MLN+V D S+
Sbjct: 769 QKLIFISANSKFERRDFSNEEDRDAELLAMANGDDHMLNDVLVDTSSV 793
BLAST of Cp4.1LG05g07210 vs. TAIR 10
Match:
AT1G62870.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G12380.1); Has 351 Blast hits to 343 proteins in 42 species: Archae - 2; Bacteria - 0; Metazoa - 27; Fungi - 5; Plants - 299; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )
HSP 1 Score: 841.6 bits (2173), Expect = 4.8e-244
Identity = 418/753 (55.51%), Postives = 561/753 (74.50%), Query Frame = 0
Query: 4 LVTVRTKAIKGKGAWYWAHLEPVLIRNPSNSLPKAVKLKCSLCDSVFSASNPSRTASEHL 63
L+ VRTKA+KGKGAWYW+HLEP+L+ N PKAVKL+CSLCD+VFSASNPSRTASEHL
Sbjct: 42 LMMVRTKAVKGKGAWYWSHLEPILLHNTDTGFPKAVKLRCSLCDAVFSASNPSRTASEHL 101
Query: 64 KRGTCPNLSSISRSNATASPLPISSIPSPTLHNHKKRSSQMNAPI----------LTASY 123
KRGTCPN +S+ + +T SP P P P +H+KR+S + SY
Sbjct: 102 KRGTCPNFNSLPKPISTISPSP----PPPPSSSHRKRNSSAVEALNHHHHHPHHHHQGSY 161
Query: 124 QVHSLAMIEPTRSYAPLISSPPTPVAQNPVGMASKMEVNQHQLVLSGGKDDLGALEMLEN 183
V L++++P+R PV Q P L+LSGGKDDLG L MLE+
Sbjct: 162 NVTPLSVVDPSRFCGQF------PVTQQP------------HLMLSGGKDDLGPLAMLED 221
Query: 184 SVKKLRSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLSQMGLPS 243
SVKKL+SP S L+K QIDSA++ L+DW ESCGSVSLS LEHPK +A L+Q+GLP
Sbjct: 222 SVKKLKSPKTSQTRNLTKAQIDSALDSLSDWVFESCGSVSLSGLEHPKLRAFLTQVGLPI 281
Query: 244 LLRTDILGARLDSKFEEAKADSEARIRDASFFQIASDGWKNKNCCGYYCGEESVVKFMVN 303
+ R D + RLD K+E+++A++E+RI DA FFQIASDGWK + E++V +VN
Sbjct: 282 ISRRDFVTGRLDLKYEDSRAEAESRIHDAMFFQIASDGWK------FDSSGENLVNLIVN 341
Query: 304 LPNGSTMFQKALFTGGLVSPKYAEEVILDTVNEICGNSLQRCVGIIADRYKGKALRNLEI 363
LPNG++++++A+F G V YAEEV+ +TV ICGNS QRCVGI++DR+ KALRNLE
Sbjct: 342 LPNGTSLYRRAVFVNGAVPSNYAEEVLWETVRGICGNSPQRCVGIVSDRFMSKALRNLES 401
Query: 364 KNHWMVNLSCQLQGFICLIKDFNKELPLFRVVTENCLKVANFVNTKSQVRNCLNKYKVQE 423
++ WMVNLSCQ QGF LI+DF KELPLF+ V+++C ++ NFVN+ +Q+RN + KY++QE
Sbjct: 402 QHQWMVNLSCQFQGFNSLIRDFVKELPLFKSVSQSCSRLVNFVNSTAQIRNAVCKYQLQE 461
Query: 424 LEGRWLLHVPSPNCDTSKNFSPVYAMLDDMLSSAHVLQMVVLDESYRLVCMEDPLASEVS 483
+LH+P S F P+Y +L+D+LS A +Q+V+ D+ + V MED +A EV
Sbjct: 462 QGETRMLHLPL----DSSLFEPLYNLLEDVLSFARAIQLVMHDDVCKAVLMEDHMAREVG 521
Query: 484 SLIQNERFWDEMEGVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFSIA 543
++ + FW+E+E V+ L+K+++ MA+ IE ERPL+GQCLPLW+ELR+K+K+W AKF++
Sbjct: 522 EMVGDVGFWNEVEAVYLLLKLVKEMARRIEEERPLVGQCLPLWDELRSKIKDWYAKFNVV 581
Query: 544 -EGPVEKILEKRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQEQEKDVDSLIN 603
E VEKI+E+RF+K+YHPAW+AAFILDPLYL +D +GKYLPPFKCLS EQEKDVD LI
Sbjct: 582 EERQVEKIVERRFKKSYHPAWAAAFILDPLYLIKDSSGKYLPPFKCLSPEQEKDVDKLIT 641
Query: 604 RLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQLDPLTGKMKIANPQSRRLVWETCLS 663
RLVSR+EAH+A MELMKWR+EGLDP+YA+AVQ+K+ DP++GKM+IANPQS RLVWET LS
Sbjct: 642 RLVSRDEAHIAMMELMKWRTEGLDPVYARAVQMKERDPVSGKMRIANPQSSRLVWETYLS 701
Query: 664 EFKTLGKVALRLIFLHSTSCGYKCKCSIMNLVCSHRHSRIGLERAQKMVFVAAHAKLERK 723
EF++LG+VA+RLIFLH+TSCG+KC S++ V S+ SR ++RAQK++F++A++K ER+
Sbjct: 702 EFRSLGRVAVRLIFLHATSCGFKCNSSVLRWVNSNGRSRAAVDRAQKLIFISANSKFERR 761
Query: 724 DFSNEDDKDAELFAMADGENDMLNEVFSDAPSI 746
DFSNE+++DAEL AMA+GE+D+LN+V D S+
Sbjct: 762 DFSNEEERDAELLAMANGEDDVLNDVLIDTSSV 762
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023533563.1 | 0.0 | 100.00 | uncharacterized protein LOC111795394 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
XP_022958247.1 | 0.0 | 99.48 | uncharacterized protein LOC111459531 isoform X1 [Cucurbita moschata] | [more] |
XP_022995714.1 | 0.0 | 98.43 | uncharacterized protein LOC111491170 isoform X1 [Cucurbita maxima] >XP_022995715... | [more] |
XP_023533567.1 | 0.0 | 99.73 | uncharacterized protein LOC111795394 isoform X3 [Cucurbita pepo subsp. pepo] >XP... | [more] |
XP_023533570.1 | 0.0 | 100.00 | uncharacterized protein LOC111795394 isoform X5 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1H1B8 | 0.0 | 99.48 | uncharacterized protein LOC111459531 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1K2P3 | 0.0 | 98.43 | uncharacterized protein LOC111491170 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1H4J7 | 0.0 | 99.46 | uncharacterized protein LOC111459531 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1K8T9 | 0.0 | 98.39 | uncharacterized protein LOC111491170 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1K6Q8 | 0.0 | 98.65 | uncharacterized protein LOC111491170 isoform X4 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT1G12380.1 | 4.7e-247 | 55.34 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G62870.1 | 4.8e-244 | 55.51 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |