Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTAGTAGTGTAAGATCAAATAATGGATAAAGAAGTCTCTCCAATCCAAAATATAATGCTCGTAAACAACTAAAATTTAAATTAATCGAATATAGATATATAGATGAAATAATGTTCTAGTGCATGACAAATATTATGAGAATATTGCTTATTCGTCATATAGTTGTAGTGTCGTCGGAGGCCTCCTTGCATCTCTACATATTGATTGAAAAACACAAAAGTAACTCTTGCTTTACACCAAATTCTCCAATATATAGATTTATTTTATTGCTCACAATTTAATTAGATTTTATGTTACATAAAAACACTAAGAGAGAAAATAATTGAGAGTGTGAACATGAGCCACAAGGAAGATGAGGGAGAATGTGCAAGTGATAGAAGGAAAGAAAGAAATTTACATAGCTTTACAACAGAGCAAATGAATGGTCATACATTTTTTACTAATTTAATTTTGTGTGAAGACAGTTTTAAGATACTAAAAAGAAATTAATTAATTAAATTAATTGAGAGAATGATAGATGTAACCGTTAGTTGAGAAAGAGTAGATGCATTTATTTATTGAAAACTCATAGAACAAAGTTAAAATTGCAATTATAATAAAAGGAACGTAGTAAATAAAGAATCAAACTCTGAAAATGGCAAAATGGGAAGAAGTTTCTTATCAAAATATAATGGGAAGCCGCCGTAACTTTTTCTTACATTTATCCTCTCCCCATACCAATTAGTATTGTATGAGTATCAACATTTCACGTTTCAGAAAACCACTAGCCCAATGACATATCTAATAGCATAAAAAGCGTAAAACAAATATCACTTAGAATCTCCATCAAATTTCAAACACATTCTTATGTGTGTCCACAAACTTGCACACATGCATACATAATATATAAATGAAAATAAGATTTTGGAAGGCTCGTATCTTTTTCCATGAACAATAATAAAATACTTCAGGCTGTGTTTTTTAATTAAAAATAAACAAATCAAAATCAATCACCCAAAATCCTGCCCTCTTTAGTGGAAGCCCACTGACAAACTGCAAGGGTCAACAAAATAGCCCAGAAAATTTAGTTGGGGAAGGCATCACTTAAAAACCTTATCATTTATTTCTTACTATAGAAAAGTGGGAAAAAATGTCAGGAAAGAAAGATTTTTTGCTTTTACCACATTTACACTTCATAAGAGTAACCCAAGACCAAAATTCAAATAGGAAGATTATTACAACTCATCACTCTCACTGTATTTAAATTTCTGACTTCTGAGTTCATATCAGGAAACATGTTTTACAACAAATCACTTCTGTTTTGCTCTTTAAAATACTTTGTTTAAACCCTTTCGAAATCAAATCGCCACGGATAATCTTCAGGTTTCGTGAAGAAGAAGGGTTCTTCTGTCTAGATTTGACCCTCCTGCTACTTGGTTTTTGTTTCTTTTTTAAAAAAAACAGAGTTACTGTTTTAGCACTCTGCAGAAACACGTATGGAAAAAACCAAGTGTCCCCCATTTATGACAACTATGATCTTCCATTGATTTTCTTGTCTTACAGAAAAAAATAAAATTAAATATAAACAAAACCCATTTAATTGAAAATTAAATTTGAGCTCCAAATTTTGGACTTCTTTTCTCTCTCTGTTATGAATCCTCTCCGTTTTAAGCTGCCACGAAGTCAATGCACCGACTTCTCTGTTTCTATTTCGTTCAGTGTTCAGTTTCTTCACATGGGGTTTGGTATTTCTCTACAAATTCTGTGTGTCTTTTCTCTTTGTTGTTTGGATTTTCATTTTTCCACCTATTTCTTTTCTGGGTTTGGTAAGTTCTCTCTGTTTAAGTCATTTTTCTCTTCTGGGCTTGATCTTCATTTACAGAATCTTGCAAGAAGATCAAAGGAAGATAGCTTTGTTTTTCATATCAACTGTGTTCAAATCTTCTTGATTATGTTGTAAAGTTTGAAAATTTCAACCTTCTCCCCACGTTGTTGTTTATTTTGGCTCATCTGAAGAACCGTTTTGCTTTGTTTTCATTTGAACACTGCCGTCCGAGCATTAGAGGATTGTTTATTCTTTAAATTAAAACCGTGATATTATAAGGGAAAGGGGATGAGTAGAAACTGATTTTGTTGATGTAATTTTTCCATTGAATAAAAATGCCTCTAGACAGTGTAAAATCAGTAGTTTATAGATCATTTATCACTTGTGATGATCCAAAAGGGGTTGTTGATTGCAGTTTAATGAAGATATCCAAAATGAATTCTCGGAAACTAGAACAAAAGATTAGAGCCCATAGGACAAGTAGGAATTCAAGTAAGGGTTTAGTATCTGATTTAGAGAAAGAGGAGCTGATCTCTAAAAAAATGAGAGAGAGAATTCATGGCCAATCTTCTATTCCATTCATGGAGGTTTGTCAGGGAGCTGAGAAGTTGAATCATATGGTTGGTTCATGGTCTAAGGGGATGAGATCTGAGAGCAAAACTGAGAAGATTGCTGAGGATTTGTTGGAAGAAACTTCAAGTTTAAGAGATTCACTGATTATGTTGGCTAAGTTGCAAGAAGCTTCGAATAAGTCGATTCGGTTGAAGAGGACATATCCAAGAAGTTTTTCTTCTCATCTTGAAGATGAGTGTTTTCCAGTTGAGGTTCAAAGATCAAAGCTTTCTACACATGGTTCTTCCAGAACTGGTGCTGATGAGGTTAAGAAGATGATTGGAAACAGCCCAGTGAAGCGAGATTCAGTGCGTAATGTTACAGTTGGTGAACATAAATCTTGTTTTTGTGATATAAATTCCAACTTGGATTCGGAAATTTCGTTGACTAGTTCCAGCCAATCTTCAATGATTGATGATAATGTCAACTGTTCTCATGGTACAACATCTCAACAGAATTTGAAACGTAATAACTTGATTGCTAAGCTTATGGGGCTTGAAGAAATTCCATCAAGATCAATGCAAATTACTCAGAAGAAAGAGTTTGAATTAAAAAAAGTCTGTGGTTACAAAGCATCTCTCTTTGGCGTCGATGCAACGTTGAATATGCCGAAGTCTAAATCTGTCATCAACAAGGAGGATCATCGAAAAGGAACCTTGAGAGAAATACTTGAAAAAATGCCTGTCAACAGGCTTAGAGAGAGTGATTCTGATATAGAGTTTAAGATTCATTGCTCAAATTCCTACAATAATGGTTCCAAACAGAGGTTGAAAGACGGACTGCCAATTGTATTGATAAAACACAAGCCTCTTCCATCTGAGAAATTTGAGGAACACCGACATGTCTCCTCAAAAGATGATGCTTTTGACCAAAAAACTAGGCTAAGAAGTACGAAAAAGAAAGAACTTTGGTCGGTTGAAGATTTTGATTTCCATGGTGGAATTGTGAGTTCAGATAAATTGCACAGCAAACAAAAGGGAGAAGGGACTCCAGTCAAACAGATTGCAGAGAAGTTGAAAATATCCAATCCAATGCCTGATATGCGCCATGAGAAAGAACCAATTGATAGGAAAGTTCTTACATCAAAGAAACTGACTAAACCAGTGGAAAAAGAATTTCCAAAAGAAAAAGTGGTGTCAAGACCAAAACATCAAGAAAAAGTGACATCCACTAATCCAAGGAAGAACAGAACTCACAAACAACGTAGCTCCATTCAAGATTCTGTGCCAGGACAAGCAGTGAGAGCGATATCTAACAATCGCGACTGTCAGAAAAAGGAGGAACTAGTACTTCCCCATTCAGAAGTTAATTCATTTGTGAGTTCTGAACTATCCTTTAAATTTTGGTACATCAAACTCTTTTTCTTTTATGCATCAGTATCCTTTTTCTCCCCTTGTAAAAGGAGAGTGAAGCAGTTTTTCTCTTACAGACTCACATGGTTGAAGTCAAGAAAGATGATGAGATTACTGATACGAATGAAAGTGTTGATCTTCAGATAAATCGAAACACTACTACCCTTATGGCTTTAATTACTATGGAGAACGAAATGGACAAATGTGATACAAAGATTATAGGTAAGTTTTCCATCACAAATAGGATTAGTTGCAATGTCTACTGTTATTGACCAATTCATCTTAAAGGGTTGGAAATGCATTCATTCTCTATCCTCATGCTTGTATGAATCTCATATATCTTCATCCAAGAGAGATTTTTAGTGAAATTTAATAACTTTGTCTTCCTAAGTAGAACAAGTACATGAAGTTGTCTAAATGTTTGTGATTTATACTTAGTTCTCTCATCAATCTTGTAGAAGGCTGCCATGAGAACCCAAACTCTCTGTCGCCATTGAGCCCCAAACTCGATATCAATACAAGTACTGTTGAAGAAATTGATCACAATGGTCATACTGAAGCAGATACCAAAAGCTGCAATCAAGGGACCAATCTGAAAGCATTACTTTTGAAAAGTTCATCATTCCTCTGTCATGCAGAGGAGCTTTTTGATCTCCATCTAAATGGTAGAACAATGCCGCAGGCAGCATCTCGCTGCAATGATCCTGAATCCCTGAATACGAAACTTTTTGTAGATTGTGCAATAGAACTTGTCGACCGTAAAGGCCATTATAACTTACCAGTAGGCAATTCTTTAGTACTAGGAGATAAGAGCAATACAAAGATAGAGATTTCCATAGAAAAACTTGTTGAAGAAGTGAACGACGACATCGAAACTCTAACGAGTTACCAAACAATATGTGGCAATAATCTTATTGTAGATACTCTATATGCAGTGTTGAGTAGAGACCTATGGTGCAAAGAAGTGATGAATGGAATGTGGGATATTGGTTGGAAGAATGAATTTTCAAGCAGTGAAAGTGAGGAAGTTGTAAATGATATAGAAATGATGATTTTGAGTGGACTGATTGAAGAGTCCTTTACATAACATCAGAAAATTGTATCTTCTGTATTGTATATACAAGAGGAAAAAAAAACTTTCAATTACTACACCTATAATCTTTATTCCAATGCATATTTGAGAAATAGGTTGACGTGACACACCAAAAAAGGTGCTTTACTTAGCCAACAGAATAACTCGACAAAAGTCTATTTGAGATACAAATTACTTTAGTTTGTGTAATGATACTATAGTCAGTCAGAGAATGGTTAACAGAATAATAAACATTTTTCAAATAATTATACTGTGTAAAAAGTGTTAATTATACTGTGTAACAAGTGTCAGATTAGTTGGAAACATTAATTATTTGGACGAGAAACTAAAACGATAGTGAATTATAAGTTAATTTTAATACCCAGAGGTGTCATAAAAGAAGTTTAAAAAGAGATGTTAAGCACTGAGACATTTCAAGGTAGTCGAATGAAGGGTGTGGAAGTTTGATAGCATTTGTGATCATTTTTTGGGAAAATAAAATTGAAGGATTTATGAATGCAAATTGAAATGATTGATTTGAGATTGCAAAAATACAAAAAGTTCACATACTCAAACTGACCCAATGATTTGTGACGTTTGGAGAACAACAATGATTCCCTATTTCCCATCCCACACGTTAAAGAAGGCGACCTTTGCTCTGTGGTTCTCTAAAAAGAAAAGTATAAGTGTTATCCTTTGTTTTTTAAGTCATTGTCTACTTAATCATCTAGCGTTATAGAATTATCAAATAGATTTTAGATTAAAAGGAAAAGATAGCTTTATTTGATAAGTGTGACAAATAGAGATTACTTGGGTTTTTTTTTAAGTTAAATATAGTTTATACTACACTTAATGTTGTCACACGGAAGATTTTTATGTTGGACTGTATAGGGGGTTGCAAGAGGGAAAACTCAAAGTCAACTTGATGGTTGTAAGAATAGGGCATAACCACTATGTCACAGTTCTTTTATTTAAAATAAAACATCATTTAATATATATTGTAATTTAACACTTTAAAATTAATATTAAAATACAAAAATCGATCATGTAAAACAGGACATGTGCTCCCGCTGGCACTTTGGTAAATCCACCCGTGAACTAATTTGACTAGAAACTTATTTATACATATTTTCATAATTTATAAATATAAAAAAAATGATCCAAAAAATTTATAAAATACAGCAAAAACAATAAACATGATAAATATCAATAAACTTCTATTAATATTTTGATATTTAACAACGTGCATTGTTGATGAAATTTTGCTATATTTTATAAATTATTATATTTTTTACTATTTCACTTATATTTTTTAAAATTCATGGTTTGTTTGATGATTATTTATTTTTAAAACAAGTATACACGAAGGGATGTTACTTTCTGGTGGGGATGGAGCTCAACGTAGAGCCATTTAGGAAATGAGAGATGGGGTGGGGAAAAGAATTCTTTGAGAATTAAATGGGGACGGGGAGGAGACCCGTCCCTACCCCGTTAGGCTTGTTTCTTATATTTATTATAATAAGAAATATAAGTAGGTTAGTGTTATATTTAAAGGCATTCTTAATTAAGTTTGGTTATTTTATTATCAGACGTATTTACCCTTAATGAAAAATTATATAAAAAAAAATATCATAGGTATTAATTTTTTGTTTCTGACAAGTATAAATAGTTTAGGTTGCACAAAAACTTTTTAAAATCA
mRNA sequence
GTAGTAGTGTAAGATCAAATAATGGATAAAGAAGTCTCTCCAATCCAAAATATAATGCTCGTAAACAACTAAAATTTAAATTAATCGAATATAGATATATAGATGAAATAATGTTCTAGTGCATGACAAATATTATGAGAATATTGCTTATTCGTCATATAGTTGTAGTGTCGTCGGAGGCCTCCTTGCATCTCTACATATTGATTGAAAAACACAAAAGTAACTCTTGCTTTACACCAAATTCTCCAATATATAGATTTATTTTATTGCTCACAATTTAATTAGATTTTATGTTACATAAAAACACTAAGAGAGAAAATAATTGAGAGTGTGAACATGAGCCACAAGGAAGATGAGGGAGAATGTGCAAGTGATAGAAGGAAAGAAAGAAATTTACATAGCTTTACAACAGAGCAAATGAATGGTCATACATTTTTTACTAATTTAATTTTGTGTGAAGACAGTTTTAAGATACTAAAAAGAAATTAATTAATTAAATTAATTGAGAGAATGATAGATGTAACCGTTAGTTGAGAAAGAGTAGATGCATTTATTTATTGAAAACTCATAGAACAAAGTTAAAATTGCAATTATAATAAAAGGAACGTAGTAAATAAAGAATCAAACTCTGAAAATGGCAAAATGGGAAGAAGTTTCTTATCAAAATATAATGGGAAGCCGCCGTAACTTTTTCTTACATTTATCCTCTCCCCATACCAATTAGTATTGTATGAGTATCAACATTTCACGTTTCAGAAAACCACTAGCCCAATGACATATCTAATAGCATAAAAAGCGTAAAACAAATATCACTTAGAATCTCCATCAAATTTCAAACACATTCTTATGTGTGTCCACAAACTTGCACACATGCATACATAATATATAAATGAAAATAAGATTTTGGAAGGCTCGTATCTTTTTCCATGAACAATAATAAAATACTTCAGGCTGTGTTTTTTAATTAAAAATAAACAAATCAAAATCAATCACCCAAAATCCTGCCCTCTTTAGTGGAAGCCCACTGACAAACTGCAAGGGTCAACAAAATAGCCCAGAAAATTTAGTTGGGGAAGGCATCACTTAAAAACCTTATCATTTATTTCTTACTATAGAAAAGTGGGAAAAAATGTCAGGAAAGAAAGATTTTTTGCTTTTACCACATTTACACTTCATAAGAGTAACCCAAGACCAAAATTCAAATAGGAAGATTATTACAACTCATCACTCTCACTGTATTTAAATTTCTGACTTCTGAGTTCATATCAGGAAACATGTTTTACAACAAATCACTTCTGTTTTGCTCTTTAAAATACTTTGTTTAAACCCTTTCGAAATCAAATCGCCACGGATAATCTTCAGGTTTCGTGAAGAAGAAGGGTTCTTCTGTCTAGATTTGACCCTCCTGCTACTTGGTTTTTGTTTCTTTTTTAAAAAAAACAGAGTTACTGTTTTAGCACTCTGCAGAAACACGTATGGAAAAAACCAAGTGTCCCCCATTTATGACAACTATGATCTTCCATTGATTTTCTTGTCTTACAGAAAAAAATAAAATTAAATATAAACAAAACCCATTTAATTGAAAATTAAATTTGAGCTCCAAATTTTGGACTTCTTTTCTCTCTCTGTTATGAATCCTCTCCGTTTTAAGCTGCCACGAAGTCAATGCACCGACTTCTCTGTTTCTATTTCGTTCAGTGTTCAGTTTCTTCACATGGGGTTTGGTATTTCTCTACAAATTCTGTGTGTCTTTTCTCTTTGTTGTTTGGATTTTCATTTTTCCACCTATTTCTTTTCTGGGTTTGGTAAGTTCTCTCTGTTTAAGTCATTTTTCTCTTCTGGGCTTGATCTTCATTTACAGAATCTTGCAAGAAGATCAAAGGAAGATAGCTTTGTTTTTCATATCAACTGTGTTCAAATCTTCTTGATTATGTTGTAAAGTTTGAAAATTTCAACCTTCTCCCCACGTTGTTGTTTATTTTGGCTCATCTGAAGAACCGTTTTGCTTTGTTTTCATTTGAACACTGCCGTCCGAGCATTAGAGGATTGTTTATTCTTTAAATTAAAACCGTGATATTATAAGGGAAAGGGGATGAGTAGAAACTGATTTTGTTGATGTAATTTTTCCATTGAATAAAAATGCCTCTAGACAGTGTAAAATCAGTAGTTTATAGATCATTTATCACTTGTGATGATCCAAAAGGGGTTGTTGATTGCAGTTTAATGAAGATATCCAAAATGAATTCTCGGAAACTAGAACAAAAGATTAGAGCCCATAGGACAAGTAGGAATTCAAGTAAGGGTTTAGTATCTGATTTAGAGAAAGAGGAGCTGATCTCTAAAAAAATGAGAGAGAGAATTCATGGCCAATCTTCTATTCCATTCATGGAGGTTTGTCAGGGAGCTGAGAAGTTGAATCATATGGTTGGTTCATGGTCTAAGGGGATGAGATCTGAGAGCAAAACTGAGAAGATTGCTGAGGATTTGTTGGAAGAAACTTCAAGTTTAAGAGATTCACTGATTATGTTGGCTAAGTTGCAAGAAGCTTCGAATAAGTCGATTCGGTTGAAGAGGACATATCCAAGAAGTTTTTCTTCTCATCTTGAAGATGAGTGTTTTCCAGTTGAGGTTCAAAGATCAAAGCTTTCTACACATGGTTCTTCCAGAACTGGTGCTGATGAGGTTAAGAAGATGATTGGAAACAGCCCAGTGAAGCGAGATTCAGTGCGTAATGTTACAGTTGGTGAACATAAATCTTGTTTTTGTGATATAAATTCCAACTTGGATTCGGAAATTTCGTTGACTAGTTCCAGCCAATCTTCAATGATTGATGATAATGTCAACTGTTCTCATGGTACAACATCTCAACAGAATTTGAAACGTAATAACTTGATTGCTAAGCTTATGGGGCTTGAAGAAATTCCATCAAGATCAATGCAAATTACTCAGAAGAAAGAGTTTGAATTAAAAAAAGTCTGTGGTTACAAAGCATCTCTCTTTGGCGTCGATGCAACGTTGAATATGCCGAAGTCTAAATCTGTCATCAACAAGGAGGATCATCGAAAAGGAACCTTGAGAGAAATACTTGAAAAAATGCCTGTCAACAGGCTTAGAGAGAGTGATTCTGATATAGAGTTTAAGATTCATTGCTCAAATTCCTACAATAATGGTTCCAAACAGAGGTTGAAAGACGGACTGCCAATTGTATTGATAAAACACAAGCCTCTTCCATCTGAGAAATTTGAGGAACACCGACATGTCTCCTCAAAAGATGATGCTTTTGACCAAAAAACTAGGCTAAGAAGTACGAAAAAGAAAGAACTTTGGTCGGTTGAAGATTTTGATTTCCATGGTGGAATTGTGAGTTCAGATAAATTGCACAGCAAACAAAAGGGAGAAGGGACTCCAGTCAAACAGATTGCAGAGAAGTTGAAAATATCCAATCCAATGCCTGATATGCGCCATGAGAAAGAACCAATTGATAGGAAAGTTCTTACATCAAAGAAACTGACTAAACCAGTGGAAAAAGAATTTCCAAAAGAAAAAGTGGTGTCAAGACCAAAACATCAAGAAAAAGTGACATCCACTAATCCAAGGAAGAACAGAACTCACAAACAACGTAGCTCCATTCAAGATTCTGTGCCAGGACAAGCAGTGAGAGCGATATCTAACAATCGCGACTGTCAGAAAAAGGAGGAACTAGTACTTCCCCATTCAGAAGTTAATTCATTTACTCACATGGTTGAAGTCAAGAAAGATGATGAGATTACTGATACGAATGAAAGTGTTGATCTTCAGATAAATCGAAACACTACTACCCTTATGGCTTTAATTACTATGGAGAACGAAATGGACAAATGTGATACAAAGATTATAGAAGGCTGCCATGAGAACCCAAACTCTCTGTCGCCATTGAGCCCCAAACTCGATATCAATACAAGTACTGTTGAAGAAATTGATCACAATGGTCATACTGAAGCAGATACCAAAAGCTGCAATCAAGGGACCAATCTGAAAGCATTACTTTTGAAAAGTTCATCATTCCTCTGTCATGCAGAGGAGCTTTTTGATCTCCATCTAAATGGTAGAACAATGCCGCAGGCAGCATCTCGCTGCAATGATCCTGAATCCCTGAATACGAAACTTTTTGTAGATTGTGCAATAGAACTTGTCGACCGTAAAGGCCATTATAACTTACCAGTAGGCAATTCTTTAGTACTAGGAGATAAGAGCAATACAAAGATAGAGATTTCCATAGAAAAACTTGTTGAAGAAGTGAACGACGACATCGAAACTCTAACGAGTTACCAAACAATATGTGGCAATAATCTTATTGTAGATACTCTATATGCAGTGTTGAGTAGAGACCTATGGTGCAAAGAAGTGATGAATGGAATGTGGGATATTGGTTGGAAGAATGAATTTTCAAGCAGTGAAAGTGAGGAAGTTGTAAATGATATAGAAATGATGATTTTGAGTGGACTGATTGAAGAGTCCTTTACATAACATCAGAAAATTGTATCTTCTGTATTGTATATACAAGAGGAAAAAAAAACTTTCAATTACTACACCTATAATCTTTATTCCAATGCATATTTGAGAAATAGGTTGACGTGACACACCAAAAAAGGTGCTTTACTTAGCCAACAGAATAACTCGACAAAAGTCTATTTGAGATACAAATTACTTTAGTTTGTGTAATGATACTATAGTCAGTCAGAGAATGGTTAACAGAATAATAAACATTTTTCAAATAATTATACTGTGTAAAAAGTGTTAATTATACTGTGTAACAAGTGTCAGATTAGTTGGAAACATTAATTATTTGGACGAGAAACTAAAACGATAGTGAATTATAAGTTAATTTTAATACCCAGAGGTGTCATAAAAGAAGTTTAAAAAGAGATGTTAAGCACTGAGACATTTCAAGGTAGTCGAATGAAGGGTGTGGAAGTTTGATAGCATTTGTGATCATTTTTTGGGAAAATAAAATTGAAGGATTTATGAATGCAAATTGAAATGATTGATTTGAGATTGCAAAAATACAAAAAGTTCACATACTCAAACTGACCCAATGATTTGTGACGTTTGGAGAACAACAATGATTCCCTATTTCCCATCCCACACGTTAAAGAAGGCGACCTTTGCTCTGTGGTTCTCTAAAAAGAAAAGTATAAGTGTTATCCTTTGTTTTTTAAGTCATTGTCTACTTAATCATCTAGCGTTATAGAATTATCAAATAGATTTTAGATTAAAAGGAAAAGATAGCTTTATTTGATAAGTGTGACAAATAGAGATTACTTGGGTTTTTTTTTAAGTTAAATATAGTTTATACTACACTTAATGTTGTCACACGGAAGATTTTTATGTTGGACTGTATAGGGGGTTGCAAGAGGGAAAACTCAAAGTCAACTTGATGGTTGTAAGAATAGGGCATAACCACTATGTCACAGTTCTTTTATTTAAAATAAAACATCATTTAATATATATTGTAATTTAACACTTTAAAATTAATATTAAAATACAAAAATCGATCATGTAAAACAGGACATGTGCTCCCGCTGGCACTTTGGTAAATCCACCCGTGAACTAATTTGACTAGAAACTTATTTATACATATTTTCATAATTTATAAATATAAAAAAAATGATCCAAAAAATTTATAAAATACAGCAAAAACAATAAACATGATAAATATCAATAAACTTCTATTAATATTTTGATATTTAACAACGTGCATTGTTGATGAAATTTTGCTATATTTTATAAATTATTATATTTTTTACTATTTCACTTATATTTTTTAAAATTCATGGTTTGTTTGATGATTATTTATTTTTAAAACAAGTATACACGAAGGGATGTTACTTTCTGGTGGGGATGGAGCTCAACGTAGAGCCATTTAGGAAATGAGAGATGGGGTGGGGAAAAGAATTCTTTGAGAATTAAATGGGGACGGGGAGGAGACCCGTCCCTACCCCGTTAGGCTTGTTTCTTATATTTATTATAATAAGAAATATAAGTAGGTTAGTGTTATATTTAAAGGCATTCTTAATTAAGTTTGGTTATTTTATTATCAGACGTATTTACCCTTAATGAAAAATTATATAAAAAAAAATATCATAGGTATTAATTTTTTGTTTCTGACAAGTATAAATAGTTTAGGTTGCACAAAAACTTTTTAAAATCA
Coding sequence (CDS)
ATGCCTCTAGACAGTGTAAAATCAGTAGTTTATAGATCATTTATCACTTGTGATGATCCAAAAGGGGTTGTTGATTGCAGTTTAATGAAGATATCCAAAATGAATTCTCGGAAACTAGAACAAAAGATTAGAGCCCATAGGACAAGTAGGAATTCAAGTAAGGGTTTAGTATCTGATTTAGAGAAAGAGGAGCTGATCTCTAAAAAAATGAGAGAGAGAATTCATGGCCAATCTTCTATTCCATTCATGGAGGTTTGTCAGGGAGCTGAGAAGTTGAATCATATGGTTGGTTCATGGTCTAAGGGGATGAGATCTGAGAGCAAAACTGAGAAGATTGCTGAGGATTTGTTGGAAGAAACTTCAAGTTTAAGAGATTCACTGATTATGTTGGCTAAGTTGCAAGAAGCTTCGAATAAGTCGATTCGGTTGAAGAGGACATATCCAAGAAGTTTTTCTTCTCATCTTGAAGATGAGTGTTTTCCAGTTGAGGTTCAAAGATCAAAGCTTTCTACACATGGTTCTTCCAGAACTGGTGCTGATGAGGTTAAGAAGATGATTGGAAACAGCCCAGTGAAGCGAGATTCAGTGCGTAATGTTACAGTTGGTGAACATAAATCTTGTTTTTGTGATATAAATTCCAACTTGGATTCGGAAATTTCGTTGACTAGTTCCAGCCAATCTTCAATGATTGATGATAATGTCAACTGTTCTCATGGTACAACATCTCAACAGAATTTGAAACGTAATAACTTGATTGCTAAGCTTATGGGGCTTGAAGAAATTCCATCAAGATCAATGCAAATTACTCAGAAGAAAGAGTTTGAATTAAAAAAAGTCTGTGGTTACAAAGCATCTCTCTTTGGCGTCGATGCAACGTTGAATATGCCGAAGTCTAAATCTGTCATCAACAAGGAGGATCATCGAAAAGGAACCTTGAGAGAAATACTTGAAAAAATGCCTGTCAACAGGCTTAGAGAGAGTGATTCTGATATAGAGTTTAAGATTCATTGCTCAAATTCCTACAATAATGGTTCCAAACAGAGGTTGAAAGACGGACTGCCAATTGTATTGATAAAACACAAGCCTCTTCCATCTGAGAAATTTGAGGAACACCGACATGTCTCCTCAAAAGATGATGCTTTTGACCAAAAAACTAGGCTAAGAAGTACGAAAAAGAAAGAACTTTGGTCGGTTGAAGATTTTGATTTCCATGGTGGAATTGTGAGTTCAGATAAATTGCACAGCAAACAAAAGGGAGAAGGGACTCCAGTCAAACAGATTGCAGAGAAGTTGAAAATATCCAATCCAATGCCTGATATGCGCCATGAGAAAGAACCAATTGATAGGAAAGTTCTTACATCAAAGAAACTGACTAAACCAGTGGAAAAAGAATTTCCAAAAGAAAAAGTGGTGTCAAGACCAAAACATCAAGAAAAAGTGACATCCACTAATCCAAGGAAGAACAGAACTCACAAACAACGTAGCTCCATTCAAGATTCTGTGCCAGGACAAGCAGTGAGAGCGATATCTAACAATCGCGACTGTCAGAAAAAGGAGGAACTAGTACTTCCCCATTCAGAAGTTAATTCATTTACTCACATGGTTGAAGTCAAGAAAGATGATGAGATTACTGATACGAATGAAAGTGTTGATCTTCAGATAAATCGAAACACTACTACCCTTATGGCTTTAATTACTATGGAGAACGAAATGGACAAATGTGATACAAAGATTATAGAAGGCTGCCATGAGAACCCAAACTCTCTGTCGCCATTGAGCCCCAAACTCGATATCAATACAAGTACTGTTGAAGAAATTGATCACAATGGTCATACTGAAGCAGATACCAAAAGCTGCAATCAAGGGACCAATCTGAAAGCATTACTTTTGAAAAGTTCATCATTCCTCTGTCATGCAGAGGAGCTTTTTGATCTCCATCTAAATGGTAGAACAATGCCGCAGGCAGCATCTCGCTGCAATGATCCTGAATCCCTGAATACGAAACTTTTTGTAGATTGTGCAATAGAACTTGTCGACCGTAAAGGCCATTATAACTTACCAGTAGGCAATTCTTTAGTACTAGGAGATAAGAGCAATACAAAGATAGAGATTTCCATAGAAAAACTTGTTGAAGAAGTGAACGACGACATCGAAACTCTAACGAGTTACCAAACAATATGTGGCAATAATCTTATTGTAGATACTCTATATGCAGTGTTGAGTAGAGACCTATGGTGCAAAGAAGTGATGAATGGAATGTGGGATATTGGTTGGAAGAATGAATTTTCAAGCAGTGAAAGTGAGGAAGTTGTAAATGATATAGAAATGATGATTTTGAGTGGACTGATTGAAGAGTCCTTTACATAA
Protein sequence
MPLDSVKSVVYRSFITCDDPKGVVDCSLMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQGAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTYPRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSCFCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRESDSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRLRSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPIDRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVRAISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITMENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKALLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLPVGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCKEVMNGMWDIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT*
Homology
BLAST of CsGy1G016710 vs. NCBI nr
Match:
XP_011659831.1 (uncharacterized protein LOC101223218 isoform X3 [Cucumis sativus])
HSP 1 Score: 1538 bits (3982), Expect = 0.0
Identity = 787/788 (99.87%), Postives = 787/788 (99.87%), Query Frame = 0
Query: 1 MPLDSVKSVVYRSFITCDDPKGVVDCSLMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDL 60
MPLDSVKSVVYRSFITCDDPKGVVDCSLMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDL
Sbjct: 1 MPLDSVKSVVYRSFITCDDPKGVVDCSLMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDL 60
Query: 61 EKEELISKKMRERIHGQSSIPFMEVCQGAEKLNHMVGSWSKGMRSESKTEKIAEDLLEET 120
EKEELISKKMRERIHGQSSIPFMEVCQGAEKLNHMVGSWSKGMRSESKTEKIAEDLLEET
Sbjct: 61 EKEELISKKMRERIHGQSSIPFMEVCQGAEKLNHMVGSWSKGMRSESKTEKIAEDLLEET 120
Query: 121 SSLRDSLIMLAKLQEASNKSIRLKRTYPRSFSSHLEDECFPVEVQRSKLSTHGSSRTGAD 180
SSLRDSLIMLAKLQEASNKSIRLKRTYPRSFSSHLEDECFPVEVQRSKLSTHGSSRTGAD
Sbjct: 121 SSLRDSLIMLAKLQEASNKSIRLKRTYPRSFSSHLEDECFPVEVQRSKLSTHGSSRTGAD 180
Query: 181 EVKKMIGNSPVKRDSVRNVTVGEHKSCFCDINSNLDSEISLTSSSQSSMIDDNVNCSHGT 240
EVKKMIGNSPVKRDSVRNVTVGEHKSCFCDINSNLDSEISLTSSSQSSMIDDNVNCSHGT
Sbjct: 181 EVKKMIGNSPVKRDSVRNVTVGEHKSCFCDINSNLDSEISLTSSSQSSMIDDNVNCSHGT 240
Query: 241 TSQQNLKRNNLIAKLMGLEEIPSRSMQITQKKEFELKKVCGYKASLFGVDATLNMPKSKS 300
TSQQNLKRNNLIAKLMGLEEIPSRSMQITQKKEFELKKVCGYKASLFGVDATLNMPKSKS
Sbjct: 241 TSQQNLKRNNLIAKLMGLEEIPSRSMQITQKKEFELKKVCGYKASLFGVDATLNMPKSKS 300
Query: 301 VINKEDHRKGTLREILEKMPVNRLRESDSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKH 360
VINKEDHRKGTLREILEKMPVNRLRESDSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKH
Sbjct: 301 VINKEDHRKGTLREILEKMPVNRLRESDSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKH 360
Query: 361 KPLPSEKFEEHRHVSSKDDAFDQKTRLRSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGE 420
KPLPSEKFEEHRHVSSKDDAFDQKTRLRSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGE
Sbjct: 361 KPLPSEKFEEHRHVSSKDDAFDQKTRLRSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGE 420
Query: 421 GTPVKQIAEKLKISNPMPDMRHEKEPIDRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKV 480
GTPVKQIAEKLKISNPMPDMRHEKEPIDRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKV
Sbjct: 421 GTPVKQIAEKLKISNPMPDMRHEKEPIDRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKV 480
Query: 481 TSTNPRKNRTHKQRSSIQDSVPGQAVRAISNNRDCQKKEELVLPHSEVNSFTHMVEVKKD 540
TSTNPRKNRTHKQRSSIQDSVPGQAVRAISNNRDCQKKEELVLPHSEVNSFTHMVEVKKD
Sbjct: 481 TSTNPRKNRTHKQRSSIQDSVPGQAVRAISNNRDCQKKEELVLPHSEVNSFTHMVEVKKD 540
Query: 541 DEITDTNESVDLQINRNTTTLMALITMENEMDKCDTKIIEGCHENPNSLSPLSPKLDINT 600
DEITDTNESVDLQINRNTTTLMALITMENEMDKCDTKIIEGCHENPNSLSPLSPKLDINT
Sbjct: 541 DEITDTNESVDLQINRNTTTLMALITMENEMDKCDTKIIEGCHENPNSLSPLSPKLDINT 600
Query: 601 STVEEIDHNGHTEADTKSCNQGTNLKALLLKSSSFLCHAEELFDLHLNGRTMPQAASRCN 660
STVEEIDHNGHTEADTKSCNQGTNLKALLLKSSSFLCHAEELFDLHLNGRTMPQAASRCN
Sbjct: 601 STVEEIDHNGHTEADTKSCNQGTNLKALLLKSSSFLCHAEELFDLHLNGRTMPQAASRCN 660
Query: 661 DPESLNTKLFVDCAIELVDRKGHYNLPVGNSLVLGDKSNTKIEISIEKLVEEVNDDIETL 720
DPESLNTKLFVDCAIELVDRKGHYNLPVGNSLVLGDKSNTKIEISIEKLVEEVNDDIETL
Sbjct: 661 DPESLNTKLFVDCAIELVDRKGHYNLPVGNSLVLGDKSNTKIEISIEKLVEEVNDDIETL 720
Query: 721 TSYQTICGNNLIVDTLYAVLSRDLWCKEVMNGMWDIGWKNEFSSSESEEVVNDIEMMILS 780
TSYQTICGNNLIVDTLYAVLSRDLWCKEVMNGMW IGWKNEFSSSESEEVVNDIEMMILS
Sbjct: 721 TSYQTICGNNLIVDTLYAVLSRDLWCKEVMNGMWAIGWKNEFSSSESEEVVNDIEMMILS 780
Query: 781 GLIEESFT 788
GLIEESFT
Sbjct: 781 GLIEESFT 788
BLAST of CsGy1G016710 vs. NCBI nr
Match:
XP_031736143.1 (uncharacterized protein LOC101223218 isoform X4 [Cucumis sativus])
HSP 1 Score: 1482 bits (3836), Expect = 0.0
Identity = 760/761 (99.87%), Postives = 760/761 (99.87%), Query Frame = 0
Query: 28 LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ 87
LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ
Sbjct: 21 LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ 80
Query: 88 GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY 147
GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY
Sbjct: 81 GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY 140
Query: 148 PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC 207
PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC
Sbjct: 141 PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC 200
Query: 208 FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ 267
FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ
Sbjct: 201 FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ 260
Query: 268 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 327
ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES
Sbjct: 261 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 320
Query: 328 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL 387
DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL
Sbjct: 321 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL 380
Query: 388 RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI 447
RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI
Sbjct: 381 RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI 440
Query: 448 DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR 507
DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR
Sbjct: 441 DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR 500
Query: 508 AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM 567
AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM
Sbjct: 501 AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM 560
Query: 568 ENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA 627
ENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA
Sbjct: 561 ENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA 620
Query: 628 LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP 687
LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP
Sbjct: 621 LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP 680
Query: 688 VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK 747
VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK
Sbjct: 681 VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK 740
Query: 748 EVMNGMWDIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 788
EVMNGMW IGWKNEFSSSESEEVVNDIEMMILSGLIEESFT
Sbjct: 741 EVMNGMWAIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 781
BLAST of CsGy1G016710 vs. NCBI nr
Match:
XP_031736141.1 (uncharacterized protein LOC101223218 isoform X1 [Cucumis sativus])
HSP 1 Score: 1482 bits (3836), Expect = 0.0
Identity = 760/761 (99.87%), Postives = 760/761 (99.87%), Query Frame = 0
Query: 28 LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ 87
LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ
Sbjct: 48 LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ 107
Query: 88 GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY 147
GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY
Sbjct: 108 GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY 167
Query: 148 PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC 207
PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC
Sbjct: 168 PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC 227
Query: 208 FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ 267
FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ
Sbjct: 228 FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ 287
Query: 268 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 327
ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES
Sbjct: 288 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 347
Query: 328 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL 387
DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL
Sbjct: 348 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL 407
Query: 388 RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI 447
RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI
Sbjct: 408 RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI 467
Query: 448 DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR 507
DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR
Sbjct: 468 DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR 527
Query: 508 AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM 567
AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM
Sbjct: 528 AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM 587
Query: 568 ENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA 627
ENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA
Sbjct: 588 ENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA 647
Query: 628 LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP 687
LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP
Sbjct: 648 LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP 707
Query: 688 VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK 747
VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK
Sbjct: 708 VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK 767
Query: 748 EVMNGMWDIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 788
EVMNGMW IGWKNEFSSSESEEVVNDIEMMILSGLIEESFT
Sbjct: 768 EVMNGMWAIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 808
BLAST of CsGy1G016710 vs. NCBI nr
Match:
XP_031736142.1 (uncharacterized protein LOC101223218 isoform X2 [Cucumis sativus])
HSP 1 Score: 1475 bits (3819), Expect = 0.0
Identity = 759/761 (99.74%), Postives = 759/761 (99.74%), Query Frame = 0
Query: 28 LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ 87
LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ
Sbjct: 48 LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ 107
Query: 88 GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY 147
GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY
Sbjct: 108 GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY 167
Query: 148 PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC 207
PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC
Sbjct: 168 PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC 227
Query: 208 FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ 267
FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ
Sbjct: 228 FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ 287
Query: 268 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 327
ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES
Sbjct: 288 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 347
Query: 328 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL 387
DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL
Sbjct: 348 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL 407
Query: 388 RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI 447
RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI
Sbjct: 408 RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI 467
Query: 448 DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR 507
DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR
Sbjct: 468 DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR 527
Query: 508 AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM 567
AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM
Sbjct: 528 AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM 587
Query: 568 ENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA 627
ENEMDKCDTKII GCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA
Sbjct: 588 ENEMDKCDTKII-GCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA 647
Query: 628 LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP 687
LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP
Sbjct: 648 LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP 707
Query: 688 VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK 747
VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK
Sbjct: 708 VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK 767
Query: 748 EVMNGMWDIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 788
EVMNGMW IGWKNEFSSSESEEVVNDIEMMILSGLIEESFT
Sbjct: 768 EVMNGMWAIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 807
BLAST of CsGy1G016710 vs. NCBI nr
Match:
KAE8652964.1 (hypothetical protein Csa_017757 [Cucumis sativus])
HSP 1 Score: 1475 bits (3819), Expect = 0.0
Identity = 759/761 (99.74%), Postives = 759/761 (99.74%), Query Frame = 0
Query: 28 LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ 87
LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ
Sbjct: 21 LMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQ 80
Query: 88 GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY 147
GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY
Sbjct: 81 GAEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTY 140
Query: 148 PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC 207
PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC
Sbjct: 141 PRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSC 200
Query: 208 FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ 267
FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ
Sbjct: 201 FCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAKLMGLEEIPSRSMQ 260
Query: 268 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 327
ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES
Sbjct: 261 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 320
Query: 328 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL 387
DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL
Sbjct: 321 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRL 380
Query: 388 RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI 447
RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI
Sbjct: 381 RSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPI 440
Query: 448 DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR 507
DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR
Sbjct: 441 DRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVR 500
Query: 508 AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM 567
AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM
Sbjct: 501 AISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITM 560
Query: 568 ENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA 627
ENEMDKCDTKII GCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA
Sbjct: 561 ENEMDKCDTKII-GCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKA 620
Query: 628 LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP 687
LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP
Sbjct: 621 LLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLP 680
Query: 688 VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK 747
VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK
Sbjct: 681 VGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCK 740
Query: 748 EVMNGMWDIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 788
EVMNGMW IGWKNEFSSSESEEVVNDIEMMILSGLIEESFT
Sbjct: 741 EVMNGMWAIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 780
BLAST of CsGy1G016710 vs. ExPASy TrEMBL
Match:
A0A1S3CHI1 (uncharacterized protein LOC103500989 OS=Cucumis melo OX=3656 GN=LOC103500989 PE=4 SV=1)
HSP 1 Score: 1311 bits (3392), Expect = 0.0
Identity = 692/817 (84.70%), Postives = 728/817 (89.11%), Query Frame = 0
Query: 1 MPLDSVKSVVYRSFITCDDPKGVVDCSLMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDL 60
MPLDSVKSVVYRSFITCDDPKGVVDC+LMKIS++NS+KLEQKIRAHRTSRNSSK LVSD+
Sbjct: 1 MPLDSVKSVVYRSFITCDDPKGVVDCNLMKISRVNSQKLEQKIRAHRTSRNSSKDLVSDV 60
Query: 61 EKEELISKKMRERIHGQSSIPFMEVCQGAEKLNHMVGSWSKGMRSESKTEKIAEDLLEET 120
EKEELISK+MRERIHGQSSI FMEVCQGAEKLNHMVGSWSKGMRSE KTEKIAEDLLEET
Sbjct: 61 EKEELISKEMRERIHGQSSISFMEVCQGAEKLNHMVGSWSKGMRSERKTEKIAEDLLEET 120
Query: 121 SSLRDSLIMLAKLQEASNKSIRLKRTYPRSFSSHLEDECFPVEVQRSKLSTHGSSRTGAD 180
SSLRDSLIMLAKLQEASN+S++LK TYP+SFS HLEDECFPVEVQRSKLSTHGSSRTGAD
Sbjct: 121 SSLRDSLIMLAKLQEASNESMQLKMTYPKSFSCHLEDECFPVEVQRSKLSTHGSSRTGAD 180
Query: 181 EVKKMIGNSPVKRDSVRNVTVGEHKSCFCDINSNLDSEISLTSSSQSSMIDDNVNCSHGT 240
EVKKMIGNSPVKRDSVRNVTVGE KSCF DINSN SEISLT SSQSS+IDDNVNC HGT
Sbjct: 181 EVKKMIGNSPVKRDSVRNVTVGERKSCFRDINSNSGSEISLTCSSQSSLIDDNVNCCHGT 240
Query: 241 TSQQ-NLKRNNLIAKLMGLEEIPSRSMQITQKKEFELKKVCGYKASLFGVDATLNMPKSK 300
TSQQ NLKRNNLIAKLMGLEEIPSRSMQIT KKEFE KKV GYK SLFG++ATLNMPKSK
Sbjct: 241 TSQQKNLKRNNLIAKLMGLEEIPSRSMQITPKKEFEFKKVSGYKTSLFGINATLNMPKSK 300
Query: 301 SVINKEDHRKGTLREILEKMPVNRLRESDSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIK 360
SVINKEDH+KGTLREILEKMPVN+LRESDSDIEF IHCSNSYN+GSKQRLKDG PIVLIK
Sbjct: 301 SVINKEDHQKGTLREILEKMPVNKLRESDSDIEFNIHCSNSYNDGSKQRLKDGRPIVLIK 360
Query: 361 HKPLPSEKFEEHR-HVSSKDDAFDQKTRLRSTKKKELWSVEDFDFHGGIVSSDKLHSKQK 420
HKPLP ++FEEHR HVSSK+DAFDQKT+LRSTKKKEL S DFDFHGGI+SSDKLH KQK
Sbjct: 361 HKPLPPDEFEEHRAHVSSKNDAFDQKTKLRSTKKKELQSAGDFDFHGGIMSSDKLHRKQK 420
Query: 421 GEGTPVKQIAE---------------------------KLKISNPMPDMRHEKEPIDRKV 480
G+G+PVKQIAE KLKI +PMPDM HEKEPIDRKV
Sbjct: 421 GQGSPVKQIAEEGRKLKPKKEAKKLKECTVDTKKKTAEKLKIFSPMPDMPHEKEPIDRKV 480
Query: 481 LTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVRAISN 540
L+SKKLTKPVEKEF KEKVVSRP+HQEKVTSTNPRKNRTHKQRSSIQD VP +AVRAISN
Sbjct: 481 LSSKKLTKPVEKEFSKEKVVSRPQHQEKVTSTNPRKNRTHKQRSSIQDPVPERAVRAISN 540
Query: 541 NRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITMENEM 600
NRDCQKK+E VL HSEVNSF INRNTTTLMALITMENEM
Sbjct: 541 NRDCQKKDEPVLSHSEVNSF----------------------INRNTTTLMALITMENEM 600
Query: 601 DKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTEADTKSCNQGTNLKALLLK 660
D+CDTKIIE C+ENPNSL PLSPKLDINTSTVEEID NGHTEA TKSCNQGTNLKALLLK
Sbjct: 601 DECDTKIIECCNENPNSLLPLSPKLDINTSTVEEIDRNGHTEAHTKSCNQGTNLKALLLK 660
Query: 661 SSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLPVGNS 720
SSSFLCHA EL+DLHLNGRTM QAASRCNDPESLNTKLFVDCAIEL+DRKGH+NL VGNS
Sbjct: 661 SSSFLCHAGELYDLHLNGRTMLQAASRCNDPESLNTKLFVDCAIELMDRKGHHNLLVGNS 720
Query: 721 LVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCKEVMN 780
L+LGDKSNTKIEISIEKLVEEVNDDI+TLTSYQTICG+NLIVDTLYAVLSRDLWCKEVMN
Sbjct: 721 LLLGDKSNTKIEISIEKLVEEVNDDIDTLTSYQTICGDNLIVDTLYAVLSRDLWCKEVMN 780
Query: 781 GMWDIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 788
GMWDIGWKN FS SESEEVVNDIEMMILSGLIEESFT
Sbjct: 781 GMWDIGWKNGFSRSESEEVVNDIEMMILSGLIEESFT 795
BLAST of CsGy1G016710 vs. ExPASy TrEMBL
Match:
A0A5A7TU01 (DUF4378 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold114G00510 PE=4 SV=1)
HSP 1 Score: 1275 bits (3299), Expect = 0.0
Identity = 672/789 (85.17%), Postives = 711/789 (90.11%), Query Frame = 0
Query: 29 MKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQG 88
MKIS++NS+KLEQKIRAHRTSRNSSK LVSD+EKEELISK+MRERIHGQSSI FMEVCQG
Sbjct: 1 MKISRVNSQKLEQKIRAHRTSRNSSKDLVSDVEKEELISKEMRERIHGQSSISFMEVCQG 60
Query: 89 AEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTYP 148
AEKLNHMVGSWSKGMRSE KTEKIAEDLLEETSSLRDSLIMLAKLQEASN+S++LK TYP
Sbjct: 61 AEKLNHMVGSWSKGMRSERKTEKIAEDLLEETSSLRDSLIMLAKLQEASNESMQLKMTYP 120
Query: 149 RSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSCF 208
+SFS HLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGE KSCF
Sbjct: 121 KSFSCHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGERKSCF 180
Query: 209 CDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQ-NLKRNNLIAKLMGLEEIPSRSMQ 268
DINSN SEISLT SSQSS+IDDNVNC HGTTSQQ NLKRNNLIAKLMGLEEIPSRSMQ
Sbjct: 181 RDINSNSGSEISLTCSSQSSLIDDNVNCCHGTTSQQKNLKRNNLIAKLMGLEEIPSRSMQ 240
Query: 269 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 328
IT KKEFE KKV GYK SLFG++ATLNMPKSKSVINKEDH+KGTLREILEKMPVN+LRES
Sbjct: 241 ITPKKEFEFKKVSGYKTSLFGINATLNMPKSKSVINKEDHQKGTLREILEKMPVNKLRES 300
Query: 329 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHR-HVSSKDDAFDQKTR 388
DSDIEF IHCSNSYN+GSKQRLKDG PIVLIKHKPLP ++FEEHR HVSSK+DAFDQKT+
Sbjct: 301 DSDIEFNIHCSNSYNDGSKQRLKDGRPIVLIKHKPLPPDEFEEHRAHVSSKNDAFDQKTK 360
Query: 389 LRSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAE----------------- 448
LRSTKKKEL S DFDFHGGI+SSDKLH KQKG+G+PVKQIAE
Sbjct: 361 LRSTKKKELQSAGDFDFHGGIMSSDKLHRKQKGQGSPVKQIAEEGRKLKPKKEAKKLKEC 420
Query: 449 ----------KLKISNPMPDMRHEKEPIDRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEK 508
KLKI +PMPDM HEKEPIDRKVL+SKKLTKPVEKEF KEKVVSRP+HQEK
Sbjct: 421 TVDTKKKTAEKLKIFSPMPDMPHEKEPIDRKVLSSKKLTKPVEKEFSKEKVVSRPQHQEK 480
Query: 509 VTSTNPRKNRTHKQRSSIQDSVPGQAVRAISNNRDCQKKEELVLPHSEVNSFTHMVEVKK 568
VTSTNPRKNRTHKQRSSIQD VP +AVRAISNNRDCQKK+E VL HSEVNSF+ V + +
Sbjct: 481 VTSTNPRKNRTHKQRSSIQDPVPERAVRAISNNRDCQKKDEPVLSHSEVNSFSEAVFILQ 540
Query: 569 DDEITDTNESVDLQINRNTTTLMALITMENEMDKCDTKIIEGCHENPNSLSPLSPKLDIN 628
+ TNE VD QINRNTTTLMALITMENEMD+CDTKIIE C+ENPNSL PLSPKLDIN
Sbjct: 541 AHMVCLTNEIVDFQINRNTTTLMALITMENEMDECDTKIIECCNENPNSLLPLSPKLDIN 600
Query: 629 TSTVEEIDHNGHTEADTKSCNQGTNLKALLLKSSSFLCHAEELFDLHLNGRTMPQAASRC 688
TSTVEEID NGHTEA TKSCNQGTNLKALLLKSSSFLCHA EL+DLHLNGRTM QAASRC
Sbjct: 601 TSTVEEIDRNGHTEAHTKSCNQGTNLKALLLKSSSFLCHAGELYDLHLNGRTMLQAASRC 660
Query: 689 NDPESLNTKLFVDCAIELVDRKGHYNLPVGNSLVLGDKSNTKIEISIEKLVEEVNDDIET 748
NDPESLNTKLFVDCAIEL+DRKGH+NL VGNSL+LGDKSNTKIEISIEKLVEEVNDDI+T
Sbjct: 661 NDPESLNTKLFVDCAIELMDRKGHHNLLVGNSLLLGDKSNTKIEISIEKLVEEVNDDIDT 720
Query: 749 LTSYQTICGNNLIVDTLYAVLSRDLWCKEVMNGMWDIGWKNEFSSSESEEVVNDIEMMIL 788
LTSYQTICG+NLIVDTLYAVLSRDLWCKEVMNGMWDIGWKN FS SESEEVVNDIEMMIL
Sbjct: 721 LTSYQTICGDNLIVDTLYAVLSRDLWCKEVMNGMWDIGWKNGFSRSESEEVVNDIEMMIL 780
BLAST of CsGy1G016710 vs. ExPASy TrEMBL
Match:
A0A5D3DZ02 (DUF4378 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold403G00440 PE=4 SV=1)
HSP 1 Score: 1270 bits (3287), Expect = 0.0
Identity = 672/790 (85.06%), Postives = 711/790 (90.00%), Query Frame = 0
Query: 29 MKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEKEELISKKMRERIHGQSSIPFMEVCQG 88
MKIS++NS+KLEQKIRAHRTSRNSSK LVSD+EKEELISK+MRERIHGQSSI FMEVCQG
Sbjct: 1 MKISRVNSQKLEQKIRAHRTSRNSSKDLVSDVEKEELISKEMRERIHGQSSISFMEVCQG 60
Query: 89 AEKLNHMVGSWSKGMRSESKTEKIAEDLLEETSSLRDSLIMLAKLQEASNKSIRLKRTYP 148
AEKLNHMVGSWSKGMRSE KTEKIAEDLLEETSSLRDSLIMLAKLQEASN+S++LK TYP
Sbjct: 61 AEKLNHMVGSWSKGMRSERKTEKIAEDLLEETSSLRDSLIMLAKLQEASNESMQLKMTYP 120
Query: 149 RSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKSCF 208
+SFS HLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGE KSCF
Sbjct: 121 KSFSCHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRDSVRNVTVGERKSCF 180
Query: 209 CDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQ-NLKRNNLIAKLMGLEEIPSRSMQ 268
DINSN SEISLT SSQSS+IDDNVNC HGTTSQQ NLKRNNLIAKLMGLEEIPSRSMQ
Sbjct: 181 RDINSNSGSEISLTCSSQSSLIDDNVNCCHGTTSQQKNLKRNNLIAKLMGLEEIPSRSMQ 240
Query: 269 ITQKKEFELKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRES 328
IT KKEFE KKV GYK SLFG++ATLNMPKSKSVINKEDH+KGTLREILEKMPVN+LRES
Sbjct: 241 ITPKKEFEFKKVSGYKTSLFGINATLNMPKSKSVINKEDHQKGTLREILEKMPVNKLRES 300
Query: 329 DSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHR-HVSSKDDAFDQKTR 388
DSDIEF IHCSNSYN+GSKQRLKDG PIVLIKHKPLP ++FEEHR HVSSK+DAFDQKT+
Sbjct: 301 DSDIEFNIHCSNSYNDGSKQRLKDGRPIVLIKHKPLPPDEFEEHRAHVSSKNDAFDQKTK 360
Query: 389 LRSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAE----------------- 448
LRSTKKKEL S DFDFHGGI+SSDKLH KQKG+G+PVKQIAE
Sbjct: 361 LRSTKKKELQSAGDFDFHGGIMSSDKLHRKQKGQGSPVKQIAEEGRKLKPKKEAKKLKEC 420
Query: 449 ----------KLKISNPMPDMRHEKEPIDRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEK 508
KLKI +PMPDM HEKEPIDRKVL+SKKLTKPVEKEF KEKVVSRP+HQEK
Sbjct: 421 TVDTKKKTAEKLKIFSPMPDMPHEKEPIDRKVLSSKKLTKPVEKEFSKEKVVSRPQHQEK 480
Query: 509 VTSTNPRKNRTHKQRSSIQDSVPGQAVRAISNNRDCQKKEELVLPHSEVNSF-THMVEVK 568
VTSTNPRKNRTHKQRSSIQD VP +AVRAISNNRDCQKK+E VL HSEVNSF + V +
Sbjct: 481 VTSTNPRKNRTHKQRSSIQDPVPERAVRAISNNRDCQKKDEPVLSHSEVNSFESEAVFIL 540
Query: 569 KDDEITDTNESVDLQINRNTTTLMALITMENEMDKCDTKIIEGCHENPNSLSPLSPKLDI 628
+ + TNE VD QINRNTTTLMALITMENEMD+CDTKIIE C+ENPNSL PLSPKLDI
Sbjct: 541 QAHMVCLTNEIVDFQINRNTTTLMALITMENEMDECDTKIIECCNENPNSLLPLSPKLDI 600
Query: 629 NTSTVEEIDHNGHTEADTKSCNQGTNLKALLLKSSSFLCHAEELFDLHLNGRTMPQAASR 688
NTSTVEEID NGHTEA TKSCNQGTNLKALLLKSSSFLCHA EL+DLHLNGRTM QAASR
Sbjct: 601 NTSTVEEIDRNGHTEAHTKSCNQGTNLKALLLKSSSFLCHAGELYDLHLNGRTMLQAASR 660
Query: 689 CNDPESLNTKLFVDCAIELVDRKGHYNLPVGNSLVLGDKSNTKIEISIEKLVEEVNDDIE 748
CNDPESLNTKLFVDCAIEL+DRKGH+NL VGNSL+LGDKSNTKIEISIEKLVEEVNDDI+
Sbjct: 661 CNDPESLNTKLFVDCAIELMDRKGHHNLLVGNSLLLGDKSNTKIEISIEKLVEEVNDDID 720
Query: 749 TLTSYQTICGNNLIVDTLYAVLSRDLWCKEVMNGMWDIGWKNEFSSSESEEVVNDIEMMI 788
TLTSYQTICG+NLIVDTLYAVLSRDLWCKEVMNGMWDIGWKN FS SESEEVVNDIEMMI
Sbjct: 721 TLTSYQTICGDNLIVDTLYAVLSRDLWCKEVMNGMWDIGWKNGFSRSESEEVVNDIEMMI 780
BLAST of CsGy1G016710 vs. ExPASy TrEMBL
Match:
A0A6J1JAT7 (uncharacterized protein LOC111485116 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111485116 PE=4 SV=1)
HSP 1 Score: 933 bits (2412), Expect = 0.0
Identity = 527/828 (63.65%), Postives = 610/828 (73.67%), Query Frame = 0
Query: 1 MPLDSVKSVVYRSFITCDDPKGVVDCSLMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDL 60
MPLD VKSVVYRSFITCDDPKGVVDC++ KISK+ S+ LE K+R R SRN +K LVS +
Sbjct: 1 MPLDGVKSVVYRSFITCDDPKGVVDCNIFKISKVKSKNLEPKVRGRRRSRNPNKVLVSQV 60
Query: 61 EKEELISKKMRERIHGQSSIPFMEVCQGAEKLNHMVGSWSKGMRSESKTEKIAEDLLEET 120
EKEELIS + GQSS+PFMEVCQGAEKLNHMVGSWS GMRS+ KTE+IAE+LLE T
Sbjct: 61 EKEELISNEK----DGQSSLPFMEVCQGAEKLNHMVGSWSNGMRSDRKTEEIAEELLEGT 120
Query: 121 SSLRDSLIMLAKLQEASNKSIRLKRTYPRSFSSHLEDECFPVEVQRSKLSTHGSSRTGAD 180
SSLR+SLIMLAKLQE SN+S++LK TY +SFS HLEDE FPVEVQRSKLS HGSSR G D
Sbjct: 121 SSLRESLIMLAKLQETSNESVQLKTTYRKSFSCHLEDESFPVEVQRSKLSIHGSSRNGPD 180
Query: 181 EVKKMIGNSPVKRDSVRNVTVGEHKSCFCDINSNLDSEISLTSSSQSSMIDDNVNCSHGT 240
EVKK+I ++ V+RD+ RNV VGE +SCF DIN + SEI TSSS+SS+I DNVNC H +
Sbjct: 181 EVKKVIRDNLVRRDTKRNVAVGE-ESCFHDINFDSGSEIPSTSSSKSSLISDNVNCCHVS 240
Query: 241 TS-QQNLKRNNLIAKLMGLEEIPSRSMQITQKKEFELKKVCGYKASLFGVDATLNMPKSK 300
TS Q+NLKRNNLIAKLMGLEEI SRS+Q K
Sbjct: 241 TSGQKNLKRNNLIAKLMGLEEISSRSLQTNPKA--------------------------- 300
Query: 301 SVINKEDHRKGTLREILEKMPVNRLRESDSDIEFKIHCSNSYNN-GSKQRLKDGLPIVLI 360
GT R+ILEKMP NRL ESD D EFK+ S+SYN+ GSKQRL++ LPIVLI
Sbjct: 301 ----------GTFRDILEKMPFNRLIESDPDKEFKLPDSHSYNHYGSKQRLENVLPIVLI 360
Query: 361 KHKPLPSEKFEEHR-HVSSKDDAFDQKTRLRSTKKKELWSVEDFDFHGGIVSSDKLHSKQ 420
KHKPLP F+EHR HVSS D F+Q+ +RS KKKELW + FD H G +SSDKL +Q
Sbjct: 361 KHKPLPPNVFKEHRAHVSSNKDVFNQQATIRSMKKKELWGFDRFDIHRGSLSSDKLCRRQ 420
Query: 421 KGEGT----------------PVKQIAEKLKISNPMPDMRHEKEPIDRKVLTSKKLTKPV 480
+ EG K+ AEKLK +PM DM HEKEPI +K+LTSKKLT
Sbjct: 421 EAEGKMPKHKEVKKLRKGTVDAKKKAAEKLKRCSPMSDMPHEKEPIHKKILTSKKLTT-- 480
Query: 481 EKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVRAISNNRDCQKKEEL 540
KEKV+SRP+H+EKV+STNPRKNRTHKQRSSI DS PG+AV+ ISN+RDCQKKE
Sbjct: 481 -----KEKVMSRPQHEEKVSSTNPRKNRTHKQRSSIPDSTPGRAVKPISNDRDCQKKEVA 540
Query: 541 VLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITMENEMDKCDTKIIEG 600
V SEVNSFTHMVE KKD + TDTNES +L IN+++ TLMAL ++E+E+DKCDTKIIE
Sbjct: 541 VPARSEVNSFTHMVEAKKDHDNTDTNESANLPINQHSATLMALTSIESEIDKCDTKIIEC 600
Query: 601 CHENPNSLSPLSPKLDINTSTVE--------------------EIDHNGHTEADTKSCNQ 660
C E+P+S S LSPKL+INTS VE +ID N HTE D +SC+Q
Sbjct: 601 CKESPSSHSLLSPKLEINTSIVEAIDPNSHTETDIEINTSIIEDIDPNSHTETDIESCDQ 660
Query: 661 GTNLKALLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESL-NTKLFVDCAIELVDR 720
G NLKALLL+SSSFL H ELFDL+LNGRTM QAASRCNDPE NTK F+DCAIE++ R
Sbjct: 661 GINLKALLLRSSSFLFHVGELFDLNLNGRTMVQAASRCNDPEEAPNTKPFIDCAIEILKR 720
Query: 721 KGHYNLPVGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVL 780
KGH L V NSL+LGD TK EIS+EKLV+EV+DDI+TLTSYQTI G N++VDT+YAVL
Sbjct: 721 KGHDELQVANSLLLGDGCKTKTEISVEKLVKEVSDDIDTLTSYQTIQGGNVVVDTVYAVL 779
Query: 781 SRDLWCKEVMNGMWDIGWKNEFSSSESEEVVNDIEMMILSGLIEESFT 788
SRDLWCKEVMNGMW GWKN S SE EEVVNDIE +ILSGLIEESFT
Sbjct: 781 SRDLWCKEVMNGMWGFGWKNGSSRSEREEVVNDIEKLILSGLIEESFT 779
BLAST of CsGy1G016710 vs. ExPASy TrEMBL
Match:
A0A6J1CUY5 (uncharacterized protein LOC111014584 OS=Momordica charantia OX=3673 GN=LOC111014584 PE=4 SV=1)
HSP 1 Score: 929 bits (2401), Expect = 0.0
Identity = 533/817 (65.24%), Postives = 627/817 (76.74%), Query Frame = 0
Query: 1 MPLDSVKSVVYRSFITCDDPKGVVDCSLMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDL 60
MPLD VKSVVYRSFITCDDPKGVVDCS+++ SK+NS+++EQKI+ HRTSRN +K LVS +
Sbjct: 1 MPLDGVKSVVYRSFITCDDPKGVVDCSIIRKSKVNSQRMEQKIKTHRTSRNPNKALVSKV 60
Query: 61 EKEELISKKMRERIHGQSSIPFMEVCQGAEKLNHMVGSWSKGMRSESKTEKIAEDLLEET 120
EKEE I+K +R H QS +P +EV +G EKLN M SWSKG+RS+ K+E IAEDLLE T
Sbjct: 61 EKEEPITKGKSKRFHSQSPLPLVEVYRGNEKLNQMFDSWSKGVRSDRKSEDIAEDLLEGT 120
Query: 121 SSLRDSLIMLAKLQEASNKSIRLKRTYPRSFSSHLEDECFPVEVQRSKLSTHGSSRTGAD 180
SSL++SLIMLAKLQEASN+S++LK Y RS S HLE++ FPVEVQRSKLS +GSS GAD
Sbjct: 121 SSLKESLIMLAKLQEASNQSVQLKMKYQRSVSCHLEEQSFPVEVQRSKLSRYGSSTDGAD 180
Query: 181 EVKKMIGNSPVKRDSVRNVTVGEHKSCFCDINSNLDSEISLTSSSQSSMIDDNVNCSHGT 240
E+KK+I +S V+RD + TVGE KSCF DINS+ EI+ TSSSQSSM +DNV+C H +
Sbjct: 181 EIKKVIKDSLVRRDVACDATVGE-KSCFRDINSDSRLEITSTSSSQSSMANDNVDCCHVS 240
Query: 241 TS-QQNLKRNNLIAKLMGLEEIPSRSMQITQKKEFELKKVCGYKASLFGVDATLNMPKSK 300
TS Q+NLK +NLIAKLMGLEEI SR Q T KKEFE K+ GY+ SLF +D TLN PKSK
Sbjct: 241 TSVQRNLKGSNLIAKLMGLEEISSRPEQTTLKKEFEFTKISGYRRSLFRID-TLNAPKSK 300
Query: 301 SVINKEDHRKGTLREILEKMPVNRLRESDSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIK 360
SV++K+D KGTLREILE MP NRL ESDSDIEFK+H NNGSKQRLKD PIVLIK
Sbjct: 301 SVVDKKDSEKGTLREILETMPFNRLTESDSDIEFKLH-----NNGSKQRLKDVPPIVLIK 360
Query: 361 HKPLPSEKFEEHR-HVSSKDDAFDQKTRLRSTKKKEL-WSVEDFDFHGGIVSSDKLHSKQ 420
PLPS + EEHR VS K++AF+QK LR KKKEL WS +D D HGGI+SSDK H KQ
Sbjct: 361 PMPLPSNELEEHRARVSLKEEAFNQKAILRKMKKKELCWSFDDSDLHGGILSSDKFHRKQ 420
Query: 421 KGEGTPVKQIAEKLKISNPMPDM-----------RHEKEPI-------DRKVLTSKKLT- 480
E P+KQIA++ +I ++ + + E + D+KVLTSKK+T
Sbjct: 421 AAERIPLKQIAQEERIPKRKEEVWKLRKGDVDTNKKDAEKLKPSSIMHDKKVLTSKKVTA 480
Query: 481 ---KPVEKEF-PKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVRAISNNRD 540
KPV+KEF KEKVVSR +HQEKVTSTNPRKNRTHK+ SSI DSV G+AVR S + D
Sbjct: 481 ATRKPVKKEFVAKEKVVSRSQHQEKVTSTNPRKNRTHKKCSSISDSVSGRAVRTTSIDCD 540
Query: 541 CQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDL--QINRNTTTLMALITMENEMD 600
C+KKE+ VL SE S T +VE K+DD TDTNE+V+L NRNT+TLMALITME E D
Sbjct: 541 CRKKEKPVLARSEAKSLTRIVEAKEDDRSTDTNENVELPKSKNRNTSTLMALITMEEETD 600
Query: 601 KCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEID-H-NGHTEADTKSCNQGTNLKALLL 660
+CDTKIIE C E+PNSLSPLSPKL+I+TST E ID H N TE DTKSCNQGTNLKAL L
Sbjct: 601 ECDTKIIECCKESPNSLSPLSPKLEIDTSTEEVIDLHLNTRTETDTKSCNQGTNLKALFL 660
Query: 661 KSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLNTKLFVDCAIELVDRKGHYNLPVGN 720
+SSSFL AEELFDL LNGRTM S CNDP++ N K +DCAIEL+ RK H ++ V N
Sbjct: 661 RSSSFLSQAEELFDLKLNGRTMLHT-SCCNDPKTPNAKHLIDCAIELMKRKCHPDIQVCN 720
Query: 721 SLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTICGNNLIVDTLYAVLSRDLWCKEVM 780
SL LG +SNTKIEIS+EKLVEEV DDI+TLTSYQTI G +DTL+AVL RD+WCKEV
Sbjct: 721 SLFLGYRSNTKIEISVEKLVEEVCDDIDTLTSYQTIRG----IDTLHAVLERDVWCKEVS 780
Query: 781 NGMWDIGWKNEFSSSESEEVVNDIEMMILSGLIEESF 787
NGMWD+GWKN FS SESEEVVNDIE +IL+GLIEESF
Sbjct: 781 NGMWDLGWKNGFSRSESEEVVNDIEKLILNGLIEESF 805
BLAST of CsGy1G016710 vs. TAIR 10
Match:
AT3G24630.1 (unknown protein; Has 5348 Blast hits to 3182 proteins in 353 species: Archae - 0; Bacteria - 481; Metazoa - 1959; Fungi - 405; Plants - 180; Viruses - 10; Other Eukaryotes - 2313 (source: NCBI BLink). )
HSP 1 Score: 151.0 bits (380), Expect = 4.0e-36
Identity = 213/827 (25.76%), Postives = 369/827 (44.62%), Query Frame = 0
Query: 1 MPLDSVKSVVYRSFITCDDPKGVVDCSLMKISKMNSRKLEQKIRAHRTSRNSSKGLVSDL 60
MP ++S VYRSFI CDDP+ VV+C + K + K R+ T + + L
Sbjct: 1 MPEGKLRSGVYRSFIMCDDPRDVVECGAI--------KKQSKSRSSSTKQRCEEHLSKVK 60
Query: 61 EKEELI---SKKMRERIHGQSSIPFMEVCQGAEKLNHMVGSWSKGMRSE--SKTEKIAED 120
E+ E+ K SS+ + V +G +KLN + S SKG E S+ E IA+D
Sbjct: 61 ERSEMAVAPRKSSSTEDVPPSSLQLLRVSKGIQKLNVAIESLSKGFSFEAVSRPEDIAKD 120
Query: 121 LLEETSSLRDSLIMLAKLQEASNKSIRLKRTYPRS---FSSHLEDECFPVEVQRSKLSTH 180
LL L +SL ML+ +QE +K + R RS F + D +R + +
Sbjct: 121 LLRGALDLEESLAMLSSIQEDDSKRKPMIRNDGRSDLRFQRSMSDRFGERIEKRMMVQEN 180
Query: 181 GSSRTGADEVKKMIGNSPVKRDSVRNVTVGEHKS--CFCDINSNLDSEISLTSSSQSSMI 240
+S+ +E++K+I S ++++ V T E K D S+ + S TSSSQSSM+
Sbjct: 181 VASKDCYEELRKVIRESFLRQNLVSQTTTIETKKRVVRSDFASSSGAVSSSTSSSQSSMV 240
Query: 241 DDNVNCSHGTTSQQNLKRNNLIAKLMGLE---EIPSRS----------MQITQKKEFELK 300
+ S + Q + +LIA+LMGL+ + P +S ++++ +++ ++K
Sbjct: 241 SGSTKSSASSDVPQ--RAPSLIARLMGLDVSTQEPRKSSVNHIDKPDILKLSSERQEKVK 300
Query: 301 KVCGYKASLFGVDATLNMPKSKSVINKEDHRKGTLREILEKMPVNRLRESDSDIE-FKIH 360
K N +S ++ R+ L+ + E+ P E+ S I +
Sbjct: 301 K---------------NSKESPEIVRCNSTREAALQSLPEETP----SENPSTIVLIRPM 360
Query: 361 CSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEHRHVSSKDDAFDQKTRLRSTKKKELW 420
GSKQ + P + + P + ++H+ S K LR TKK +
Sbjct: 361 RVVKPEPGSKQPVVPKKPRMQGEVHPRMINQRKDHQANGSN----KMKLPLRMTKKDK-- 420
Query: 421 SVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKLKISNPMPDMRHEKEPIDRKVLTSKK 480
+ + ++ EG + KL + + +++P++ T+KK
Sbjct: 421 -----------EPKEMVPKVEENEGKVI-----KLMSPSNAKVLTRDRKPLE----TNKK 480
Query: 481 LTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTHKQRSSIQDSVPGQAVRAISNNRDCQ 540
L V+K+ E + +H+ +NP + S + + ++ R S++
Sbjct: 481 LV--VKKDDIAE---GKDRHRALKPPSNPVSQKISNNSSDVSRNKSRRSSRLSSSSSSGS 540
Query: 541 KKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVDLQINRNTTTLMALITMENEMDKCDT 600
++ KK E + N L+ N + +E + C +
Sbjct: 541 RE-------------------KKSGEASRPNAKKKLRQQDN--------DLGSENNSCSS 600
Query: 601 KIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGHTE-ADTKSC-----------NQGTN 660
+ E SL+ LS + +T E + GH + + SC +
Sbjct: 601 Q------ETHGSLNQLSTE----ETTSSEFHNQGHCDNGEVSSCAATIHHSHEPETSQIS 660
Query: 661 LKALLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESL-NTKLFVDCAIELVDRKGH 720
LK+ L SS F+ +AE+LFD + N ++ R D + + +L +D A E+V RK
Sbjct: 661 LKSFLSTSSDFISYAEDLFDFNTNTERSRESNFRRRDSIVISDQRLALDFAKEVVRRK-- 720
Query: 721 YNLPVGNSLVLGDKS-NTKIEISIEKLVEEVNDDIETLTSYQ-TICG-NNLIVDTLYAVL 780
SL+L + + +T+ + I++L+ EV D E+LTSY+ T G N+ + ++++ VL
Sbjct: 721 -------SLLLAEPTCHTRSSLDIDELLTEVCDGFESLTSYKDTFSGQNSFVKESIHLVL 721
Query: 781 SRDLWCK--EVMNGMWDIGWKNEFSSSESEEVVNDIEMMILSGLIEE 786
+DL K E+ +G+WD+GW++EF E+ E V D+E +ILSGLI+E
Sbjct: 781 EKDLKGKKTEMTSGVWDLGWRSEFQIDETYEAVADLEKLILSGLIQE 721
BLAST of CsGy1G016710 vs. TAIR 10
Match:
AT5G42710.2 (unknown protein; INVOLVED IN: biological_process unknown. )
HSP 1 Score: 71.2 bits (173), Expect = 4.1e-12
Identity = 172/793 (21.69%), Postives = 318/793 (40.10%), Query Frame = 0
Query: 29 MKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEK-------EELISKKMRERIHGQSSIP 88
+ +SK + LE +A R + S ++S L + E S+ ++ SS P
Sbjct: 85 LDLSKALAFALENAGKATRVDPSGSASIISFLHEVGRRSLGETRSSQVFVQQQQPSSSSP 144
Query: 89 FM-----EVCQGAEKLNHMVGSWSKGM--RSESKTEKIAEDLLEETSSLRDSLIMLAKLQ 148
+ E+ +GA+KLN ++ + S G+ R + + E L+E L SL +L +Q
Sbjct: 145 MIHVHIKEISKGAQKLNQIINACSNGLSFRKGRYSIQCGEQLMEGAIELEQSLRLLVDIQ 204
Query: 149 EASNKSIRLKRTYPRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRD 208
+AS ++SH +R K G D+ ++ N ++
Sbjct: 205 QAS------------EYTSH----------KRRKNRIKLLEENGDDDEEEDAHNQNYQK- 264
Query: 209 SVRNVTVGEHKSCFCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAK 268
++ V + + +N D + Q+S +D +T Q + +++AK
Sbjct: 265 -IKQVAKADIEMRLLALNYQEDK--NNKHRKQTSYCEDT---EQRSTKPQKGRIPSVVAK 324
Query: 269 LMGLEEIPSRSMQITQKKEFE---LKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGT 328
LMGL E P + K + E ++V +L + VI+KE T
Sbjct: 325 LMGLGEFPQDEKETNIKHDGENLTRRRVMEASENLVELKTQRKSTSLDLVIHKETQ---T 384
Query: 329 LREILEKMPVNRLRESDSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEH 388
EI K + D + + SY + K+ +IK P P+E +H
Sbjct: 385 ANEINYKAKSQQKDREKDDSKSRKRSKASYKKDGETTTKN-----VIKRNPTPTE--NKH 444
Query: 389 RHVSSKDDAFDQKTRLRSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKL 448
+ V+ QK + + KKE E +G + HS++ P+ ++
Sbjct: 445 KVVARS----QQKPLHKLSNKKEKLQRERHRENGVTTN----HSQK-----PLSSEDLQM 504
Query: 449 KISNPMPDMRHEKEPIDRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTH 508
K+ + ++ + + + + K E E K K+ + K+Q+ S
Sbjct: 505 KVR-----LINKAKAVKKSFSHVEVAQKGKEGEVLKAKICEK-KNQDIYISNEALCKVMK 564
Query: 509 KQRSSIQDSVPGQAVRAISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVD 568
+ +D +++ +++ + + + + + S+V+ H E+K D + E V
Sbjct: 565 RPEIKKEDGKHDLLLKSYNDSNEKKAEVDTCIKSSQVSGVEHKKEIKDDSILLIAAERVP 624
Query: 569 LQINRNTTTLMALITMENEMDKCDTKIIEGCHENPNSLSPLSPKLDINTSTVEEIDHNGH 628
Q EN + T + +P+ PK D N+ + + + G
Sbjct: 625 CQ-----------APSENHHGRMFT-------NGMDQQAPI-PKSDGNSDILSKTVYKGE 684
Query: 629 TEA-----------------DTKSCNQGTNLKALLLKSSSFLCHAEELFDLHLNGRTMPQ 688
EA +T S N+ NLK + +KS FL A+ F L++
Sbjct: 685 IEAGLPLLEKRQERRKRETTETLSENE-INLKKIFVKSQLFLDTAKAHFKLNIPQNVFHD 744
Query: 689 AASRCNDPESLNTKLFVDCAIELVDRKGHYNLPVGNSLVLGDKSNTKIEISIEKLVEEVN 748
S + + L ++CA EL+ RK + + V S++KI S++ L+ +++
Sbjct: 745 TTSGSYYYQE-DKNLTLECAFELMKRKRRFQELSVHPFVKVPISSSKIN-SLDHLIRQIS 795
Query: 749 DDIETLTSYQTICGNNLIVDTLYAVLSRDLWCKE-VMNGMWDIGWKNE-FSSSESEEVVN 786
++E L +Y C V+ VL RD+ K+ +N MWD+GW + + E ++V+
Sbjct: 805 KELEKLRAYGRDCHIGSHVEDY--VLERDVHYKDPYLNSMWDMGWNDSMLAFIEKDDVMR 795
BLAST of CsGy1G016710 vs. TAIR 10
Match:
AT5G42710.1 (unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 70.1 bits (170), Expect = 9.1e-12
Identity = 169/781 (21.64%), Postives = 312/781 (39.95%), Query Frame = 0
Query: 29 MKISKMNSRKLEQKIRAHRTSRNSSKGLVSDLEK-------EELISKKMRERIHGQSSIP 88
+ +SK + LE +A R + S ++S L + E S+ ++ SS P
Sbjct: 85 LDLSKALAFALENAGKATRVDPSGSASIISFLHEVGRRSLGETRSSQVFVQQQQPSSSSP 144
Query: 89 FM-----EVCQGAEKLNHMVGSWSKGM--RSESKTEKIAEDLLEETSSLRDSLIMLAKLQ 148
+ E+ +GA+KLN ++ + S G+ R + + E L+E L SL +L +Q
Sbjct: 145 MIHVHIKEISKGAQKLNQIINACSNGLSFRKGRYSIQCGEQLMEGAIELEQSLRLLVDIQ 204
Query: 149 EASNKSIRLKRTYPRSFSSHLEDECFPVEVQRSKLSTHGSSRTGADEVKKMIGNSPVKRD 208
+AS ++SH +R K G D+ ++ N ++
Sbjct: 205 QAS------------EYTSH----------KRRKNRIKLLEENGDDDEEEDAHNQNYQK- 264
Query: 209 SVRNVTVGEHKSCFCDINSNLDSEISLTSSSQSSMIDDNVNCSHGTTSQQNLKRNNLIAK 268
++ V + + +N D + Q+S +D +T Q + +++AK
Sbjct: 265 -IKQVAKADIEMRLLALNYQEDK--NNKHRKQTSYCEDT---EQRSTKPQKGRIPSVVAK 324
Query: 269 LMGLEEIPSRSMQITQKKEFE---LKKVCGYKASLFGVDATLNMPKSKSVINKEDHRKGT 328
LMGL E P + K + E ++V +L + VI+KE T
Sbjct: 325 LMGLGEFPQDEKETNIKHDGENLTRRRVMEASENLVELKTQRKSTSLDLVIHKETQ---T 384
Query: 329 LREILEKMPVNRLRESDSDIEFKIHCSNSYNNGSKQRLKDGLPIVLIKHKPLPSEKFEEH 388
EI K + D + + SY + K+ +IK P P+E +H
Sbjct: 385 ANEINYKAKSQQKDREKDDSKSRKRSKASYKKDGETTTKN-----VIKRNPTPTE--NKH 444
Query: 389 RHVSSKDDAFDQKTRLRSTKKKELWSVEDFDFHGGIVSSDKLHSKQKGEGTPVKQIAEKL 448
+ V+ QK + + KKE E +G + HS++ P+ ++
Sbjct: 445 KVVARS----QQKPLHKLSNKKEKLQRERHRENGVTTN----HSQK-----PLSSEDLQM 504
Query: 449 KISNPMPDMRHEKEPIDRKVLTSKKLTKPVEKEFPKEKVVSRPKHQEKVTSTNPRKNRTH 508
K+ + ++ + + + + K E E K K+ + K+Q+ S
Sbjct: 505 KVR-----LINKAKAVKKSFSHVEVAQKGKEGEVLKAKICEK-KNQDIYISNEALCKVMK 564
Query: 509 KQRSSIQDSVPGQAVRAISNNRDCQKKEELVLPHSEVNSFTHMVEVKKDDEITDTNESVD 568
+ +D +++ +++ + + + + + S+V+ H E+K D + E V
Sbjct: 565 RPEIKKEDGKHDLLLKSYNDSNEKKAEVDTCIKSSQVSGVEHKKEIKDDSILLIAAERVP 624
Query: 569 LQINRNTTTLMALITMENEMDKCDTKIIEGCHENPNSLSPLSPK-----LDINTSTVEEI 628
Q + T N MD+ I N + LS K ++ +E+
Sbjct: 625 CQAPSENQHHGRMFT--NGMDQ--QAPIPKSDGNSDILSKTVYKETKGEIEAGLPLLEKR 684
Query: 629 DHNGHTEADTKSCNQGTNLKALLLKSSSFLCHAEELFDLHLNGRTMPQAASRCNDPESLN 688
E NLK + +KS FL A+ F L++ S + +
Sbjct: 685 QERRKRETTETLSENEINLKKIFVKSQLFLDTAKAHFKLNIPQNVFHDTTSGSYYYQE-D 744
Query: 689 TKLFVDCAIELVDRKGHYNLPVGNSLVLGDKSNTKIEISIEKLVEEVNDDIETLTSYQTI 748
L ++CA EL+ RK + + V S++KI S++ L+ +++ ++E L +Y
Sbjct: 745 KNLTLECAFELMKRKRRFQELSVHPFVKVPISSSKIN-SLDHLIRQISKELEKLRAYGRD 799
Query: 749 CGNNLIVDTLYAVLSRDLWCKE-VMNGMWDIGWKNE-FSSSESEEVVNDIEMMILSGLIE 786
C V+ VL RD+ K+ +N MWD+GW + + E ++V+ DIE + SGL+E
Sbjct: 805 CHIGSHVEDY--VLERDVHYKDPYLNSMWDMGWNDSMLAFIEKDDVMRDIEREVFSGLLE 799
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3CHI1 | 0.0 | 84.70 | uncharacterized protein LOC103500989 OS=Cucumis melo OX=3656 GN=LOC103500989 PE=... | [more] |
A0A5A7TU01 | 0.0 | 85.17 | DUF4378 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... | [more] |
A0A5D3DZ02 | 0.0 | 85.06 | DUF4378 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... | [more] |
A0A6J1JAT7 | 0.0 | 63.65 | uncharacterized protein LOC111485116 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1CUY5 | 0.0 | 65.24 | uncharacterized protein LOC111014584 OS=Momordica charantia OX=3673 GN=LOC111014... | [more] |
Match Name | E-value | Identity | Description | |
AT3G24630.1 | 4.0e-36 | 25.76 | unknown protein; Has 5348 Blast hits to 3182 proteins in 353 species: Archae - 0... | [more] |
AT5G42710.2 | 4.1e-12 | 21.69 | unknown protein; INVOLVED IN: biological_process unknown. | [more] |
AT5G42710.1 | 9.1e-12 | 21.64 | unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0... | [more] |