Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTATCTAAAATTGGAATATTTGTGCGTAAATCGGAAAGATTACAATTTGAGAGAATCATTGCAATTTACCGCAAACTACAAAGATTCGCTTGCTTTCGCTTCCAATGATCCTTAGTTCTGTGCGTGCGTGTCTATCATTAAGCACTGCACCCCCACCGCCGTCGCCGATAAGTCCGCATTACTTTTGCAACCGTTAAAGCAAAGCCTTCGTTTTCCGGTTTCGTTTTCAAAAATAGCCGATTTCATTCTCGTCGGCGACGTTCACCGACGCACCGGTAAGTTCCTCCTCTTTCTGTTCAGCACTTCACTTTTTGATTAAAGTTCCATAAGCTTCTCAAGATTTTGCAGGGTTTAGGGTTTTGGATTTTGGTCCTTTTTGCTTCTAAAGGCTTCTCTTATTCTTGCTACTTTTCTTATGCTATTTGATTAGGAGCTCTATAATGATTGGGAAGAAAGAGAATTGAAGCACAGGACAATTAGTATGCAGGGTTTAGGGTTTTGGCTGACCTCCAATTAATGGATTTTGGTCCTTTTTGCTTGTAAAGGCTTCTATTATTCTTGCTACTTTTCTTATGCTATTTGATTACGAGCTCTATAATGATGGGGAAGAAAGAGAATCGAAGGACAGGGACGATTAGTATGGAGGATTGTTCCCCTCTATTGGAAAGGTATTAGGCCTTTCTATTCCTGCTTCCATGTTTTTGAATTTGATGAGTTTTTCCCCATTTTTTGTCTGTGGCTAACCGTTACTGCTGGGTTTTTGGTGCTGAGTTTTAGTTCTTCATGTATTTAGATGCTATTTAATGAGGGGTTTTTGGTTGTGGGACAGATATTCAGTGAGGACGATACTTACTTTGCTTCGGGAGGTGGCCCAGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGGAGAACACGTCCACTGGGATTTCTGATGCTCGGGAGTATCAGTTGTTATGGCGTCACTTGGCTTATCGTCAAACTTTACTGGAGGACATGCATTCTGTTACTGATTCACTGGTCAGTTTAGCAGTTTTCATCAAGCTTCTATCTATTGATGACATGACATAATACTTTTACCTTATTCTTAAGCACTAAAATTTGTGAGCTTAAGGGAGTAAAGTATGATAAACAAGGGGTTGAGAGGAATAGTAAGATCTCTCGGGCATAAGGAAGTATGAGAAATAAATCTAAACTAGAAGTGTTAGAGATTTTGAAATGAATATGATTAAGTTTCTTTTTTTCCTGACATTCATTCAAAACGAAGGAAAAGATGATTTTACTAATACAACCTTTTGTATCATAAATGTGGGTGTGAAAATATTTAATGGACGGGGAAATTCTGCGAATTCGAACTATTATAGTTTGTATTTTTCCTTCAGATCCGAAGATAAATAGACCAGAATTATTTATTTGTTTTCACAGCTAGATCATACATTGACTGTGTTTCTTTATGATTTGTACTATCCTTGACTTCAATGTAGGTTTTCCTATTTCATTAATGCGACGTGGTTGTTTTTCTTTACTCTGTTTTTTTGTTCACTGATATTGAGACTTCTCATTGTGGGACTATTGAGTTTATTTCTCTATATTTGTTTTATGATGGTATAAAAAATGTGTTTATATGGACTTTTGACATTTGATAGAGTGTTTGCCACTGTCCACAAGTAACTTAATTAACTTACTAGTTGAGTTTGAGGGCTTACAATTGATTGAATGAGTCTAAATTTGCTTAGGGGAGATACTTATGATTTGAACTTTTGCATTCCATTACTGATATTCTTATTTAGGAGTTTGATTTGCAGGATTATGATAGTGATTTGGATTTTGAGGTAGAGCCTTTTCCATCTGTCAGCAGTGAGTCCTCTAATGAAGCTTCAGCATGTGTGAAGGTGAGTCCATGTGTTCATATAACAGTTTTTTCCTTTAGTTATGTGGTTTTTCCTGTATTACGTGTTCTTTTTGTAGTTCTAGGTTAAACTTCACATTTTGTTTCAAATCGATGTCTTGACAGGTGCTTATTGCTAATAGTATACCGAATGAGTCAGATGTTCCAAATAGTTCTGCGGTTGAGGCCCCATTGACTATAGGTATATCCAATTGTCAACCATCTACCGACAATCTTGACCATCATCAATCTACCTATTTGCAAAGAATGTCGGTTACGATTCCACTCTCAATTCAGAGACAACCTATTCCAATGCCATCAGCAACTGAAGTTATTGATGTGAATGGAGCAACTTCTCGAAAGAGAAGAAAACCTTGGTCGAAGGCAGAGGATTTGGAGTTGATAGCGGCTGTCGAGAAGTGTGGTGAAGGAAATTGGGCGAATATCTTGAAAGGAGACTTCAAGGGAGATAGAACTGCATCACAACTTTCTCAGGTGATATTACGTACATCTTTTTCGATCCTATTCTCCACAAATTTTTAATAGGTTATCCTATACTATATAAATATCAAATATTTAATGTCCAGCTGTATTTTTTCTTTATATAAAATATAATCATTATATTATATAATTTCAAATATTTAATGTCTAATTTTCCTTCAAAACGTATTCATTCATTGAATCCTTGTTTATTGTCCAGGCCAGGCAAGTCTGGACATTTTTTTTATATGAATATATCTAAGGTGATATGATATTGATATACATTCTTTTAAATAAAGAAATTTATAAAAATGATTAAAATTGATAATTTATCCGTATTCAAATTCAGGTAGAGATTAATAGACAACTATTTGGTATGTTTTACTTGTATTTACAAAAGACATCAAAGATATTGATTAATTTATTTTCATTTTCTAGAGGTGGTCCGTTATTAGGAAGCGGCGATGTAATTTGAACATAGGAGCTAGCACCTCAAGTACTGCTCATAAAGCTCAGATTGATGCTGCACACCGTGCATTATCCTTTGCCCTTGATTTGCCGGTGAATAACTCAAAAACAGGTTGTTCTATTGACCTTGATTATTATCTATGTTATAAACTTTAGTTATTTAGCTTTTTAAGTTTTATTTTTTTTGGTGGAATGATCATTTAACTTTTCCTTCCGTTTGAGGTACTCTATATTAAAAAAAGATTAATCGATTAGTTGGTGTGATCATGTGATATCCCAAGTTTTAGAAGGATTGCAATGTATGTTCTTGCATGTATCTTAATAGTTTTGTTGCTCTCACTTCTGCAATTTTAATATTTTCTTTATTTACTTGTAGCAGCAAATTCAAACATAAACAGTAGCATTGTTTCTTCTGCAAGTGGTTCTGAATCTTCGATTCAAATGCAGAATCAGTCTCCACAGATCTCCATGCCTTCAAGAAGAATCAACACTCCCAAGAATTCGTTGATGATAAAGTCTACTCACGATTCTGATTCTATTGTTAGAGCAACTGCCGTAGCTGCAGGGGCCCGAATTGTTTCCCCATCCGATGCTGCATCTCTACTGAAAGCCACACAGACAAAAAATGCAATCCATATAAAGTCCAAATGCAAGCTCAAACTATGTAAGTGGTAAATCTACTATGGTGGGCAATAACACAACGAAGGCTGTCTCACCAAAATTTCTGCATCATCGTTCTACCGCTATTTCGACAAATCCACCATCAAACCAAGTAAGCCCAACAACTGAGTCTCCATTGAAGCAAGAGGTTAATAGTTCAAAAGAACGTAAAACTCCCGAGGCAATCATAACCACAAAAGAGGAGTTTCGAGAAAACAGCACGGGGAGTGATGTCAAGATTAGGGGCTGACTAAACATAAAAGAAAGTGAGAAAAAGTCCATACCATATATCATGGGGGCATTGTACAAATAATAAACAACTGAGATTGACAATCACGGCTATAGCAGGTGAATGTACAGAAACATCAGTAGGTGTAAGTAGTGAATGATATGGCACATTCTTTTATTTATGAATTTTGTTGTTATTTGGGGTGTGGGGTCAATGCAAATGCTCAACAACCACTGAAATTTGGAAGCTCATATTTTTCAACAAGAAAAGGAAATTGTATAGCTACAGTTTATCTGTAATTGAAGGTATAATTCATTCTTTAAATTAATAGATTTTCTTTCTTACATTACATTGGCGAAATTACACCCTACAAAGGAAAGCTCATCATTGAGGTAGGGGCCCTTTAGCTAGATCACTATATTGTTTCTCTAGTTGTGGACTTGCACATGAGAAAAGTGATGGCTTTTGCCTCTTTAACTTCACCCCAAACAAGAAAGATCGTAAAGGAATTCGTGTTCGGCTTAAGGATTTTGATTGTTTCATGCTTTTGGCTGCTGAAATACCAAATTCTTGATCAAGTCTCTCCAAAGCCTTCTCAACTTGAACTTGTAGTAGACTTACCCGATCTTGACCAACTTGTAACTCATCTGCTATCTTCCTGTTTTCTTGCTTCATGTTGAGTACTTCACCTTGAAACTTTCCAGATTGATAATCACTCAGTTCTGCCTTTTCTTCAGCTGAACCTTCATCTGTGATTCTTGATAGATCACTTTGTATGTCACATAAGGATGAGAATCTATTGCATAGTTCATCTTTTAATACTGCACTATGTTCCAACCAAAGTGATAATTCGGTTTGTATCTCTCTCAAATGTGTATATATCGGCCGACCATCGGATTCAGTTGCTCCTTGATGTTTTACACTTCCTTCTTGCTTGTTTTCCTTCAAATTTTGCACCTCTGATTGCAGATCCTGGATTGAAGTCTGGAATTTTTGAATCTGGTGGACTGCCGTGCTAAATCGCAACCAAAATTCGAGATTCATCTCTAGTTGTCCATCAATATGGGAGCGGAATCTTTCTTCCGTTGGTGACATGGTAATGAATTTATCACCCCCAATTGATTTCTTGTTTACATCCTCTTTCTTCTTCATACTTCTTGCATTTTTCATTGATCTGTAGCTTCCTTCTGTGGGCTCAATACTTTGTTCGCGATAGGAATCGGGAGTTGAAACTTGATCTATATATGGAGTTGAGGATTCAGAGTACAAATAGCTAGGAGCCTCATGAATGCTTTCTTGAGGTAATTCTCGGTCTGCATCTCTTGCATTCGTGTCTTCATCTGTTTCACCATTATTTACTAATGATTTTATAACATCATCTTTAGAGGAAATGGCATCTTTCAGTTCTTTAACTTGCATTGCCAATTCAAAGATGCTATCACGGTTTTTTTGTTCAACTTCACTTAACTTGTTTCTTACGTCTTTATAGTCTCTTAGAACTGAAGTATACTCTTCCAGTAAAATCTTTTCTCTATCTTCTATTCCTTTCAAAAACGTCTGCCTCAAGGTGGGACTTGTTTCTTCCCCATGAGATTCATTTGCCTCCGAATCAAGACAACTATTTTCTGTATGAAGTTGTGGTCTTCCATCGTTTTCCTCATGCCCCAATTCCTTTACCTCATCCACTGCAAGAGTAAAGCTCTTATCCTCGTCCATGAAAAAGTCACCAAGTTTCATAGTTTCTAATTTCCTCCCATCAAATCCTGAATTTGTTGAACATTCTGTTGTTTTCACATCAGGATCGACCATCATCACATCTTGTGAGAGGTCACAAGTTTCAACATCATCCATTTTCATGGTCTGCAATCTATCAGATAAGTGGTCGAGATTACTACTTGCTTTGGTGAATTGGGTTTGAAGGTTGTTATTTTGATTTTCTGCATTCTGATTTAGATTTTTCACTCTTGCAAGTTCGGCTTCCAATTCCTTTATCTTCTTCTTCATGGTTTCTGAACTCTCCACTAGAATTTCCTTGTCTTCTTCCAACTGTTGGACGTTCGCTTGGAGCACCTCTGTTTCTGATTTTAGTCGTTTTACAAGAGACGTTTGAGAAGAAACTGCAGCTTCAAGTGTCACAATTTTATTTACAAGTTTGTCAATTTTCTCAGCCAACTCAGAAATAGTGAACGAGCTGTTGGAATCCATTTCCAGATGTTCTCTGATCTTCTGATCAAGGAGTTCTATATCATGTTTGTCTTCTGCTGTACAATTGACTACTTGATCTGAGATATTCAATTCTGGTTCATTGCTTTGATCTTCATGTAAATCAGTGCACTCGTGATGGTCTATTGGCTTTGGAAGGAACTTAAATTTGAGGCTTTCGAACTTTGTAACTACATCTTTTATTCTACCTTTCTCTAATTTTGTTTCTTCAACTGTTTTCTCCTGTTCCTCTTGCAGTTTGGCCAGAGTTTCTCTGCAAGATTTTAGAGCAGTTGTTGCCATTAATGTTCGAGCTTCATTATCTTCAATAACAGTGCCAATCTCGAACTCGTCTTGCAAGTTGCTTACTCTCTTTTGCATTTTTGTGATACTGCTTTCCATCTCCCAATATTTCTCGCATTCACGTTCATACAAACTCTTCACGAATTCCATTTCTGTCTGTCGGGCCAGAATTTCTTTCTGTAGCATATCGATCTCTTCTAAGGCTTCGGTTTTATCAAGCCCTGATTTTGGAGTTAAAGCAACTCTATTTTTAGGAGTGCTTTCATTTCTCTTTAGTTGTGTCTTTCGTCTTATCATGGAAGGACTTCTAAAACTTCTCTCAGGAAACTTTGGAACTTCAGGGATTCCTGGTTTTGGGGAGCCATCCAATTCATTTGAAAATTCGGAAGGACTTTTCGCTGAAGTTGATTCTCGAGAAAAGAAATCAACTTCACAGTCATCATCATCAATTGTGTAATGAACTCGTTCGGGAAAAATCGAAGCTATTGTCCGATTTGCTCCTTGAAAGTCTTTTGATAGGTGATCATATCTCTCTGCTAAAGCTCGGTATGCTCGAAAAGACTCTTCTACATGTTCTACAAGTTCAGGTCTCTTCCTGTAATACATTTCAGCCCTCCTTGCAAAAGAATCTCCATCGCCTTCAATGATTCTCATCATGCTATCGACCTTCTCTTCCATATCTACAATTCCAAAAGCATTTGCAGTGTGAGTCCAAATATCATTTTAAAACACTCTTGGGTAATAAATTAATACATCAATTTGAGGAACATATAGCATTGATTATTACTTATATATCACTCCAGATTGGAATTCCTAAAACTATAATATACATAACATTAAAACTCTAATAAATATTTATGGTGTAAAACTAGAGGGGAAAACACCTAATGTGTGTATCTATCTATATGTGTATATACATGCAGGTCAATACAAATATCATTTCAAAACACTCTCAACTAATGAATTATTTTATGATTGGAGTGTGTAAGGAATGTTGGGGTTTTTCGTGGTAGGAAGCAGGATCACGAGGTTTGGTTTCTTATTAGATTTTATGTATCTCGTTGAGTTTTGATTTCAAAGACATTTTGTAACTATTCCATCACTAACATTTTATTTATCTGGAACTCGTTCTTCTAGTTCTGTTT
mRNA sequence
CTATCTAAAATTGGAATATTTGTGCGTAAATCGGAAAGATTACAATTTGAGAGAATCATTGCAATTTACCGCAAACTACAAAGATTCGCTTGCTTTCGCTTCCAATGATCCTTAGTTCTGTGCGTGCGTGTCTATCATTAAGCACTGCACCCCCACCGCCGTCGCCGATAAGTCCGCATTACTTTTGCAACCGTTAAAGCAAAGCCTTCGTTTTCCGGTTTCGTTTTCAAAAATAGCCGATTTCATTCTCGTCGGCGACGTTCACCGACGCACCGGAGCTCTATAATGATTGGGAAGAAAGAGAATTGAAGCACAGGACAATTAGTATGCAGGGTTTAGGGTTTTGGCTGACCTCCAATTAATGGATTTTGGTCCTTTTTGCTTGTAAAGGCTTCTATTATTCTTGCTACTTTTCTTATGCTATTTGATTACGAGCTCTATAATGATGGGGAAGAAAGAGAATCGAAGGACAGGGACGATTAGTATGGAGGATTGTTCCCCTCTATTGGAAAGATATTCAGTGAGGACGATACTTACTTTGCTTCGGGAGGTGGCCCAGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGGAGAACACGTCCACTGGGATTTCTGATGCTCGGGAGTATCAGTTGTTATGGCGTCACTTGGCTTATCGTCAAACTTTACTGGAGGACATGCATTCTGTTACTGATTCACTGGAGTTTGATTTGCAGGATTATGATAGTGATTTGGATTTTGAGGTAGAGCCTTTTCCATCTGTCAGCAGTGAGTCCTCTAATGAAGCTTCAGCATGTGTGAAGGTGCTTATTGCTAATAGTATACCGAATGAGTCAGATGTTCCAAATAGTTCTGCGGTTGAGGCCCCATTGACTATAGGTATATCCAATTGTCAACCATCTACCGACAATCTTGACCATCATCAATCTACCTATTTGCAAAGAATGTCGGTTACGATTCCACTCTCAATTCAGAGACAACCTATTCCAATGCCATCAGCAACTGAAGTTATTGATGTGAATGGAGCAACTTCTCGAAAGAGAAGAAAACCTTGGTCGAAGGCAGAGGATTTGGAGTTGATAGCGGCTGTCGAGAAGTGTGGTGAAGGAAATTGGGCGAATATCTTGAAAGGAGACTTCAAGGGAGATAGAACTGCATCACAACTTTCTCAGAGGTGGTCCGTTATTAGGAAGCGGCGATGTAATTTGAACATAGGAGCTAGCACCTCAAGTACTGCTCATAAAGCTCAGATTGATGCTGCACACCGTGCATTATCCTTTGCCCTTGATTTGCCGGTGAATAACTCAAAAACAGCAAATTCAAACATAAACAGTAGCATTGTTTCTTCTGCAAGTGGTTCTGAATCTTCGATTCAAATGCAGAATCAGTCTCCACAGATCTCCATGCCTTCAAGAAGAATCAACACTCCCAAGAATTCGTTGATGATAAAGTCTACTCACGATTCTGATTCTATTGTTAGAGCAACTGCCGTAGCTGCAGGGGCCCGAATTGTTTCCCCATCCGATGCTGCATCTCTACTGAAAGCCACACAGACAAAAAATGCAATCCATATAAAGTCCAAATGCAAGCTCAAACTATGTAAGTGGTAAATCTACTATGGTGGGCAATAACACAACGAAGGCTGTCTCACCAAAATTTCTGCATCATCGTTCTACCGCTATTTCGACAAATCCACCATCAAACCAAGTAAGCCCAACAACTGAGTCTCCATTGAAGCAAGAGGTTAATAGTTCAAAAGAACGTAAAACTCCCGAGGCAATCATAACCACAAAAGAGGAGTTTCGAGAAAACAGCACGGGGAGTGATGTCAAGATTAGGGGCTGACTAAACATAAAAGAAAGTGAGAAAAAGTCCATACCATATATCATGGGGGCATTGTACAAATAATAAACAACTGAGATTGACAATCACGGCTATAGCAGGTGAATGTACAGAAACATCAGTAGGTGTAAGTAGTGAATGATATGGCACATTCTTTTATTTATGAATTTTGTTGTTATTTGGGGTGTGGGGTCAATGCAAATGCTCAACAACCACTGAAATTTGGAAGCTCATATTTTTCAACAAGAAAAGGAAATTGTATAGCTACAGTTTATCTGTAATTGAAGATCCTGGATTGAAGTCTGGAATTTTTGAATCTGGTGGACTGCCGTGCTAAATCGCAACCAAAATTCGAGATTCATCTCTAGTTGTCCATCAATATGGGAGCGGAATCTTTCTTCCGTTGGTGACATGGTAATGAATTTATCACCCCCAATTGATTTCTTGTTTACATCCTCTTTCTTCTTCATACTTCTTGCATTTTTCATTGATCTGTAGCTTCCTTCTGTGGGCTCAATACTTTGTTCGCGATAGGAATCGGGAGTTGAAACTTGATCTATATATGGAGTTGAGGATTCAGAGTACAAATAGCTAGGAGCCTCATGAATGCTTTCTTGAGGTCAATACAAATATCATTTCAAAACACTCTCAACTAATGAATTATTTTATGATTGGAGTGTGTAAGGAATGTTGGGGTTTTTCGTGGTAGGAAGCAGGATCACGAGGTTTGGTTTCTTATTAGATTTTATGTATCTCGTTGAGTTTTGATTTCAAAGACATTTTGTAACTATTCCATCACTAACATTTTATTTATCTGGAACTCGTTCTTCTAGTTCTGTTT
Coding sequence (CDS)
ATGATGGGGAAGAAAGAGAATCGAAGGACAGGGACGATTAGTATGGAGGATTGTTCCCCTCTATTGGAAAGATATTCAGTGAGGACGATACTTACTTTGCTTCGGGAGGTGGCCCAGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGGAGAACACGTCCACTGGGATTTCTGATGCTCGGGAGTATCAGTTGTTATGGCGTCACTTGGCTTATCGTCAAACTTTACTGGAGGACATGCATTCTGTTACTGATTCACTGGAGTTTGATTTGCAGGATTATGATAGTGATTTGGATTTTGAGGTAGAGCCTTTTCCATCTGTCAGCAGTGAGTCCTCTAATGAAGCTTCAGCATGTGTGAAGGTGCTTATTGCTAATAGTATACCGAATGAGTCAGATGTTCCAAATAGTTCTGCGGTTGAGGCCCCATTGACTATAGGTATATCCAATTGTCAACCATCTACCGACAATCTTGACCATCATCAATCTACCTATTTGCAAAGAATGTCGGTTACGATTCCACTCTCAATTCAGAGACAACCTATTCCAATGCCATCAGCAACTGAAGTTATTGATGTGAATGGAGCAACTTCTCGAAAGAGAAGAAAACCTTGGTCGAAGGCAGAGGATTTGGAGTTGATAGCGGCTGTCGAGAAGTGTGGTGAAGGAAATTGGGCGAATATCTTGAAAGGAGACTTCAAGGGAGATAGAACTGCATCACAACTTTCTCAGAGGTGGTCCGTTATTAGGAAGCGGCGATGTAATTTGAACATAGGAGCTAGCACCTCAAGTACTGCTCATAAAGCTCAGATTGATGCTGCACACCGTGCATTATCCTTTGCCCTTGATTTGCCGGTGAATAACTCAAAAACAGCAAATTCAAACATAAACAGTAGCATTGTTTCTTCTGCAAGTGGTTCTGAATCTTCGATTCAAATGCAGAATCAGTCTCCACAGATCTCCATGCCTTCAAGAAGAATCAACACTCCCAAGAATTCGTTGATGATAAAGTCTACTCACGATTCTGATTCTATTGTTAGAGCAACTGCCGTAGCTGCAGGGGCCCGAATTGTTTCCCCATCCGATGCTGCATCTCTACTGAAAGCCACACAGACAAAAAATGCAATCCATATAAAGTCCAAATGCAAGCTCAAACTATGTAAGTGGTAA
Protein sequence
MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISDAREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASACVKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQRQPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTASQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVSPSDAASLLKATQTKNAIHIKSKCKLKLCKW*
Homology
BLAST of CsGy3G043000 vs. NCBI nr
Match:
XP_011652579.1 (uncharacterized protein LOC101205013 isoform X2 [Cucumis sativus])
HSP 1 Score: 747 bits (1929), Expect = 1.48e-271
Identity = 390/390 (100.00%), Postives = 390/390 (100.00%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD
Sbjct: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC
Sbjct: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR
Sbjct: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA
Sbjct: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
Query: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSS 300
SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSS
Sbjct: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSS 300
Query: 301 IVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVS 360
IVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVS
Sbjct: 301 IVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVS 360
Query: 361 PSDAASLLKATQTKNAIHIKSKCKLKLCKW 390
PSDAASLLKATQTKNAIHIKSKCKLKLCKW
Sbjct: 361 PSDAASLLKATQTKNAIHIKSKCKLKLCKW 390
BLAST of CsGy3G043000 vs. NCBI nr
Match:
XP_011652578.1 (uncharacterized protein LOC101205013 isoform X1 [Cucumis sativus] >XP_031738636.1 uncharacterized protein LOC101205013 isoform X1 [Cucumis sativus] >XP_031738637.1 uncharacterized protein LOC101205013 isoform X1 [Cucumis sativus] >XP_031738638.1 uncharacterized protein LOC101205013 isoform X1 [Cucumis sativus] >XP_031738639.1 uncharacterized protein LOC101205013 isoform X1 [Cucumis sativus])
HSP 1 Score: 743 bits (1917), Expect = 1.03e-269
Identity = 390/391 (99.74%), Postives = 390/391 (99.74%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD
Sbjct: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC
Sbjct: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR
Sbjct: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA
Sbjct: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
Query: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTA-NSNINS 300
SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTA NSNINS
Sbjct: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTAANSNINS 300
Query: 301 SIVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIV 360
SIVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIV
Sbjct: 301 SIVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIV 360
Query: 361 SPSDAASLLKATQTKNAIHIKSKCKLKLCKW 390
SPSDAASLLKATQTKNAIHIKSKCKLKLCKW
Sbjct: 361 SPSDAASLLKATQTKNAIHIKSKCKLKLCKW 391
BLAST of CsGy3G043000 vs. NCBI nr
Match:
KGN60262.1 (hypothetical protein Csa_001375 [Cucumis sativus])
HSP 1 Score: 731 bits (1888), Expect = 2.17e-265
Identity = 385/390 (98.72%), Postives = 385/390 (98.72%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD
Sbjct: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYRQTLLEDMHSVTDSL DYDSDLDFEVEPFPSVSSESSNEASAC
Sbjct: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSL-----DYDSDLDFEVEPFPSVSSESSNEASAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR
Sbjct: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA
Sbjct: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
Query: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSS 300
SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSS
Sbjct: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSS 300
Query: 301 IVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVS 360
IVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVS
Sbjct: 301 IVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVS 360
Query: 361 PSDAASLLKATQTKNAIHIKSKCKLKLCKW 390
PSDAASLLKATQTKNAIHIKSKCKLKLCKW
Sbjct: 361 PSDAASLLKATQTKNAIHIKSKCKLKLCKW 385
BLAST of CsGy3G043000 vs. NCBI nr
Match:
XP_011652580.1 (uncharacterized protein LOC101205013 isoform X3 [Cucumis sativus])
HSP 1 Score: 727 bits (1876), Expect = 1.52e-263
Identity = 385/391 (98.47%), Postives = 385/391 (98.47%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD
Sbjct: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYRQTLLEDMHSVTDSL DYDSDLDFEVEPFPSVSSESSNEASAC
Sbjct: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSL-----DYDSDLDFEVEPFPSVSSESSNEASAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR
Sbjct: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA
Sbjct: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
Query: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTA-NSNINS 300
SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTA NSNINS
Sbjct: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTAANSNINS 300
Query: 301 SIVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIV 360
SIVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIV
Sbjct: 301 SIVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIV 360
Query: 361 SPSDAASLLKATQTKNAIHIKSKCKLKLCKW 390
SPSDAASLLKATQTKNAIHIKSKCKLKLCKW
Sbjct: 361 SPSDAASLLKATQTKNAIHIKSKCKLKLCKW 386
BLAST of CsGy3G043000 vs. NCBI nr
Match:
XP_008466159.1 (PREDICTED: uncharacterized protein LOC103503656 isoform X1 [Cucumis melo] >XP_008466160.1 PREDICTED: uncharacterized protein LOC103503656 isoform X1 [Cucumis melo])
HSP 1 Score: 643 bits (1658), Expect = 2.20e-228
Identity = 348/402 (86.57%), Postives = 360/402 (89.55%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
M+GKKENRRTGTISMEDCS LL RYSVRTI TLLREVAQVSGVRIDWDKLV+NTSTGISD
Sbjct: 1 MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYR TLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSV SESSNEA+AC
Sbjct: 61 AREYQLLWRHLAYRHTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVGSESSNEAAAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIAN IPNESDVPNSSAVEAPLTI ISN QP TDN D+HQS LQ +SVTIPLSIQR
Sbjct: 121 VKVLIANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASLQGISVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGAT-----SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
QPIP+P A EV DVNGA SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK
Sbjct: 181 QPIPVPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
Query: 241 GDRTASQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANS 300
GDRTASQLSQRWSVIRKRRCNLN+GASTSST KAQIDAAHRAL+FALDLPVNN+KTANS
Sbjct: 241 GDRTASQLSQRWSVIRKRRCNLNLGASTSSTTQKAQIDAAHRALNFALDLPVNNTKTANS 300
Query: 301 NINSSIVSS-ASGSESSIQMQNQSPQISMPSRR-------------INTPKNSLMIKSTH 360
NINSSIVSS AS SESS+QMQNQSPQISMPSR INT KNSLMI STH
Sbjct: 301 NINSSIVSSSASASESSVQMQNQSPQISMPSRPLLVDPLPSAVKSGINTSKNSLMINSTH 360
Query: 361 DSDSIVRATAVAAGARIVSPSDAASLLKATQTKNAIHIKSKC 383
+SDSIVRATAVAAGARIVSPSDAASL+KATQTKNAIHIKSKC
Sbjct: 361 NSDSIVRATAVAAGARIVSPSDAASLMKATQTKNAIHIKSKC 402
BLAST of CsGy3G043000 vs. ExPASy TrEMBL
Match:
A0A0A0LHM8 (HTH myb-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G891670 PE=4 SV=1)
HSP 1 Score: 731 bits (1888), Expect = 1.05e-265
Identity = 385/390 (98.72%), Postives = 385/390 (98.72%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD
Sbjct: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYRQTLLEDMHSVTDSL DYDSDLDFEVEPFPSVSSESSNEASAC
Sbjct: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSL-----DYDSDLDFEVEPFPSVSSESSNEASAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR
Sbjct: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA
Sbjct: 181 QPIPMPSATEVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTA 240
Query: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSS 300
SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSS
Sbjct: 241 SQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSS 300
Query: 301 IVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVS 360
IVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVS
Sbjct: 301 IVSSASGSESSIQMQNQSPQISMPSRRINTPKNSLMIKSTHDSDSIVRATAVAAGARIVS 360
Query: 361 PSDAASLLKATQTKNAIHIKSKCKLKLCKW 390
PSDAASLLKATQTKNAIHIKSKCKLKLCKW
Sbjct: 361 PSDAASLLKATQTKNAIHIKSKCKLKLCKW 385
BLAST of CsGy3G043000 vs. ExPASy TrEMBL
Match:
A0A1S3CQJ6 (uncharacterized protein LOC103503656 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503656 PE=4 SV=1)
HSP 1 Score: 643 bits (1658), Expect = 1.06e-228
Identity = 348/402 (86.57%), Postives = 360/402 (89.55%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
M+GKKENRRTGTISMEDCS LL RYSVRTI TLLREVAQVSGVRIDWDKLV+NTSTGISD
Sbjct: 1 MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYR TLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSV SESSNEA+AC
Sbjct: 61 AREYQLLWRHLAYRHTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVGSESSNEAAAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIAN IPNESDVPNSSAVEAPLTI ISN QP TDN D+HQS LQ +SVTIPLSIQR
Sbjct: 121 VKVLIANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASLQGISVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGAT-----SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
QPIP+P A EV DVNGA SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK
Sbjct: 181 QPIPVPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
Query: 241 GDRTASQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANS 300
GDRTASQLSQRWSVIRKRRCNLN+GASTSST KAQIDAAHRAL+FALDLPVNN+KTANS
Sbjct: 241 GDRTASQLSQRWSVIRKRRCNLNLGASTSSTTQKAQIDAAHRALNFALDLPVNNTKTANS 300
Query: 301 NINSSIVSS-ASGSESSIQMQNQSPQISMPSRR-------------INTPKNSLMIKSTH 360
NINSSIVSS AS SESS+QMQNQSPQISMPSR INT KNSLMI STH
Sbjct: 301 NINSSIVSSSASASESSVQMQNQSPQISMPSRPLLVDPLPSAVKSGINTSKNSLMINSTH 360
Query: 361 DSDSIVRATAVAAGARIVSPSDAASLLKATQTKNAIHIKSKC 383
+SDSIVRATAVAAGARIVSPSDAASL+KATQTKNAIHIKSKC
Sbjct: 361 NSDSIVRATAVAAGARIVSPSDAASLMKATQTKNAIHIKSKC 402
BLAST of CsGy3G043000 vs. ExPASy TrEMBL
Match:
A0A5A7T5C8 (HTH myb-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold92G002020 PE=4 SV=1)
HSP 1 Score: 638 bits (1646), Expect = 7.37e-227
Identity = 348/403 (86.35%), Postives = 360/403 (89.33%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
M+GKKENRRTGTISMEDCS LL RYSVRTI TLLREVAQVSGVRIDWDKLV+NTSTGISD
Sbjct: 1 MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYR TLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSV SESSNEA+AC
Sbjct: 61 AREYQLLWRHLAYRHTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVGSESSNEAAAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIAN IPNESDVPNSSAVEAPLTI ISN QP TDN D+HQS LQ +SVTIPLSIQR
Sbjct: 121 VKVLIANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASLQGISVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGAT-----SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
QPIP+P A EV DVNGA SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK
Sbjct: 181 QPIPVPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
Query: 241 GDRTASQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTA-N 300
GDRTASQLSQRWSVIRKRRCNLN+GASTSST KAQIDAAHRAL+FALDLPVNN+KTA N
Sbjct: 241 GDRTASQLSQRWSVIRKRRCNLNLGASTSSTTQKAQIDAAHRALNFALDLPVNNTKTAAN 300
Query: 301 SNINSSIVSS-ASGSESSIQMQNQSPQISMPSRR-------------INTPKNSLMIKST 360
SNINSSIVSS AS SESS+QMQNQSPQISMPSR INT KNSLMI ST
Sbjct: 301 SNINSSIVSSSASASESSVQMQNQSPQISMPSRPLLVDPLPSAVKSGINTSKNSLMINST 360
Query: 361 HDSDSIVRATAVAAGARIVSPSDAASLLKATQTKNAIHIKSKC 383
H+SDSIVRATAVAAGARIVSPSDAASL+KATQTKNAIHIKSKC
Sbjct: 361 HNSDSIVRATAVAAGARIVSPSDAASLMKATQTKNAIHIKSKC 403
BLAST of CsGy3G043000 vs. ExPASy TrEMBL
Match:
A0A1S3CRZ4 (uncharacterized protein LOC103503656 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103503656 PE=4 SV=1)
HSP 1 Score: 627 bits (1617), Expect = 1.53e-222
Identity = 343/402 (85.32%), Postives = 355/402 (88.31%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
M+GKKENRRTGTISMEDCS LL RYSVRTI TLLREVAQVSGVRIDWDKLV+NTSTGISD
Sbjct: 1 MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYR TLLEDMHSVTDSL DYDSDLDFEVEPFPSV SESSNEA+AC
Sbjct: 61 AREYQLLWRHLAYRHTLLEDMHSVTDSL-----DYDSDLDFEVEPFPSVGSESSNEAAAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIAN IPNESDVPNSSAVEAPLTI ISN QP TDN D+HQS LQ +SVTIPLSIQR
Sbjct: 121 VKVLIANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASLQGISVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGAT-----SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
QPIP+P A EV DVNGA SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK
Sbjct: 181 QPIPVPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
Query: 241 GDRTASQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANS 300
GDRTASQLSQRWSVIRKRRCNLN+GASTSST KAQIDAAHRAL+FALDLPVNN+KTANS
Sbjct: 241 GDRTASQLSQRWSVIRKRRCNLNLGASTSSTTQKAQIDAAHRALNFALDLPVNNTKTANS 300
Query: 301 NINSSIVSS-ASGSESSIQMQNQSPQISMPSRR-------------INTPKNSLMIKSTH 360
NINSSIVSS AS SESS+QMQNQSPQISMPSR INT KNSLMI STH
Sbjct: 301 NINSSIVSSSASASESSVQMQNQSPQISMPSRPLLVDPLPSAVKSGINTSKNSLMINSTH 360
Query: 361 DSDSIVRATAVAAGARIVSPSDAASLLKATQTKNAIHIKSKC 383
+SDSIVRATAVAAGARIVSPSDAASL+KATQTKNAIHIKSKC
Sbjct: 361 NSDSIVRATAVAAGARIVSPSDAASLMKATQTKNAIHIKSKC 397
BLAST of CsGy3G043000 vs. ExPASy TrEMBL
Match:
A0A5D3E5P5 (HTH myb-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G005670 PE=4 SV=1)
HSP 1 Score: 622 bits (1605), Expect = 1.06e-220
Identity = 343/403 (85.11%), Postives = 355/403 (88.09%), Query Frame = 0
Query: 1 MMGKKENRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISD 60
M+GKKENRRTGTISMEDCS LL RYSVRTI TLLREVAQVSGVRIDWDKLV+NTSTGISD
Sbjct: 1 MIGKKENRRTGTISMEDCSTLLGRYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISD 60
Query: 61 AREYQLLWRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASAC 120
AREYQLLWRHLAYR TLLEDMHSVTDSL DYDSDLDFEVEPFPSV SESSNEA+AC
Sbjct: 61 AREYQLLWRHLAYRHTLLEDMHSVTDSL-----DYDSDLDFEVEPFPSVGSESSNEAAAC 120
Query: 121 VKVLIANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQR 180
VKVLIAN IPNESDVPNSSAVEAPLTI ISN QP TDN D+HQS LQ +SVTIPLSIQR
Sbjct: 121 VKVLIANGIPNESDVPNSSAVEAPLTIRISNSQPPTDNFDNHQSASLQGISVTIPLSIQR 180
Query: 181 QPIPMPSATEVIDVNGAT-----SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
QPIP+P A EV DVNGA SRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK
Sbjct: 181 QPIPVPPAAEVFDVNGAAGASAASRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFK 240
Query: 241 GDRTASQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTA-N 300
GDRTASQLSQRWSVIRKRRCNLN+GASTSST KAQIDAAHRAL+FALDLPVNN+KTA N
Sbjct: 241 GDRTASQLSQRWSVIRKRRCNLNLGASTSSTTQKAQIDAAHRALNFALDLPVNNTKTAAN 300
Query: 301 SNINSSIVSS-ASGSESSIQMQNQSPQISMPSRR-------------INTPKNSLMIKST 360
SNINSSIVSS AS SESS+QMQNQSPQISMPSR INT KNSLMI ST
Sbjct: 301 SNINSSIVSSSASASESSVQMQNQSPQISMPSRPLLVDPLPSAVKSGINTSKNSLMINST 360
Query: 361 HDSDSIVRATAVAAGARIVSPSDAASLLKATQTKNAIHIKSKC 383
H+SDSIVRATAVAAGARIVSPSDAASL+KATQTKNAIHIKSKC
Sbjct: 361 HNSDSIVRATAVAAGARIVSPSDAASLMKATQTKNAIHIKSKC 398
BLAST of CsGy3G043000 vs. TAIR 10
Match:
AT1G09710.1 (Homeodomain-like superfamily protein )
HSP 1 Score: 208.0 bits (528), Expect = 1.4e-53
Identity = 155/384 (40.36%), Postives = 221/384 (57.55%), Query Frame = 0
Query: 7 NRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISDAREYQL 66
NRR I+ D + LL RY + TIL +L+E++ S ++DW+ LV+ T+TGI++AREYQL
Sbjct: 10 NRRKRIITEGDIATLLLRYDMETILRMLQEISYCSETKMDWNALVKKTTTGITNAREYQL 69
Query: 67 LWRHLAYRQTLL--EDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASACVKVL 126
LWRHL+YR LL ED D+L D DSD++ E+E P+VS E+S EA A VKV+
Sbjct: 70 LWRHLSYRHPLLPVED-----DALPL---DDDSDMECELEASPAVSHEASVEAIAHVKVM 129
Query: 127 IANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQRQPIP 186
A+ + +ESD+ + S VEAPLTI I P + M++ P+ +Q+
Sbjct: 130 AASYVLSESDILDDSTVEAPLTINIPYALPEGSQEPSESPWSSRGMNINFPVCLQK---- 189
Query: 187 MPSATEVIDVNGATS-----RKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRT 246
++TE ++ NG+ R++RK WS ED EL AAV++CGEGNWA+I+KGDF+G+RT
Sbjct: 190 -VTSTEGMNGNGSAGISMAFRRKRKRWSAEEDEELFAAVKRCGEGNWAHIVKGDFRGERT 249
Query: 247 ASQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTAN----- 306
ASQLSQRW++IRK RC+ STS + Q A A++ AL L + N +N
Sbjct: 250 ASQLSQRWALIRK-RCH----TSTSVSQCGLQGTEAKLAVNHALSLALGNRPPSNKLAIG 309
Query: 307 ---SNINSSIVSSASGSESSIQMQNQSPQI---------SMPSRRINTPKNSLMIKSTHD 366
+ + +I + + SS Q Q QS I S+P+ + K + ST
Sbjct: 310 LMPTTSSCTITETEANGGSSSQGQQQSKPIVQALPRAGTSLPAAKSRVVKKT-TASSTSR 369
BLAST of CsGy3G043000 vs. TAIR 10
Match:
AT1G58220.1 (Homeodomain-like superfamily protein )
HSP 1 Score: 204.1 bits (518), Expect = 2.0e-52
Identity = 156/392 (39.80%), Postives = 221/392 (56.38%), Query Frame = 0
Query: 8 RRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISDAREYQLL 67
+R IS D + LL+RY TIL LL+E+A + +++W++LV+ TSTGI+ AREYQLL
Sbjct: 9 KRKEFISEADIATLLQRYDTVTILKLLQEMAYYAEAKMNWNELVKKTSTGITSAREYQLL 68
Query: 68 WRHLAYRQTLLEDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASACVKVLIAN 127
WRHLAYR +L+ ++ + D DSD++ E+E P VS + EA A VKV+ A+
Sbjct: 69 WRHLAYRDSLVPVGNNAR------VLDDDSDMECELEASPGVSVDVVTEAVAHVKVMAAS 128
Query: 128 SIPNESDVPNSSAVEAPLTIGI--SNCQPSTDNLDHHQSTYLQRMSVTIPLSIQRQPIPM 187
+P+ESD+P S VEAPLTI I S + + D + S+ + M++T P+ +
Sbjct: 129 YVPSESDIPEDSTVEAPLTINIPYSLHRGPQEPSDSYWSS--RGMNITF-------PVFL 188
Query: 188 PSATEVIDVNGATS----RKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTAS 247
P A E + NG S RKRRK WS ED ELIAAV++ GEG+WA I K +F+G+RTAS
Sbjct: 189 PKAAEGHNGNGLASSLAPRKRRKKWSAEEDEELIAAVKRHGEGSWALISKEEFEGERTAS 248
Query: 248 QLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFAL-------DLPVNNSKTAN 307
QLSQRW IR+R N T +AQ+ AA+RALS A+ L V + +
Sbjct: 249 QLSQRWGAIRRRTDTSNTSTQTGLQRTEAQM-AANRALSLAVGNRLPSKKLAVGMTPMLS 308
Query: 308 SNINSSIVSSASGSESSIQMQNQ-SPQI--------SMPSRRINTPKNSLMIKSTHDSDS 367
S ++ + S S++Q Q Q PQI S+P + P ST +D
Sbjct: 309 SGTIKGAQANGASSGSTLQGQQQPQPQIQALSRATTSVPVAKSRVPVKKTTGNSTSRADL 368
Query: 368 IVRATAVAAGARIVSPSDAASLLKATQTKNAI 378
+V A +VAA A + + A ++ K KNA+
Sbjct: 369 MVTANSVAAAACMSGLATAVTVPKIEPGKNAV 384
BLAST of CsGy3G043000 vs. TAIR 10
Match:
AT1G09710.2 (Homeodomain-like superfamily protein )
HSP 1 Score: 201.4 bits (511), Expect = 1.3e-51
Identity = 136/330 (41.21%), Postives = 198/330 (60.00%), Query Frame = 0
Query: 7 NRRTGTISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISDAREYQL 66
NRR I+ D + LL RY + TIL +L+E++ S ++DW+ LV+ T+TGI++AREYQL
Sbjct: 10 NRRKRIITEGDIATLLLRYDMETILRMLQEISYCSETKMDWNALVKKTTTGITNAREYQL 69
Query: 67 LWRHLAYRQTLL--EDMHSVTDSLEFDLQDYDSDLDFEVEPFPSVSSESSNEASACVKVL 126
LWRHL+YR LL ED D+L D DSD++ E+E P+VS E+S EA A VKV+
Sbjct: 70 LWRHLSYRHPLLPVED-----DALPL---DDDSDMECELEASPAVSHEASVEAIAHVKVM 129
Query: 127 IANSIPNESDVPNSSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRMSVTIPLSIQRQPIP 186
A+ + +ESD+ + S VEAPLTI I P + M++ P+ +Q+
Sbjct: 130 AASYVLSESDILDDSTVEAPLTINIPYALPEGSQEPSESPWSSRGMNINFPVCLQK---- 189
Query: 187 MPSATEVIDVNGATS-----RKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRT 246
++TE ++ NG+ R++RK WS ED EL AAV++CGEGNWA+I+KGDF+G+RT
Sbjct: 190 -VTSTEGMNGNGSAGISMAFRRKRKRWSAEEDEELFAAVKRCGEGNWAHIVKGDFRGERT 249
Query: 247 ASQLSQRWSVIRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINS 306
ASQLSQRW++IRK RC+ STS + Q A A++ AL L + N +N
Sbjct: 250 ASQLSQRWALIRK-RCH----TSTSVSQCGLQGTEAKLAVNHALSLALGNRPPSNKLAIG 309
Query: 307 SIVSSASGSESSIQMQNQSPQISMPSRRIN 330
+ + + SSI + + + +P +N
Sbjct: 310 TSSRRSFPANSSIYVITEDALVWLPLACLN 321
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_011652579.1 | 1.48e-271 | 100.00 | uncharacterized protein LOC101205013 isoform X2 [Cucumis sativus] | [more] |
XP_011652578.1 | 1.03e-269 | 99.74 | uncharacterized protein LOC101205013 isoform X1 [Cucumis sativus] >XP_031738636.... | [more] |
KGN60262.1 | 2.17e-265 | 98.72 | hypothetical protein Csa_001375 [Cucumis sativus] | [more] |
XP_011652580.1 | 1.52e-263 | 98.47 | uncharacterized protein LOC101205013 isoform X3 [Cucumis sativus] | [more] |
XP_008466159.1 | 2.20e-228 | 86.57 | PREDICTED: uncharacterized protein LOC103503656 isoform X1 [Cucumis melo] >XP_00... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LHM8 | 1.05e-265 | 98.72 | HTH myb-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G89167... | [more] |
A0A1S3CQJ6 | 1.06e-228 | 86.57 | uncharacterized protein LOC103503656 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5A7T5C8 | 7.37e-227 | 86.35 | HTH myb-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... | [more] |
A0A1S3CRZ4 | 1.53e-222 | 85.32 | uncharacterized protein LOC103503656 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5D3E5P5 | 1.06e-220 | 85.11 | HTH myb-type domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN... | [more] |