Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCATCTCCCGAAGGTAATCTCATTCGTTGCTTCGATCTCCGTTCTTCCTGCCGATTGCGAATCTAAATATACATATTCAATGCCATCGTCGGTTTGGGTTTCTCATGATTCTGCCTTGATTCCATTGTTACTCTCTCGCGGAATCATCAGCAATAGAAGCACGTATTAGATCTGCGTTTCGGTATTGTTCGCATCTCTGCCTTTAGGATTAGCAATGTCTTTGAATCGGATGAGTATGTTTCTTTCGAGGCAGAGGGGAGTTTTCTTCTTGAAGTTTGAGTTGTTGGTTGAGGATCAGTTGTTCAAATACTGGAATGTCTTCTAGCAATTCAATCCAATTTCCATATCAAGTGATTCTGGTGATTGTGGAATATGTTTGCGTAGATGGGGGGTTTATGCTTCTTTTATTTTTTTTTTTCTTCCTTTTAATTTTATCATCAATTTTGTCTTGTTCGTTAGGAATCTGGCATCTTTTGTTCTTTTTATGGGATGGATGGAGAAGTTTTTGCGTGGTTATGAGAGTAATAATTAGGGAGATCGGGCAAGTACTATGAGGTTGTGCATAGTTGCTCGGGGATTTAATAACATTATACCCAAAAGTTCACTGCTGGAGTTCGCTATAGGGTTCGTTGTGTTGGGTTTCCCCTGATCCAATCTCGAGGCGTGGATCATCTAAACTTTTGTGGTTAAAGCTTGAACACAAGAACTTCTAATATGCTAGGGGATGGGAATGCAGGATTTAAAGATTTGGATGCTGGTTTTATGATTTTTCACGATGGAACCGTTCTGCATCGTTAATTTCTTTAGTTTTCGCTTGCAGATTAATTGATTATTTCGACACAATGTCTGTCGTATGCTATCTTGCATATTCAATCTTTTTGAAGATTACTAATGTGTTAGTATCCATACGGTGCAGAATTTGAAAGTGACTTCTGAGAAGTTAGTGAGAATTTGGTGTTGGATTAAAGTGATAAAAGACCAAGAAACTGTATTAAGCCTCTGAGGTCTTGGAATAATGACCGTTCCAGAAAGTGAAGAGGTGCTTAATATTCTTCTTTGCTGTTTTCTGGATTTTGAAGATCCTGCAAGTTTCTTCTTATATGGTTATAATTATTGTCAGTAAATGTTCTATTGGCACTATGCAGGTTGGTTTTAAGCGCATTGGGTTGTCAGCTAGTGATTATGATGCAAGTCTTCCTATCAAGAAAAGGAGATTTCCGGTAGTGCAGTTTCCTCCGTCTCCATCTAAAGATATATCTTCATTCCATTCAGATGGAAATTTATTGAAGGCTGAGCGGCCATCTCCACCTAAAGATGCATCTTCTTTTAATCGCAAAGAAAATTTAATGAAGACTGAGCAGCCGATTATATCTGTGACAATAGTTTCAAGTTCTAGTGCAGTCACAAGTTCTGGATTGTCAAACAAGAATCAGGACTGTGTTTCTGACGAGAACAAAGGAAAATCTGATACTGTTTCATGTTTTGTGGATACGGTCCAGAGTGATACTGGAATGCCACGAGTCAAGTTTCAGGAACCCGGTTTGGGAGAACATGCTTGTATTAATGATTTTGTTGAACATGATGATAAATCCTTGGTAACTGAAAAACATACTGTTCATGCATCACCAGAGATCTGTGGGGGGTTGGAGTTATCGTCAACTAGCCTTGACTCTGATCCTCTTGCTGGTAACAAAGAGGAAGAAATTGATGCAAAAATGCCTGAAGAAAAGTGCAGCTCTCCAATTTGTCAAGTTGAAGGAGGAGCTGGAGTATTGGTAGGTTTGAAGGGACACATGGATTTGAAATTAGTTCCTGAAAAGAGTGACTTGAATTTCCTGAAGCAGAATTCTTTGGAACCTGTGTTGCTGGACTTTCCATTAAACAAGCAAGGAAGTAGCACCCAATGTGTCAAAGGTAACGTAGGGTCTGATTGTGATGGGTCTCTTTTGCAGTCAAACAGGGAAAAATGGGATCTAAATACCTCAATGGAGTCATGGGAGGGTTGTACTAGTGGTGATGCACCTGTAGTTCAGATATCAGGCTCTCAGACAAGTACGGCTGTTGAAGCTTATGATTGCTCATCTGAAATGGTTGAAAGTGTTAGTCCATGCGGAAAACAAACCCTTTTAGATAGTGAACATAAAGGCAACTCTATTTATGCATGCATACCATCAAAAGAGCATCTTCATTTAAGTCTCGATTCATCTTATCCGAAGCCTATGCTTGAAGAAGATCCTTATATTTCTGAATATGAATCAGATGGTAACTGGGATATAGCTGAGGCTGTTGATGATAATGATAATAATATAGAAGAAGACTATGAAGATGGGGAGGTCCGGGAAACAATGCAGGAAACTGAAATAGAGGTCCATGTATGTGAGAAAAGAGAAATTGTGCCTTTGGATCATGCTGATTGTAATGATAAAAAGATCAATTCTGTTGGATTGCCCGATCATGAATGTGTCGCTTTAGGCCCTCTGGAACAGGAAACTAAAACAGAAAATCTGGATTACAGTAGCGGAGACGATGTTCGGACTACAACTAAAAGTATATCTTGTGAGCAAGAAAATGAAGATCTTTGTGTGAAAGAATTACATGCCGTAGAGAATACTAGTAGTGAGAAGGGCGCAGGAAGAAGCCAATTGTCTCAGTATGATAAAAAGGACAACTTTGAGAGCCAGGACACTGCTGACAGAATCGTCGATGAGGAACTGATTCCTACATTTTCTCAGGGCGAGGTGGAGAATGCTATAGCAGTAGATGTAGGGCAGAATAGGGATCTAACATTGCCTACTGTAAAGGAGTCTATAAGTGGTGATGATGCGAAGGATATCAATGGAGGCACTAGAAATAGTCGGATAATTAATCTTAATCGAGCATCTACTGATTCAACTCCTTGTAAGGAAAAATCTAGTTTTGTCAGGTCAGTTTTATCACATACGGATAGAGAGGGTGTACCCAGCATGGCAGTTGAAGGAGCGAATTTGCAACATCAAGAAAGGTGATTATTAGTTACTCGGTTTCTGCTATTTGCTTTTCTTCTCAAACTTCTTTTAGCTGCTCTTATGTCTTTGCTTATATTGTCACTATTTTTCTTTTTCAGAGACGATGCATACAGTAATATTACCAAGAAAATTTCAGTAGATAGACACCAGGATCAGTCACCATGGATGAATTATAGTCATAGAAGAGGGAGAAGTACCAATAGGTTGGATAACCGATCTGGGGAATGGGATTTTGGTCCCAACTTTTCTCCTGAAACATACAGTGACCAGCAGATAGATTACCATGTTCCTGGTCTTGATCAAAACCGATATAAGATTATACCTGATGGTCCATTTGGTGGTGCTAACCGTCGTGGTAGAGAATTGCTAGCGGATGAGGGACCTTTTTTTTTCCATGGACCCTCAAGGAGGAAGTCACCTGGAAGAAGACATGGGCCCTGTGTACGAGGTGGCAAAATGGTTAACAGAATGCCTAGAGATTTTAGTCCAGATAGATGCATGGATGAAGGTGGCTCTTTTGATCGACCACATGGTGAAAAGTTCACCAGGAATTTTGCTGATGACACTGTGGATCCGATGTATCCACGACCTCAACCTCCATACGACGTAGACAGACCTTTCTTCCGGGAAAGAAGGAACTTCTCATTCCAAAGAAAAACTTTTCCAAGAATTGATTCTAAATCTCCAGTAAGATCCCGAGCTCGATCTCCTACCCAATGGTTCTCTTCTAAAAGATCTGAGAGGTTTTGTGGACGTCCCGACATGACACATCGAAGATCTCCAAATTATAGGACGGACAGGATGAGATCTCCTGATCAGCGTCCTATACGTGGGCATATGCCAGGCCGAAGACAAGGATTCCATTTCCTTTCACCATCTGATGAGTTGAGGGATGTGGGTCCTGCACCTGACCATGGCCATATGAGGTCTATTGTCCCTAATAGGAATCAGACTGAAAGATTACCGCTTAGAAACAGAAGTTATGATGCTATAGATCCTCGAGGAAGGATTGAGAACGACGAACTTTTTGATGGTCCCGTACGTTCGGGTCAATTGACTGGGTACAATGGTGGGGAACCGGATGACGATGAAAGAAGATTTAATGAGAGACATGAACCTCTTCATTCTTTTAAGCATCCATTTGATGATTCTGATGGTGAGAGATTTCGAAACAACGGGGAGGATTGTTCTAGGCCTTTTAGATTTTGTGCAGAGAATGACTCAAGAATTTCATGGAAGAGAAGGTAGCTTTTGGTGAGAAAGACCGATATAAGGATCTCTACTAAGGGAGAGGAGATTGCTTTAGACTAGGCCATTCTGTTAGTTTAACTTAGATGTTTTTGTAGGTAATATTTTATGGTGCAAAGGGGGACATTTTTTTGTTCTCTTTTGTTTTATTTCATAATTGCAATTGTAAAGAATTTCATTTTCTATGCAGAAGAGTATGCTTCTCAAGTGCTCTTGTGACTTCGAACTGACTAAACCACTGAAAATTTTGTTCTCTAAAACATTAGGATCTTTCTTATGATCTTGGCAGGCAAATTCAGATGAGAAGGGAAGTATATCAAGTGGGGTTGTTTCCTTTTCTAATCTCCAAAAATGGTTTGCAATTCTTTTAAAAATTGTTATGAATGTTGTTCTTTTTACTAGAAATATATTCACTGATTTACTGTTGAATTTGACAGTTATCTTAGAGCATGGAGAAGCGCAAGGGGTGTCTTTTAAGGTATTCTTTTCTTTGACTAAGTGACATTTCTTCTCTCTTCTATGCTAAATCTAAAGTTGTTATTATTTGGTTAAGTGAACAAAATTTAATTCTGGTTAAAATACCATTTTGGACCCACTATTTAAACTCCGTTCCATTTTGGCCTATGTAGTCAGTTCTCTGTTGATTTTTTTTTTTTTTAATCTTTATTACCAGTCCATTTATAAATTGTGAAAATATATTTACACAATGTCTCTTATTGATAAAAATTAGCATTGAAGGACTACATTTTAAGATTTGTTGAAAGTGCTGAGACTTAATCAGGATGGAACCAAAATAATGTTTAATTCCCTTTTTTTTCCCCTTTTGGTAGATTCTATTTGCAGATTTTTAATGATAGAGAAGTTTCAGAAGCAATGGTGTACTGAATGAAGATGACAAACAGCTCTTAGAGATCCTCTCCTCTCTCTAGCTATAAATCTTATTATCAGGCAGATTAGAGATTGTAGAAGC
mRNA sequence
CTCCATCTCCCGAAGGTAATCTCATTCGTTGCTTCGATCTCCGTTCTTCCTGCCGATTGCGAATCTAAATATACATATTCAATGCCATCGTCGGTTTGGGTTTCTCATGATTCTGCCTTGATTCCATTGTTACTCTCTCGCGGAATCATCAGCAATAGAAGCACGTATTAGATCTGCGTTTCGAATTTGAAAGTGACTTCTGAGAAGTTAGTGAGAATTTGGTGTTGGATTAAAGTGATAAAAGACCAAGAAACTGTATTAAGCCTCTGAGGTCTTGGAATAATGACCGTTCCAGAAAGTGAAGAGGTTGGTTTTAAGCGCATTGGGTTGTCAGCTAGTGATTATGATGCAAGTCTTCCTATCAAGAAAAGGAGATTTCCGGTAGTGCAGTTTCCTCCGTCTCCATCTAAAGATATATCTTCATTCCATTCAGATGGAAATTTATTGAAGGCTGAGCGGCCATCTCCACCTAAAGATGCATCTTCTTTTAATCGCAAAGAAAATTTAATGAAGACTGAGCAGCCGATTATATCTGTGACAATAGTTTCAAGTTCTAGTGCAGTCACAAGTTCTGGATTGTCAAACAAGAATCAGGACTGTGTTTCTGACGAGAACAAAGGAAAATCTGATACTGTTTCATGTTTTGTGGATACGGTCCAGAGTGATACTGGAATGCCACGAGTCAAGTTTCAGGAACCCGGTTTGGGAGAACATGCTTGTATTAATGATTTTGTTGAACATGATGATAAATCCTTGGTAACTGAAAAACATACTGTTCATGCATCACCAGAGATCTGTGGGGGGTTGGAGTTATCGTCAACTAGCCTTGACTCTGATCCTCTTGCTGGTAACAAAGAGGAAGAAATTGATGCAAAAATGCCTGAAGAAAAGTGCAGCTCTCCAATTTGTCAAGTTGAAGGAGGAGCTGGAGTATTGGTAGGTTTGAAGGGACACATGGATTTGAAATTAGTTCCTGAAAAGAGTGACTTGAATTTCCTGAAGCAGAATTCTTTGGAACCTGTGTTGCTGGACTTTCCATTAAACAAGCAAGGAAGTAGCACCCAATGTGTCAAAGGTAACGTAGGGTCTGATTGTGATGGGTCTCTTTTGCAGTCAAACAGGGAAAAATGGGATCTAAATACCTCAATGGAGTCATGGGAGGGTTGTACTAGTGGTGATGCACCTGTAGTTCAGATATCAGGCTCTCAGACAAGTACGGCTGTTGAAGCTTATGATTGCTCATCTGAAATGGTTGAAAGTGTTAGTCCATGCGGAAAACAAACCCTTTTAGATAGTGAACATAAAGGCAACTCTATTTATGCATGCATACCATCAAAAGAGCATCTTCATTTAAGTCTCGATTCATCTTATCCGAAGCCTATGCTTGAAGAAGATCCTTATATTTCTGAATATGAATCAGATGGTAACTGGGATATAGCTGAGGCTGTTGATGATAATGATAATAATATAGAAGAAGACTATGAAGATGGGGAGGTCCGGGAAACAATGCAGGAAACTGAAATAGAGGTCCATGTATGTGAGAAAAGAGAAATTGTGCCTTTGGATCATGCTGATTGTAATGATAAAAAGATCAATTCTGTTGGATTGCCCGATCATGAATGTGTCGCTTTAGGCCCTCTGGAACAGGAAACTAAAACAGAAAATCTGGATTACAGTAGCGGAGACGATGTTCGGACTACAACTAAAAGTATATCTTGTGAGCAAGAAAATGAAGATCTTTGTGTGAAAGAATTACATGCCGTAGAGAATACTAGTAGTGAGAAGGGCGCAGGAAGAAGCCAATTGTCTCAGTATGATAAAAAGGACAACTTTGAGAGCCAGGACACTGCTGACAGAATCGTCGATGAGGAACTGATTCCTACATTTTCTCAGGGCGAGGTGGAGAATGCTATAGCAGTAGATGTAGGGCAGAATAGGGATCTAACATTGCCTACTGTAAAGGAGTCTATAAGTGGTGATGATGCGAAGGATATCAATGGAGGCACTAGAAATAGTCGGATAATTAATCTTAATCGAGCATCTACTGATTCAACTCCTTGTAAGGAAAAATCTAGTTTTGTCAGGTCAGTTTTATCACATACGGATAGAGAGGGTGTACCCAGCATGGCAGTTGAAGGAGCGAATTTGCAACATCAAGAAAGAGACGATGCATACAGTAATATTACCAAGAAAATTTCAGTAGATAGACACCAGGATCAGTCACCATGGATGAATTATAGTCATAGAAGAGGGAGAAGTACCAATAGGTTGGATAACCGATCTGGGGAATGGGATTTTGGTCCCAACTTTTCTCCTGAAACATACAGTGACCAGCAGATAGATTACCATGTTCCTGGTCTTGATCAAAACCGATATAAGATTATACCTGATGGTCCATTTGGTGGTGCTAACCGTCGTGGTAGAGAATTGCTAGCGGATGAGGGACCTTTTTTTTTCCATGGACCCTCAAGGAGGAAGTCACCTGGAAGAAGACATGGGCCCTGTGTACGAGGTGGCAAAATGGTTAACAGAATGCCTAGAGATTTTAGTCCAGATAGATGCATGGATGAAGGTGGCTCTTTTGATCGACCACATGGTGAAAAGTTCACCAGGAATTTTGCTGATGACACTGTGGATCCGATGTATCCACGACCTCAACCTCCATACGACGTAGACAGACCTTTCTTCCGGGAAAGAAGGAACTTCTCATTCCAAAGAAAAACTTTTCCAAGAATTGATTCTAAATCTCCAGTAAGATCCCGAGCTCGATCTCCTACCCAATGGTTCTCTTCTAAAAGATCTGAGAGGTTTTGTGGACGTCCCGACATGACACATCGAAGATCTCCAAATTATAGGACGGACAGGATGAGATCTCCTGATCAGCGTCCTATACGTGGGCATATGCCAGGCCGAAGACAAGGATTCCATTTCCTTTCACCATCTGATGAGTTGAGGGATGTGGGTCCTGCACCTGACCATGGCCATATGAGGTCTATTGTCCCTAATAGGAATCAGACTGAAAGATTACCGCTTAGAAACAGAAGTTATGATGCTATAGATCCTCGAGGAAGGATTGAGAACGACGAACTTTTTGATGGTCCCGTACGTTCGGGTCAATTGACTGGGTACAATGGTGGGGAACCGGATGACGATGAAAGAAGATTTAATGAGAGACATGAACCTCTTCATTCTTTTAAGCATCCATTTGATGATTCTGATGGTGAGAGATTTCGAAACAACGGGGAGGATTGTTCTAGGCCTTTTAGATTTTGTGCAGAGAATGACTCAAGAATTTCATGGAAGAGAAGGTAGCTTTTGGTGAGAAAGACCGATATAAGGATCTCTACTAAGGGAGAGGAGATTGCTTTAGACTAGGCCATTCTGTTAGTTTAACTTAGATGTTTTTGTAGGTAATATTTTATGGTGCAAAGGGGGACATTTTTTTGTTCTCTTTTGTTTTATTTCATAATTGCAATTGTAAAGAATTTCATTTTCTATGCAGAAGAGTATGCTTCTCAAGTGCTCTTGTGACTTCGAACTGACTAAACCACTGAAAATTTTGTTCTCTAAAACATTAGGATCTTTCTTATGATCTTGGCAGGCAAATTCAGATGAGAAGGGAAGTATATCAAGTGGGGTTGTTTCCTTTTCTAATCTCCAAAAATGGTTTGCAATTCTTTTAAAAATTGTTATGAATGTTGTTCTTTTTACTAGAAATATATTCACTGATTTACTGTTGAATTTGACAGTTATCTTAGAGCATGGAGAAGCGCAAGGGGTGTCTTTTAAGATTCTATTTGCAGATTTTTAATGATAGAGAAGTTTCAGAAGCAATGGTGTACTGAATGAAGATGACAAACAGCTCTTAGAGATCCTCTCCTCTCTCTAGCTATAAATCTTATTATCAGGCAGATTAGAGATTGTAGAAGC
Coding sequence (CDS)
ATGACCGTTCCAGAAAGTGAAGAGGTTGGTTTTAAGCGCATTGGGTTGTCAGCTAGTGATTATGATGCAAGTCTTCCTATCAAGAAAAGGAGATTTCCGGTAGTGCAGTTTCCTCCGTCTCCATCTAAAGATATATCTTCATTCCATTCAGATGGAAATTTATTGAAGGCTGAGCGGCCATCTCCACCTAAAGATGCATCTTCTTTTAATCGCAAAGAAAATTTAATGAAGACTGAGCAGCCGATTATATCTGTGACAATAGTTTCAAGTTCTAGTGCAGTCACAAGTTCTGGATTGTCAAACAAGAATCAGGACTGTGTTTCTGACGAGAACAAAGGAAAATCTGATACTGTTTCATGTTTTGTGGATACGGTCCAGAGTGATACTGGAATGCCACGAGTCAAGTTTCAGGAACCCGGTTTGGGAGAACATGCTTGTATTAATGATTTTGTTGAACATGATGATAAATCCTTGGTAACTGAAAAACATACTGTTCATGCATCACCAGAGATCTGTGGGGGGTTGGAGTTATCGTCAACTAGCCTTGACTCTGATCCTCTTGCTGGTAACAAAGAGGAAGAAATTGATGCAAAAATGCCTGAAGAAAAGTGCAGCTCTCCAATTTGTCAAGTTGAAGGAGGAGCTGGAGTATTGGTAGGTTTGAAGGGACACATGGATTTGAAATTAGTTCCTGAAAAGAGTGACTTGAATTTCCTGAAGCAGAATTCTTTGGAACCTGTGTTGCTGGACTTTCCATTAAACAAGCAAGGAAGTAGCACCCAATGTGTCAAAGGTAACGTAGGGTCTGATTGTGATGGGTCTCTTTTGCAGTCAAACAGGGAAAAATGGGATCTAAATACCTCAATGGAGTCATGGGAGGGTTGTACTAGTGGTGATGCACCTGTAGTTCAGATATCAGGCTCTCAGACAAGTACGGCTGTTGAAGCTTATGATTGCTCATCTGAAATGGTTGAAAGTGTTAGTCCATGCGGAAAACAAACCCTTTTAGATAGTGAACATAAAGGCAACTCTATTTATGCATGCATACCATCAAAAGAGCATCTTCATTTAAGTCTCGATTCATCTTATCCGAAGCCTATGCTTGAAGAAGATCCTTATATTTCTGAATATGAATCAGATGGTAACTGGGATATAGCTGAGGCTGTTGATGATAATGATAATAATATAGAAGAAGACTATGAAGATGGGGAGGTCCGGGAAACAATGCAGGAAACTGAAATAGAGGTCCATGTATGTGAGAAAAGAGAAATTGTGCCTTTGGATCATGCTGATTGTAATGATAAAAAGATCAATTCTGTTGGATTGCCCGATCATGAATGTGTCGCTTTAGGCCCTCTGGAACAGGAAACTAAAACAGAAAATCTGGATTACAGTAGCGGAGACGATGTTCGGACTACAACTAAAAGTATATCTTGTGAGCAAGAAAATGAAGATCTTTGTGTGAAAGAATTACATGCCGTAGAGAATACTAGTAGTGAGAAGGGCGCAGGAAGAAGCCAATTGTCTCAGTATGATAAAAAGGACAACTTTGAGAGCCAGGACACTGCTGACAGAATCGTCGATGAGGAACTGATTCCTACATTTTCTCAGGGCGAGGTGGAGAATGCTATAGCAGTAGATGTAGGGCAGAATAGGGATCTAACATTGCCTACTGTAAAGGAGTCTATAAGTGGTGATGATGCGAAGGATATCAATGGAGGCACTAGAAATAGTCGGATAATTAATCTTAATCGAGCATCTACTGATTCAACTCCTTGTAAGGAAAAATCTAGTTTTGTCAGGTCAGTTTTATCACATACGGATAGAGAGGGTGTACCCAGCATGGCAGTTGAAGGAGCGAATTTGCAACATCAAGAAAGAGACGATGCATACAGTAATATTACCAAGAAAATTTCAGTAGATAGACACCAGGATCAGTCACCATGGATGAATTATAGTCATAGAAGAGGGAGAAGTACCAATAGGTTGGATAACCGATCTGGGGAATGGGATTTTGGTCCCAACTTTTCTCCTGAAACATACAGTGACCAGCAGATAGATTACCATGTTCCTGGTCTTGATCAAAACCGATATAAGATTATACCTGATGGTCCATTTGGTGGTGCTAACCGTCGTGGTAGAGAATTGCTAGCGGATGAGGGACCTTTTTTTTTCCATGGACCCTCAAGGAGGAAGTCACCTGGAAGAAGACATGGGCCCTGTGTACGAGGTGGCAAAATGGTTAACAGAATGCCTAGAGATTTTAGTCCAGATAGATGCATGGATGAAGGTGGCTCTTTTGATCGACCACATGGTGAAAAGTTCACCAGGAATTTTGCTGATGACACTGTGGATCCGATGTATCCACGACCTCAACCTCCATACGACGTAGACAGACCTTTCTTCCGGGAAAGAAGGAACTTCTCATTCCAAAGAAAAACTTTTCCAAGAATTGATTCTAAATCTCCAGTAAGATCCCGAGCTCGATCTCCTACCCAATGGTTCTCTTCTAAAAGATCTGAGAGGTTTTGTGGACGTCCCGACATGACACATCGAAGATCTCCAAATTATAGGACGGACAGGATGAGATCTCCTGATCAGCGTCCTATACGTGGGCATATGCCAGGCCGAAGACAAGGATTCCATTTCCTTTCACCATCTGATGAGTTGAGGGATGTGGGTCCTGCACCTGACCATGGCCATATGAGGTCTATTGTCCCTAATAGGAATCAGACTGAAAGATTACCGCTTAGAAACAGAAGTTATGATGCTATAGATCCTCGAGGAAGGATTGAGAACGACGAACTTTTTGATGGTCCCGTACGTTCGGGTCAATTGACTGGGTACAATGGTGGGGAACCGGATGACGATGAAAGAAGATTTAATGAGAGACATGAACCTCTTCATTCTTTTAAGCATCCATTTGATGATTCTGATGGTGAGAGATTTCGAAACAACGGGGAGGATTGTTCTAGGCCTTTTAGATTTTGTGCAGAGAATGACTCAAGAATTTCATGGAAGAGAAGGTAG
Protein sequence
MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQFPPSPSKDISSFHSDGNLLKAERPSPPKDASSFNRKENLMKTEQPIISVTIVSSSSAVTSSGLSNKNQDCVSDENKGKSDTVSCFVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSLVTEKHTVHASPEICGGLELSSTSLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVLVGLKGHMDLKLVPEKSDLNFLKQNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQSNREKWDLNTSMESWEGCTSGDAPVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDSEHKGNSIYACIPSKEHLHLSLDSSYPKPMLEEDPYISEYESDGNWDIAEAVDDNDNNIEEDYEDGEVRETMQETEIEVHVCEKREIVPLDHADCNDKKINSVGLPDHECVALGPLEQETKTENLDYSSGDDVRTTTKSISCEQENEDLCVKELHAVENTSSEKGAGRSQLSQYDKKDNFESQDTADRIVDEELIPTFSQGEVENAIAVDVGQNRDLTLPTVKESISGDDAKDINGGTRNSRIINLNRASTDSTPCKEKSSFVRSVLSHTDREGVPSMAVEGANLQHQERDDAYSNITKKISVDRHQDQSPWMNYSHRRGRSTNRLDNRSGEWDFGPNFSPETYSDQQIDYHVPGLDQNRYKIIPDGPFGGANRRGRELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKMVNRMPRDFSPDRCMDEGGSFDRPHGEKFTRNFADDTVDPMYPRPQPPYDVDRPFFRERRNFSFQRKTFPRIDSKSPVRSRARSPTQWFSSKRSERFCGRPDMTHRRSPNYRTDRMRSPDQRPIRGHMPGRRQGFHFLSPSDELRDVGPAPDHGHMRSIVPNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPVRSGQLTGYNGGEPDDDERRFNERHEPLHSFKHPFDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR
Homology
BLAST of ClCG09G020330.1 vs. NCBI nr
Match:
XP_038889581.1 (uncharacterized protein LOC120079459 isoform X2 [Benincasa hispida] >XP_038889582.1 uncharacterized protein LOC120079459 isoform X2 [Benincasa hispida])
HSP 1 Score: 1770.0 bits (4583), Expect = 0.0e+00
Identity = 882/1009 (87.41%), Postives = 928/1009 (91.97%), Query Frame = 0
Query: 1 MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQFPPSPSKDISSFHSDGNLLKAERP 60
MTVPESEEVGFK I LSASDYDASLPIKKRRF VVQFPPSPSKD SSFHSDGNLLKAERP
Sbjct: 1 MTVPESEEVGFKPIALSASDYDASLPIKKRRFLVVQFPPSPSKDTSSFHSDGNLLKAERP 60
Query: 61 SPPKDASSFNRKENLMKTEQPIISVTIVSSSSAVTSSGLSNKNQDCVSDENKGKSDTVSC 120
SP KD SS ENL+KTEQP +SVTIVSSSSAVTSS LSNKNQDCVSD+NKGK D+ SC
Sbjct: 61 SPSKDLSSLTHNENLIKTEQPTLSVTIVSSSSAVTSSALSNKNQDCVSDKNKGKPDSDSC 120
Query: 121 FVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSLVTEKHTVHASPEICGGLELSST 180
VD V++D G P VKFQEP +G HACIN VE++ KSLV KHTVHASPEICGGL+ SST
Sbjct: 121 CVDIVRNDIGTPGVKFQEPSVGGHACINGSVEYEGKSLVLVKHTVHASPEICGGLKSSST 180
Query: 181 SLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVLVGLKGHMDLKLVPEKSDLNFLK 240
SL+SDPLAGNKEEEID K PEEKCS PICQV GGAGV VGLKGHMD KLVPE+SDLNFLK
Sbjct: 181 SLNSDPLAGNKEEEIDVKTPEEKCSPPICQVGGGAGVSVGLKGHMDPKLVPEESDLNFLK 240
Query: 241 QNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQSNREKWDLNTSMESWEGCTSGDA 300
NSLEPVLLDFPLNKQGSSTQCVKGNV SD DGSLLQSNREKWDLNTSMESWEGCT GDA
Sbjct: 241 HNSLEPVLLDFPLNKQGSSTQCVKGNVASDGDGSLLQSNREKWDLNTSMESWEGCTIGDA 300
Query: 301 PVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDSEHKGNSIYACIPSKEHLHLSLD 360
PVVQIS +QT+ A+E Y CSSEMVE VSPCGKQTLLDSEHKGNSIY+CIPSKEHLHLSLD
Sbjct: 301 PVVQISATQTNMAIETYACSSEMVEIVSPCGKQTLLDSEHKGNSIYSCIPSKEHLHLSLD 360
Query: 361 SSYPKPMLEEDPYISEYESDGNWDIAEAVDDNDNNIEEDYEDGEVRETMQETEIEVHVCE 420
SSY +P LEEDPYISEYESDGNWDIAEAVDDNDNNIEEDYEDGEVRETMQETE+EVHV E
Sbjct: 361 SSYQQPTLEEDPYISEYESDGNWDIAEAVDDNDNNIEEDYEDGEVRETMQETEVEVHVYE 420
Query: 421 KREIVPLDHADCNDKKINSVGLPDHECVALGPLEQETKTENLDYSSGDDVRTTTKSISCE 480
KREI PLDHA C+DKKINSVGLPDHE ALGPLEQE K ENLDY S DDV+ TKS S E
Sbjct: 421 KREIEPLDHACCSDKKINSVGLPDHEFFALGPLEQEAKPENLDYRSEDDVQAMTKSKSRE 480
Query: 481 QENEDLCVKELHAVENTSSEKGAGRSQLSQYDKKDNFESQDTADRIVDEELIPTFSQGEV 540
Q +EDLCVKELHAVENT SEK AGR+QLSQYDK+DNF DTAD+I+DEELIPTFSQGEV
Sbjct: 481 QVHEDLCVKELHAVENTISEKAAGRTQLSQYDKRDNFVGLDTADKIIDEELIPTFSQGEV 540
Query: 541 ENAIAVDVGQNRDLTLPTVKESISGDDAKDINGGTRNSRIINLNRASTDSTPCKEKSSFV 600
ENA+AVDV Q+RDLTLPTVKES++GDDAKDIN GTRNSRIINLNR STDST CK KSSFV
Sbjct: 541 ENAVAVDVVQSRDLTLPTVKESVNGDDAKDINEGTRNSRIINLNRTSTDSTSCKAKSSFV 600
Query: 601 RSVLSHTDREGVPSMAVEGANLQHQERDDAYSNITKKISVDRHQDQSPWMNYSHRRGRST 660
RS LSHTDRE VP+MAVEGA++Q QERDDAYSNITKKISVDRHQ QSPWMN+SHRRGRST
Sbjct: 601 RSGLSHTDREFVPNMAVEGADVQPQERDDAYSNITKKISVDRHQGQSPWMNFSHRRGRST 660
Query: 661 NRLDNRSGEWDFGPNFSPETYSDQQIDYHVPGLDQNRYKIIPDGPFGGANRRGRELLADE 720
NRLDNRS EWDFGPNFSPET++DQ+IDYHVPGLDQNRYKIIPDGPFGGAN RGRELL DE
Sbjct: 661 NRLDNRSEEWDFGPNFSPETFTDQRIDYHVPGLDQNRYKIIPDGPFGGANHRGRELLEDE 720
Query: 721 GPFFFHGPSRRKSPGRRHGPCVRGGKMVNRMPRDFSPDRCMDEGGSFDRPHGEKFTRNFA 780
GPFFFHGPSRRKSPGRRHGP VRGGKMVNRMPRDFSP RCMDEGGSFDR HGEKFTRNFA
Sbjct: 721 GPFFFHGPSRRKSPGRRHGPGVRGGKMVNRMPRDFSPSRCMDEGGSFDRQHGEKFTRNFA 780
Query: 781 DDTVDPMYPRPQPPYDVDRPFFRERRNFSFQRKTFPRIDSKSPVRSRARSPTQWFSSKRS 840
DDT+DPMY RPQPPYDVDRPFFRERRNFSFQRKTFPRIDSKSPVRSRARSP+QWFSSKRS
Sbjct: 781 DDTMDPMYARPQPPYDVDRPFFRERRNFSFQRKTFPRIDSKSPVRSRARSPSQWFSSKRS 840
Query: 841 ERFCGRPDMTHRRSPNYRTDRMRSPDQRPIRGHMPGRRQGFHFLSPSDELRDVGPAPDHG 900
+RFCGRP++THRRSPNYRTDRMRSPDQRPIRG++PGRRQGFHFLSPSDELRDVGPAPDHG
Sbjct: 841 DRFCGRPELTHRRSPNYRTDRMRSPDQRPIRGYVPGRRQGFHFLSPSDELRDVGPAPDHG 900
Query: 901 HMRSIVPNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPVRSGQLTGYNGGEPDDDERR 960
MRSI+PNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPVRSGQLTGYNGGE DDDERR
Sbjct: 901 PMRSIIPNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPVRSGQLTGYNGGELDDDERR 960
Query: 961 FNERHEPLHSFKHPFDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1010
F+ERHEPLHSFKHPFDDSDGERFRNNGEDCSRPFR+CAENDSRISWKRR
Sbjct: 961 FHERHEPLHSFKHPFDDSDGERFRNNGEDCSRPFRYCAENDSRISWKRR 1009
BLAST of ClCG09G020330.1 vs. NCBI nr
Match:
XP_038889579.1 (uncharacterized protein LOC120079459 isoform X1 [Benincasa hispida] >XP_038889580.1 uncharacterized protein LOC120079459 isoform X1 [Benincasa hispida])
HSP 1 Score: 1755.3 bits (4545), Expect = 0.0e+00
Identity = 874/1005 (86.97%), Postives = 923/1005 (91.84%), Query Frame = 0
Query: 5 ESEEVGFKRIGLSASDYDASLPIKKRRFPVVQFPPSPSKDISSFHSDGNLLKAERPSPPK 64
++ +VGFK I LSASDYDASLPIKKRRF VVQFPPSPSKD SSFHSDGNLLKAERPSP K
Sbjct: 12 DTMQVGFKPIALSASDYDASLPIKKRRFLVVQFPPSPSKDTSSFHSDGNLLKAERPSPSK 71
Query: 65 DASSFNRKENLMKTEQPIISVTIVSSSSAVTSSGLSNKNQDCVSDENKGKSDTVSCFVDT 124
D SS ENL+KTEQP +SVTIVSSSSAVTSS LSNKNQDCVSD+NKGK D+ SC VD
Sbjct: 72 DLSSLTHNENLIKTEQPTLSVTIVSSSSAVTSSALSNKNQDCVSDKNKGKPDSDSCCVDI 131
Query: 125 VQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSLVTEKHTVHASPEICGGLELSSTSLDS 184
V++D G P VKFQEP +G HACIN VE++ KSLV KHTVHASPEICGGL+ SSTSL+S
Sbjct: 132 VRNDIGTPGVKFQEPSVGGHACINGSVEYEGKSLVLVKHTVHASPEICGGLKSSSTSLNS 191
Query: 185 DPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVLVGLKGHMDLKLVPEKSDLNFLKQNSL 244
DPLAGNKEEEID K PEEKCS PICQV GGAGV VGLKGHMD KLVPE+SDLNFLK NSL
Sbjct: 192 DPLAGNKEEEIDVKTPEEKCSPPICQVGGGAGVSVGLKGHMDPKLVPEESDLNFLKHNSL 251
Query: 245 EPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQSNREKWDLNTSMESWEGCTSGDAPVVQ 304
EPVLLDFPLNKQGSSTQCVKGNV SD DGSLLQSNREKWDLNTSMESWEGCT GDAPVVQ
Sbjct: 252 EPVLLDFPLNKQGSSTQCVKGNVASDGDGSLLQSNREKWDLNTSMESWEGCTIGDAPVVQ 311
Query: 305 ISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDSEHKGNSIYACIPSKEHLHLSLDSSYP 364
IS +QT+ A+E Y CSSEMVE VSPCGKQTLLDSEHKGNSIY+CIPSKEHLHLSLDSSY
Sbjct: 312 ISATQTNMAIETYACSSEMVEIVSPCGKQTLLDSEHKGNSIYSCIPSKEHLHLSLDSSYQ 371
Query: 365 KPMLEEDPYISEYESDGNWDIAEAVDDNDNNIEEDYEDGEVRETMQETEIEVHVCEKREI 424
+P LEEDPYISEYESDGNWDIAEAVDDNDNNIEEDYEDGEVRETMQETE+EVHV EKREI
Sbjct: 372 QPTLEEDPYISEYESDGNWDIAEAVDDNDNNIEEDYEDGEVRETMQETEVEVHVYEKREI 431
Query: 425 VPLDHADCNDKKINSVGLPDHECVALGPLEQETKTENLDYSSGDDVRTTTKSISCEQENE 484
PLDHA C+DKKINSVGLPDHE ALGPLEQE K ENLDY S DDV+ TKS S EQ +E
Sbjct: 432 EPLDHACCSDKKINSVGLPDHEFFALGPLEQEAKPENLDYRSEDDVQAMTKSKSREQVHE 491
Query: 485 DLCVKELHAVENTSSEKGAGRSQLSQYDKKDNFESQDTADRIVDEELIPTFSQGEVENAI 544
DLCVKELHAVENT SEK AGR+QLSQYDK+DNF DTAD+I+DEELIPTFSQGEVENA+
Sbjct: 492 DLCVKELHAVENTISEKAAGRTQLSQYDKRDNFVGLDTADKIIDEELIPTFSQGEVENAV 551
Query: 545 AVDVGQNRDLTLPTVKESISGDDAKDINGGTRNSRIINLNRASTDSTPCKEKSSFVRSVL 604
AVDV Q+RDLTLPTVKES++GDDAKDIN GTRNSRIINLNR STDST CK KSSFVRS L
Sbjct: 552 AVDVVQSRDLTLPTVKESVNGDDAKDINEGTRNSRIINLNRTSTDSTSCKAKSSFVRSGL 611
Query: 605 SHTDREGVPSMAVEGANLQHQERDDAYSNITKKISVDRHQDQSPWMNYSHRRGRSTNRLD 664
SHTDRE VP+MAVEGA++Q QERDDAYSNITKKISVDRHQ QSPWMN+SHRRGRSTNRLD
Sbjct: 612 SHTDREFVPNMAVEGADVQPQERDDAYSNITKKISVDRHQGQSPWMNFSHRRGRSTNRLD 671
Query: 665 NRSGEWDFGPNFSPETYSDQQIDYHVPGLDQNRYKIIPDGPFGGANRRGRELLADEGPFF 724
NRS EWDFGPNFSPET++DQ+IDYHVPGLDQNRYKIIPDGPFGGAN RGRELL DEGPFF
Sbjct: 672 NRSEEWDFGPNFSPETFTDQRIDYHVPGLDQNRYKIIPDGPFGGANHRGRELLEDEGPFF 731
Query: 725 FHGPSRRKSPGRRHGPCVRGGKMVNRMPRDFSPDRCMDEGGSFDRPHGEKFTRNFADDTV 784
FHGPSRRKSPGRRHGP VRGGKMVNRMPRDFSP RCMDEGGSFDR HGEKFTRNFADDT+
Sbjct: 732 FHGPSRRKSPGRRHGPGVRGGKMVNRMPRDFSPSRCMDEGGSFDRQHGEKFTRNFADDTM 791
Query: 785 DPMYPRPQPPYDVDRPFFRERRNFSFQRKTFPRIDSKSPVRSRARSPTQWFSSKRSERFC 844
DPMY RPQPPYDVDRPFFRERRNFSFQRKTFPRIDSKSPVRSRARSP+QWFSSKRS+RFC
Sbjct: 792 DPMYARPQPPYDVDRPFFRERRNFSFQRKTFPRIDSKSPVRSRARSPSQWFSSKRSDRFC 851
Query: 845 GRPDMTHRRSPNYRTDRMRSPDQRPIRGHMPGRRQGFHFLSPSDELRDVGPAPDHGHMRS 904
GRP++THRRSPNYRTDRMRSPDQRPIRG++PGRRQGFHFLSPSDELRDVGPAPDHG MRS
Sbjct: 852 GRPELTHRRSPNYRTDRMRSPDQRPIRGYVPGRRQGFHFLSPSDELRDVGPAPDHGPMRS 911
Query: 905 IVPNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPVRSGQLTGYNGGEPDDDERRFNER 964
I+PNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPVRSGQLTGYNGGE DDDERRF+ER
Sbjct: 912 IIPNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPVRSGQLTGYNGGELDDDERRFHER 971
Query: 965 HEPLHSFKHPFDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1010
HEPLHSFKHPFDDSDGERFRNNGEDCSRPFR+CAENDSRISWKRR
Sbjct: 972 HEPLHSFKHPFDDSDGERFRNNGEDCSRPFRYCAENDSRISWKRR 1016
BLAST of ClCG09G020330.1 vs. NCBI nr
Match:
XP_031742263.1 (uncharacterized protein LOC101204083 [Cucumis sativus])
HSP 1 Score: 1620.1 bits (4194), Expect = 0.0e+00
Identity = 822/1028 (79.96%), Postives = 885/1028 (86.09%), Query Frame = 0
Query: 1 MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQFPPSPSKDISSFHSDGNLLKAERP 60
MT+ ESEEVGFKRIGLSASDY+A++PIKKRRFP VQ PSPSKDISSFHSDGNLLK E+P
Sbjct: 1 MTIAESEEVGFKRIGLSASDYEANIPIKKRRFPGVQLTPSPSKDISSFHSDGNLLKVEQP 60
Query: 61 SPPKDASSFNRKENLMKTEQPIISVTIVSSSSAVTSSGLSNKNQDCVSDENKGKSDTVSC 120
SPPKD SSFN ENL+K+E+PI+SVT VSSSS VTS LSN NQD VS+E KGKSDT SC
Sbjct: 61 SPPKDVSSFNHNENLIKSEEPILSVTTVSSSSVVTSCALSNNNQDSVSEEKKGKSDTDSC 120
Query: 121 FVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSLVTEKHTVHASPEICGGLELSST 180
VD VQS+ G VKFQEP LG HAC + FVE + KSLVT +HT HASP IC GL+L ST
Sbjct: 121 CVDIVQSNIGAAGVKFQEPSLGRHACTDGFVECEGKSLVTVEHTDHASPVICAGLKLLST 180
Query: 181 SLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVLVGLKGHMDLKLVPEKSDLNFLK 240
SLDSD AGNKEEEID KMPEE CS PICQ+ GGAGVLVGLKGHMDLKLV EKSDLNFLK
Sbjct: 181 SLDSDHFAGNKEEEIDVKMPEENCSPPICQL-GGAGVLVGLKGHMDLKLVSEKSDLNFLK 240
Query: 241 QNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQSNREKWDLNTSMESWEGCTSGDA 300
QNS+EPVLL+F LNKQGSSTQCVKGNVG DCDGS LQSNREKWDLNTSMESWEGCTSGDA
Sbjct: 241 QNSMEPVLLNFALNKQGSSTQCVKGNVGFDCDGSFLQSNREKWDLNTSMESWEGCTSGDA 300
Query: 301 PVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDSEHKGNSIYACIPSKEHLHLSLD 360
PVVQIS ++T+T +E Y CSSEMVES SPCGKQTLLD+E KG+S +KEHLHLSLD
Sbjct: 301 PVVQISATRTNTTIETYSCSSEMVESDSPCGKQTLLDNEDKGDS------TKEHLHLSLD 360
Query: 361 SSYPKPMLEEDPYISEYESDGNWDIAEAV-----------DDNDNNIEEDYEDGEVRETM 420
SSY K +L+EDPYISEYESDGNWDIAE V DDNDNN+EEDYEDGEVRETM
Sbjct: 361 SSYLKSVLDEDPYISEYESDGNWDIAETVDDNDDNDDNDNDDNDNNVEEDYEDGEVRETM 420
Query: 421 QETEIEVHVCEKREIVPLDHADCNDKKINSVGLPDHECVALGPLEQETKTENLDYSS--G 480
QETE+EVHV EKREI PLDHA CNDKKINSVGL DHE LGP +QETK ENLDY S
Sbjct: 421 QETEVEVHVYEKREIEPLDHAGCNDKKINSVGLLDHEFFTLGPKKQETKLENLDYRSEDE 480
Query: 481 DDVRTTTKSISCEQENEDLCVKELHAVENTSSE------KGAGRSQLSQYDKKDNFESQD 540
D+V+TTTKS S EQENEDLCVKELHAVEN E K RSQLSQYDKK NFE Q
Sbjct: 481 DEVQTTTKSNSYEQENEDLCVKELHAVENAIGEDVNISAKATERSQLSQYDKKGNFEGQG 540
Query: 541 TADRIVDEELIPTFSQGEVENAIAVDVGQNRDLTLPTVKESISGDDAKDINGGTRNSRII 600
TAD+I++EE +PTFSQ EVENA+AVDV QNRDLTLPTVKES++ D+AKDINGGTRNSRII
Sbjct: 541 TADKILNEEPVPTFSQNEVENAVAVDVVQNRDLTLPTVKESVNEDNAKDINGGTRNSRII 600
Query: 601 NLNRASTDSTPCKEKSSFVRSVLSHTDREGVPSMAVEGANLQHQERDDAYSNITKKISVD 660
N NR STDSTPCK KS+F + VLSH DRE VP+M VE AN++ QERDD YSNI+KKIS+D
Sbjct: 601 NFNRTSTDSTPCKAKSNFAKPVLSHKDREFVPNMVVERANMKPQERDDVYSNISKKISID 660
Query: 661 RHQDQSPWMNYSHRRGRSTNRLDNRSGEWDFGPNFSPETYSDQQIDYHVPGLDQNRYKII 720
+ Q P M +SHRRGR+TNRLDNRS EWDFGPNFSPETYS+QQIDYHV GLDQNRYKII
Sbjct: 661 KRQGPPPLMGFSHRRGRNTNRLDNRSEEWDFGPNFSPETYSEQQIDYHVTGLDQNRYKII 720
Query: 721 PDGPFGGANRRGRELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKMVNRMPRDFSPDRCM 780
PDGPFGGANRRGREL+ DE PFFFHGPSRRKSPGRRHG VRGGKMVNRMPRDFSP RCM
Sbjct: 721 PDGPFGGANRRGRELVEDEEPFFFHGPSRRKSPGRRHGHSVRGGKMVNRMPRDFSPGRCM 780
Query: 781 DEGGSFDRPHGEKFTRNFADDTVDPMYPRPQPPYDVDRPFFRERRNFSFQRKTFPRIDSK 840
DEGGSFDR HGEKFTRNFADDTVD MYPRPQPPYDVDRPFFRERRNFSFQRKTFP+IDSK
Sbjct: 781 DEGGSFDRQHGEKFTRNFADDTVDEMYPRPQPPYDVDRPFFRERRNFSFQRKTFPKIDSK 840
Query: 841 SPVRSRARSPTQWFSSKRSERFCGRPDMTHRRSPNYRTDRMRSPDQRPIRGHMPGRRQGF 900
SPVRSRARSP+QWFSSKRS+RFC RP+MTHRRSPNY TDRMRSPDQR IRG+MPG+RQGF
Sbjct: 841 SPVRSRARSPSQWFSSKRSDRFCERPNMTHRRSPNYMTDRMRSPDQRSIRGYMPGQRQGF 900
Query: 901 HFLSPSDELRDVGPAPDHGHMRSIVPNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPV 960
+LSP DELRDVGPAPDHGHMR +PNRNQT+RLPLRNRSYDAIDPRGRIEND LF GPV
Sbjct: 901 RYLSPPDELRDVGPAPDHGHMRPFIPNRNQTKRLPLRNRSYDAIDPRGRIENDGLFYGPV 960
Query: 961 RSGQLTGYNGGEPDDDERRFNERHEPLHSFKHPFDDSDGERFRNNGEDCSRPFRFCAEND 1010
R GQLTGYNGGEPDDDERRFNERHEPLHSFKH F DSDGER+RN GEDCSRPFRFCAE+D
Sbjct: 961 RLGQLTGYNGGEPDDDERRFNERHEPLHSFKHGFRDSDGERYRNKGEDCSRPFRFCAEDD 1020
BLAST of ClCG09G020330.1 vs. NCBI nr
Match:
TYK05746.1 (uncharacterized protein E5676_scaffold98G002340 [Cucumis melo var. makuwa])
HSP 1 Score: 1603.2 bits (4150), Expect = 0.0e+00
Identity = 831/1101 (75.48%), Postives = 893/1101 (81.11%), Query Frame = 0
Query: 1 MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQF----------------------- 60
MT+ ESEEVGFKR GLSASDYDA LPIKKRRFPVVQF
Sbjct: 1 MTLVESEEVGFKRTGLSASDYDAILPIKKRRFPVVQFPPSPSKDLPLSPSKDLPPSPSKD 60
Query: 61 -----------------PPSPSKDISSFHSDGNLLKAERPSPPK---------------- 120
PPSPSKD+ FHSDGNLLKAE+PSPPK
Sbjct: 61 LPPSPSKDLPPSPSKNLPPSPSKDLPPFHSDGNLLKAEQPSPPKDLSSLNRNESLIKTEQ 120
Query: 121 --------------------------DASSFNRKENLMKTEQPIISVTIVSSSSAVTSSG 180
D SSFNR ENL+KTEQPI+S TIVSSSS VTSS
Sbjct: 121 PSPPKEPSSLNRNESLIKTEQPSPSEDLSSFNRNENLIKTEQPILSRTIVSSSSVVTSSA 180
Query: 181 LSNKNQDCVSDENKGKSDTVSCFVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSL 240
L N NQD VS+E KGKSD+ SC VD VQSD G VKFQEP L HA IN F E++ KSL
Sbjct: 181 LLNNNQDNVSEEKKGKSDSDSCCVDLVQSDIGTAGVKFQEPNLRVHAYINCFDEYEGKSL 240
Query: 241 VTEKHTVHASPEICGGLELSSTSLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVL 300
VT KHT+ SPEI GG LSSTSLDSDPLA NKEEEID KMPEE CS PIC+V GGAGV
Sbjct: 241 VTVKHTIRTSPEIYGGSNLSSTSLDSDPLADNKEEEIDVKMPEENCSPPICEVGGGAGVS 300
Query: 301 VGLKGHMDLKLVPEKSDLNFLKQNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQS 360
VGL HMDLKLVPEKSDLNFLKQ+S+EPVLLDF LNK GSSTQCVK NVGSDCDG LLQ
Sbjct: 301 VGLNCHMDLKLVPEKSDLNFLKQDSVEPVLLDFSLNKHGSSTQCVKDNVGSDCDGPLLQL 360
Query: 361 NREKWDLNTSMESWEGCTSGDAPVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDS 420
NREKWDLNTSMESWEGCTSGD+PV ++S ++T+T +E Y CSSEMVES SPCGKQTLLDS
Sbjct: 361 NREKWDLNTSMESWEGCTSGDSPVAKMSATKTNTTIETYACSSEMVESDSPCGKQTLLDS 420
Query: 421 EHKGNSIYACIPSKEHLHLSLDSSYPKPMLEEDPYISEYESDGNWDIAEAV--DDNDNNI 480
E K NSIYAC+PSK HLHLSLDSSY KP++EEDPYISEYESDGNWDIAEAV DDNDN++
Sbjct: 421 EDKDNSIYACMPSKGHLHLSLDSSYLKPVVEEDPYISEYESDGNWDIAEAVDDDDNDNHL 480
Query: 481 EEDYEDGEVRETMQETEIEVHVCEKREIVPLDHADCNDKKINSVGLPDHECVALGPLEQE 540
EEDYEDGEVRET+QETE+EVH EKREI PLDHA C+DKKIN++ LPDHE +ALGPLEQE
Sbjct: 481 EEDYEDGEVRETLQETEVEVHAYEKREIEPLDHAGCDDKKINTIRLPDHELLALGPLEQE 540
Query: 541 TKTENLDYSSGDDVRTTTKSISCEQENEDLCVKELHAVENTSS------EKGAGRSQLSQ 600
TK ENLD+ S DDVRTTTKS S EQENEDLCVKELHAVEN+ S K GR QL Q
Sbjct: 541 TKPENLDHRSEDDVRTTTKSKSYEQENEDLCVKELHAVENSISGDVNRPVKATGRGQLFQ 600
Query: 601 YDKKDNFESQDTADRIVDEELIPTFSQGEVENAIAVDVGQNRDLTLPTVKESISGDDAKD 660
YDKK NFE+ DTAD IVDEELIPTFSQGE+ENA+AVDV QNRDLTLPTVKES++G+DAKD
Sbjct: 601 YDKKHNFEAHDTADEIVDEELIPTFSQGEMENAVAVDVVQNRDLTLPTVKESVNGNDAKD 660
Query: 661 INGGTRNSRIINLNRASTDSTPCKEKSSFVRSVLSHTDREGVPSMAVEGANLQHQERDDA 720
INGGTRNSRIIN NR STDSTPCKEKSSF RSVL H +RE VP+MAVEGAN+Q QERDDA
Sbjct: 661 INGGTRNSRIINFNRVSTDSTPCKEKSSFARSVLPHKEREFVPNMAVEGANMQPQERDDA 720
Query: 721 YSNITKKISVDRHQDQSPWMNYSHRRGRSTNRLDNRSGEWDFGPNFSPETYSDQQIDYHV 780
YSNITKKIS+D+ + Q P M +SHRRGRSTNRLDNRS EWDFGPNFSPETYS+QQIDYH
Sbjct: 721 YSNITKKISIDKREGQPPLMGFSHRRGRSTNRLDNRSEEWDFGPNFSPETYSEQQIDYHG 780
Query: 781 PGLDQNRYKIIPDGPFGGANRRGRELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKMVNR 840
PGLDQNRYKI PDGPFGGANRRGRELL DE PFFFHGPSRRKS GRRHGP V GGKMV +
Sbjct: 781 PGLDQNRYKITPDGPFGGANRRGRELLEDEEPFFFHGPSRRKSFGRRHGPNVGGGKMVYK 840
Query: 841 MPRDFSPDRCMDEGGSFDRPHGEKFTRNFADDTVDPMYPRPQPPYDVDRPFFRERRNFSF 900
+PRDFSP RCMDEGGSFDR HGEKF+RNFADDTVD MYPRPQPPYD+D+PFFRERRNFSF
Sbjct: 841 IPRDFSPGRCMDEGGSFDRQHGEKFSRNFADDTVDLMYPRPQPPYDIDKPFFRERRNFSF 900
Query: 901 QRKTFPRIDSKSPVRSRARSPTQWFSSKRSERFCGRPDMTHRRSPNYRTDRMRSPDQRPI 960
QRK+FPRIDSKSPVR+RARSP+QWFSSKRS+RFC R DMTHRRSPNYR++RMRSPD RPI
Sbjct: 901 QRKSFPRIDSKSPVRARARSPSQWFSSKRSDRFCERSDMTHRRSPNYRSERMRSPDHRPI 960
Query: 961 RGHM-PGRRQGFHFLSPSDELRDVGPAPDHGHMRSIVPNRNQTERLPLRNRSYDAIDPRG 1010
RGHM PGRRQGFHFLS SDELRDVGPAPDHGHMRSI+P+RNQTERLPLRNRSYDAIDP+G
Sbjct: 961 RGHMPPGRRQGFHFLSASDELRDVGPAPDHGHMRSIIPDRNQTERLPLRNRSYDAIDPQG 1020
BLAST of ClCG09G020330.1 vs. NCBI nr
Match:
KAE8648526.1 (hypothetical protein Csa_009072 [Cucumis sativus])
HSP 1 Score: 1602.8 bits (4149), Expect = 0.0e+00
Identity = 818/1030 (79.42%), Postives = 885/1030 (85.92%), Query Frame = 0
Query: 1 MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQFPPSPSKDISSFHSDGNLLKAERP 60
MT+ ESEEVGFKRIGLSASDY+A++PIKKRRFP VQ PSPSKDISSFHSDGNLLK E+P
Sbjct: 1 MTIAESEEVGFKRIGLSASDYEANIPIKKRRFPGVQLTPSPSKDISSFHSDGNLLKVEQP 60
Query: 61 SPPKDASSFNRKENLMKTEQPIISVTIVSSSSAVTSSGLSNKNQDCVSDENKGKSDTVSC 120
SPPKD SSFN ENL+K+E+PI+SVT VSSSS VTS LSN NQD VS+E KGKSDT SC
Sbjct: 61 SPPKDVSSFNHNENLIKSEEPILSVTTVSSSSVVTSCALSNNNQDSVSEEKKGKSDTDSC 120
Query: 121 FVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSLVTEKHTVHASPEICGGLELSST 180
VD VQS+ G VKFQEP LG HAC + FVE + KSLVT +HT HASP IC GL+L ST
Sbjct: 121 CVDIVQSNIGAAGVKFQEPSLGRHACTDGFVECEGKSLVTVEHTDHASPVICAGLKLLST 180
Query: 181 SLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVLVGLKGHMDLKLVPEKSDLNFLK 240
SLDSD AGNKEEEID KMPEE CS PICQ+ GGAGVLVGLKGHMDLKLV EKSDLNFLK
Sbjct: 181 SLDSDHFAGNKEEEIDVKMPEENCSPPICQL-GGAGVLVGLKGHMDLKLVSEKSDLNFLK 240
Query: 241 QNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQSNREKWDLNTSMESWEGCTSGDA 300
QNS+EPVLL+F LNKQGSSTQCVKGNVG DCDGS LQSNREKWDLNTSMESWEGCTSGDA
Sbjct: 241 QNSMEPVLLNFALNKQGSSTQCVKGNVGFDCDGSFLQSNREKWDLNTSMESWEGCTSGDA 300
Query: 301 PVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDSEHKGNSIYACIPSKEHLHLSLD 360
PVVQIS ++T+T +E Y CSSEMVES SPCGKQTLLD+E KG+S +KEHLHLSLD
Sbjct: 301 PVVQISATRTNTTIETYSCSSEMVESDSPCGKQTLLDNEDKGDS------TKEHLHLSLD 360
Query: 361 SSYPKPMLEEDPYISEYESDGNWDIAEAV-----------DDNDNNIEEDYEDGEVRETM 420
SSY K +L+EDPYISEYESDGNWDIAE V DDNDNN+EEDYEDGEVRETM
Sbjct: 361 SSYLKSVLDEDPYISEYESDGNWDIAETVDDNDDNDDNDNDDNDNNVEEDYEDGEVRETM 420
Query: 421 QETEIEVHVCEKREIVPLDHADCNDKKINSVGLPDHECVALGPLEQETKTENLDYSS--G 480
QETE+EVHV EKREI PLDHA CNDKKINSVGL DHE LGP +QETK ENLDY S
Sbjct: 421 QETEVEVHVYEKREIEPLDHAGCNDKKINSVGLLDHEFFTLGPKKQETKLENLDYRSEDE 480
Query: 481 DDVRTTTKSISCEQENEDLCVKELHAVENTSSE------KGAGRSQLSQYDKKDNFESQD 540
D+V+TTTKS S EQENEDLCVKELHAVEN E K RSQLSQYDKK NFE Q
Sbjct: 481 DEVQTTTKSNSYEQENEDLCVKELHAVENAIGEDVNISAKATERSQLSQYDKKGNFEGQG 540
Query: 541 TADRIVDEELIPTFSQGEVENAIAVDVGQNRDLTLPTVKESISGDDAKDINGGTRNSRII 600
TAD+I++EE +PTFSQ EVENA+AVDV QNRDLTLPTVKES++ D+AKDINGGTRNSRII
Sbjct: 541 TADKILNEEPVPTFSQNEVENAVAVDVVQNRDLTLPTVKESVNEDNAKDINGGTRNSRII 600
Query: 601 NLNRASTDSTPCKEKSSFVRSVLSHTDREGVPSMAVEGANLQHQERDDAYSNITKKISVD 660
N NR STDSTPCK KS+F + VLSH DRE VP+M VE AN++ QERDDAYSNITKKIS+D
Sbjct: 601 NFNRTSTDSTPCKAKSNFAKPVLSHKDREFVPNMVVERANMKPQERDDAYSNITKKISID 660
Query: 661 RHQDQSPWMNYSHRRGRSTNRLDNRSGEWDFGPNFSPETYSDQQIDYHVPGLDQNRYKII 720
+ + Q P M +SHRRGRS+NRLD+RS EWDFGPNFSPETYS+QQIDYHVPGLDQNRYKI
Sbjct: 661 KREGQPPLMGFSHRRGRSSNRLDHRSEEWDFGPNFSPETYSEQQIDYHVPGLDQNRYKIT 720
Query: 721 PDGPFGGANRRGRELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKMVNRMPRDFSPDRCM 780
PDGPFGGANRRGRELL DE PFFFHGPSRRKS GRRHGP V GGKMV ++PRDFSP RCM
Sbjct: 721 PDGPFGGANRRGRELLEDEEPFFFHGPSRRKSLGRRHGPNVGGGKMVYKIPRDFSPGRCM 780
Query: 781 DEGGSFDRPHGEKFTRNFADDTVDPMYPRPQPPYDVDRPFFRERRNFSFQRKTFPRIDSK 840
DEGGSFDR HGEKF+RNFADDTVD MYPRPQPPYD+D+PFFRERRNFSFQRK+FPRIDSK
Sbjct: 781 DEGGSFDRQHGEKFSRNFADDTVDLMYPRPQPPYDIDKPFFRERRNFSFQRKSFPRIDSK 840
Query: 841 SPVRSRARSPTQWFSSKRSERFCGRPDMTHRRSPNYRTDRMRSPDQRPIRGHM-PGRRQG 900
SPVRSRARSP QWFSSKRS+RFC R DMTHRRSPNYR++RMRSPDQRPIRGHM PGRRQG
Sbjct: 841 SPVRSRARSPGQWFSSKRSDRFCERSDMTHRRSPNYRSERMRSPDQRPIRGHMPPGRRQG 900
Query: 901 FHFLSPSDELRDVGPAPDHGHMRSIVPNRNQTERLPLRNRSYDAIDPRGRIENDELFDG- 960
FHFLS SDE+RDVGPAPDHGHMRSI+P+RNQTERLPLRNRSYDAIDP+GRIEND+ F G
Sbjct: 901 FHFLSASDEMRDVGPAPDHGHMRSIIPDRNQTERLPLRNRSYDAIDPQGRIENDDFFYGP 960
Query: 961 PVRSGQLTGYNGGEPDDDERRFNERHEPLHSFKHPFDDSDGERFRNNGEDCSRPFRFCAE 1010
PVR GQLTGYN G PDDDERRFNERHEPL+SFKHPF DSDGERFRNN EDCSRPFRFC
Sbjct: 961 PVRLGQLTGYNDGVPDDDERRFNERHEPLYSFKHPFGDSDGERFRNNREDCSRPFRFCPG 1020
BLAST of ClCG09G020330.1 vs. ExPASy TrEMBL
Match:
A0A0A0KU39 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G504130 PE=4 SV=1)
HSP 1 Score: 1622.1 bits (4199), Expect = 0.0e+00
Identity = 823/1028 (80.06%), Postives = 885/1028 (86.09%), Query Frame = 0
Query: 1 MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQFPPSPSKDISSFHSDGNLLKAERP 60
MT+ ESEEVGFKRIGLSASDY+A++PIKKRRFP VQ PSPSKDISSFHSDGNLLK E+P
Sbjct: 1 MTIAESEEVGFKRIGLSASDYEANIPIKKRRFPGVQLTPSPSKDISSFHSDGNLLKVEQP 60
Query: 61 SPPKDASSFNRKENLMKTEQPIISVTIVSSSSAVTSSGLSNKNQDCVSDENKGKSDTVSC 120
SPPKD SSFN ENL+K+E+PI+SVT VSSSS VTS LSN NQD VS+E KGKSDT SC
Sbjct: 61 SPPKDVSSFNHNENLIKSEEPILSVTTVSSSSVVTSCALSNNNQDSVSEEKKGKSDTDSC 120
Query: 121 FVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSLVTEKHTVHASPEICGGLELSST 180
VD VQS+ G VKFQEP LG HAC + FVE + KSLVT +HT HASP IC GL+L ST
Sbjct: 121 CVDIVQSNIGAAGVKFQEPSLGRHACTDGFVECEGKSLVTVEHTDHASPVICAGLKLLST 180
Query: 181 SLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVLVGLKGHMDLKLVPEKSDLNFLK 240
SLDSD AGNKEEEID KMPEE CS PICQ+ GGAGVLVGLKGHMDLKLV EKSDLNFLK
Sbjct: 181 SLDSDHFAGNKEEEIDVKMPEENCSPPICQL-GGAGVLVGLKGHMDLKLVSEKSDLNFLK 240
Query: 241 QNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQSNREKWDLNTSMESWEGCTSGDA 300
QNS+EPVLL+F LNKQGSSTQCVKGNVG DCDGS LQSNREKWDLNTSMESWEGCTSGDA
Sbjct: 241 QNSMEPVLLNFALNKQGSSTQCVKGNVGFDCDGSFLQSNREKWDLNTSMESWEGCTSGDA 300
Query: 301 PVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDSEHKGNSIYACIPSKEHLHLSLD 360
PVVQIS ++T+T +E Y CSSEMVES SPCGKQTLLD+E KG+S +KEHLHLSLD
Sbjct: 301 PVVQISATRTNTTIETYSCSSEMVESDSPCGKQTLLDNEDKGDS------TKEHLHLSLD 360
Query: 361 SSYPKPMLEEDPYISEYESDGNWDIAEAV-----------DDNDNNIEEDYEDGEVRETM 420
SSY K +L+EDPYISEYESDGNWDIAE V DDNDNN+EEDYEDGEVRETM
Sbjct: 361 SSYLKSVLDEDPYISEYESDGNWDIAETVDDNDDNDDNDNDDNDNNVEEDYEDGEVRETM 420
Query: 421 QETEIEVHVCEKREIVPLDHADCNDKKINSVGLPDHECVALGPLEQETKTENLDYSS--G 480
QETE+EVHV EKREI PLDHA CNDKKINSVGL DHE LGP +QETK ENLDY S
Sbjct: 421 QETEVEVHVYEKREIEPLDHAGCNDKKINSVGLLDHEFFTLGPKKQETKLENLDYRSEDE 480
Query: 481 DDVRTTTKSISCEQENEDLCVKELHAVENTSSE------KGAGRSQLSQYDKKDNFESQD 540
D+V+TTTKS S EQENEDLCVKELHAVEN E K RSQLSQYDKK NFE Q
Sbjct: 481 DEVQTTTKSNSYEQENEDLCVKELHAVENAIGEDVNISAKATERSQLSQYDKKGNFEGQG 540
Query: 541 TADRIVDEELIPTFSQGEVENAIAVDVGQNRDLTLPTVKESISGDDAKDINGGTRNSRII 600
TAD+I++EE +PTFSQ EVENA+AVDV QNRDLTLPTVKES++ DDAKDINGGTRNSRII
Sbjct: 541 TADKILNEEPVPTFSQNEVENAVAVDVVQNRDLTLPTVKESVNEDDAKDINGGTRNSRII 600
Query: 601 NLNRASTDSTPCKEKSSFVRSVLSHTDREGVPSMAVEGANLQHQERDDAYSNITKKISVD 660
N NR STDSTPCK KS+F + VLSH DRE VP+M VE AN++ QERDD YSNI+KKIS+D
Sbjct: 601 NFNRTSTDSTPCKAKSNFAKPVLSHKDREFVPNMVVERANMKPQERDDVYSNISKKISID 660
Query: 661 RHQDQSPWMNYSHRRGRSTNRLDNRSGEWDFGPNFSPETYSDQQIDYHVPGLDQNRYKII 720
+ Q P M +SHRRGR+TNRLDNRS EWDFGPNFSPETYS+QQIDYHV GLDQNRYKII
Sbjct: 661 KRQGPPPLMGFSHRRGRNTNRLDNRSEEWDFGPNFSPETYSEQQIDYHVTGLDQNRYKII 720
Query: 721 PDGPFGGANRRGRELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKMVNRMPRDFSPDRCM 780
PDGPFGGANRRGREL+ DE PFFFHGPSRRKSPGRRHG VRGGKMVNRMPRDFSP RCM
Sbjct: 721 PDGPFGGANRRGRELVEDEEPFFFHGPSRRKSPGRRHGHSVRGGKMVNRMPRDFSPGRCM 780
Query: 781 DEGGSFDRPHGEKFTRNFADDTVDPMYPRPQPPYDVDRPFFRERRNFSFQRKTFPRIDSK 840
DEGGSFDR HGEKFTRNFADDTVD MYPRPQPPYDVDRPFFRERRNFSFQRKTFP+IDSK
Sbjct: 781 DEGGSFDRQHGEKFTRNFADDTVDEMYPRPQPPYDVDRPFFRERRNFSFQRKTFPKIDSK 840
Query: 841 SPVRSRARSPTQWFSSKRSERFCGRPDMTHRRSPNYRTDRMRSPDQRPIRGHMPGRRQGF 900
SPVRSRARSP+QWFSSKRS+RFC RP+MTHRRSPNY TDRMRSPDQR IRG+MPG+RQGF
Sbjct: 841 SPVRSRARSPSQWFSSKRSDRFCERPNMTHRRSPNYMTDRMRSPDQRSIRGYMPGQRQGF 900
Query: 901 HFLSPSDELRDVGPAPDHGHMRSIVPNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPV 960
+LSP DELRDVGPAPDHGHMR +PNRNQT+RLPLRNRSYDAIDPRGRIEND LF GPV
Sbjct: 901 RYLSPPDELRDVGPAPDHGHMRPFIPNRNQTKRLPLRNRSYDAIDPRGRIENDGLFYGPV 960
Query: 961 RSGQLTGYNGGEPDDDERRFNERHEPLHSFKHPFDDSDGERFRNNGEDCSRPFRFCAEND 1010
R GQLTGYNGGEPDDDERRFNERHEPLHSFKH F DSDGER+RN GEDCSRPFRFCAE+D
Sbjct: 961 RLGQLTGYNGGEPDDDERRFNERHEPLHSFKHGFRDSDGERYRNKGEDCSRPFRFCAEDD 1020
BLAST of ClCG09G020330.1 vs. ExPASy TrEMBL
Match:
A0A5D3C5V8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G002340 PE=4 SV=1)
HSP 1 Score: 1603.2 bits (4150), Expect = 0.0e+00
Identity = 831/1101 (75.48%), Postives = 893/1101 (81.11%), Query Frame = 0
Query: 1 MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQF----------------------- 60
MT+ ESEEVGFKR GLSASDYDA LPIKKRRFPVVQF
Sbjct: 1 MTLVESEEVGFKRTGLSASDYDAILPIKKRRFPVVQFPPSPSKDLPLSPSKDLPPSPSKD 60
Query: 61 -----------------PPSPSKDISSFHSDGNLLKAERPSPPK---------------- 120
PPSPSKD+ FHSDGNLLKAE+PSPPK
Sbjct: 61 LPPSPSKDLPPSPSKNLPPSPSKDLPPFHSDGNLLKAEQPSPPKDLSSLNRNESLIKTEQ 120
Query: 121 --------------------------DASSFNRKENLMKTEQPIISVTIVSSSSAVTSSG 180
D SSFNR ENL+KTEQPI+S TIVSSSS VTSS
Sbjct: 121 PSPPKEPSSLNRNESLIKTEQPSPSEDLSSFNRNENLIKTEQPILSRTIVSSSSVVTSSA 180
Query: 181 LSNKNQDCVSDENKGKSDTVSCFVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSL 240
L N NQD VS+E KGKSD+ SC VD VQSD G VKFQEP L HA IN F E++ KSL
Sbjct: 181 LLNNNQDNVSEEKKGKSDSDSCCVDLVQSDIGTAGVKFQEPNLRVHAYINCFDEYEGKSL 240
Query: 241 VTEKHTVHASPEICGGLELSSTSLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVL 300
VT KHT+ SPEI GG LSSTSLDSDPLA NKEEEID KMPEE CS PIC+V GGAGV
Sbjct: 241 VTVKHTIRTSPEIYGGSNLSSTSLDSDPLADNKEEEIDVKMPEENCSPPICEVGGGAGVS 300
Query: 301 VGLKGHMDLKLVPEKSDLNFLKQNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQS 360
VGL HMDLKLVPEKSDLNFLKQ+S+EPVLLDF LNK GSSTQCVK NVGSDCDG LLQ
Sbjct: 301 VGLNCHMDLKLVPEKSDLNFLKQDSVEPVLLDFSLNKHGSSTQCVKDNVGSDCDGPLLQL 360
Query: 361 NREKWDLNTSMESWEGCTSGDAPVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDS 420
NREKWDLNTSMESWEGCTSGD+PV ++S ++T+T +E Y CSSEMVES SPCGKQTLLDS
Sbjct: 361 NREKWDLNTSMESWEGCTSGDSPVAKMSATKTNTTIETYACSSEMVESDSPCGKQTLLDS 420
Query: 421 EHKGNSIYACIPSKEHLHLSLDSSYPKPMLEEDPYISEYESDGNWDIAEAV--DDNDNNI 480
E K NSIYAC+PSK HLHLSLDSSY KP++EEDPYISEYESDGNWDIAEAV DDNDN++
Sbjct: 421 EDKDNSIYACMPSKGHLHLSLDSSYLKPVVEEDPYISEYESDGNWDIAEAVDDDDNDNHL 480
Query: 481 EEDYEDGEVRETMQETEIEVHVCEKREIVPLDHADCNDKKINSVGLPDHECVALGPLEQE 540
EEDYEDGEVRET+QETE+EVH EKREI PLDHA C+DKKIN++ LPDHE +ALGPLEQE
Sbjct: 481 EEDYEDGEVRETLQETEVEVHAYEKREIEPLDHAGCDDKKINTIRLPDHELLALGPLEQE 540
Query: 541 TKTENLDYSSGDDVRTTTKSISCEQENEDLCVKELHAVENTSS------EKGAGRSQLSQ 600
TK ENLD+ S DDVRTTTKS S EQENEDLCVKELHAVEN+ S K GR QL Q
Sbjct: 541 TKPENLDHRSEDDVRTTTKSKSYEQENEDLCVKELHAVENSISGDVNRPVKATGRGQLFQ 600
Query: 601 YDKKDNFESQDTADRIVDEELIPTFSQGEVENAIAVDVGQNRDLTLPTVKESISGDDAKD 660
YDKK NFE+ DTAD IVDEELIPTFSQGE+ENA+AVDV QNRDLTLPTVKES++G+DAKD
Sbjct: 601 YDKKHNFEAHDTADEIVDEELIPTFSQGEMENAVAVDVVQNRDLTLPTVKESVNGNDAKD 660
Query: 661 INGGTRNSRIINLNRASTDSTPCKEKSSFVRSVLSHTDREGVPSMAVEGANLQHQERDDA 720
INGGTRNSRIIN NR STDSTPCKEKSSF RSVL H +RE VP+MAVEGAN+Q QERDDA
Sbjct: 661 INGGTRNSRIINFNRVSTDSTPCKEKSSFARSVLPHKEREFVPNMAVEGANMQPQERDDA 720
Query: 721 YSNITKKISVDRHQDQSPWMNYSHRRGRSTNRLDNRSGEWDFGPNFSPETYSDQQIDYHV 780
YSNITKKIS+D+ + Q P M +SHRRGRSTNRLDNRS EWDFGPNFSPETYS+QQIDYH
Sbjct: 721 YSNITKKISIDKREGQPPLMGFSHRRGRSTNRLDNRSEEWDFGPNFSPETYSEQQIDYHG 780
Query: 781 PGLDQNRYKIIPDGPFGGANRRGRELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKMVNR 840
PGLDQNRYKI PDGPFGGANRRGRELL DE PFFFHGPSRRKS GRRHGP V GGKMV +
Sbjct: 781 PGLDQNRYKITPDGPFGGANRRGRELLEDEEPFFFHGPSRRKSFGRRHGPNVGGGKMVYK 840
Query: 841 MPRDFSPDRCMDEGGSFDRPHGEKFTRNFADDTVDPMYPRPQPPYDVDRPFFRERRNFSF 900
+PRDFSP RCMDEGGSFDR HGEKF+RNFADDTVD MYPRPQPPYD+D+PFFRERRNFSF
Sbjct: 841 IPRDFSPGRCMDEGGSFDRQHGEKFSRNFADDTVDLMYPRPQPPYDIDKPFFRERRNFSF 900
Query: 901 QRKTFPRIDSKSPVRSRARSPTQWFSSKRSERFCGRPDMTHRRSPNYRTDRMRSPDQRPI 960
QRK+FPRIDSKSPVR+RARSP+QWFSSKRS+RFC R DMTHRRSPNYR++RMRSPD RPI
Sbjct: 901 QRKSFPRIDSKSPVRARARSPSQWFSSKRSDRFCERSDMTHRRSPNYRSERMRSPDHRPI 960
Query: 961 RGHM-PGRRQGFHFLSPSDELRDVGPAPDHGHMRSIVPNRNQTERLPLRNRSYDAIDPRG 1010
RGHM PGRRQGFHFLS SDELRDVGPAPDHGHMRSI+P+RNQTERLPLRNRSYDAIDP+G
Sbjct: 961 RGHMPPGRRQGFHFLSASDELRDVGPAPDHGHMRSIIPDRNQTERLPLRNRSYDAIDPQG 1020
BLAST of ClCG09G020330.1 vs. ExPASy TrEMBL
Match:
A0A1S3CJJ9 (uncharacterized protein LOC103501674 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501674 PE=4 SV=1)
HSP 1 Score: 1601.3 bits (4145), Expect = 0.0e+00
Identity = 829/1093 (75.85%), Postives = 891/1093 (81.52%), Query Frame = 0
Query: 1 MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQF----------------------- 60
MT+ ESEEVGFKR LSASDYDA LPIKKRRFPVVQF
Sbjct: 1 MTLVESEEVGFKRTELSASDYDAILPIKKRRFPVVQFPPSPSKDLPLSPSKDLPPSPSKD 60
Query: 61 ---------PPSPSKDISSFHSDGNLLKAERPSPPK------------------------ 120
PPSPSKD+ FHSDGNLLKAE+PSPPK
Sbjct: 61 LPPSPSKNLPPSPSKDLPPFHSDGNLLKAEQPSPPKDLSSLNRNESLIKTEQPSPPKEPS 120
Query: 121 ------------------DASSFNRKENLMKTEQPIISVTIVSSSSAVTSSGLSNKNQDC 180
D SSFNR ENL+KTEQPI+S+TIVSSSS VTSS L N NQD
Sbjct: 121 SLNRNESLIKTEQPSPSEDLSSFNRNENLIKTEQPILSMTIVSSSSVVTSSALLNNNQDN 180
Query: 181 VSDENKGKSDTVSCFVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSLVTEKHTVH 240
VS+E KGKSD+ SC VD VQSD G VKFQEP L HA IN F E+ KSLVT KHT+
Sbjct: 181 VSEEKKGKSDSDSCCVDLVQSDIGTAGVKFQEPNLRVHAYINCFDEYKGKSLVTVKHTIR 240
Query: 241 ASPEICGGLELSSTSLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVLVGLKGHMD 300
SPEI GG LSSTSLDSDPLA NKEEEID KMPEE CS PIC+V GGAGV VGL HMD
Sbjct: 241 TSPEIYGGSNLSSTSLDSDPLADNKEEEIDVKMPEENCSPPICEVGGGAGVSVGLNCHMD 300
Query: 301 LKLVPEKSDLNFLKQNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQSNREKWDLN 360
LKLVPEKSDLNFLKQ+S+EPVLLDF LNK GSSTQCVK NVGSDCDG LLQ NREKWDLN
Sbjct: 301 LKLVPEKSDLNFLKQDSVEPVLLDFSLNKHGSSTQCVKDNVGSDCDGPLLQLNREKWDLN 360
Query: 361 TSMESWEGCTSGDAPVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDSEHKGNSIY 420
TSMESWEGCTSGD+PV ++S ++T+T +E Y CSS MVES SPCGKQTLLDSE K NSIY
Sbjct: 361 TSMESWEGCTSGDSPVAKMSATKTNTTIETYACSSAMVESDSPCGKQTLLDSEDKDNSIY 420
Query: 421 ACIPSKEHLHLSLDSSYPKPMLEEDPYISEYESDGNWDIAEAV--DDNDNNIEEDYEDGE 480
AC+PSK HLHLSLDSSY KP++EEDPYISEYESDGNWDIAEAV DDNDN++EEDYEDGE
Sbjct: 421 ACMPSKGHLHLSLDSSYLKPVVEEDPYISEYESDGNWDIAEAVDDDDNDNHLEEDYEDGE 480
Query: 481 VRETMQETEIEVHVCEKREIVPLDHADCNDKKINSVGLPDHECVALGPLEQETKTENLDY 540
VRET+QETE+EVH EKREI PLDHA C+DKKIN++ LPDHE +ALGPLEQETK ENLD+
Sbjct: 481 VRETLQETEVEVHAYEKREIEPLDHAGCDDKKINTIRLPDHELLALGPLEQETKPENLDH 540
Query: 541 SSGDDVRTTTKSISCEQENEDLCVKELHAVENTSS------EKGAGRSQLSQYDKKDNFE 600
S DDVRTTTKS S EQENEDLCVKELHAVEN+ S K GR QL QYDKK NFE
Sbjct: 541 RSEDDVRTTTKSKSYEQENEDLCVKELHAVENSISGDVNRPVKATGRGQLFQYDKKHNFE 600
Query: 601 SQDTADRIVDEELIPTFSQGEVENAIAVDVGQNRDLTLPTVKESISGDDAKDINGGTRNS 660
+ DTAD IVDEELIPTFSQGE+ENA+AVDV QNRDLTLPTVKES++G+DAKDINGGTRNS
Sbjct: 601 AHDTADEIVDEELIPTFSQGEMENAVAVDVVQNRDLTLPTVKESVNGNDAKDINGGTRNS 660
Query: 661 RIINLNRASTDSTPCKEKSSFVRSVLSHTDREGVPSMAVEGANLQHQERDDAYSNITKKI 720
RIIN NR STDSTPCKEKSSF RSVL H +RE VP+MAVEGAN+Q QERDDAYSNITKKI
Sbjct: 661 RIINFNRVSTDSTPCKEKSSFARSVLPHKEREFVPNMAVEGANMQPQERDDAYSNITKKI 720
Query: 721 SVDRHQDQSPWMNYSHRRGRSTNRLDNRSGEWDFGPNFSPETYSDQQIDYHVPGLDQNRY 780
S+D+ + Q P M +SHRRGRSTNRLDNRS EWDFGPNFSPETYS+QQIDYH PGLDQNRY
Sbjct: 721 SIDKREGQPPLMGFSHRRGRSTNRLDNRSEEWDFGPNFSPETYSEQQIDYHGPGLDQNRY 780
Query: 781 KIIPDGPFGGANRRGRELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKMVNRMPRDFSPD 840
KI PDGPFGGANRRGRELL DE PFFFHGPSRRKS GRRHGP V GGKMV ++PRDFSP
Sbjct: 781 KITPDGPFGGANRRGRELLEDEEPFFFHGPSRRKSFGRRHGPNVGGGKMVYKIPRDFSPG 840
Query: 841 RCMDEGGSFDRPHGEKFTRNFADDTVDPMYPRPQPPYDVDRPFFRERRNFSFQRKTFPRI 900
RCMDEGGSFDR HGEKF+RNFADDTVD MYPRPQPPYD+D+PFFRERRNFSFQRK+FPRI
Sbjct: 841 RCMDEGGSFDRQHGEKFSRNFADDTVDLMYPRPQPPYDIDKPFFRERRNFSFQRKSFPRI 900
Query: 901 DSKSPVRSRARSPTQWFSSKRSERFCGRPDMTHRRSPNYRTDRMRSPDQRPIRGHM-PGR 960
DSKSPVR+RARSP+QWFSSKRS+RFC R DMTHRRSPNYR++RMRSPD RPIRGHM PGR
Sbjct: 901 DSKSPVRARARSPSQWFSSKRSDRFCERSDMTHRRSPNYRSERMRSPDHRPIRGHMPPGR 960
Query: 961 RQGFHFLSPSDELRDVGPAPDHGHMRSIVPNRNQTERLPLRNRSYDAIDPRGRIENDELF 1010
RQGFHFLS SDELRDVGPAPDHGHMRSI+P+RNQTERLPLRNRSYDAIDP+GRIEND F
Sbjct: 961 RQGFHFLSASDELRDVGPAPDHGHMRSIIPDRNQTERLPLRNRSYDAIDPQGRIENDNFF 1020
BLAST of ClCG09G020330.1 vs. ExPASy TrEMBL
Match:
A0A5D3C339 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G002390 PE=4 SV=1)
HSP 1 Score: 1593.6 bits (4125), Expect = 0.0e+00
Identity = 809/1015 (79.70%), Postives = 873/1015 (86.01%), Query Frame = 0
Query: 1 MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQFPPSPSKDISSFHSDGNLLKAERP 60
MTVPESEEV FKRIGLSASDYDA++PIKKRRF VQ PSPSKDISSFHSDG+LLK E+P
Sbjct: 1 MTVPESEEVDFKRIGLSASDYDANIPIKKRRFLGVQLTPSPSKDISSFHSDGDLLKVEQP 60
Query: 61 SPPKDASSFNRKENLMKTEQPIISVTIVSSSSAVTSSGLSNKNQDCVSDENKGKSDTVSC 120
SPPK SSFN ENL+K+E+PI+SV IVSSSSAVTS LSN NQD VS++ KGKSDT SC
Sbjct: 61 SPPKGVSSFNHNENLIKSEEPILSVIIVSSSSAVTSCALSNNNQDSVSEKKKGKSDTDSC 120
Query: 121 FVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSLVTEKHTVHASPEICGGLELSST 180
VD V+SDTG VKFQEPG G HAC + FVE + KS+ +HT HASP ICGGL+L ST
Sbjct: 121 CVDIVRSDTGTAGVKFQEPGFGRHACTDGFVECEGKSV---EHTDHASPVICGGLKL-ST 180
Query: 181 SLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVLVGLKGHMDLKLVPEKSDLNFLK 240
SLDSD AGNKEEEID KMPEE CS PICQ+ GGAG+ VGLKGHMDLKLVPEKSDLNFLK
Sbjct: 181 SLDSDHFAGNKEEEIDVKMPEENCSPPICQLGGGAGLSVGLKGHMDLKLVPEKSDLNFLK 240
Query: 241 QNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQSNREKWDLNTSMESWEGCTSGDA 300
QNS+EPVLLDF LN QGSSTQCVKGNVG DCDGS LQSNREKWDLNTSMESWEGCTSGD
Sbjct: 241 QNSMEPVLLDFALNNQGSSTQCVKGNVGFDCDGSFLQSNREKWDLNTSMESWEGCTSGDD 300
Query: 301 PVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDSEHKGNSIYACIPSKEHLHLSLD 360
PVVQIS ++T+T E Y CSSEMVES SPC KQTLLDSE K +S +KEHLHLSLD
Sbjct: 301 PVVQISTTRTNTTTETYACSSEMVESDSPCRKQTLLDSEDKVDS------TKEHLHLSLD 360
Query: 361 SSYPKPMLEEDPYISEYESDGNWDIAEAVDDNDNNIEEDYEDGEVRETMQETEIEVHVCE 420
SSY K +L+EDPYISEYESDGNWDIAE VDDNDNN+EEDYEDGEVRETMQE E+EVHV E
Sbjct: 361 SSYLKSVLDEDPYISEYESDGNWDIAETVDDNDNNVEEDYEDGEVRETMQENEVEVHVHE 420
Query: 421 KREIVPLDHADCNDKKINSVGLPDHECVALGPLEQETKTENLDYSSGDDVRTTTKSISCE 480
KRE+ PLDHA CN++KINSVGL DHE LGP EQETK+ENLDY S D+V+TTTKS S E
Sbjct: 421 KREVEPLDHAGCNEEKINSVGLLDHEFFTLGPQEQETKSENLDYRSEDEVQTTTKSKSYE 480
Query: 481 QENEDLCVKELHAVENTSSE------KGAGRSQLSQYDKKDNFESQDTADRIVDEELIPT 540
QENEDLCVKELHAVEN SE K GR QLSQYDKK NFE Q TAD+I++EE IPT
Sbjct: 481 QENEDLCVKELHAVENAISEDVNISAKATGRIQLSQYDKKGNFEGQGTADKIINEEPIPT 540
Query: 541 FSQGEVENAIAVDVGQNRDLTLPTVKESISGDDAKDINGGTRNSRIINLNRASTDSTPCK 600
FSQ EVENA+AVDV QNRDLTLPTV ES++ DD KDINGGTRNSRIIN NR STDSTPCK
Sbjct: 541 FSQDEVENAVAVDVVQNRDLTLPTVNESVTRDDTKDINGGTRNSRIINFNRTSTDSTPCK 600
Query: 601 EKSSFVRSVLSHTDREGVPSMAVEGANLQHQERDDAYSNITKKISVDRHQDQSPWMNYSH 660
KSSFVR VLSH DRE VP+M VE AN++ QERDD YSNITKKIS+D+ Q P M +SH
Sbjct: 601 AKSSFVRPVLSHKDREFVPNMGVEEANMKPQERDDVYSNITKKISIDKRQGPPPLMGFSH 660
Query: 661 RRGRSTNRLDNRSGEWDFGPNFSPETYSDQQIDYHVPGLDQNRYKIIPDGPFGGANRRGR 720
RRGR TNRLDNRS EWDFG NFSPE YS+Q+IDYHV G DQNRYKIIPDGPFGGANRRGR
Sbjct: 661 RRGRYTNRLDNRSEEWDFGANFSPEIYSEQKIDYHVAGFDQNRYKIIPDGPFGGANRRGR 720
Query: 721 ELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKMVNRMPRDFSPDRCMDEGGSFDRPHGEK 780
EL+ DE PFFFHGPSRRKSPGRRHG VRGGKMVNRMPRDFSP RCMDEGGSFDR HGEK
Sbjct: 721 ELVEDEEPFFFHGPSRRKSPGRRHGHSVRGGKMVNRMPRDFSPGRCMDEGGSFDRQHGEK 780
Query: 781 FTRNFADDTVDPMYPRPQPPYDVDRPFFRERRNFSFQRKTFPRIDSKSPVRSRARSPTQW 840
FTR+FADDTVD MYPRPQPPYDVDRPFFRERRNFSFQRKTFP+IDSKSPVRSRARSP+QW
Sbjct: 781 FTRSFADDTVDGMYPRPQPPYDVDRPFFRERRNFSFQRKTFPKIDSKSPVRSRARSPSQW 840
Query: 841 FSSKRSERFCGRPDMTHRRSPNYRTDRMRSPDQRPIRGHMPGRRQGFHFLSPSDELRDVG 900
FSSKRS+RFC RP+MTHRRSPNY TDRMRSPDQ IRG+MPG+RQGF +LSP DELRDVG
Sbjct: 841 FSSKRSDRFCERPNMTHRRSPNYMTDRMRSPDQCSIRGYMPGQRQGFRYLSPPDELRDVG 900
Query: 901 PAPDHGHMRSIVPNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPVRSGQLTGYNGGEP 960
APDHGHMR +PNRNQT+RLPLRNRSYDAIDPRGRIE+D LF GPVR GQLTGYNGG+P
Sbjct: 901 SAPDHGHMRPFIPNRNQTKRLPLRNRSYDAIDPRGRIEDDGLFYGPVRLGQLTGYNGGKP 960
Query: 961 DDDERRFNERHEPLHSFKHPFDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1010
DDDERRFNERHEPLHSFKH F DSDG+R+RN GEDCSRPFRFCAE+D RISWKRR
Sbjct: 961 DDDERRFNERHEPLHSFKHGFRDSDGDRYRNKGEDCSRPFRFCAEDDPRISWKRR 1005
BLAST of ClCG09G020330.1 vs. ExPASy TrEMBL
Match:
A0A1S3CJI4 (uncharacterized protein LOC103501669 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501669 PE=4 SV=1)
HSP 1 Score: 1592.0 bits (4121), Expect = 0.0e+00
Identity = 809/1015 (79.70%), Postives = 872/1015 (85.91%), Query Frame = 0
Query: 1 MTVPESEEVGFKRIGLSASDYDASLPIKKRRFPVVQFPPSPSKDISSFHSDGNLLKAERP 60
MTVPESEEV FKRIGLSASDYDA++PIKKRRF VQ PSPSKDISSFHSDG+LLK E+P
Sbjct: 1 MTVPESEEVDFKRIGLSASDYDANIPIKKRRFLGVQLTPSPSKDISSFHSDGDLLKVEQP 60
Query: 61 SPPKDASSFNRKENLMKTEQPIISVTIVSSSSAVTSSGLSNKNQDCVSDENKGKSDTVSC 120
SPPK SSFN ENL+K+E+PI+SV IVSSSSAVTS LSN NQD VS++ KGKSDT SC
Sbjct: 61 SPPKGVSSFNHNENLIKSEEPILSVIIVSSSSAVTSCALSNNNQDSVSEKKKGKSDTDSC 120
Query: 121 FVDTVQSDTGMPRVKFQEPGLGEHACINDFVEHDDKSLVTEKHTVHASPEICGGLELSST 180
VD V+SDTG VKFQEPG G HAC + FVE + KS+ +HT HASP ICGGL+L ST
Sbjct: 121 CVDIVRSDTGTAGVKFQEPGFGRHACTDGFVECEGKSV---EHTDHASPVICGGLKL-ST 180
Query: 181 SLDSDPLAGNKEEEIDAKMPEEKCSSPICQVEGGAGVLVGLKGHMDLKLVPEKSDLNFLK 240
SLDSD AGNKEEEID KMPEE CS PICQ+ GG GV VGLKGHMDLKLVPEKSDLNFLK
Sbjct: 181 SLDSDHFAGNKEEEIDVKMPEENCSPPICQLGGGGGVSVGLKGHMDLKLVPEKSDLNFLK 240
Query: 241 QNSLEPVLLDFPLNKQGSSTQCVKGNVGSDCDGSLLQSNREKWDLNTSMESWEGCTSGDA 300
QNS+EPVLLDF LN QGSSTQCVKGNVG DCDGS LQSNREKWDLNTSMESWEGCTSGD
Sbjct: 241 QNSMEPVLLDFALNNQGSSTQCVKGNVGFDCDGSFLQSNREKWDLNTSMESWEGCTSGDD 300
Query: 301 PVVQISGSQTSTAVEAYDCSSEMVESVSPCGKQTLLDSEHKGNSIYACIPSKEHLHLSLD 360
PVVQIS ++T+T E Y CSSEMVES SPC KQTLLDSE K +S +KEHLHLSLD
Sbjct: 301 PVVQISTTRTNTTTETYACSSEMVESDSPCRKQTLLDSEDKVDS------TKEHLHLSLD 360
Query: 361 SSYPKPMLEEDPYISEYESDGNWDIAEAVDDNDNNIEEDYEDGEVRETMQETEIEVHVCE 420
SSY K +L+EDPYISEYESDGNWDIAE VDDNDNN+EEDYEDGEVRETMQE E+EVHV E
Sbjct: 361 SSYLKSVLDEDPYISEYESDGNWDIAETVDDNDNNVEEDYEDGEVRETMQENEVEVHVHE 420
Query: 421 KREIVPLDHADCNDKKINSVGLPDHECVALGPLEQETKTENLDYSSGDDVRTTTKSISCE 480
KREI PLDHA CN++KINSVGL DHE LGP EQETK+ENLDY S D+V+TTTKS S E
Sbjct: 421 KREIEPLDHAGCNEEKINSVGLLDHEFFTLGPQEQETKSENLDYRSEDEVQTTTKSKSYE 480
Query: 481 QENEDLCVKELHAVENTSSE------KGAGRSQLSQYDKKDNFESQDTADRIVDEELIPT 540
QENEDLCVKELHAVEN SE K GR QLSQYDKK NFE Q TAD+I++EE IPT
Sbjct: 481 QENEDLCVKELHAVENAISEDVNISAKATGRIQLSQYDKKGNFEGQGTADKIINEEPIPT 540
Query: 541 FSQGEVENAIAVDVGQNRDLTLPTVKESISGDDAKDINGGTRNSRIINLNRASTDSTPCK 600
FSQ EVENA+AVDV QNRDLTLPTV ES++ DD KDINGGTRNSRIIN NR STDSTPCK
Sbjct: 541 FSQDEVENAVAVDVVQNRDLTLPTVNESVTRDDTKDINGGTRNSRIINFNRTSTDSTPCK 600
Query: 601 EKSSFVRSVLSHTDREGVPSMAVEGANLQHQERDDAYSNITKKISVDRHQDQSPWMNYSH 660
KSSFVR VLSH DRE VP+M VE AN++ QERDD YSNITKKIS+D+ Q P M +SH
Sbjct: 601 AKSSFVRPVLSHKDREFVPNMGVEEANMKPQERDDVYSNITKKISIDKRQGPPPLMGFSH 660
Query: 661 RRGRSTNRLDNRSGEWDFGPNFSPETYSDQQIDYHVPGLDQNRYKIIPDGPFGGANRRGR 720
RRGR TNRLDNRS EWDFG NFSPE YS+QQIDYHV G D+NRYKIIPDGPFGGANRRGR
Sbjct: 661 RRGRYTNRLDNRSEEWDFGANFSPEIYSEQQIDYHVAGFDKNRYKIIPDGPFGGANRRGR 720
Query: 721 ELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKMVNRMPRDFSPDRCMDEGGSFDRPHGEK 780
EL+ DE PFFFHGPSRRKSPGRRHG VRGGKMVNRMPRDFSP RCMDEGGSFDR HGEK
Sbjct: 721 ELVEDEEPFFFHGPSRRKSPGRRHGHSVRGGKMVNRMPRDFSPGRCMDEGGSFDRQHGEK 780
Query: 781 FTRNFADDTVDPMYPRPQPPYDVDRPFFRERRNFSFQRKTFPRIDSKSPVRSRARSPTQW 840
FTR+FADDTVD MYPRPQPPYDVDRPFFRERRNFSFQRKTFP+IDSKSPVRSRARSP+QW
Sbjct: 781 FTRSFADDTVDGMYPRPQPPYDVDRPFFRERRNFSFQRKTFPKIDSKSPVRSRARSPSQW 840
Query: 841 FSSKRSERFCGRPDMTHRRSPNYRTDRMRSPDQRPIRGHMPGRRQGFHFLSPSDELRDVG 900
FSSKRS+RFC RP+MTH+RSPNY TDRMRSPDQ IRG+MPG+RQGF +LSP DELRDVG
Sbjct: 841 FSSKRSDRFCERPNMTHQRSPNYMTDRMRSPDQCSIRGYMPGQRQGFRYLSPPDELRDVG 900
Query: 901 PAPDHGHMRSIVPNRNQTERLPLRNRSYDAIDPRGRIENDELFDGPVRSGQLTGYNGGEP 960
APDHGHMR +PNRNQT+RLPLRNRSYDAIDPRGRIE+D LF GPVR GQLTGYNGG+P
Sbjct: 901 SAPDHGHMRPFIPNRNQTKRLPLRNRSYDAIDPRGRIEDDGLFYGPVRLGQLTGYNGGKP 960
Query: 961 DDDERRFNERHEPLHSFKHPFDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1010
DDDERRFNERHEPLHSFKH F DSDG+R+RN GEDCSRPFRFCAE+D RISWKRR
Sbjct: 961 DDDERRFNERHEPLHSFKHGFRDSDGDRYRNKGEDCSRPFRFCAEDDPRISWKRR 1005
BLAST of ClCG09G020330.1 vs. TAIR 10
Match:
AT5G13590.1 (unknown protein; Has 150 Blast hits to 121 proteins in 42 species: Archae - 0; Bacteria - 8; Metazoa - 80; Fungi - 5; Plants - 17; Viruses - 0; Other Eukaryotes - 40 (source: NCBI BLink). )
HSP 1 Score: 84.3 bits (207), Expect = 5.9e-16
Identity = 95/332 (28.61%), Postives = 137/332 (41.27%), Query Frame = 0
Query: 688 YHVPGLDQNRYKIIPDGPFGGANRRGRELLADEGPFFFHGPSRRKSPGRRHGPCVRGGKM 747
YH + R IPD R L D H +K HG RGG
Sbjct: 779 YHGRIMRSPRLNFIPD----------RRRLPDNTESNLHDQDTKKFEFDNHGNTRRGGAF 838
Query: 748 VNRMPRDFSP--DRCMDEGGSFDR-----------------------PHGEKFTRNFADD 807
++ R P D SF R GEKFTR +
Sbjct: 839 MSNFQRGRRPANDGVTPYAHSFPRRSPSFSYNRGPTNKEDTSAFHGFRDGEKFTRGLQCN 898
Query: 808 TVDPMYPRPQPPYDVDRPFFRERRNF-SFQRKTFPRIDSKSPVRSRARS--PTQWFSSKR 867
+P++ Q PY F R R F + ++ FP S+SPVRSR RS + F ++
Sbjct: 899 NTEPLFMNHQRPYRGRSGFARGRTKFVNNPKRDFPGFRSRSPVRSRERSDGSSSSFRNRS 958
Query: 868 SERFCGRPDMTHRRSPN-YRTDRMRSPDQRPIRGHMPGRRQGFHFLS--PSDELRDVGPA 927
E F G D +HRRSP+ Y+ +RM SPD M RR S PS+ R G A
Sbjct: 959 QEEFSGHTDFSHRRSPSGYKVERMSSPDHSGYSREMVVRRHNSPPFSHRPSNAGRGRGYA 1018
Query: 928 PDHGHMRSIVPNRN------QTERLPLRNR-SYDAIDPRGRIE-NDELFDGPVRSGQLTG 981
G++R R+ ++ + RN + + +DPR R++ +D+ F+G + S +
Sbjct: 1019 RGRGYVRGRGYGRDGNSFRKPSDHVVHRNHGNMNNLDPRERVDYSDDFFEGQIHSERF-- 1078
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038889581.1 | 0.0e+00 | 87.41 | uncharacterized protein LOC120079459 isoform X2 [Benincasa hispida] >XP_03888958... | [more] |
XP_038889579.1 | 0.0e+00 | 86.97 | uncharacterized protein LOC120079459 isoform X1 [Benincasa hispida] >XP_03888958... | [more] |
XP_031742263.1 | 0.0e+00 | 79.96 | uncharacterized protein LOC101204083 [Cucumis sativus] | [more] |
TYK05746.1 | 0.0e+00 | 75.48 | uncharacterized protein E5676_scaffold98G002340 [Cucumis melo var. makuwa] | [more] |
KAE8648526.1 | 0.0e+00 | 79.42 | hypothetical protein Csa_009072 [Cucumis sativus] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KU39 | 0.0e+00 | 80.06 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G504130 PE=4 SV=1 | [more] |
A0A5D3C5V8 | 0.0e+00 | 75.48 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3CJJ9 | 0.0e+00 | 75.85 | uncharacterized protein LOC103501674 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5D3C339 | 0.0e+00 | 79.70 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3CJI4 | 0.0e+00 | 79.70 | uncharacterized protein LOC103501669 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT5G13590.1 | 5.9e-16 | 28.61 | unknown protein; Has 150 Blast hits to 121 proteins in 42 species: Archae - 0; B... | [more] |
Relationships
This mRNA is a part of the following gene feature(s):
The following exon feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
ClCG09G020330.1-exon | ClCG09G020330.1-exon-CG_Chr09:37372903..37373085 | exon |
ClCG09G020330.1-exon | ClCG09G020330.1-exon-CG_Chr09:37373821..37373943 | exon |
ClCG09G020330.1-exon | ClCG09G020330.1-exon-CG_Chr09:37374050..37375905 | exon |
ClCG09G020330.1-exon | ClCG09G020330.1-exon-CG_Chr09:37376008..37377635 | exon |
ClCG09G020330.1-exon | ClCG09G020330.1-exon-CG_Chr09:37377988..37378127 | exon |
The following five_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
ClCG09G020330.1-five_prime_utr | ClCG09G020330.1-five_prime_utr-CG_Chr09:37372903..37373085 | five_prime_UTR |
ClCG09G020330.1-five_prime_utr | ClCG09G020330.1-five_prime_utr-CG_Chr09:37373821..37373919 | five_prime_UTR |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
ClCG09G020330.1-cds | ClCG09G020330.1-cds-CG_Chr09:37373920..37373943 | CDS |
ClCG09G020330.1-cds | ClCG09G020330.1-cds-CG_Chr09:37374050..37375905 | CDS |
ClCG09G020330.1-cds | ClCG09G020330.1-cds-CG_Chr09:37376008..37377157 | CDS |
The following three_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
ClCG09G020330.1-three_prime_utr | ClCG09G020330.1-three_prime_utr-CG_Chr09:37377158..37377635 | three_prime_UTR |
ClCG09G020330.1-three_prime_utr | ClCG09G020330.1-three_prime_utr-CG_Chr09:37377988..37378127 | three_prime_UTR |
The following polypeptide feature(s) derives from this mRNA:
Feature Name | Unique Name | Type |
ClCG09G020330.1 | ClCG09G020330.1-protein | polypeptide |