Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCAATTCGCTTCTTCCAATTCTCTCTTCGTTCTCGTCGTCTCTCGCATGGCGGACCAATAAGCATAAACTCAATTCACATTCCCTATCCGACGATAATTCGAATCCCCATTTCCCCCAACTTCTCCTCCGATTTCTTCTTCTATCTGCTCTTTTCTCCTACTCCCAACCTTATTGCATTCTTCTCAGACGCATTGCTTTGATACAGCCTACCCTTCAGCAGTTACCCCAACTCCTTTTGTTTTCCCTTCCTTCCAATTTCAATCGCTACTCTGAAGGTTAGTATTCGTCCTTAATCCCTCCGTATTGCTTATTTGCTAGGATTTACTTTGGTTTATTTCGTTTCTGAGATTTTTGTTTATGATTATAGGTCCTGGAATCAAAGTTTATGTTACGCACCATGCTCTAGGGTTTTATTTTTTGCACTTCTGTTATTGTTTTTTCCTGATGTTTTACTCCACTTTAGGGGTCTTCTGATTGAATTGTTTGCGGGGTTTCAAGTGTTGCTTCCAAATTCTGAATCTTCTTCCTCTATCGACGTAGATATGATCTTCATTCCTATTTCTACCGCCATTTACGCTGAGATGAGGGGCTTCTGACTTATTCACACTTGCTTCCAATCCGTTATTTGGGGAAGATGGATGAGGGAAAGATAGTTGCTCTTCTTCCTTCCGTTGCATTTTCTTTCGAATAGTCTTGATATCGTTGCCATTGCGTTCTGCGGCCTGGCTTCGAAACTTTGCGATTGCCACTTTTAGTGAAGGAGGTTTCAAGTAATATTGTTATTGTGAATTAGAGCTCAGTAATGGTTTCACTGTCATGCATGGATATGTTCAGTGATTATGATTTATAGGGTGGATTTTGTTGTCGGTGGAGAATAAATCTGCAATACAAGGGTCGAGTTTGCAAGTCTGTTAACGAGGATTAAATTTTTTATTTAGGTCGAGTCAAATCTTTCCTTTCCATCCTTCAAAGTAAATTGTAATATTGGCATCTACCCTGCTTGTTTCTTGTCTAACTCTTTTATTTCGTGGATGCCTTTATAAACATTTAGAGCACTTTTGTGCTCTTCGGCGGGCATTAGGAAGGACTATTACTTCTGGTGGTGTTTTGTCTTAATAATTATTCAAAGAATGTGGTAAAGGATGCAAGAAGTGATATGCTGTTATGTTTGATCATGTTTAATACTCATTATGGAAAGAAGTGAACCTACATTAGTTCCAGAGTGGTTGAGAAGTTCTGGAAGTCTTTCTGGGAGTGGAATTTCAGCTCAACAATTTGCATCGTCTTCTTCACACTCAGGTATTTTATAGTTTATTATATGTTGGAGACCAACCTATTGATTTTCAATACTATCTTATTTTATTTTAAGGATTATATGGTTGAATTGCTTTTCTTCTTTTGTGGTGCCAACTTCCCATCACTATTTCATTTACATTCAGTAATTTGGTATTACTCATTGTTTACTAGATATTTCTTCTCAAGCTCATTACTCGAGGAGTAGAACTTCAAAGAGCATTAGCGATATAGATAAACCACATTTTGATTTCTTGGATTGGTCATCTTCATCAACCTCGAGGAGGAGTTCTAGTAATGGCTCTGGAAAGAATGCTTACAGTAGTTTTAATAGGAATCATCGTGATAGAGACCGTGAGAAGGAGAAAGAAATGTCAAATCTTGGGGACCCCTGGGGTTATGACCTTTCTAGCCCTCTGGTAAACATATTTTCCAGTAGAGTTGAGAAGGAAACTTTGCGGCGTTCTCATTCAATGGTATCTAGGAAGCAAGGTGATTTATTTCCTCAAAGAGTTGCTGCTGACTTGAAAAGTGGAGGATATAATCATAAGGCTAACAGTAATGGCTTTCATTCAGGAAGTACCATTAACGGTATCACTGATAAGGCTGTTTTTGACAAGGATTTTCCATCACTTGGATCAGAAGAAAGGCAAGGAGGACCAGATGTAGGGAGAGTATCATCTCCTGGCTTGACCACATCTGTTCAGAGCTTGCCTATTGGGAGTTCAACTTTGATTGGTCGGGAGGGATGGACATCAGCTTTGGCTGAGGTGCCAACTATTGTCACAGGCAGTTCTGCTGCTCCATCATCTGTTCAACAGACTGTTGCTGCCAATTCTGGGTTGGGGTCCCCAAATGCAACAACTCCACGCAAGATGGCTGAAGCTTTAACACAGGCACCAACAAGAGCCCGTGTTACTTCTCAGTCAACTGAGGTAGTCACTTTCCTCTAAAACGTTTGTAGAAAACATTTAATGCAGTTGATATAATGCCATTCTTTGTTTGTAGTTATCTGTCAAGACACAGAGGCTTGAGGAATTGGCTATTAAGCAGTCCAGGCAATTAATCCCAGTAACACCTTCAATGCCTAAAGTTTCTGTAAGTGTTTCTCCATTGATTGACACATTTTTTTTTGTTTTTTAATTAGTTCATGCAACAGAAATCTATGTTTTTCTATATTTCTTCTGTTTAGCTACTTTGTGTCTAATATTTTGTTTCGTATGTTCTGGCCAATTGTATTAGTGAAGAATTTGATTTAGTGATGTAAAAGTTTTTAATCCATGCACAGAAGCTTTCTTTACGAATTGTTTGCATGTTATACTTCTCATGATAACCTTAGGCCCACTCAGTTTTTGTAGAAAAAATAGGGGTAAAATAATAATAGATAAAATGTGGTCTTATAGCTAAAAAAGTAATAGCTAAAACGTGATGGCATGGTGTGAACGATTAAGTTGTTCAAATTTGGATATTCTGATGGCATAGTTGCATAGGAGGCAAATAGTTGTTATCTGTTCACCATTGGATTTACATACTTGAACGTTCATGGTCATGAGAACATGTTGGTTGCAAACTTGCAAGTGCGATGTTGTTCTTTCTAGTTAGTGTTTGAGTTGGAAAGTTTCAGTAATAAAGTGGAAATGAAACTAGTTCATTCTCATTGTTAAATATAATCATTTATGGTCTTATTTTGTAGCTGACTATGGAAAATCTCTAATTGTGACTGTGAGACCACTTCTTAGGTGAACATTTTTTTTAAAAAAAAACAATAACGTAAGACAATGGTTCTATAAGAATTGATGAAATATAAAGATATTGTCTCCAAGTTCGGCACAAAAATTGTAGATTCATAGCTAAAGCTGAGTGAACAATATTGAGAGTTGAGCCACATAAAAAAGTTATTACTCACCTCACATGTAAGCTACTAAAGAAGATATTTATATATTGGACCAAGTATTGTAAAGTTTTCAGAATTGGAGCTCAATAATGAAGCCACAAAAGAAACATTTTCTCAAAGCCGTAAAGATTCTTTGTAGGAGTCTTGGAATTTTTTTTAATTTCCTAAATGCAAATCTCCTGTAAAACAGATCAATGGATTTAGCCACGATCTCACTAAACAGACTCCCTTGTTCTCTGACTTCCTGTTCCACTGCAACCCATTGGCCAGTACTGTAGCTAGGGACCTAAGGTTTAGAGGATCCTACCATACCCTACAACGGACAGTTTCCCCGGTGTTCTTTTGATGTTTGATCGTATTCTTCTTTAATTTTGAAGCCAACCCTTCATTCTACTCTAGTCTGTCAATCCCATTTTGAGCAATACTAATTGAATTGTTTTGTGTAATCTGATACTCCGGTTTAAATTCTGGTTTAGAAATCCTTTTTTTCTGAGTCTTTTTTTAAGAGTTGAAGGGGGAACAATTGACTTCTAAACATCTTGGTTGCAAATAATCTTTCATGATCATTATATCAAATTACTTCATCCTTGTGGACTTGGACACAAGATTTTGATGCTTTAGTTCTTTTAGTGGAGAAGCTGGTCTGGACATTCATAGATATTTAAAAAAAAAAAAGTTATTTTAGGGGAGGGGCAATTGCTTCTCTGCCCCTAGGCTGTTTTTTTTTCTTTTTTTGTGATTAATACAAAGTCCTGTTTCTTATCAAAATAGATATGCTAATGGTGGCAAATGGAAATTATAGAATCTTTAGAGGAAAAGGTATTTATTGGAAACTTTTAGTGGTGAATTTTTTTTTATTCCACATAAAAAATATTTTCTGGTGAAATAATGTTTTCTGCAACCTTGTACATGTTCTAGAAGACAACTGTAATGAATAACTTCATGTGTTGAGTGCAAAACTATACATGATCTTGAATTTGTTGGCTCATTTTCTTCTCAGGTTCTTAGTTCTTTTGAAAAATCCAAGTCCAAAGGAGCATCCAGAACTGCTGAAATGAATGTGCCTGGCAAGAGCAGTCAACAACAGCTCTCTCTGGTGCAGCATAATAGTCAGCCTCTTCGAGGTGGACAAGTCAAGTCCGATTCTCCAAAAACCACTCATGGGAAGTTTCTGGTTCTCAAACCAGTATGGGAGAATGGTGTGTTAAAGGATGGCTCAAATCCTATTAGTAATGTTAACAGTAGAACTGCAAATTGCCAGCCCTCTTCTGTTGCCTCTTCAGCAACACCTAACACTTCAAGGAACCAAAACAACCTGAATCCTTCATCCTCTCTGGAGCGAAAGGTATCTGCTTTAGATCTGAAATCAGGACCCACTTTGGAAAAGAGACCTCCTTCTGTTCAATCGCAAAGCAGGAATGATTTCTTCAATCTCATAAAGAAGAAAACTTTGGTGAATGGTTCCACTTGTCTACAAGATTCAGGTATCTGCACATCTCCCATCAAGGAAAAATCTGGGACAGTGAATGGGGAAGTAGTTAGTGCCGCAGTACATCCTTCTACTGTTACAGACAATGAGGTAGCTAGCAATGGTGATACCACTGAAGAGGTTCAGAGGTTTTCTGAGGTTGTGAATAAGAGTTTGAGCCCTAACAAAGCATTATGTACAGACGAGGAAGAGGCTGCATTTCTTCGTTCTCTTGGATGGGAAGAAAATTCAGGAGAGGATGAAGGACTAACAGAGGAGGAGATCAATGCTTTTTATCAGCAGGTTTGGTGCTTGCTTCCTATATTTTCATTAAAATTCTTGCATTCTGCATTACCTGTTTATCACTATTTGATTTCAATGTCAGTTTTCTTAATGAAAGAAATGCTGACGCCAATCCAGGAATAAGTCTGATCTTTCTTTTATAGATTTCCTGTTTTTTAATGCTTAATTGTGTCGAGAACTTGCTGGTCAATAATACGGATGGTCCCAGTTTGAACTTAAAATTATGTCCCTCGTACAGCCACATTCTTTGATTTGGAGGCCTGAATTCGTGATCATTTGTCATCAGGACAAGATATAAGCTCAGATTCCCTGGAAGTCATGGCATCAATTCGTGTGTCACTAGCCTAAGAAATTAAGTAGCACTGGTTTCCCAATATTGAAATATTGTAGATGGTATCTGCTTTTATTTAGTTGTTTGAAGAGACTAGTTAAGGTACTTATAATTTCGCTTATATATTGGACACTGCGATTATCAAAAGGGGAAAAGAAACCACGACTTTGTAACCTTGACCTACAAATTATGCATGAGACGTCCCGTTTCATTTTGGAGGGGACAAGGGCCTACTGTTTTGATATTTGTTCATAAGGATATTTATGATTGAATCAACAATATCTAGTAATGCAATAAAATTTGTCATTAGGCGAATTGCTACAGTAAAGATAATGTAAGAACTCAATTGTATTTATACAACATATTTCTGGTACTTGATATTTATTCAACTTTTTTGTTGCAGTACATGAACCTGAAGCCATCTTTGAAGCCGATCAGATGTAAGTTACCAGAACCTTCCAGTGCGGTCTGAATATGGTAATCCTTCCTGTGGGATCTTTGTCAGTTTAGACTCATCTGAGTTGGTGATCTCGCATGCCTAATATCAACTTGTTTTTGTGGTAAGTTGTCTAGGGGTTCTTCCCATATATTTCTTTTTTCCTTTTAGCGAGTAAAAGAAAAAGAAAAATTGACAATGTTGGGCAGGAGGCAGAGAGTGAGTGTTGCTTTGGCAAAGTGCAATAAGTCAATAGGTTAGGAAGTCTCTGCTTGCATACAAGTTTACATTTTTATTTTTATTTTTTTTCCTTTTCTTTTTCGTTTTTCTGAAGGACAGTTAGGGAAGGGGTCGGATAGATGGTGTACTTTTGTTTGCAATTGCAGCTGAAGATGGGTACTGCAATTAGCATATTTTCCAGTATTTGGGCCCTTTTGTTACTCTTTTAAGATTGATCCTTAAAAAGGTCTGATTTGCATTTCATGAGTTTAAAG
mRNA sequence
CTTCAATTCGCTTCTTCCAATTCTCTCTTCGTTCTCGTCGTCTCTCGCATGGCGGACCAATAAGCATAAACTCAATTCACATTCCCTATCCGACGATAATTCGAATCCCCATTTCCCCCAACTTCTCCTCCGATTTCTTCTTCTATCTGCTCTTTTCTCCTACTCCCAACCTTATTGCATTCTTCTCAGACGCATTGCTTTGATACAGCCTACCCTTCAGCAGTTACCCCAACTCCTTTTGTTTTCCCTTCCTTCCAATTTCAATCGCTACTCTGAAGGGGTCTTCTGATTGAATTGTTTGCGGGGTTTCAAGTGTTGCTTCCAAATTCTGAATCTTCTTCCTCTATCGACGTAGATATGATCTTCATTCCTATTTCTACCGCCATTTACGCTGAGATGAGGGGCTTCTGACTTATTCACACTTGCTTCCAATCCGTTATTTGGGGAAGATGGATGAGGGAAAGATAGTTGCTCTTCTTCCTTCCGTTGCATTTTCTTTCGAATAGTCTTGATATCGTTGCCATTGCGTTCTGCGGCCTGGCTTCGAAACTTTGCGATTGCCACTTTTAGTGAAGGAGGTTTCAAGTAATATTGTTATTGTGAATTAGAGCTCAGTAATGGTTTCACTGTCATGCATGGATATGTTCAGTGATTATGATTTATAGGGTGGATTTTGTTGTCGGTGGAGAATAAATCTGCAATACAAGGGTCGAGTTTGCAAGTCTGTTAACGAGGATTAAATTTTTTATTTAGGTCGAGTCAAATCTTTCCTTTCCATCCTTCAAAGTAAATTGTAATATTGGCATCTACCCTGCTTGTTTCTTGTCTAACTCTTTTATTTCGTGGATGCCTTTATAAACATTTAGAGCACTTTTGTGCTCTTCGGCGGGCATTAGGAAGGACTATTACTTCTGGTGGTGTTTTGTCTTAATAATTATTCAAAGAATGTGGTAAAGGATGCAAGAAGTGATATGCTGTTATGTTTGATCATGTTTAATACTCATTATGGAAAGAAGTGAACCTACATTAGTTCCAGAGTGGTTGAGAAGTTCTGGAAGTCTTTCTGGGAGTGGAATTTCAGCTCAACAATTTGCATCGTCTTCTTCACACTCAGATATTTCTTCTCAAGCTCATTACTCGAGGAGTAGAACTTCAAAGAGCATTAGCGATATAGATAAACCACATTTTGATTTCTTGGATTGGTCATCTTCATCAACCTCGAGGAGGAGTTCTAGTAATGGCTCTGGAAAGAATGCTTACAGTAGTTTTAATAGGAATCATCGTGATAGAGACCGTGAGAAGGAGAAAGAAATGTCAAATCTTGGGGACCCCTGGGGTTATGACCTTTCTAGCCCTCTGGTAAACATATTTTCCAGTAGAGTTGAGAAGGAAACTTTGCGGCGTTCTCATTCAATGGTATCTAGGAAGCAAGGTGATTTATTTCCTCAAAGAGTTGCTGCTGACTTGAAAAGTGGAGGATATAATCATAAGGCTAACAGTAATGGCTTTCATTCAGGAAGTACCATTAACGGTATCACTGATAAGGCTGTTTTTGACAAGGATTTTCCATCACTTGGATCAGAAGAAAGGCAAGGAGGACCAGATGTAGGGAGAGTATCATCTCCTGGCTTGACCACATCTGTTCAGAGCTTGCCTATTGGGAGTTCAACTTTGATTGGTCGGGAGGGATGGACATCAGCTTTGGCTGAGGTGCCAACTATTGTCACAGGCAGTTCTGCTGCTCCATCATCTGTTCAACAGACTGTTGCTGCCAATTCTGGGTTGGGGTCCCCAAATGCAACAACTCCACGCAAGATGGCTGAAGCTTTAACACAGGCACCAACAAGAGCCCGTGTTACTTCTCAGTCAACTGAGTTATCTGTCAAGACACAGAGGCTTGAGGAATTGGCTATTAAGCAGTCCAGGCAATTAATCCCAGTAACACCTTCAATGCCTAAAGTTTCTATCAATGGATTTAGCCACGATCTCACTAAACAGACTCCCTTGTTCTCTGACTTCCTGTTCCACTGCAACCCATTGGCCAGTACTGTAGCTAGGGACCTAAGGTTTAGAGGATCCTACCATACCCTACAACGGACAGTTTCCCCGGTTCTTAGTTCTTTTGAAAAATCCAAGTCCAAAGGAGCATCCAGAACTGCTGAAATGAATGTGCCTGGCAAGAGCAGTCAACAACAGCTCTCTCTGGTGCAGCATAATAGTCAGCCTCTTCGAGGTGGACAAGTCAAGTCCGATTCTCCAAAAACCACTCATGGGAAGTTTCTGGTTCTCAAACCAGTATGGGAGAATGGTGTGTTAAAGGATGGCTCAAATCCTATTAGTAATGTTAACAGTAGAACTGCAAATTGCCAGCCCTCTTCTGTTGCCTCTTCAGCAACACCTAACACTTCAAGGAACCAAAACAACCTGAATCCTTCATCCTCTCTGGAGCGAAAGGTATCTGCTTTAGATCTGAAATCAGGACCCACTTTGGAAAAGAGACCTCCTTCTGTTCAATCGCAAAGCAGGAATGATTTCTTCAATCTCATAAAGAAGAAAACTTTGGTGAATGGTTCCACTTGTCTACAAGATTCAGGTATCTGCACATCTCCCATCAAGGAAAAATCTGGGACAGTGAATGGGGAAGTAGTTAGTGCCGCAGTACATCCTTCTACTGTTACAGACAATGAGGTAGCTAGCAATGGTGATACCACTGAAGAGGTTCAGAGGTTTTCTGAGGTTGTGAATAAGAGTTTGAGCCCTAACAAAGCATTATGTACAGACGAGGAAGAGGCTGCATTTCTTCGTTCTCTTGGATGGGAAGAAAATTCAGGAGAGGATGAAGGACTAACAGAGGAGGAGATCAATGCTTTTTATCAGCAGTACATGAACCTGAAGCCATCTTTGAAGCCGATCAGATGTAAGTTACCAGAACCTTCCAGTGCGGTCTGAATATGGTAATCCTTCCTGTGGGATCTTTGTCAGTTTAGACTCATCTGAGTTGGTGATCTCGCATGCCTAATATCAACTTGTTTTTGTGGTAAGTTGTCTAGGGGTTCTTCCCATATATTTCTTTTTTCCTTTTAGCGAGTAAAAGAAAAAGAAAAATTGACAATGTTGGGCAGGAGGCAGAGAGTGAGTGTTGCTTTGGCAAAGTGCAATAAGTCAATAGGTTAGGAAGTCTCTGCTTGCATACAAGTTTACATTTTTATTTTTATTTTTTTTCCTTTTCTTTTTCGTTTTTCTGAAGGACAGTTAGGGAAGGGGTCGGATAGATGGTGTACTTTTGTTTGCAATTGCAGCTGAAGATGGGTACTGCAATTAGCATATTTTCCAGTATTTGGGCCCTTTTGTTACTCTTTTAAGATTGATCCTTAAAAAGGTCTGATTTGCATTTCATGAGTTTAAAG
Coding sequence (CDS)
ATGGAAAGAAGTGAACCTACATTAGTTCCAGAGTGGTTGAGAAGTTCTGGAAGTCTTTCTGGGAGTGGAATTTCAGCTCAACAATTTGCATCGTCTTCTTCACACTCAGATATTTCTTCTCAAGCTCATTACTCGAGGAGTAGAACTTCAAAGAGCATTAGCGATATAGATAAACCACATTTTGATTTCTTGGATTGGTCATCTTCATCAACCTCGAGGAGGAGTTCTAGTAATGGCTCTGGAAAGAATGCTTACAGTAGTTTTAATAGGAATCATCGTGATAGAGACCGTGAGAAGGAGAAAGAAATGTCAAATCTTGGGGACCCCTGGGGTTATGACCTTTCTAGCCCTCTGGTAAACATATTTTCCAGTAGAGTTGAGAAGGAAACTTTGCGGCGTTCTCATTCAATGGTATCTAGGAAGCAAGGTGATTTATTTCCTCAAAGAGTTGCTGCTGACTTGAAAAGTGGAGGATATAATCATAAGGCTAACAGTAATGGCTTTCATTCAGGAAGTACCATTAACGGTATCACTGATAAGGCTGTTTTTGACAAGGATTTTCCATCACTTGGATCAGAAGAAAGGCAAGGAGGACCAGATGTAGGGAGAGTATCATCTCCTGGCTTGACCACATCTGTTCAGAGCTTGCCTATTGGGAGTTCAACTTTGATTGGTCGGGAGGGATGGACATCAGCTTTGGCTGAGGTGCCAACTATTGTCACAGGCAGTTCTGCTGCTCCATCATCTGTTCAACAGACTGTTGCTGCCAATTCTGGGTTGGGGTCCCCAAATGCAACAACTCCACGCAAGATGGCTGAAGCTTTAACACAGGCACCAACAAGAGCCCGTGTTACTTCTCAGTCAACTGAGTTATCTGTCAAGACACAGAGGCTTGAGGAATTGGCTATTAAGCAGTCCAGGCAATTAATCCCAGTAACACCTTCAATGCCTAAAGTTTCTATCAATGGATTTAGCCACGATCTCACTAAACAGACTCCCTTGTTCTCTGACTTCCTGTTCCACTGCAACCCATTGGCCAGTACTGTAGCTAGGGACCTAAGGTTTAGAGGATCCTACCATACCCTACAACGGACAGTTTCCCCGGTTCTTAGTTCTTTTGAAAAATCCAAGTCCAAAGGAGCATCCAGAACTGCTGAAATGAATGTGCCTGGCAAGAGCAGTCAACAACAGCTCTCTCTGGTGCAGCATAATAGTCAGCCTCTTCGAGGTGGACAAGTCAAGTCCGATTCTCCAAAAACCACTCATGGGAAGTTTCTGGTTCTCAAACCAGTATGGGAGAATGGTGTGTTAAAGGATGGCTCAAATCCTATTAGTAATGTTAACAGTAGAACTGCAAATTGCCAGCCCTCTTCTGTTGCCTCTTCAGCAACACCTAACACTTCAAGGAACCAAAACAACCTGAATCCTTCATCCTCTCTGGAGCGAAAGGTATCTGCTTTAGATCTGAAATCAGGACCCACTTTGGAAAAGAGACCTCCTTCTGTTCAATCGCAAAGCAGGAATGATTTCTTCAATCTCATAAAGAAGAAAACTTTGGTGAATGGTTCCACTTGTCTACAAGATTCAGGTATCTGCACATCTCCCATCAAGGAAAAATCTGGGACAGTGAATGGGGAAGTAGTTAGTGCCGCAGTACATCCTTCTACTGTTACAGACAATGAGGTAGCTAGCAATGGTGATACCACTGAAGAGGTTCAGAGGTTTTCTGAGGTTGTGAATAAGAGTTTGAGCCCTAACAAAGCATTATGTACAGACGAGGAAGAGGCTGCATTTCTTCGTTCTCTTGGATGGGAAGAAAATTCAGGAGAGGATGAAGGACTAACAGAGGAGGAGATCAATGCTTTTTATCAGCAGTACATGAACCTGAAGCCATCTTTGAAGCCGATCAGATGTAAGTTACCAGAACCTTCCAGTGCGGTCTGA
Protein sequence
MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPHFDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVNIFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDKAVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIVTGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEELAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYHTLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKTTHGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSLERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKSGTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLRSLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV
Homology
BLAST of Clc11G16740 vs. NCBI nr
Match:
XP_038899422.1 (uncharacterized protein LOC120086719 [Benincasa hispida] >XP_038899423.1 uncharacterized protein LOC120086719 [Benincasa hispida] >XP_038899424.1 uncharacterized protein LOC120086719 [Benincasa hispida] >XP_038899426.1 uncharacterized protein LOC120086719 [Benincasa hispida])
HSP 1 Score: 1052.0 bits (2719), Expect = 2.1e-303
Identity = 572/647 (88.41%), Postives = 579/647 (89.49%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH
Sbjct: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVN 120
FDFLDWSSSS+SRRSSSNGSGKNAYSSFNRNHRDRDR+KEKEMSNLGDPWGYD SSPLVN
Sbjct: 61 FDFLDWSSSSSSRRSSSNGSGKNAYSSFNRNHRDRDRDKEKEMSNLGDPWGYDFSSPLVN 120
Query: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDK 180
IFSSRVEKETLRRSHSMVSRKQGDLFPQR A DLKS GYNHK NSNGFHSGSTINGITDK
Sbjct: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRGAVDLKSEGYNHKVNSNGFHSGSTINGITDK 180
Query: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIV 240
AVFDKDFPSLGSEERQG PDVGRVSSPGLTTSVQSLP+GSST IGREGWTSALAEVPTI+
Sbjct: 181 AVFDKDFPSLGSEERQGAPDVGRVSSPGLTTSVQSLPMGSSTFIGREGWTSALAEVPTIL 240
Query: 241 TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEE 300
TGS AAPSSVQ TVAA SGLGSPNATTPRKMAEALTQAPTRARV SQSTELSVKTQRLEE
Sbjct: 241 TGSLAAPSSVQPTVAATSGLGSPNATTPRKMAEALTQAPTRARVASQSTELSVKTQRLEE 300
Query: 301 LAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYH 360
LAIKQSRQLIPVTPSMPKVS
Sbjct: 301 LAIKQSRQLIPVTPSMPKVS---------------------------------------- 360
Query: 361 TLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKT 420
VL+SFEKSKSKGASR AEMNVPGK QQQLSLVQHNSQPLRGGQVKSDSPKT
Sbjct: 361 --------VLNSFEKSKSKGASRPAEMNVPGKGGQQQLSLVQHNSQPLRGGQVKSDSPKT 420
Query: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSL 480
THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNT RNQNNLNPSSSL
Sbjct: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTLRNQNNLNPSSSL 480
Query: 481 ERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
ERKV+ALDLKSG TLEKRPPS QSQSRNDFFNLIKKKTLVNGSTCLQDSGICTS IKEKS
Sbjct: 481 ERKVAALDLKSGSTLEKRPPSAQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSSIKEKS 540
Query: 541 GTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 600
G VNGEVVSAAVHPSTVTD+EVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR
Sbjct: 541 GIVNGEVVSAAVHPSTVTDDEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 599
Query: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
SLGWEENSGEDEGLTEEEINAFYQQYM LKPSLKPIRCKLPEPSSAV
Sbjct: 601 SLGWEENSGEDEGLTEEEINAFYQQYMKLKPSLKPIRCKLPEPSSAV 599
BLAST of Clc11G16740 vs. NCBI nr
Match:
XP_011653819.1 (uncharacterized protein LOC101213356 isoform X1 [Cucumis sativus] >XP_031740296.1 uncharacterized protein LOC101213356 isoform X1 [Cucumis sativus] >KGN54604.1 hypothetical protein Csa_012759 [Cucumis sativus])
HSP 1 Score: 1025.8 bits (2651), Expect = 1.6e-295
Identity = 559/647 (86.40%), Postives = 574/647 (88.72%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
MERSEPTLVPEWLRSSGSLSGSGI AQQFASSSSHSDISSQ HYSRSRTSKSISDIDKPH
Sbjct: 1 MERSEPTLVPEWLRSSGSLSGSGI-AQQFASSSSHSDISSQGHYSRSRTSKSISDIDKPH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVN 120
FDFLDWSSSS++RRSSSNGSGKNAYS+FNRNHRDRDREKEK+MSN GD WGYD SSPLVN
Sbjct: 61 FDFLDWSSSSSTRRSSSNGSGKNAYSNFNRNHRDRDREKEKDMSNHGDSWGYDFSSPLVN 120
Query: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDK 180
+FSSR EKETLRRSHSMVSRKQGDLFPQRVA DLKSGGYNHKANSNGFH GSTINGITDK
Sbjct: 121 VFSSRAEKETLRRSHSMVSRKQGDLFPQRVAVDLKSGGYNHKANSNGFHLGSTINGITDK 180
Query: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIV 240
AVFDKDFPSLGSEERQGGPDVGRVSSPGLTT VQSLPIGSSTLIGREGWTSALAEVPT V
Sbjct: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTCVQSLPIGSSTLIGREGWTSALAEVPTTV 240
Query: 241 TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEE 300
TGS AAPSS+QQT ANSGLGSPNATTPRKMAEALTQAPTR RVTSQSTELSVKTQRLEE
Sbjct: 241 TGSPAAPSSIQQT--ANSGLGSPNATTPRKMAEALTQAPTRGRVTSQSTELSVKTQRLEE 300
Query: 301 LAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYH 360
LAIKQSRQLIPVTPSMPKVS
Sbjct: 301 LAIKQSRQLIPVTPSMPKVS---------------------------------------- 360
Query: 361 TLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKT 420
VLS+FEKSKSKGASRTAEMNVPGK QQQLS++QHNSQPLRGGQVKSDSPKT
Sbjct: 361 --------VLSTFEKSKSKGASRTAEMNVPGKGGQQQLSMMQHNSQPLRGGQVKSDSPKT 420
Query: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSL 480
THGKFLVLKPVWENGVLKDGSNPI+NVNSRTAN QPSSVASSAT NTSRNQNNL PSSSL
Sbjct: 421 THGKFLVLKPVWENGVLKDGSNPINNVNSRTANSQPSSVASSATSNTSRNQNNLTPSSSL 480
Query: 481 ERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
ERKV+ALDLKSG TLE+RPPS QSQSR+DFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS
Sbjct: 481 ERKVAALDLKSGSTLERRPPSAQSQSRSDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
Query: 541 GTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 600
G NGEVVSAAVHPS VTD+EVASNGDT+EEVQRFSEVVNKSLSPNKALCTDEEEAAFLR
Sbjct: 541 GIANGEVVSAAVHPSAVTDDEVASNGDTSEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 596
Query: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPE SSAV
Sbjct: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPESSSAV 596
BLAST of Clc11G16740 vs. NCBI nr
Match:
XP_008444119.1 (PREDICTED: uncharacterized protein LOC103487558 isoform X1 [Cucumis melo] >XP_016899887.1 PREDICTED: uncharacterized protein LOC103487558 isoform X1 [Cucumis melo] >KAA0064265.1 mediator of RNA polymerase II transcription subunit 1 [Cucumis melo var. makuwa] >TYK18635.1 mediator of RNA polymerase II transcription subunit 1 [Cucumis melo var. makuwa])
HSP 1 Score: 1016.1 bits (2626), Expect = 1.3e-292
Identity = 556/647 (85.94%), Postives = 573/647 (88.56%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
MERSEPTLVPEWLRSSGSLSGSGI AQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH
Sbjct: 1 MERSEPTLVPEWLRSSGSLSGSGI-AQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVN 120
FDFLDWSSSS++RRSSSNGSGKNAYS+FNRNHRDRDREKEK+MSN GDPWGYD SSPLVN
Sbjct: 61 FDFLDWSSSSSTRRSSSNGSGKNAYSNFNRNHRDRDREKEKDMSNHGDPWGYDFSSPLVN 120
Query: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDK 180
+FSSR EKETLRRSHSMVSRKQGDLFPQRVA +LKSGGYNHKANSNGFHSGSTINGITDK
Sbjct: 121 VFSSRAEKETLRRSHSMVSRKQGDLFPQRVAVNLKSGGYNHKANSNGFHSGSTINGITDK 180
Query: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIV 240
AVFDKDFPSLG+EERQGGPDVGRVSSPGLTT V SLPIGSSTLI REGWTSALAEVPT V
Sbjct: 181 AVFDKDFPSLGAEERQGGPDVGRVSSPGLTTCVHSLPIGSSTLISREGWTSALAEVPTTV 240
Query: 241 TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEE 300
TGSS APSS+QQT ANSGLGSPNATT RKMAEALTQAPTR RVTSQSTELSVKTQRLEE
Sbjct: 241 TGSSGAPSSIQQT--ANSGLGSPNATTSRKMAEALTQAPTRGRVTSQSTELSVKTQRLEE 300
Query: 301 LAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYH 360
LAIKQSRQLIPV PSMPKVS
Sbjct: 301 LAIKQSRQLIPVMPSMPKVS---------------------------------------- 360
Query: 361 TLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKT 420
VL++FEKSKSKGASRTAEMNVPGK QQQLS++QHNSQPLRGGQVKSDSPKT
Sbjct: 361 --------VLNTFEKSKSKGASRTAEMNVPGKGGQQQLSMMQHNSQPLRGGQVKSDSPKT 420
Query: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSL 480
THGKFLVLKPVWENG+LKDGSNPISNVNSRTAN QPSSVASSAT NTSRNQNNL+P SSL
Sbjct: 421 THGKFLVLKPVWENGMLKDGSNPISNVNSRTANSQPSSVASSATSNTSRNQNNLHP-SSL 480
Query: 481 ERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
ERKV+ALDLKSG TLEKRP S QSQSR+DFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS
Sbjct: 481 ERKVAALDLKSGSTLEKRPHSAQSQSRSDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
Query: 541 GTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 600
G NGEVVSAAVHPS VTD+EVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR
Sbjct: 541 GIANGEVVSAAVHPSAVTDDEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 595
Query: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV
Sbjct: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 595
BLAST of Clc11G16740 vs. NCBI nr
Match:
XP_031740297.1 (uncharacterized protein LOC101213356 isoform X2 [Cucumis sativus])
HSP 1 Score: 1011.9 bits (2615), Expect = 2.4e-291
Identity = 554/647 (85.63%), Postives = 569/647 (87.94%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
MERSEPTLVPEWLRSSGSLSGSGI AQQFASSSSHS HYSRSRTSKSISDIDKPH
Sbjct: 1 MERSEPTLVPEWLRSSGSLSGSGI-AQQFASSSSHS-----GHYSRSRTSKSISDIDKPH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVN 120
FDFLDWSSSS++RRSSSNGSGKNAYS+FNRNHRDRDREKEK+MSN GD WGYD SSPLVN
Sbjct: 61 FDFLDWSSSSSTRRSSSNGSGKNAYSNFNRNHRDRDREKEKDMSNHGDSWGYDFSSPLVN 120
Query: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDK 180
+FSSR EKETLRRSHSMVSRKQGDLFPQRVA DLKSGGYNHKANSNGFH GSTINGITDK
Sbjct: 121 VFSSRAEKETLRRSHSMVSRKQGDLFPQRVAVDLKSGGYNHKANSNGFHLGSTINGITDK 180
Query: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIV 240
AVFDKDFPSLGSEERQGGPDVGRVSSPGLTT VQSLPIGSSTLIGREGWTSALAEVPT V
Sbjct: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTCVQSLPIGSSTLIGREGWTSALAEVPTTV 240
Query: 241 TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEE 300
TGS AAPSS+QQT ANSGLGSPNATTPRKMAEALTQAPTR RVTSQSTELSVKTQRLEE
Sbjct: 241 TGSPAAPSSIQQT--ANSGLGSPNATTPRKMAEALTQAPTRGRVTSQSTELSVKTQRLEE 300
Query: 301 LAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYH 360
LAIKQSRQLIPVTPSMPKVS
Sbjct: 301 LAIKQSRQLIPVTPSMPKVS---------------------------------------- 360
Query: 361 TLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKT 420
VLS+FEKSKSKGASRTAEMNVPGK QQQLS++QHNSQPLRGGQVKSDSPKT
Sbjct: 361 --------VLSTFEKSKSKGASRTAEMNVPGKGGQQQLSMMQHNSQPLRGGQVKSDSPKT 420
Query: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSL 480
THGKFLVLKPVWENGVLKDGSNPI+NVNSRTAN QPSSVASSAT NTSRNQNNL PSSSL
Sbjct: 421 THGKFLVLKPVWENGVLKDGSNPINNVNSRTANSQPSSVASSATSNTSRNQNNLTPSSSL 480
Query: 481 ERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
ERKV+ALDLKSG TLE+RPPS QSQSR+DFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS
Sbjct: 481 ERKVAALDLKSGSTLERRPPSAQSQSRSDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
Query: 541 GTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 600
G NGEVVSAAVHPS VTD+EVASNGDT+EEVQRFSEVVNKSLSPNKALCTDEEEAAFLR
Sbjct: 541 GIANGEVVSAAVHPSAVTDDEVASNGDTSEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 591
Query: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPE SSAV
Sbjct: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPESSSAV 591
BLAST of Clc11G16740 vs. NCBI nr
Match:
XP_011653820.1 (uncharacterized protein LOC101213356 isoform X3 [Cucumis sativus])
HSP 1 Score: 957.2 bits (2473), Expect = 7.2e-275
Identity = 533/647 (82.38%), Postives = 548/647 (84.70%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
MERSEPTLVPEWLRSSGSLSGSGI AQQFASSSSHSDISSQ HYSRSRTSKSISDIDKPH
Sbjct: 1 MERSEPTLVPEWLRSSGSLSGSGI-AQQFASSSSHSDISSQGHYSRSRTSKSISDIDKPH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVN 120
FDFLDWSSSS++RRSSSNGSGKNAYS+FNRNHRDRDREKEK+MSN GD WGYD SSPLVN
Sbjct: 61 FDFLDWSSSSSTRRSSSNGSGKNAYSNFNRNHRDRDREKEKDMSNHGDSWGYDFSSPLVN 120
Query: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDK 180
+FSSR EKETLRRSHSMVSRKQ GSTINGITDK
Sbjct: 121 VFSSRAEKETLRRSHSMVSRKQ----------------------------GSTINGITDK 180
Query: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIV 240
AVFDKDFPSLGSEERQGGPDVGRVSSPGLTT VQSLPIGSSTLIGREGWTSALAEVPT V
Sbjct: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTCVQSLPIGSSTLIGREGWTSALAEVPTTV 240
Query: 241 TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEE 300
TGS AAPSS+QQT ANSGLGSPNATTPRKMAEALTQAPTR RVTSQSTELSVKTQRLEE
Sbjct: 241 TGSPAAPSSIQQT--ANSGLGSPNATTPRKMAEALTQAPTRGRVTSQSTELSVKTQRLEE 300
Query: 301 LAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYH 360
LAIKQSRQLIPVTPSMPKVS
Sbjct: 301 LAIKQSRQLIPVTPSMPKVS---------------------------------------- 360
Query: 361 TLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKT 420
VLS+FEKSKSKGASRTAEMNVPGK QQQLS++QHNSQPLRGGQVKSDSPKT
Sbjct: 361 --------VLSTFEKSKSKGASRTAEMNVPGKGGQQQLSMMQHNSQPLRGGQVKSDSPKT 420
Query: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSL 480
THGKFLVLKPVWENGVLKDGSNPI+NVNSRTAN QPSSVASSAT NTSRNQNNL PSSSL
Sbjct: 421 THGKFLVLKPVWENGVLKDGSNPINNVNSRTANSQPSSVASSATSNTSRNQNNLTPSSSL 480
Query: 481 ERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
ERKV+ALDLKSG TLE+RPPS QSQSR+DFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS
Sbjct: 481 ERKVAALDLKSGSTLERRPPSAQSQSRSDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
Query: 541 GTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 600
G NGEVVSAAVHPS VTD+EVASNGDT+EEVQRFSEVVNKSLSPNKALCTDEEEAAFLR
Sbjct: 541 GIANGEVVSAAVHPSAVTDDEVASNGDTSEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 568
Query: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPE SSAV
Sbjct: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPESSSAV 568
BLAST of Clc11G16740 vs. ExPASy TrEMBL
Match:
A0A0A0L0N1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377180 PE=4 SV=1)
HSP 1 Score: 1025.8 bits (2651), Expect = 7.9e-296
Identity = 559/647 (86.40%), Postives = 574/647 (88.72%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
MERSEPTLVPEWLRSSGSLSGSGI AQQFASSSSHSDISSQ HYSRSRTSKSISDIDKPH
Sbjct: 1 MERSEPTLVPEWLRSSGSLSGSGI-AQQFASSSSHSDISSQGHYSRSRTSKSISDIDKPH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVN 120
FDFLDWSSSS++RRSSSNGSGKNAYS+FNRNHRDRDREKEK+MSN GD WGYD SSPLVN
Sbjct: 61 FDFLDWSSSSSTRRSSSNGSGKNAYSNFNRNHRDRDREKEKDMSNHGDSWGYDFSSPLVN 120
Query: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDK 180
+FSSR EKETLRRSHSMVSRKQGDLFPQRVA DLKSGGYNHKANSNGFH GSTINGITDK
Sbjct: 121 VFSSRAEKETLRRSHSMVSRKQGDLFPQRVAVDLKSGGYNHKANSNGFHLGSTINGITDK 180
Query: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIV 240
AVFDKDFPSLGSEERQGGPDVGRVSSPGLTT VQSLPIGSSTLIGREGWTSALAEVPT V
Sbjct: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTCVQSLPIGSSTLIGREGWTSALAEVPTTV 240
Query: 241 TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEE 300
TGS AAPSS+QQT ANSGLGSPNATTPRKMAEALTQAPTR RVTSQSTELSVKTQRLEE
Sbjct: 241 TGSPAAPSSIQQT--ANSGLGSPNATTPRKMAEALTQAPTRGRVTSQSTELSVKTQRLEE 300
Query: 301 LAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYH 360
LAIKQSRQLIPVTPSMPKVS
Sbjct: 301 LAIKQSRQLIPVTPSMPKVS---------------------------------------- 360
Query: 361 TLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKT 420
VLS+FEKSKSKGASRTAEMNVPGK QQQLS++QHNSQPLRGGQVKSDSPKT
Sbjct: 361 --------VLSTFEKSKSKGASRTAEMNVPGKGGQQQLSMMQHNSQPLRGGQVKSDSPKT 420
Query: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSL 480
THGKFLVLKPVWENGVLKDGSNPI+NVNSRTAN QPSSVASSAT NTSRNQNNL PSSSL
Sbjct: 421 THGKFLVLKPVWENGVLKDGSNPINNVNSRTANSQPSSVASSATSNTSRNQNNLTPSSSL 480
Query: 481 ERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
ERKV+ALDLKSG TLE+RPPS QSQSR+DFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS
Sbjct: 481 ERKVAALDLKSGSTLERRPPSAQSQSRSDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
Query: 541 GTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 600
G NGEVVSAAVHPS VTD+EVASNGDT+EEVQRFSEVVNKSLSPNKALCTDEEEAAFLR
Sbjct: 541 GIANGEVVSAAVHPSAVTDDEVASNGDTSEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 596
Query: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPE SSAV
Sbjct: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPESSSAV 596
BLAST of Clc11G16740 vs. ExPASy TrEMBL
Match:
A0A1S3B9P2 (uncharacterized protein LOC103487558 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103487558 PE=4 SV=1)
HSP 1 Score: 1016.1 bits (2626), Expect = 6.3e-293
Identity = 556/647 (85.94%), Postives = 573/647 (88.56%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
MERSEPTLVPEWLRSSGSLSGSGI AQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH
Sbjct: 1 MERSEPTLVPEWLRSSGSLSGSGI-AQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVN 120
FDFLDWSSSS++RRSSSNGSGKNAYS+FNRNHRDRDREKEK+MSN GDPWGYD SSPLVN
Sbjct: 61 FDFLDWSSSSSTRRSSSNGSGKNAYSNFNRNHRDRDREKEKDMSNHGDPWGYDFSSPLVN 120
Query: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDK 180
+FSSR EKETLRRSHSMVSRKQGDLFPQRVA +LKSGGYNHKANSNGFHSGSTINGITDK
Sbjct: 121 VFSSRAEKETLRRSHSMVSRKQGDLFPQRVAVNLKSGGYNHKANSNGFHSGSTINGITDK 180
Query: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIV 240
AVFDKDFPSLG+EERQGGPDVGRVSSPGLTT V SLPIGSSTLI REGWTSALAEVPT V
Sbjct: 181 AVFDKDFPSLGAEERQGGPDVGRVSSPGLTTCVHSLPIGSSTLISREGWTSALAEVPTTV 240
Query: 241 TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEE 300
TGSS APSS+QQT ANSGLGSPNATT RKMAEALTQAPTR RVTSQSTELSVKTQRLEE
Sbjct: 241 TGSSGAPSSIQQT--ANSGLGSPNATTSRKMAEALTQAPTRGRVTSQSTELSVKTQRLEE 300
Query: 301 LAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYH 360
LAIKQSRQLIPV PSMPKVS
Sbjct: 301 LAIKQSRQLIPVMPSMPKVS---------------------------------------- 360
Query: 361 TLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKT 420
VL++FEKSKSKGASRTAEMNVPGK QQQLS++QHNSQPLRGGQVKSDSPKT
Sbjct: 361 --------VLNTFEKSKSKGASRTAEMNVPGKGGQQQLSMMQHNSQPLRGGQVKSDSPKT 420
Query: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSL 480
THGKFLVLKPVWENG+LKDGSNPISNVNSRTAN QPSSVASSAT NTSRNQNNL+P SSL
Sbjct: 421 THGKFLVLKPVWENGMLKDGSNPISNVNSRTANSQPSSVASSATSNTSRNQNNLHP-SSL 480
Query: 481 ERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
ERKV+ALDLKSG TLEKRP S QSQSR+DFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS
Sbjct: 481 ERKVAALDLKSGSTLEKRPHSAQSQSRSDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
Query: 541 GTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 600
G NGEVVSAAVHPS VTD+EVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR
Sbjct: 541 GIANGEVVSAAVHPSAVTDDEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 595
Query: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV
Sbjct: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 595
BLAST of Clc11G16740 vs. ExPASy TrEMBL
Match:
A0A5D3D506 (Mediator of RNA polymerase II transcription subunit 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G001750 PE=4 SV=1)
HSP 1 Score: 1016.1 bits (2626), Expect = 6.3e-293
Identity = 556/647 (85.94%), Postives = 573/647 (88.56%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
MERSEPTLVPEWLRSSGSLSGSGI AQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH
Sbjct: 1 MERSEPTLVPEWLRSSGSLSGSGI-AQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVN 120
FDFLDWSSSS++RRSSSNGSGKNAYS+FNRNHRDRDREKEK+MSN GDPWGYD SSPLVN
Sbjct: 61 FDFLDWSSSSSTRRSSSNGSGKNAYSNFNRNHRDRDREKEKDMSNHGDPWGYDFSSPLVN 120
Query: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDK 180
+FSSR EKETLRRSHSMVSRKQGDLFPQRVA +LKSGGYNHKANSNGFHSGSTINGITDK
Sbjct: 121 VFSSRAEKETLRRSHSMVSRKQGDLFPQRVAVNLKSGGYNHKANSNGFHSGSTINGITDK 180
Query: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIV 240
AVFDKDFPSLG+EERQGGPDVGRVSSPGLTT V SLPIGSSTLI REGWTSALAEVPT V
Sbjct: 181 AVFDKDFPSLGAEERQGGPDVGRVSSPGLTTCVHSLPIGSSTLISREGWTSALAEVPTTV 240
Query: 241 TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEE 300
TGSS APSS+QQT ANSGLGSPNATT RKMAEALTQAPTR RVTSQSTELSVKTQRLEE
Sbjct: 241 TGSSGAPSSIQQT--ANSGLGSPNATTSRKMAEALTQAPTRGRVTSQSTELSVKTQRLEE 300
Query: 301 LAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYH 360
LAIKQSRQLIPV PSMPKVS
Sbjct: 301 LAIKQSRQLIPVMPSMPKVS---------------------------------------- 360
Query: 361 TLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKT 420
VL++FEKSKSKGASRTAEMNVPGK QQQLS++QHNSQPLRGGQVKSDSPKT
Sbjct: 361 --------VLNTFEKSKSKGASRTAEMNVPGKGGQQQLSMMQHNSQPLRGGQVKSDSPKT 420
Query: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSL 480
THGKFLVLKPVWENG+LKDGSNPISNVNSRTAN QPSSVASSAT NTSRNQNNL+P SSL
Sbjct: 421 THGKFLVLKPVWENGMLKDGSNPISNVNSRTANSQPSSVASSATSNTSRNQNNLHP-SSL 480
Query: 481 ERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
ERKV+ALDLKSG TLEKRP S QSQSR+DFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS
Sbjct: 481 ERKVAALDLKSGSTLEKRPHSAQSQSRSDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
Query: 541 GTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 600
G NGEVVSAAVHPS VTD+EVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR
Sbjct: 541 GIANGEVVSAAVHPSAVTDDEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 595
Query: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV
Sbjct: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 595
BLAST of Clc11G16740 vs. ExPASy TrEMBL
Match:
A0A6J1IJ74 (uncharacterized protein LOC111476775 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111476775 PE=4 SV=1)
HSP 1 Score: 953.0 bits (2462), Expect = 6.5e-274
Identity = 523/647 (80.83%), Postives = 552/647 (85.32%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
MERSEPT VPEWLR SGSLSGSG SAQQ ASSSSH+D +SQ+ SRSRTSKSISD+DKPH
Sbjct: 1 MERSEPTFVPEWLRISGSLSGSGNSAQQSASSSSHTDGTSQSQCSRSRTSKSISDMDKPH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLVN 120
F+FLD SSSS+SRRSSSNGSGKNAYSSFNRNH DRDREKEKE+SNLGDPWG+D SSPLVN
Sbjct: 61 FEFLDRSSSSSSRRSSSNGSGKNAYSSFNRNHHDRDREKEKELSNLGDPWGHDFSSPLVN 120
Query: 121 IFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITDK 180
F+ RVEKETLRRSHSMVSRK GDL+PQR A DLKSG YNHKANSNGFHSGSTI GITDK
Sbjct: 121 TFTGRVEKETLRRSHSMVSRKPGDLYPQRHAGDLKSGSYNHKANSNGFHSGSTITGITDK 180
Query: 181 AVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTIV 240
A F +DFPSLGSEERQG PDVGRVSSPGLTTSVQS PIG+STLIGREGWTSALAEVPT+V
Sbjct: 181 AGFYEDFPSLGSEERQGWPDVGRVSSPGLTTSVQSFPIGNSTLIGREGWTSALAEVPTVV 240
Query: 241 TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLEE 300
TGS APS VQQ+VAANSGLGSPNATTPRKMAEALTQAPTR+RVT QSTELSV TQRLEE
Sbjct: 241 TGSMTAPSFVQQSVAANSGLGSPNATTPRKMAEALTQAPTRSRVTHQSTELSVTTQRLEE 300
Query: 301 LAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSYH 360
LAIKQSRQLIPVTPSMPKVS++
Sbjct: 301 LAIKQSRQLIPVTPSMPKVSVH-------------------------------------- 360
Query: 361 TLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPKT 420
SFEKSKSKGASRTAE+NVP K QQQL L QHNSQPLRGGQ+KSDSPKT
Sbjct: 361 -----------SFEKSKSKGASRTAEVNVPAKGGQQQLPLGQHNSQPLRGGQIKSDSPKT 420
Query: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSSL 480
THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSAT TSRNQNNLNPSSSL
Sbjct: 421 THGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATTITSRNQNNLNPSSSL 480
Query: 481 ERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEKS 540
ERKV+A DLKSG TLEKRPPS Q QSRNDFFNLIKKKT+VNGST LQDSGI TSP+KEKS
Sbjct: 481 ERKVAASDLKSGSTLEKRPPSTQLQSRNDFFNLIKKKTVVNGSTSLQDSGIFTSPVKEKS 540
Query: 541 GTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFLR 600
G V+GE+ +AA+HPSTVTD+EVASNGDT EEVQR S+VVNKSLS N ALCTDEEE AFLR
Sbjct: 541 GLVSGEIGNAAMHPSTVTDDEVASNGDTAEEVQRSSDVVNKSLSTNLALCTDEEEVAFLR 598
Query: 601 SLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
SLGWEE+SGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV
Sbjct: 601 SLGWEEDSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 598
BLAST of Clc11G16740 vs. ExPASy TrEMBL
Match:
A0A6J1ILU5 (uncharacterized protein LOC111476775 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476775 PE=4 SV=1)
HSP 1 Score: 948.3 bits (2450), Expect = 1.6e-272
Identity = 523/648 (80.71%), Postives = 552/648 (85.19%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSH-SDISSQAHYSRSRTSKSISDIDKP 60
MERSEPT VPEWLR SGSLSGSG SAQQ ASSSSH +D +SQ+ SRSRTSKSISD+DKP
Sbjct: 1 MERSEPTFVPEWLRISGSLSGSGNSAQQSASSSSHTTDGTSQSQCSRSRTSKSISDMDKP 60
Query: 61 HFDFLDWSSSSTSRRSSSNGSGKNAYSSFNRNHRDRDREKEKEMSNLGDPWGYDLSSPLV 120
HF+FLD SSSS+SRRSSSNGSGKNAYSSFNRNH DRDREKEKE+SNLGDPWG+D SSPLV
Sbjct: 61 HFEFLDRSSSSSSRRSSSNGSGKNAYSSFNRNHHDRDREKEKELSNLGDPWGHDFSSPLV 120
Query: 121 NIFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGITD 180
N F+ RVEKETLRRSHSMVSRK GDL+PQR A DLKSG YNHKANSNGFHSGSTI GITD
Sbjct: 121 NTFTGRVEKETLRRSHSMVSRKPGDLYPQRHAGDLKSGSYNHKANSNGFHSGSTITGITD 180
Query: 181 KAVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPTI 240
KA F +DFPSLGSEERQG PDVGRVSSPGLTTSVQS PIG+STLIGREGWTSALAEVPT+
Sbjct: 181 KAGFYEDFPSLGSEERQGWPDVGRVSSPGLTTSVQSFPIGNSTLIGREGWTSALAEVPTV 240
Query: 241 VTGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRLE 300
VTGS APS VQQ+VAANSGLGSPNATTPRKMAEALTQAPTR+RVT QSTELSV TQRLE
Sbjct: 241 VTGSMTAPSFVQQSVAANSGLGSPNATTPRKMAEALTQAPTRSRVTHQSTELSVTTQRLE 300
Query: 301 ELAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGSY 360
ELAIKQSRQLIPVTPSMPKVS++
Sbjct: 301 ELAIKQSRQLIPVTPSMPKVSVH------------------------------------- 360
Query: 361 HTLQRTVSPVLSSFEKSKSKGASRTAEMNVPGKSSQQQLSLVQHNSQPLRGGQVKSDSPK 420
SFEKSKSKGASRTAE+NVP K QQQL L QHNSQPLRGGQ+KSDSPK
Sbjct: 361 ------------SFEKSKSKGASRTAEVNVPAKGGQQQLPLGQHNSQPLRGGQIKSDSPK 420
Query: 421 TTHGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPSSS 480
TTHGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSAT TSRNQNNLNPSSS
Sbjct: 421 TTHGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATTITSRNQNNLNPSSS 480
Query: 481 LERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIKEK 540
LERKV+A DLKSG TLEKRPPS Q QSRNDFFNLIKKKT+VNGST LQDSGI TSP+KEK
Sbjct: 481 LERKVAASDLKSGSTLEKRPPSTQLQSRNDFFNLIKKKTVVNGSTSLQDSGIFTSPVKEK 540
Query: 541 SGTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAAFL 600
SG V+GE+ +AA+HPSTVTD+EVASNGDT EEVQR S+VVNKSLS N ALCTDEEE AFL
Sbjct: 541 SGLVSGEIGNAAMHPSTVTDDEVASNGDTAEEVQRSSDVVNKSLSTNLALCTDEEEVAFL 599
Query: 601 RSLGWEENSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 648
RSLGWEE+SGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV
Sbjct: 601 RSLGWEEDSGEDEGLTEEEINAFYQQYMNLKPSLKPIRCKLPEPSSAV 599
BLAST of Clc11G16740 vs. TAIR 10
Match:
AT1G36990.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G08510.1); Has 5029 Blast hits to 1779 proteins in 339 species: Archae - 2; Bacteria - 1372; Metazoa - 990; Fungi - 933; Plants - 111; Viruses - 28; Other Eukaryotes - 1593 (source: NCBI BLink). )
HSP 1 Score: 353.2 bits (905), Expect = 4.4e-97
Identity = 262/640 (40.94%), Postives = 352/640 (55.00%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
M++ E +L PEWLRSSG SG G S SSSSHSD +S + SR+R S+S SD+D H
Sbjct: 1 MDKGEHSLAPEWLRSSGHASGGGSSNHLLVSSSSHSDSASLQYNSRNRNSRSKSDVDSIH 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSS--FNRNHRDRDREKEKEMSNLGDPWGYDLSSPL 120
FLD SSS+ SRR SSNGS K+AYSS FNR+ RD+DR ++K+ + DPW D S PL
Sbjct: 61 SPFLDRSSSTNSRRGSSNGSAKHAYSSFNFNRSQRDKDRSRDKDRVSYVDPWDLDTSIPL 120
Query: 121 VNIFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGIT 180
I + R + + LRRSHSMV+RKQG+ + + L +GG ++ N NG SG +I
Sbjct: 121 RTILTGR-DPDPLRRSHSMVTRKQGEHLSRGLTVGLNNGGSSNSYNGNGLLSGPSIGNSF 180
Query: 181 DKAVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPT 240
+ FDKDFPSLG+EE+Q G DV RVSSPG+++ VQ+LP+G+S LIG EGWTSALAEVP
Sbjct: 181 QRTGFDKDFPSLGAEEKQNGQDVVRVSSPGISSVVQNLPVGNSALIGGEGWTSALAEVPN 240
Query: 241 IV----TGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVK 300
++ TGS +P + + +G N MAEAL QAP R Q SVK
Sbjct: 241 VIEKACTGSLTSPKANAVSAGTLTGPSGLN------MAEALVQAPARTHTPPQG---SVK 300
Query: 301 TQRLEELAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLR 360
TQRLE+LAIKQSRQLIPV PS PK G S
Sbjct: 301 TQRLEDLAIKQSRQLIPVVPSAPK----GLS----------------------------- 360
Query: 361 FRGSYHTLQRTVSPVLSSFEKSKSKGASRTAEMNV-PGKSSQQQLSLVQHNSQPLRGGQV 420
L+S +KSK+K RT E + P +++ QQ +++ + Q GQ+
Sbjct: 361 ---------------LNSSDKSKTKQVVRTGETCLAPSRNALQQPAVLLGSFQSNPSGQI 420
Query: 421 KSDSPKTTHGKFLVLKPVWENGV--LKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQ 480
K + K LVLKP ENGV +K+ +P +N N+R A+ Q S S R+
Sbjct: 421 KPEK------KLLVLKPARENGVSAVKESGSPSANTNTRAASSQLMSNTQSTQSAPVRST 480
Query: 481 NNLNPSSSLERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGI 540
N S + SA + SG T+EK+P + Q+QSR+ F++ +K+K + S I
Sbjct: 481 N----SPKELKGASAFSMISGQTIEKKPSAAQAQSRSAFYSALKQKQTASTS-------I 540
Query: 541 CTSPIKEKSGTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCT 600
T P+ + + V + + +S + EV +V + +
Sbjct: 541 TTDPVSSSTSASSSVEVKLNSSKDLIASDPSSSQATSGVEVTDSVQVASHTSGFEATDTP 564
Query: 601 DEEEAAFLRSLGWEENSGEDEGLTEEEINAFYQQYMNLKP 632
DEEEA FLRSLGW EN+GE E LTEEEI++F +QY L+P
Sbjct: 601 DEEEAQFLRSLGWVENNGE-EYLTEEEIDSFLEQYKELRP 564
BLAST of Clc11G16740 vs. TAIR 10
Match:
AT4G08510.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36990.1); Has 888 Blast hits to 321 proteins in 121 species: Archae - 0; Bacteria - 120; Metazoa - 86; Fungi - 24; Plants - 79; Viruses - 0; Other Eukaryotes - 579 (source: NCBI BLink). )
HSP 1 Score: 283.9 bits (725), Expect = 3.3e-76
Identity = 233/636 (36.64%), Postives = 329/636 (51.73%), Query Frame = 0
Query: 1 MERSEPTLVPEWLRSSGSLSGSGISAQQFASSSSHSDISSQAHYSRSRTSKSISDIDKPH 60
ME+ EP+LVPEWLRSSG SG G S + +S S++R ++S SD D
Sbjct: 1 MEKREPSLVPEWLRSSGHGSGVG----------SSNSLSDSLRNSKNRNARSRSDADSVG 60
Query: 61 FDFLDWSSSSTSRRSSSNGSGKNAYSS--FNRNHRDRDREKEKEMSNLGDPWGYDLSSPL 120
FLD SSS+ +RR SSNGS K+AYSS FNR++RD+DR +EK+ + DPW D S P
Sbjct: 61 SPFLDRSSSTNTRRGSSNGSTKHAYSSFNFNRSNRDKDRSREKDRMSYMDPWDNDSSMPF 120
Query: 121 VNIFSSRVEKETLRRSHSMVSRKQGDLFPQRVAADLKSGGYNHKANSNGFHSGSTINGIT 180
R E E LRRSHSM +RKQG+ Q K+GG + N +G G++ +
Sbjct: 121 GTFLIGRGE-EPLRRSHSMTTRKQGNHLAQGFTVGYKNGGNINTFNGHGILPGTSPVKSS 180
Query: 181 DKAVFDKDFPSLGSEERQGGPDVGRVSSPGLTTSVQSLPIGSSTLIGREGWTSALAEVPT 240
+ F+KDFP L EER GGPDV R+SSPG + + QSL + + LI EGWTSALAEVP
Sbjct: 181 KRMGFNKDFPLLRGEERNGGPDVVRISSPGRSPTAQSLSVANPALIIGEGWTSALAEVPN 240
Query: 241 IVTGSSAAPSSVQQTVAANSGLGSPNATTPRKMAEALTQAPTRARVTSQSTELSVKTQRL 300
++ S A S V ++ L P R MAEAL QAP R Q+ Q L
Sbjct: 241 VIEKSGGAES--HANVGNSATLSGPAC---RNMAEALVQAPGRTGTPPQA-------QTL 300
Query: 301 EELAIKQSRQLIPVTPSMPKVSINGFSHDLTKQTPLFSDFLFHCNPLASTVARDLRFRGS 360
E+ AI+QSRQLIPV PS PK GS
Sbjct: 301 EDRAIRQSRQLIPVVPSAPK--------------------------------------GS 360
Query: 361 YHTLQRTVSPVLSSFEKSKSKGASRTAEMNV-PGKSSQQQLSLVQHNSQPLRGGQVKSDS 420
H +S +KSK+K R+ E + +++QQQ S++ N Q G Q+K D+
Sbjct: 361 VH----------NSSDKSKTKPMFRSGETGLASSRNTQQQSSVMLGNMQSNPGSQIKPDT 420
Query: 421 PKTTHGKFLVLKPVWENGVLKDGSNPISNVNSRTANCQPSSVASSATPNTSRNQNNLNPS 480
K K ++LKP ENGV+ GS P NSR A QP++ S+ + R+ N
Sbjct: 421 TK----KLVILKPARENGVVAGGSPP----NSRVAASQPTTAPSTQFTASVRSTN----- 480
Query: 481 SSLERKVSALDLKSGPTLEKRPPSVQSQSRNDFFNLIKKKTLVNGSTCLQDSGICTSPIK 540
+ + +++++ +G EK+ Q+QSR+ F++ +K+KT N ST + C
Sbjct: 481 GPRDLRGASVNMLAGKAAEKKLSLAQTQSRHAFYSALKQKTCTNISTDPSKTSSCILSSV 540
Query: 541 EKSGTVNGEVVSAAVHPSTVTDNEVASNGDTTEEVQRFSEVVNKSLSPNKALCTDEEEAA 600
E+ + E+V+ S + + A + E V++ S V + A+ D +EAA
Sbjct: 541 EEQANSSKELVA-----SDPSSPQAAERDEIMESVEKVSNVAERISRFESAVRPDPKEAA 544
Query: 601 FLRSLGWEENSGEDEGLTEEEINAFYQQYMNLKPSL 634
FL+SLGW+EN ++ T EE+ + +++ KPSL
Sbjct: 601 FLKSLGWDENDSDEYTHTMEEMREWCKKF---KPSL 544
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038899422.1 | 2.1e-303 | 88.41 | uncharacterized protein LOC120086719 [Benincasa hispida] >XP_038899423.1 unchara... | [more] |
XP_011653819.1 | 1.6e-295 | 86.40 | uncharacterized protein LOC101213356 isoform X1 [Cucumis sativus] >XP_031740296.... | [more] |
XP_008444119.1 | 1.3e-292 | 85.94 | PREDICTED: uncharacterized protein LOC103487558 isoform X1 [Cucumis melo] >XP_01... | [more] |
XP_031740297.1 | 2.4e-291 | 85.63 | uncharacterized protein LOC101213356 isoform X2 [Cucumis sativus] | [more] |
XP_011653820.1 | 7.2e-275 | 82.38 | uncharacterized protein LOC101213356 isoform X3 [Cucumis sativus] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L0N1 | 7.9e-296 | 86.40 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377180 PE=4 SV=1 | [more] |
A0A1S3B9P2 | 6.3e-293 | 85.94 | uncharacterized protein LOC103487558 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5D3D506 | 6.3e-293 | 85.94 | Mediator of RNA polymerase II transcription subunit 1 OS=Cucumis melo var. makuw... | [more] |
A0A6J1IJ74 | 6.5e-274 | 80.83 | uncharacterized protein LOC111476775 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1ILU5 | 1.6e-272 | 80.71 | uncharacterized protein LOC111476775 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT1G36990.1 | 4.4e-97 | 40.94 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXP... | [more] |
AT4G08510.1 | 3.3e-76 | 36.64 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |