ClCG03G000095 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG03G000095
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionIntegrator complex subunit 9-like protein isoform X1
LocationCG_Chr03: 145682 .. 152875 (+)
RNA-Seq ExpressionClCG03G000095
SyntenyClCG03G000095
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATTTGTAAGTTGCTTCTTAATATCTGCTATATTAGTTCACTGCTTGCATACATTCGAGTTCATGGAAAGTAATCTTTTTCTATAAAAATAATAATAATTTTTTCTTTTTCCGTTTCAGACTTGTTTAAGCAAAGGTGGATGTTTCTATTTCCCACCATGTCATATGCTCAATATTTGTGGGTTTAGAATCCAATTTGACTGTCCTGTGGACTTTTCAGCTCTCCCTATCTTCTCCCCTGTTCCTTTTGATTTTGATGTTCTTTCAGATAAAGAACTATCAAGTCACCCGGGCCACGATTCTCTCAATTTGGAAAATGTGTCTGAGGAGAAAACTGAAAAGCCACTTGATGTGGGTTCTTTGATAAAAGCGGAGCCTTGCTACAAAATCATTAAGAACTTGTGTCTCTGGAACCCATCTTTCACTAATATTGTTTTGATTTCTAGTCCAATGGGCATGTTAGGACTACCCTTTTTGACTCGAGAGAAGGGGTTCTCTGCAAAGGTAGATGATACTATTTTCATTGCATTGAATCTTTTGTGGTCATAATAAGTTTCAGTGCTGTGAAAACTTGATTTCACATACTTTTCACTGGCTTGTCATAGATATATGCGACAGAAGCAACTACAAGACTCGGTAAAATTATGATGGATGACCTTGTTGCAATGCATATGGAATTCAAACAGTTTTATGGATCTGAAGATGATGCTATCTTGCAGTGGATGAGGCCAGAAGAGCTAAAGCTGCTTCATCGTGCGCTAAGAGAAGTGGCGTTTGGGCAGGATGGAGCAGATCTTGGGGGTTGGATGCCCATGTATAGGTAACTTAGTTATTTAGTATTTCTTTGAATACTTTACATAGGCCTTTTAGAGCCTAGCTGGAAAGACAAAATAGAAACTAAAGTAATTAGATGGAGATCTGGGTGGTTGGGTGCCTGTTTATTCTGATTGACACAGTTTTATTTTATTATTTTTCTTCATTTTATTATTTTCTTAATAACTGGGAATCATTTTCATATTGTACCATTGTGGATCTTGTCTATGAACGATGGAAATTAAGGAATTTTCTTTGATTTATTATTTGTTTCGCTATACTTTTTCTTTGATGGTGTTTGTGAAACATTTCTCTATCTTTGTCGTAAGAATCACTCTGTCATCTTGCTATTACAATTGTGGTGTCATGACTTGCACTCTGATCGTACACTGCATTTGTTTATTCGGTTGCATATAATTACCGTGACTTTATCTTGGTCTTCCTTTGGTGGGAATCCTTGAAGCCAAAGCCTTTTGGGACCTTGTTATTGATAAAGGAAACTAACGCTGGCTCAACTTTAAAGCACTCCTCTCCAATCTTTCTACTACCGTTTCAACTATCCTACTTTAGGGGCCCATTTTCATCACCAAGTTGGTTGACGAAGCTCCTATGTATGCTTTGCTCGAAGGGACTAGTTTTGGTTCTCATTTGGGGGTAAAGTTGCTCCCTTGAGGTGTCTTGATTGCTAAAGGTGGTCTTGTGCGACGATTGAGGAGTTTCTCCTCCATCCGCCTTTCAGAGAGAAAAGTCGTTTCTTGTGGCTTGCTGGAGTGTGTGCGGTGGTTTGGGACATTTGGGGAAGAGGAATGGTAGAGTGTTTAGAGGTAGGGACAGGGACCCTTGTGAGGTTTGATTTTTTGGTGAGATTTCATGTGTCCCTTTGGGCTTCTATTTCGAAGCTTTTTTTGTAACTATTCTTTAGGAAACATTTTTCTTAGTTGGAACCCCTTCGTTGGGGTGTTTTGGTGGACTGGTTTTTTGTTCGCCCTTGTATTCTTTCATTTTTCTCAATGAAAGTTGTTCTTATAAAAAAAAAAAGAAAGAAAGAAAGAAAAAAAAAGAAAAGGATGGTCTAGTTTTGGGAGCATGCATAAAAGGAACTTCACCTTGTTTTCAAAATGGCTATGGAGATTCTAGAGTCATGGACTATGCTGTAGGCTGGCGGAGGTATTCATCATTCATGGAGAAGACCTCCTTGGGTGGAATCCAAAGTGAAGAAACCTATCGTTTTCCCAAAGTTCTTGGAATTCTATTTCCAGCCTTAAAGACTTGAGGAAGAGAAATACTATTGGGACTGGGAATGGGGTTAGGTTTTGGCTTGAACATTGGGTGTCTGATAGTCTTCTTGCAGAATATATCCAAAAGCAAATTTTCCATGGTTCATTTTGTTGAAATCTGGATGATTGGGAGTTGTTGTTAAGGGGAAATCTGGATGATTGGGTGCCGGTGGAATGGTTATGCCTTTAGCCAATATATGAAGGTGTGGCCTTCTCTCTCTTTAAGTTTTTAAAAAACAAATTTCATTGATGTATAAAATTTACTAAACAAGGATTATTAGCCCAAACCAAAGGAATTACAAAAGACTTCTCCAATTGGTCAAAAGGGAAGAAATACCATTATAAGAGCTATAGAAAGATGTACATTTGCACAATTCATAGCAAGATAGATTTGGGAGTAGTACTTGGATTATATTTTTTGGCACTACAATCTTGCGAGTAGTATTTGGGAATCTTTCCTTAAACATTTGGCATGATGGATGCTCGTCATAATCATGTTAGCACTACGATCGATGAGTTTCTCCTCAATCTACTTTTTGGAGAAAGGGCCGTTTTTTATGGCTTGTAGGGGTGTGTGCAATTTTATGGGTCTTGTAGAGGGAGTGAAATAGAAGGGTGTTTAGGGGCTTGGAGAGGGATCCTTTGGAGATTTGGTGCCTTGTCCGTTTTCATGTATCCTTGTAGGTTTTGATTTCGAAGACCTTTTGTAACTATTCTTTAGGCACTATTGTGCATAGCTAGAGTCGCTTTCTTTGAAGTGTGCCTTCTTTTTTCATGCCTGTGTATTATTTCATTCTTTCTCAATGAAAGTTGTTTATTTCATTAAAAAAAGAAAAAGGTCATAGCAAGAAAGATAACGCGTTCAAAAAGAATCCATCAAAAGTGCTTCCTTTTCCATTTAAGAGACAGTTGTTCTGTTCCATCCAAGTGCTCCATGAAAAGGCCATCAGTGAAGCGGTGGCCTGTGAGGGTGTAAGTAAGGAGGTTATTAATCTCATTAGGTAGGGCAGTCGACCATCCAAAGGAATGAGTGATCTACTTCCAAAATCTGTTGGCAAAATGACAACTGATAAACAAGTGGCTTGTTGGCCTTTTCTCCTTTTGAAGACAAGAGAATATAGCCTCTCAATCCTTCAAGCTTATTTCCTTGTGAGATAGTTTCCAATAGTTTTTCTAGGACTTTCTATATGCTGAAGGTCTGTTTACTATGGTTTGGAAAGGTAAATGACCAAAGTAAAATTGAAGTCTTCCCTTGGATTTTGTTCCATGGAGGCCTCCACTGATGATAAAATTGAAGATAATCAATCAAATTGCAATTTCTTTCTGGAGGCAATATGGAAAAATCATCGGACTTAGAAACATAAAATCCCAAAATATGTTTAGTTCATCTCTCTTTACTTGATTTAAGTTTACGTTCTCTCTCTTTTGCTAACTATCAGGTCTGGGTTCTCTCTTATTAATCGGGTTGTTGTGGCTCCTCTTGTTGCTTCTGCTGTGGGAATATTTGTGGGAATCGTTTCAGAATTCTATTAACACCACAGCCTGTTATGGAGTGAAGATTCCTTTCCTTGGATACCGTTTTCTTTTGCGTTTCTAACTTTGATTTTGGAGTCTTGTGAAACTTGTGTTTTTCTACTCTCTCTCAACCGGGGTGTTGGTCCTATGTGTGCTGCTTTTCAATATATTCTTTTGATAGGAGCTTTTAAGTTAATTACGAAAGATACTAAAGTAAGGAAGTCGCAACTCAAAATCAATACATTTGTTCATAAATGTCCTTGGATGTAATGAAGGATCTTTTTCTCAGTGCAGCTGACGTTAAGGATTGCATGCAGAAGGTTGAAACTCTTAGATACGGGGAGGAAGCATGCTATAATGGTGCACTAGTTATAAAGGCATTCAGCTCTGGTCTTGAGATTGGCGCTTGTAACTGGACTATTAATTGCCCAAAGAGAGACATTGCATATATTTCAAGTTCTATCTTTTTTTCCTCCAATGCAATGGATTTTGATTACCTTGCTCTTCAGAAGGAGACAATTATTTATTCTGATTTCTCATCTCTGGAACTTATGAATGCCATAGAGAACGATACAAGAGTACCACTTATAGACAACAACTTATTGCCGCTCGGGTATTTTATGCTTCTGCTTTGAAATCTATCTTGCACTAAAAGTCTAGTATTTTTCTCACTCTTTTGGATAATTCATCAAATGTTTTTCCTGAGGGTATGTTTTTGTGTTATTTAGTAGTAATGAGGAAGCTTTGGCTAATTTATTGAGTGATCCTGCTGAGACCGTGGAGGAATCAGAAAAACTTTCTTTTATCTGTTCTTGTGCTATCCAATCTGTTGAATCTGGTGGTTCAGTCCTTATTCCTATGAATCGACTTGGTGTGACCCTGCAACTTCTAGAGCAGATATCAGCTTCACTAGATTATTCAAATCTGAAGGTTAGTAAACTCTTTTCCTTTTTTCTTTTTTATATTTAAACAGCACTCTTCAAAGTTCAAATGTAAGCTATAATCCTCTGAAATGGAAATCCATTCATCTAACCACCCTGGTTTGTTTGTATTTTAAAGAAAAACAACTAGTTCCACATATATTGTTATTTTTTAAGAAAGCCATCAAGAGCTGATATCCTTAGTTAGCCGATTGAGCAAACTTTGAATAATTGACATGAATGAAGGTAAGAGCTTAGAGGGTAGAGAGGGTATATATCATTAGAGCTGTAGAGCGTGAGCTTAGTTTTTATATTTGTTGGTTCCTTTCTTCATACCAAGTATGCAATTAAATTGTGTAGGTTCCTATATATTTTATTTCTTCTGTAGCTGAGGAGTTGTTGGCATTTGCCAATGTTATACCAGAGTGGTTATGCAAGCAAAGACAACAAAAGGTTTGAATTTGTTTGTTTAGTTGTTTCAATGTACTTGGAGCGTTTAGTCAAATTAGTGTGGTATAATGTTGCTCTTACATAATGGTTGGTGTTGCAGTTATTTTCTGGAGAGCCGGTGTTTGCATTTGTCGAGCTCCTTAAAGAGAAAAAGCTTCACGTCTTTCCTGCAGTTCATTCACCCAAATTATTGTATGGTTTAGCCTAAGCTTTCTTCTCACCTTCTTATGCAAACACAACAAACAATCTATAAGTTTCGTGCTGTGACATGTACTTCCAATTTTACAATTGTGGTAGTTCAATTTTCATTTACTTTTTTAAGAAACGGAGTAATTAAGATGAATACTTAGGAATGTAGGACTATCTAGTATGTGCCTTTGAAGTCTTTTTCTATCCTACCAGTTATCAAAATTAGAGAGTGAAGGAGGTTGGATTGTGAACTGGTTTACTATGTGGAGGGAATAATTACTTCCAAATGTTGTACAAACCTTTTCCTGTTTATTATAAATAGTTGTGAACAGCTTAGAGGCATAAACTTTTATGAGTAACTTTTGTTTTGGACCATGATAAACTTTGAGAATTGGATTCTGTGCAGAATGAACTGGCAGGAACCATGCATTGTATTTTGTCCTCATTGGAGCTTACGACTTGGTCCAGTGGTCCACTTGCTCCGACGTTGGTGTGGGGATCCTAGCTCTCTACTTGTTCTAGAGGTGCTTATGATTCTACGCACTTACTATTGTTTTTTACTTCCAAGAGTACATATTTTCAGTGGCTTTGATGTTAATACTCCACATATAATATGCAGAAGGGACTTAATGTTGAGCTTGCTCTCTTACCGTTTAAGCCAATGAATATGAAGGTCCTTCAATGTTCATTCCAATCTGGTATAAAGTATGGTTCATTTTTCTCCTCGTTCTCTCTAGTCCAATGTTCTTGTGCACATACAGAATGAGTTGCTTTATTTGCTTCAATATGTTTGGTCTTCCACTTTGACCTTTCTTCCTCCCCCCGCACCTCCTGAGGAAAAAAAATATGATGCCATCTTCTATGATCAATATTTCTGCCTGAAGAGAAGATTAAAGTTCACTTGTTTGTAAAATAAACTAAAGAACTTGGAGCTCTCAAAAAATTGTTTGTAGGATCCCTGTAGAGTTGAGAGCTTCTAATAGGACACTTGGCATGAAGTGAGACCAAAACTAATATCTTCTATTGGTTTGTCTTCAAGGTTTTGAACTTGTGAATCACTCTTGAGTTCTACTGGTTAGTTTTCTTATTTCTTGCGCTTGTATACCCTATTTTGAAGGCTGGAGAAGGTACAACCGTTGCTGAAAGTCTTGCAGCCAAATTTTGCCGTGGTAGTGTATTTATTTACTCATTATCTCCTATTATTCTTGAGTTTATATATTTTTTGGACCTACATTTGCGTACCCGTGTTTACCTGATGTAGCTTCCTGAGAATTTGAGCCGGCTGATCAATTCAAATACAGAATCATTCACAGTCTTTTCGTACTCTGAAGGCGAAACCTTATGTGTACCAAACTTGAAAGACAGTTTAGAATTAGAGATCGCTTCAGACTCGGCTATGAGTTTCTGTTGGCGAAAGTTGCATCAAGGAAATATAGACATCACAAGATTGAAAGGGGAGCTCTCGTTAAATTGTGGGAAATTCAAGTTGTTCCCTGAAAATGCGCAAGTAGCCACGGATCAGAGGCCACTAATACACTGGGGTCAGCCAGATTTGGAAAAGCTTCTGACTGTGTTATCAAAGATGGGCATTGAGGGTTCTCTGCAGCAGGAAGTATCTGGTGCCGAGTCAAGCAACGTTCGTGTCATACACATACACGATCCTACTACAGGTGTGATAGAAATCCAGGAGTCAAGGACTATAATTAGTGTTGTTGATAAGACATTATCTGCTCGAATTTTTTATGCTCTAAATAGCGTCTTGGATGGAGTTTAGTAAGGACTCCCCTACTTTTAGCAAATTTACCTTGCACTGGACAGGTATGACATGAAGAGATGAAGTTTGAACTGCAATGGCACGGCAAAGTTTCTTGTTCGTTTTTTCTTGGTATGATGTAAAAGGAATAGTGTGCTATTTGGATATGAATTTAAAAGGGGAGTAAATGATTATAAAAATGTTAAGATGGCTTAGGATTGTATATGGACATGATTGATGAAAGTGAAGATATCGTATTTGTGATGGAATTTTT

mRNA sequence

ATGGAATTTACTTGTTTAAGCAAAGGTGGATGTTTCTATTTCCCACCATGTCATATGCTCAATATTTGTGGGTTTAGAATCCAATTTGACTGTCCTGTGGACTTTTCAGCTCTCCCTATCTTCTCCCCTGTTCCTTTTGATTTTGATGTTCTTTCAGATAAAGAACTATCAAGTCACCCGGGCCACGATTCTCTCAATTTGGAAAATGTGTCTGAGGAGAAAACTGAAAAGCCACTTGATGTGGGTTCTTTGATAAAAGCGGAGCCTTGCTACAAAATCATTAAGAACTTGTGTCTCTGGAACCCATCTTTCACTAATATTGTTTTGATTTCTAGTCCAATGGGCATGTTAGGACTACCCTTTTTGACTCGAGAGAAGGGGTTCTCTGCAAAGATATATGCGACAGAAGCAACTACAAGACTCGGTAAAATTATGATGGATGACCTTGTTGCAATGCATATGGAATTCAAACAGTTTTATGGATCTGAAGATGATGCTATCTTGCAGTGGATGAGGCCAGAAGAGCTAAAGCTGCTTCATCGTGCGCTAAGAGAAGTGGCGTTTGGGCAGGATGGAGCAGATCTTGGGGGTTGGATGCCCATGTATAGTGCAGCTGACGTTAAGGATTGCATGCAGAAGGTTGAAACTCTTAGATACGGGGAGGAAGCATGCTATAATGGTGCACTAGTTATAAAGGCATTCAGCTCTGGTCTTGAGATTGGCGCTTGTAACTGGACTATTAATTGCCCAAAGAGAGACATTGCATATATTTCAAGTTCTATCTTTTTTTCCTCCAATGCAATGGATTTTGATTACCTTGCTCTTCAGAAGGAGACAATTATTTATTCTGATTTCTCATCTCTGGAACTTATGAATGCCATAGAGAACGATACAAGAGTACCACTTATAGACAACAACTTATTGCCGCTCGGTAGTAATGAGGAAGCTTTGGCTAATTTATTGAGTGATCCTGCTGAGACCGTGGAGGAATCAGAAAAACTTTCTTTTATCTGTTCTTGTGCTATCCAATCTGTTGAATCTGGTGGTTCAGTCCTTATTCCTATGAATCGACTTGGTGTGACCCTGCAACTTCTAGAGCAGATATCAGCTTCACTAGATTATTCAAATCTGAAGGTTCCTATATATTTTATTTCTTCTGTAGCTGAGGAGTTGTTGGCATTTGCCAATGTTATACCAGAGTGGTTATGCAAGCAAAGACAACAAAAGTTATTTTCTGGAGAGCCGGTGTTTGCATTTGTCGAGCTCCTTAAAGAGAAAAAGCTTCACGTCTTTCCTGCAGTTCATTCACCCAAATTATTAATGAACTGGCAGGAACCATGCATTGTATTTTGTCCTCATTGGAGCTTACGACTTGGTCCAGTGGTCCACTTGCTCCGACGTTGGTGTGGGGATCCTAGCTCTCTACTTGTTCTAGAGAAGGGACTTAATGTTGAGCTTGCTCTCTTACCGTTTAAGCCAATGAATATGAAGGTCCTTCAATGTTCATTCCAATCTGGTATAAAGCTGGAGAAGGTACAACCGTTGCTGAAAGTCTTGCAGCCAAATTTTGCCGTGCTTCCTGAGAATTTGAGCCGGCTGATCAATTCAAATACAGAATCATTCACAGTCTTTTCGTACTCTGAAGGCGAAACCTTATGTGTACCAAACTTGAAAGACAGTTTAGAATTAGAGATCGCTTCAGACTCGGCTATGAGTTTCTGTTGGCGAAAGTTGCATCAAGGAAATATAGACATCACAAGATTGAAAGGGGAGCTCTCGTTAAATTGTGGGAAATTCAAGTTGTTCCCTGAAAATGCGCAAGTAGCCACGGATCAGAGGCCACTAATACACTGGGGTCAGCCAGATTTGGAAAAGCTTCTGACTGTGTTATCAAAGATGGGCATTGAGGGTTCTCTGCAGCAGGAAGTATCTGGTGCCGAGTCAAGCAACGTTCGTGTCATACACATACACGATCCTACTACAGGTGTGATAGAAATCCAGGAGTCAAGGACTATAATTAGTGTTGTTGATAAGACATTATCTGCTCGAATTTTTTATGCTCTAAATAGCGTCTTGGATGGAGTTTAGTAAGGACTCCCCTACTTTTAGCAAATTTACCTTGCACTGGACAGGTATGACATGAAGAGATGAAGTTTGAACTGCAATGGCACGGCAAAGTTTCTTGTTCGTTTTTTCTTGGTATGATGTAAAAGGAATAGTGTGCTATTTGGATATGAATTTAAAAGGGGAGTAAATGATTATAAAAATGTTAAGATGGCTTAGGATTGTATATGGACATGATTGATGAAAGTGAAGATATCGTATTTGTGATGGAATTTTT

Coding sequence (CDS)

ATGGAATTTACTTGTTTAAGCAAAGGTGGATGTTTCTATTTCCCACCATGTCATATGCTCAATATTTGTGGGTTTAGAATCCAATTTGACTGTCCTGTGGACTTTTCAGCTCTCCCTATCTTCTCCCCTGTTCCTTTTGATTTTGATGTTCTTTCAGATAAAGAACTATCAAGTCACCCGGGCCACGATTCTCTCAATTTGGAAAATGTGTCTGAGGAGAAAACTGAAAAGCCACTTGATGTGGGTTCTTTGATAAAAGCGGAGCCTTGCTACAAAATCATTAAGAACTTGTGTCTCTGGAACCCATCTTTCACTAATATTGTTTTGATTTCTAGTCCAATGGGCATGTTAGGACTACCCTTTTTGACTCGAGAGAAGGGGTTCTCTGCAAAGATATATGCGACAGAAGCAACTACAAGACTCGGTAAAATTATGATGGATGACCTTGTTGCAATGCATATGGAATTCAAACAGTTTTATGGATCTGAAGATGATGCTATCTTGCAGTGGATGAGGCCAGAAGAGCTAAAGCTGCTTCATCGTGCGCTAAGAGAAGTGGCGTTTGGGCAGGATGGAGCAGATCTTGGGGGTTGGATGCCCATGTATAGTGCAGCTGACGTTAAGGATTGCATGCAGAAGGTTGAAACTCTTAGATACGGGGAGGAAGCATGCTATAATGGTGCACTAGTTATAAAGGCATTCAGCTCTGGTCTTGAGATTGGCGCTTGTAACTGGACTATTAATTGCCCAAAGAGAGACATTGCATATATTTCAAGTTCTATCTTTTTTTCCTCCAATGCAATGGATTTTGATTACCTTGCTCTTCAGAAGGAGACAATTATTTATTCTGATTTCTCATCTCTGGAACTTATGAATGCCATAGAGAACGATACAAGAGTACCACTTATAGACAACAACTTATTGCCGCTCGGTAGTAATGAGGAAGCTTTGGCTAATTTATTGAGTGATCCTGCTGAGACCGTGGAGGAATCAGAAAAACTTTCTTTTATCTGTTCTTGTGCTATCCAATCTGTTGAATCTGGTGGTTCAGTCCTTATTCCTATGAATCGACTTGGTGTGACCCTGCAACTTCTAGAGCAGATATCAGCTTCACTAGATTATTCAAATCTGAAGGTTCCTATATATTTTATTTCTTCTGTAGCTGAGGAGTTGTTGGCATTTGCCAATGTTATACCAGAGTGGTTATGCAAGCAAAGACAACAAAAGTTATTTTCTGGAGAGCCGGTGTTTGCATTTGTCGAGCTCCTTAAAGAGAAAAAGCTTCACGTCTTTCCTGCAGTTCATTCACCCAAATTATTAATGAACTGGCAGGAACCATGCATTGTATTTTGTCCTCATTGGAGCTTACGACTTGGTCCAGTGGTCCACTTGCTCCGACGTTGGTGTGGGGATCCTAGCTCTCTACTTGTTCTAGAGAAGGGACTTAATGTTGAGCTTGCTCTCTTACCGTTTAAGCCAATGAATATGAAGGTCCTTCAATGTTCATTCCAATCTGGTATAAAGCTGGAGAAGGTACAACCGTTGCTGAAAGTCTTGCAGCCAAATTTTGCCGTGCTTCCTGAGAATTTGAGCCGGCTGATCAATTCAAATACAGAATCATTCACAGTCTTTTCGTACTCTGAAGGCGAAACCTTATGTGTACCAAACTTGAAAGACAGTTTAGAATTAGAGATCGCTTCAGACTCGGCTATGAGTTTCTGTTGGCGAAAGTTGCATCAAGGAAATATAGACATCACAAGATTGAAAGGGGAGCTCTCGTTAAATTGTGGGAAATTCAAGTTGTTCCCTGAAAATGCGCAAGTAGCCACGGATCAGAGGCCACTAATACACTGGGGTCAGCCAGATTTGGAAAAGCTTCTGACTGTGTTATCAAAGATGGGCATTGAGGGTTCTCTGCAGCAGGAAGTATCTGGTGCCGAGTCAAGCAACGTTCGTGTCATACACATACACGATCCTACTACAGGTGTGATAGAAATCCAGGAGTCAAGGACTATAATTAGTGTTGTTGATAAGACATTATCTGCTCGAATTTTTTATGCTCTAAATAGCGTCTTGGATGGAGTTTAG

Protein sequence

MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHPGHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLPFLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLHRALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEIGACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRVPLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGVTLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFVELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEKGLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTESFTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFKLFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPTTGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV
Homology
BLAST of ClCG03G000095 vs. NCBI nr
Match: XP_038890023.1 (integrator complex subunit 9 isoform X1 [Benincasa hispida])

HSP 1 Score: 1249.6 bits (3232), Expect = 0.0e+00
Identity = 624/695 (89.78%), Postives = 652/695 (93.81%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLSKGGCFYFPPCHMLN+ GFRIQ DCP+DFSALPIFSPVPFDFDVLS+KE+SS+P
Sbjct: 1   MEFTCLSKGGCFYFPPCHMLNVFGFRIQIDCPMDFSALPIFSPVPFDFDVLSNKEISSYP 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           GH SLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNI+LISSPMGMLGLP
Sbjct: 61  GHGSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIILISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTREKGF AKIYATEAT RLGKIMMDDLVAMHMEFKQFYGSEDD I QWMR EE KLLH
Sbjct: 121 FLTREKGFCAKIYATEATARLGKIMMDDLVAMHMEFKQFYGSEDDGISQWMRQEEPKLLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
           RALREVAFGQDGADLG WMPMYSAAD+KDC+QKVETLRYGEEACYNGALVIKAFSSGLEI
Sbjct: 181 RALREVAFGQDGADLGVWMPMYSAADIKDCLQKVETLRYGEEACYNGALVIKAFSSGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           GACNWTINCPKRDIAYISSSIF SSNAMDFDYLALQ+ETIIYSD SSLEL N +EN+TRV
Sbjct: 241 GACNWTINCPKRDIAYISSSIFSSSNAMDFDYLALQEETIIYSDCSSLELTNDVENNTRV 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
           PLID NLL L SNEE LANLL DPAET++E EKLSFICSCAIQSVESGGSVLIP+NR G+
Sbjct: 301 PLID-NLLAL-SNEEPLANLLCDPAETMDELEKLSFICSCAIQSVESGGSVLIPINRFGL 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
           TLQLLEQISASLDYSNLKVPIY ISSVAEELLAF NVIPEWLCKQRQQKLFSGEP+FAF 
Sbjct: 361 TLQLLEQISASLDYSNLKVPIYLISSVAEELLAFVNVIPEWLCKQRQQKLFSGEPMFAFD 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           ELLKEKKL VFPAVHSPK L+NWQEPCIVFCPHWSLRLGPVVHLL+RWCGDPSSLLVLEK
Sbjct: 421 ELLKEKKLQVFPAVHSPKSLLNWQEPCIVFCPHWSLRLGPVVHLLQRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL++ELALLPF+PM MKVLQCSFQSGIKLEKV+PLLKVLQP  AVLPENLSRLIN+NTES
Sbjct: 481 GLDIELALLPFRPMTMKVLQCSFQSGIKLEKVRPLLKVLQPKVAVLPENLSRLINTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
           FTVFSYSEGETL VPNLKDSLELEI SD A SFCWRKLHQGNI+I RLKGELSLNCGKFK
Sbjct: 541 FTVFSYSEGETLRVPNLKDSLELEITSDLATSFCWRKLHQGNINIARLKGELSLNCGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           LFPEN QV T+QRPLIHWG+PDLEKLLT+LSKMGIE SLQ E+S AESSNVRVI IHDPT
Sbjct: 601 LFPENTQVDTEQRPLIHWGRPDLEKLLTLLSKMGIEDSLQPEISDAESSNVRVIRIHDPT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
            GVIEIQESRTIISV DKTLSARIF ALNSVLDGV
Sbjct: 661 RGVIEIQESRTIISVADKTLSARIFDALNSVLDGV 693

BLAST of ClCG03G000095 vs. NCBI nr
Match: XP_011655058.1 (integrator complex subunit 9 homolog isoform X1 [Cucumis sativus] >KAE8648065.1 hypothetical protein Csa_005798 [Cucumis sativus])

HSP 1 Score: 1225.7 bits (3170), Expect = 0.0e+00
Identity = 609/695 (87.63%), Postives = 641/695 (92.23%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLSKGGCFY PPCHMLN+CGFRIQFDCPVDFSAL IFSPVP D DVLSDKE SSH 
Sbjct: 1   MEFTCLSKGGCFYMPPCHMLNVCGFRIQFDCPVDFSALSIFSPVPSDLDVLSDKEPSSHL 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           GH SL+L+NVS+E TEKPLDVG LIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP
Sbjct: 61  GHGSLDLDNVSDE-TEKPLDVGYLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTREKGFSAKIY TEAT RLGKIMMDDL+AMHMEFKQFYGSEDDAI QWMR E+L LLH
Sbjct: 121 FLTREKGFSAKIYVTEATARLGKIMMDDLIAMHMEFKQFYGSEDDAISQWMRQEDLSLLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
             LREVAFGQD AD GGWMPMYSAADVKDCMQKVETLRYGEE CYNG LVIKAFSSGLEI
Sbjct: 181 HKLREVAFGQDRADFGGWMPMYSAADVKDCMQKVETLRYGEETCYNGTLVIKAFSSGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           G+CNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSL  MN +ENDTRV
Sbjct: 241 GSCNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLAFMNDVENDTRV 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
            LIDN LLPL S EE LANLLS PAETVEESEKL FICSCAIQSVESGGSVLIP+NRLGV
Sbjct: 301 SLIDNTLLPLSSKEETLANLLSYPAETVEESEKLYFICSCAIQSVESGGSVLIPINRLGV 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
            LQLLEQISASLDYS+LKVPIYFISSVAEELL FAN IPEWLC+QRQ KLFSGEP+F FV
Sbjct: 361 NLQLLEQISASLDYSDLKVPIYFISSVAEELLTFANAIPEWLCRQRQHKLFSGEPMFTFV 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           ELLKE KLHV PA+HSPKLL+NWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK
Sbjct: 421 ELLKENKLHVVPAIHSPKLLINWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL+VEL+LLPFKPM+MKVLQCSFQSGIK EKV+PLLKVLQP   VLPENLSRLIN+NTES
Sbjct: 481 GLDVELSLLPFKPMSMKVLQCSFQSGIKQEKVRPLLKVLQPKIVVLPENLSRLINTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
           FTVF+YSEG++L VPNLKDS ELEIASDSAMSFCWRKLHQGNI+ITRLKGELSLNCGKFK
Sbjct: 541 FTVFTYSEGKSLHVPNLKDSSELEIASDSAMSFCWRKLHQGNINITRLKGELSLNCGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           LF EN QVA  QRPL+HWGQP+LEKLLTVLSKMGIEGS+QQE+S AE ++V VIHIH  T
Sbjct: 601 LFSENTQVAMYQRPLVHWGQPNLEKLLTVLSKMGIEGSVQQEMSDAEPNDVHVIHIHGLT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
            GVIEIQESRTIISVVDKTLSA+IF AL+SV+DGV
Sbjct: 661 KGVIEIQESRTIISVVDKTLSAQIFNALDSVMDGV 694

BLAST of ClCG03G000095 vs. NCBI nr
Match: XP_004146463.1 (integrator complex subunit 9 homolog isoform X2 [Cucumis sativus])

HSP 1 Score: 1221.1 bits (3158), Expect = 0.0e+00
Identity = 609/695 (87.63%), Postives = 641/695 (92.23%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLSKGGCFY PPCHMLN+CGFRIQFDCPVDFSAL IFSPVP D DVLSDKE SSH 
Sbjct: 1   MEFTCLSKGGCFYMPPCHMLNVCGFRIQFDCPVDFSALSIFSPVPSDLDVLSDKEPSSHL 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           GH SL+L+NVS+E TEKPLDVG LIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP
Sbjct: 61  GHGSLDLDNVSDE-TEKPLDVGYLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTREKGFSAKIY TEAT RLGKIMMDDL+AMHMEFKQFYGSEDDAI QWMR E+L LLH
Sbjct: 121 FLTREKGFSAKIYVTEATARLGKIMMDDLIAMHMEFKQFYGSEDDAISQWMRQEDLSLLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
             LREVAFGQD AD GGWMPMYSAADVKDCMQKVETLRYGEE CYNG LVIKAFSSGLEI
Sbjct: 181 HKLREVAFGQDRADFGGWMPMYSAADVKDCMQKVETLRYGEETCYNGTLVIKAFSSGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           G+CNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSL  MN +ENDTRV
Sbjct: 241 GSCNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLAFMNDVENDTRV 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
            LIDN LLPL S EE LANLLS PAETVEESEKL FICSCAIQSVESGGSVLIP+NRLGV
Sbjct: 301 SLIDNTLLPL-SKEETLANLLSYPAETVEESEKLYFICSCAIQSVESGGSVLIPINRLGV 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
            LQLLEQISASLDYS+LKVPIYFISSVAEELL FAN IPEWLC+QRQ KLFSGEP+F FV
Sbjct: 361 NLQLLEQISASLDYSDLKVPIYFISSVAEELLTFANAIPEWLCRQRQHKLFSGEPMFTFV 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           ELLKE KLHV PA+HSPKLL+NWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK
Sbjct: 421 ELLKENKLHVVPAIHSPKLLINWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL+VEL+LLPFKPM+MKVLQCSFQSGIK EKV+PLLKVLQP   VLPENLSRLIN+NTES
Sbjct: 481 GLDVELSLLPFKPMSMKVLQCSFQSGIKQEKVRPLLKVLQPKIVVLPENLSRLINTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
           FTVF+YSEG++L VPNLKDS ELEIASDSAMSFCWRKLHQGNI+ITRLKGELSLNCGKFK
Sbjct: 541 FTVFTYSEGKSLHVPNLKDSSELEIASDSAMSFCWRKLHQGNINITRLKGELSLNCGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           LF EN QVA  QRPL+HWGQP+LEKLLTVLSKMGIEGS+QQE+S AE ++V VIHIH  T
Sbjct: 601 LFSENTQVAMYQRPLVHWGQPNLEKLLTVLSKMGIEGSVQQEMSDAEPNDVHVIHIHGLT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
            GVIEIQESRTIISVVDKTLSA+IF AL+SV+DGV
Sbjct: 661 KGVIEIQESRTIISVVDKTLSAQIFNALDSVMDGV 693

BLAST of ClCG03G000095 vs. NCBI nr
Match: XP_008452382.1 (PREDICTED: integrator complex subunit 9 homolog isoform X1 [Cucumis melo] >XP_016901329.1 PREDICTED: integrator complex subunit 9 homolog isoform X1 [Cucumis melo] >KAA0058380.1 integrator complex subunit 9-like protein isoform X1 [Cucumis melo var. makuwa] >TYK26881.1 integrator complex subunit 9-like protein isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 1206.0 bits (3119), Expect = 0.0e+00
Identity = 607/695 (87.34%), Postives = 641/695 (92.23%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLSKGGCFY PPCHMLN+CGFRIQFDCPVDFSALPIFSPVP D DVLSDKELSSH 
Sbjct: 1   MEFTCLSKGGCFYLPPCHMLNVCGFRIQFDCPVDFSALPIFSPVPSDLDVLSDKELSSHL 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           GH SL+L+NVSEE TEKPLDVGSLIKAEPCYKIIKN  LWNPSFTNIVLISSPMGMLGLP
Sbjct: 61  GHGSLDLDNVSEE-TEKPLDVGSLIKAEPCYKIIKN--LWNPSFTNIVLISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTREKGFSAKIYATEAT RLGKIMMDDL+AMHME KQFYGSEDDAI QWM  E+LKLLH
Sbjct: 121 FLTREKGFSAKIYATEATARLGKIMMDDLIAMHMEVKQFYGSEDDAISQWMGQEDLKLLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
             LREV FGQ+ ADL GWMP+YSA DVKDCMQKVETLRYGEE CYNG LVIKAFSSGLEI
Sbjct: 181 HKLREVTFGQNRADLSGWMPLYSADDVKDCMQKVETLRYGEETCYNGTLVIKAFSSGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           G+CNWTINCPKRDIAYISSSIFFSSN+M+FDYLALQ ETIIYSDFSSLE MN +ENDTRV
Sbjct: 241 GSCNWTINCPKRDIAYISSSIFFSSNSMEFDYLALQMETIIYSDFSSLEFMNDVENDTRV 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
           PLIDNNL PLG  EE LANLLS+ AETVEESEKL FICSCAIQSVESGGSVLIP+NRLGV
Sbjct: 301 PLIDNNLQPLG-KEETLANLLSNAAETVEESEKLYFICSCAIQSVESGGSVLIPINRLGV 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
            LQLLEQISASLDYSNLKVPIYFISSVAEELL F N IPEWLC+QRQQKLFSGEP+F FV
Sbjct: 361 NLQLLEQISASLDYSNLKVPIYFISSVAEELLTFVNAIPEWLCRQRQQKLFSGEPMFTFV 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           ELLKE KLHV PA+HSPKLL+NWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK
Sbjct: 421 ELLKENKLHVVPAIHSPKLLINWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL+VELALLPFKPM+MKVLQCSFQSGIKLEKV+PLLKVLQP   VLPENLSRLI++NTES
Sbjct: 481 GLDVELALLPFKPMSMKVLQCSFQSGIKLEKVRPLLKVLQPKVVVLPENLSRLIDTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
           FTVFSYSEG++L VPNLKDS ELEIASDSAMSFCWRKLHQGNI+ITRLKGELSLN GKFK
Sbjct: 541 FTVFSYSEGKSLRVPNLKDSSELEIASDSAMSFCWRKLHQGNINITRLKGELSLNYGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           L  EN +VA  QRPLIHWGQP+LE LLTVLSKMGIEGS+QQE+S A S+NVRVIHIH  T
Sbjct: 601 LLSENTEVAMYQRPLIHWGQPNLENLLTVLSKMGIEGSVQQEMSDA-SNNVRVIHIHGLT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
           TG+IEIQESRTIISVVD+TLSA+IF AL+SVLDGV
Sbjct: 661 TGLIEIQESRTIISVVDRTLSAQIFNALDSVLDGV 690

BLAST of ClCG03G000095 vs. NCBI nr
Match: XP_022938639.1 (integrator complex subunit 9 homolog isoform X1 [Cucurbita moschata])

HSP 1 Score: 1188.7 bits (3074), Expect = 0.0e+00
Identity = 597/695 (85.90%), Postives = 635/695 (91.37%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLS+GGCFYFPPCHM  +CGFRIQFDCP+DFSALPIFSPVP DF V+SD+ELS+HP
Sbjct: 1   MEFTCLSRGGCFYFPPCHMFEVCGFRIQFDCPMDFSALPIFSPVPLDFYVISDEELSTHP 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           G+ S NLENVSEEK EKPLDVGSLIKAEP YKII NL LWNPSFT+IVLISSPMGMLGLP
Sbjct: 61  GNGSFNLENVSEEKIEKPLDVGSLIKAEPWYKIINNLRLWNPSFTDIVLISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTREK FSAKIYATEAT RLGK+MMDDL+AMHMEFKQFYGSEDDA  QWM+ EEL+LLH
Sbjct: 121 FLTREKDFSAKIYATEATARLGKMMMDDLIAMHMEFKQFYGSEDDATPQWMKQEELELLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
            AL+EVAFGQD ADLGGWMPMY AADVKDCM+KVET+RYGEEACYNGALVIKA SSGLEI
Sbjct: 181 HALKEVAFGQDEADLGGWMPMYCAADVKDCMKKVETVRYGEEACYNGALVIKALSSGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           GACNWTIN PKR+IAYISSSIF SSNAM+FDYLALQ+ETIIYSDFSS+E MN I NDT  
Sbjct: 241 GACNWTINGPKRNIAYISSSIFSSSNAMNFDYLALQEETIIYSDFSSVESMNDILNDTSG 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
           PL D NL  L SNEE LANLLSDPAE+V ESEKLSFICSCA+QSVESGGSVLIP+NRLGV
Sbjct: 301 PLTD-NLTALSSNEETLANLLSDPAESVGESEKLSFICSCAVQSVESGGSVLIPINRLGV 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
           TLQLLEQISASLDYSNLKVPIY ISSVAEELLAFANVIPEWL KQRQQKLFSGEP+FAFV
Sbjct: 361 TLQLLEQISASLDYSNLKVPIYLISSVAEELLAFANVIPEWLSKQRQQKLFSGEPMFAFV 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           +LLKEK+LHVFPAVHSP LL+NWQEPC+VFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK
Sbjct: 421 DLLKEKRLHVFPAVHSPNLLINWQEPCVVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL+ ELALLPF+PM+MKVLQC+F SGIKL+KV+PLLKVLQP   +LPENLSRLIN+NTES
Sbjct: 481 GLDAELALLPFRPMSMKVLQCAFLSGIKLDKVRPLLKVLQPKVVMLPENLSRLINTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
           FTVFSYSEGETL VPNLKDSLELEIA D AMSFCWRKL QGNIDI RLKGELSLNCGKFK
Sbjct: 541 FTVFSYSEGETLRVPNLKDSLELEIAPDLAMSFCWRKLQQGNIDIARLKGELSLNCGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           L  ENA VATDQRPLIHWGQPDL+KLL VLSKMGIEGSLQQ  S AESSNV VI IHDPT
Sbjct: 601 LLAENAHVATDQRPLIHWGQPDLKKLLNVLSKMGIEGSLQQ--SDAESSNVGVIRIHDPT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
             VIEIQESRTIISV DK LSARIF A++SVLDGV
Sbjct: 661 EAVIEIQESRTIISVADKKLSARIFDAVDSVLDGV 692

BLAST of ClCG03G000095 vs. ExPASy Swiss-Prot
Match: A7SBF0 (Integrator complex subunit 9 homolog OS=Nematostella vectensis OX=45351 GN=ints9 PE=3 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 1.9e-58
Identity = 195/705 (27.66%), Postives = 314/705 (44.54%), Query Frame = 0

Query: 16  PCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDK--ELSSHPGHDSLNLENVSEE 75
           PC +L      I  DC +D S +  F+P+     V ++K  +L S    +   +E  + +
Sbjct: 13  PCLVLQFKQTNIMLDCGLDMSTVNQFTPLSL---VNNEKFSQLKSWSSRELQEIEGFTAQ 72

Query: 76  KTEKPLDVGSLIKAEPCYKIIKNLC-----LWNPSFTNIVLISSPMGMLGLPFLTREKGF 135
              K       I AEP       +C     L + S  +++LIS+   ML LPF+T   GF
Sbjct: 73  NNLKEAGGRLFIDAEP------EVCPPETGLIDFSMVDVILISNYHHMLALPFITEYSGF 132

Query: 136 SAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLHRALREVAF 195
           + KIYATE T ++G+ +M +LV       +           W     ++ L   L E+  
Sbjct: 133 NGKIYATEPTIQIGRDLMLELVTFAERVPKRRNGN-----MWKNDNVIRCLPAPLNEL-- 192

Query: 196 GQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEIGACNWTIN 255
               A++  W  +YS  DVK C+ K++ + Y E+    G L + A SSG  +G+ NW + 
Sbjct: 193 ----ANVKSWRVLYSKHDVKACISKIQAVSYSEKLDLCGILQLSAHSSGFCLGSSNWMLE 252

Query: 256 CPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRVPLIDNNLL 315
                I+Y+S S  F+++ +  +   L+   ++            I   T  P IDN   
Sbjct: 253 SEYEKISYLSPSSSFTTHPLPLNQTVLKNSDVL-----------IITGVTEAP-IDNPDA 312

Query: 316 PLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGVTLQLLEQI 375
            LG                          C+    ++ +GG+VL+P    GV   L E +
Sbjct: 313 MLGE------------------------FCTHLASTLRAGGNVLVPCYPSGVLYDLFECL 372

Query: 376 SASLDYSNL-KVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFVELLKEKK 435
              LD + L  VPIYFIS VA+  LA++N+  EWLC+ +Q K++  EP F   ELLKE +
Sbjct: 373 YTYLDNAKLGMVPIYFISPVADSSLAYSNIYGEWLCQSKQTKVYLPEPPFPHAELLKEAR 432

Query: 436 LHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVL-EKGLNVEL 495
           L VF  +H+     +++ PC+VF  H SLR G  VH +  W    ++ ++  E       
Sbjct: 433 LKVFSNLHN-GFSSSFKTPCVVFTGHPSLRYGDAVHFMEIWGKSGNNTVIFTEPDFPYLE 492

Query: 496 ALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSR--LINSNTESFTV- 555
           AL P++P+ MK   C     +   +   LLK LQP   V+PE+ SR  +I+ +    T+ 
Sbjct: 493 ALAPYQPLAMKTCYCPIDPRLNFAQANKLLKELQPRHLVMPESYSRPPVIHPHRTDLTIE 552

Query: 556 ------FSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLH-QGNIDITRLKGELSLNC 615
                  +++  +   +P  +   ++ IA++  +S C    H +  + +  L G L    
Sbjct: 553 DPGCSLTTFNHLDVAALPISRSFEKVVIANE--LSSCLHPQHVRPGVAVATLTGTLVTKD 612

Query: 616 GKFKLFP-----------ENAQVATDQRPLIH--WGQPDLEKLLTVLSKMGIEGSLQQEV 675
            K+ L P           E    +T++  L    WG   L+  +  L K GI   +  E 
Sbjct: 613 NKYTLQPLEFLVEPKAGSEGGDSSTNKGQLSRHLWGTVQLDDFVRSLKKRGIT-DVNVES 653

Query: 676 SGAESSNVRVIHIHDPTTGVIEIQESRTIISVVDKTLSARIFYAL 689
           SG E      IH+ +    ++  + S  II+  ++ L  RI  AL
Sbjct: 673 SGGE----HTIHLPNDDAMILLDRGSTHIITHGNEELRIRIRDAL 653

BLAST of ClCG03G000095 vs. ExPASy Swiss-Prot
Match: Q9NV88 (Integrator complex subunit 9 OS=Homo sapiens OX=9606 GN=INTS9 PE=1 SV=2)

HSP 1 Score: 217.6 bits (553), Expect = 4.4e-55
Identity = 175/676 (25.89%), Postives = 303/676 (44.82%), Query Frame = 0

Query: 16  PCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHPGHDSLNLENVSEEKT 75
           PC++L      I  DC +D ++   F P+P    ++    LS+ PG  SL   N   +K 
Sbjct: 13  PCNVLKFKSTTIMLDCGLDMTSTLNFLPLP----LVQSPRLSNLPGW-SLKDGNAFLDKE 72

Query: 76  EKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLPFLTREKGFSAKIYAT 135
            K       + + P +  +    L + S  +++LIS+   M+ LP++T   GF+  +YAT
Sbjct: 73  LKECSGHVFVDSVPEF-CLPETELIDLSTVDVILISNYHCMMALPYITEHTGFTGTVYAT 132

Query: 136 EATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLHRALREVAFGQDGADL 195
           E T ++G+++M++LV     F +       A L W   +  +LL   L+      D  ++
Sbjct: 133 EPTVQIGRLLMEELV----NFIERVPKAQSASL-WKNKDIQRLLPSPLK------DAVEV 192

Query: 196 GGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEIGACNWTINCPKRDIA 255
             W   Y+  +V   + K++ + Y ++    GA+ +   SSG  +G+ NW I      ++
Sbjct: 193 STWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVS 252

Query: 256 YISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRVPLIDNNLLPLGSNEE 315
           Y+S S   +++    D  +L+   ++            +   T++P              
Sbjct: 253 YVSGSSLLTTHPQPMDQASLKNSDVL-----------VLTGLTQIP-------------- 312

Query: 316 ALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGVTLQLLEQISASLDYS 375
                      T      +   CS    +V +GG+VL+P    GV   LLE +   +D +
Sbjct: 313 -----------TANPDGMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSA 372

Query: 376 NL-KVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFVELLKEKKLHVFPAV 435
            L  VP+YFIS VA   L F+ +  EWLC  +Q K++  EP F   EL++  KL  +P++
Sbjct: 373 GLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYPSI 432

Query: 436 HSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRW-CGDPSSLLVLEKGLNVELALLPFKP 495
           H      ++++PC+VF  H SLR G VVH +  W     ++++  E   +   AL P++P
Sbjct: 433 HG-DFSNDFRQPCVVFTGHPSLRFGDVVHFMELWGKSSLNTVIFTEPDFSYLEALAPYQP 492

Query: 496 MNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTESFTV--------FS 555
           + MK + C   + +   +V  LLK +QP   V PE  ++   + +    +         S
Sbjct: 493 LAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPEQYTQPPPAQSHRMDLMIDCQPPAMS 552

Query: 556 YSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFKLF--P 615
           Y   E L +P  +   ++EI  + A S    ++  G I +  +   L     K  L   P
Sbjct: 553 YRRAEVLALPFKRRYEKIEIMPELADSLVPMEIKPG-ISLATVSAVLHTKDNKHLLQPPP 612

Query: 616 ENAQ-VATDQRPLIHWGQPDLEKLLTVLSKMGIEGSL--QQEVSGAESSNVRVIHIHDPT 675
             AQ  +  +R  +    PD + L  +LS     GS+  +Q V   E      I + D  
Sbjct: 613 RPAQPTSGKKRKRVSDDVPDCKVLKPLLS-----GSIPVEQFVQTLEKHGFSDIKVEDTA 628

BLAST of ClCG03G000095 vs. ExPASy Swiss-Prot
Match: Q4R5Z4 (Integrator complex subunit 9 OS=Macaca fascicularis OX=9541 GN=INTS9 PE=2 SV=1)

HSP 1 Score: 215.3 bits (547), Expect = 2.2e-54
Identity = 175/676 (25.89%), Postives = 299/676 (44.23%), Query Frame = 0

Query: 16  PCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHPGHDSLNLENVSEEKT 75
           PC++L      I  DC +D ++   F P+P    ++    LSS PG  SL   N   +KT
Sbjct: 13  PCNVLKFKSTTIMLDCGLDMTSTLNFLPLP----LVQSPRLSSLPGW-SLKDGNAFLDKT 72

Query: 76  EKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLPFLTREKGFSAKIYAT 135
           E                      L + S  +++LIS+   M+ LP++T   GF+  +YAT
Sbjct: 73  E----------------------LIDLSTVDVILISNYHCMMALPYITEHTGFTGTVYAT 132

Query: 136 EATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLHRALREVAFGQDGADL 195
           E T ++G+++M++LV     F +       A L W   +  +LL   L+      D  ++
Sbjct: 133 EPTVQIGRLLMEELV----NFIERVPKAQSASL-WKNKDIQRLLPSPLK------DAVEV 192

Query: 196 GGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEIGACNWTINCPKRDIA 255
             W   Y+  +V   + K++ + + ++    GA+ +   SSG  +G+ NW I      ++
Sbjct: 193 STWRRCYTMQEVNSALSKIQLVGFSQKIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVS 252

Query: 256 YISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRVPLIDNNLLPLGSNEE 315
           Y+S S   +++    D  +L+   ++            +   T++P              
Sbjct: 253 YVSGSSLLTTHPQPMDQASLKNSDVL-----------VLTGLTQIP-------------- 312

Query: 316 ALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGVTLQLLEQISASLDYS 375
                      T      +   CS    +V +GG+VL+P    GV   LLE +   +D +
Sbjct: 313 -----------TANPDGMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSA 372

Query: 376 NL-KVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFVELLKEKKLHVFPAV 435
            L  VP+YFIS VA   L F+ +  EWLC  +Q K++  EP F   EL++  KL  +P++
Sbjct: 373 GLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYPSI 432

Query: 436 HSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRW-CGDPSSLLVLEKGLNVELALLPFKP 495
           H      ++++PC+VF  H SLR G VVH +  W     ++++  E   +   AL P++P
Sbjct: 433 HG-DFSNDFRQPCVVFTGHPSLRFGDVVHFMELWGKSSLNTVIFTEPDFSYLEALAPYQP 492

Query: 496 MNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTESFTV--------FS 555
           + MK + C   + +   +V  LLK +QP   V PE  ++   + +    +         S
Sbjct: 493 LAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPEQYTQPPPAQSHRMDLMIDCQPPAMS 552

Query: 556 YSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFKLF--P 615
           Y   E L +P  +   ++EI  + A S    ++  G I +  +   L     K  L   P
Sbjct: 553 YRRAEVLALPFKRRYEKIEIMPELADSLVPMEIKPG-ISLATVSAVLHTKDNKHLLQPPP 607

Query: 616 ENAQ-VATDQRPLIHWGQPDLEKLLTVLSKMGIEGSL--QQEVSGAESSNVRVIHIHDPT 675
             AQ  +  +R  +    PD + L  +LS     GS+  +Q V   E      I + D  
Sbjct: 613 RPAQPTSGKKRKRVSDDVPDCKVLKPLLS-----GSIPVEQFVQTLEKHGFSDIKVEDTA 607

BLAST of ClCG03G000095 vs. ExPASy Swiss-Prot
Match: Q2KJA6 (Integrator complex subunit 9 OS=Bos taurus OX=9913 GN=INTS9 PE=2 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 3.7e-54
Identity = 172/676 (25.44%), Postives = 297/676 (43.93%), Query Frame = 0

Query: 16  PCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHPGHDSLNLENVSEEKT 75
           PC++L      I  DC +D ++   F P+P    ++    LS+ PG  SL   N   +K 
Sbjct: 13  PCNVLKFKSTTIMLDCGLDMTSTLNFLPLP----LVQSPRLSNLPGW-SLKDGNAFLDKE 72

Query: 76  EKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLPFLTREKGFSAKIYAT 135
            K       + + P +  +    L + S  +++LIS+   M+ LP++T   GF+  +YAT
Sbjct: 73  LKECSGHVFVDSVPEF-CLPETELIDLSTVDVILISNYHCMMALPYITEHTGFTGTVYAT 132

Query: 136 EATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLHRALREVAFGQDGADL 195
           E T ++G+++M++LV     F +       A L W   +  +LL   L+      D  ++
Sbjct: 133 EPTVQIGRLLMEELV----NFIERVPKAQSASL-WKNKDIQRLLPSPLK------DAVEV 192

Query: 196 GGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEIGACNWTINCPKRDIA 255
             W   Y+  +V   + K++ + Y ++    GA+ +   SSG  +G+ NW I      ++
Sbjct: 193 STWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVS 252

Query: 256 YISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRVPLIDNNLLPLGSNEE 315
           Y+S S   +++    D  +L+   ++            +   T++P              
Sbjct: 253 YVSGSSLLTTHPQPMDQASLKNSDVL-----------ILTGLTQIP-------------- 312

Query: 316 ALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGVTLQLLEQISASLDYS 375
                      T      +   CS    +V +GG+VL+P    GV   LLE +   +D +
Sbjct: 313 -----------TANPDSMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSA 372

Query: 376 NL-KVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFVELLKEKKLHVFPAV 435
            L  +P YFIS VA   L F+ +  EWLC  +Q K++  EP F   EL++  KL  +P++
Sbjct: 373 GLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSI 432

Query: 436 HSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRW-CGDPSSLLVLEKGLNVELALLPFKP 495
           H      ++++PC+VF  H SLR G VVH +  W     ++++  E   +   AL P++P
Sbjct: 433 HG-DFSNDFRQPCVVFTGHPSLRFGDVVHFMELWGKSSLNTVIFTEPDFSYLEALAPYQP 492

Query: 496 MNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTESFTV--------FS 555
           + MK + C   + +   +V  LLK +QP   V PE  ++   + +    +         S
Sbjct: 493 LAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPEQYTQPTPAQSHRMDLMVDCQPPAMS 552

Query: 556 YSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFKLFP-- 615
           Y   E L +P  +   ++EI  + A S    ++  G I +  +   L     K  L P  
Sbjct: 553 YRRAEVLALPFKRRYEKIEIMPELADSLVPMEIKPG-ISLATVSAVLHTKDNKHVLQPPP 612

Query: 616 -ENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSL--QQEVSGAESSNVRVIHIHDPT 675
                    +R       PD + L  +LS     GS+   Q V   E      I + D  
Sbjct: 613 RPTQPTGGKKRKRASDDIPDCKVLKPLLS-----GSIPVDQFVQTLEKHGFSDIKVEDTA 628

BLAST of ClCG03G000095 vs. ExPASy Swiss-Prot
Match: Q8K114 (Integrator complex subunit 9 OS=Mus musculus OX=10090 GN=Ints9 PE=1 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 8.2e-54
Identity = 170/676 (25.15%), Postives = 300/676 (44.38%), Query Frame = 0

Query: 16  PCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHPGHDSLNLENVSEEKT 75
           PC++L      I  DC +D ++   F P+P    ++    LS+ PG  SL   N   +K 
Sbjct: 13  PCNVLKFKSTTIMLDCGLDMTSTLNFLPLP----LVQSPRLSNLPGW-SLKDGNAFLDKE 72

Query: 76  EKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLPFLTREKGFSAKIYAT 135
            K       + + P +  +    L + S  +++LIS+   M+ LP++T   GF+  +YAT
Sbjct: 73  LKECSGHVFVDSVPEF-CLPETELIDLSTVDVILISNYHCMMALPYITEHTGFTGTVYAT 132

Query: 136 EATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLHRALREVAFGQDGADL 195
           E T ++G+++M++LV     F +       A L W   +  +LL   L+      D  ++
Sbjct: 133 EPTMQIGRLLMEELV----NFIERVPKAQSASL-WKNKDIQRLLPSPLK------DAVEV 192

Query: 196 GGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEIGACNWTINCPKRDIA 255
             W   Y+  +V   + K++ + Y ++    GA+ +   SSG  +G+ NW I      ++
Sbjct: 193 STWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVS 252

Query: 256 YISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRVPLIDNNLLPLGSNEE 315
           Y+S S   +++    D  +L+   ++            +   T++P              
Sbjct: 253 YVSGSSLLTTHPQPMDQASLKNSDVL-----------ILTGLTQIP-------------- 312

Query: 316 ALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGVTLQLLEQISASLDYS 375
                      T      +   CS    +V +GG+VL+P    GV   LLE +   +D +
Sbjct: 313 -----------TANPDGMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSA 372

Query: 376 NL-KVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFVELLKEKKLHVFPAV 435
            L  +P YFIS VA   L F+ +  EWLC  +Q K++  EP F   EL++  KL  + ++
Sbjct: 373 GLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYRSI 432

Query: 436 HSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRW-CGDPSSLLVLEKGLNVELALLPFKP 495
           H      ++++PC++F  H SLR G VVH +  W     ++++  E   +   AL P++P
Sbjct: 433 HG-DFSNDFRQPCVLFTGHPSLRFGDVVHFMELWGKSSLNTIIFTEPDFSYLEALAPYQP 492

Query: 496 MNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTESFTV--------FS 555
           + MK + C   + +   +V  LLK +QP   V PE  ++   +      +         S
Sbjct: 493 LAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPEQYTQPPPAQAHRMDLMIDCQPPAMS 552

Query: 556 YSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFKLFP-- 615
           Y   E L +P  +   ++EI  + A S    ++  G I +  +   L     K  L P  
Sbjct: 553 YRRAEVLALPFKRRYEKIEIMPELADSLVPMEIKPG-ISLATVSAVLHTKDNKHVLQPPP 612

Query: 616 -ENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSL--QQEVSGAESSNVRVIHIHDPT 675
                 ++ +R  ++   PD + L  +LS     GS+  +Q V   E      I + D  
Sbjct: 613 KPTQPTSSKKRKRVNEDIPDCKVLKPLLS-----GSIPVEQFVQTLEKHGFSDIKVEDTA 628

BLAST of ClCG03G000095 vs. ExPASy TrEMBL
Match: A0A1S4DZC1 (integrator complex subunit 9 homolog isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493437 PE=3 SV=1)

HSP 1 Score: 1206.0 bits (3119), Expect = 0.0e+00
Identity = 607/695 (87.34%), Postives = 641/695 (92.23%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLSKGGCFY PPCHMLN+CGFRIQFDCPVDFSALPIFSPVP D DVLSDKELSSH 
Sbjct: 1   MEFTCLSKGGCFYLPPCHMLNVCGFRIQFDCPVDFSALPIFSPVPSDLDVLSDKELSSHL 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           GH SL+L+NVSEE TEKPLDVGSLIKAEPCYKIIKN  LWNPSFTNIVLISSPMGMLGLP
Sbjct: 61  GHGSLDLDNVSEE-TEKPLDVGSLIKAEPCYKIIKN--LWNPSFTNIVLISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTREKGFSAKIYATEAT RLGKIMMDDL+AMHME KQFYGSEDDAI QWM  E+LKLLH
Sbjct: 121 FLTREKGFSAKIYATEATARLGKIMMDDLIAMHMEVKQFYGSEDDAISQWMGQEDLKLLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
             LREV FGQ+ ADL GWMP+YSA DVKDCMQKVETLRYGEE CYNG LVIKAFSSGLEI
Sbjct: 181 HKLREVTFGQNRADLSGWMPLYSADDVKDCMQKVETLRYGEETCYNGTLVIKAFSSGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           G+CNWTINCPKRDIAYISSSIFFSSN+M+FDYLALQ ETIIYSDFSSLE MN +ENDTRV
Sbjct: 241 GSCNWTINCPKRDIAYISSSIFFSSNSMEFDYLALQMETIIYSDFSSLEFMNDVENDTRV 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
           PLIDNNL PLG  EE LANLLS+ AETVEESEKL FICSCAIQSVESGGSVLIP+NRLGV
Sbjct: 301 PLIDNNLQPLG-KEETLANLLSNAAETVEESEKLYFICSCAIQSVESGGSVLIPINRLGV 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
            LQLLEQISASLDYSNLKVPIYFISSVAEELL F N IPEWLC+QRQQKLFSGEP+F FV
Sbjct: 361 NLQLLEQISASLDYSNLKVPIYFISSVAEELLTFVNAIPEWLCRQRQQKLFSGEPMFTFV 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           ELLKE KLHV PA+HSPKLL+NWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK
Sbjct: 421 ELLKENKLHVVPAIHSPKLLINWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL+VELALLPFKPM+MKVLQCSFQSGIKLEKV+PLLKVLQP   VLPENLSRLI++NTES
Sbjct: 481 GLDVELALLPFKPMSMKVLQCSFQSGIKLEKVRPLLKVLQPKVVVLPENLSRLIDTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
           FTVFSYSEG++L VPNLKDS ELEIASDSAMSFCWRKLHQGNI+ITRLKGELSLN GKFK
Sbjct: 541 FTVFSYSEGKSLRVPNLKDSSELEIASDSAMSFCWRKLHQGNINITRLKGELSLNYGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           L  EN +VA  QRPLIHWGQP+LE LLTVLSKMGIEGS+QQE+S A S+NVRVIHIH  T
Sbjct: 601 LLSENTEVAMYQRPLIHWGQPNLENLLTVLSKMGIEGSVQQEMSDA-SNNVRVIHIHGLT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
           TG+IEIQESRTIISVVD+TLSA+IF AL+SVLDGV
Sbjct: 661 TGLIEIQESRTIISVVDRTLSAQIFNALDSVLDGV 690

BLAST of ClCG03G000095 vs. ExPASy TrEMBL
Match: A0A5D3DT52 (Integrator complex subunit 9-like protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2133G00230 PE=3 SV=1)

HSP 1 Score: 1206.0 bits (3119), Expect = 0.0e+00
Identity = 607/695 (87.34%), Postives = 641/695 (92.23%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLSKGGCFY PPCHMLN+CGFRIQFDCPVDFSALPIFSPVP D DVLSDKELSSH 
Sbjct: 1   MEFTCLSKGGCFYLPPCHMLNVCGFRIQFDCPVDFSALPIFSPVPSDLDVLSDKELSSHL 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           GH SL+L+NVSEE TEKPLDVGSLIKAEPCYKIIKN  LWNPSFTNIVLISSPMGMLGLP
Sbjct: 61  GHGSLDLDNVSEE-TEKPLDVGSLIKAEPCYKIIKN--LWNPSFTNIVLISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTREKGFSAKIYATEAT RLGKIMMDDL+AMHME KQFYGSEDDAI QWM  E+LKLLH
Sbjct: 121 FLTREKGFSAKIYATEATARLGKIMMDDLIAMHMEVKQFYGSEDDAISQWMGQEDLKLLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
             LREV FGQ+ ADL GWMP+YSA DVKDCMQKVETLRYGEE CYNG LVIKAFSSGLEI
Sbjct: 181 HKLREVTFGQNRADLSGWMPLYSADDVKDCMQKVETLRYGEETCYNGTLVIKAFSSGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           G+CNWTINCPKRDIAYISSSIFFSSN+M+FDYLALQ ETIIYSDFSSLE MN +ENDTRV
Sbjct: 241 GSCNWTINCPKRDIAYISSSIFFSSNSMEFDYLALQMETIIYSDFSSLEFMNDVENDTRV 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
           PLIDNNL PLG  EE LANLLS+ AETVEESEKL FICSCAIQSVESGGSVLIP+NRLGV
Sbjct: 301 PLIDNNLQPLG-KEETLANLLSNAAETVEESEKLYFICSCAIQSVESGGSVLIPINRLGV 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
            LQLLEQISASLDYSNLKVPIYFISSVAEELL F N IPEWLC+QRQQKLFSGEP+F FV
Sbjct: 361 NLQLLEQISASLDYSNLKVPIYFISSVAEELLTFVNAIPEWLCRQRQQKLFSGEPMFTFV 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           ELLKE KLHV PA+HSPKLL+NWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK
Sbjct: 421 ELLKENKLHVVPAIHSPKLLINWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL+VELALLPFKPM+MKVLQCSFQSGIKLEKV+PLLKVLQP   VLPENLSRLI++NTES
Sbjct: 481 GLDVELALLPFKPMSMKVLQCSFQSGIKLEKVRPLLKVLQPKVVVLPENLSRLIDTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
           FTVFSYSEG++L VPNLKDS ELEIASDSAMSFCWRKLHQGNI+ITRLKGELSLN GKFK
Sbjct: 541 FTVFSYSEGKSLRVPNLKDSSELEIASDSAMSFCWRKLHQGNINITRLKGELSLNYGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           L  EN +VA  QRPLIHWGQP+LE LLTVLSKMGIEGS+QQE+S A S+NVRVIHIH  T
Sbjct: 601 LLSENTEVAMYQRPLIHWGQPNLENLLTVLSKMGIEGSVQQEMSDA-SNNVRVIHIHGLT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
           TG+IEIQESRTIISVVD+TLSA+IF AL+SVLDGV
Sbjct: 661 TGLIEIQESRTIISVVDRTLSAQIFNALDSVLDGV 690

BLAST of ClCG03G000095 vs. ExPASy TrEMBL
Match: A0A6J1FEN8 (integrator complex subunit 9 homolog isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444811 PE=3 SV=1)

HSP 1 Score: 1188.7 bits (3074), Expect = 0.0e+00
Identity = 597/695 (85.90%), Postives = 635/695 (91.37%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLS+GGCFYFPPCHM  +CGFRIQFDCP+DFSALPIFSPVP DF V+SD+ELS+HP
Sbjct: 1   MEFTCLSRGGCFYFPPCHMFEVCGFRIQFDCPMDFSALPIFSPVPLDFYVISDEELSTHP 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           G+ S NLENVSEEK EKPLDVGSLIKAEP YKII NL LWNPSFT+IVLISSPMGMLGLP
Sbjct: 61  GNGSFNLENVSEEKIEKPLDVGSLIKAEPWYKIINNLRLWNPSFTDIVLISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTREK FSAKIYATEAT RLGK+MMDDL+AMHMEFKQFYGSEDDA  QWM+ EEL+LLH
Sbjct: 121 FLTREKDFSAKIYATEATARLGKMMMDDLIAMHMEFKQFYGSEDDATPQWMKQEELELLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
            AL+EVAFGQD ADLGGWMPMY AADVKDCM+KVET+RYGEEACYNGALVIKA SSGLEI
Sbjct: 181 HALKEVAFGQDEADLGGWMPMYCAADVKDCMKKVETVRYGEEACYNGALVIKALSSGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           GACNWTIN PKR+IAYISSSIF SSNAM+FDYLALQ+ETIIYSDFSS+E MN I NDT  
Sbjct: 241 GACNWTINGPKRNIAYISSSIFSSSNAMNFDYLALQEETIIYSDFSSVESMNDILNDTSG 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
           PL D NL  L SNEE LANLLSDPAE+V ESEKLSFICSCA+QSVESGGSVLIP+NRLGV
Sbjct: 301 PLTD-NLTALSSNEETLANLLSDPAESVGESEKLSFICSCAVQSVESGGSVLIPINRLGV 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
           TLQLLEQISASLDYSNLKVPIY ISSVAEELLAFANVIPEWL KQRQQKLFSGEP+FAFV
Sbjct: 361 TLQLLEQISASLDYSNLKVPIYLISSVAEELLAFANVIPEWLSKQRQQKLFSGEPMFAFV 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           +LLKEK+LHVFPAVHSP LL+NWQEPC+VFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK
Sbjct: 421 DLLKEKRLHVFPAVHSPNLLINWQEPCVVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL+ ELALLPF+PM+MKVLQC+F SGIKL+KV+PLLKVLQP   +LPENLSRLIN+NTES
Sbjct: 481 GLDAELALLPFRPMSMKVLQCAFLSGIKLDKVRPLLKVLQPKVVMLPENLSRLINTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
           FTVFSYSEGETL VPNLKDSLELEIA D AMSFCWRKL QGNIDI RLKGELSLNCGKFK
Sbjct: 541 FTVFSYSEGETLRVPNLKDSLELEIAPDLAMSFCWRKLQQGNIDIARLKGELSLNCGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           L  ENA VATDQRPLIHWGQPDL+KLL VLSKMGIEGSLQQ  S AESSNV VI IHDPT
Sbjct: 601 LLAENAHVATDQRPLIHWGQPDLKKLLNVLSKMGIEGSLQQ--SDAESSNVGVIRIHDPT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
             VIEIQESRTIISV DK LSARIF A++SVLDGV
Sbjct: 661 EAVIEIQESRTIISVADKKLSARIFDAVDSVLDGV 692

BLAST of ClCG03G000095 vs. ExPASy TrEMBL
Match: A0A6J1FKC1 (integrator complex subunit 9 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444811 PE=3 SV=1)

HSP 1 Score: 1185.6 bits (3066), Expect = 0.0e+00
Identity = 596/695 (85.76%), Postives = 634/695 (91.22%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLS+GGCFYFPPCHM  +CGFRIQFDCP+DFSALPIFSPVP DF V+SD+ELS+HP
Sbjct: 1   MEFTCLSRGGCFYFPPCHMFEVCGFRIQFDCPMDFSALPIFSPVPLDFYVISDEELSTHP 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           G+ S NLENVSEEK EKPLDVGSLIKAEP YKII NL LWNPSFT+IVLISSPMGMLGLP
Sbjct: 61  GNGSFNLENVSEEKIEKPLDVGSLIKAEPWYKIINNLRLWNPSFTDIVLISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTREK FSAKIYATEAT RLGK+MMDDL+AMHMEFKQFYGSEDDA  QWM+ EEL+LLH
Sbjct: 121 FLTREKDFSAKIYATEATARLGKMMMDDLIAMHMEFKQFYGSEDDATPQWMKQEELELLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
            AL+EVAFGQD ADLGGWMPMY AADVKDCM+KVET+RYGEEACYNGALVIKA SSGLEI
Sbjct: 181 HALKEVAFGQDEADLGGWMPMYCAADVKDCMKKVETVRYGEEACYNGALVIKALSSGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           GACNWTIN PKR+IAYISSSIF SSNAM+FDYLALQ+ETIIYSDFSS+E MN I NDT  
Sbjct: 241 GACNWTINGPKRNIAYISSSIFSSSNAMNFDYLALQEETIIYSDFSSVESMNDILNDTSG 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
           PL DN  L   SNEE LANLLSDPAE+V ESEKLSFICSCA+QSVESGGSVLIP+NRLGV
Sbjct: 301 PLTDN--LTALSNEETLANLLSDPAESVGESEKLSFICSCAVQSVESGGSVLIPINRLGV 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
           TLQLLEQISASLDYSNLKVPIY ISSVAEELLAFANVIPEWL KQRQQKLFSGEP+FAFV
Sbjct: 361 TLQLLEQISASLDYSNLKVPIYLISSVAEELLAFANVIPEWLSKQRQQKLFSGEPMFAFV 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           +LLKEK+LHVFPAVHSP LL+NWQEPC+VFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK
Sbjct: 421 DLLKEKRLHVFPAVHSPNLLINWQEPCVVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL+ ELALLPF+PM+MKVLQC+F SGIKL+KV+PLLKVLQP   +LPENLSRLIN+NTES
Sbjct: 481 GLDAELALLPFRPMSMKVLQCAFLSGIKLDKVRPLLKVLQPKVVMLPENLSRLINTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
           FTVFSYSEGETL VPNLKDSLELEIA D AMSFCWRKL QGNIDI RLKGELSLNCGKFK
Sbjct: 541 FTVFSYSEGETLRVPNLKDSLELEIAPDLAMSFCWRKLQQGNIDIARLKGELSLNCGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           L  ENA VATDQRPLIHWGQPDL+KLL VLSKMGIEGSLQQ  S AESSNV VI IHDPT
Sbjct: 601 LLAENAHVATDQRPLIHWGQPDLKKLLNVLSKMGIEGSLQQ--SDAESSNVGVIRIHDPT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
             VIEIQESRTIISV DK LSARIF A++SVLDGV
Sbjct: 661 EAVIEIQESRTIISVADKKLSARIFDAVDSVLDGV 691

BLAST of ClCG03G000095 vs. ExPASy TrEMBL
Match: A0A6J1JXZ8 (integrator complex subunit 9 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488899 PE=3 SV=1)

HSP 1 Score: 1184.1 bits (3062), Expect = 0.0e+00
Identity = 596/695 (85.76%), Postives = 635/695 (91.37%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           MEFTCLS+GG FYFPPCHML +CGFRIQFDCP+DFSALPIFSPVP DFDV+SD+ELS+HP
Sbjct: 1   MEFTCLSRGGFFYFPPCHMLEVCGFRIQFDCPMDFSALPIFSPVPLDFDVISDEELSTHP 60

Query: 61  GHDSLNLENVSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLGLP 120
           G+ S NLENVSEEK EKPLDVGSLIKAEP YKIIKNL LWN SFT+IVLISSPMGMLGLP
Sbjct: 61  GNGSFNLENVSEEKIEKPLDVGSLIKAEPWYKIIKNLRLWNLSFTDIVLISSPMGMLGLP 120

Query: 121 FLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKLLH 180
           FLTR+KGFSAKIYATEAT RLGK+MMDDLVAMHMEFKQFYGSEDDA  QWMR EEL+LLH
Sbjct: 121 FLTRQKGFSAKIYATEATARLGKMMMDDLVAMHMEFKQFYGSEDDATPQWMRQEELELLH 180

Query: 181 RALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGLEI 240
            AL+EVAFGQD ADLGGWMPMYSAADVKDCM+KVET+RYGEEACYNGALVIKA S GLEI
Sbjct: 181 HALKEVAFGQDEADLGGWMPMYSAADVKDCMKKVETVRYGEEACYNGALVIKALSCGLEI 240

Query: 241 GACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQKETIIYSDFSSLELMNAIENDTRV 300
           GACNWTIN PKRDIAYISSSIF SSNAM+FDYLALQ ETIIYSDFSS+E MN I NDT  
Sbjct: 241 GACNWTINGPKRDIAYISSSIFSSSNAMNFDYLALQGETIIYSDFSSVESMNDILNDTSG 300

Query: 301 PLIDNNLLPLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMNRLGV 360
            L + NL+ L  NEE LANLLSDPAE+V ESEKLSFICSCAIQSVESGGSVLIP+NRLGV
Sbjct: 301 SLTE-NLMALSRNEETLANLLSDPAESVGESEKLSFICSCAIQSVESGGSVLIPINRLGV 360

Query: 361 TLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPVFAFV 420
           TLQLLEQISASLD SNLKVPIY ISSVAEELLA ANVIPEWLCKQRQ+KLFSGEP+FAFV
Sbjct: 361 TLQLLEQISASLDCSNLKVPIYLISSVAEELLALANVIPEWLCKQRQEKLFSGEPMFAFV 420

Query: 421 ELLKEKKLHVFPAVHSPKLLMNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480
           +LLKEKKLH FPAVHSPKLL+NWQEPC+VFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK
Sbjct: 421 DLLKEKKLHAFPAVHSPKLLINWQEPCVVFCPHWSLRLGPVVHLLRRWCGDPSSLLVLEK 480

Query: 481 GLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSRLINSNTES 540
           GL+ ELALLPF+PM+MKVLQC+F SGIKL+KV+PLLKVLQP   +LPENLSRLIN+NTES
Sbjct: 481 GLDAELALLPFRPMSMKVLQCAFLSGIKLDKVRPLLKVLQPKVVMLPENLSRLINTNTES 540

Query: 541 FTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQGNIDITRLKGELSLNCGKFK 600
            TVFSYSEGETL VPNLKDSLELEIA D AMSFCWRKL QGNIDI RLKGELSLNCGKFK
Sbjct: 541 CTVFSYSEGETLRVPNLKDSLELEIAPDLAMSFCWRKLQQGNIDIARLKGELSLNCGKFK 600

Query: 601 LFPENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAESSNVRVIHIHDPT 660
           LFPENA VATDQRPLIHWGQPDL+KLL VLSKMGIEGSLQQ  S AESSNV VI IHDPT
Sbjct: 601 LFPENAHVATDQRPLIHWGQPDLKKLLNVLSKMGIEGSLQQ--SDAESSNVGVIRIHDPT 660

Query: 661 TGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
             VIEIQ+SRTIISV DKTL ARIF A++S+L+GV
Sbjct: 661 EAVIEIQDSRTIISVADKTLCARIFDAVDSILNGV 692

BLAST of ClCG03G000095 vs. TAIR 10
Match: AT3G07530.1 (CONTAINS InterPro DOMAIN/s: Beta-Casp domain (InterPro:IPR022712); BEST Arabidopsis thaliana protein match is: cleavage and polyadenylation specificity factor 73 kDa subunit-II (TAIR:AT2G01730.1); Has 624 Blast hits to 615 proteins in 160 species: Archae - 54; Bacteria - 6; Metazoa - 333; Fungi - 44; Plants - 93; Viruses - 0; Other Eukaryotes - 94 (source: NCBI BLink). )

HSP 1 Score: 670.6 bits (1729), Expect = 1.3e-192
Identity = 341/707 (48.23%), Postives = 480/707 (67.89%), Query Frame = 0

Query: 1   MEFTCLSKGGCFYFPPCHMLNICGFRIQFDCPVDFSALPIFSPVPFDFDVLSDKELSSHP 60
           ME TCLSKG  F++PPCHMLN+CGFRI  DCP+D SA+ IFSPVP         E S + 
Sbjct: 1   MELTCLSKGDGFHYPPCHMLNLCGFRILIDCPLDLSAIKIFSPVPSGV----GSEASEYL 60

Query: 61  GHDSLNLEN--VSEEKTEKPLDVGSLIKAEPCYKIIKNLCLWNPSFTNIVLISSPMGMLG 120
             +SL+ +N    ++K E+ L    L+  EP YK +K L LW  SF +IVLIS+PMG+LG
Sbjct: 61  SDESLDAQNPIQKKQKLERQLTCADLVCEEPWYKTVKALHLWEASFIDIVLISNPMGLLG 120

Query: 121 LPFLTREKGFSAKIYATEATTRLGKIMMDDLVAMHMEFKQFYGSEDDAILQWMRPEELKL 180
           LPFLT+  GF AKIY TE T ++G++MM+D+V+MH EF+ F+G ++ +   W++  + + 
Sbjct: 121 LPFLTQNPGFFAKIYMTEVTAKIGQLMMEDIVSMHKEFRCFHGPDNSSFPGWIKNLDSEQ 180

Query: 181 LHRALREVAFGQDGADLGGWMPMYSAADVKDCMQKVETLRYGEEACYNGALVIKAFSSGL 240
           +   L++V FG+ G DLG WM +YS  D++ CM+KV+ +++ EE CYNG L+IKA SSGL
Sbjct: 181 VPALLKKVVFGESGDDLGSWMRLYSLDDIESCMKKVQGVKFAEEVCYNGTLIIKALSSGL 240

Query: 241 EIGACNWTINCPKRDIAYISSSIFFSSNAMDFDYLALQK-ETIIYSDFSSLELMNAIEND 300
           +IGACNW IN P   ++Y+S SIF S +A  FD+  L++ + +IYSDFSSL+     E+ 
Sbjct: 241 DIGACNWLINGPNGSLSYVSDSIFVSHHARSFDFHGLKETDVLIYSDFSSLQSAEVTEDG 300

Query: 301 TRVPLIDNNLL-PLGSNEEALANLLSDPAETVEESEKLSFICSCAIQSVESGGSVLIPMN 360
              P  DNN +  +  N+++L N      +++EE EKL+F+CSCA +S ++GGS LI + 
Sbjct: 301 CISPDSDNNYISTISDNKDSLLN----TEDSLEEMEKLAFVCSCAAESADAGGSTLITIT 360

Query: 361 RLGVTLQLLEQISASLDYSNLKVPIYFISSVAEELLAFANVIPEWLCKQRQQKLFSGEPV 420
           R+G+ LQLLE +S SL+ S+LKVPI+ ISSVAEELLA+ N IPEWLC+QRQ+KL SGEP 
Sbjct: 361 RIGIVLQLLELLSNSLESSSLKVPIFVISSVAEELLAYTNTIPEWLCEQRQEKLISGEPS 420

Query: 421 FAFVELLKEKKLHVFPAVHSPKLL----MNWQEPCIVFCPHWSLRLGPVVHLLRRWCGDP 480
           F  ++ +K KK+H+FPA+HSP L+     +WQEPCIVF  HWSLRLGP V LL+RW GDP
Sbjct: 421 FGHLKFIKNKKIHLFPAIHSPNLIYANRTSWQEPCIVFASHWSLRLGPSVQLLQRWRGDP 480

Query: 481 SSLLVLEKGLNVELALLPFKPMNMKVLQCSFQSGIKLEKVQPLLKVLQPNFAVLPENLSR 540
            SLLVLE G++  L LLPF+P+ MK+LQCSF SGI+L+K+  L+ VLQP   ++P+ +++
Sbjct: 481 KSLLVLEDGISSGLGLLPFRPIAMKILQCSFLSGIRLQKLPTLVSVLQPKIFLVPDAVNQ 540

Query: 541 LIN-SNTESFTVFSYSEGETLCVPNLKDSLELEIASDSAMSFCWRKLHQ-GNIDITRLKG 600
            I+ +  ++ ++ +Y E +TL VP + D+  +EI +D A    WRKL Q  +  I RLKG
Sbjct: 541 RISLAAIKTISILNYFENKTLHVPRIVDNPSVEITTDLASKLSWRKLRQRESFGIARLKG 600

Query: 601 ELSLNCGKFKLFP--ENAQVATDQRPLIHWGQPDLEKLLTVLSKMGIEGSLQQEVSGAES 660
            L +  GK +L    E  + +   RPL HWG    E LL  L KMGI+GS++Q      S
Sbjct: 601 GLLMEDGKHRLVSGLEQEESSGKARPLRHWGSVAPELLLDALLKMGIKGSIEQSTGDNGS 660

Query: 661 SNVRVIHIHDPTTGVIEIQESRTIISVVDKTLSARIFYALNSVLDGV 696
            +  +IHI +P +G+IE  E  T I   D+ + +++F A++ VLDG+
Sbjct: 661 EDKSIIHIENPNSGLIEFSEMGTAIITGDENVVSQVFQAIDGVLDGI 699

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890023.10.0e+0089.78integrator complex subunit 9 isoform X1 [Benincasa hispida][more]
XP_011655058.10.0e+0087.63integrator complex subunit 9 homolog isoform X1 [Cucumis sativus] >KAE8648065.1 ... [more]
XP_004146463.10.0e+0087.63integrator complex subunit 9 homolog isoform X2 [Cucumis sativus][more]
XP_008452382.10.0e+0087.34PREDICTED: integrator complex subunit 9 homolog isoform X1 [Cucumis melo] >XP_01... [more]
XP_022938639.10.0e+0085.90integrator complex subunit 9 homolog isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A7SBF01.9e-5827.66Integrator complex subunit 9 homolog OS=Nematostella vectensis OX=45351 GN=ints9... [more]
Q9NV884.4e-5525.89Integrator complex subunit 9 OS=Homo sapiens OX=9606 GN=INTS9 PE=1 SV=2[more]
Q4R5Z42.2e-5425.89Integrator complex subunit 9 OS=Macaca fascicularis OX=9541 GN=INTS9 PE=2 SV=1[more]
Q2KJA63.7e-5425.44Integrator complex subunit 9 OS=Bos taurus OX=9913 GN=INTS9 PE=2 SV=1[more]
Q8K1148.2e-5425.15Integrator complex subunit 9 OS=Mus musculus OX=10090 GN=Ints9 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S4DZC10.0e+0087.34integrator complex subunit 9 homolog isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5D3DT520.0e+0087.34Integrator complex subunit 9-like protein isoform X1 OS=Cucumis melo var. makuwa... [more]
A0A6J1FEN80.0e+0085.90integrator complex subunit 9 homolog isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1FKC10.0e+0085.76integrator complex subunit 9 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A6J1JXZ80.0e+0085.76integrator complex subunit 9 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488... [more]
Match NameE-valueIdentityDescription
AT3G07530.11.3e-19248.23CONTAINS InterPro DOMAIN/s: Beta-Casp domain (InterPro:IPR022712); BEST Arabidop... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR022712Beta-Casp domainSMARTSM01027Beta_Casp_2coord: 361..482
e-value: 1.9E-8
score: 44.1
IPR022712Beta-Casp domainPFAMPF10996Beta-Caspcoord: 363..478
e-value: 4.3E-7
score: 30.2
IPR036866Ribonuclease Z/Hydroxyacylglutathione hydrolase-likeGENE3D3.60.15.10coord: 13..287
e-value: 9.1E-28
score: 98.8
IPR036866Ribonuclease Z/Hydroxyacylglutathione hydrolase-likeSUPERFAMILY56281Metallo-hydrolase/oxidoreductasecoord: 1..529
NoneNo IPR availableGENE3D3.40.50.10890coord: 331..482
e-value: 5.8E-34
score: 119.0
IPR001279Metallo-beta-lactamasePFAMPF16661Lactamase_B_6coord: 105..257
e-value: 2.4E-8
score: 33.6
IPR027074Integrator complex subunit 9PANTHERPTHR46094INTEGRATOR COMPLEX SUBUNIT 9coord: 3..692

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G000095.1ClCG03G000095.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016180 snRNA processing
cellular_component GO:0032039 integrator complex