CmoCh18G010640.1 (mRNA) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh18G010640.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionPAX3- and PAX7-binding protein 1
LocationCmo_Chr18: 11275065 .. 11281185 (+)
Sequence length3361
RNA-Seq ExpressionCmoCh18G010640.1
SyntenyCmoCh18G010640.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAAAACCCCGCTAGAAATGAAATGCACGAAACAAGAAACCGACTAAAATCCAGGTCCAAAAACCCTAGCAAAGATCCACGGCTTCTGAATCTGAATCCCGACCTTCGCAATTGCCAATTCATTCTGCTCGTTTCGTTTCATCGCACAATTCCTTCGTAATTGTGTCTAGCAATCACCATGTCCGGCAGTCGAGCTAGAAACTTTCGCCGCCGGGCCGACGACAATGACGACGACGACGAACCCAATGGCGCCGCCGCCCCCTCAACCGGTGTTTCAAACGCCTCATCAAAAGCCGCTTCCACCTCCTCTACCGTCGCCAATAAACCCAAGAAGGCTAATCCTCAGGTACCCAAGCTTCTCAGCTTCGCCAGTGACGAAGAGAACGATGCCCCACTTCGCACTTCTTCGAAACCAGCCAATTCCAAGAAGCCTTCTTCTGCTCGACTTGCTAAGCCTTCCTCCACCCACAAGATCACTGCCTTGAAGGATCGCATCGCTCACTCTTCTTCAACGTCGGCTTCTGTTCCCTCCAATGTGCAACCTCAAGCTGGAACTTATACCAAGGAGGCCCTTCGCGAGTTGCAGAAGAATACTCGCACGCTTGCGAGCTCGAGATCCTCATCCGAGTCTAAGCCTTCCGCCGAACCTGTTATTGTTCTGAAGGGTCTTCTCAAGCCTGTCGAACAAATTTCGGACAGTGCTAAAGAAGGTAAAGAGTCGAGCTCCGAAGATGAAGAAGGGGGTAGTAACGAAAAGAGCGCTGGTTCGTTTCGGAGGAGTAAAGAAGACGCCTTAGCTCGAATGGCTTCAATGGGGATTGGCAGAGGAAAGGATTCAACTGGGTCAATTCCCGATCAAGCAACCATTAATGCAATTCGCGCGAAAAGGGAACGTATGCGACAGGCTGGGGTTGCAGCTCCGGATTATATATCCTTAGATGCAGGAAGCAACCGCACGGCGCCGGGGGAGTTGAGCGACGAAGAAACAGAGTTTCCGGGGAGAATCGCCATGATTGGAGGGAAGTCAGCAAGTTCGAAGAAGGGCGTGTTTGAGGAATTTGATGAGCAAGCTATTGATGGGGTAAGAACGAATATTATTGAGCACAGCGATGAGGACGAGGAGGAGAAAATATGGGAGGCAGAGCAGTTTAGGAAGGGACTGGGTAAGAGAATGGATGATGGTTCTACTAGGGTGGAGAGCTCAAGTGTCCCTCTCATTCCGAGTGTTCCGCAGCAGAACTTAATTTACCCCACTACAGCTGGGTATAATTCGGTGCCTAGCATATCTACAGCTACAAGTATTGGAGGTTCTGTTGGTGTTTCACAGGGTTTGGATGGGCTATCAATATCTCAGCAAGCTGAGATTGCTAAAAAAGCTATGCGAGATAATATGGGGAGGCTTAAGGTATGCATATTTTCCCCTCATTTCAGTCTTGTGCTATGGAGGTATAAACGTGAAACTTACAGGAGATTCTAGGAATATGTTAAATGATATTTGAGTTAATAGGGCTTTCAGTATCTTGGTTTATTATATACATGGCATGTGTTACTTGGAATGTAGTTTACTTTGAATAGCCTATCTCCAAAATAATGTGTTATCAATCATGAAGTGCATATTGACTAACGTTGCTTGTGGCTATGGATCTCTGTGTATGCATGACCTCCATTCTCTTTAATACTCAGGAATCTTATCGAAGAACTGCAGCATCTGTGTTGAAGACGGACGAAAATTTGTCTGCATCTCTTTTGAATATCACAGCCCTTGAAAAGTCTCTTTCTGCTGCCGGCGAGAAGTTTATATTTATGCAAAAGCTTCGTGACTTTGTTTCTGTTATCTGTGACTTTTTGCAGGTACAGATATCCTATCTTACGTCCTACATGTTTTATGGAAATAGAAATTTGATTGTCATGCATGTTTGACTAGAACTAGAGAACCACATTACATTTGGTTGTTTATTACATGTTGGCTAAAAGGTGTTAGAAACATGTAGCAACTTTATGATGATTGGTTTATTTCGAACTTAATATATGACAAGCTAGATAAAGCCTTATTCTCTTTTAGTTTCTAATTTTCACAAATAACTTGATGCTTCCTATGGATATATAAAGTAGTCCTTCTATGGCTACAGTTAACCGCAGCTTTCGCTCTTTTCTGTTTCTGAAGGTGAAATTAGAACAATTATTGTATGTCATGGACATGTTAGATGGACAAGTCTCAACAGAATTGGCTATTTCACCTTTGTCTGTTGTATCTTTTTATGAGCTATATATATGGCCAAGCAATAAAAAGCTTGTTATGTTACGAATATACACAATGTTACTGTACTTGTAAGATATATGAGTTTCTGAATAGTCTGCCACAGAAAAGAGAGAACAAAAATTATAGGACAGAGTAGAAATTAAATAGCAGTCATGCTTTGAAGATGCCATGTATATATGGTTTTCGAAGTGTTATTACCTTTTGATACTTCCTCTCCCATAGATTTTTCATTTGGATATCTAGCTTACAGTTTGGGCTTTTGTTAGAGACAGTAGGGCTTGGTTGGAGGAGGTGCTTTTGAATTCTTGCTTGAAGGAGAAGGGCAGAGTATAGTGGCAGTCTAGCTTCTTTGCAATGTTGTGCGACATTTTGATTGAGAGGAATAATACTATTTTTAGAGAGGTCGAAAGATATGATGTGAAAGTTTTGGAGGTGGCTAGGTTTAACGCCTTCTTTGGGACGTCAGCCGCACGGCTTTATACAATTATGAGTGTAGTGTCGTTCTTTTGGATTCAAATCCTTTTTTTATAATTAGGGCCAAACTCCTTTTTTGTTCGGGCTTATTTTTCAAATGCTTTTGTCTATACTAACATTTTTCTTCTTGGTTTCTTCCGGAAAGGGGGCTATTGTGTTGCATCATACGTGTTAAGGGATAGAGTTAAAGCAAATACTCAATGTAGCATTTTTTTTCCTGTTCGCATTATTTTGAATGCACTCGGCTATTCTCATGAGACAAATGCTTGACTTTACTACATTTGGTATCAAGGTGACTTGTAACATATTAAATCCTAGGTAGCTGGCCATCATGATTGAACCCATTCCCTTTGGGGCTTTTATTAATTATACATGGCTCATTTTGACCATTGGGCCAACCTAGGGTGGTTTAGTGTATTTGGTAGTTAGCTACGGAGACTTGATCCATTACTTTATTGAAGGCATCTAAGTTAACAACTTTGACTTGAATGCATGAATCAAACATGATAGAAAATGTAGCCGCACCTCAAAAGTCCAAAGTTAGTCGTTTGGTAGTTTATTGAAGGATTGAACCTTGCTGTAGTGAAAAATATGTCATGTTTGATTATTCTTGTTTTGTTGGTCAAGTTTCAAAGCGTCTTTTGCATCTCTTGGACTTTCACTTAGACTGGGATGCTTTACTGAAAGAGTTACGAGGGCATGAAATTTTTTTTTGATCGTTAAGAAAAAGACATTCTTCGTGCCTGCAAAACAGTTTAATTACTAACGCATTTAAGGGGTTTCAATAGTACAAGAGCATTTGGGATGTTTGTAATTTAGATGGTTAGCATGTGCCTTTCCTATTATTAAGACTTGCCTTCCTCTTCACTTTTCCCCCTTGCTGGATATGCTTGTAAACTAGAACTATAGGCCTCTCAACTTTGCGTCACTGAGATGTTAGGTCCTATATTAGGTTGGCTTTTTAATTCAAGCTGTTTTATCTCCTTTCTTTAATTAACTATTTTGTGTGTTTCTGTGCTTTCAGCATAAAGCTCCATTCATAGAGGAGCTTGAGGAGCAGATGCAAAAACTTCACGAAGAACGGGCTTCTACAGTAGTGGAAAGAAGAGTAGCTGATAATGATGATGAAATGGTGGAGATAGAAGCAGCTGTAAAAGCAGCAATGTCAATCTTGAATAAGAAGGGGAGCAGCAATGAAATGATTGCTGCAGCCACAAGTGCTGCCCAGGCAGCAATTGCCTCTGCAAAGGAACAGGCAAATTTACCAACAAAGGTAGATGAATTTGGTAGGGATTTAAATTTGCAAAAACGTATGGATATGAAAAGAAGGGCTGAGGCTCGAAAGCGCCGGAGAGCAAAGTACGATTCCAAGAGACTTGCATCCACAGAAGTTGATGGCCATCAAAAAGTAGAAGGAGAGTCTAGCACTGATGAGAGTGATAGTGAGGCTGCAGCTTACCAGTCAAACCATGATTTATTACTTCAGACTGCTGATCAAATTTTTAGTGATGCAGCTGAGGAATTCTCCCAACTTTCTGTGGTGAAACAGAGGTTTGAACAATGGAAGAGAGATTATTCAGCAACGTACCGTGACGCATATATGTCATTAAGCACTGCTGCTATCTTCTCTCCTTATGTGAGATTGGAACTCTTGAAGTGGGATCCCCTGCATGAAAATGCAGATTTTTTTGACATGAACTGGTATGATATTATTAAGTATTACAGTGCTAATATACATAGCATTTATACTCAAGTCAATATGAAAAATAATCATGAATGGTAGAGGAATTTTCTCCGATCAGCTGGTTGTGGTACTTTCTTAGTGAAGTTAAAGTCATACATAATTTAAATTGCTCGTTTACCCATCCTTATTTTGAATTTTCAGGCACTCTTTGCTGTTCAATTATGGTATGCCGGAAGATGGTAGTGATTTTGCTCCAAATGATGCCGATGCTAACCTTGTCCCAGAACTAGTTGAGAAGGTTGCACTTCCAATATTGCACCATGAAGTTGCTCATTGTTGGGACATGCTTAGCACACGTGAAACCCGAAATGCAGCTTTTGCTACTAGCTTGATTACTAACTATGTTCCAACATCAAGTGAAGCTCTTACGGAATTATTGGTTGTTATTCGTACTCGTTTATCAAGCGCTGTTGAAGATCTTACGGTATGAATTTGGTTTTGCACTTTTCAATAATATGGTTATCTTTATATTATATTATCATGGAAACCAACATTTCTGCGTGAATTGAATGCATATATCCTCAACTTCACTGTACTGTGAGCCATTATGCGTTCATGTTTAATTCTCAATGGATTATTTCATCTAATCTGCTGATGTTGGATCATGTTTACAAAATATAGGTTCCTACTTGGAGTGCACTGGTGATGAAAGCTGTTCCAAATGCTGCTCGAATTGCAGCATATCGGTTTGGCATATCCGTTCGTTTGATGAGAAACATATGTTTGTGGAAAGAAATTATTGCATTGCCCATTTTAGAAAAGCTTGCCCTTGAAGAGCTCTTGTATGGGAAAGTTCTACCTCATGTTAGAAGCATCACAGCGAACATACATGATGCAGTCACAAGAACTGAGAGAATCATTGCTTCTCTATCAGGAGTGTGGACAGGCCCCGGCGTCACCGGTGATCGCAGGTTTGGATATTCATACGTTATTCACTCTTTTGTTTGGACCTGAAATTTGGCTGGCTAAATAATATCAGTGAAGGGAATACGCACGTGAACGATGCTTATAAGCGTTTAAGTTGCATTCCTTTTGTTTATGAATTTATCTCTCTGTTTCATACAAAGATGATACCCATAAGCTTCTATACATTTTTGTTTTGCAGTCACAAGTTGCAACCATTGGTAGACTATGTTATGCTACTGGGAAGAACATTGGAGAAAAAACATATTTCAGGCATAGCTGAGAGCGAGACGAGCGGACTAGCTCGGCGATTAAAGAAGATGCTAGTTGAGCTGAATGAATATGACAATGCAAGAGACATTGCTAAGACCTTCCATCTCAGGGAGGCACTATGAGCTCGAACGAGCGTCTGGTGTGATCACAAGATAGGACAATGTACCTTGTGTATATGCTTTTATGGAACATTGATGAAGTTTATTGTCTCAAAGATTATCCTGATTCTTACTGAATTGACATCATTGATCTAGAGAATGCGGCATTAGAGTCTGAAAAGAGCTAACTGCAGAGCTGTGAACCCATTAGCATTTGATGAGTTTATTTGGTACGATCGTTATGGCCATTTTTGATAGCTCCCTCCATTTGGTCTTCGTCATTGATACTCGACCTTCATTCTTAGAGCCTGCTAAACACATTAGTTGGGTCAGCCCATTTCTTTAAACTTTCAATTTTTTTCCCATCG

mRNA sequence

GAAAAAACCCCGCTAGAAATGAAATGCACGAAACAAGAAACCGACTAAAATCCAGGTCCAAAAACCCTAGCAAAGATCCACGGCTTCTGAATCTGAATCCCGACCTTCGCAATTGCCAATTCATTCTGCTCGTTTCGTTTCATCGCACAATTCCTTCGTAATTGTGTCTAGCAATCACCATGTCCGGCAGTCGAGCTAGAAACTTTCGCCGCCGGGCCGACGACAATGACGACGACGACGAACCCAATGGCGCCGCCGCCCCCTCAACCGGTGTTTCAAACGCCTCATCAAAAGCCGCTTCCACCTCCTCTACCGTCGCCAATAAACCCAAGAAGGCTAATCCTCAGGTACCCAAGCTTCTCAGCTTCGCCAGTGACGAAGAGAACGATGCCCCACTTCGCACTTCTTCGAAACCAGCCAATTCCAAGAAGCCTTCTTCTGCTCGACTTGCTAAGCCTTCCTCCACCCACAAGATCACTGCCTTGAAGGATCGCATCGCTCACTCTTCTTCAACGTCGGCTTCTGTTCCCTCCAATGTGCAACCTCAAGCTGGAACTTATACCAAGGAGGCCCTTCGCGAGTTGCAGAAGAATACTCGCACGCTTGCGAGCTCGAGATCCTCATCCGAGTCTAAGCCTTCCGCCGAACCTGTTATTGTTCTGAAGGGTCTTCTCAAGCCTGTCGAACAAATTTCGGACAGTGCTAAAGAAGGTAAAGAGTCGAGCTCCGAAGATGAAGAAGGGGGTAGTAACGAAAAGAGCGCTGGTTCGTTTCGGAGGAGTAAAGAAGACGCCTTAGCTCGAATGGCTTCAATGGGGATTGGCAGAGGAAAGGATTCAACTGGGTCAATTCCCGATCAAGCAACCATTAATGCAATTCGCGCGAAAAGGGAACGTATGCGACAGGCTGGGGTTGCAGCTCCGGATTATATATCCTTAGATGCAGGAAGCAACCGCACGGCGCCGGGGGAGTTGAGCGACGAAGAAACAGAGTTTCCGGGGAGAATCGCCATGATTGGAGGGAAGTCAGCAAGTTCGAAGAAGGGCGTGTTTGAGGAATTTGATGAGCAAGCTATTGATGGGGTAAGAACGAATATTATTGAGCACAGCGATGAGGACGAGGAGGAGAAAATATGGGAGGCAGAGCAGTTTAGGAAGGGACTGGGTAAGAGAATGGATGATGGTTCTACTAGGGTGGAGAGCTCAAGTGTCCCTCTCATTCCGAGTGTTCCGCAGCAGAACTTAATTTACCCCACTACAGCTGGGTATAATTCGGTGCCTAGCATATCTACAGCTACAAGTATTGGAGGTTCTGTTGGTGTTTCACAGGGTTTGGATGGGCTATCAATATCTCAGCAAGCTGAGATTGCTAAAAAAGCTATGCGAGATAATATGGGGAGGCTTAAGGAATCTTATCGAAGAACTGCAGCATCTGTGTTGAAGACGGACGAAAATTTGTCTGCATCTCTTTTGAATATCACAGCCCTTGAAAAGTCTCTTTCTGCTGCCGGCGAGAAGTTTATATTTATGCAAAAGCTTCGTGACTTTGTTTCTGTTATCTGTGACTTTTTGCAGCATAAAGCTCCATTCATAGAGGAGCTTGAGGAGCAGATGCAAAAACTTCACGAAGAACGGGCTTCTACAGTAGTGGAAAGAAGAGTAGCTGATAATGATGATGAAATGGTGGAGATAGAAGCAGCTGTAAAAGCAGCAATGTCAATCTTGAATAAGAAGGGGAGCAGCAATGAAATGATTGCTGCAGCCACAAGTGCTGCCCAGGCAGCAATTGCCTCTGCAAAGGAACAGGCAAATTTACCAACAAAGGTAGATGAATTTGGTAGGGATTTAAATTTGCAAAAACGTATGGATATGAAAAGAAGGGCTGAGGCTCGAAAGCGCCGGAGAGCAAAGTACGATTCCAAGAGACTTGCATCCACAGAAGTTGATGGCCATCAAAAAGTAGAAGGAGAGTCTAGCACTGATGAGAGTGATAGTGAGGCTGCAGCTTACCAGTCAAACCATGATTTATTACTTCAGACTGCTGATCAAATTTTTAGTGATGCAGCTGAGGAATTCTCCCAACTTTCTGTGGTGAAACAGAGGTTTGAACAATGGAAGAGAGATTATTCAGCAACGTACCGTGACGCATATATGTCATTAAGCACTGCTGCTATCTTCTCTCCTTATGTGAGATTGGAACTCTTGAAGTGGGATCCCCTGCATGAAAATGCAGATTTTTTTGACATGAACTGGCACTCTTTGCTGTTCAATTATGGTATGCCGGAAGATGGTAGTGATTTTGCTCCAAATGATGCCGATGCTAACCTTGTCCCAGAACTAGTTGAGAAGGTTGCACTTCCAATATTGCACCATGAAGTTGCTCATTGTTGGGACATGCTTAGCACACGTGAAACCCGAAATGCAGCTTTTGCTACTAGCTTGATTACTAACTATGTTCCAACATCAAGTGAAGCTCTTACGGAATTATTGGTTGTTATTCGTACTCGTTTATCAAGCGCTGTTGAAGATCTTACGGTTCCTACTTGGAGTGCACTGGTGATGAAAGCTGTTCCAAATGCTGCTCGAATTGCAGCATATCGGTTTGGCATATCCGTTCGTTTGATGAGAAACATATGTTTGTGGAAAGAAATTATTGCATTGCCCATTTTAGAAAAGCTTGCCCTTGAAGAGCTCTTGTATGGGAAAGTTCTACCTCATGTTAGAAGCATCACAGCGAACATACATGATGCAGTCACAAGAACTGAGAGAATCATTGCTTCTCTATCAGGAGTGTGGACAGGCCCCGGCGTCACCGGTGATCGCAGTCACAAGTTGCAACCATTGGTAGACTATGTTATGCTACTGGGAAGAACATTGGAGAAAAAACATATTTCAGGCATAGCTGAGAGCGAGACGAGCGGACTAGCTCGGCGATTAAAGAAGATGCTAGTTGAGCTGAATGAATATGACAATGCAAGAGACATTGCTAAGACCTTCCATCTCAGGGAGGCACTATGAGCTCGAACGAGCGTCTGGTGTGATCACAAGATAGGACAATGTACCTTGTGTATATGCTTTTATGGAACATTGATGAAGTTTATTGTCTCAAAGATTATCCTGATTCTTACTGAATTGACATCATTGATCTAGAGAATGCGGCATTAGAGTCTGAAAAGAGCTAACTGCAGAGCTGTGAACCCATTAGCATTTGATGAGTTTATTTGGTACGATCGTTATGGCCATTTTTGATAGCTCCCTCCATTTGGTCTTCGTCATTGATACTCGACCTTCATTCTTAGAGCCTGCTAAACACATTAGTTGGGTCAGCCCATTTCTTTAAACTTTCAATTTTTTTCCCATCG

Coding sequence (CDS)

ATGTCCGGCAGTCGAGCTAGAAACTTTCGCCGCCGGGCCGACGACAATGACGACGACGACGAACCCAATGGCGCCGCCGCCCCCTCAACCGGTGTTTCAAACGCCTCATCAAAAGCCGCTTCCACCTCCTCTACCGTCGCCAATAAACCCAAGAAGGCTAATCCTCAGGTACCCAAGCTTCTCAGCTTCGCCAGTGACGAAGAGAACGATGCCCCACTTCGCACTTCTTCGAAACCAGCCAATTCCAAGAAGCCTTCTTCTGCTCGACTTGCTAAGCCTTCCTCCACCCACAAGATCACTGCCTTGAAGGATCGCATCGCTCACTCTTCTTCAACGTCGGCTTCTGTTCCCTCCAATGTGCAACCTCAAGCTGGAACTTATACCAAGGAGGCCCTTCGCGAGTTGCAGAAGAATACTCGCACGCTTGCGAGCTCGAGATCCTCATCCGAGTCTAAGCCTTCCGCCGAACCTGTTATTGTTCTGAAGGGTCTTCTCAAGCCTGTCGAACAAATTTCGGACAGTGCTAAAGAAGGTAAAGAGTCGAGCTCCGAAGATGAAGAAGGGGGTAGTAACGAAAAGAGCGCTGGTTCGTTTCGGAGGAGTAAAGAAGACGCCTTAGCTCGAATGGCTTCAATGGGGATTGGCAGAGGAAAGGATTCAACTGGGTCAATTCCCGATCAAGCAACCATTAATGCAATTCGCGCGAAAAGGGAACGTATGCGACAGGCTGGGGTTGCAGCTCCGGATTATATATCCTTAGATGCAGGAAGCAACCGCACGGCGCCGGGGGAGTTGAGCGACGAAGAAACAGAGTTTCCGGGGAGAATCGCCATGATTGGAGGGAAGTCAGCAAGTTCGAAGAAGGGCGTGTTTGAGGAATTTGATGAGCAAGCTATTGATGGGGTAAGAACGAATATTATTGAGCACAGCGATGAGGACGAGGAGGAGAAAATATGGGAGGCAGAGCAGTTTAGGAAGGGACTGGGTAAGAGAATGGATGATGGTTCTACTAGGGTGGAGAGCTCAAGTGTCCCTCTCATTCCGAGTGTTCCGCAGCAGAACTTAATTTACCCCACTACAGCTGGGTATAATTCGGTGCCTAGCATATCTACAGCTACAAGTATTGGAGGTTCTGTTGGTGTTTCACAGGGTTTGGATGGGCTATCAATATCTCAGCAAGCTGAGATTGCTAAAAAAGCTATGCGAGATAATATGGGGAGGCTTAAGGAATCTTATCGAAGAACTGCAGCATCTGTGTTGAAGACGGACGAAAATTTGTCTGCATCTCTTTTGAATATCACAGCCCTTGAAAAGTCTCTTTCTGCTGCCGGCGAGAAGTTTATATTTATGCAAAAGCTTCGTGACTTTGTTTCTGTTATCTGTGACTTTTTGCAGCATAAAGCTCCATTCATAGAGGAGCTTGAGGAGCAGATGCAAAAACTTCACGAAGAACGGGCTTCTACAGTAGTGGAAAGAAGAGTAGCTGATAATGATGATGAAATGGTGGAGATAGAAGCAGCTGTAAAAGCAGCAATGTCAATCTTGAATAAGAAGGGGAGCAGCAATGAAATGATTGCTGCAGCCACAAGTGCTGCCCAGGCAGCAATTGCCTCTGCAAAGGAACAGGCAAATTTACCAACAAAGGTAGATGAATTTGGTAGGGATTTAAATTTGCAAAAACGTATGGATATGAAAAGAAGGGCTGAGGCTCGAAAGCGCCGGAGAGCAAAGTACGATTCCAAGAGACTTGCATCCACAGAAGTTGATGGCCATCAAAAAGTAGAAGGAGAGTCTAGCACTGATGAGAGTGATAGTGAGGCTGCAGCTTACCAGTCAAACCATGATTTATTACTTCAGACTGCTGATCAAATTTTTAGTGATGCAGCTGAGGAATTCTCCCAACTTTCTGTGGTGAAACAGAGGTTTGAACAATGGAAGAGAGATTATTCAGCAACGTACCGTGACGCATATATGTCATTAAGCACTGCTGCTATCTTCTCTCCTTATGTGAGATTGGAACTCTTGAAGTGGGATCCCCTGCATGAAAATGCAGATTTTTTTGACATGAACTGGCACTCTTTGCTGTTCAATTATGGTATGCCGGAAGATGGTAGTGATTTTGCTCCAAATGATGCCGATGCTAACCTTGTCCCAGAACTAGTTGAGAAGGTTGCACTTCCAATATTGCACCATGAAGTTGCTCATTGTTGGGACATGCTTAGCACACGTGAAACCCGAAATGCAGCTTTTGCTACTAGCTTGATTACTAACTATGTTCCAACATCAAGTGAAGCTCTTACGGAATTATTGGTTGTTATTCGTACTCGTTTATCAAGCGCTGTTGAAGATCTTACGGTTCCTACTTGGAGTGCACTGGTGATGAAAGCTGTTCCAAATGCTGCTCGAATTGCAGCATATCGGTTTGGCATATCCGTTCGTTTGATGAGAAACATATGTTTGTGGAAAGAAATTATTGCATTGCCCATTTTAGAAAAGCTTGCCCTTGAAGAGCTCTTGTATGGGAAAGTTCTACCTCATGTTAGAAGCATCACAGCGAACATACATGATGCAGTCACAAGAACTGAGAGAATCATTGCTTCTCTATCAGGAGTGTGGACAGGCCCCGGCGTCACCGGTGATCGCAGTCACAAGTTGCAACCATTGGTAGACTATGTTATGCTACTGGGAAGAACATTGGAGAAAAAACATATTTCAGGCATAGCTGAGAGCGAGACGAGCGGACTAGCTCGGCGATTAAAGAAGATGCTAGTTGAGCTGAATGAATATGACAATGCAAGAGACATTGCTAAGACCTTCCATCTCAGGGAGGCACTATGA

Protein sequence

MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKLLSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNVQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESSTDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL
Homology
BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match: Q9FNN3 (Transcriptional repressor ILP1 OS=Arabidopsis thaliana OX=3702 GN=ILP1 PE=1 SV=1)

HSP 1 Score: 897.1 bits (2317), Expect = 1.7e-259
Identity = 540/964 (56.02%), Postives = 691/964 (71.68%), Query Frame = 0

Query: 1   MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
           M  +R +NFRRR DD  D+ +   A   S   S  SS    T S  A+ PKK      KL
Sbjct: 1   MGSNRPKNFRRRGDDGGDEIDGKVATPSSKPTSTLSSSKPKTLS--ASAPKK------KL 60

Query: 61  LSFASD--EENDAPLRTSSKPANSKK--PSSARLAKPSSTHKITALKDRIAHSSSTSASV 120
           LSFA D  EE D   R + KP N +    SS+RL    S+H+ ++ K+R   S       
Sbjct: 61  LSFADDEEEEEDGAPRVTIKPKNGRDRVKSSSRLGVSGSSHRHSSTKERRPAS------- 120

Query: 121 PSNVQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLK-PVEQISDSA 180
            SNV PQAG+Y+KEAL ELQKNTRTL  SRSS+    +AEP +VLKGL+K P +    S 
Sbjct: 121 -SNVLPQAGSYSKEALLELQKNTRTLPYSRSSA----NAEPKVVLKGLIKPPQDHEQQSL 180

Query: 181 KEGKESSSE---DEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINA 240
           K+  +  S+   DEEG   +          EDA A                  DQA I  
Sbjct: 181 KDVVKQVSDLDFDEEGEEEQ---------HEDAFA------------------DQAAI-- 240

Query: 241 IRAKRERMRQAGVA-APDYISLDAG-SNRTAPGELSDEETEFPGRIAMIGGK-SASSKKG 300
           IRAK+ERMRQ+  A APDYISLD G  N +A   +SDE+ +F G    +G +     KKG
Sbjct: 241 IRAKKERMRQSRSAPAPDYISLDGGIVNHSAVEGVSDEDADFQG--IFVGPRPQKDDKKG 300

Query: 301 VFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSS---VPL 360
           VF+  DE       T    + DEDEE+K+WE EQF+KG+GKRMD+GS R  +S+   VPL
Sbjct: 301 VFDFGDENPTAKETTTSSIYEDEDEEDKLWEEEQFKKGIGKRMDEGSHRTVTSNGIGVPL 360

Query: 361 ---IPSVPQQN-LIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMR 420
                ++PQQ   +Y   AG   +P++S A +IG +  V    D L +SQQAE+AKKA++
Sbjct: 361 HSKQQTLPQQQPQMYAYHAG-TPMPNVSVAPTIGPATSV----DTLPMSQQAELAKKALK 420

Query: 421 DNMGRLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICD 480
           DN+ +LKES+ +T +S+ KTDENL+ASL++ITALE SLSAAG+K++FMQKLRDF+SVICD
Sbjct: 421 DNVKKLKESHAKTLSSLTKTDENLTASLMSITALESSLSAAGDKYVFMQKLRDFISVICD 480

Query: 481 FLQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSN 540
           F+Q+K   IEE+E+QM++L+E+ A +++ERR+ADN+DEM+E+ AAVKAAM++LNK GSS+
Sbjct: 481 FMQNKGSLIEEIEDQMKELNEKHALSILERRIADNNDEMIELGAAVKAAMTVLNKHGSSS 540

Query: 541 EMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKR 600
            +IAAAT AA AA  S ++Q N P K+DEFGRD NLQKR ++++RA AR++RRA++++KR
Sbjct: 541 SVIAAATGAALAASTSIRQQMNQPVKLDEFGRDENLQKRREVEQRAAARQKRRARFENKR 600

Query: 601 LASTEVDGHQ-KVEGESSTDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQR 660
            ++ EVDG   K+EGESSTDESD+E +AY+   D LLQ AD++FSDA+EE+SQLS VK R
Sbjct: 601 ASAMEVDGPSLKIEGESSTDESDTETSAYKETRDSLLQCADKVFSDASEEYSQLSKVKAR 660

Query: 661 FEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMP 720
           FE+WKRDYS+TYRDAYMSL+  +IFSPYVRLELLKWDPLH++ DFFDM WH LLF+YG P
Sbjct: 661 FERWKRDYSSTYRDAYMSLTVPSIFSPYVRLELLKWDPLHQDVDFFDMKWHGLLFDYGKP 720

Query: 721 EDGSDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPT 780
           EDG DFAP+D DANLVPELVEKVA+PILHH++  CWD+LSTRETRNA  ATSL+TNYV  
Sbjct: 721 EDGDDFAPDDTDANLVPELVEKVAIPILHHQIVRCWDILSTRETRNAVAATSLVTNYVSA 780

Query: 781 SSEALTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLW 840
           SSEAL EL   IR RL  A+  ++VPTW  LV+KAVPN  ++AAYRFG SVRLMRNIC+W
Sbjct: 781 SSEALAELFAAIRARLVEAIAAISVPTWDPLVLKAVPNTPQVAAYRFGTSVRLMRNICMW 840

Query: 841 KEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDR 900
           K+I+ALP+LE LAL +LL+GKVLPHVRSI +NIHDAVTRTERI+ASLSGVWTGP VT   
Sbjct: 841 KDILALPVLENLALSDLLFGKVLPHVRSIASNIHDAVTRTERIVASLSGVWTGPSVTRTH 900

Query: 901 SHKLQPLVDYVMLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL 946
           S  LQPLVD  + L R LEK+  SG+ ++ET+GLARRLK++LVEL+E+D+AR+I +TF+L
Sbjct: 901 SRPLQPLVDCTLTLRRILEKRLGSGLDDAETTGLARRLKRILVELHEHDHAREIVRTFNL 908

BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match: Q9Y5B6 (PAX3- and PAX7-binding protein 1 OS=Homo sapiens OX=9606 GN=PAXBP1 PE=1 SV=2)

HSP 1 Score: 165.2 bits (417), Expect = 3.5e-39
Identity = 240/978 (24.54%), Postives = 412/978 (42.13%), Query Frame = 0

Query: 4   SRARNFRRRADDND-----DDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVP 63
           +R  N R+R D  +     D+++      P  G    +       +          P  P
Sbjct: 5   ARRVNVRKRNDSEEEERERDEEQEPPPLLPPPGTGEEAGPGGGDRAPGGESLLGPGPSPP 64

Query: 64  KLLSFASDEENDAPLRTSSKPANSKKP-SSARLAKPSSTHKITALKDRIAHSSSTSASVP 123
             L+     E        ++P N  KP    R  K      + + +D    +        
Sbjct: 65  SALTPGLGAEAGGGFPGGAEPGNGLKPRKRPRENKEVPRASLLSFQDEEEENEEV----- 124

Query: 124 SNVQPQAGTYTKEALRELQKNTR-TLASSRSSSESKPSAEPVIVL--KGLLKPVEQ---- 183
              + +  +Y+K+ ++ L+K  +  L  S+  +E   SAE    L   G +K   Q    
Sbjct: 125 --FKVKKSSYSKKIVKLLKKEYKEDLEKSKIKTELNSSAESEQPLDKTGHVKDTNQEDGV 184

Query: 184 -ISDSAKEGKESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQAT 243
            IS+  ++  +  SE EE     K+ G+F  +       ++S+ + R     G IPD A 
Sbjct: 185 IISEHGEDEMDMESEKEE--EKPKTGGAFSNA-------LSSLNVLR----PGEIPDAAF 244

Query: 244 INAIRAKRERMRQAGVAAPDYISLDAGS-NRTAPGELSDEETEFPGRIAMIGGKSASSKK 303
           I+A R KR+  R+ G   P       G   R    + SD+E +   R  +   K  S ++
Sbjct: 245 IHAARKKRQMARELGDFTPHDNEPGKGRLVREDENDASDDEDDDEKRRIVFSVKEKSQRQ 304

Query: 304 GVFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIP 363
            + EE     I+G   + +   ++DEE   WE EQ RKG+         +V++S  P   
Sbjct: 305 KIAEEI---GIEGSDDDALVTGEQDEELSRWEQEQIRKGI------NIPQVQASQ-PAEV 364

Query: 364 SVPQQNLIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGL---------SISQQAEIAKK 423
           ++  QN  Y T    +S     + T+ G S   SQ  D                 ++ KK
Sbjct: 365 NMYYQN-TYQTMPYGSSYGIPYSYTAYGSSDAKSQKTDNTVPFKTPSNEMTPVTIDLVKK 424

Query: 424 AMRDNMGRLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSV 483
            ++D +  +KE ++       K  ++   S   I  LE S    GE++ F+Q++R +V  
Sbjct: 425 QLKDRLDSMKELHKTNRQQHEKHLQSRVDSTRAIERLEGSSGGIGERYKFLQEMRGYVQD 484

Query: 484 ICDFLQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKG 543
           + +    K P I ELE  + +L+++RAS +V+RR  D  DE  E                
Sbjct: 485 LLECFSEKVPLINELESAIHQLYKQRASRLVQRRQDDIKDESSE---------------- 544

Query: 544 SSNEMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYD 603
                            +S   +A +   +D FGRD  L +    KRR   R+ RR +  
Sbjct: 545 ----------------FSSHSNKALMAPNLDSFGRDRALYQE-HAKRRIAEREARRTRRR 604

Query: 604 SKRLASTEVDGHQKVEGESSTDESDS-EAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVV 663
             R  + ++  H  +EG SS DE  S +   +    D + + + ++F D  E F  +  +
Sbjct: 605 QAREQTGKMADH--LEGLSSDDEETSTDITNFNLEKDRISKESGKVFEDVLESFYSIDCI 664

Query: 664 KQRFEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENA-DFFDMNWHSLLFN 723
           K +FE W+  Y  +Y+DAY+ L    +F+P +RL+LL W PL     DF +M W   L  
Sbjct: 665 KSQFEAWRSKYYTSYKDAYIGLCLPKLFNPLIRLQLLTWTPLEAKCRDFENMLWFESLLF 724

Query: 724 YGMPEDGSDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITN 783
           YG  E   +   +D D  L+P +VEKV LP L     + WD  ST +T      T  + N
Sbjct: 725 YGCEEREQE--KDDVDVALLPTIVEKVILPKLTVIAENMWDPFSTTQTSRMVGITLKLIN 784

Query: 784 YVPTSSEA--------LTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYR-F 843
             P+   A        L  LL+ +R  L    +D+ +P +   V++   +   +   R F
Sbjct: 785 GYPSVVNAENKNTQVYLKALLLRMRRTLD---DDVFMPLYPKNVLENKNSGPYLFFQRQF 844

Query: 844 GISVRLMRNICLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASL 903
             SV+L+ N   W  I +   L++L+++ LL   +L   ++      D++ + + +I   
Sbjct: 845 WSSVKLLGNFLQWYGIFSNKTLQELSIDGLLNRYILMAFQNSEYG-DDSIKKAQNVINCF 904

Query: 904 SGVWTGPGVTGDRS-HKLQPLVDYVMLLGRTLEKKHI--SGIAESETSGLARRLKKMLVE 944
              W    + G+R+  +L+    Y++ L  T+ +  I  S + +       +++ K+L  
Sbjct: 905 PKQWF-MNLKGERTISQLENFCRYLVHLADTIYRNSIGCSDVEKRNARENIKQIVKLLAS 909

BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match: P58501 (PAX3- and PAX7-binding protein 1 OS=Mus musculus OX=10090 GN=Paxbp1 PE=1 SV=3)

HSP 1 Score: 162.5 bits (410), Expect = 2.3e-38
Identity = 235/951 (24.71%), Postives = 399/951 (41.96%), Query Frame = 0

Query: 22  PNGAAAPSTGVSNASSKAASTSSTVANKPKK---ANPQVPK--LLSFASDEENDAPLRTS 81
           P  A  P  G       +         KP+K    N +VP+  LLSF  +EE +  +   
Sbjct: 65  PPSAHHPGLGAEAGGGISGGAEPGNGLKPRKRPRENKEVPRASLLSFQDEEEENEEVFKV 124

Query: 82  SKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNVQPQAGTYTKEALRELQ 141
            K + SKK    +L K    +K    K +I    +T+A                      
Sbjct: 125 KKSSYSKK--IVKLLK--KEYKEDLEKSKIKTELNTAAD--------------------- 184

Query: 142 KNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKESSSEDEEGGSNEKSAG 201
            + + L  +  + ++ P    V            IS+  ++  +  SE EE     K+ G
Sbjct: 185 -SDQPLDKTCHAKDTNPEDGVV------------ISEHGEDEMDMESEKEE--EKPKAGG 244

Query: 202 SFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERMRQAGVAAPDYISLDAG 261
           +F  +       ++S+ + R     G IPD A I+A R KR+  R+ G   P       G
Sbjct: 245 AFSNA-------LSSLNVLR----PGEIPDAAFIHAARKKRQLARELGDFTPHDSEPGKG 304

Query: 262 S-NRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAIDGVRTNIIEHSDEDEE 321
              R    + SD+E +   R  +   K  S ++ + EE     I+G   + +   ++DEE
Sbjct: 305 RLVREDENDASDDEDDDEKRRIVFSVKEKSQRQKIAEEI---GIEGSDDDALVTGEQDEE 364

Query: 322 EKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTTAGYNSVPSISTATSI 381
              WE EQ RKG+       S   + S V +      Q + Y  + G   +P   + T+ 
Sbjct: 365 LSRWEQEQIRKGINIPQVQAS---QPSEVNVYYQNTYQTMPYGASYG---IP--YSYTAY 424

Query: 382 GGSVGVSQGLDGL---------SISQQAEIAKKAMRDNMGRLKESYRRTAASVLKTDENL 441
           G S   SQ  D                 ++ K+ ++D +  +KE ++       K  ++ 
Sbjct: 425 GSSDAKSQKTDNTVPFKTPSNEMAPVTIDLVKRQLKDRLDSMKELHKTNQQQHEKHLQSR 484

Query: 442 SASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERA 501
             S   I  LE S    GE++ F+Q++R +V  + +    K P I ELE  + +L+++RA
Sbjct: 485 VDSTRAIERLEGSSGGIGERYKFLQEMRGYVQDLLECFSEKVPLINELESAIHQLYKQRA 544

Query: 502 STVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAKEQANLP 561
           S +V+RR  D  DE  E                                 +S   +A + 
Sbjct: 545 SRLVQRRQDDIKDESSE--------------------------------FSSHSNKALMA 604

Query: 562 TKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESSTDESDS- 621
             +D FGRD  L +    KRR   R+ RR +    R  + ++  H  +EG SS DE  S 
Sbjct: 605 PNLDSFGRDRALYQE-HAKRRIAEREARRTRRRQAREQTGQMADH--LEGLSSDDEETST 664

Query: 622 EAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSLSTAAI 681
           +   +    D +L+ + ++F D  E F  +  +K +FE W+  Y  +Y+DAY+ L    +
Sbjct: 665 DITNFNLEKDRILKESSKVFEDVLESFYSIDCIKAQFEAWRSKYYMSYKDAYIGLCLPKL 724

Query: 682 FSPYVRLELLKWDPLHENA-DFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKV 741
           F+P +RL+LL W PL     DF  M W   L  YG  +   +   ++AD  L+P +VEKV
Sbjct: 725 FNPLIRLQLLTWTPLEAKCRDFETMLWFESLLFYGCEDREQE--KDEADVALLPTIVEKV 784

Query: 742 ALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEA--------LTELLVVIRTR 801
            LP L       WD  ST +T      T  + N  P+   A        L  LL+ +R  
Sbjct: 785 ILPKLTVIAETMWDPFSTTQTSRMVGITMKLINGYPSVVNADNKNTQVYLKALLLRMRRT 844

Query: 802 LSSAVEDLTVPTWSALVMKAVPNAARIAAYR-FGISVRLMRNICLWKEIIALPILEKLAL 861
           L    +D+ +P +   V++   +   +   R F  SV+L+ N   W  I +   L++L++
Sbjct: 845 LD---DDVFMPLYPKNVLENKNSGPYLFFQRQFWSSVKLLGNFLQWYGIFSNKTLQELSI 904

Query: 862 EELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRS-HKLQPLVDYVML 921
           + LL   +L   ++      D++ + + +I      W    + G+R+  +L+    Y++ 
Sbjct: 905 DGLLNRYILMAFQNSEYG-DDSIRKAQNVINCFPKQWF-VNLKGERTISQLENFCRYLVH 911

Query: 922 LGRTLEKKHI--SGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLRE 944
           L  T+ +  I  S + +       +++ K+L  +   D+A  +A   +++E
Sbjct: 965 LADTIYRNSIGCSDVEKRNARENIKQIVKLLASVRALDHAISVASDHNVKE 911

BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match: P16383 (Intron Large complex component GCFC2 OS=Homo sapiens OX=9606 GN=GCFC2 PE=1 SV=2)

HSP 1 Score: 106.3 bits (264), Expect = 1.9e-21
Identity = 182/784 (23.21%), Postives = 317/784 (40.43%), Query Frame = 0

Query: 175 AKEGKESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGK-DSTGSIPDQATINAI 234
           A EG ES + D      +K   S     +  L+  +S  +G  +  ST  IPD A I A 
Sbjct: 84  ADEGSESRTLDVSTDEEDKIHHSSESKDDQGLSSDSSSSLGEKELSSTVKIPDAAFIQAA 143

Query: 235 RAKRERMRQAGVAAPDYISLD-------AGSNRTAPGELSDEETEFPGRIAMIGGKSASS 294
           R KRE  R    A  DYISLD       +G  R +  +   E  +   RI          
Sbjct: 144 RRKRELAR----AQDDYISLDVQHTSSISGMKRESEDDPESEPDDHEKRIPF-----TLR 203

Query: 295 KKGVFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPL 354
            + + +   E++I        E S EDE++  WE +Q RK                    
Sbjct: 204 PQTLRQRMAEESISR-NEETSEESQEDEKQDTWEQQQMRKA------------------- 263

Query: 355 IPSVPQQNLIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMG 414
           +  + ++++      G + V    T+ S                    EI KK +   + 
Sbjct: 264 VKIIEERDIDLSCGNGSSKVKKFDTSISFP--------------PVNLEIIKKQLNTRLT 323

Query: 415 RLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQH 474
            L+E++R       K  +++ +S   I  LE S + A     F + ++ +V  + D L  
Sbjct: 324 LLQETHRSHLREYEKYVQDVKSSKSTIQNLESSSNQA-LNCKFYKSMKIYVENLIDCLNE 383

Query: 475 KAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIA 534
           K   I+E+E  M  L  ++A T ++RR     DE+       K   + L +    +E   
Sbjct: 384 KIINIQEIESSMHALLLKQAMTFMKRR----QDEL-------KHESTYLQQLSRKDE--- 443

Query: 535 AATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLAST 594
             TS +       K Q  L                       E  + RR K    R+ S 
Sbjct: 444 --TSTSGNFSVDEKTQWIL-----------------------EEIESRRTKRRQARVLSG 503

Query: 595 EVDGHQKVEGESSTDESDS-EAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQW 654
             + HQ  EG SS DE  S E   +Q +   +LQ   ++F +  ++F  +  +  +F+QW
Sbjct: 504 NCN-HQ--EGTSSDDELPSAEMIDFQKSQGDILQKQKKVFEEVQDDFCNIQNILLKFQQW 563

Query: 655 KRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLH-ENADFFDMNWHSLLFNYGMPEDG 714
           +  +  +Y +A++SL    + +P +R++L+ W+PL  E+    +M W   +  +      
Sbjct: 564 REKFPDSYYEAFISLCIPKLLNPLIRVQLIDWNPLKLESTGLKEMPWFKSVEEFMDSSVE 623

Query: 715 SDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTS-- 774
                + +D  ++  ++ K  +P L   V   WD LST +T +      +I     T   
Sbjct: 624 DSKKESSSDKKVLSAIINKTIIPRLTDFVEFLWDPLSTSQTTSLITHCRVILEEHSTCEN 683

Query: 775 --SEALTELLVVIRTRLSSAVE-DLTVPTW--SALVMKAVPNAARIAAYRFGISVRLMRN 834
             S++  +LL  I +R+  AVE D+ +P +  SA+  K  P+ ++    +F   ++L RN
Sbjct: 684 EVSKSRQDLLKSIVSRMKKAVEDDVFIPLYPKSAVENKTSPH-SKFQERQFWSGLKLFRN 743

Query: 835 ICLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGV 894
           I LW  ++    L++L L +LL   ++  + + T    D V +  ++ A L   W     
Sbjct: 744 ILLWNGLLTDDTLQELGLGKLLNRYLIIALLNATPG-PDVVKKCNQVAACLPEKWFENSA 771

Query: 895 TGDRSHKLQPLVDYVMLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAK 942
                 +L+  + ++      L+  H   ++ SE       +  +LV++   + A     
Sbjct: 804 MRTSIPQLENFIQFL------LQSAH--KLSRSEFRDEVEEIILILVKIKALNQAESFIG 771

BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match: Q8BKT3 (Intron Large complex component GCFC2 OS=Mus musculus OX=10090 GN=Gcfc2 PE=1 SV=2)

HSP 1 Score: 94.4 bits (233), Expect = 7.6e-18
Identity = 200/806 (24.81%), Postives = 330/806 (40.94%), Query Frame = 0

Query: 92  KPSSTHKITALKDRIAHSSSTSASVPSNVQP-QAGTYTKEA-----LRELQKNTRTLASS 151
           +P  T +   ++   + S S  A   S  +P  AG  T+ A      R  +   R  ASS
Sbjct: 4   RPQRTFRRRQVESSDSDSDSDGAKEQSAEEPASAGGRTEGAERPRGARSARGRGRVWASS 63

Query: 152 RSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKESSSEDEEGGSNEKSAGSFRRSKEDA 211
           R S  + P  +                 +     E S+++EEG      +   R    D+
Sbjct: 64  RRSPGAAPRGD---------------GGAECRTAELSTDEEEGTHTLTGSKGDRSPSSDS 123

Query: 212 LARMASMGIGRGKDSTGSIPDQATINAIRAKRERMRQAGVAAPDYISLDAG-SNRTAPGE 271
              +      R       IPD A I A R KRE  R  G    DYISLD   S  T+  +
Sbjct: 124 SCSLEE----RDVSPIVEIPDAAFIQAARRKRELARTPG----DYISLDVNHSCSTSDCK 183

Query: 272 LSDEE------TEFPGRIAMIGGKSASSKKGVFEEFDEQAIDGVRTNIIEHSDEDEEEKI 331
            S+EE       +   RI +   K  + ++ + EE   ++ +       E S EDE + I
Sbjct: 184 RSNEEDPESDPDDHEKRI-LFTPKPQTLRQRMAEETSIRSEES-----SEESQEDENQDI 243

Query: 332 WEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTTAGYNSVPSISTATSIGGS 391
           WE +Q RK +                     +P         AG N+  S S+ +     
Sbjct: 244 WEQQQMRKAV--------------------RIP---------AGQNTDLSHSSKSQTLKK 303

Query: 392 VGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVLKTDENLSASLLNITALEK 451
              S     +++    EI KK + + +  L+ES+R       K ++++ +S   I  LE 
Sbjct: 304 FDTSISFPPVNL----EIIKKQLNNRLTLLQESHRSHQREYEKYEQDIKSSKTAIQNLE- 363

Query: 452 SLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASTVVERRVADND 511
           S S   + + F + ++ +V  I D L  K   I ELE  M  L  +R+  +++RR     
Sbjct: 364 SASDHAQNYRFYRGMKSYVENIIDCLNEKIVSIVELESSMYTLLLKRSEALLKRR----Q 423

Query: 512 DEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNL 571
           DE+       K   S L +    +E     TSA  +     K+Q  L             
Sbjct: 424 DEL-------KCESSYLQQLSRKDE-----TSANGSLAVDEKDQRIL------------- 483

Query: 572 QKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESSTDE-SDSEAAAYQSNHDLL 631
                     EAR+ +R +    R  S   D HQ  EG SS DE S +E   +      +
Sbjct: 484 -------EEIEARRMQRRQ---ARELSGSCD-HQ--EGMSSDDELSPAEMTNFHKCQGDI 543

Query: 632 LQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKW 691
           LQ   ++F D  ++F  +  +  +F+QW+  +  +Y +A++      + SP +R++LL W
Sbjct: 544 LQDCKKVFEDVHDDFCNVQNILLKFQQWREKFPDSYYEAFVGFCLPKLLSPLIRVQLLDW 603

Query: 692 DPLHENADFFD-MNWHSLLFNYGMPEDGSDFAPND-ADANLVPELVEKVALPILHHEVAH 751
           +PL  ++   D M W + +  + M     D    D +D  ++  ++ K  +P L   V  
Sbjct: 604 NPLKMDSIGLDKMPWFTAITEF-MESSMDDIGKEDGSDKKILAAVINKTVVPRLTDFVET 663

Query: 752 CWDMLSTRETRN------AAFATSLITNYVPTSSEALTELLVVIRTRLSSAVE-DLTVPT 811
            WD LST +TR+       AF      N V  + +   +LL  I  R+  ++E D+ +P 
Sbjct: 664 IWDPLSTSQTRSLTVHCRVAFEQFASENEVSKNKQ---DLLKSIVARMKKSIEDDIFIPL 698

Query: 812 W--SALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLYGKVLPH 871
           +  S+   K  P+ ++    +F  +++L RNI LW  ++    L+ L L +LL   ++  
Sbjct: 724 YPKSSEEGKMSPH-SKFQERQFWGALKLFRNILLWNGLLPDDTLQDLGLGKLLNRYLIIS 698

Query: 872 VRSITANIHDAVTRTERIIASLSGVW 873
           + +      D V +  +I A L   W
Sbjct: 784 LTNAVPG-PDVVKKCSQIAACLPERW 698

BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match: A0A6J1G0Q4 (transcriptional repressor ILP1 OS=Cucurbita moschata OX=3662 GN=LOC111449643 PE=3 SV=1)

HSP 1 Score: 1749.6 bits (4530), Expect = 0.0e+00
Identity = 945/945 (100.00%), Postives = 945/945 (100.00%), Query Frame = 0

Query: 1   MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
           MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL
Sbjct: 1   MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60

Query: 61  LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV 120
           LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV
Sbjct: 61  LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV 120

Query: 121 QPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE 180
           QPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE
Sbjct: 121 QPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE 180

Query: 181 SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM 240
           SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM
Sbjct: 181 SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM 240

Query: 241 RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID 300
           RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID
Sbjct: 241 RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID 300

Query: 301 GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTT 360
           GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTT
Sbjct: 301 GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTT 360

Query: 361 AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL 420
           AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL
Sbjct: 361 AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL 420

Query: 421 KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK 480
           KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK
Sbjct: 421 KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK 480

Query: 481 LHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK 540
           LHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK
Sbjct: 481 LHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK 540

Query: 541 EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST 600
           EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST
Sbjct: 541 EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST 600

Query: 601 DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL 660
           DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL
Sbjct: 601 DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL 660

Query: 661 STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL 720
           STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL
Sbjct: 661 STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL 720

Query: 721 VEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSA 780
           VEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSA
Sbjct: 721 VEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSA 780

Query: 781 VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY 840
           VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY
Sbjct: 781 VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY 840

Query: 841 GKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE 900
           GKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE
Sbjct: 841 GKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE 900

Query: 901 KKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
           KKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL
Sbjct: 901 KKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 945

BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match: A0A6J1HXZ2 (transcriptional repressor ILP1 OS=Cucurbita maxima OX=3661 GN=LOC111467670 PE=3 SV=1)

HSP 1 Score: 1732.2 bits (4485), Expect = 0.0e+00
Identity = 935/945 (98.94%), Postives = 940/945 (99.47%), Query Frame = 0

Query: 1   MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
           MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSK ASTSSTVANKPKKANPQVPKL
Sbjct: 1   MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKPASTSSTVANKPKKANPQVPKL 60

Query: 61  LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV 120
           LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV
Sbjct: 61  LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV 120

Query: 121 QPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE 180
           QPQAG YT+EALRELQKNTRTLASSR+SSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE
Sbjct: 121 QPQAGIYTEEALRELQKNTRTLASSRASSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE 180

Query: 181 SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM 240
           SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM
Sbjct: 181 SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM 240

Query: 241 RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID 300
           RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID
Sbjct: 241 RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID 300

Query: 301 GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTT 360
           GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSV QQNLIYPTT
Sbjct: 301 GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVLQQNLIYPTT 360

Query: 361 AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL 420
           AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL
Sbjct: 361 AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL 420

Query: 421 KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK 480
           KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK
Sbjct: 421 KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK 480

Query: 481 LHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK 540
           LHEERASTVVERRVADNDDEMVEI+AAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK
Sbjct: 481 LHEERASTVVERRVADNDDEMVEIDAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK 540

Query: 541 EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST 600
           EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST
Sbjct: 541 EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST 600

Query: 601 DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL 660
           DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL
Sbjct: 601 DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL 660

Query: 661 STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL 720
           STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL
Sbjct: 661 STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL 720

Query: 721 VEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSA 780
           VEKVALPILHHE+AHCWDMLSTRETRNAAFATSLITNYVPTSSEAL ELLVVIRTRLSSA
Sbjct: 721 VEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPTSSEALMELLVVIRTRLSSA 780

Query: 781 VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY 840
           VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY
Sbjct: 781 VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY 840

Query: 841 GKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE 900
           GKVLPHVRSITANIHDAVTRTERIIASL GVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE
Sbjct: 841 GKVLPHVRSITANIHDAVTRTERIIASLLGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE 900

Query: 901 KKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
           KKHISG+AESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL
Sbjct: 901 KKHISGVAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 945

BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match: A0A5D3CCM3 (PAX3-and PAX7-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G00700 PE=3 SV=1)

HSP 1 Score: 1592.0 bits (4121), Expect = 0.0e+00
Identity = 860/947 (90.81%), Postives = 900/947 (95.04%), Query Frame = 0

Query: 1   MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
           MSGSRARNFRRRADDNDDDDEPNG+ APS   SNASSK +STSS VA KPKKANPQ PKL
Sbjct: 1   MSGSRARNFRRRADDNDDDDEPNGSPAPSISASNASSKPSSTSSVVATKPKKANPQGPKL 60

Query: 61  LSFASDEENDAPLR-TSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSN 120
           LSFASDEENDAPLR +SSK ++SKKPSSARLAKPSSTHKITALKDRIAHSSS SASVPSN
Sbjct: 61  LSFASDEENDAPLRPSSSKSSSSKKPSSARLAKPSSTHKITALKDRIAHSSSISASVPSN 120

Query: 121 VQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGK 180
           VQPQAG YTKEALRELQKNTRTLASSR SSESKPSAEPVIVLKGLLKP EQ+ +SA+E K
Sbjct: 121 VQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAEPVIVLKGLLKPAEQVPESAREDK 180

Query: 181 ESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTG-SIPDQATINAIRAKRE 240
           ESSSEDEE GSN KSA S RRSKED LARMASMGIGRGKDS+G SIPDQATINAIRAKRE
Sbjct: 181 ESSSEDEEAGSNAKSAASLRRSKEDTLARMASMGIGRGKDSSGSSIPDQATINAIRAKRE 240

Query: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQA 300
           RMRQAGVAAPDYISLDAGSNRTAPGELSDEE EFPGRIAMIGGK  SSKKGVFEE DEQ 
Sbjct: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLESSKKGVFEEVDEQG 300

Query: 301 IDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYP 360
           IDGVRTNIIEHSDEDEEEKIWE EQFRKGLGKRMDDGSTRVES+SVP++ SV QQNLIYP
Sbjct: 301 IDGVRTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPVVQSVQQQNLIYP 360

Query: 361 TTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAAS 420
           TT GY+SVPS STATSIGGSV VSQGLDGLSISQQAEIAKKAM+++MGRLKESYRRTA+S
Sbjct: 361 TTIGYSSVPSKSTATSIGGSVSVSQGLDGLSISQQAEIAKKAMQESMGRLKESYRRTASS 420

Query: 421 VLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480
           VLKTDENLSASLL IT LEK+LSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM
Sbjct: 421 VLKTDENLSASLLKITDLEKALSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480

Query: 481 QKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIAS 540
           QKLHEERASTVVERRVADNDDEMVEIE AVKAA SILNKKGSS+EM+ AATSAAQAAIAS
Sbjct: 481 QKLHEERASTVVERRVADNDDEMVEIETAVKAATSILNKKGSSHEMLVAATSAAQAAIAS 540

Query: 541 AKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGES 600
           ++EQANLPTK+DEFGRDLNLQKRMDMKRRAEARKRRR++YDSKRLAS EVDGHQKVEGES
Sbjct: 541 SREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASMEVDGHQKVEGES 600

Query: 601 STDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYM 660
           STDESDS++AAYQSN DLLLQTA+QIFSDAAEEFSQLSVVKQRFE+WKRDYSATYRDAYM
Sbjct: 601 STDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEEWKRDYSATYRDAYM 660

Query: 661 SLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720
           SLS  AIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP
Sbjct: 661 SLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720

Query: 721 ELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLS 780
           ELVEKVALPILHHE+AHCWDMLSTRETRNAAFATSLITNYVP SSEALTELLVVIRTRLS
Sbjct: 721 ELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSEALTELLVVIRTRLS 780

Query: 781 SAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEEL 840
            A+EDLTVPTW++LV KAVPNAARIAAYRFG+SVRL+RNICLWKEIIALPILEKLALEEL
Sbjct: 781 GAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLLRNICLWKEIIALPILEKLALEEL 840

Query: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRT 900
           LYGKVLPHVRSITANIHDAVTRTERIIASL+GVWTG G+ GDRSHKLQPLVDYV+LLGRT
Sbjct: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHKLQPLVDYVLLLGRT 900

Query: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
           LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL+EAL
Sbjct: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEAL 947

BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match: A0A1S3BG51 (PAX3- and PAX7-binding protein 1 OS=Cucumis melo OX=3656 GN=LOC103489249 PE=3 SV=1)

HSP 1 Score: 1592.0 bits (4121), Expect = 0.0e+00
Identity = 860/947 (90.81%), Postives = 900/947 (95.04%), Query Frame = 0

Query: 1   MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
           MSGSRARNFRRRADDNDDDDEPNG+ APS   SNASSK +STSS VA KPKKANPQ PKL
Sbjct: 1   MSGSRARNFRRRADDNDDDDEPNGSPAPSISASNASSKPSSTSSVVATKPKKANPQGPKL 60

Query: 61  LSFASDEENDAPLR-TSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSN 120
           LSFASDEENDAPLR +SSK ++SKKPSSARLAKPSSTHKITALKDRIAHSSS SASVPSN
Sbjct: 61  LSFASDEENDAPLRPSSSKSSSSKKPSSARLAKPSSTHKITALKDRIAHSSSISASVPSN 120

Query: 121 VQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGK 180
           VQPQAG YTKEALRELQKNTRTLASSR SSESKPSAEPVIVLKGLLKP EQ+ +SA+E K
Sbjct: 121 VQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAEPVIVLKGLLKPAEQVPESAREDK 180

Query: 181 ESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTG-SIPDQATINAIRAKRE 240
           ESSSEDEE GSN KSA S RRSKED LARMASMGIGRGKDS+G SIPDQATINAIRAKRE
Sbjct: 181 ESSSEDEEAGSNAKSAASLRRSKEDTLARMASMGIGRGKDSSGSSIPDQATINAIRAKRE 240

Query: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQA 300
           RMRQAGVAAPDYISLDAGSNRTAPGELSDEE EFPGRIAMIGGK  SSKKGVFEE DEQ 
Sbjct: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLESSKKGVFEEVDEQG 300

Query: 301 IDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYP 360
           IDGVRTNIIEHSDEDEEEKIWE EQFRKGLGKRMDDGSTRVES+SVP++ SV QQNLIYP
Sbjct: 301 IDGVRTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPVVQSVQQQNLIYP 360

Query: 361 TTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAAS 420
           TT GY+SVPS STATSIGGSV VSQGLDGLSISQQAEIAKKAM+++MGRLKESYRRTA+S
Sbjct: 361 TTIGYSSVPSKSTATSIGGSVSVSQGLDGLSISQQAEIAKKAMQESMGRLKESYRRTASS 420

Query: 421 VLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480
           VLKTDENLSASLL IT LEK+LSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM
Sbjct: 421 VLKTDENLSASLLKITDLEKALSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480

Query: 481 QKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIAS 540
           QKLHEERASTVVERRVADNDDEMVEIE AVKAA SILNKKGSS+EM+ AATSAAQAAIAS
Sbjct: 481 QKLHEERASTVVERRVADNDDEMVEIETAVKAATSILNKKGSSHEMLVAATSAAQAAIAS 540

Query: 541 AKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGES 600
           ++EQANLPTK+DEFGRDLNLQKRMDMKRRAEARKRRR++YDSKRLAS EVDGHQKVEGES
Sbjct: 541 SREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASMEVDGHQKVEGES 600

Query: 601 STDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYM 660
           STDESDS++AAYQSN DLLLQTA+QIFSDAAEEFSQLSVVKQRFE+WKRDYSATYRDAYM
Sbjct: 601 STDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEEWKRDYSATYRDAYM 660

Query: 661 SLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720
           SLS  AIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP
Sbjct: 661 SLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720

Query: 721 ELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLS 780
           ELVEKVALPILHHE+AHCWDMLSTRETRNAAFATSLITNYVP SSEALTELLVVIRTRLS
Sbjct: 721 ELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSEALTELLVVIRTRLS 780

Query: 781 SAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEEL 840
            A+EDLTVPTW++LV KAVPNAARIAAYRFG+SVRL+RNICLWKEIIALPILEKLALEEL
Sbjct: 781 GAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLLRNICLWKEIIALPILEKLALEEL 840

Query: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRT 900
           LYGKVLPHVRSITANIHDAVTRTERIIASL+GVWTG G+ GDRSHKLQPLVDYV+LLGRT
Sbjct: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHKLQPLVDYVLLLGRT 900

Query: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
           LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL+EAL
Sbjct: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEAL 947

BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match: A0A0A0KWD3 (GCFC domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G608340 PE=3 SV=1)

HSP 1 Score: 1582.8 bits (4097), Expect = 0.0e+00
Identity = 854/947 (90.18%), Postives = 895/947 (94.51%), Query Frame = 0

Query: 1   MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
           MSGSRARNFRRRADDNDDDDEP G+ APS   SNASSK +STSS VA KPKKANPQ  KL
Sbjct: 1   MSGSRARNFRRRADDNDDDDEPKGSTAPSISASNASSKPSSTSSVVATKPKKANPQGLKL 60

Query: 61  LSFASDEENDAPLR-TSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSN 120
           LSFASDEENDAPLR +SSK ++SKKPSSARLAKPSSTHKITALKDRIAHSSS SASVPSN
Sbjct: 61  LSFASDEENDAPLRPSSSKSSSSKKPSSARLAKPSSTHKITALKDRIAHSSSISASVPSN 120

Query: 121 VQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGK 180
           VQPQAG YTKEALRELQKNTRTLASSR SSESKPSAEPVIVLKGLLKP EQ+ DSA+E K
Sbjct: 121 VQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAEPVIVLKGLLKPAEQVPDSAREAK 180

Query: 181 ESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTG-SIPDQATINAIRAKRE 240
           ESSSED+E GSN KSA S RRSKED LARMASMGIGRGKDS+G SIPDQATINAIRAKRE
Sbjct: 181 ESSSEDDEAGSNAKSAASLRRSKEDTLARMASMGIGRGKDSSGSSIPDQATINAIRAKRE 240

Query: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQA 300
           RMRQAGVAAPDYISLDAGSNRTAPGELSDEE EFPGRIAMIGGK  SSKKGVFEE DEQ 
Sbjct: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLESSKKGVFEEVDEQG 300

Query: 301 IDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYP 360
           IDG RTNIIEHSDEDEEEKIWE EQFRKGLGKRMDDGSTRVES+SVP++PSV  QNLIYP
Sbjct: 301 IDGARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPVVPSVQPQNLIYP 360

Query: 361 TTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAAS 420
           TT GY+SVPS+STATSIGGSV +SQGLDGLSISQQAEIAK AM+++MGRLKESYRRTA S
Sbjct: 361 TTIGYSSVPSMSTATSIGGSVSISQGLDGLSISQQAEIAKTAMQESMGRLKESYRRTAMS 420

Query: 421 VLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480
           VLKTDENLSASLL IT LEK+LSAAG+KF+FMQKLRDFVSVICDFLQHKAPFIEELEEQM
Sbjct: 421 VLKTDENLSASLLKITDLEKALSAAGDKFMFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480

Query: 481 QKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIAS 540
           QKLHEERASTVVERRVADNDDEMVEIE AVKAA+SILNKKGSSNEM+ AATSAAQAAIA 
Sbjct: 481 QKLHEERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNEMVTAATSAAQAAIAL 540

Query: 541 AKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGES 600
           ++EQANLPTK+DEFGRDLNLQKRMDMKRRAEARKRRR++YDSKRLAS EVDGHQKVEGES
Sbjct: 541 SREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASMEVDGHQKVEGES 600

Query: 601 STDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYM 660
           STDESDS++AAYQSN DLLLQTA+QIFSDAAEEFSQLSVVKQRFE WKRDYSATYRDAYM
Sbjct: 601 STDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEAWKRDYSATYRDAYM 660

Query: 661 SLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720
           SLS  AIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP
Sbjct: 661 SLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720

Query: 721 ELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLS 780
           ELVEKVALPILHHE+AHCWDMLSTRETRNAAFATSLITNYVP SSEALTELLVVIRTRLS
Sbjct: 721 ELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSEALTELLVVIRTRLS 780

Query: 781 SAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEEL 840
            A+EDLTVPTW++LV KAVPNAARIAAYRFG+SVRLMRNICLWKEIIALPILEKLALEEL
Sbjct: 781 GAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNICLWKEIIALPILEKLALEEL 840

Query: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRT 900
           LYGKVLPHVRSITANIHDAVTRTERIIASL+GVWTG G+ GDRSHKLQPLVDYV+LLGRT
Sbjct: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHKLQPLVDYVLLLGRT 900

Query: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
           LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL+EAL
Sbjct: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEAL 947

BLAST of CmoCh18G010640.1 vs. TAIR 10
Match: AT5G08550.1 (GC-rich sequence DNA-binding factor-like protein )

HSP 1 Score: 897.1 bits (2317), Expect = 1.2e-260
Identity = 540/964 (56.02%), Postives = 691/964 (71.68%), Query Frame = 0

Query: 1   MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
           M  +R +NFRRR DD  D+ +   A   S   S  SS    T S  A+ PKK      KL
Sbjct: 1   MGSNRPKNFRRRGDDGGDEIDGKVATPSSKPTSTLSSSKPKTLS--ASAPKK------KL 60

Query: 61  LSFASD--EENDAPLRTSSKPANSKK--PSSARLAKPSSTHKITALKDRIAHSSSTSASV 120
           LSFA D  EE D   R + KP N +    SS+RL    S+H+ ++ K+R   S       
Sbjct: 61  LSFADDEEEEEDGAPRVTIKPKNGRDRVKSSSRLGVSGSSHRHSSTKERRPAS------- 120

Query: 121 PSNVQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLK-PVEQISDSA 180
            SNV PQAG+Y+KEAL ELQKNTRTL  SRSS+    +AEP +VLKGL+K P +    S 
Sbjct: 121 -SNVLPQAGSYSKEALLELQKNTRTLPYSRSSA----NAEPKVVLKGLIKPPQDHEQQSL 180

Query: 181 KEGKESSSE---DEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINA 240
           K+  +  S+   DEEG   +          EDA A                  DQA I  
Sbjct: 181 KDVVKQVSDLDFDEEGEEEQ---------HEDAFA------------------DQAAI-- 240

Query: 241 IRAKRERMRQAGVA-APDYISLDAG-SNRTAPGELSDEETEFPGRIAMIGGK-SASSKKG 300
           IRAK+ERMRQ+  A APDYISLD G  N +A   +SDE+ +F G    +G +     KKG
Sbjct: 241 IRAKKERMRQSRSAPAPDYISLDGGIVNHSAVEGVSDEDADFQG--IFVGPRPQKDDKKG 300

Query: 301 VFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSS---VPL 360
           VF+  DE       T    + DEDEE+K+WE EQF+KG+GKRMD+GS R  +S+   VPL
Sbjct: 301 VFDFGDENPTAKETTTSSIYEDEDEEDKLWEEEQFKKGIGKRMDEGSHRTVTSNGIGVPL 360

Query: 361 ---IPSVPQQN-LIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMR 420
                ++PQQ   +Y   AG   +P++S A +IG +  V    D L +SQQAE+AKKA++
Sbjct: 361 HSKQQTLPQQQPQMYAYHAG-TPMPNVSVAPTIGPATSV----DTLPMSQQAELAKKALK 420

Query: 421 DNMGRLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICD 480
           DN+ +LKES+ +T +S+ KTDENL+ASL++ITALE SLSAAG+K++FMQKLRDF+SVICD
Sbjct: 421 DNVKKLKESHAKTLSSLTKTDENLTASLMSITALESSLSAAGDKYVFMQKLRDFISVICD 480

Query: 481 FLQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSN 540
           F+Q+K   IEE+E+QM++L+E+ A +++ERR+ADN+DEM+E+ AAVKAAM++LNK GSS+
Sbjct: 481 FMQNKGSLIEEIEDQMKELNEKHALSILERRIADNNDEMIELGAAVKAAMTVLNKHGSSS 540

Query: 541 EMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKR 600
            +IAAAT AA AA  S ++Q N P K+DEFGRD NLQKR ++++RA AR++RRA++++KR
Sbjct: 541 SVIAAATGAALAASTSIRQQMNQPVKLDEFGRDENLQKRREVEQRAAARQKRRARFENKR 600

Query: 601 LASTEVDGHQ-KVEGESSTDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQR 660
            ++ EVDG   K+EGESSTDESD+E +AY+   D LLQ AD++FSDA+EE+SQLS VK R
Sbjct: 601 ASAMEVDGPSLKIEGESSTDESDTETSAYKETRDSLLQCADKVFSDASEEYSQLSKVKAR 660

Query: 661 FEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMP 720
           FE+WKRDYS+TYRDAYMSL+  +IFSPYVRLELLKWDPLH++ DFFDM WH LLF+YG P
Sbjct: 661 FERWKRDYSSTYRDAYMSLTVPSIFSPYVRLELLKWDPLHQDVDFFDMKWHGLLFDYGKP 720

Query: 721 EDGSDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPT 780
           EDG DFAP+D DANLVPELVEKVA+PILHH++  CWD+LSTRETRNA  ATSL+TNYV  
Sbjct: 721 EDGDDFAPDDTDANLVPELVEKVAIPILHHQIVRCWDILSTRETRNAVAATSLVTNYVSA 780

Query: 781 SSEALTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLW 840
           SSEAL EL   IR RL  A+  ++VPTW  LV+KAVPN  ++AAYRFG SVRLMRNIC+W
Sbjct: 781 SSEALAELFAAIRARLVEAIAAISVPTWDPLVLKAVPNTPQVAAYRFGTSVRLMRNICMW 840

Query: 841 KEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDR 900
           K+I+ALP+LE LAL +LL+GKVLPHVRSI +NIHDAVTRTERI+ASLSGVWTGP VT   
Sbjct: 841 KDILALPVLENLALSDLLFGKVLPHVRSIASNIHDAVTRTERIVASLSGVWTGPSVTRTH 900

Query: 901 SHKLQPLVDYVMLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL 946
           S  LQPLVD  + L R LEK+  SG+ ++ET+GLARRLK++LVEL+E+D+AR+I +TF+L
Sbjct: 901 SRPLQPLVDCTLTLRRILEKRLGSGLDDAETTGLARRLKRILVELHEHDHAREIVRTFNL 908

BLAST of CmoCh18G010640.1 vs. TAIR 10
Match: AT5G09210.1 (GC-rich sequence DNA-binding factor-like protein )

HSP 1 Score: 352.1 bits (902), Expect = 1.4e-96
Identity = 189/320 (59.06%), Postives = 234/320 (73.12%), Query Frame = 0

Query: 587 EVDGHQK-VEGESST-DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQ 646
           +VDG+   VEG+SST DESD E +AY+   D LLQ AD+IFSDA+  +S+LS VK  F++
Sbjct: 253 KVDGYSLIVEGDSSTDDESDCETSAYEEARDSLLQRADKIFSDASVVYSELSRVKSIFKR 312

Query: 647 WKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDG 706
             R  S  +R AY SL+  +++SPY+RLELL+WDPLH++ DF DMNWH LLF+  +    
Sbjct: 313 GARHPSPAFRAAYTSLTVPSMYSPYLRLELLRWDPLHQDVDFSDMNWHGLLFHSRIVCGS 372

Query: 707 SDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSE 766
           +    N    N V ELV+ VA+PILHH +  CWD+LSTRETRN   ATSL+  YV  SSE
Sbjct: 373 TPVCTN---PNFVSELVKYVAVPILHHRIVRCWDILSTRETRNVVAATSLVARYVFPSSE 432

Query: 767 ALTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEI 826
           AL EL + I  RL  A+  ++VPTW   V K VPNA ++AAYRFG SVRLMRNIC+WK++
Sbjct: 433 ALAELSLAIHARLVEAIIAISVPTWDPQVSKDVPNAPQVAAYRFGTSVRLMRNICMWKDV 492

Query: 827 IALPILEKLALEELLYGKVLPHVRSIT--ANIHDAVTRTERIIASLSGVWTGPGVTGDRS 886
           + LP+LEKLAL +LL+GKVLPHVRSI   +NIHDAVT+TERI+ASLSGVWTGP VT   S
Sbjct: 493 MELPVLEKLALSDLLFGKVLPHVRSIASESNIHDAVTKTERIVASLSGVWTGPSVTRTHS 552

Query: 887 HKLQPLVDYVMLLGRTLEKK 903
           H LQPLVD  + LGR LEKK
Sbjct: 553 HLLQPLVDCTLTLGRILEKK 569

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FNN31.7e-25956.02Transcriptional repressor ILP1 OS=Arabidopsis thaliana OX=3702 GN=ILP1 PE=1 SV=1[more]
Q9Y5B63.5e-3924.54PAX3- and PAX7-binding protein 1 OS=Homo sapiens OX=9606 GN=PAXBP1 PE=1 SV=2[more]
P585012.3e-3824.71PAX3- and PAX7-binding protein 1 OS=Mus musculus OX=10090 GN=Paxbp1 PE=1 SV=3[more]
P163831.9e-2123.21Intron Large complex component GCFC2 OS=Homo sapiens OX=9606 GN=GCFC2 PE=1 SV=2[more]
Q8BKT37.6e-1824.81Intron Large complex component GCFC2 OS=Mus musculus OX=10090 GN=Gcfc2 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1G0Q40.0e+00100.00transcriptional repressor ILP1 OS=Cucurbita moschata OX=3662 GN=LOC111449643 PE=... [more]
A0A6J1HXZ20.0e+0098.94transcriptional repressor ILP1 OS=Cucurbita maxima OX=3661 GN=LOC111467670 PE=3 ... [more]
A0A5D3CCM30.0e+0090.81PAX3-and PAX7-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3BG510.0e+0090.81PAX3- and PAX7-binding protein 1 OS=Cucumis melo OX=3656 GN=LOC103489249 PE=3 SV... [more]
A0A0A0KWD30.0e+0090.18GCFC domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G608340 PE=3 S... [more]
Match NameE-valueIdentityDescription
AT5G08550.11.2e-26056.02GC-rich sequence DNA-binding factor-like protein [more]
AT5G09210.11.4e-9659.06GC-rich sequence DNA-binding factor-like protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 471..491
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..51
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 580..600
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..235
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 137..154
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 105..126
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 74..94
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 561..609
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 172..206
NoneNo IPR availablePANTHERPTHR12214:SF0LD29489Pcoord: 1..932
IPR022783GCF, C-terminalPFAMPF07842GCFCcoord: 636..839
e-value: 1.1E-28
score: 100.7
IPR012890Intron Large complex component GCFC2-likePANTHERPTHR12214GC-RICH SEQUENCE DNA-BINDING FACTORcoord: 1..932

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh18G010640CmoCh18G010640gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh18G010640.1:exon:2849CmoCh18G010640.1:exon:2849exon
CmoCh18G010640.1:exon:2850CmoCh18G010640.1:exon:2850exon
CmoCh18G010640.1:exon:2851CmoCh18G010640.1:exon:2851exon
CmoCh18G010640.1:exon:2852CmoCh18G010640.1:exon:2852exon
CmoCh18G010640.1:exon:2853CmoCh18G010640.1:exon:2853exon
CmoCh18G010640.1:exon:2854CmoCh18G010640.1:exon:2854exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh18G010640.1:five_prime_utrCmoCh18G010640.1:five_prime_utrfive_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh18G010640.1:cdsCmoCh18G010640.1:cdsCDS
CmoCh18G010640.1:cdsCmoCh18G010640.1:cds_2CDS
CmoCh18G010640.1:cdsCmoCh18G010640.1:cds_3CDS
CmoCh18G010640.1:cdsCmoCh18G010640.1:cds_4CDS
CmoCh18G010640.1:cdsCmoCh18G010640.1:cds_5CDS
CmoCh18G010640.1:cdsCmoCh18G010640.1:cds_6CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh18G010640.1:three_prime_utrCmoCh18G010640.1:three_prime_utrthree_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh18G010640.1CmoCh18G010640.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding