Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAAAACCCCGCTAGAAATGAAATGCACGAAACAAGAAACCGACTAAAATCCAGGTCCAAAAACCCTAGCAAAGATCCACGGCTTCTGAATCTGAATCCCGACCTTCGCAATTGCCAATTCATTCTGCTCGTTTCGTTTCATCGCACAATTCCTTCGTAATTGTGTCTAGCAATCACCATGTCCGGCAGTCGAGCTAGAAACTTTCGCCGCCGGGCCGACGACAATGACGACGACGACGAACCCAATGGCGCCGCCGCCCCCTCAACCGGTGTTTCAAACGCCTCATCAAAAGCCGCTTCCACCTCCTCTACCGTCGCCAATAAACCCAAGAAGGCTAATCCTCAGGTACCCAAGCTTCTCAGCTTCGCCAGTGACGAAGAGAACGATGCCCCACTTCGCACTTCTTCGAAACCAGCCAATTCCAAGAAGCCTTCTTCTGCTCGACTTGCTAAGCCTTCCTCCACCCACAAGATCACTGCCTTGAAGGATCGCATCGCTCACTCTTCTTCAACGTCGGCTTCTGTTCCCTCCAATGTGCAACCTCAAGCTGGAACTTATACCAAGGAGGCCCTTCGCGAGTTGCAGAAGAATACTCGCACGCTTGCGAGCTCGAGATCCTCATCCGAGTCTAAGCCTTCCGCCGAACCTGTTATTGTTCTGAAGGGTCTTCTCAAGCCTGTCGAACAAATTTCGGACAGTGCTAAAGAAGGTAAAGAGTCGAGCTCCGAAGATGAAGAAGGGGGTAGTAACGAAAAGAGCGCTGGTTCGTTTCGGAGGAGTAAAGAAGACGCCTTAGCTCGAATGGCTTCAATGGGGATTGGCAGAGGAAAGGATTCAACTGGGTCAATTCCCGATCAAGCAACCATTAATGCAATTCGCGCGAAAAGGGAACGTATGCGACAGGCTGGGGTTGCAGCTCCGGATTATATATCCTTAGATGCAGGAAGCAACCGCACGGCGCCGGGGGAGTTGAGCGACGAAGAAACAGAGTTTCCGGGGAGAATCGCCATGATTGGAGGGAAGTCAGCAAGTTCGAAGAAGGGCGTGTTTGAGGAATTTGATGAGCAAGCTATTGATGGGGTAAGAACGAATATTATTGAGCACAGCGATGAGGACGAGGAGGAGAAAATATGGGAGGCAGAGCAGTTTAGGAAGGGACTGGGTAAGAGAATGGATGATGGTTCTACTAGGGTGGAGAGCTCAAGTGTCCCTCTCATTCCGAGTGTTCCGCAGCAGAACTTAATTTACCCCACTACAGCTGGGTATAATTCGGTGCCTAGCATATCTACAGCTACAAGTATTGGAGGTTCTGTTGGTGTTTCACAGGGTTTGGATGGGCTATCAATATCTCAGCAAGCTGAGATTGCTAAAAAAGCTATGCGAGATAATATGGGGAGGCTTAAGGTATGCATATTTTCCCCTCATTTCAGTCTTGTGCTATGGAGGTATAAACGTGAAACTTACAGGAGATTCTAGGAATATGTTAAATGATATTTGAGTTAATAGGGCTTTCAGTATCTTGGTTTATTATATACATGGCATGTGTTACTTGGAATGTAGTTTACTTTGAATAGCCTATCTCCAAAATAATGTGTTATCAATCATGAAGTGCATATTGACTAACGTTGCTTGTGGCTATGGATCTCTGTGTATGCATGACCTCCATTCTCTTTAATACTCAGGAATCTTATCGAAGAACTGCAGCATCTGTGTTGAAGACGGACGAAAATTTGTCTGCATCTCTTTTGAATATCACAGCCCTTGAAAAGTCTCTTTCTGCTGCCGGCGAGAAGTTTATATTTATGCAAAAGCTTCGTGACTTTGTTTCTGTTATCTGTGACTTTTTGCAGGTACAGATATCCTATCTTACGTCCTACATGTTTTATGGAAATAGAAATTTGATTGTCATGCATGTTTGACTAGAACTAGAGAACCACATTACATTTGGTTGTTTATTACATGTTGGCTAAAAGGTGTTAGAAACATGTAGCAACTTTATGATGATTGGTTTATTTCGAACTTAATATATGACAAGCTAGATAAAGCCTTATTCTCTTTTAGTTTCTAATTTTCACAAATAACTTGATGCTTCCTATGGATATATAAAGTAGTCCTTCTATGGCTACAGTTAACCGCAGCTTTCGCTCTTTTCTGTTTCTGAAGGTGAAATTAGAACAATTATTGTATGTCATGGACATGTTAGATGGACAAGTCTCAACAGAATTGGCTATTTCACCTTTGTCTGTTGTATCTTTTTATGAGCTATATATATGGCCAAGCAATAAAAAGCTTGTTATGTTACGAATATACACAATGTTACTGTACTTGTAAGATATATGAGTTTCTGAATAGTCTGCCACAGAAAAGAGAGAACAAAAATTATAGGACAGAGTAGAAATTAAATAGCAGTCATGCTTTGAAGATGCCATGTATATATGGTTTTCGAAGTGTTATTACCTTTTGATACTTCCTCTCCCATAGATTTTTCATTTGGATATCTAGCTTACAGTTTGGGCTTTTGTTAGAGACAGTAGGGCTTGGTTGGAGGAGGTGCTTTTGAATTCTTGCTTGAAGGAGAAGGGCAGAGTATAGTGGCAGTCTAGCTTCTTTGCAATGTTGTGCGACATTTTGATTGAGAGGAATAATACTATTTTTAGAGAGGTCGAAAGATATGATGTGAAAGTTTTGGAGGTGGCTAGGTTTAACGCCTTCTTTGGGACGTCAGCCGCACGGCTTTATACAATTATGAGTGTAGTGTCGTTCTTTTGGATTCAAATCCTTTTTTTATAATTAGGGCCAAACTCCTTTTTTGTTCGGGCTTATTTTTCAAATGCTTTTGTCTATACTAACATTTTTCTTCTTGGTTTCTTCCGGAAAGGGGGCTATTGTGTTGCATCATACGTGTTAAGGGATAGAGTTAAAGCAAATACTCAATGTAGCATTTTTTTTCCTGTTCGCATTATTTTGAATGCACTCGGCTATTCTCATGAGACAAATGCTTGACTTTACTACATTTGGTATCAAGGTGACTTGTAACATATTAAATCCTAGGTAGCTGGCCATCATGATTGAACCCATTCCCTTTGGGGCTTTTATTAATTATACATGGCTCATTTTGACCATTGGGCCAACCTAGGGTGGTTTAGTGTATTTGGTAGTTAGCTACGGAGACTTGATCCATTACTTTATTGAAGGCATCTAAGTTAACAACTTTGACTTGAATGCATGAATCAAACATGATAGAAAATGTAGCCGCACCTCAAAAGTCCAAAGTTAGTCGTTTGGTAGTTTATTGAAGGATTGAACCTTGCTGTAGTGAAAAATATGTCATGTTTGATTATTCTTGTTTTGTTGGTCAAGTTTCAAAGCGTCTTTTGCATCTCTTGGACTTTCACTTAGACTGGGATGCTTTACTGAAAGAGTTACGAGGGCATGAAATTTTTTTTTGATCGTTAAGAAAAAGACATTCTTCGTGCCTGCAAAACAGTTTAATTACTAACGCATTTAAGGGGTTTCAATAGTACAAGAGCATTTGGGATGTTTGTAATTTAGATGGTTAGCATGTGCCTTTCCTATTATTAAGACTTGCCTTCCTCTTCACTTTTCCCCCTTGCTGGATATGCTTGTAAACTAGAACTATAGGCCTCTCAACTTTGCGTCACTGAGATGTTAGGTCCTATATTAGGTTGGCTTTTTAATTCAAGCTGTTTTATCTCCTTTCTTTAATTAACTATTTTGTGTGTTTCTGTGCTTTCAGCATAAAGCTCCATTCATAGAGGAGCTTGAGGAGCAGATGCAAAAACTTCACGAAGAACGGGCTTCTACAGTAGTGGAAAGAAGAGTAGCTGATAATGATGATGAAATGGTGGAGATAGAAGCAGCTGTAAAAGCAGCAATGTCAATCTTGAATAAGAAGGGGAGCAGCAATGAAATGATTGCTGCAGCCACAAGTGCTGCCCAGGCAGCAATTGCCTCTGCAAAGGAACAGGCAAATTTACCAACAAAGGTAGATGAATTTGGTAGGGATTTAAATTTGCAAAAACGTATGGATATGAAAAGAAGGGCTGAGGCTCGAAAGCGCCGGAGAGCAAAGTACGATTCCAAGAGACTTGCATCCACAGAAGTTGATGGCCATCAAAAAGTAGAAGGAGAGTCTAGCACTGATGAGAGTGATAGTGAGGCTGCAGCTTACCAGTCAAACCATGATTTATTACTTCAGACTGCTGATCAAATTTTTAGTGATGCAGCTGAGGAATTCTCCCAACTTTCTGTGGTGAAACAGAGGTTTGAACAATGGAAGAGAGATTATTCAGCAACGTACCGTGACGCATATATGTCATTAAGCACTGCTGCTATCTTCTCTCCTTATGTGAGATTGGAACTCTTGAAGTGGGATCCCCTGCATGAAAATGCAGATTTTTTTGACATGAACTGGTATGATATTATTAAGTATTACAGTGCTAATATACATAGCATTTATACTCAAGTCAATATGAAAAATAATCATGAATGGTAGAGGAATTTTCTCCGATCAGCTGGTTGTGGTACTTTCTTAGTGAAGTTAAAGTCATACATAATTTAAATTGCTCGTTTACCCATCCTTATTTTGAATTTTCAGGCACTCTTTGCTGTTCAATTATGGTATGCCGGAAGATGGTAGTGATTTTGCTCCAAATGATGCCGATGCTAACCTTGTCCCAGAACTAGTTGAGAAGGTTGCACTTCCAATATTGCACCATGAAGTTGCTCATTGTTGGGACATGCTTAGCACACGTGAAACCCGAAATGCAGCTTTTGCTACTAGCTTGATTACTAACTATGTTCCAACATCAAGTGAAGCTCTTACGGAATTATTGGTTGTTATTCGTACTCGTTTATCAAGCGCTGTTGAAGATCTTACGGTATGAATTTGGTTTTGCACTTTTCAATAATATGGTTATCTTTATATTATATTATCATGGAAACCAACATTTCTGCGTGAATTGAATGCATATATCCTCAACTTCACTGTACTGTGAGCCATTATGCGTTCATGTTTAATTCTCAATGGATTATTTCATCTAATCTGCTGATGTTGGATCATGTTTACAAAATATAGGTTCCTACTTGGAGTGCACTGGTGATGAAAGCTGTTCCAAATGCTGCTCGAATTGCAGCATATCGGTTTGGCATATCCGTTCGTTTGATGAGAAACATATGTTTGTGGAAAGAAATTATTGCATTGCCCATTTTAGAAAAGCTTGCCCTTGAAGAGCTCTTGTATGGGAAAGTTCTACCTCATGTTAGAAGCATCACAGCGAACATACATGATGCAGTCACAAGAACTGAGAGAATCATTGCTTCTCTATCAGGAGTGTGGACAGGCCCCGGCGTCACCGGTGATCGCAGGTTTGGATATTCATACGTTATTCACTCTTTTGTTTGGACCTGAAATTTGGCTGGCTAAATAATATCAGTGAAGGGAATACGCACGTGAACGATGCTTATAAGCGTTTAAGTTGCATTCCTTTTGTTTATGAATTTATCTCTCTGTTTCATACAAAGATGATACCCATAAGCTTCTATACATTTTTGTTTTGCAGTCACAAGTTGCAACCATTGGTAGACTATGTTATGCTACTGGGAAGAACATTGGAGAAAAAACATATTTCAGGCATAGCTGAGAGCGAGACGAGCGGACTAGCTCGGCGATTAAAGAAGATGCTAGTTGAGCTGAATGAATATGACAATGCAAGAGACATTGCTAAGACCTTCCATCTCAGGGAGGCACTATGAGCTCGAACGAGCGTCTGGTGTGATCACAAGATAGGACAATGTACCTTGTGTATATGCTTTTATGGAACATTGATGAAGTTTATTGTCTCAAAGATTATCCTGATTCTTACTGAATTGACATCATTGATCTAGAGAATGCGGCATTAGAGTCTGAAAAGAGCTAACTGCAGAGCTGTGAACCCATTAGCATTTGATGAGTTTATTTGGTACGATCGTTATGGCCATTTTTGATAGCTCCCTCCATTTGGTCTTCGTCATTGATACTCGACCTTCATTCTTAGAGCCTGCTAAACACATTAGTTGGGTCAGCCCATTTCTTTAAACTTTCAATTTTTTTCCCATCG
mRNA sequence
GAAAAAACCCCGCTAGAAATGAAATGCACGAAACAAGAAACCGACTAAAATCCAGGTCCAAAAACCCTAGCAAAGATCCACGGCTTCTGAATCTGAATCCCGACCTTCGCAATTGCCAATTCATTCTGCTCGTTTCGTTTCATCGCACAATTCCTTCGTAATTGTGTCTAGCAATCACCATGTCCGGCAGTCGAGCTAGAAACTTTCGCCGCCGGGCCGACGACAATGACGACGACGACGAACCCAATGGCGCCGCCGCCCCCTCAACCGGTGTTTCAAACGCCTCATCAAAAGCCGCTTCCACCTCCTCTACCGTCGCCAATAAACCCAAGAAGGCTAATCCTCAGGTACCCAAGCTTCTCAGCTTCGCCAGTGACGAAGAGAACGATGCCCCACTTCGCACTTCTTCGAAACCAGCCAATTCCAAGAAGCCTTCTTCTGCTCGACTTGCTAAGCCTTCCTCCACCCACAAGATCACTGCCTTGAAGGATCGCATCGCTCACTCTTCTTCAACGTCGGCTTCTGTTCCCTCCAATGTGCAACCTCAAGCTGGAACTTATACCAAGGAGGCCCTTCGCGAGTTGCAGAAGAATACTCGCACGCTTGCGAGCTCGAGATCCTCATCCGAGTCTAAGCCTTCCGCCGAACCTGTTATTGTTCTGAAGGGTCTTCTCAAGCCTGTCGAACAAATTTCGGACAGTGCTAAAGAAGGTAAAGAGTCGAGCTCCGAAGATGAAGAAGGGGGTAGTAACGAAAAGAGCGCTGGTTCGTTTCGGAGGAGTAAAGAAGACGCCTTAGCTCGAATGGCTTCAATGGGGATTGGCAGAGGAAAGGATTCAACTGGGTCAATTCCCGATCAAGCAACCATTAATGCAATTCGCGCGAAAAGGGAACGTATGCGACAGGCTGGGGTTGCAGCTCCGGATTATATATCCTTAGATGCAGGAAGCAACCGCACGGCGCCGGGGGAGTTGAGCGACGAAGAAACAGAGTTTCCGGGGAGAATCGCCATGATTGGAGGGAAGTCAGCAAGTTCGAAGAAGGGCGTGTTTGAGGAATTTGATGAGCAAGCTATTGATGGGGTAAGAACGAATATTATTGAGCACAGCGATGAGGACGAGGAGGAGAAAATATGGGAGGCAGAGCAGTTTAGGAAGGGACTGGGTAAGAGAATGGATGATGGTTCTACTAGGGTGGAGAGCTCAAGTGTCCCTCTCATTCCGAGTGTTCCGCAGCAGAACTTAATTTACCCCACTACAGCTGGGTATAATTCGGTGCCTAGCATATCTACAGCTACAAGTATTGGAGGTTCTGTTGGTGTTTCACAGGGTTTGGATGGGCTATCAATATCTCAGCAAGCTGAGATTGCTAAAAAAGCTATGCGAGATAATATGGGGAGGCTTAAGGAATCTTATCGAAGAACTGCAGCATCTGTGTTGAAGACGGACGAAAATTTGTCTGCATCTCTTTTGAATATCACAGCCCTTGAAAAGTCTCTTTCTGCTGCCGGCGAGAAGTTTATATTTATGCAAAAGCTTCGTGACTTTGTTTCTGTTATCTGTGACTTTTTGCAGCATAAAGCTCCATTCATAGAGGAGCTTGAGGAGCAGATGCAAAAACTTCACGAAGAACGGGCTTCTACAGTAGTGGAAAGAAGAGTAGCTGATAATGATGATGAAATGGTGGAGATAGAAGCAGCTGTAAAAGCAGCAATGTCAATCTTGAATAAGAAGGGGAGCAGCAATGAAATGATTGCTGCAGCCACAAGTGCTGCCCAGGCAGCAATTGCCTCTGCAAAGGAACAGGCAAATTTACCAACAAAGGTAGATGAATTTGGTAGGGATTTAAATTTGCAAAAACGTATGGATATGAAAAGAAGGGCTGAGGCTCGAAAGCGCCGGAGAGCAAAGTACGATTCCAAGAGACTTGCATCCACAGAAGTTGATGGCCATCAAAAAGTAGAAGGAGAGTCTAGCACTGATGAGAGTGATAGTGAGGCTGCAGCTTACCAGTCAAACCATGATTTATTACTTCAGACTGCTGATCAAATTTTTAGTGATGCAGCTGAGGAATTCTCCCAACTTTCTGTGGTGAAACAGAGGTTTGAACAATGGAAGAGAGATTATTCAGCAACGTACCGTGACGCATATATGTCATTAAGCACTGCTGCTATCTTCTCTCCTTATGTGAGATTGGAACTCTTGAAGTGGGATCCCCTGCATGAAAATGCAGATTTTTTTGACATGAACTGGCACTCTTTGCTGTTCAATTATGGTATGCCGGAAGATGGTAGTGATTTTGCTCCAAATGATGCCGATGCTAACCTTGTCCCAGAACTAGTTGAGAAGGTTGCACTTCCAATATTGCACCATGAAGTTGCTCATTGTTGGGACATGCTTAGCACACGTGAAACCCGAAATGCAGCTTTTGCTACTAGCTTGATTACTAACTATGTTCCAACATCAAGTGAAGCTCTTACGGAATTATTGGTTGTTATTCGTACTCGTTTATCAAGCGCTGTTGAAGATCTTACGGTTCCTACTTGGAGTGCACTGGTGATGAAAGCTGTTCCAAATGCTGCTCGAATTGCAGCATATCGGTTTGGCATATCCGTTCGTTTGATGAGAAACATATGTTTGTGGAAAGAAATTATTGCATTGCCCATTTTAGAAAAGCTTGCCCTTGAAGAGCTCTTGTATGGGAAAGTTCTACCTCATGTTAGAAGCATCACAGCGAACATACATGATGCAGTCACAAGAACTGAGAGAATCATTGCTTCTCTATCAGGAGTGTGGACAGGCCCCGGCGTCACCGGTGATCGCAGTCACAAGTTGCAACCATTGGTAGACTATGTTATGCTACTGGGAAGAACATTGGAGAAAAAACATATTTCAGGCATAGCTGAGAGCGAGACGAGCGGACTAGCTCGGCGATTAAAGAAGATGCTAGTTGAGCTGAATGAATATGACAATGCAAGAGACATTGCTAAGACCTTCCATCTCAGGGAGGCACTATGAGCTCGAACGAGCGTCTGGTGTGATCACAAGATAGGACAATGTACCTTGTGTATATGCTTTTATGGAACATTGATGAAGTTTATTGTCTCAAAGATTATCCTGATTCTTACTGAATTGACATCATTGATCTAGAGAATGCGGCATTAGAGTCTGAAAAGAGCTAACTGCAGAGCTGTGAACCCATTAGCATTTGATGAGTTTATTTGGTACGATCGTTATGGCCATTTTTGATAGCTCCCTCCATTTGGTCTTCGTCATTGATACTCGACCTTCATTCTTAGAGCCTGCTAAACACATTAGTTGGGTCAGCCCATTTCTTTAAACTTTCAATTTTTTTCCCATCG
Coding sequence (CDS)
ATGTCCGGCAGTCGAGCTAGAAACTTTCGCCGCCGGGCCGACGACAATGACGACGACGACGAACCCAATGGCGCCGCCGCCCCCTCAACCGGTGTTTCAAACGCCTCATCAAAAGCCGCTTCCACCTCCTCTACCGTCGCCAATAAACCCAAGAAGGCTAATCCTCAGGTACCCAAGCTTCTCAGCTTCGCCAGTGACGAAGAGAACGATGCCCCACTTCGCACTTCTTCGAAACCAGCCAATTCCAAGAAGCCTTCTTCTGCTCGACTTGCTAAGCCTTCCTCCACCCACAAGATCACTGCCTTGAAGGATCGCATCGCTCACTCTTCTTCAACGTCGGCTTCTGTTCCCTCCAATGTGCAACCTCAAGCTGGAACTTATACCAAGGAGGCCCTTCGCGAGTTGCAGAAGAATACTCGCACGCTTGCGAGCTCGAGATCCTCATCCGAGTCTAAGCCTTCCGCCGAACCTGTTATTGTTCTGAAGGGTCTTCTCAAGCCTGTCGAACAAATTTCGGACAGTGCTAAAGAAGGTAAAGAGTCGAGCTCCGAAGATGAAGAAGGGGGTAGTAACGAAAAGAGCGCTGGTTCGTTTCGGAGGAGTAAAGAAGACGCCTTAGCTCGAATGGCTTCAATGGGGATTGGCAGAGGAAAGGATTCAACTGGGTCAATTCCCGATCAAGCAACCATTAATGCAATTCGCGCGAAAAGGGAACGTATGCGACAGGCTGGGGTTGCAGCTCCGGATTATATATCCTTAGATGCAGGAAGCAACCGCACGGCGCCGGGGGAGTTGAGCGACGAAGAAACAGAGTTTCCGGGGAGAATCGCCATGATTGGAGGGAAGTCAGCAAGTTCGAAGAAGGGCGTGTTTGAGGAATTTGATGAGCAAGCTATTGATGGGGTAAGAACGAATATTATTGAGCACAGCGATGAGGACGAGGAGGAGAAAATATGGGAGGCAGAGCAGTTTAGGAAGGGACTGGGTAAGAGAATGGATGATGGTTCTACTAGGGTGGAGAGCTCAAGTGTCCCTCTCATTCCGAGTGTTCCGCAGCAGAACTTAATTTACCCCACTACAGCTGGGTATAATTCGGTGCCTAGCATATCTACAGCTACAAGTATTGGAGGTTCTGTTGGTGTTTCACAGGGTTTGGATGGGCTATCAATATCTCAGCAAGCTGAGATTGCTAAAAAAGCTATGCGAGATAATATGGGGAGGCTTAAGGAATCTTATCGAAGAACTGCAGCATCTGTGTTGAAGACGGACGAAAATTTGTCTGCATCTCTTTTGAATATCACAGCCCTTGAAAAGTCTCTTTCTGCTGCCGGCGAGAAGTTTATATTTATGCAAAAGCTTCGTGACTTTGTTTCTGTTATCTGTGACTTTTTGCAGCATAAAGCTCCATTCATAGAGGAGCTTGAGGAGCAGATGCAAAAACTTCACGAAGAACGGGCTTCTACAGTAGTGGAAAGAAGAGTAGCTGATAATGATGATGAAATGGTGGAGATAGAAGCAGCTGTAAAAGCAGCAATGTCAATCTTGAATAAGAAGGGGAGCAGCAATGAAATGATTGCTGCAGCCACAAGTGCTGCCCAGGCAGCAATTGCCTCTGCAAAGGAACAGGCAAATTTACCAACAAAGGTAGATGAATTTGGTAGGGATTTAAATTTGCAAAAACGTATGGATATGAAAAGAAGGGCTGAGGCTCGAAAGCGCCGGAGAGCAAAGTACGATTCCAAGAGACTTGCATCCACAGAAGTTGATGGCCATCAAAAAGTAGAAGGAGAGTCTAGCACTGATGAGAGTGATAGTGAGGCTGCAGCTTACCAGTCAAACCATGATTTATTACTTCAGACTGCTGATCAAATTTTTAGTGATGCAGCTGAGGAATTCTCCCAACTTTCTGTGGTGAAACAGAGGTTTGAACAATGGAAGAGAGATTATTCAGCAACGTACCGTGACGCATATATGTCATTAAGCACTGCTGCTATCTTCTCTCCTTATGTGAGATTGGAACTCTTGAAGTGGGATCCCCTGCATGAAAATGCAGATTTTTTTGACATGAACTGGCACTCTTTGCTGTTCAATTATGGTATGCCGGAAGATGGTAGTGATTTTGCTCCAAATGATGCCGATGCTAACCTTGTCCCAGAACTAGTTGAGAAGGTTGCACTTCCAATATTGCACCATGAAGTTGCTCATTGTTGGGACATGCTTAGCACACGTGAAACCCGAAATGCAGCTTTTGCTACTAGCTTGATTACTAACTATGTTCCAACATCAAGTGAAGCTCTTACGGAATTATTGGTTGTTATTCGTACTCGTTTATCAAGCGCTGTTGAAGATCTTACGGTTCCTACTTGGAGTGCACTGGTGATGAAAGCTGTTCCAAATGCTGCTCGAATTGCAGCATATCGGTTTGGCATATCCGTTCGTTTGATGAGAAACATATGTTTGTGGAAAGAAATTATTGCATTGCCCATTTTAGAAAAGCTTGCCCTTGAAGAGCTCTTGTATGGGAAAGTTCTACCTCATGTTAGAAGCATCACAGCGAACATACATGATGCAGTCACAAGAACTGAGAGAATCATTGCTTCTCTATCAGGAGTGTGGACAGGCCCCGGCGTCACCGGTGATCGCAGTCACAAGTTGCAACCATTGGTAGACTATGTTATGCTACTGGGAAGAACATTGGAGAAAAAACATATTTCAGGCATAGCTGAGAGCGAGACGAGCGGACTAGCTCGGCGATTAAAGAAGATGCTAGTTGAGCTGAATGAATATGACAATGCAAGAGACATTGCTAAGACCTTCCATCTCAGGGAGGCACTATGA
Protein sequence
MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKLLSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNVQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERMRQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESSTDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL
Homology
BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match:
Q9FNN3 (Transcriptional repressor ILP1 OS=Arabidopsis thaliana OX=3702 GN=ILP1 PE=1 SV=1)
HSP 1 Score: 897.1 bits (2317), Expect = 1.7e-259
Identity = 540/964 (56.02%), Postives = 691/964 (71.68%), Query Frame = 0
Query: 1 MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
M +R +NFRRR DD D+ + A S S SS T S A+ PKK KL
Sbjct: 1 MGSNRPKNFRRRGDDGGDEIDGKVATPSSKPTSTLSSSKPKTLS--ASAPKK------KL 60
Query: 61 LSFASD--EENDAPLRTSSKPANSKK--PSSARLAKPSSTHKITALKDRIAHSSSTSASV 120
LSFA D EE D R + KP N + SS+RL S+H+ ++ K+R S
Sbjct: 61 LSFADDEEEEEDGAPRVTIKPKNGRDRVKSSSRLGVSGSSHRHSSTKERRPAS------- 120
Query: 121 PSNVQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLK-PVEQISDSA 180
SNV PQAG+Y+KEAL ELQKNTRTL SRSS+ +AEP +VLKGL+K P + S
Sbjct: 121 -SNVLPQAGSYSKEALLELQKNTRTLPYSRSSA----NAEPKVVLKGLIKPPQDHEQQSL 180
Query: 181 KEGKESSSE---DEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINA 240
K+ + S+ DEEG + EDA A DQA I
Sbjct: 181 KDVVKQVSDLDFDEEGEEEQ---------HEDAFA------------------DQAAI-- 240
Query: 241 IRAKRERMRQAGVA-APDYISLDAG-SNRTAPGELSDEETEFPGRIAMIGGK-SASSKKG 300
IRAK+ERMRQ+ A APDYISLD G N +A +SDE+ +F G +G + KKG
Sbjct: 241 IRAKKERMRQSRSAPAPDYISLDGGIVNHSAVEGVSDEDADFQG--IFVGPRPQKDDKKG 300
Query: 301 VFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSS---VPL 360
VF+ DE T + DEDEE+K+WE EQF+KG+GKRMD+GS R +S+ VPL
Sbjct: 301 VFDFGDENPTAKETTTSSIYEDEDEEDKLWEEEQFKKGIGKRMDEGSHRTVTSNGIGVPL 360
Query: 361 ---IPSVPQQN-LIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMR 420
++PQQ +Y AG +P++S A +IG + V D L +SQQAE+AKKA++
Sbjct: 361 HSKQQTLPQQQPQMYAYHAG-TPMPNVSVAPTIGPATSV----DTLPMSQQAELAKKALK 420
Query: 421 DNMGRLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICD 480
DN+ +LKES+ +T +S+ KTDENL+ASL++ITALE SLSAAG+K++FMQKLRDF+SVICD
Sbjct: 421 DNVKKLKESHAKTLSSLTKTDENLTASLMSITALESSLSAAGDKYVFMQKLRDFISVICD 480
Query: 481 FLQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSN 540
F+Q+K IEE+E+QM++L+E+ A +++ERR+ADN+DEM+E+ AAVKAAM++LNK GSS+
Sbjct: 481 FMQNKGSLIEEIEDQMKELNEKHALSILERRIADNNDEMIELGAAVKAAMTVLNKHGSSS 540
Query: 541 EMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKR 600
+IAAAT AA AA S ++Q N P K+DEFGRD NLQKR ++++RA AR++RRA++++KR
Sbjct: 541 SVIAAATGAALAASTSIRQQMNQPVKLDEFGRDENLQKRREVEQRAAARQKRRARFENKR 600
Query: 601 LASTEVDGHQ-KVEGESSTDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQR 660
++ EVDG K+EGESSTDESD+E +AY+ D LLQ AD++FSDA+EE+SQLS VK R
Sbjct: 601 ASAMEVDGPSLKIEGESSTDESDTETSAYKETRDSLLQCADKVFSDASEEYSQLSKVKAR 660
Query: 661 FEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMP 720
FE+WKRDYS+TYRDAYMSL+ +IFSPYVRLELLKWDPLH++ DFFDM WH LLF+YG P
Sbjct: 661 FERWKRDYSSTYRDAYMSLTVPSIFSPYVRLELLKWDPLHQDVDFFDMKWHGLLFDYGKP 720
Query: 721 EDGSDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPT 780
EDG DFAP+D DANLVPELVEKVA+PILHH++ CWD+LSTRETRNA ATSL+TNYV
Sbjct: 721 EDGDDFAPDDTDANLVPELVEKVAIPILHHQIVRCWDILSTRETRNAVAATSLVTNYVSA 780
Query: 781 SSEALTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLW 840
SSEAL EL IR RL A+ ++VPTW LV+KAVPN ++AAYRFG SVRLMRNIC+W
Sbjct: 781 SSEALAELFAAIRARLVEAIAAISVPTWDPLVLKAVPNTPQVAAYRFGTSVRLMRNICMW 840
Query: 841 KEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDR 900
K+I+ALP+LE LAL +LL+GKVLPHVRSI +NIHDAVTRTERI+ASLSGVWTGP VT
Sbjct: 841 KDILALPVLENLALSDLLFGKVLPHVRSIASNIHDAVTRTERIVASLSGVWTGPSVTRTH 900
Query: 901 SHKLQPLVDYVMLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL 946
S LQPLVD + L R LEK+ SG+ ++ET+GLARRLK++LVEL+E+D+AR+I +TF+L
Sbjct: 901 SRPLQPLVDCTLTLRRILEKRLGSGLDDAETTGLARRLKRILVELHEHDHAREIVRTFNL 908
BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match:
Q9Y5B6 (PAX3- and PAX7-binding protein 1 OS=Homo sapiens OX=9606 GN=PAXBP1 PE=1 SV=2)
HSP 1 Score: 165.2 bits (417), Expect = 3.5e-39
Identity = 240/978 (24.54%), Postives = 412/978 (42.13%), Query Frame = 0
Query: 4 SRARNFRRRADDND-----DDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVP 63
+R N R+R D + D+++ P G + + P P
Sbjct: 5 ARRVNVRKRNDSEEEERERDEEQEPPPLLPPPGTGEEAGPGGGDRAPGGESLLGPGPSPP 64
Query: 64 KLLSFASDEENDAPLRTSSKPANSKKP-SSARLAKPSSTHKITALKDRIAHSSSTSASVP 123
L+ E ++P N KP R K + + +D +
Sbjct: 65 SALTPGLGAEAGGGFPGGAEPGNGLKPRKRPRENKEVPRASLLSFQDEEEENEEV----- 124
Query: 124 SNVQPQAGTYTKEALRELQKNTR-TLASSRSSSESKPSAEPVIVL--KGLLKPVEQ---- 183
+ + +Y+K+ ++ L+K + L S+ +E SAE L G +K Q
Sbjct: 125 --FKVKKSSYSKKIVKLLKKEYKEDLEKSKIKTELNSSAESEQPLDKTGHVKDTNQEDGV 184
Query: 184 -ISDSAKEGKESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQAT 243
IS+ ++ + SE EE K+ G+F + ++S+ + R G IPD A
Sbjct: 185 IISEHGEDEMDMESEKEE--EKPKTGGAFSNA-------LSSLNVLR----PGEIPDAAF 244
Query: 244 INAIRAKRERMRQAGVAAPDYISLDAGS-NRTAPGELSDEETEFPGRIAMIGGKSASSKK 303
I+A R KR+ R+ G P G R + SD+E + R + K S ++
Sbjct: 245 IHAARKKRQMARELGDFTPHDNEPGKGRLVREDENDASDDEDDDEKRRIVFSVKEKSQRQ 304
Query: 304 GVFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIP 363
+ EE I+G + + ++DEE WE EQ RKG+ +V++S P
Sbjct: 305 KIAEEI---GIEGSDDDALVTGEQDEELSRWEQEQIRKGI------NIPQVQASQ-PAEV 364
Query: 364 SVPQQNLIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGL---------SISQQAEIAKK 423
++ QN Y T +S + T+ G S SQ D ++ KK
Sbjct: 365 NMYYQN-TYQTMPYGSSYGIPYSYTAYGSSDAKSQKTDNTVPFKTPSNEMTPVTIDLVKK 424
Query: 424 AMRDNMGRLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSV 483
++D + +KE ++ K ++ S I LE S GE++ F+Q++R +V
Sbjct: 425 QLKDRLDSMKELHKTNRQQHEKHLQSRVDSTRAIERLEGSSGGIGERYKFLQEMRGYVQD 484
Query: 484 ICDFLQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKG 543
+ + K P I ELE + +L+++RAS +V+RR D DE E
Sbjct: 485 LLECFSEKVPLINELESAIHQLYKQRASRLVQRRQDDIKDESSE---------------- 544
Query: 544 SSNEMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYD 603
+S +A + +D FGRD L + KRR R+ RR +
Sbjct: 545 ----------------FSSHSNKALMAPNLDSFGRDRALYQE-HAKRRIAEREARRTRRR 604
Query: 604 SKRLASTEVDGHQKVEGESSTDESDS-EAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVV 663
R + ++ H +EG SS DE S + + D + + + ++F D E F + +
Sbjct: 605 QAREQTGKMADH--LEGLSSDDEETSTDITNFNLEKDRISKESGKVFEDVLESFYSIDCI 664
Query: 664 KQRFEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENA-DFFDMNWHSLLFN 723
K +FE W+ Y +Y+DAY+ L +F+P +RL+LL W PL DF +M W L
Sbjct: 665 KSQFEAWRSKYYTSYKDAYIGLCLPKLFNPLIRLQLLTWTPLEAKCRDFENMLWFESLLF 724
Query: 724 YGMPEDGSDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITN 783
YG E + +D D L+P +VEKV LP L + WD ST +T T + N
Sbjct: 725 YGCEEREQE--KDDVDVALLPTIVEKVILPKLTVIAENMWDPFSTTQTSRMVGITLKLIN 784
Query: 784 YVPTSSEA--------LTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYR-F 843
P+ A L LL+ +R L +D+ +P + V++ + + R F
Sbjct: 785 GYPSVVNAENKNTQVYLKALLLRMRRTLD---DDVFMPLYPKNVLENKNSGPYLFFQRQF 844
Query: 844 GISVRLMRNICLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASL 903
SV+L+ N W I + L++L+++ LL +L ++ D++ + + +I
Sbjct: 845 WSSVKLLGNFLQWYGIFSNKTLQELSIDGLLNRYILMAFQNSEYG-DDSIKKAQNVINCF 904
Query: 904 SGVWTGPGVTGDRS-HKLQPLVDYVMLLGRTLEKKHI--SGIAESETSGLARRLKKMLVE 944
W + G+R+ +L+ Y++ L T+ + I S + + +++ K+L
Sbjct: 905 PKQWF-MNLKGERTISQLENFCRYLVHLADTIYRNSIGCSDVEKRNARENIKQIVKLLAS 909
BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match:
P58501 (PAX3- and PAX7-binding protein 1 OS=Mus musculus OX=10090 GN=Paxbp1 PE=1 SV=3)
HSP 1 Score: 162.5 bits (410), Expect = 2.3e-38
Identity = 235/951 (24.71%), Postives = 399/951 (41.96%), Query Frame = 0
Query: 22 PNGAAAPSTGVSNASSKAASTSSTVANKPKK---ANPQVPK--LLSFASDEENDAPLRTS 81
P A P G + KP+K N +VP+ LLSF +EE + +
Sbjct: 65 PPSAHHPGLGAEAGGGISGGAEPGNGLKPRKRPRENKEVPRASLLSFQDEEEENEEVFKV 124
Query: 82 SKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNVQPQAGTYTKEALRELQ 141
K + SKK +L K +K K +I +T+A
Sbjct: 125 KKSSYSKK--IVKLLK--KEYKEDLEKSKIKTELNTAAD--------------------- 184
Query: 142 KNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKESSSEDEEGGSNEKSAG 201
+ + L + + ++ P V IS+ ++ + SE EE K+ G
Sbjct: 185 -SDQPLDKTCHAKDTNPEDGVV------------ISEHGEDEMDMESEKEE--EKPKAGG 244
Query: 202 SFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERMRQAGVAAPDYISLDAG 261
+F + ++S+ + R G IPD A I+A R KR+ R+ G P G
Sbjct: 245 AFSNA-------LSSLNVLR----PGEIPDAAFIHAARKKRQLARELGDFTPHDSEPGKG 304
Query: 262 S-NRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAIDGVRTNIIEHSDEDEE 321
R + SD+E + R + K S ++ + EE I+G + + ++DEE
Sbjct: 305 RLVREDENDASDDEDDDEKRRIVFSVKEKSQRQKIAEEI---GIEGSDDDALVTGEQDEE 364
Query: 322 EKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTTAGYNSVPSISTATSI 381
WE EQ RKG+ S + S V + Q + Y + G +P + T+
Sbjct: 365 LSRWEQEQIRKGINIPQVQAS---QPSEVNVYYQNTYQTMPYGASYG---IP--YSYTAY 424
Query: 382 GGSVGVSQGLDGL---------SISQQAEIAKKAMRDNMGRLKESYRRTAASVLKTDENL 441
G S SQ D ++ K+ ++D + +KE ++ K ++
Sbjct: 425 GSSDAKSQKTDNTVPFKTPSNEMAPVTIDLVKRQLKDRLDSMKELHKTNQQQHEKHLQSR 484
Query: 442 SASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERA 501
S I LE S GE++ F+Q++R +V + + K P I ELE + +L+++RA
Sbjct: 485 VDSTRAIERLEGSSGGIGERYKFLQEMRGYVQDLLECFSEKVPLINELESAIHQLYKQRA 544
Query: 502 STVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAKEQANLP 561
S +V+RR D DE E +S +A +
Sbjct: 545 SRLVQRRQDDIKDESSE--------------------------------FSSHSNKALMA 604
Query: 562 TKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESSTDESDS- 621
+D FGRD L + KRR R+ RR + R + ++ H +EG SS DE S
Sbjct: 605 PNLDSFGRDRALYQE-HAKRRIAEREARRTRRRQAREQTGQMADH--LEGLSSDDEETST 664
Query: 622 EAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSLSTAAI 681
+ + D +L+ + ++F D E F + +K +FE W+ Y +Y+DAY+ L +
Sbjct: 665 DITNFNLEKDRILKESSKVFEDVLESFYSIDCIKAQFEAWRSKYYMSYKDAYIGLCLPKL 724
Query: 682 FSPYVRLELLKWDPLHENA-DFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEKV 741
F+P +RL+LL W PL DF M W L YG + + ++AD L+P +VEKV
Sbjct: 725 FNPLIRLQLLTWTPLEAKCRDFETMLWFESLLFYGCEDREQE--KDEADVALLPTIVEKV 784
Query: 742 ALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEA--------LTELLVVIRTR 801
LP L WD ST +T T + N P+ A L LL+ +R
Sbjct: 785 ILPKLTVIAETMWDPFSTTQTSRMVGITMKLINGYPSVVNADNKNTQVYLKALLLRMRRT 844
Query: 802 LSSAVEDLTVPTWSALVMKAVPNAARIAAYR-FGISVRLMRNICLWKEIIALPILEKLAL 861
L +D+ +P + V++ + + R F SV+L+ N W I + L++L++
Sbjct: 845 LD---DDVFMPLYPKNVLENKNSGPYLFFQRQFWSSVKLLGNFLQWYGIFSNKTLQELSI 904
Query: 862 EELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRS-HKLQPLVDYVML 921
+ LL +L ++ D++ + + +I W + G+R+ +L+ Y++
Sbjct: 905 DGLLNRYILMAFQNSEYG-DDSIRKAQNVINCFPKQWF-VNLKGERTISQLENFCRYLVH 911
Query: 922 LGRTLEKKHI--SGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLRE 944
L T+ + I S + + +++ K+L + D+A +A +++E
Sbjct: 965 LADTIYRNSIGCSDVEKRNARENIKQIVKLLASVRALDHAISVASDHNVKE 911
BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match:
P16383 (Intron Large complex component GCFC2 OS=Homo sapiens OX=9606 GN=GCFC2 PE=1 SV=2)
HSP 1 Score: 106.3 bits (264), Expect = 1.9e-21
Identity = 182/784 (23.21%), Postives = 317/784 (40.43%), Query Frame = 0
Query: 175 AKEGKESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGK-DSTGSIPDQATINAI 234
A EG ES + D +K S + L+ +S +G + ST IPD A I A
Sbjct: 84 ADEGSESRTLDVSTDEEDKIHHSSESKDDQGLSSDSSSSLGEKELSSTVKIPDAAFIQAA 143
Query: 235 RAKRERMRQAGVAAPDYISLD-------AGSNRTAPGELSDEETEFPGRIAMIGGKSASS 294
R KRE R A DYISLD +G R + + E + RI
Sbjct: 144 RRKRELAR----AQDDYISLDVQHTSSISGMKRESEDDPESEPDDHEKRIPF-----TLR 203
Query: 295 KKGVFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPL 354
+ + + E++I E S EDE++ WE +Q RK
Sbjct: 204 PQTLRQRMAEESISR-NEETSEESQEDEKQDTWEQQQMRKA------------------- 263
Query: 355 IPSVPQQNLIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMG 414
+ + ++++ G + V T+ S EI KK + +
Sbjct: 264 VKIIEERDIDLSCGNGSSKVKKFDTSISFP--------------PVNLEIIKKQLNTRLT 323
Query: 415 RLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQH 474
L+E++R K +++ +S I LE S + A F + ++ +V + D L
Sbjct: 324 LLQETHRSHLREYEKYVQDVKSSKSTIQNLESSSNQA-LNCKFYKSMKIYVENLIDCLNE 383
Query: 475 KAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIA 534
K I+E+E M L ++A T ++RR DE+ K + L + +E
Sbjct: 384 KIINIQEIESSMHALLLKQAMTFMKRR----QDEL-------KHESTYLQQLSRKDE--- 443
Query: 535 AATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLAST 594
TS + K Q L E + RR K R+ S
Sbjct: 444 --TSTSGNFSVDEKTQWIL-----------------------EEIESRRTKRRQARVLSG 503
Query: 595 EVDGHQKVEGESSTDESDS-EAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQW 654
+ HQ EG SS DE S E +Q + +LQ ++F + ++F + + +F+QW
Sbjct: 504 NCN-HQ--EGTSSDDELPSAEMIDFQKSQGDILQKQKKVFEEVQDDFCNIQNILLKFQQW 563
Query: 655 KRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLH-ENADFFDMNWHSLLFNYGMPEDG 714
+ + +Y +A++SL + +P +R++L+ W+PL E+ +M W + +
Sbjct: 564 REKFPDSYYEAFISLCIPKLLNPLIRVQLIDWNPLKLESTGLKEMPWFKSVEEFMDSSVE 623
Query: 715 SDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTS-- 774
+ +D ++ ++ K +P L V WD LST +T + +I T
Sbjct: 624 DSKKESSSDKKVLSAIINKTIIPRLTDFVEFLWDPLSTSQTTSLITHCRVILEEHSTCEN 683
Query: 775 --SEALTELLVVIRTRLSSAVE-DLTVPTW--SALVMKAVPNAARIAAYRFGISVRLMRN 834
S++ +LL I +R+ AVE D+ +P + SA+ K P+ ++ +F ++L RN
Sbjct: 684 EVSKSRQDLLKSIVSRMKKAVEDDVFIPLYPKSAVENKTSPH-SKFQERQFWSGLKLFRN 743
Query: 835 ICLWKEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGV 894
I LW ++ L++L L +LL ++ + + T D V + ++ A L W
Sbjct: 744 ILLWNGLLTDDTLQELGLGKLLNRYLIIALLNATPG-PDVVKKCNQVAACLPEKWFENSA 771
Query: 895 TGDRSHKLQPLVDYVMLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAK 942
+L+ + ++ L+ H ++ SE + +LV++ + A
Sbjct: 804 MRTSIPQLENFIQFL------LQSAH--KLSRSEFRDEVEEIILILVKIKALNQAESFIG 771
BLAST of CmoCh18G010640.1 vs. ExPASy Swiss-Prot
Match:
Q8BKT3 (Intron Large complex component GCFC2 OS=Mus musculus OX=10090 GN=Gcfc2 PE=1 SV=2)
HSP 1 Score: 94.4 bits (233), Expect = 7.6e-18
Identity = 200/806 (24.81%), Postives = 330/806 (40.94%), Query Frame = 0
Query: 92 KPSSTHKITALKDRIAHSSSTSASVPSNVQP-QAGTYTKEA-----LRELQKNTRTLASS 151
+P T + ++ + S S A S +P AG T+ A R + R ASS
Sbjct: 4 RPQRTFRRRQVESSDSDSDSDGAKEQSAEEPASAGGRTEGAERPRGARSARGRGRVWASS 63
Query: 152 RSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKESSSEDEEGGSNEKSAGSFRRSKEDA 211
R S + P + + E S+++EEG + R D+
Sbjct: 64 RRSPGAAPRGD---------------GGAECRTAELSTDEEEGTHTLTGSKGDRSPSSDS 123
Query: 212 LARMASMGIGRGKDSTGSIPDQATINAIRAKRERMRQAGVAAPDYISLDAG-SNRTAPGE 271
+ R IPD A I A R KRE R G DYISLD S T+ +
Sbjct: 124 SCSLEE----RDVSPIVEIPDAAFIQAARRKRELARTPG----DYISLDVNHSCSTSDCK 183
Query: 272 LSDEE------TEFPGRIAMIGGKSASSKKGVFEEFDEQAIDGVRTNIIEHSDEDEEEKI 331
S+EE + RI + K + ++ + EE ++ + E S EDE + I
Sbjct: 184 RSNEEDPESDPDDHEKRI-LFTPKPQTLRQRMAEETSIRSEES-----SEESQEDENQDI 243
Query: 332 WEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTTAGYNSVPSISTATSIGGS 391
WE +Q RK + +P AG N+ S S+ +
Sbjct: 244 WEQQQMRKAV--------------------RIP---------AGQNTDLSHSSKSQTLKK 303
Query: 392 VGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVLKTDENLSASLLNITALEK 451
S +++ EI KK + + + L+ES+R K ++++ +S I LE
Sbjct: 304 FDTSISFPPVNL----EIIKKQLNNRLTLLQESHRSHQREYEKYEQDIKSSKTAIQNLE- 363
Query: 452 SLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHEERASTVVERRVADND 511
S S + + F + ++ +V I D L K I ELE M L +R+ +++RR
Sbjct: 364 SASDHAQNYRFYRGMKSYVENIIDCLNEKIVSIVELESSMYTLLLKRSEALLKRR----Q 423
Query: 512 DEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNL 571
DE+ K S L + +E TSA + K+Q L
Sbjct: 424 DEL-------KCESSYLQQLSRKDE-----TSANGSLAVDEKDQRIL------------- 483
Query: 572 QKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESSTDE-SDSEAAAYQSNHDLL 631
EAR+ +R + R S D HQ EG SS DE S +E + +
Sbjct: 484 -------EEIEARRMQRRQ---ARELSGSCD-HQ--EGMSSDDELSPAEMTNFHKCQGDI 543
Query: 632 LQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKW 691
LQ ++F D ++F + + +F+QW+ + +Y +A++ + SP +R++LL W
Sbjct: 544 LQDCKKVFEDVHDDFCNVQNILLKFQQWREKFPDSYYEAFVGFCLPKLLSPLIRVQLLDW 603
Query: 692 DPLHENADFFD-MNWHSLLFNYGMPEDGSDFAPND-ADANLVPELVEKVALPILHHEVAH 751
+PL ++ D M W + + + M D D +D ++ ++ K +P L V
Sbjct: 604 NPLKMDSIGLDKMPWFTAITEF-MESSMDDIGKEDGSDKKILAAVINKTVVPRLTDFVET 663
Query: 752 CWDMLSTRETRN------AAFATSLITNYVPTSSEALTELLVVIRTRLSSAVE-DLTVPT 811
WD LST +TR+ AF N V + + +LL I R+ ++E D+ +P
Sbjct: 664 IWDPLSTSQTRSLTVHCRVAFEQFASENEVSKNKQ---DLLKSIVARMKKSIEDDIFIPL 698
Query: 812 W--SALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLYGKVLPH 871
+ S+ K P+ ++ +F +++L RNI LW ++ L+ L L +LL ++
Sbjct: 724 YPKSSEEGKMSPH-SKFQERQFWGALKLFRNILLWNGLLPDDTLQDLGLGKLLNRYLIIS 698
Query: 872 VRSITANIHDAVTRTERIIASLSGVW 873
+ + D V + +I A L W
Sbjct: 784 LTNAVPG-PDVVKKCSQIAACLPERW 698
BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match:
A0A6J1G0Q4 (transcriptional repressor ILP1 OS=Cucurbita moschata OX=3662 GN=LOC111449643 PE=3 SV=1)
HSP 1 Score: 1749.6 bits (4530), Expect = 0.0e+00
Identity = 945/945 (100.00%), Postives = 945/945 (100.00%), Query Frame = 0
Query: 1 MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL
Sbjct: 1 MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
Query: 61 LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV 120
LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV
Sbjct: 61 LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV 120
Query: 121 QPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE 180
QPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE
Sbjct: 121 QPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE 180
Query: 181 SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM 240
SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM
Sbjct: 181 SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM 240
Query: 241 RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID 300
RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID
Sbjct: 241 RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID 300
Query: 301 GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTT 360
GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTT
Sbjct: 301 GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTT 360
Query: 361 AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL 420
AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL
Sbjct: 361 AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL 420
Query: 421 KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK 480
KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK
Sbjct: 421 KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK 480
Query: 481 LHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK 540
LHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK
Sbjct: 481 LHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK 540
Query: 541 EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST 600
EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST
Sbjct: 541 EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST 600
Query: 601 DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL 660
DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL
Sbjct: 601 DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL 660
Query: 661 STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL 720
STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL
Sbjct: 661 STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL 720
Query: 721 VEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSA 780
VEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSA
Sbjct: 721 VEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSA 780
Query: 781 VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY 840
VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY
Sbjct: 781 VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY 840
Query: 841 GKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE 900
GKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE
Sbjct: 841 GKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE 900
Query: 901 KKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
KKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL
Sbjct: 901 KKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 945
BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match:
A0A6J1HXZ2 (transcriptional repressor ILP1 OS=Cucurbita maxima OX=3661 GN=LOC111467670 PE=3 SV=1)
HSP 1 Score: 1732.2 bits (4485), Expect = 0.0e+00
Identity = 935/945 (98.94%), Postives = 940/945 (99.47%), Query Frame = 0
Query: 1 MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSK ASTSSTVANKPKKANPQVPKL
Sbjct: 1 MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKPASTSSTVANKPKKANPQVPKL 60
Query: 61 LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV 120
LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV
Sbjct: 61 LSFASDEENDAPLRTSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSNV 120
Query: 121 QPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE 180
QPQAG YT+EALRELQKNTRTLASSR+SSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE
Sbjct: 121 QPQAGIYTEEALRELQKNTRTLASSRASSESKPSAEPVIVLKGLLKPVEQISDSAKEGKE 180
Query: 181 SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM 240
SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM
Sbjct: 181 SSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINAIRAKRERM 240
Query: 241 RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID 300
RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID
Sbjct: 241 RQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQAID 300
Query: 301 GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYPTT 360
GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSV QQNLIYPTT
Sbjct: 301 GVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVLQQNLIYPTT 360
Query: 361 AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL 420
AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL
Sbjct: 361 AGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAASVL 420
Query: 421 KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK 480
KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK
Sbjct: 421 KTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQK 480
Query: 481 LHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK 540
LHEERASTVVERRVADNDDEMVEI+AAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK
Sbjct: 481 LHEERASTVVERRVADNDDEMVEIDAAVKAAMSILNKKGSSNEMIAAATSAAQAAIASAK 540
Query: 541 EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST 600
EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST
Sbjct: 541 EQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGESST 600
Query: 601 DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL 660
DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL
Sbjct: 601 DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYMSL 660
Query: 661 STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL 720
STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL
Sbjct: 661 STAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPEL 720
Query: 721 VEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLSSA 780
VEKVALPILHHE+AHCWDMLSTRETRNAAFATSLITNYVPTSSEAL ELLVVIRTRLSSA
Sbjct: 721 VEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPTSSEALMELLVVIRTRLSSA 780
Query: 781 VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY 840
VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY
Sbjct: 781 VEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEELLY 840
Query: 841 GKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE 900
GKVLPHVRSITANIHDAVTRTERIIASL GVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE
Sbjct: 841 GKVLPHVRSITANIHDAVTRTERIIASLLGVWTGPGVTGDRSHKLQPLVDYVMLLGRTLE 900
Query: 901 KKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
KKHISG+AESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL
Sbjct: 901 KKHISGVAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 945
BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match:
A0A5D3CCM3 (PAX3-and PAX7-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G00700 PE=3 SV=1)
HSP 1 Score: 1592.0 bits (4121), Expect = 0.0e+00
Identity = 860/947 (90.81%), Postives = 900/947 (95.04%), Query Frame = 0
Query: 1 MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
MSGSRARNFRRRADDNDDDDEPNG+ APS SNASSK +STSS VA KPKKANPQ PKL
Sbjct: 1 MSGSRARNFRRRADDNDDDDEPNGSPAPSISASNASSKPSSTSSVVATKPKKANPQGPKL 60
Query: 61 LSFASDEENDAPLR-TSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSN 120
LSFASDEENDAPLR +SSK ++SKKPSSARLAKPSSTHKITALKDRIAHSSS SASVPSN
Sbjct: 61 LSFASDEENDAPLRPSSSKSSSSKKPSSARLAKPSSTHKITALKDRIAHSSSISASVPSN 120
Query: 121 VQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGK 180
VQPQAG YTKEALRELQKNTRTLASSR SSESKPSAEPVIVLKGLLKP EQ+ +SA+E K
Sbjct: 121 VQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAEPVIVLKGLLKPAEQVPESAREDK 180
Query: 181 ESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTG-SIPDQATINAIRAKRE 240
ESSSEDEE GSN KSA S RRSKED LARMASMGIGRGKDS+G SIPDQATINAIRAKRE
Sbjct: 181 ESSSEDEEAGSNAKSAASLRRSKEDTLARMASMGIGRGKDSSGSSIPDQATINAIRAKRE 240
Query: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQA 300
RMRQAGVAAPDYISLDAGSNRTAPGELSDEE EFPGRIAMIGGK SSKKGVFEE DEQ
Sbjct: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLESSKKGVFEEVDEQG 300
Query: 301 IDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYP 360
IDGVRTNIIEHSDEDEEEKIWE EQFRKGLGKRMDDGSTRVES+SVP++ SV QQNLIYP
Sbjct: 301 IDGVRTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPVVQSVQQQNLIYP 360
Query: 361 TTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAAS 420
TT GY+SVPS STATSIGGSV VSQGLDGLSISQQAEIAKKAM+++MGRLKESYRRTA+S
Sbjct: 361 TTIGYSSVPSKSTATSIGGSVSVSQGLDGLSISQQAEIAKKAMQESMGRLKESYRRTASS 420
Query: 421 VLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480
VLKTDENLSASLL IT LEK+LSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM
Sbjct: 421 VLKTDENLSASLLKITDLEKALSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480
Query: 481 QKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIAS 540
QKLHEERASTVVERRVADNDDEMVEIE AVKAA SILNKKGSS+EM+ AATSAAQAAIAS
Sbjct: 481 QKLHEERASTVVERRVADNDDEMVEIETAVKAATSILNKKGSSHEMLVAATSAAQAAIAS 540
Query: 541 AKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGES 600
++EQANLPTK+DEFGRDLNLQKRMDMKRRAEARKRRR++YDSKRLAS EVDGHQKVEGES
Sbjct: 541 SREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASMEVDGHQKVEGES 600
Query: 601 STDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYM 660
STDESDS++AAYQSN DLLLQTA+QIFSDAAEEFSQLSVVKQRFE+WKRDYSATYRDAYM
Sbjct: 601 STDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEEWKRDYSATYRDAYM 660
Query: 661 SLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720
SLS AIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP
Sbjct: 661 SLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720
Query: 721 ELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLS 780
ELVEKVALPILHHE+AHCWDMLSTRETRNAAFATSLITNYVP SSEALTELLVVIRTRLS
Sbjct: 721 ELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSEALTELLVVIRTRLS 780
Query: 781 SAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEEL 840
A+EDLTVPTW++LV KAVPNAARIAAYRFG+SVRL+RNICLWKEIIALPILEKLALEEL
Sbjct: 781 GAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLLRNICLWKEIIALPILEKLALEEL 840
Query: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRT 900
LYGKVLPHVRSITANIHDAVTRTERIIASL+GVWTG G+ GDRSHKLQPLVDYV+LLGRT
Sbjct: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHKLQPLVDYVLLLGRT 900
Query: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL+EAL
Sbjct: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEAL 947
BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match:
A0A1S3BG51 (PAX3- and PAX7-binding protein 1 OS=Cucumis melo OX=3656 GN=LOC103489249 PE=3 SV=1)
HSP 1 Score: 1592.0 bits (4121), Expect = 0.0e+00
Identity = 860/947 (90.81%), Postives = 900/947 (95.04%), Query Frame = 0
Query: 1 MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
MSGSRARNFRRRADDNDDDDEPNG+ APS SNASSK +STSS VA KPKKANPQ PKL
Sbjct: 1 MSGSRARNFRRRADDNDDDDEPNGSPAPSISASNASSKPSSTSSVVATKPKKANPQGPKL 60
Query: 61 LSFASDEENDAPLR-TSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSN 120
LSFASDEENDAPLR +SSK ++SKKPSSARLAKPSSTHKITALKDRIAHSSS SASVPSN
Sbjct: 61 LSFASDEENDAPLRPSSSKSSSSKKPSSARLAKPSSTHKITALKDRIAHSSSISASVPSN 120
Query: 121 VQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGK 180
VQPQAG YTKEALRELQKNTRTLASSR SSESKPSAEPVIVLKGLLKP EQ+ +SA+E K
Sbjct: 121 VQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAEPVIVLKGLLKPAEQVPESAREDK 180
Query: 181 ESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTG-SIPDQATINAIRAKRE 240
ESSSEDEE GSN KSA S RRSKED LARMASMGIGRGKDS+G SIPDQATINAIRAKRE
Sbjct: 181 ESSSEDEEAGSNAKSAASLRRSKEDTLARMASMGIGRGKDSSGSSIPDQATINAIRAKRE 240
Query: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQA 300
RMRQAGVAAPDYISLDAGSNRTAPGELSDEE EFPGRIAMIGGK SSKKGVFEE DEQ
Sbjct: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLESSKKGVFEEVDEQG 300
Query: 301 IDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYP 360
IDGVRTNIIEHSDEDEEEKIWE EQFRKGLGKRMDDGSTRVES+SVP++ SV QQNLIYP
Sbjct: 301 IDGVRTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPVVQSVQQQNLIYP 360
Query: 361 TTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAAS 420
TT GY+SVPS STATSIGGSV VSQGLDGLSISQQAEIAKKAM+++MGRLKESYRRTA+S
Sbjct: 361 TTIGYSSVPSKSTATSIGGSVSVSQGLDGLSISQQAEIAKKAMQESMGRLKESYRRTASS 420
Query: 421 VLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480
VLKTDENLSASLL IT LEK+LSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM
Sbjct: 421 VLKTDENLSASLLKITDLEKALSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480
Query: 481 QKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIAS 540
QKLHEERASTVVERRVADNDDEMVEIE AVKAA SILNKKGSS+EM+ AATSAAQAAIAS
Sbjct: 481 QKLHEERASTVVERRVADNDDEMVEIETAVKAATSILNKKGSSHEMLVAATSAAQAAIAS 540
Query: 541 AKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGES 600
++EQANLPTK+DEFGRDLNLQKRMDMKRRAEARKRRR++YDSKRLAS EVDGHQKVEGES
Sbjct: 541 SREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASMEVDGHQKVEGES 600
Query: 601 STDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYM 660
STDESDS++AAYQSN DLLLQTA+QIFSDAAEEFSQLSVVKQRFE+WKRDYSATYRDAYM
Sbjct: 601 STDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEEWKRDYSATYRDAYM 660
Query: 661 SLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720
SLS AIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP
Sbjct: 661 SLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720
Query: 721 ELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLS 780
ELVEKVALPILHHE+AHCWDMLSTRETRNAAFATSLITNYVP SSEALTELLVVIRTRLS
Sbjct: 721 ELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSEALTELLVVIRTRLS 780
Query: 781 SAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEEL 840
A+EDLTVPTW++LV KAVPNAARIAAYRFG+SVRL+RNICLWKEIIALPILEKLALEEL
Sbjct: 781 GAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLLRNICLWKEIIALPILEKLALEEL 840
Query: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRT 900
LYGKVLPHVRSITANIHDAVTRTERIIASL+GVWTG G+ GDRSHKLQPLVDYV+LLGRT
Sbjct: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHKLQPLVDYVLLLGRT 900
Query: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL+EAL
Sbjct: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEAL 947
BLAST of CmoCh18G010640.1 vs. ExPASy TrEMBL
Match:
A0A0A0KWD3 (GCFC domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G608340 PE=3 SV=1)
HSP 1 Score: 1582.8 bits (4097), Expect = 0.0e+00
Identity = 854/947 (90.18%), Postives = 895/947 (94.51%), Query Frame = 0
Query: 1 MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
MSGSRARNFRRRADDNDDDDEP G+ APS SNASSK +STSS VA KPKKANPQ KL
Sbjct: 1 MSGSRARNFRRRADDNDDDDEPKGSTAPSISASNASSKPSSTSSVVATKPKKANPQGLKL 60
Query: 61 LSFASDEENDAPLR-TSSKPANSKKPSSARLAKPSSTHKITALKDRIAHSSSTSASVPSN 120
LSFASDEENDAPLR +SSK ++SKKPSSARLAKPSSTHKITALKDRIAHSSS SASVPSN
Sbjct: 61 LSFASDEENDAPLRPSSSKSSSSKKPSSARLAKPSSTHKITALKDRIAHSSSISASVPSN 120
Query: 121 VQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLKPVEQISDSAKEGK 180
VQPQAG YTKEALRELQKNTRTLASSR SSESKPSAEPVIVLKGLLKP EQ+ DSA+E K
Sbjct: 121 VQPQAGVYTKEALRELQKNTRTLASSRPSSESKPSAEPVIVLKGLLKPAEQVPDSAREAK 180
Query: 181 ESSSEDEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTG-SIPDQATINAIRAKRE 240
ESSSED+E GSN KSA S RRSKED LARMASMGIGRGKDS+G SIPDQATINAIRAKRE
Sbjct: 181 ESSSEDDEAGSNAKSAASLRRSKEDTLARMASMGIGRGKDSSGSSIPDQATINAIRAKRE 240
Query: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEETEFPGRIAMIGGKSASSKKGVFEEFDEQA 300
RMRQAGVAAPDYISLDAGSNRTAPGELSDEE EFPGRIAMIGGK SSKKGVFEE DEQ
Sbjct: 241 RMRQAGVAAPDYISLDAGSNRTAPGELSDEEAEFPGRIAMIGGKLESSKKGVFEEVDEQG 300
Query: 301 IDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSSVPLIPSVPQQNLIYP 360
IDG RTNIIEHSDEDEEEKIWE EQFRKGLGKRMDDGSTRVES+SVP++PSV QNLIYP
Sbjct: 301 IDGARTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPVVPSVQPQNLIYP 360
Query: 361 TTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMRDNMGRLKESYRRTAAS 420
TT GY+SVPS+STATSIGGSV +SQGLDGLSISQQAEIAK AM+++MGRLKESYRRTA S
Sbjct: 361 TTIGYSSVPSMSTATSIGGSVSISQGLDGLSISQQAEIAKTAMQESMGRLKESYRRTAMS 420
Query: 421 VLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480
VLKTDENLSASLL IT LEK+LSAAG+KF+FMQKLRDFVSVICDFLQHKAPFIEELEEQM
Sbjct: 421 VLKTDENLSASLLKITDLEKALSAAGDKFMFMQKLRDFVSVICDFLQHKAPFIEELEEQM 480
Query: 481 QKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSNEMIAAATSAAQAAIAS 540
QKLHEERASTVVERRVADNDDEMVEIE AVKAA+SILNKKGSSNEM+ AATSAAQAAIA
Sbjct: 481 QKLHEERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNEMVTAATSAAQAAIAL 540
Query: 541 AKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKRLASTEVDGHQKVEGES 600
++EQANLPTK+DEFGRDLNLQKRMDMKRRAEARKRRR++YDSKRLAS EVDGHQKVEGES
Sbjct: 541 SREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASMEVDGHQKVEGES 600
Query: 601 STDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQWKRDYSATYRDAYM 660
STDESDS++AAYQSN DLLLQTA+QIFSDAAEEFSQLSVVKQRFE WKRDYSATYRDAYM
Sbjct: 601 STDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEAWKRDYSATYRDAYM 660
Query: 661 SLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720
SLS AIFSPYVRLELLKWDPLHE+ADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP
Sbjct: 661 SLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVP 720
Query: 721 ELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSEALTELLVVIRTRLS 780
ELVEKVALPILHHE+AHCWDMLSTRETRNAAFATSLITNYVP SSEALTELLVVIRTRLS
Sbjct: 721 ELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSEALTELLVVIRTRLS 780
Query: 781 SAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEIIALPILEKLALEEL 840
A+EDLTVPTW++LV KAVPNAARIAAYRFG+SVRLMRNICLWKEIIALPILEKLALEEL
Sbjct: 781 GAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNICLWKEIIALPILEKLALEEL 840
Query: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDRSHKLQPLVDYVMLLGRT 900
LYGKVLPHVRSITANIHDAVTRTERIIASL+GVWTG G+ GDRSHKLQPLVDYV+LLGRT
Sbjct: 841 LYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHKLQPLVDYVLLLGRT 900
Query: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLREAL 946
LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL+EAL
Sbjct: 901 LEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEAL 947
BLAST of CmoCh18G010640.1 vs. TAIR 10
Match:
AT5G08550.1 (GC-rich sequence DNA-binding factor-like protein )
HSP 1 Score: 897.1 bits (2317), Expect = 1.2e-260
Identity = 540/964 (56.02%), Postives = 691/964 (71.68%), Query Frame = 0
Query: 1 MSGSRARNFRRRADDNDDDDEPNGAAAPSTGVSNASSKAASTSSTVANKPKKANPQVPKL 60
M +R +NFRRR DD D+ + A S S SS T S A+ PKK KL
Sbjct: 1 MGSNRPKNFRRRGDDGGDEIDGKVATPSSKPTSTLSSSKPKTLS--ASAPKK------KL 60
Query: 61 LSFASD--EENDAPLRTSSKPANSKK--PSSARLAKPSSTHKITALKDRIAHSSSTSASV 120
LSFA D EE D R + KP N + SS+RL S+H+ ++ K+R S
Sbjct: 61 LSFADDEEEEEDGAPRVTIKPKNGRDRVKSSSRLGVSGSSHRHSSTKERRPAS------- 120
Query: 121 PSNVQPQAGTYTKEALRELQKNTRTLASSRSSSESKPSAEPVIVLKGLLK-PVEQISDSA 180
SNV PQAG+Y+KEAL ELQKNTRTL SRSS+ +AEP +VLKGL+K P + S
Sbjct: 121 -SNVLPQAGSYSKEALLELQKNTRTLPYSRSSA----NAEPKVVLKGLIKPPQDHEQQSL 180
Query: 181 KEGKESSSE---DEEGGSNEKSAGSFRRSKEDALARMASMGIGRGKDSTGSIPDQATINA 240
K+ + S+ DEEG + EDA A DQA I
Sbjct: 181 KDVVKQVSDLDFDEEGEEEQ---------HEDAFA------------------DQAAI-- 240
Query: 241 IRAKRERMRQAGVA-APDYISLDAG-SNRTAPGELSDEETEFPGRIAMIGGK-SASSKKG 300
IRAK+ERMRQ+ A APDYISLD G N +A +SDE+ +F G +G + KKG
Sbjct: 241 IRAKKERMRQSRSAPAPDYISLDGGIVNHSAVEGVSDEDADFQG--IFVGPRPQKDDKKG 300
Query: 301 VFEEFDEQAIDGVRTNIIEHSDEDEEEKIWEAEQFRKGLGKRMDDGSTRVESSS---VPL 360
VF+ DE T + DEDEE+K+WE EQF+KG+GKRMD+GS R +S+ VPL
Sbjct: 301 VFDFGDENPTAKETTTSSIYEDEDEEDKLWEEEQFKKGIGKRMDEGSHRTVTSNGIGVPL 360
Query: 361 ---IPSVPQQN-LIYPTTAGYNSVPSISTATSIGGSVGVSQGLDGLSISQQAEIAKKAMR 420
++PQQ +Y AG +P++S A +IG + V D L +SQQAE+AKKA++
Sbjct: 361 HSKQQTLPQQQPQMYAYHAG-TPMPNVSVAPTIGPATSV----DTLPMSQQAELAKKALK 420
Query: 421 DNMGRLKESYRRTAASVLKTDENLSASLLNITALEKSLSAAGEKFIFMQKLRDFVSVICD 480
DN+ +LKES+ +T +S+ KTDENL+ASL++ITALE SLSAAG+K++FMQKLRDF+SVICD
Sbjct: 421 DNVKKLKESHAKTLSSLTKTDENLTASLMSITALESSLSAAGDKYVFMQKLRDFISVICD 480
Query: 481 FLQHKAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIEAAVKAAMSILNKKGSSN 540
F+Q+K IEE+E+QM++L+E+ A +++ERR+ADN+DEM+E+ AAVKAAM++LNK GSS+
Sbjct: 481 FMQNKGSLIEEIEDQMKELNEKHALSILERRIADNNDEMIELGAAVKAAMTVLNKHGSSS 540
Query: 541 EMIAAATSAAQAAIASAKEQANLPTKVDEFGRDLNLQKRMDMKRRAEARKRRRAKYDSKR 600
+IAAAT AA AA S ++Q N P K+DEFGRD NLQKR ++++RA AR++RRA++++KR
Sbjct: 541 SVIAAATGAALAASTSIRQQMNQPVKLDEFGRDENLQKRREVEQRAAARQKRRARFENKR 600
Query: 601 LASTEVDGHQ-KVEGESSTDESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQR 660
++ EVDG K+EGESSTDESD+E +AY+ D LLQ AD++FSDA+EE+SQLS VK R
Sbjct: 601 ASAMEVDGPSLKIEGESSTDESDTETSAYKETRDSLLQCADKVFSDASEEYSQLSKVKAR 660
Query: 661 FEQWKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMP 720
FE+WKRDYS+TYRDAYMSL+ +IFSPYVRLELLKWDPLH++ DFFDM WH LLF+YG P
Sbjct: 661 FERWKRDYSSTYRDAYMSLTVPSIFSPYVRLELLKWDPLHQDVDFFDMKWHGLLFDYGKP 720
Query: 721 EDGSDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPT 780
EDG DFAP+D DANLVPELVEKVA+PILHH++ CWD+LSTRETRNA ATSL+TNYV
Sbjct: 721 EDGDDFAPDDTDANLVPELVEKVAIPILHHQIVRCWDILSTRETRNAVAATSLVTNYVSA 780
Query: 781 SSEALTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLW 840
SSEAL EL IR RL A+ ++VPTW LV+KAVPN ++AAYRFG SVRLMRNIC+W
Sbjct: 781 SSEALAELFAAIRARLVEAIAAISVPTWDPLVLKAVPNTPQVAAYRFGTSVRLMRNICMW 840
Query: 841 KEIIALPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLSGVWTGPGVTGDR 900
K+I+ALP+LE LAL +LL+GKVLPHVRSI +NIHDAVTRTERI+ASLSGVWTGP VT
Sbjct: 841 KDILALPVLENLALSDLLFGKVLPHVRSIASNIHDAVTRTERIVASLSGVWTGPSVTRTH 900
Query: 901 SHKLQPLVDYVMLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHL 946
S LQPLVD + L R LEK+ SG+ ++ET+GLARRLK++LVEL+E+D+AR+I +TF+L
Sbjct: 901 SRPLQPLVDCTLTLRRILEKRLGSGLDDAETTGLARRLKRILVELHEHDHAREIVRTFNL 908
BLAST of CmoCh18G010640.1 vs. TAIR 10
Match:
AT5G09210.1 (GC-rich sequence DNA-binding factor-like protein )
HSP 1 Score: 352.1 bits (902), Expect = 1.4e-96
Identity = 189/320 (59.06%), Postives = 234/320 (73.12%), Query Frame = 0
Query: 587 EVDGHQK-VEGESST-DESDSEAAAYQSNHDLLLQTADQIFSDAAEEFSQLSVVKQRFEQ 646
+VDG+ VEG+SST DESD E +AY+ D LLQ AD+IFSDA+ +S+LS VK F++
Sbjct: 253 KVDGYSLIVEGDSSTDDESDCETSAYEEARDSLLQRADKIFSDASVVYSELSRVKSIFKR 312
Query: 647 WKRDYSATYRDAYMSLSTAAIFSPYVRLELLKWDPLHENADFFDMNWHSLLFNYGMPEDG 706
R S +R AY SL+ +++SPY+RLELL+WDPLH++ DF DMNWH LLF+ +
Sbjct: 313 GARHPSPAFRAAYTSLTVPSMYSPYLRLELLRWDPLHQDVDFSDMNWHGLLFHSRIVCGS 372
Query: 707 SDFAPNDADANLVPELVEKVALPILHHEVAHCWDMLSTRETRNAAFATSLITNYVPTSSE 766
+ N N V ELV+ VA+PILHH + CWD+LSTRETRN ATSL+ YV SSE
Sbjct: 373 TPVCTN---PNFVSELVKYVAVPILHHRIVRCWDILSTRETRNVVAATSLVARYVFPSSE 432
Query: 767 ALTELLVVIRTRLSSAVEDLTVPTWSALVMKAVPNAARIAAYRFGISVRLMRNICLWKEI 826
AL EL + I RL A+ ++VPTW V K VPNA ++AAYRFG SVRLMRNIC+WK++
Sbjct: 433 ALAELSLAIHARLVEAIIAISVPTWDPQVSKDVPNAPQVAAYRFGTSVRLMRNICMWKDV 492
Query: 827 IALPILEKLALEELLYGKVLPHVRSIT--ANIHDAVTRTERIIASLSGVWTGPGVTGDRS 886
+ LP+LEKLAL +LL+GKVLPHVRSI +NIHDAVT+TERI+ASLSGVWTGP VT S
Sbjct: 493 MELPVLEKLALSDLLFGKVLPHVRSIASESNIHDAVTKTERIVASLSGVWTGPSVTRTHS 552
Query: 887 HKLQPLVDYVMLLGRTLEKK 903
H LQPLVD + LGR LEKK
Sbjct: 553 HLLQPLVDCTLTLGRILEKK 569
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FNN3 | 1.7e-259 | 56.02 | Transcriptional repressor ILP1 OS=Arabidopsis thaliana OX=3702 GN=ILP1 PE=1 SV=1 | [more] |
Q9Y5B6 | 3.5e-39 | 24.54 | PAX3- and PAX7-binding protein 1 OS=Homo sapiens OX=9606 GN=PAXBP1 PE=1 SV=2 | [more] |
P58501 | 2.3e-38 | 24.71 | PAX3- and PAX7-binding protein 1 OS=Mus musculus OX=10090 GN=Paxbp1 PE=1 SV=3 | [more] |
P16383 | 1.9e-21 | 23.21 | Intron Large complex component GCFC2 OS=Homo sapiens OX=9606 GN=GCFC2 PE=1 SV=2 | [more] |
Q8BKT3 | 7.6e-18 | 24.81 | Intron Large complex component GCFC2 OS=Mus musculus OX=10090 GN=Gcfc2 PE=1 SV=2 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1G0Q4 | 0.0e+00 | 100.00 | transcriptional repressor ILP1 OS=Cucurbita moschata OX=3662 GN=LOC111449643 PE=... | [more] |
A0A6J1HXZ2 | 0.0e+00 | 98.94 | transcriptional repressor ILP1 OS=Cucurbita maxima OX=3661 GN=LOC111467670 PE=3 ... | [more] |
A0A5D3CCM3 | 0.0e+00 | 90.81 | PAX3-and PAX7-binding protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... | [more] |
A0A1S3BG51 | 0.0e+00 | 90.81 | PAX3- and PAX7-binding protein 1 OS=Cucumis melo OX=3656 GN=LOC103489249 PE=3 SV... | [more] |
A0A0A0KWD3 | 0.0e+00 | 90.18 | GCFC domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G608340 PE=3 S... | [more] |
Match Name | E-value | Identity | Description | |
AT5G08550.1 | 1.2e-260 | 56.02 | GC-rich sequence DNA-binding factor-like protein | [more] |
AT5G09210.1 | 1.4e-96 | 59.06 | GC-rich sequence DNA-binding factor-like protein | [more] |