Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTCATTCCAAATTGCAAGACTTCAATAACTCCCTCTGGATTTCCTCATCTGTTCATCATTTCTACAGAATTTTCTCTCCCTCAAGTGGTTCATATATCAACGACAAGTTCAAAGAACTGTGTTTAAGTGCAAAATTTGTTTAAGCACTTGAAGAATCACTCTTAAATACGTTCAAACTCAAACTAGTTCATCTTAGCCTGCAAATTTTCTGCAAATCGTGAAGAAATTTGGCGAGAAAACAACCTCGAAACAAGAACAACATCAGAAAAGAAGAACACATATTAAAAACGAAGAGATCTAGATCTTTTCGTTTCGTTGGTTTTTTTAGGGTAATGGATATTCATCCATGGATTGAGTATCTTGAAGCAAGAAAACGGTACCGCGTTCATAAATTCCAGCCCTCGTGGGTTCGACCGTACCATTGGTTCGCTCAAAATTCTTTTGGAAAAGTAAAGAAACCAGTTACTTGTTGTAATCAAGGTAAGGTACTTTTGAAATACACTACGGTCTTTTCCTTTATTTAAATTTATTTTTTGTTTATAGGGTTTTGGAATTAGCTCTCTTACTTTATCAATCAGTTCATCAAAGGGCCTTAAATTAATCCCTCAAAATTTCGCGTTTATTTGGTGTGTTTGACCTCAACTTGGAAATGCCGGAATTGGATGTATTTATCATTTTTTTTTTTTTTTTTTACAATAATTCTGAGACTGTGAGCGTTTTGAATTTGTGTTTTCCGTGAAATATAATAGAGAGAGAGAAAGAGAGAGAAATATGCTTTTTAGTTTCTAATTATTTTTGGATTGTAAAATTTTAAAGATATGAATTTAAGGGCGTAAAGGATTCTCCATCTTGGTGGACTCGAATGTTGATCTTTAAAGCTACGGACTACTTAGGGTTTTCGATTTTCCTGATTTTTTCCATTCCTTGGGCGATCTTCCTTCCTTCACTCGGTGAGTTCCTCTAATTTCATCTTCTGCTTTGCTTACTCTCTTCCTTAATTTCATGTTCTCTTCTCAATTTGAACTCTATGAAGGTTGAATTCCGAGGCCTCTTGTCGCATTCGGAAGAAAACTGGTTTTTCTGTAATCTGTTCTTCTTTTCTTTCACAACGACTAAGTTTTCTCTGGTTTTCAATTCAATTCTGAGCAGATTCTGTGTTCATATTTGTGAAGTTCTTAGAGCTAATCCAAAGCTAGCAGTTCTTCATATCTATCGAAGTTCATTTACTTTCAGTTAACTCTTTGGGAGTTTAAATTATGAGTTGATGCTGGAGGCAAGGATTTTAAAGTCTCCTGTGAAAAGCTGGAGAGAGTACATTACTAGAGAGGGTTCATTTCTCTTATATAACTCCAATGGTGTTTCATTCTCAAGTTCCTTCAAGATATACTTGTAGGCTTCTTGCAATCCCCTGTCGGAGTGTTCCTGAAGATAAATTTAAGAAGGATAATCCAGATGATGAGCAGAGATATCCCTTTCCACAATTAAATTCTTCTGGGCGTTTGGAGGTTCGAATTTTGTTGATGAGTGTTAGCTTGTATGTTCCTGCAACTACCTTGTTTTAATTATCTTTGTCACATATATTTGCAGGTCCAGGTTTTGCCCAATCCTAGTAAAGATGAGTTTTGTAGGATTATTGAATCGTATAGACCGAGCATTGTTTACTTACAAGGGGAACAACTTGAGAATGATGAAGTTGGTTCTTTAGTATGGGAAGGTGTTGATTTGTTCACCGTAGAAGCGATTAGTGGGCTCTTTAGTTCTCCGTTGCCCACCACTGTATGTTCTACATCCTTTTTGAATTTTCAGTTTCGTATTTTCTTGTATCATTTTGTAATCAGAGCCCCTGCAAAAGGAAAAAGTAAACAGAAACAAATAAGAACTTCTATATTGTATGATTACAGTGAACTAATGTTGTTTTAATTGGACACCGAGAAAACGTTAGCCAGTCTAGTTATCCTATCAGCTTACATATATTTTGGATGTTGAATTCAGTGTGGCGTTTGAAAATTCCACAAAGCTTTCACTAAGCTAGTTAATAAATTTTATGGTTTGTTGAAACAAGCCTGCTTCTGATAACCATCGCAACCTCTTATGTCCGTTACGTTTCACTCCAATTTTTACTAACGACCTTGGTGTGGATCTTCAGGTATATTTGGAAGTAGCCAATGGAGATGAAGTAGCTGATGCACTTCATTCCAAGGTGAATTTCATTTAGTTCCATATTTACGTTCTGTGTTTACTGCCATTTCTGATGTGCTTGTATGAAGTGTGAGCTATTTTCATAAACCAAGAGAATGGAACAAGATATTTTGGTTAACTGGTTGCTTACTTGTTATCAGGGTATTCCTTATGTCATATACTGGAGAAGCACATTTTCTTGTTATGCAGCATGCCATTTCCGTAATGCATTTCTTTCGGTTCTTCAGAGGTATTCCATTAGTATGACAATGCAGATTAATCGGTGCTATGTATAGATGGAAAGTAAAGGATAATTGAAGGATAACTATTTTTCAATGTGAATTGCAGTTCATCTGCTCATACATGGGACGCATTTCAACTTGCACATGCTTCATTTAGGATGTATTGCTTGGGAAACAGCCTTGTTCTTCCTAACAGTAGTCATAATGACGTTAGTGAAGATCTAGGACCACATCTTCTAGGAGAACGTTTGAGAATAAACATTGAACCCCTTGAGAAAGAGGCAGCTGATGATGAAGAAAGTTCATCGGAAGTTCCTTCTGTTAGCATACTTGATCATGATGTTGAAATGAGATTTCTCATATGTGGTGAACCAAGGTCATTGGTAAGTAGGAATGGACACGAATGTGCAAAATTCGTACCCTTCCAGTATCCTATATGGCATGACTTGTAGTTTTGTTTTTCTCTTTCTAGGATGCTTACGTATTAGGAGCTCTGGAGGATGGTCTTAACGCTCTCTTGGACATTGAAGTAAGTAGGGAAAATGCTTTATTGCTAGAACCTCTGTTTCTAATAGACAATCTAGCTTCTTGATTTGGTTATGTCCTGTTAATAGGGGAAAGTGTTTTCTTCTTACTTTTTTTTCTAATACCTATTTTTGTGCTTTAGATTCGGGGGAGCAAACTTCACGGCAAGTTCAGGTGTGTTGGTTTTTGAAGTCACTTCCTGACTTATTTCGAGGTTTCTGGATTGCATCACCTGCCTAATGCATTTGAGGAGTTCTATTTTATTATTTTTTCTATGTCGCTGGTGTTTTTATGGATTTAGTCACTGCAAAAATTTTCTGACCTCTAAATGGAGTTGTAGACTCCACATTTTATAATGGCTGTCAATTGTATTAATTCGTCTGATGTGCACAGTGCCCCTCCTCCACCTCTCCAGGCAGGAATGCACTCTAATGGTGTTGTGACCATGCGCTGTGACATATCAACATGCAGTTTTGCCCACATCTCACTATTGGTGTCGGGTAGTGCACAAGCTTGTTTTGATGATCAGGTAATGTAGGAGCTCTCTGGTTAGGTTTTCATCTTATTTTGCAGCTGTCAGATCTTATACAACTTCTTACTTGTTTGTATTCAGCTATTTGAAAATTATATAAAGAATGAGATTATAGACAGAAGCGAACCAGTACAGACACTAATAGATGGTGATGGAAGCAAACACTCGCGTGAGCCACGGAAATCTGCTTCAGTTGCTTGTGGGGCAACAGTTTTCGAAGTTAGCATGAAGGTCCCTTCGTGGGCATCTCAGGTTGTCCGATCACTTCTCTTCATCTTATTTTATGAGATTGTATCCATTGGAATCTGATTCTATATTCTGGTTTAGCATTCGTATGTTCTTGAATCAGAATGGCTGTCTTCTATCTAGCCTGAAATGCTGTGCACTGATTTCGGTGAATTTGTTTCTCCTGGCCTGGTTCTTGATTACGGAATCTAAAGAATATTAATGCTTGCTGATATTTATATATTTCCAGTGTTGGGGAATCAGAACTAAAAATCTATGAAGCAGTATCTCGTATTCGTCGAGTGATGATTGATGATTACTAGCTCTTATAATGAGTATGTGCTTGTCCGTGGCTGTTTTGCAGATTTTGAGGCAGTTAGCCCCTGATGTTTCTTATCGAAGTTTAGTTGTACTTGGCATTGCCGGCATTCAGGGTTTGTCTGTAGCTTCTTTTGAGAAGGACGATGCTGAACGCCTCATTTTCTTTGGTTCAAGGAAGCAAAGAGATTTATTTCTAAACAATTTAACCGATAGCACACCTCCAAGCTGGTTGAAACCACCTGCACCTAGAAAGAGATTGAGAAACGTGAAAGACATAAGCCTTGGTTCTCATGATGTTATTCAACATCTGAAGGTTTTGTCTGGTAACAGAATAGACGGCGAGAACATGGAAATAGGATCGAGGAATGGTTTCAGCACTCCCATGTTTCCACTCCCGAGAAGGAGAGGAATGAAAATTGCCGCAATGAGGCCCATTCCTCACGTTAATCGCCATAAAATGATATCTTTTCGTGGAATAGCTGAGACAGGTGGGCACAATGGAGGCCTGTTTAAAGCTAGTGTTTCTTCTAGTAATTCAGGGAAGCATGTTACTGTAGGTTCAGCTTCAGTTTTGCAACAAAAAATGTTTCCAAGTGCATCTCAATATAAACAAATTATTCCCATGAATCCACTACCTTTGAAGAAGCATGGTTGTGGCAGAAGCCCTATACAGGCTTGCTTTGAGGTAGTCGCATCTCTTTGGAACGCTCCTAGAAAATATGTGCTTAAAGAATATAATGTTCAGGAAGTCAACTTTAGAGTCGTTGCTAATCCTCAGGTGTTCGACCTGAATATTTAGATCGATTTCGACTTAATTTTTCTTGAGCTAATCTTTTTTCTTGTTCTCATGTAGGAGGAATTTTTGAAGGATTTGCTGCAGTTTCTTGCTCTTCGAGGTCATAATCGACTTATTCCTCCTGGTGGGCTTGCTGAGTTTCCAGATGCAATACTCAATGGAAAGCGTCTCGACCTTTACAACCTGTACAAGGAGGTAAGCTCCTTTCTCCATTAGAATCATGGATATTGAGTGTACTGTGTTCTGATACTGATTGGACTATTCCCGAACGGATGCTGCTGATCAGGTGGTTTCAAGAGGTGGCTTTCGTGTTGGAAATGGTATTAACTGGAAAGGACAGATCTTCTCAAAGATGCGCAATTACACCATGACCAATAGAATGACTGTATGGCTCTCATGTCTAATAACATAATTCTTATCCTGATTATCATTTGGTTTGTCGTAACTGTTTATTCTTGAATATCTAAAATGATTTATATGCATAACAAGTAATGGGAATGTTGTTCTTGATTTTCATAAGAAGATTTAGAGTAGATTTCTGTGGTGATAGACAATAAGAACTAATTCTCTGGCAGGGTGTTGGAAACACGCTTAAAAGACATTATGAAACTTATCTTCTCGAGTATGAATTGGCGCATGAAGATGTAGATGGTGAATGTTGCCTGTTGTGTCACAGGTTTGTACACTTCCATTCCACTCTCTTCTGAAATATTTCCTTCAATGCACTCTTAGGAGGCAACATTCTTGTTTGATCCCACGTCGATTGGAGAGGGGGAACGAAGCATTCCTTGTAAGTGTGTAGAAACCTCTCCCTAACCTGATATGTTTTAAAACCTTGAGAGGAAGCGCAGAAGGGAAAGCCCAAAGTGGACAATATTTGTTAGCGGTGGGCTTGGGCTTGGGCTGTTACAAATGGTATCAGAGCTGGTTACTGGGCGATGTGCCAGTGAGGATGCTATAGTTAGGTTGTAGCACTGGGCCTCCAAGGGGGTGGATTGTGAGATCCCACGTCAGTTGGAGAGGGGAACGAAGCATTCCTTATAAGGGTGTGGAATCCTCTCCCTAACGGACGTGTTTTAAAACCTTGAGGAGAAGCCTAGAAGGGAAAGCTCAAAGATGGACAATATCTGTTAGCGGTGGGCTTGGGCTGTTACAAATGGTATCAGAGTCGGTTACCGGGTGACGTGTTAGCAAAGACGTTGGGCTCTCAAGGGGGGTAGGTTGTGAGATCTCACATCGGTTGGAGAGCGGAATGAAGCATTCCATATAAGGGTGTGGAAACCTCTTCCTAACGGACGCGTTTTAAAACCTTGAGGGGAAGTGTTTGGAAAGCCCAAAGAGGACAATATATGTTGGCGATCAGCTCTGCATGTTACAATTCTGTACTGGCTCATAAGCCATCTTTCTCTTTCCTTAATGTTAAACTAGATAAACTTCATATGTTCCCTTACACTGAAGTTTTGGATTGCTTTAACAGTAGTGCAGCAGGGGACTGGGTGAACTGTGGCATTTGTGGCGAGTGGGCTCACTTTGGTTGTGACCGAAGGCAGGGTCTCGGCGCATTTAAGGTATCTTCCCCATCCTACGACATCATCCCACTGATACACAAACTGTGTGAAAAGATTAGAACAAGGAAAAATTCCATTGCACATACAAACCTGTATCTCATAATGATGCATTGTGCATGATTTTCGCAGGATTATGCAAAAACTGATGGATTAGAATACATATGTCCACACTGTAGTGTTGCTAATTACAAGAAGAAGAAAGTTGGTAATGGGTTGTCTCCAGGCTTATCCTCAAGACCAATATGAAGATGATTTGATTTTGAGGTCAAATTTATCTCTGCAAACCTATATAAGAGAACCGAACCGTTGCGGCCGTACATATAAAGCTCAACGCCTTAACTACATGTATCACTGTGCTATTAACCTAAATCTTCAAGTATATATATATATATATATATAACGTATCCAGGAAACCAGAGAAAAGTTGAGTAGAAAACTATTGATTTTTTCTGTATTGTAGGGTGGGGTTGAGTAATACTTGGATAATGCCTTAGGAAATAGAAGGGGAATGATTACTACAGTTCTCCATCTTAGTTGAGAGCTAACTTAGATCTTATGTAGAAAGAGGATGAATTGGATGATCCACTTGAAGTTACATGTAAGCTACCGATTTTTCCCCTTTTGACGATCAACGTGTATGCTTACAAGGTGAAGGTTTTATGATACATCAAATGGATTGAATGACAGCACTTCTTATTTAGTTT
mRNA sequence
AATTCATTCCAAATTGCAAGACTTCAATAACTCCCTCTGGATTTCCTCATCTGTTCATCATTTCTACAGAATTTTCTCTCCCTCAAGTGGTTCATATATCAACGACAAGTTCAAAGAACTGTGTTTAAGTGCAAAATTTGTTTAAGCACTTGAAGAATCACTCTTAAATACGTTCAAACTCAAACTAGTTCATCTTAGCCTGCAAATTTTCTGCAAATCGTGAAGAAATTTGGCGAGAAAACAACCTCGAAACAAGAACAACATCAGAAAAGAAGAACACATATTAAAAACGAAGAGATCTAGATCTTTTCGTTTCGTTGGTTTTTTTAGGGTAATGGATATTCATCCATGGATTGAGTATCTTGAAGCAAGAAAACGGTACCGCGTTCATAAATTCCAGCCCTCGTGGGTTCGACCGTACCATTGGTTCGCTCAAAATTCTTTTGGAAAAGTAAAGAAACCAGTTACTTGTTGTAATCAAGATATGAATTTAAGGGCGTAAAGGATTCTCCATCTTGGTGGACTCGAATGTTGATCTTTAAAGCTACGGACTACTTAGGGTTTTCGATTTTCCTGATTTTTTCCATTCCTTGGGCGATCTTCCTTCCTTCACTCGATTCTGTGTTCATATTTGTGAAGTTCTTAGAGCTAATCCAAAGCTAGCAGTTCTTCATATCTATCGAAGTTCATTTACTTTCAGTTAACTCTTTGGGAGTTTAAATTATGAGTTGATGCTGGAGGCAAGGATTTTAAAGTCTCCTGTGAAAAGCTGGAGAGAGTACATTACTAGAGAGGGTTCATTTCTCTTATATAACTCCAATGGTGTTTCATTCTCAAGTTCCTTCAAGATATACTTGTAGGCTTCTTGCAATCCCCTGTCGGAGTGTTCCTGAAGATAAATTTAAGAAGGATAATCCAGATGATGAGCAGAGATATCCCTTTCCACAATTAAATTCTTCTGGGCGTTTGGAGGTCCAGGTTTTGCCCAATCCTAGTAAAGATGAGTTTTGTAGGATTATTGAATCGTATAGACCGAGCATTGTTTACTTACAAGGGGAACAACTTGAGAATGATGAAGTTGGTTCTTTAGTATGGGAAGGTGTTGATTTGTTCACCGTAGAAGCGATTAGTGGGCTCTTTAGTTCTCCGTTGCCCACCACTGTATATTTGGAAGTAGCCAATGGAGATGAAGTAGCTGATGCACTTCATTCCAAGGGTATTCCTTATGTCATATACTGGAGAAGCACATTTTCTTGTTATGCAGCATGCCATTTCCGTAATGCATTTCTTTCGGTTCTTCAGAGTTCATCTGCTCATACATGGGACGCATTTCAACTTGCACATGCTTCATTTAGGATGTATTGCTTGGGAAACAGCCTTGTTCTTCCTAACAGTAGTCATAATGACGTTAGTGAAGATCTAGGACCACATCTTCTAGGAGAACGTTTGAGAATAAACATTGAACCCCTTGAGAAAGAGGCAGCTGATGATGAAGAAAGTTCATCGGAAGTTCCTTCTGTTAGCATACTTGATCATGATGTTGAAATGAGATTTCTCATATGTGGTGAACCAAGGTCATTGGATGCTTACGTATTAGGAGCTCTGGAGGATGGTCTTAACGCTCTCTTGGACATTGAAATTCGGGGGAGCAAACTTCACGGCAAGTTCAGTGCCCCTCCTCCACCTCTCCAGGCAGGAATGCACTCTAATGGTGTTGTGACCATGCGCTGTGACATATCAACATGCAGTTTTGCCCACATCTCACTATTGGTGTCGGGTAGTGCACAAGCTTGTTTTGATGATCAGCTATTTGAAAATTATATAAAGAATGAGATTATAGACAGAAGCGAACCAGTACAGACACTAATAGATGGTGATGGAAGCAAACACTCGCGTGAGCCACGGAAATCTGCTTCAGTTGCTTGTGGGGCAACAGTTTTCGAAGTTAGCATGAAGGTCCCTTCGTGGGCATCTCAGATTTTGAGGCAGTTAGCCCCTGATGTTTCTTATCGAAGTTTAGTTGTACTTGGCATTGCCGGCATTCAGGGTTTGTCTGTAGCTTCTTTTGAGAAGGACGATGCTGAACGCCTCATTTTCTTTGGTTCAAGGAAGCAAAGAGATTTATTTCTAAACAATTTAACCGATAGCACACCTCCAAGCTGGTTGAAACCACCTGCACCTAGAAAGAGATTGAGAAACGTGAAAGACATAAGCCTTGGTTCTCATGATGTTATTCAACATCTGAAGGTTTTGTCTGGTAACAGAATAGACGGCGAGAACATGGAAATAGGATCGAGGAATGGTTTCAGCACTCCCATGTTTCCACTCCCGAGAAGGAGAGGAATGAAAATTGCCGCAATGAGGCCCATTCCTCACGTTAATCGCCATAAAATGATATCTTTTCGTGGAATAGCTGAGACAGGTGGGCACAATGGAGGCCTGTTTAAAGCTAGTGTTTCTTCTAGTAATTCAGGGAAGCATGTTACTGTAGGTTCAGCTTCAGTTTTGCAACAAAAAATGTTTCCAAGTGCATCTCAATATAAACAAATTATTCCCATGAATCCACTACCTTTGAAGAAGCATGGTTGTGGCAGAAGCCCTATACAGGCTTGCTTTGAGGAGGAATTTTTGAAGGATTTGCTGCAGTTTCTTGCTCTTCGAGGTCATAATCGACTTATTCCTCCTGGTGGGCTTGCTGAGTTTCCAGATGCAATACTCAATGGAAAGCGTCTCGACCTTTACAACCTGTACAAGGAGGTGGTTTCAAGAGGTGGCTTTCGTGTTGGAAATGGTATTAACTGGAAAGGACAGATCTTCTCAAAGATGCGCAATTACACCATGACCAATAGAATGACTGGTGTTGGAAACACGCTTAAAAGACATTATGAAACTTATCTTCTCGAGTATGAATTGGCGCATGAAGATGTAGATGGTGAATGTTGCCTGTTGTGTCACAGTAGTGCAGCAGGGGACTGGGTGAACTGTGGCATTTGTGGCGAGTGGGCTCACTTTGGTTGTGACCGAAGGCAGGGTCTCGGCGCATTTAAGGATTATGCAAAAACTGATGGATTAGAATACATATGTCCACACTGTAGTGTTGCTAATTACAAGAAGAAGAAAGTTGGTAATGGGTTGTCTCCAGGCTTATCCTCAAGACCAATATGAAGATGATTTGATTTTGAGGTCAAATTTATCTCTGCAAACCTATATAAGAGAACCGAACCGTTGCGGCCGTACATATAAAGCTCAACGCCTTAACTACATGTATCACTGTGCTATTAACCTAAATCTTCAAGTATATATATATATATATATATAACGTATCCAGGAAACCAGAGAAAAGTTGAGTAGAAAACTATTGATTTTTTCTGTATTGTAGGGTGGGGTTGAGTAATACTTGGATAATGCCTTAGGAAATAGAAGGGGAATGATTACTACAGTTCTCCATCTTAGTTGAGAGCTAACTTAGATCTTATGTAGAAAGAGGATGAATTGGATGATCCACTTGAAGTTACATGTAAGCTACCGATTTTTCCCCTTTTGACGATCAACGTGTATGCTTACAAGGTGAAGGTTTTATGATACATCAAATGGATTGAATGACAGCACTTCTTATTTAGTTT
Coding sequence (CDS)
ATGGTGTTTCATTCTCAAGTTCCTTCAAGATATACTTGTAGGCTTCTTGCAATCCCCTGTCGGAGTGTTCCTGAAGATAAATTTAAGAAGGATAATCCAGATGATGAGCAGAGATATCCCTTTCCACAATTAAATTCTTCTGGGCGTTTGGAGGTCCAGGTTTTGCCCAATCCTAGTAAAGATGAGTTTTGTAGGATTATTGAATCGTATAGACCGAGCATTGTTTACTTACAAGGGGAACAACTTGAGAATGATGAAGTTGGTTCTTTAGTATGGGAAGGTGTTGATTTGTTCACCGTAGAAGCGATTAGTGGGCTCTTTAGTTCTCCGTTGCCCACCACTGTATATTTGGAAGTAGCCAATGGAGATGAAGTAGCTGATGCACTTCATTCCAAGGGTATTCCTTATGTCATATACTGGAGAAGCACATTTTCTTGTTATGCAGCATGCCATTTCCGTAATGCATTTCTTTCGGTTCTTCAGAGTTCATCTGCTCATACATGGGACGCATTTCAACTTGCACATGCTTCATTTAGGATGTATTGCTTGGGAAACAGCCTTGTTCTTCCTAACAGTAGTCATAATGACGTTAGTGAAGATCTAGGACCACATCTTCTAGGAGAACGTTTGAGAATAAACATTGAACCCCTTGAGAAAGAGGCAGCTGATGATGAAGAAAGTTCATCGGAAGTTCCTTCTGTTAGCATACTTGATCATGATGTTGAAATGAGATTTCTCATATGTGGTGAACCAAGGTCATTGGATGCTTACGTATTAGGAGCTCTGGAGGATGGTCTTAACGCTCTCTTGGACATTGAAATTCGGGGGAGCAAACTTCACGGCAAGTTCAGTGCCCCTCCTCCACCTCTCCAGGCAGGAATGCACTCTAATGGTGTTGTGACCATGCGCTGTGACATATCAACATGCAGTTTTGCCCACATCTCACTATTGGTGTCGGGTAGTGCACAAGCTTGTTTTGATGATCAGCTATTTGAAAATTATATAAAGAATGAGATTATAGACAGAAGCGAACCAGTACAGACACTAATAGATGGTGATGGAAGCAAACACTCGCGTGAGCCACGGAAATCTGCTTCAGTTGCTTGTGGGGCAACAGTTTTCGAAGTTAGCATGAAGGTCCCTTCGTGGGCATCTCAGATTTTGAGGCAGTTAGCCCCTGATGTTTCTTATCGAAGTTTAGTTGTACTTGGCATTGCCGGCATTCAGGGTTTGTCTGTAGCTTCTTTTGAGAAGGACGATGCTGAACGCCTCATTTTCTTTGGTTCAAGGAAGCAAAGAGATTTATTTCTAAACAATTTAACCGATAGCACACCTCCAAGCTGGTTGAAACCACCTGCACCTAGAAAGAGATTGAGAAACGTGAAAGACATAAGCCTTGGTTCTCATGATGTTATTCAACATCTGAAGGTTTTGTCTGGTAACAGAATAGACGGCGAGAACATGGAAATAGGATCGAGGAATGGTTTCAGCACTCCCATGTTTCCACTCCCGAGAAGGAGAGGAATGAAAATTGCCGCAATGAGGCCCATTCCTCACGTTAATCGCCATAAAATGATATCTTTTCGTGGAATAGCTGAGACAGGTGGGCACAATGGAGGCCTGTTTAAAGCTAGTGTTTCTTCTAGTAATTCAGGGAAGCATGTTACTGTAGGTTCAGCTTCAGTTTTGCAACAAAAAATGTTTCCAAGTGCATCTCAATATAAACAAATTATTCCCATGAATCCACTACCTTTGAAGAAGCATGGTTGTGGCAGAAGCCCTATACAGGCTTGCTTTGAGGAGGAATTTTTGAAGGATTTGCTGCAGTTTCTTGCTCTTCGAGGTCATAATCGACTTATTCCTCCTGGTGGGCTTGCTGAGTTTCCAGATGCAATACTCAATGGAAAGCGTCTCGACCTTTACAACCTGTACAAGGAGGTGGTTTCAAGAGGTGGCTTTCGTGTTGGAAATGGTATTAACTGGAAAGGACAGATCTTCTCAAAGATGCGCAATTACACCATGACCAATAGAATGACTGGTGTTGGAAACACGCTTAAAAGACATTATGAAACTTATCTTCTCGAGTATGAATTGGCGCATGAAGATGTAGATGGTGAATGTTGCCTGTTGTGTCACAGTAGTGCAGCAGGGGACTGGGTGAACTGTGGCATTTGTGGCGAGTGGGCTCACTTTGGTTGTGACCGAAGGCAGGGTCTCGGCGCATTTAAGGATTATGCAAAAACTGATGGATTAGAATACATATGTCCACACTGTAGTGTTGCTAATTACAAGAAGAAGAAAGTTGGTAATGGGTTGTCTCCAGGCTTATCCTCAAGACCAATATGA
Protein sequence
MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSKDEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVANGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRMYCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHDVEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVVTMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSREPRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDDAERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSGNRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGGLFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFEEEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGINWKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRPI
Homology
BLAST of CmoCh17G006360 vs. ExPASy Swiss-Prot
Match:
Q6NQ79 (AT-rich interactive domain-containing protein 4 OS=Arabidopsis thaliana OX=3702 GN=ARID4 PE=1 SV=1)
HSP 1 Score: 880.2 bits (2273), Expect = 1.8e-254
Identity = 441/779 (56.61%), Postives = 556/779 (71.37%), Query Frame = 0
Query: 2 VFHSQVPSRYTCRLLAIPC-RSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 61
+FH Q SR C ++A+ + + + D + +YPFP L+SSGRL+ QVL NP+
Sbjct: 1 MFHGQGFSRNRCNVVAVVSGAELCDTNNQIDGTSHQPKYPFPDLSSSGRLKFQVLNNPTP 60
Query: 62 DEFCRIIESYRPSIVYLQGEQL-ENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEV 121
+EF + S VYLQGE ++DEVG LV D T +A+ LF S LPTTVYLE+
Sbjct: 61 EEFQVAVNSSATDFVYLQGEHSGDSDEVGPLVLGYTDFSTPDALVTLFGSTLPTTVYLEL 120
Query: 122 ANGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFR 181
NG+E+A AL+SKG+ YVIYW++ FS YAACHFR++ SV+QSS + TWD F +A ASFR
Sbjct: 121 PNGEELAQALYSKGVQYVIYWKNVFSKYAACHFRHSLFSVIQSSCSDTWDVFHVAEASFR 180
Query: 182 MYCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDH 241
+YC ++ VLP++S+ ++ ++GP LLGE +I++ E + ++E S +PS+ I D
Sbjct: 181 LYCTSDNAVLPSNSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEENSLESLPSIKIYDE 240
Query: 242 DVEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGV 301
DV +RFL+CG P ++D ++LG+L DGLNALL IE+RGSKLH + SAP PPLQAG + GV
Sbjct: 241 DVTVRFLLCGPPCTVDTFLLGSLMDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRGV 300
Query: 302 VTMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHS- 361
VTMRCD+STCS AHIS+LVSG+AQ CF DQL EN+IK+E++++ + V ++++ + +K
Sbjct: 301 VTMRCDVSTCSSAHISMLVSGNAQTCFSDQLLENHIKHEVVEKIQLVHSVVNSEETKRGF 360
Query: 362 REPRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEK 421
EPR+SAS+ACGA+V EVSM+VP+WA Q+LRQLAPDVSYRSLVVLG+A IQGLSVASFEK
Sbjct: 361 SEPRRSASIACGASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFEK 420
Query: 422 DDAERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAP-RKR---LRNVKDISLGSHDVIQH 481
DDAERL+FF ++ D ++ S P+WL PP P RKR R K+I G
Sbjct: 421 DDAERLLFFCGQQINDTSNHDALLSKIPNWLTPPLPTRKRSEPCRESKEIENGG------ 480
Query: 482 LKVLSGNRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAET 541
P R + +AA+RPIPH RHKMI F G +E
Sbjct: 481 -----------------------------PTSRKINVAALRPIPHTRRHKMIPFSGYSEI 540
Query: 542 GGHNGGLFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSP 601
G +G K S+ KH G V +K F + Q KQII +NPLPLKKH CGR+
Sbjct: 541 GRFDGDHTKGSLPM--PPKHGASGGTPVTHRKAFSGSYQRKQIISLNPLPLKKHDCGRAH 600
Query: 602 IQACFEEEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFR 661
IQ C EEEFL+D++QFL +RGH RL+PPGGLAEFPDA+LN KRLDL+NLY+EVVSRGGF
Sbjct: 601 IQVCSEEEFLRDVMQFLLIRGHTRLVPPGGLAEFPDAVLNSKRLDLFNLYREVVSRGGFH 660
Query: 662 VGNGINWKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSS 721
VGNGINWKGQ+FSKMRN+T+TNRMTGVGNTLKRHYETYLLEYE AH+DVDGECCL+C SS
Sbjct: 661 VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAHDDVDGECCLICRSS 720
Query: 722 AAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKK--KVGNG 772
AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEY+CP+CSV+NY+KK K NG
Sbjct: 721 TAGDWVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSVSNYRKKSQKTSNG 742
BLAST of CmoCh17G006360 vs. ExPASy TrEMBL
Match:
A0A6J1GPB6 (AT-rich interactive domain-containing protein 4-like OS=Cucurbita moschata OX=3662 GN=LOC111456216 PE=4 SV=1)
HSP 1 Score: 1604.0 bits (4152), Expect = 0.0e+00
Identity = 781/781 (100.00%), Postives = 781/781 (100.00%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK
Sbjct: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA
Sbjct: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD
Sbjct: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV
Sbjct: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE
Sbjct: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD
Sbjct: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG
Sbjct: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG
Sbjct: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE
Sbjct: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP
Sbjct: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
Query: 781 I 782
I
Sbjct: 781 I 781
BLAST of CmoCh17G006360 vs. ExPASy TrEMBL
Match:
A0A6J1JUU6 (AT-rich interactive domain-containing protein 4-like OS=Cucurbita maxima OX=3661 GN=LOC111488561 PE=4 SV=1)
HSP 1 Score: 1570.4 bits (4065), Expect = 0.0e+00
Identity = 763/781 (97.70%), Postives = 771/781 (98.72%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQ PSRYTCRLLAIPC SVPEDKFKKDNP+DEQRYPFPQLNSSGRLEVQVLPNPSK
Sbjct: 1 MVFHSQAPSRYTCRLLAIPCGSVPEDKFKKDNPEDEQRYPFPQLNSSGRLEVQVLPNPSK 60
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
DEFCRIIESYRP+IVYLQGE+LENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLE+A
Sbjct: 61 DEFCRIIESYRPNIVYLQGERLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEIA 120
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRIN+EPLEKEAADDEESSSEVPSVSILDH
Sbjct: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINVEPLEKEAADDEESSSEVPSVSILDHY 240
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHS+GVV
Sbjct: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSDGVV 300
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKH RE
Sbjct: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHLRE 360
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD
Sbjct: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERL+FFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDI LGSHDVIQHLKV SG
Sbjct: 421 AERLLFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDIRLGSHDVIQHLKVSSG 480
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
+RIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMI FRGIAETGGHNGG
Sbjct: 481 SRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMIYFRGIAETGGHNGG 540
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
L KASVSSSNS KHV VGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE
Sbjct: 541 LVKASVSSSNSAKHVIVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP
Sbjct: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
Query: 781 I 782
I
Sbjct: 781 I 781
BLAST of CmoCh17G006360 vs. ExPASy TrEMBL
Match:
A0A5D3BXU1 (AT-rich interactive domain-containing protein 4-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G001620 PE=4 SV=1)
HSP 1 Score: 1440.6 bits (3728), Expect = 0.0e+00
Identity = 697/781 (89.24%), Postives = 733/781 (93.85%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQVP+RYTCRLLAIP SVPEDK KKDNP+D+QRYPFPQLNSSGRLEVQVL NPSK
Sbjct: 1 MVFHSQVPARYTCRLLAIPSGSVPEDKSKKDNPEDQQRYPFPQLNSSGRLEVQVLSNPSK 60
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
D+FCR +ESY+P+IVYLQGEQLENDEVGSLVW GVDL TVEAISGLF+ PL TTVYL++A
Sbjct: 61 DQFCRTLESYKPNIVYLQGEQLENDEVGSLVWGGVDLSTVEAISGLFNYPLLTTVYLDIA 120
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
GDEVADALHSKGIPYVIYWRSTF+CY ACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 121 KGDEVADALHSKGIPYVIYWRSTFTCYTACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGN+ VLP+SSH +VSEDLGPHLLGERL+IN+EPLEKE ADDEESSSE SVS+LD+D
Sbjct: 181 YCLGNNFVLPSSSHKEVSEDLGPHLLGERLKINVEPLEKEVADDEESSSEGISVSVLDND 240
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
VEMRFL+CGEP SLDAYVL ALEDGLNALLDIEIRGSKLH KFSAPPPPLQAG SNGVV
Sbjct: 241 VEMRFLVCGEPGSLDAYVLEALEDGLNALLDIEIRGSKLHSKFSAPPPPLQAGTLSNGVV 300
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCD+STCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDR E VQTLIDG+GSKH E
Sbjct: 301 TMRCDLSTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRGELVQTLIDGEGSKHLSE 360
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKS S+ACGATVFEVS+KVPSWASQILRQLAPDVSYRSLV LGIA IQGLSVASFEKDD
Sbjct: 361 PRKSTSIACGATVFEVSLKVPSWASQILRQLAPDVSYRSLVGLGIASIQGLSVASFEKDD 420
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERL+FF SRK++DLFL+NLTDST PSWLKPPAPRKR + +KD SLGSHD+I+HLKVL G
Sbjct: 421 AERLLFFCSRKEKDLFLSNLTDSTLPSWLKPPAPRKRSKYMKDTSLGSHDIIEHLKVLPG 480
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
+RI NMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISF GI+E GGHNGG
Sbjct: 481 SRIHSANMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFHGISEMGGHNGG 540
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
L KASV SSN KHVTVGSASV QQK+FPSASQYKQIIPMNPLPLKKHGCGRS IQACFE
Sbjct: 541 LLKASVPSSNPTKHVTVGSASVFQQKVFPSASQYKQIIPMNPLPLKKHGCGRSHIQACFE 600
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDLLQFLALRGH+RLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 601 EEFLKDLLQFLALRGHSRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCG CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKV NGLSPG SSRP
Sbjct: 721 NCGFCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVANGLSPGFSSRP 780
Query: 781 I 782
+
Sbjct: 781 M 781
BLAST of CmoCh17G006360 vs. ExPASy TrEMBL
Match:
A0A1S3CGG2 (AT-rich interactive domain-containing protein 4-like OS=Cucumis melo OX=3656 GN=LOC103500649 PE=4 SV=1)
HSP 1 Score: 1440.2 bits (3727), Expect = 0.0e+00
Identity = 696/781 (89.12%), Postives = 733/781 (93.85%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQVP+RYTCRLLAIP S+PEDK KKDNP+D+QRYPFPQLNSSGRLEVQVL NPSK
Sbjct: 1 MVFHSQVPARYTCRLLAIPSGSIPEDKSKKDNPEDQQRYPFPQLNSSGRLEVQVLSNPSK 60
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
D+FCR +ESY+P+IVYLQGEQLENDEVGSLVW GVDL TVEAISGLF+ PL TTVYL++A
Sbjct: 61 DQFCRTLESYKPNIVYLQGEQLENDEVGSLVWGGVDLSTVEAISGLFNYPLLTTVYLDIA 120
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
GDEVADALHSKGIPYVIYWRSTF+CY ACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 121 KGDEVADALHSKGIPYVIYWRSTFTCYTACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGN+ VLP+SSH +VSEDLGPHLLGERL+IN+EPLEKE ADDEESSSE SVS+LD+D
Sbjct: 181 YCLGNNFVLPSSSHKEVSEDLGPHLLGERLKINVEPLEKEVADDEESSSEGISVSVLDND 240
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
VEMRFL+CGEP SLDAYVL ALEDGLNALLDIEIRGSKLH KFSAPPPPLQAG SNGVV
Sbjct: 241 VEMRFLVCGEPGSLDAYVLEALEDGLNALLDIEIRGSKLHSKFSAPPPPLQAGTLSNGVV 300
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCD+STCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDR E VQTLIDG+GSKH E
Sbjct: 301 TMRCDLSTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRGELVQTLIDGEGSKHLSE 360
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKS S+ACGATVFEVS+KVPSWASQILRQLAPDVSYRSLV LGIA IQGLSVASFEKDD
Sbjct: 361 PRKSTSIACGATVFEVSLKVPSWASQILRQLAPDVSYRSLVGLGIASIQGLSVASFEKDD 420
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERL+FF SRK++DLFL+NLTDST PSWLKPPAPRKR + +KD SLGSHD+I+HLKVL G
Sbjct: 421 AERLLFFCSRKEKDLFLSNLTDSTLPSWLKPPAPRKRSKYMKDTSLGSHDIIEHLKVLPG 480
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
+RI NMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISF GI+E GGHNGG
Sbjct: 481 SRIHSANMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFHGISEMGGHNGG 540
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
L KASV SSN KHVTVGSASV QQK+FPSASQYKQIIPMNPLPLKKHGCGRS IQACFE
Sbjct: 541 LLKASVPSSNPTKHVTVGSASVFQQKVFPSASQYKQIIPMNPLPLKKHGCGRSHIQACFE 600
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDLLQFLALRGH+RLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 601 EEFLKDLLQFLALRGHSRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCG CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKV NGLSPG SSRP
Sbjct: 721 NCGFCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVANGLSPGFSSRP 780
Query: 781 I 782
+
Sbjct: 781 M 781
BLAST of CmoCh17G006360 vs. ExPASy TrEMBL
Match:
A0A0A0KCC6 (ARID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G448080 PE=4 SV=1)
HSP 1 Score: 1435.2 bits (3714), Expect = 0.0e+00
Identity = 693/781 (88.73%), Postives = 730/781 (93.47%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQVP+RYTCRLLAIP SVPEDK KKDNP+D+QRYPFPQLNSSGRLEVQVL NPSK
Sbjct: 38 MVFHSQVPARYTCRLLAIPYGSVPEDKCKKDNPEDQQRYPFPQLNSSGRLEVQVLSNPSK 97
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
D+FCR +ESY+P+IVYLQGEQLENDEVGSLVW GVDL VEAISGLF+ PLPTTVYL++A
Sbjct: 98 DQFCRTLESYKPNIVYLQGEQLENDEVGSLVWRGVDLSNVEAISGLFNYPLPTTVYLDIA 157
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
GDEVADALHSKGIPYVIYWRS F+CYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 158 KGDEVADALHSKGIPYVIYWRSAFTCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 217
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGN+ VLP+SSH +VSEDLGPHLLGERL+IN+EPLEKE ADDEESSSE SV+ILD+D
Sbjct: 218 YCLGNNFVLPSSSHKEVSEDLGPHLLGERLKINVEPLEKEVADDEESSSEGISVNILDND 277
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
VEMRFL+CGEP SLDAYVL ALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAG SNGVV
Sbjct: 278 VEMRFLVCGEPGSLDAYVLEALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGTLSNGVV 337
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCD+STCSFAHISLLVSGSAQACFDDQLFENYIK EIIDR E VQTL+D +GSKH E
Sbjct: 338 TMRCDLSTCSFAHISLLVSGSAQACFDDQLFENYIKTEIIDRGELVQTLLDSEGSKHLSE 397
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKS S+ACGATVFEVS+KVPSWASQI RQLAPDVSYRSLV LGIA IQGLSVASFEKDD
Sbjct: 398 PRKSTSIACGATVFEVSLKVPSWASQIFRQLAPDVSYRSLVGLGIASIQGLSVASFEKDD 457
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERL+FF SRK+ DLFL+NLTDST PSWLKPPAPRKR + +KD SLGSH++I+HLKV G
Sbjct: 458 AERLLFFCSRKENDLFLSNLTDSTLPSWLKPPAPRKRPKYIKDTSLGSHEIIEHLKVSPG 517
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
+RI G NMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISF GI+ETGGHNG
Sbjct: 518 SRIHGANMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFHGISETGGHNGS 577
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
L KASV SSN KHVTVGSASV QQK+FPSAS YKQIIPMNPLPLKKHGCGRS IQACFE
Sbjct: 578 LLKASVPSSNPTKHVTVGSASVFQQKVFPSASHYKQIIPMNPLPLKKHGCGRSHIQACFE 637
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDL+QFLALRGH+RLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 638 EEFLKDLMQFLALRGHSRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 697
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 698 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 757
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKV NGLSPG SSRP
Sbjct: 758 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVANGLSPGFSSRP 817
Query: 781 I 782
I
Sbjct: 818 I 818
BLAST of CmoCh17G006360 vs. NCBI nr
Match:
XP_022953795.1 (AT-rich interactive domain-containing protein 4-like [Cucurbita moschata] >XP_022953796.1 AT-rich interactive domain-containing protein 4-like [Cucurbita moschata] >XP_022953797.1 AT-rich interactive domain-containing protein 4-like [Cucurbita moschata] >XP_022953798.1 AT-rich interactive domain-containing protein 4-like [Cucurbita moschata])
HSP 1 Score: 1604.0 bits (4152), Expect = 0.0e+00
Identity = 781/781 (100.00%), Postives = 781/781 (100.00%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK
Sbjct: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA
Sbjct: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD
Sbjct: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV
Sbjct: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE
Sbjct: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD
Sbjct: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG
Sbjct: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG
Sbjct: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE
Sbjct: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP
Sbjct: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
Query: 781 I 782
I
Sbjct: 781 I 781
BLAST of CmoCh17G006360 vs. NCBI nr
Match:
KAG7013957.1 (AT-rich interactive domain-containing protein 4 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1589.7 bits (4115), Expect = 0.0e+00
Identity = 772/781 (98.85%), Postives = 777/781 (99.49%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQVPSRYTCRLLAIPC SVPEDKFKKDNP+DEQRYPFPQLNSSGRLEVQVLPNPSK
Sbjct: 1 MVFHSQVPSRYTCRLLAIPCGSVPEDKFKKDNPEDEQRYPFPQLNSSGRLEVQVLPNPSK 60
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
DEFCRIIESYRPSIVYLQGE+LENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLE+A
Sbjct: 61 DEFCRIIESYRPSIVYLQGERLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEIA 120
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD
Sbjct: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
+EMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV
Sbjct: 241 LEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE
Sbjct: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD
Sbjct: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERL+FFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHD IQHLKV SG
Sbjct: 421 AERLLFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDAIQHLKVSSG 480
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG
Sbjct: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
LFKASVSSSNS KHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE
Sbjct: 541 LFKASVSSSNSAKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP
Sbjct: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
Query: 781 I 782
I
Sbjct: 781 I 781
BLAST of CmoCh17G006360 vs. NCBI nr
Match:
XP_023548213.1 (AT-rich interactive domain-containing protein 4-like [Cucurbita pepo subsp. pepo] >XP_023548214.1 AT-rich interactive domain-containing protein 4-like [Cucurbita pepo subsp. pepo] >XP_023548215.1 AT-rich interactive domain-containing protein 4-like [Cucurbita pepo subsp. pepo] >XP_023548216.1 AT-rich interactive domain-containing protein 4-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1577.8 bits (4084), Expect = 0.0e+00
Identity = 766/781 (98.08%), Postives = 775/781 (99.23%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQVPSRYTCRLLAIPC SVPED+F+KDNP+DEQRYPFPQLNSSGRLEVQVLPNPSK
Sbjct: 1 MVFHSQVPSRYTCRLLAIPCGSVPEDRFQKDNPEDEQRYPFPQLNSSGRLEVQVLPNPSK 60
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
DEFCR IESYRP+IVYLQGE+LENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLE+A
Sbjct: 61 DEFCRTIESYRPNIVYLQGERLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEIA 120
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGNSLVL NSSHNDVSEDLGPHLLGERLRIN+EPLEKEAADDEESSSEVPSVSILDHD
Sbjct: 181 YCLGNSLVLHNSSHNDVSEDLGPHLLGERLRINVEPLEKEAADDEESSSEVPSVSILDHD 240
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV
Sbjct: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE
Sbjct: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD
Sbjct: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERL+FFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKV SG
Sbjct: 421 AERLLFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVSSG 480
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
+RIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG
Sbjct: 481 SRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
LFKASVSSSNS KHVTVGSASVLQQKMF SASQYKQIIPMNPLPLKKHGCGRSPIQACFE
Sbjct: 541 LFKASVSSSNSAKHVTVGSASVLQQKMFSSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP
Sbjct: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
Query: 781 I 782
I
Sbjct: 781 I 781
BLAST of CmoCh17G006360 vs. NCBI nr
Match:
XP_022992125.1 (AT-rich interactive domain-containing protein 4-like [Cucurbita maxima] >XP_022992126.1 AT-rich interactive domain-containing protein 4-like [Cucurbita maxima] >XP_022992127.1 AT-rich interactive domain-containing protein 4-like [Cucurbita maxima] >XP_022992128.1 AT-rich interactive domain-containing protein 4-like [Cucurbita maxima] >XP_022992129.1 AT-rich interactive domain-containing protein 4-like [Cucurbita maxima])
HSP 1 Score: 1570.4 bits (4065), Expect = 0.0e+00
Identity = 763/781 (97.70%), Postives = 771/781 (98.72%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQ PSRYTCRLLAIPC SVPEDKFKKDNP+DEQRYPFPQLNSSGRLEVQVLPNPSK
Sbjct: 1 MVFHSQAPSRYTCRLLAIPCGSVPEDKFKKDNPEDEQRYPFPQLNSSGRLEVQVLPNPSK 60
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
DEFCRIIESYRP+IVYLQGE+LENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLE+A
Sbjct: 61 DEFCRIIESYRPNIVYLQGERLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEIA 120
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRIN+EPLEKEAADDEESSSEVPSVSILDH
Sbjct: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINVEPLEKEAADDEESSSEVPSVSILDHY 240
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHS+GVV
Sbjct: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSDGVV 300
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKH RE
Sbjct: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHLRE 360
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD
Sbjct: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERL+FFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDI LGSHDVIQHLKV SG
Sbjct: 421 AERLLFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDIRLGSHDVIQHLKVSSG 480
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
+RIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMI FRGIAETGGHNGG
Sbjct: 481 SRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMIYFRGIAETGGHNGG 540
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
L KASVSSSNS KHV VGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE
Sbjct: 541 LVKASVSSSNSAKHVIVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP
Sbjct: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
Query: 781 I 782
I
Sbjct: 781 I 781
BLAST of CmoCh17G006360 vs. NCBI nr
Match:
KAG6575416.1 (AT-rich interactive domain-containing protein 4, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1537.7 bits (3980), Expect = 0.0e+00
Identity = 752/781 (96.29%), Postives = 756/781 (96.80%), Query Frame = 0
Query: 1 MVFHSQVPSRYTCRLLAIPCRSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 60
MVFHSQVPSRYTCRLLAIPC SVPEDKFKKDNP+DEQRYPFPQLNSSGRLEVQVLPNPSK
Sbjct: 1 MVFHSQVPSRYTCRLLAIPCGSVPEDKFKKDNPEDEQRYPFPQLNSSGRLEVQVLPNPSK 60
Query: 61 DEFCRIIESYRPSIVYLQGEQLENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEVA 120
DEFCRIIESYRPSIVYLQGE+LENDEVGSLVWEGVDLFTVEAISGLF+SPLPTTVYLE+A
Sbjct: 61 DEFCRIIESYRPSIVYLQGERLENDEVGSLVWEGVDLFTVEAISGLFTSPLPTTVYLEIA 120
Query: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM
Sbjct: 121 NGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFRM 180
Query: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD
Sbjct: 181 YCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDHD 240
Query: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV
Sbjct: 241 VEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGVV 300
Query: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE
Sbjct: 301 TMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHSRE 360
Query: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD
Sbjct: 361 PRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEKDD 420
Query: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVLSG 480
AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKV SG
Sbjct: 421 AERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAPRKRLRNVKDISLGSHDVIQHLKVSSG 480
Query: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAETGGHNGG 540
NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAET
Sbjct: 481 NRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAET------ 540
Query: 541 LFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
GSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE
Sbjct: 541 -----------------GSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSPIQACFE 600
Query: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN
Sbjct: 601 EEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFRVGNGIN 660
Query: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV
Sbjct: 661 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSSAAGDWV 720
Query: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 780
NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP
Sbjct: 721 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKKKVGNGLSPGLSSRP 758
Query: 781 I 782
I
Sbjct: 781 I 758
BLAST of CmoCh17G006360 vs. TAIR 10
Match:
AT3G43240.1 (ARID/BRIGHT DNA-binding domain-containing protein )
HSP 1 Score: 880.2 bits (2273), Expect = 1.2e-255
Identity = 441/779 (56.61%), Postives = 556/779 (71.37%), Query Frame = 0
Query: 2 VFHSQVPSRYTCRLLAIPC-RSVPEDKFKKDNPDDEQRYPFPQLNSSGRLEVQVLPNPSK 61
+FH Q SR C ++A+ + + + D + +YPFP L+SSGRL+ QVL NP+
Sbjct: 1 MFHGQGFSRNRCNVVAVVSGAELCDTNNQIDGTSHQPKYPFPDLSSSGRLKFQVLNNPTP 60
Query: 62 DEFCRIIESYRPSIVYLQGEQL-ENDEVGSLVWEGVDLFTVEAISGLFSSPLPTTVYLEV 121
+EF + S VYLQGE ++DEVG LV D T +A+ LF S LPTTVYLE+
Sbjct: 61 EEFQVAVNSSATDFVYLQGEHSGDSDEVGPLVLGYTDFSTPDALVTLFGSTLPTTVYLEL 120
Query: 122 ANGDEVADALHSKGIPYVIYWRSTFSCYAACHFRNAFLSVLQSSSAHTWDAFQLAHASFR 181
NG+E+A AL+SKG+ YVIYW++ FS YAACHFR++ SV+QSS + TWD F +A ASFR
Sbjct: 121 PNGEELAQALYSKGVQYVIYWKNVFSKYAACHFRHSLFSVIQSSCSDTWDVFHVAEASFR 180
Query: 182 MYCLGNSLVLPNSSHNDVSEDLGPHLLGERLRINIEPLEKEAADDEESSSEVPSVSILDH 241
+YC ++ VLP++S+ ++ ++GP LLGE +I++ E + ++E S +PS+ I D
Sbjct: 181 LYCTSDNAVLPSNSNRKMNYEMGPCLLGEPPKIDVVSPEADELEEENSLESLPSIKIYDE 240
Query: 242 DVEMRFLICGEPRSLDAYVLGALEDGLNALLDIEIRGSKLHGKFSAPPPPLQAGMHSNGV 301
DV +RFL+CG P ++D ++LG+L DGLNALL IE+RGSKLH + SAP PPLQAG + GV
Sbjct: 241 DVTVRFLLCGPPCTVDTFLLGSLMDGLNALLRIEMRGSKLHNRSSAPAPPLQAGTFTRGV 300
Query: 302 VTMRCDISTCSFAHISLLVSGSAQACFDDQLFENYIKNEIIDRSEPVQTLIDGDGSKHS- 361
VTMRCD+STCS AHIS+LVSG+AQ CF DQL EN+IK+E++++ + V ++++ + +K
Sbjct: 301 VTMRCDVSTCSSAHISMLVSGNAQTCFSDQLLENHIKHEVVEKIQLVHSVVNSEETKRGF 360
Query: 362 REPRKSASVACGATVFEVSMKVPSWASQILRQLAPDVSYRSLVVLGIAGIQGLSVASFEK 421
EPR+SAS+ACGA+V EVSM+VP+WA Q+LRQLAPDVSYRSLVVLG+A IQGLSVASFEK
Sbjct: 361 SEPRRSASIACGASVCEVSMQVPTWALQVLRQLAPDVSYRSLVVLGVASIQGLSVASFEK 420
Query: 422 DDAERLIFFGSRKQRDLFLNNLTDSTPPSWLKPPAP-RKR---LRNVKDISLGSHDVIQH 481
DDAERL+FF ++ D ++ S P+WL PP P RKR R K+I G
Sbjct: 421 DDAERLLFFCGQQINDTSNHDALLSKIPNWLTPPLPTRKRSEPCRESKEIENGG------ 480
Query: 482 LKVLSGNRIDGENMEIGSRNGFSTPMFPLPRRRGMKIAAMRPIPHVNRHKMISFRGIAET 541
P R + +AA+RPIPH RHKMI F G +E
Sbjct: 481 -----------------------------PTSRKINVAALRPIPHTRRHKMIPFSGYSEI 540
Query: 542 GGHNGGLFKASVSSSNSGKHVTVGSASVLQQKMFPSASQYKQIIPMNPLPLKKHGCGRSP 601
G +G K S+ KH G V +K F + Q KQII +NPLPLKKH CGR+
Sbjct: 541 GRFDGDHTKGSLPM--PPKHGASGGTPVTHRKAFSGSYQRKQIISLNPLPLKKHDCGRAH 600
Query: 602 IQACFEEEFLKDLLQFLALRGHNRLIPPGGLAEFPDAILNGKRLDLYNLYKEVVSRGGFR 661
IQ C EEEFL+D++QFL +RGH RL+PPGGLAEFPDA+LN KRLDL+NLY+EVVSRGGF
Sbjct: 601 IQVCSEEEFLRDVMQFLLIRGHTRLVPPGGLAEFPDAVLNSKRLDLFNLYREVVSRGGFH 660
Query: 662 VGNGINWKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHEDVDGECCLLCHSS 721
VGNGINWKGQ+FSKMRN+T+TNRMTGVGNTLKRHYETYLLEYE AH+DVDGECCL+C SS
Sbjct: 661 VGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYEYAHDDVDGECCLICRSS 720
Query: 722 AAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVANYKKK--KVGNG 772
AGDWVNCG CGEWAHFGCDRR GLGAFKDYAKTDGLEY+CP+CSV+NY+KK K NG
Sbjct: 721 TAGDWVNCGSCGEWAHFGCDRRPGLGAFKDYAKTDGLEYVCPNCSVSNYRKKSQKTSNG 742
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q6NQ79 | 1.8e-254 | 56.61 | AT-rich interactive domain-containing protein 4 OS=Arabidopsis thaliana OX=3702 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GPB6 | 0.0e+00 | 100.00 | AT-rich interactive domain-containing protein 4-like OS=Cucurbita moschata OX=36... | [more] |
A0A6J1JUU6 | 0.0e+00 | 97.70 | AT-rich interactive domain-containing protein 4-like OS=Cucurbita maxima OX=3661... | [more] |
A0A5D3BXU1 | 0.0e+00 | 89.24 | AT-rich interactive domain-containing protein 4-like OS=Cucumis melo var. makuwa... | [more] |
A0A1S3CGG2 | 0.0e+00 | 89.12 | AT-rich interactive domain-containing protein 4-like OS=Cucumis melo OX=3656 GN=... | [more] |
A0A0A0KCC6 | 0.0e+00 | 88.73 | ARID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G448080 PE=4 S... | [more] |
Match Name | E-value | Identity | Description | |
XP_022953795.1 | 0.0e+00 | 100.00 | AT-rich interactive domain-containing protein 4-like [Cucurbita moschata] >XP_02... | [more] |
KAG7013957.1 | 0.0e+00 | 98.85 | AT-rich interactive domain-containing protein 4 [Cucurbita argyrosperma subsp. a... | [more] |
XP_023548213.1 | 0.0e+00 | 98.08 | AT-rich interactive domain-containing protein 4-like [Cucurbita pepo subsp. pepo... | [more] |
XP_022992125.1 | 0.0e+00 | 97.70 | AT-rich interactive domain-containing protein 4-like [Cucurbita maxima] >XP_0229... | [more] |
KAG6575416.1 | 0.0e+00 | 96.29 | AT-rich interactive domain-containing protein 4, partial [Cucurbita argyrosperma... | [more] |
Match Name | E-value | Identity | Description | |
AT3G43240.1 | 1.2e-255 | 56.61 | ARID/BRIGHT DNA-binding domain-containing protein | [more] |