Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GATCTTAGAACTTAGCAGACTTAGCTTCAAACCAAAACCCTCAGGCCCTAACCATCTTCATCCAAGAGCCCCTAATTGGTGTTCGTTTCGTTGATTCGCCCCCAACTTCAATCATTTATAATCCGATCAAGTTCCGGCCGTCGTGTATTTGACTGTCCCACGCCACTGCCATCCCTTTTGTCTTGGTGTTCCATCTTCATCACTGCTATCTTCGTCTTCATAAATGGGTTGCAGGAAGAAAGGAGTCCCAGATTCCCAAAACGAGGTAGAGCATTTTGACTTTTCATGGCAAAGGGTTTAATCTATCTTTTGCGTTAATCTTGGAGGGTTTCTGTTTGTTTCGTATGTATTTTTTAACAAGAAAGAAGAGTTCCGTGCGAATTTGCTTTGATTTTAAAGTGGGTTTGTGTTCATGGGGAGATGGCACGTTTCATCTAATGCTTCCATTTTAGATTGCAATAAAGATGTAGATCCTAATCCTAGTAATGGCTGTTGCATTGCTTCGGATTGTTTGGTAGAGAGGAGTTATGAAAATGTTGATTATGATGATTGCAAGGCGAGAATTAGATGCTATTTTGAGAAAATTCTTTGGGTTTTTTTAAAGGAAATTGGTCGTAGAGGATTTGTTAGGCCACTGCCTGCGTTAATAGGTGAAGGGGGAGCTTTGGATTTGTTTGAACTGTTCTTGGTAGTAAGAGATAAAGGAGGTTCTCAAGTGGTTTCAGAGAAGAAACTATGGTCTTCAGTGGTTGTCGAATTAGGTTTGGATCTTGGGCTTTCGGCGTCGGTGAAATTGATTTATTCCAAGTACTTAAGTGATCTAGAGAAATGGCTTATGGTGAGATGTGGAGACACAAAACTGGAAAATGGGAGCTCTGATTATTGCTACAAGAAAAGTTCTCCATTTTTATCGGAACTCGGGGCAAAGATTAACGGTATGTTGTATGGTGTGCCGAGACAAAATAGCATATATGATGAATGTTTTGGATTCAAATCTAACAAACAGAATGGGAACGTTAATGTTGCTGCTGCTGCTGCAGTGGAGAAGGAAATAAAATTCCCTGAAATAAAGAAGAAAGAGCACGATCTCCATGGGGATGTTACATCAATTCAACAAGATTGTACAGAGACGCATCCAATCCATGTTATTGAAGATGGTCAAAGTTTGGATGCTGTTAATGTTGAAGCTGAAATAGAATCTCTTGGGAAATATCGAGAATCGTTATTACGAATGCTGAAGTGGGTGAGAAAGACTGCGAAGCATCCTGAAGATCCATTAAATGGTACAATACCGGGGACATCTAGGTGGAAAGGTTACTCGAGCGACGATGCATTATGGCTTCAAGTAATCAGGGCAAAGGATGCTCTTCTAATTAGGAAGGGTGTTGACAAAATTGCTGAGAAACGTCTGTTAATACAGGTAGATTTCACCCACTTATTCTGATTGTAAGCCAGTTCTCTCTCTCTCCCTCTTCCCCTTTTTTCCTTGACCGATCGACATACTTGCTTTCTTGCTTTGGACTGCAATTATGCGCAAACTGACGTGTGCAGGTATTACGATCTTTGGCTTTCTTTTTCAGGATAGAGTCTTCTGATCATAATCAGCGCCCTTTTCTTTCATATATAGAACGGGAAAGTTACCTCCTGCTATTAGAACCTTGGGTCTCCATCATCATAAGCAAGCAGCATACGACTACTGTCTCCCTGTCTTGCGTCCTATGTGATAACATGATGGACACAGTAAGGAAAACTTAGATCTTTGTCATTGGGTTTACTAGCATAGTCTTTACTTTATCAGGGATACTTAAAAGGCCAATAGGATCAGGTTTGACCTTGTTTATGGGTTCTGACGTTTTCACCTTTTAGCTCACTATTAGGAGTTTACATCTCTCAATTTTCACGTAAGTTCGGCTGAAAACATAACAGAGCTCTTGCCACCACCAATAGAACGTGTTTATATTTATTACCTTTCTTTTGGGAGAACAAGGTCTTAGTGTTTTCTAAATGGAAGTTGTTATTCGTTATAAAGTCGAGTGTCAAGTTGTGCAGGGTAGTCTATGCTCTTGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGGGTGGAGATTCAAGGCAGCTCATAGGGTTTCAAACTTAATAGAAGGATGATTTCTTTGAGGAAAGTTTACGAGATGGATATGAATGTATCTGGGGATTCGAGGGTCAGGAAATCAAAAGTGGATTTACATAGCTTCCCGAAGATGAGTAGGATTGAAAGGACGAGAGTTTCATCGAGAATTTTGAGGATTGAATAAGGCTTAGGAGATCGTTGGAGGTGGTGAATATTAGTGAGAGCTCTAGATGGAAGAAAAATTCCCATGTCCAATGGATTTAGGAGGATGATGCTAATAGAACCTAGGAGGAAGAGTAACCTAGTGAATGTGAATACGACCGCTTTCACTTAGGAAAAGGATATGTCTCGTATCTTCCTTCGTGCAACGTTGTTTGTAGGCATTAAGGGGGTTTTTCGTAAGGGTTTTTTTTTTTTTTTTTTTGATAAGAAACGAACTTTTCATTATCAAGGGAAGAATACATAAGGAGGGGAGATGAGATATCCCCACACACCAAAGGCTTACAGAAAAGATGCACAACTGGTAACAACGAGATAAGCTATAATTACAAAATAATTTAGAAGTGCTAGACAAAGTAGAAGCAAAAAGAGTTAAAGGATCCCGAAAAACCTCCCTGCCAGACTCATGGCTATTGAAAAGTTTCGAGTTTCTCTCAAACCAAAGCCGCCAAAGGAGGGCAAACACCCCTTTAGACCAAAAAATCTTGGCTTGCCCTTTAAAATGCATGCCACCAATGAGCTGACATCGGGCCTCTGACACTTTGCTAGGCCAACACCATTGAATACTGAAGACTTTGAGTTATACTTTGTGGCATTCACTTTCCTCATATGGTCTCCTTCTCATCATGAAACACCAGCCCCATTTGGCAAGGGGAGCCCAATTCCTCATGGCAAGGTTACAATAGAAATAGGGAGGGAAGTTATTTCCTAGTTTATAAGATTCATCATTCCATCCACTTTGCCACCTTTATTTCCCAGTCTAAGCTTTCCTTCTGCCGATAGGATATTACGATCTATGAAGTAAAAACGACACAGTTTAAGTAATTTGTCATGACGTGCAATAATTATCCTTTTGCTTTAGACTTTTCATATATCTTTTATTTTCCTTTTGAAAATTCGGGAATCGTTTTGAGCTTGTTTTGATATATTTATGACAAAGACATGTTCATACACTTGGACACAACTATTAACTTATTTGGTTTCGTACATCCTTTATTAAACCGAACCATGAAAGGCTGCTTTCACTTATGCTTATATGCTGATGACATATATAAGTTTATGAGACATGTTTATATAATTGAATATATCTGTTCTAGTGCCATTTATGTTTAGTGTAGGCAGTGCCTATGTACTTGCTCCTTAGGTTAGGAAGCATAAGTTATAGCTACGAAGAAAGAAGCGTATCTGTCAATGGTTTCTTTGTTTAAACAGAATGAAATGAACTACTGAGTTTGAGGAAAAAAAATTGGTTTAAACTTGCAGAAGAAAGTAAAGATGCATCCATCCATTTATGAGGATAATATTGATAACCATCACCTCTCTACAGAAAGGATAAGTTGCAGCAAAAGATCTAAAGCTTTGACTGAGTCCGTACTCGCCCCGTGTTCTAATTCTTGTCCAACCGTTCGAAGTAATTGTATTAGTAGTCTAACAACAGAAGTTGGAAAGGGACTCAAGAATCAAGCAGTTTTGAATGGTGATATACCATCTGAAATGGAAGACGATCATCCGAATGAAGATTCAGCTGAGGAGACGGTTCCCGTGGGTGCTGTATGTCAAGCAGATTTACCTGAATGGACTGGTAATAATTCTGATAGTGACTCTAAATGGCTAGGGACACGGTCGTGGCCTCTTCAACACAGAAATAGTAATTCCGTACGTGATAGACGCGCCATTGGCAGAGGGAGACCGGATTCATGTGGCTGCCAATTTCCAGGTTCGGTTGAATGTTTTAGATTTCACATAGCTGAAGCAAGGATGAGATTAAAGCTCGAGCTTGGCTCGACATTCTTTGCATGGAGATTTCATCAAATGGGGGAGGAAATATCTCTGCAATGGACAGTTGAAGAGGAAAAGAGATTCAAGGAGTTGGCTATGTCAGGTTTCAACAATCATAATCGGTGCTTTTGGGACTATTCCTTGAGATGGTTTCCTATGAAATCAAGGAAAAATCTGATAAGCTATTACTTCAATGTGTTTCTTTTACGGCTGAGAAGCTATCAGAATCGCGTAACTCCGAATAGCATCGATAGCGACGATGAAGATTTTGAGTTCGGTCGTGTTAGTGGTGGATTTGGGGACAAGGCAATGGAAATTTTAGGCTCAAATTCTTTAGAATGTTCTATAAACAGACAGGTCACAGATGTGGAGTAGTCTAAAACTGGAGCAGAATCCGAAGTATCTAAAAAAGAGGGAAAGGAACGCAATTTTGACGAGAACTCGAGTTACGTTTTAGCTATACAACACCGATATCGGTACTTTATGGTGCTAGGTATTATATTTTTTGGGGAGGATCTGTATGGTATCGGCTGGTGAGAAATATGAGGATCAAAGATGGCCATTGCTACATTTGTTTGTTCTGAATTTGTGTTGTGTGGAACTCAATCTGATAGTTCTTGGTTTTGTATTTGGAGTTTGGTTTTAATCATTCCAAGGGGCTCTGATAAGAAAGCTTGAGAATAGCTCACCAGACACGTTTATATCGGTTGATTCGTGGAAAATTTTGTCATGTTTTTAAATTATTTGTCGAGGTAACATGTTTTTAAATTATTTGTCGAGGTAACATGTTTGTAAAT
mRNA sequence
GATCTTAGAACTTAGCAGACTTAGCTTCAAACCAAAACCCTCAGGCCCTAACCATCTTCATCCAAGAGCCCCTAATTGGTGTTCGTTTCGTTGATTCGCCCCCAACTTCAATCATTTATAATCCGATCAAGTTCCGGCCGTCGTGTATTTGACTGTCCCACGCCACTGCCATCCCTTTTGTCTTGGTGTTCCATCTTCATCACTGCTATCTTCGTCTTCATAAATGGGTTGCAGGAAGAAAGGAGTCCCAGATTCCCAAAACGAGGTAGAGCATTTTGACTTTTCATGGCAAAGGGTTTAATCTATCTTTTGCGTTAATCTTGGAGGGTTTCTGTTTGTTTCGTATGTATTTTTTAACAAGAAAGAAGAGTTCCGTGCGAATTTGCTTTGATTTTAAAGTGGGTTTGTGTTCATGGGGAGATGGCACGTTTCATCTAATGCTTCCATTTTAGATTGCAATAAAGATGTAGATCCTAATCCTAGTAATGGCTGTTGCATTGCTTCGGATTGTTTGGTAGAGAGGAGTTATGAAAATGTTGATTATGATGATTGCAAGGCGAGAATTAGATGCTATTTTGAGAAAATTCTTTGGGTTTTTTTAAAGGAAATTGGTCGTAGAGGATTTGTTAGGCCACTGCCTGCGTTAATAGGTGAAGGGGGAGCTTTGGATTTGTTTGAACTGTTCTTGGTAGTAAGAGATAAAGGAGGTTCTCAAGTGGTTTCAGAGAAGAAACTATGGTCTTCAGTGGTTGTCGAATTAGGTTTGGATCTTGGGCTTTCGGCGTCGGTGAAATTGATTTATTCCAAGTACTTAAGTGATCTAGAGAAATGGCTTATGGTGAGATGTGGAGACACAAAACTGGAAAATGGGAGCTCTGATTATTGCTACAAGAAAAGTTCTCCATTTTTATCGGAACTCGGGGCAAAGATTAACGGTATGTTGTATGGTGTGCCGAGACAAAATAGCATATATGATGAATGTTTTGGATTCAAATCTAACAAACAGAATGGGAACGTTAATGTTGCTGCTGCTGCTGCAGTGGAGAAGGAAATAAAATTCCCTGAAATAAAGAAGAAAGAGCACGATCTCCATGGGGATGTTACATCAATTCAACAAGATTGTACAGAGACGCATCCAATCCATGTTATTGAAGATGGTCAAAGTTTGGATGCTGTTAATGTTGAAGCTGAAATAGAATCTCTTGGGAAATATCGAGAATCGTTATTACGAATGCTGAAGTGGGTGAGAAAGACTGCGAAGCATCCTGAAGATCCATTAAATGGTACAATACCGGGGACATCTAGGTGGAAAGGTTACTCGAGCGACGATGCATTATGGCTTCAAGTAATCAGGGCAAAGGATGCTCTTCTAATTAGGAAGGGTGTTGACAAAATTGCTGAGAAACGTCTGTTAATACAGAAGAAAGTAAAGATGCATCCATCCATTTATGAGGATAATATTGATAACCATCACCTCTCTACAGAAAGGATAAGTTGCAGCAAAAGATCTAAAGCTTTGACTGAGTCCGTACTCGCCCCGTGTTCTAATTCTTGTCCAACCGTTCGAAGTAATTGTATTAGTAGTCTAACAACAGAAGTTGGAAAGGGACTCAAGAATCAAGCAGTTTTGAATGGTGATATACCATCTGAAATGGAAGACGATCATCCGAATGAAGATTCAGCTGAGGAGACGGTTCCCGTGGGTGCTGTATGTCAAGCAGATTTACCTGAATGGACTGGTAATAATTCTGATAGTGACTCTAAATGGCTAGGGACACGGTCGTGGCCTCTTCAACACAGAAATAGTAATTCCGTACGTGATAGACGCGCCATTGGCAGAGGGAGACCGGATTCATGTGGCTGCCAATTTCCAGGTTCGGTTGAATGTTTTAGATTTCACATAGCTGAAGCAAGGATGAGATTAAAGCTCGAGCTTGGCTCGACATTCTTTGCATGGAGATTTCATCAAATGGGGGAGGAAATATCTCTGCAATGGACAGTTGAAGAGGAAAAGAGATTCAAGGAGTTGGCTATGTCAGGTTTCAACAATCATAATCGGTGCTTTTGGGACTATTCCTTGAGATGGTTTCCTATGAAATCAAGGAAAAATCTGATAAGCTATTACTTCAATGTGTTTCTTTTACGGCTGAGAAGCTATCAGAATCGCGTAACTCCGAATAGCATCGATAGCGACGATGAAGATTTTGAGTTCGGTCGTGTTAGTGGTGGATTTGGGGACAAGGCAATGGAAATTTTAGGCTCAAATTCTTTAGAATGTTCTATAAACAGACAGGTCACAGATGTGGAGTAGTCTAAAACTGGAGCAGAATCCGAAGTATCTAAAAAAGAGGGAAAGGAACGCAATTTTGACGAGAACTCGAGTTACGTTTTAGCTATACAACACCGATATCGGTACTTTATGGTGCTAGGTATTATATTTTTTGGGGAGGATCTGTATGGTATCGGCTGGTGAGAAATATGAGGATCAAAGATGGCCATTGCTACATTTGTTTGTTCTGAATTTGTGTTGTGTGGAACTCAATCTGATAGTTCTTGGTTTTGTATTTGGAGTTTGGTTTTAATCATTCCAAGGGGCTCTGATAAGAAAGCTTGAGAATAGCTCACCAGACACGTTTATATCGGTTGATTCGTGGAAAATTTTGTCATGTTTTTAAATTATTTGTCGAGGTAACATGTTTTTAAATTATTTGTCGAGGTAACATGTTTGTAAAT
Coding sequence (CDS)
ATGGGGAGATGGCACGTTTCATCTAATGCTTCCATTTTAGATTGCAATAAAGATGTAGATCCTAATCCTAGTAATGGCTGTTGCATTGCTTCGGATTGTTTGGTAGAGAGGAGTTATGAAAATGTTGATTATGATGATTGCAAGGCGAGAATTAGATGCTATTTTGAGAAAATTCTTTGGGTTTTTTTAAAGGAAATTGGTCGTAGAGGATTTGTTAGGCCACTGCCTGCGTTAATAGGTGAAGGGGGAGCTTTGGATTTGTTTGAACTGTTCTTGGTAGTAAGAGATAAAGGAGGTTCTCAAGTGGTTTCAGAGAAGAAACTATGGTCTTCAGTGGTTGTCGAATTAGGTTTGGATCTTGGGCTTTCGGCGTCGGTGAAATTGATTTATTCCAAGTACTTAAGTGATCTAGAGAAATGGCTTATGGTGAGATGTGGAGACACAAAACTGGAAAATGGGAGCTCTGATTATTGCTACAAGAAAAGTTCTCCATTTTTATCGGAACTCGGGGCAAAGATTAACGGTATGTTGTATGGTGTGCCGAGACAAAATAGCATATATGATGAATGTTTTGGATTCAAATCTAACAAACAGAATGGGAACGTTAATGTTGCTGCTGCTGCTGCAGTGGAGAAGGAAATAAAATTCCCTGAAATAAAGAAGAAAGAGCACGATCTCCATGGGGATGTTACATCAATTCAACAAGATTGTACAGAGACGCATCCAATCCATGTTATTGAAGATGGTCAAAGTTTGGATGCTGTTAATGTTGAAGCTGAAATAGAATCTCTTGGGAAATATCGAGAATCGTTATTACGAATGCTGAAGTGGGTGAGAAAGACTGCGAAGCATCCTGAAGATCCATTAAATGGTACAATACCGGGGACATCTAGGTGGAAAGGTTACTCGAGCGACGATGCATTATGGCTTCAAGTAATCAGGGCAAAGGATGCTCTTCTAATTAGGAAGGGTGTTGACAAAATTGCTGAGAAACGTCTGTTAATACAGAAGAAAGTAAAGATGCATCCATCCATTTATGAGGATAATATTGATAACCATCACCTCTCTACAGAAAGGATAAGTTGCAGCAAAAGATCTAAAGCTTTGACTGAGTCCGTACTCGCCCCGTGTTCTAATTCTTGTCCAACCGTTCGAAGTAATTGTATTAGTAGTCTAACAACAGAAGTTGGAAAGGGACTCAAGAATCAAGCAGTTTTGAATGGTGATATACCATCTGAAATGGAAGACGATCATCCGAATGAAGATTCAGCTGAGGAGACGGTTCCCGTGGGTGCTGTATGTCAAGCAGATTTACCTGAATGGACTGGTAATAATTCTGATAGTGACTCTAAATGGCTAGGGACACGGTCGTGGCCTCTTCAACACAGAAATAGTAATTCCGTACGTGATAGACGCGCCATTGGCAGAGGGAGACCGGATTCATGTGGCTGCCAATTTCCAGGTTCGGTTGAATGTTTTAGATTTCACATAGCTGAAGCAAGGATGAGATTAAAGCTCGAGCTTGGCTCGACATTCTTTGCATGGAGATTTCATCAAATGGGGGAGGAAATATCTCTGCAATGGACAGTTGAAGAGGAAAAGAGATTCAAGGAGTTGGCTATGTCAGGTTTCAACAATCATAATCGGTGCTTTTGGGACTATTCCTTGAGATGGTTTCCTATGAAATCAAGGAAAAATCTGATAAGCTATTACTTCAATGTGTTTCTTTTACGGCTGAGAAGCTATCAGAATCGCGTAACTCCGAATAGCATCGATAGCGACGATGAAGATTTTGAGTTCGGTCGTGTTAGTGGTGGATTTGGGGACAAGGCAATGGAAATTTTAGGCTCAAATTCTTTAGAATGTTCTATAAACAGACAGGTCACAGATGTGGAGTAG
Protein sequence
MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILWVFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDLGLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGVPRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTETHPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDDHPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFEFGRVSGGFGDKAMEILGSNSLECSINRQVTDVE
Homology
BLAST of CmoCh16G006560 vs. ExPASy Swiss-Prot
Match:
Q9LDD4 (AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=ARID2 PE=1 SV=1)
HSP 1 Score: 335.9 bits (860), Expect = 1.0e-90
Identity = 232/609 (38.10%), Postives = 319/609 (52.38%), Query Frame = 0
Query: 38 SYENVD---YDDCKARIRCYFEKILWVFLKEIGRRGFVRPLPALIGEGGALDLFELFLVV 97
SY +V+ D+C+ R+R F++ L VFL+E G ++PLPA+IG+G +DLF+LF++V
Sbjct: 10 SYVDVEIKYVDECEERLRRLFDQALLVFLEE---EGSIKPLPAVIGDGKNVDLFKLFVLV 69
Query: 98 RDKGGSQVVSEKKLWSSVVVELGLDLGLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGS 157
R++ G VS K+LW V +LG D L S+ LIY KYL+ +EKW + +N
Sbjct: 70 REREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAVEESRIVNWDNKD 129
Query: 158 SDY--CYKKSSPFLSELGAKINGMLYGVPRQNSIYDECFGFKSNKQNGNVNVAAAAAVEK 217
S+ CY S L ELG NG S+ D K K+N V E
Sbjct: 130 SEKKGCY---SGMLHELG---NGF-------KSLLD---NGKCQKRNRAVAFGCNHMEES 189
Query: 218 EIKFPEIKKKEHDLHGDVTSIQQDCTETHPIHVIEDGQSLDAVNVEAEIESLGKYRESLL 277
+F +K+ + D + VI + + AV SL K R+ L
Sbjct: 190 CSEFDRSRKRFRESDDDDKGVGLSSV------VIREETVVCAVEEGLSDFSLEK-RDDLP 249
Query: 278 RMLKWVRKTAKHPEDPLNGTIPGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKR 337
MLKW+ A P DP G IP +S+WK Y+ + WLQV RAK++LL+++ ++ +
Sbjct: 250 GMLKWLALVATSPHDPAIGVIPHSSKWKQYNGNKC-WLQVARAKNSLLVQRDNAELRYRY 309
Query: 338 LLIQKKVKM-HPSIYEDNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISS 397
+ + HPS+YED+ S R+ S R L++ CS+SC C S
Sbjct: 310 HPFRGHQNIHHPSMYEDD----RKSIGRLRYSIRPPNLSKH----CSSSC------CNGS 369
Query: 398 LTTEVGKGLKNQAVLNGDIPSEMEDDHPNEDSAEE---------TVPVGAVCQADLPEWT 457
+ K + I SE A + + VG QA + EWT
Sbjct: 370 SLVSLSKSRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPRRCIKVGHQHQAQVDEWT 429
Query: 458 GNNSDSDSKWLGTRSWPLQHRNS-NSVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEAR 517
+ DSDSKWLGTR WP ++ + + +G+GRPDSC C+ G VEC R HIAE R
Sbjct: 430 ESGVDSDSKWLGTRIWPPENSEALDQTLGNDLVGKGRPDSCSCELSGFVECTRLHIAEKR 489
Query: 518 MRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPM 577
M LK ELG FF WRF+QMGEE+ L+WT EEEKRFK++ ++ + FW + + FP
Sbjct: 490 MELKRELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIA----DPQSFWTNAAKNFPK 549
Query: 578 KSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFEFGRVSGGFGDKAMEILGSNSLE 631
K R+ L+SYYFNVFL+ R YQNRVTP SIDSDDE FG V G FG A+ GS+ +
Sbjct: 550 KKREELVSYYFNVFLINRRRYQNRVTPKSIDSDDEG-AFGSVGGSFGRDAVTSSGSDVMI 572
BLAST of CmoCh16G006560 vs. ExPASy Swiss-Prot
Match:
Q84JT7 (AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=ARID1 PE=2 SV=1)
HSP 1 Score: 253.4 bits (646), Expect = 6.5e-66
Identity = 194/551 (35.21%), Postives = 257/551 (46.64%), Query Frame = 0
Query: 55 FEKILWVFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVV 114
F +L FL E PLPA+ GEG +DLF LFL V KGG VSE W VV
Sbjct: 49 FRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQ 108
Query: 115 ELGLDLGLSASVKLIYSKYLSDLEKWL-MVRCGDTKLE----NGSSDYCYKKSSPFLSEL 174
E GL+ SAS KLIY KYL +WL V GDT + +G SD + + FLSE+
Sbjct: 109 ESGLESYDSASAKLIYVKYLDAFGRWLNRVVAGDTDVSSVELSGISDALVARLNGFLSEV 168
Query: 175 GAKINGMLYGVPRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIK-FPEIKKKEHDLHG 234
K ++ + F K+ ++ +V +A +K F K E L
Sbjct: 169 KKKYELRKGRPAKELGAELKWFISKTKRRYDKHHVGKESASNDAVKEFQGSKLAERRL-- 228
Query: 235 DVTSIQQDCTETHPIHVIEDGQSLDAVNVEAEIESLGK-YRESLLRMLKWVRKTAKHPED 294
I ++E +V E S GK RE L LKW+ AK P D
Sbjct: 229 ------------EQIMILE--------SVTQECSSPGKRKRECPLETLKWLSDVAKDPCD 288
Query: 295 PLNGTIPGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYE 354
P G +P S W Y S++ W Q++ + + R D EK QK KMHP +Y+
Sbjct: 289 PSLGIVPDRSEWVSYGSEEP-WKQLLLFRAS---RTNNDSACEKTW--QKVQKMHPCLYD 348
Query: 355 DNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLN 414
D+ + ER+S + T G G
Sbjct: 349 DSAGASYNLRERLSYEDYKRGKT--------------------------GNG-------- 408
Query: 415 GDIPSEMEDDHPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPL--QHRNS 474
DI S E+D P VG+ QA +PEWTG +SDSKWLGTR WPL + +
Sbjct: 409 SDIGSSDEEDRP-------CALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKA 468
Query: 475 NSVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEIS 534
N + +R IG+GR D CGC PGS+EC +FHI R +LKLELG F+ W F MGE
Sbjct: 469 NLLIERDRIGKGRQDPCGCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTL 528
Query: 535 LQWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNR 594
WT E K+ K L MS + + F + P KSR ++SY++NV LL+ R+ Q+R
Sbjct: 529 QYWTDLELKKIKSL-MSSPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSR 529
Query: 595 VTPNSIDSDDE 597
+TP+ IDSD +
Sbjct: 589 ITPHDIDSDTD 529
BLAST of CmoCh16G006560 vs. ExPASy TrEMBL
Match:
A0A6J1ETI2 (AT-rich interactive domain-containing protein 2-like isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111437591 PE=4 SV=1)
HSP 1 Score: 1298.1 bits (3358), Expect = 0.0e+00
Identity = 632/632 (100.00%), Postives = 632/632 (100.00%), Query Frame = 0
Query: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60
MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW
Sbjct: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60
Query: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL
Sbjct: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
Query: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
Query: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET 240
PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET 240
Query: 241 HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK 300
HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK
Sbjct: 241 HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK 300
Query: 301 GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNHHLSTERI 360
GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNHHLSTERI
Sbjct: 301 GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNHHLSTERI 360
Query: 361 SCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDDHPN 420
SCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDDHPN
Sbjct: 361 SCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDDHPN 420
Query: 421 EDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRGRPD 480
EDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRGRPD
Sbjct: 421 EDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRGRPD 480
Query: 481 SCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFKELA 540
SCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFKELA
Sbjct: 481 SCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFKELA 540
Query: 541 MSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFEF 600
MSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFEF
Sbjct: 541 MSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFEF 600
Query: 601 GRVSGGFGDKAMEILGSNSLECSINRQVTDVE 633
GRVSGGFGDKAMEILGSNSLECSINRQVTDVE
Sbjct: 601 GRVSGGFGDKAMEILGSNSLECSINRQVTDVE 632
BLAST of CmoCh16G006560 vs. ExPASy TrEMBL
Match:
A0A6J1EZB1 (AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111437591 PE=4 SV=1)
HSP 1 Score: 1269.6 bits (3284), Expect = 0.0e+00
Identity = 632/695 (90.94%), Postives = 632/695 (90.94%), Query Frame = 0
Query: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60
MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW
Sbjct: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60
Query: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL
Sbjct: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
Query: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
Query: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET 240
PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET 240
Query: 241 HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK 300
HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK
Sbjct: 241 HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK 300
Query: 301 GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQ------------------------ 360
GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQ
Sbjct: 301 GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQVLRSLAFFFRIESSDHNQRPFLSY 360
Query: 361 ---------------------------------------KKVKMHPSIYEDNIDNHHLST 420
KKVKMHPSIYEDNIDNHHLST
Sbjct: 361 IERESYLLLLEPWVSIIISKQHTTTVSLSCVLCDNMMDTKKVKMHPSIYEDNIDNHHLST 420
Query: 421 ERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDD 480
ERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDD
Sbjct: 421 ERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDD 480
Query: 481 HPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRG 540
HPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRG
Sbjct: 481 HPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRG 540
Query: 541 RPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFK 600
RPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFK
Sbjct: 541 RPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFK 600
Query: 601 ELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDED 633
ELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDED
Sbjct: 601 ELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDED 660
BLAST of CmoCh16G006560 vs. ExPASy TrEMBL
Match:
A0A6J1J644 (AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482924 PE=4 SV=1)
HSP 1 Score: 1242.6 bits (3214), Expect = 0.0e+00
Identity = 611/633 (96.52%), Postives = 614/633 (97.00%), Query Frame = 0
Query: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60
MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVE +Y NVDYDDCKARIRCYFEKILW
Sbjct: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVEGTYANVDYDDCKARIRCYFEKILW 60
Query: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL
Sbjct: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
Query: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
Query: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET 240
PRQNSIYDECFGFKSNKQNGNVNV AAAAVEKEIKF EIKKKEHDLHGDVT IQQDCTET
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNV-AAAAVEKEIKFSEIKKKEHDLHGDVTPIQQDCTET 240
Query: 241 HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK 300
HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTI G SRWK
Sbjct: 241 HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTILGASRWK 300
Query: 301 GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNHHLSTERI 360
GYSSDDALWLQVI AKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNH LSTERI
Sbjct: 301 GYSSDDALWLQVISAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNHRLSTERI 360
Query: 361 SCSKRSKALTESVLAPCSNSCPTVRSNCI-SSLTTEVGKGLKNQAVLNGDIPSEMEDDHP 420
SCSKR KA TESV A CSNSCPTVRSNCI SSLTTEVGKGLKNQAVLNGDIPSEMEDDHP
Sbjct: 361 SCSKRFKASTESVFATCSNSCPTVRSNCISSSLTTEVGKGLKNQAVLNGDIPSEMEDDHP 420
Query: 421 NEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRGRP 480
NEDSAEETVPVGA+CQADLPEWTGNNSDSDSKWLGTR WPLQHRNSNSVRDRRAIGRGRP
Sbjct: 421 NEDSAEETVPVGALCQADLPEWTGNNSDSDSKWLGTRLWPLQHRNSNSVRDRRAIGRGRP 480
Query: 481 DSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFKEL 540
DSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWT EEEKRFKEL
Sbjct: 481 DSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTAEEEKRFKEL 540
Query: 541 AMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFE 600
AMS FNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFE
Sbjct: 541 AMSSFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFE 600
Query: 601 FGRVSGGFGDKAMEILGSNSLECSINRQVTDVE 633
FG VSGGFGDKAME+LGS SLECSINRQVTDVE
Sbjct: 601 FGCVSGGFGDKAMEVLGSKSLECSINRQVTDVE 632
BLAST of CmoCh16G006560 vs. ExPASy TrEMBL
Match:
A0A6J1ETJ6 (AT-rich interactive domain-containing protein 2-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111437591 PE=4 SV=1)
HSP 1 Score: 1240.7 bits (3209), Expect = 0.0e+00
Identity = 621/695 (89.35%), Postives = 621/695 (89.35%), Query Frame = 0
Query: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60
MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW
Sbjct: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60
Query: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL
Sbjct: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
Query: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
Query: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET 240
PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET 240
Query: 241 HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK 300
HPIHVI EAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK
Sbjct: 241 HPIHVI-----------EAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK 300
Query: 301 GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQ------------------------ 360
GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQ
Sbjct: 301 GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQVLRSLAFFFRIESSDHNQRPFLSY 360
Query: 361 ---------------------------------------KKVKMHPSIYEDNIDNHHLST 420
KKVKMHPSIYEDNIDNHHLST
Sbjct: 361 IERESYLLLLEPWVSIIISKQHTTTVSLSCVLCDNMMDTKKVKMHPSIYEDNIDNHHLST 420
Query: 421 ERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDD 480
ERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDD
Sbjct: 421 ERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLNGDIPSEMEDD 480
Query: 481 HPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRG 540
HPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRG
Sbjct: 481 HPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRG 540
Query: 541 RPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFK 600
RPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFK
Sbjct: 541 RPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFK 600
Query: 601 ELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDED 633
ELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDED
Sbjct: 601 ELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDED 660
BLAST of CmoCh16G006560 vs. ExPASy TrEMBL
Match:
A0A6J1J301 (AT-rich interactive domain-containing protein 2-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482924 PE=4 SV=1)
HSP 1 Score: 1213.7 bits (3139), Expect = 0.0e+00
Identity = 600/633 (94.79%), Postives = 603/633 (95.26%), Query Frame = 0
Query: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60
MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVE +Y NVDYDDCKARIRCYFEKILW
Sbjct: 1 MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVEGTYANVDYDDCKARIRCYFEKILW 60
Query: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL
Sbjct: 61 VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120
Query: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180
Query: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCTET 240
PRQNSIYDECFGFKSNKQNGNVNV AAAAVEKEIKF EIKKKEHDLHGDVT IQQDCTET
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNV-AAAAVEKEIKFSEIKKKEHDLHGDVTPIQQDCTET 240
Query: 241 HPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIPGTSRWK 300
HPIHVI EAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTI G SRWK
Sbjct: 241 HPIHVI-----------EAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTILGASRWK 300
Query: 301 GYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNHHLSTERI 360
GYSSDDALWLQVI AKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNH LSTERI
Sbjct: 301 GYSSDDALWLQVISAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYEDNIDNHRLSTERI 360
Query: 361 SCSKRSKALTESVLAPCSNSCPTVRSNCI-SSLTTEVGKGLKNQAVLNGDIPSEMEDDHP 420
SCSKR KA TESV A CSNSCPTVRSNCI SSLTTEVGKGLKNQAVLNGDIPSEMEDDHP
Sbjct: 361 SCSKRFKASTESVFATCSNSCPTVRSNCISSSLTTEVGKGLKNQAVLNGDIPSEMEDDHP 420
Query: 421 NEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSNSVRDRRAIGRGRP 480
NEDSAEETVPVGA+CQADLPEWTGNNSDSDSKWLGTR WPLQHRNSNSVRDRRAIGRGRP
Sbjct: 421 NEDSAEETVPVGALCQADLPEWTGNNSDSDSKWLGTRLWPLQHRNSNSVRDRRAIGRGRP 480
Query: 481 DSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFKEL 540
DSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWT EEEKRFKEL
Sbjct: 481 DSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISLQWTAEEEKRFKEL 540
Query: 541 AMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFE 600
AMS FNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFE
Sbjct: 541 AMSSFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFE 600
Query: 601 FGRVSGGFGDKAMEILGSNSLECSINRQVTDVE 633
FG VSGGFGDKAME+LGS SLECSINRQVTDVE
Sbjct: 601 FGCVSGGFGDKAMEVLGSKSLECSINRQVTDVE 621
BLAST of CmoCh16G006560 vs. TAIR 10
Match:
AT4G11400.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )
HSP 1 Score: 335.9 bits (860), Expect = 7.1e-92
Identity = 232/609 (38.10%), Postives = 319/609 (52.38%), Query Frame = 0
Query: 38 SYENVD---YDDCKARIRCYFEKILWVFLKEIGRRGFVRPLPALIGEGGALDLFELFLVV 97
SY +V+ D+C+ R+R F++ L VFL+E G ++PLPA+IG+G +DLF+LF++V
Sbjct: 10 SYVDVEIKYVDECEERLRRLFDQALLVFLEE---EGSIKPLPAVIGDGKNVDLFKLFVLV 69
Query: 98 RDKGGSQVVSEKKLWSSVVVELGLDLGLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGS 157
R++ G VS K+LW V +LG D L S+ LIY KYL+ +EKW + +N
Sbjct: 70 REREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAVEESRIVNWDNKD 129
Query: 158 SDY--CYKKSSPFLSELGAKINGMLYGVPRQNSIYDECFGFKSNKQNGNVNVAAAAAVEK 217
S+ CY S L ELG NG S+ D K K+N V E
Sbjct: 130 SEKKGCY---SGMLHELG---NGF-------KSLLD---NGKCQKRNRAVAFGCNHMEES 189
Query: 218 EIKFPEIKKKEHDLHGDVTSIQQDCTETHPIHVIEDGQSLDAVNVEAEIESLGKYRESLL 277
+F +K+ + D + VI + + AV SL K R+ L
Sbjct: 190 CSEFDRSRKRFRESDDDDKGVGLSSV------VIREETVVCAVEEGLSDFSLEK-RDDLP 249
Query: 278 RMLKWVRKTAKHPEDPLNGTIPGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKR 337
MLKW+ A P DP G IP +S+WK Y+ + WLQV RAK++LL+++ ++ +
Sbjct: 250 GMLKWLALVATSPHDPAIGVIPHSSKWKQYNGNKC-WLQVARAKNSLLVQRDNAELRYRY 309
Query: 338 LLIQKKVKM-HPSIYEDNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISS 397
+ + HPS+YED+ S R+ S R L++ CS+SC C S
Sbjct: 310 HPFRGHQNIHHPSMYEDD----RKSIGRLRYSIRPPNLSKH----CSSSC------CNGS 369
Query: 398 LTTEVGKGLKNQAVLNGDIPSEMEDDHPNEDSAEE---------TVPVGAVCQADLPEWT 457
+ K + I SE A + + VG QA + EWT
Sbjct: 370 SLVSLSKSRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPRRCIKVGHQHQAQVDEWT 429
Query: 458 GNNSDSDSKWLGTRSWPLQHRNS-NSVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEAR 517
+ DSDSKWLGTR WP ++ + + +G+GRPDSC C+ G VEC R HIAE R
Sbjct: 430 ESGVDSDSKWLGTRIWPPENSEALDQTLGNDLVGKGRPDSCSCELSGFVECTRLHIAEKR 489
Query: 518 MRLKLELGSTFFAWRFHQMGEEISLQWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPM 577
M LK ELG FF WRF+QMGEE+ L+WT EEEKRFK++ ++ + FW + + FP
Sbjct: 490 MELKRELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIA----DPQSFWTNAAKNFPK 549
Query: 578 KSRKNLISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFEFGRVSGGFGDKAMEILGSNSLE 631
K R+ L+SYYFNVFL+ R YQNRVTP SIDSDDE FG V G FG A+ GS+ +
Sbjct: 550 KKREELVSYYFNVFLINRRRYQNRVTPKSIDSDDEG-AFGSVGGSFGRDAVTSSGSDVMI 572
BLAST of CmoCh16G006560 vs. TAIR 10
Match:
AT2G46040.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )
HSP 1 Score: 253.4 bits (646), Expect = 4.6e-67
Identity = 194/551 (35.21%), Postives = 257/551 (46.64%), Query Frame = 0
Query: 55 FEKILWVFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVV 114
F +L FL E PLPA+ GEG +DLF LFL V KGG VSE W VV
Sbjct: 49 FRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQ 108
Query: 115 ELGLDLGLSASVKLIYSKYLSDLEKWL-MVRCGDTKLE----NGSSDYCYKKSSPFLSEL 174
E GL+ SAS KLIY KYL +WL V GDT + +G SD + + FLSE+
Sbjct: 109 ESGLESYDSASAKLIYVKYLDAFGRWLNRVVAGDTDVSSVELSGISDALVARLNGFLSEV 168
Query: 175 GAKINGMLYGVPRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIK-FPEIKKKEHDLHG 234
K ++ + F K+ ++ +V +A +K F K E L
Sbjct: 169 KKKYELRKGRPAKELGAELKWFISKTKRRYDKHHVGKESASNDAVKEFQGSKLAERRL-- 228
Query: 235 DVTSIQQDCTETHPIHVIEDGQSLDAVNVEAEIESLGK-YRESLLRMLKWVRKTAKHPED 294
I ++E +V E S GK RE L LKW+ AK P D
Sbjct: 229 ------------EQIMILE--------SVTQECSSPGKRKRECPLETLKWLSDVAKDPCD 288
Query: 295 PLNGTIPGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQKKVKMHPSIYE 354
P G +P S W Y S++ W Q++ + + R D EK QK KMHP +Y+
Sbjct: 289 PSLGIVPDRSEWVSYGSEEP-WKQLLLFRAS---RTNNDSACEKTW--QKVQKMHPCLYD 348
Query: 355 DNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGLKNQAVLN 414
D+ + ER+S + T G G
Sbjct: 349 DSAGASYNLRERLSYEDYKRGKT--------------------------GNG-------- 408
Query: 415 GDIPSEMEDDHPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPL--QHRNS 474
DI S E+D P VG+ QA +PEWTG +SDSKWLGTR WPL + +
Sbjct: 409 SDIGSSDEEDRP-------CALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKA 468
Query: 475 NSVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEIS 534
N + +R IG+GR D CGC PGS+EC +FHI R +LKLELG F+ W F MGE
Sbjct: 469 NLLIERDRIGKGRQDPCGCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTL 528
Query: 535 LQWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNR 594
WT E K+ K L MS + + F + P KSR ++SY++NV LL+ R+ Q+R
Sbjct: 529 QYWTDLELKKIKSL-MSSPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSR 529
Query: 595 VTPNSIDSDDE 597
+TP+ IDSD +
Sbjct: 589 ITPHDIDSDTD 529
BLAST of CmoCh16G006560 vs. TAIR 10
Match:
AT5G04110.1 (DNA GYRASE B3 )
HSP 1 Score: 128.6 bits (322), Expect = 1.7e-29
Identity = 72/206 (34.95%), Postives = 112/206 (54.37%), Query Frame = 0
Query: 407 NGDIPSEMEDD--HPNEDSAEETVPVGAVCQADLPEWT---------GNNSDSDS-KWLG 466
N D+ ++ D + +P+G QA++P W G+ DS++ +WLG
Sbjct: 338 NKDVSNKTSKDVITHGSNKTRPAIPIGPRFQAEIPVWIAPTKKGKFYGSPGDSNTLRWLG 397
Query: 467 TRSWPLQHRNSNSVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFA 526
T WP + +V ++ +G GR DSC C P S C + H EA+ L+ E+ F
Sbjct: 398 TGVWP-TYSLKKTVHSKK-VGEGRSDSCSCASPRSTNCIKRHKKEAQELLEKEINRAFST 457
Query: 527 WRFHQMGEEISLQ-WTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFN 586
W F QMGEEI L+ WT +EE+RF+ L + + FW+++ FP KS+K+L+SYY+N
Sbjct: 458 WEFDQMGEEIVLKSWTAKEERRFEALVKKNPLSSSDGFWEFASNAFPQKSKKDLLSYYYN 517
Query: 587 VFLLRLRSYQNRVTPNSIDSDDEDFE 600
VFL++ N+IDSDD+ ++
Sbjct: 518 VFLIKRMRLLKSSAANNIDSDDDHYD 541
BLAST of CmoCh16G006560 vs. TAIR 10
Match:
AT2G03470.1 (ELM2 domain-containing protein )
HSP 1 Score: 97.4 bits (241), Expect = 4.3e-20
Identity = 81/273 (29.67%), Postives = 120/273 (43.96%), Query Frame = 0
Query: 346 YEDNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVG-------K 405
+++ I HH S E K++ L E ++ C N T +N + G
Sbjct: 31 FDEAIPYHHASME-----KKTNVLVEDLIGLCENPTWTNDANHVDKGFETTGLCQEDSQS 90
Query: 406 GLKNQAVLNGDIPSEMEDDHPNED--------SAEETVPVGAVCQADLPEWTGNN--SDS 465
G+ Q+ L+ P ED + V VG+ QAD+PE+ S
Sbjct: 91 GVTTQSDLSHQSSGSDFTWKPVEDVYTCLMNQPPRKQVLVGSNHQADIPEFVKEEILDQS 150
Query: 466 DSKWLGTRSWPLQHRNSNSVRDRRAIGRGR-PDSCGCQFPGSVECFRFHIAEARMRLKLE 525
+++ L + + D G G+ C C GS+ C R HI EAR L
Sbjct: 151 EARTKEDLEGKLMRKCVIPMSDSDLCGTGQGRKECLCLDKGSIRCVRRHIIEARESLVET 210
Query: 526 LG-STFFAWRFHQMGEEISLQWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKN 585
+G F +MGEE++ WT EEE F ++ S + R FW FP ++ K
Sbjct: 211 IGYERFMELGLCEMGEEVASLWTEEEEDLFHKVVYSNPFSAGRDFWKQLKGTFPSRTMKE 270
Query: 586 LISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFE 600
L+SYYFNVF+LR R QNR +DSDD++++
Sbjct: 271 LVSYYFNVFILRRRGIQNRFKALDVDSDDDEWQ 298
BLAST of CmoCh16G006560 vs. TAIR 10
Match:
AT2G03470.2 (ELM2 domain-containing protein )
HSP 1 Score: 93.2 bits (230), Expect = 8.0e-19
Identity = 81/273 (29.67%), Postives = 120/273 (43.96%), Query Frame = 0
Query: 346 YEDNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVG-------K 405
+++ I HH S E K++ L E ++ C N T +N + G
Sbjct: 31 FDEAIPYHHASME-----KKTNVL-EDLIGLCENPTWTNDANHVDKGFETTGLCQEDSQS 90
Query: 406 GLKNQAVLNGDIPSEMEDDHPNED--------SAEETVPVGAVCQADLPEWTGNN--SDS 465
G+ Q+ L+ P ED + V VG+ QAD+PE+ S
Sbjct: 91 GVTTQSDLSHQSSGSDFTWKPVEDVYTCLMNQPPRKQVLVGSNHQADIPEFVKEEILDQS 150
Query: 466 DSKWLGTRSWPLQHRNSNSVRDRRAIGRGR-PDSCGCQFPGSVECFRFHIAEARMRLKLE 525
+++ L + + D G G+ C C GS+ C R HI EAR L
Sbjct: 151 EARTKEDLEGKLMRKCVIPMSDSDLCGTGQGRKECLCLDKGSIRCVRRHIIEARESLVET 210
Query: 526 LG-STFFAWRFHQMGEEISLQWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKN 585
+G F +MGEE++ WT EEE F ++ S + R FW FP ++ K
Sbjct: 211 IGYERFMELGLCEMGEEVASLWTEEEEDLFHKVVYSNPFSAGRDFWKQLKGTFPSRTMKE 270
Query: 586 LISYYFNVFLLRLRSYQNRVTPNSIDSDDEDFE 600
L+SYYFNVF+LR R QNR +DSDD++++
Sbjct: 271 LVSYYFNVFILRRRGIQNRFKALDVDSDDDEWQ 297
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9LDD4 | 1.0e-90 | 38.10 | AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 ... | [more] |
Q84JT7 | 6.5e-66 | 35.21 | AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 ... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ETI2 | 0.0e+00 | 100.00 | AT-rich interactive domain-containing protein 2-like isoform X3 OS=Cucurbita mos... | [more] |
A0A6J1EZB1 | 0.0e+00 | 90.94 | AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita mos... | [more] |
A0A6J1J644 | 0.0e+00 | 96.52 | AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita max... | [more] |
A0A6J1ETJ6 | 0.0e+00 | 89.35 | AT-rich interactive domain-containing protein 2-like isoform X2 OS=Cucurbita mos... | [more] |
A0A6J1J301 | 0.0e+00 | 94.79 | AT-rich interactive domain-containing protein 2-like isoform X2 OS=Cucurbita max... | [more] |