CmaCh01G015860.1 (mRNA) Cucurbita maxima (Rimu)

NameCmaCh01G015860.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionMyb/SANT-like DNA-binding domain protein
LocationCma_Chr01 : 10943665 .. 10946054 (-)
Sequence length894
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGAGGAGGACGGCAAACGTGGAACGGAGGTTTCAGGTTCTCGTCGGACGCGGTCTCGTATAGCACCGGATTGGACAGCGGCGAATTGCCTCGTTCTTGTTAATGTGATTGCAGCTGTGGAGGCGGATTGTTGGAAAGCTTTGTCTAGCTTTCAGAAATGGAAAATTGTTGCAGAGAACTGCACGTCTTTGGATGTGGCTCGGAACTCGAATCAGTGCAGGAGGAAGTGGGACTGTTTGCTGATTGAACATGATGTTATCAAGCAATGGGAGTTAGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAAAGTGGAAGGAGAAAGGAATTGGGACTTCCTGATAACTTTGACGAGGAGCTGTTCAAAGCAATCGATAATGTTGTATTGATGAGGGCGAATCAGTCGGATACGGAGCCTGATAGCGATCCTGAGGCTGCGGTTGAGAAGGTAGATGAAGCTGCAGAGCCTGGTATGGTTCTTTGTTTCCTAGAGTGCTTATTGATTACATTTCTATGGTATCAAGTTTTCATGTGAATTTCTACACTTAGTTTCTTTGAATACACTTGATACTATCATTTTTGCTTCACTTAATTACTTAAAAACCTTTGTTTGCCGGCCTTGGGCGAGCAAGGTATTCCCTGAGATTAGCATAATGGGGCATGAACATAATTGCAAGGACCAGCTCTGATAGTTTGCTTATAGGAAAGTCATTTTTTGCTTGTCGATGAAATTTTCTATAACGGTCCAAGCCTACCGTTAGTAGATATTGTCCTCTTTGGGCTTTCTCTTTTGGACTTCCCCCCTTCAAGGTTTTTAAAACGTGTCTGCTAGGGAGAGGTTTCCATACCCTTTTAAAGAATGCTTTGTTCTCCTCCACAACCTATGTGGGATCTCATAATCTACTCCCCTTCGGGGCTCAGTGATCGTTCTCCTCCCCTACAGACGTGGGATCTCATAATCTACGTTCTCCTCCCCAACCGATGTGGGATCTCACAACTTACGTTCTCCTCCCCAACAGATGTGGGATCTCACAATCTACATTCTCCTCCCCAACCGATGTGGGATCTCACAACCTACGTTCTCCTCCCCAACAGATGTGGGATCTCACAATCTACGTTCTCTTCCCCAGCCGATGTGGGATCTCACAATTTACTCCCCTTCGGGGCTCAGCGTTCTCACTGACACTCGTTCCCTTCTCCAATGATGTGGGACCTCCTAATCCACCCCCTCGAGGCTAGCATCCTTGCTGACATACCGCCTCATGCCACTCCCCTTTGGGGCTCAGCCTCGTTGCTGACACATCACCCCGGTGTCTGGCTCTGATACCATTTGTAACAGCCTAAGCCCACTGCTGTCCTCTTTGGTTTTCCCTTTCGGGCTTCCCCTCAAGGTTTTTAAAATGCATATAGGAAAGGTTTTCACACCCGTATAAAGGATGTTTCGTTCTCGTCTCTAATCGACGTGGGATCTCACATTTTCAATACATTTATTGGTTTCTTCTTGTTTTTAATGCTGTAAAATTAATAGAAACAAGTGATTGAAATTCTTGTATTTCTTGTTTGGTATTAAATCCAGAGTAACATGATATCATATCTACAAATAACTCTATATCATCAACCAGAATGTCGAATGTTCGAATCTCCATCTCCACAAATTGTTGAACTTTAAATCTTGAGTAGCCATTTCAGTATCTAGATCTTTTTGGTTCTTTAATTCATTCGGGGAATTTTGAACTTTAGAGGTCTGCTAACATACATCATAACAAGGCCTGATGAGACAGGATTGCCTATTAGGGTTCTTTTGACATCCTAATTTTTTTCATGGAGATGAAATACAGCTCTAACTGCGTTTTCCTACGGCTTAATTGTGAAATGTGGTTAAAGAGTGAGATGGATCTTATAATTCCTGTCTTCAAGTCCCTTCAATAATATTGGTTTGTTCTTCTTTCTGGCATTGTGATGTTAGGCCCTAAAAGGCAAAGACGTTGTTCAATGTCCACAAGAAACCAAGCTCTTGAGAAATCTGTAAAATGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAACCTCTGGTGAGCTCTCCGGAAGTAGAGCGTCGTGGATGCTACATCAAAAGCCACGGAGAAAAGGCAACCGATAACACCGAACCCGAAGAGCAAACGATGACGAAGAAATTGCTTGAAACTGCAGAAAAAGTTCAAGCAATTGTGTCTGAGAATGCAAAATATGCAATGTCCGGCGAAAAAAACGACAACGACAACTACAACAACCGAACTAATTCCATTAGGCGTCAAGGGAGCAAGCTTATCAAATGTCTTGAAGATTTTCTCAACACCATTAACGATCTCCATAGCCTGCCCGAAGATCAAGAGTAA

mRNA sequence

ATGAAGGAGGAGGACGGCAAACGTGGAACGGAGGTTTCAGGTTCTCGTCGGACGCGGTCTCGTATAGCACCGGATTGGACAGCGGCGAATTGCCTCGTTCTTGTTAATGTGATTGCAGCTGTGGAGGCGGATTGTTGGAAAGCTTTGTCTAGCTTTCAGAAATGGAAAATTGTTGCAGAGAACTGCACGTCTTTGGATGTGGCTCGGAACTCGAATCAGTGCAGGAGGAAGTGGGACTGTTTGCTGATTGAACATGATGTTATCAAGCAATGGGAGTTAGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAAAGTGGAAGGAGAAAGGAATTGGGACTTCCTGATAACTTTGACGAGGAGCTGTTCAAAGCAATCGATAATGTTGTATTGATGAGGGCGAATCAGTCGGATACGGAGCCTGATAGCGATCCTGAGGCTGCGGTTGAGAAGGTAGATGAAGCTGCAGAGCCTGGCCCTAAAAGGCAAAGACGTTGTTCAATGTCCACAAGAAACCAAGCTCTTGAGAAATCTGTAAAATGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAACCTCTGGTGAGCTCTCCGGAAGTAGAGCGTCGTGGATGCTACATCAAAAGCCACGGAGAAAAGGCAACCGATAACACCGAACCCGAAGAGCAAACGATGACGAAGAAATTGCTTGAAACTGCAGAAAAAGTTCAAGCAATTGTGTCTGAGAATGCAAAATATGCAATGTCCGGCGAAAAAAACGACAACGACAACTACAACAACCGAACTAATTCCATTAGGCGTCAAGGGAGCAAGCTTATCAAATGTCTTGAAGATTTTCTCAACACCATTAACGATCTCCATAGCCTGCCCGAAGATCAAGAGTAA

Coding sequence (CDS)

ATGAAGGAGGAGGACGGCAAACGTGGAACGGAGGTTTCAGGTTCTCGTCGGACGCGGTCTCGTATAGCACCGGATTGGACAGCGGCGAATTGCCTCGTTCTTGTTAATGTGATTGCAGCTGTGGAGGCGGATTGTTGGAAAGCTTTGTCTAGCTTTCAGAAATGGAAAATTGTTGCAGAGAACTGCACGTCTTTGGATGTGGCTCGGAACTCGAATCAGTGCAGGAGGAAGTGGGACTGTTTGCTGATTGAACATGATGTTATCAAGCAATGGGAGTTAGAGATGCCGGATGATGATTCGTATTGGTGTTTGGAAAGTGGAAGGAGAAAGGAATTGGGACTTCCTGATAACTTTGACGAGGAGCTGTTCAAAGCAATCGATAATGTTGTATTGATGAGGGCGAATCAGTCGGATACGGAGCCTGATAGCGATCCTGAGGCTGCGGTTGAGAAGGTAGATGAAGCTGCAGAGCCTGGCCCTAAAAGGCAAAGACGTTGTTCAATGTCCACAAGAAACCAAGCTCTTGAGAAATCTGTAAAATGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAACCTCTGGTGAGCTCTCCGGAAGTAGAGCGTCGTGGATGCTACATCAAAAGCCACGGAGAAAAGGCAACCGATAACACCGAACCCGAAGAGCAAACGATGACGAAGAAATTGCTTGAAACTGCAGAAAAAGTTCAAGCAATTGTGTCTGAGAATGCAAAATATGCAATGTCCGGCGAAAAAAACGACAACGACAACTACAACAACCGAACTAATTCCATTAGGCGTCAAGGGAGCAAGCTTATCAAATGTCTTGAAGATTTTCTCAACACCATTAACGATCTCCATAGCCTGCCCGAAGATCAAGAGTAA

Protein sequence

MKEEDGKRGTEVSGSRRTRSRIAPDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVAENCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDEELFKAIDNVVLMRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSTRNQALEKSVKCEEEEEEEEEEEEEEPLVSSPEVERRGCYIKSHGEKATDNTEPEEQTMTKKLLETAEKVQAIVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIKCLEDFLNTINDLHSLPEDQE
BLAST of CmaCh01G015860.1 vs. TrEMBL
Match: A0A0A0LDW0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G882960 PE=4 SV=1)

HSP 1 Score: 403.3 bits (1035), Expect = 2.6e-109
Identity = 226/313 (72.20%), Postives = 245/313 (78.27%), Query Frame = 1

Query: 2   KEEDGKRGTEVSGSRRTRSRIA--PDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVA 61
           KE  G RG+ VSGSRRTRS+IA  P WTAA+CLVLVNVIAAVEADC KALSS+QKWKIVA
Sbjct: 3   KENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA 62

Query: 62  ENCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFD 121
           ENCTSLDV R SNQCRRKWDCLLIEHDVIKQWEL+MPDDDSYWCL SGRRKELGLP+NFD
Sbjct: 63  ENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWCLASGRRKELGLPENFD 122

Query: 122 EELFKAIDNVVLMRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSTRNQALEKSV 181
           EELFKAIDNV  MRANQSDTEPDSDPEAA+   DE AEPGPKRQRR SMS  NQ LEKS+
Sbjct: 123 EELFKAIDNVASMRANQSDTEPDSDPEAAIGNADEIAEPGPKRQRRRSMSKSNQVLEKSL 182

Query: 182 KCE------------EEEEEEE---EEEEEEPLVSSPEVERRGCYIKSHGEKATDNTEPE 241
           +CE            E E+  E   EE EE+PL+SSPE+E R CYIKS+  K TDN EP+
Sbjct: 183 ECERNLGLEISLECKEVEDRGERGGEEVEEKPLLSSPELEPRECYIKSNESKVTDNIEPK 242

Query: 242 EQTMTKKLLETAEKVQAIVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIKCLEDFLN 298
           EQ M K LLE AEKVQAIVSENA+Y  S EK   D    +TN +R QGSKLI+CL D LN
Sbjct: 243 EQMMAKFLLENAEKVQAIVSENAEYTTSDEKCAKD----QTNLVRHQGSKLIRCLGDILN 302

BLAST of CmaCh01G015860.1 vs. TrEMBL
Match: M5Y8E8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025574mg PE=4 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 1.4e-51
Identity = 136/320 (42.50%), Postives = 190/320 (59.38%), Query Frame = 1

Query: 13  SGSRRTRSRIAPDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVAENCTSLDVARNSN 72
           S  R TRS++APDW + + L+LVN IAAVEADC KALSSFQKWKI+++NC++L V R  +
Sbjct: 18  SSRRSTRSQVAPDWNSTDELLLVNEIAAVEADCLKALSSFQKWKIISQNCSALGVPRTLD 77

Query: 73  QCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDEELFKAIDNVVLM 132
           Q RRKWD L +++  IKQWE       SYW LE GRRK+ GLP+NFD ELF+AIDN+V +
Sbjct: 78  QYRRKWDALFLQYKSIKQWESASRGGASYWVLEIGRRKQKGLPENFDNELFRAIDNLVRV 137

Query: 133 RANQSDTEPDSDPEAAVEKV----DEAAEPGPKRQRRCSMSTRNQALEKSV--------- 192
           R NQSDT+PDSDPEA ++      D  AEP  KR+RR S   ++ ++E S+         
Sbjct: 138 RGNQSDTDPDSDPEAEIDAEADVPDVVAEPESKRRRRRSTHQKSCSIENSLEDVRWKSLK 197

Query: 193 --KCEEEEEEEEEEE-------EEEPLVSSPEV----------ERRGCYIKSHGEKATDN 252
             + EE+ EE   EE       EE+P+ S  EV           ++ C  K    +  + 
Sbjct: 198 KPRVEEKPEETHAEEKPQETHAEEKPVGSCLEVIPQKSLAEQKSQKSCAKKHKNSQIKEK 257

Query: 253 T---EPEEQTMTKKLLETAEKVQAIVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIK 298
               E +EQ    +L E  E +QAIV+ENA +  + +     +   +T+ +RRQG ++I 
Sbjct: 258 AISIEEQEQIAVMQLHENVELIQAIVNENADHEAAADVKSTGD--PQTDLVRRQGDQVIA 317

BLAST of CmaCh01G015860.1 vs. TrEMBL
Match: V4P3R9_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10025787mg PE=4 SV=1)

HSP 1 Score: 198.4 bits (503), Expect = 1.3e-47
Identity = 130/310 (41.94%), Postives = 192/310 (61.94%), Query Frame = 1

Query: 13  SGSRRTRSRIAPDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVAENCTSLDVARNSN 72
           SGSRRTRS++APDWT  +CL+LVN IAAVEADC  ALSSFQKW I++ENC +LDV R  N
Sbjct: 6   SGSRRTRSQVAPDWTVKDCLILVNEIAAVEADCSNALSSFQKWTIISENCNALDVHRTLN 65

Query: 73  QCRRKWDCLLIEHDVIKQWELE-MPDDDSYWCLESGRRKELGLPDNFDEELFKAIDNVVL 132
           QCRRKWD L+ +++ IK+WE +      SYW L + +RK+L LP N D ELF+AI+ VV+
Sbjct: 66  QCRRKWDSLVSDYNQIKKWESQGRGGGHSYWSLSTEKRKKLNLPGNIDNELFEAINAVVM 125

Query: 133 MRANQSDTEPDSDPEA--AVEKVDEAAE---PGPKRQRRCSMST---------------- 192
           ++ +++ TEPDSDPEA    + +D +AE    G KR R+ ++                  
Sbjct: 126 LQEDKAGTEPDSDPEAQEGYDVLDVSAELAFVGSKRSRQRTLLVMKENPPHKTKTDAEPR 185

Query: 193 RNQALEKSVKCEEEEEEEEEE-EEEEPL--VSSPEVERRGCYIKSHGEKATDNTEPEEQT 252
           RN+ L+K+ +   +   +++  EE++P+  +S+ E E     I+   E+ T N E E + 
Sbjct: 186 RNRVLDKTKEQRAKATNQKKPMEEKKPVEEISTGEGEEDTMSIE---EEETMNIEKEVEA 245

Query: 253 MTKKLLETAEKVQAIVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIKCLEDFLNTIN 298
           M  KL E A+ + AIV  N   A   E  D+ + +++   +R+QG +LI CL + +NT+N
Sbjct: 246 MEAKLGEKADLIHAIVGRN--LAKGSETGDDISISDKMKFVRQQGEELIVCLSEIVNTLN 305

BLAST of CmaCh01G015860.1 vs. TrEMBL
Match: O49577_ARATH (Putative uncharacterized protein AT4g31270 OS=Arabidopsis thaliana GN=At4g31270 PE=4 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 6.9e-46
Identity = 122/294 (41.50%), Postives = 176/294 (59.86%), Query Frame = 1

Query: 13  SGSRRTRSRIAPDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVAENCTSLDVARNSN 72
           SGSRRTRS++AP+W   +CLVLVN IAAVEADC  ALSSFQKW ++ ENC +LDV+RN N
Sbjct: 6   SGSRRTRSQVAPEWAVKDCLVLVNEIAAVEADCSNALSSFQKWTMITENCNALDVSRNLN 65

Query: 73  QCRRKWDCLLIEHDVIKQWELEMPDDD-SYWCLESGRRKELGLPDNFDEELFKAIDNVVL 132
           QCRRKWD L+ +++ IK+WE +      SYW L S +RK L LP + D ELF+AI+ VV+
Sbjct: 66  QCRRKWDSLMSDYNQIKKWESQYRGTGRSYWSLSSDKRKLLNLPGDIDIELFEAINAVVM 125

Query: 133 MRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSTRNQALEKSVKCEEEEEEEEEE 192
           ++  ++ TE DSDPEA  + VD +AE G KR R+     R   ++++ K E      +  
Sbjct: 126 IQDEKAGTESDSDPEAQ-DVVDLSAELGSKRSRQ-----RTMVMKETKKEEPRTSRVQVN 185

Query: 193 EEEEPLVSSPEVERRGCYIK--------SHGEKATDNTEPEEQTMTKKLLETAEKVQAIV 252
             E+P+ +    + +    K           E  T N E + + M  KL    + + AIV
Sbjct: 186 TREKPITTKATHQNKTMGEKKPVEDMSTDEEEDETMNIEEDVEVMEAKLSYKIDLIHAIV 245

Query: 253 SENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIKCLEDFLNTINDLHSLPEDQE 298
             N   A   E  D  + +++  S+R+QG +LI CL + ++T+N LH +P++ E
Sbjct: 246 GRN--LAKDNETKDGVSMDDKLKSVRQQGDELIGCLSEIVSTLNRLHEVPQEIE 291

BLAST of CmaCh01G015860.1 vs. TrEMBL
Match: D7MB82_ARALL (Predicted protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_656923 PE=4 SV=1)

HSP 1 Score: 192.2 bits (487), Expect = 9.0e-46
Identity = 121/294 (41.16%), Postives = 174/294 (59.18%), Query Frame = 1

Query: 13  SGSRRTRSRIAPDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVAENCTSLDVARNSN 72
           SGSRRTRS++AP+W   +CL+LVN IAAVEADC  ALSSFQKW ++ ENC +LDV RN N
Sbjct: 6   SGSRRTRSQVAPEWAVKDCLILVNEIAAVEADCSNALSSFQKWTMILENCNALDVRRNLN 65

Query: 73  QCRRKWDCLLIEHDVIKQWELEMPDDD-SYWCLESGRRKELGLPDNFDEELFKAIDNVVL 132
           QCRRKWD L+ +++ IKQWE +      SYW L S +RK L LP N D ELF+AI  VV+
Sbjct: 66  QCRRKWDSLMSDYNQIKQWESQYRGTGRSYWSLSSDKRKLLNLPGNIDIELFEAISAVVM 125

Query: 133 MRANQSDTEPDSDPEA--AVEKVDEAAEPGPKRQRRCSMSTRNQALEKSVKCEEEEEEEE 192
           ++  ++ TE DSDPEA   V+   E A  G KR R+ ++  +    +K+ K E +    +
Sbjct: 126 IQDEKAGTESDSDPEAQDVVDITAELAFVGSKRSRQRTIVMKENPPQKTKKEEPQISRVQ 185

Query: 193 EEEEEEPLVSSPEVERRGCYIKSHGEKATDNTEPEEQTMT---------KKLLETAEKVQ 252
               E+P+ +    +++    K   E+ + + E EE+TM           KL    + + 
Sbjct: 186 VNTREKPITAKATHQKKTMEEKRPMEEISTDEEEEEETMNIEEEVEVMEAKLSYKIDLIH 245

Query: 253 AIVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIKCLEDFLNTINDLHSLPE 295
           AIV  N   A   E  D  N +++   +R+QG +LI CL + ++T+N L  +P+
Sbjct: 246 AIVGRN--LAKDNETRDGINTDDKLKFVRQQGDELIGCLSEIVSTLNRLREVPQ 297

BLAST of CmaCh01G015860.1 vs. TAIR10
Match: AT4G31270.1 (AT4G31270.1 sequence-specific DNA binding transcription factors)

HSP 1 Score: 188.7 bits (478), Expect = 5.1e-48
Identity = 121/296 (40.88%), Postives = 174/296 (58.78%), Query Frame = 1

Query: 13  SGSRRTRSRIAPDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVAENCTSLDVARNSN 72
           SGSRRTRS++AP+W   +CLVLVN IAAVEADC  ALSSFQKW ++ ENC +LDV+RN N
Sbjct: 6   SGSRRTRSQVAPEWAVKDCLVLVNEIAAVEADCSNALSSFQKWTMITENCNALDVSRNLN 65

Query: 73  QCRRKWDCLLIEHDVIKQWELEMPDDD-SYWCLESGRRKELGLPDNFDEELFKAIDNVVL 132
           QCRRKWD L+ +++ IK+WE +      SYW L S +RK L LP + D ELF+AI+ VV+
Sbjct: 66  QCRRKWDSLMSDYNQIKKWESQYRGTGRSYWSLSSDKRKLLNLPGDIDIELFEAINAVVM 125

Query: 133 MRANQSDTEPDSDPEA--AVEKVDEAAEPGPKRQRRCSMSTRNQALEKSVKCEEEEEEEE 192
           ++  ++ TE DSDPEA   V+   E A  G KR R+     R   ++++ K E      +
Sbjct: 126 IQDEKAGTESDSDPEAQDVVDLSAELAFVGSKRSRQ-----RTMVMKETKKEEPRTSRVQ 185

Query: 193 EEEEEEPLVSSPEVERRGCYIK--------SHGEKATDNTEPEEQTMTKKLLETAEKVQA 252
               E+P+ +    + +    K           E  T N E + + M  KL    + + A
Sbjct: 186 VNTREKPITTKATHQNKTMGEKKPVEDMSTDEEEDETMNIEEDVEVMEAKLSYKIDLIHA 245

Query: 253 IVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIKCLEDFLNTINDLHSLPEDQE 298
           IV  N   A   E  D  + +++  S+R+QG +LI CL + ++T+N LH +P++ E
Sbjct: 246 IVGRN--LAKDNETKDGVSMDDKLKSVRQQGDELIGCLSEIVSTLNRLHEVPQEIE 294

BLAST of CmaCh01G015860.1 vs. TAIR10
Match: AT2G35640.1 (AT2G35640.1 Homeodomain-like superfamily protein)

HSP 1 Score: 52.8 bits (125), Expect = 4.3e-07
Identity = 38/143 (26.57%), Postives = 60/143 (41.96%), Query Frame = 1

Query: 25  DWTAANCLVLVNVIAAVEADCWKALSSFQK------------WKIVAENCTSLDVARNSN 84
           +WT +  LVL+    A + D  + +   +K            WK + E C      RN N
Sbjct: 21  NWTVSETLVLIE---AKKMDDQRRVRRSEKQPEGRNKPAELRWKWIEEYCWRRGCYRNQN 80

Query: 85  QCRRKWDCLLIEHDVIKQWELEMPD-------DDSYWCLESGRRKELGLPDNFDEELFKA 144
           QC  KWD L+ ++  I+++E    +         SYW ++   RKE  LP N   +++  
Sbjct: 81  QCNDKWDNLMRDYKKIREYERSRVESSFNTVTSSSYWKMDKTERKEKNLPSNMLPQIYDV 140

Query: 145 IDNVVLMRANQSDTEPDSDPEAA 149
           +  +V        T P S   AA
Sbjct: 141 LSELV-----DRKTLPSSSSAAA 155

BLAST of CmaCh01G015860.1 vs. TAIR10
Match: AT1G31310.1 (AT1G31310.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 50.8 bits (120), Expect = 1.6e-06
Identity = 33/120 (27.50%), Postives = 51/120 (42.50%), Query Frame = 1

Query: 54  KWKIVAENCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDD-------------- 113
           +WK + + C      R+ NQC  KWD L+ ++  ++++E    +                
Sbjct: 63  RWKWIEDYCWRKGCMRSQNQCNDKWDNLMRDYKKVREYERRRVESSITAGESSSSSAPAG 122

Query: 114 ---SYWCLESGRRKELGLPDNFDEELFKAIDNVVLMRANQSDTEPDSDPEAAVEKVDEAA 157
              SYW +E   RKE  LP N   + ++A+  VV     +S T P S    AV     AA
Sbjct: 123 ETASYWKMEKSERKERSLPSNMLPQTYQALFEVV-----ESKTLPSSTAVTAVTAAVAAA 177

BLAST of CmaCh01G015860.1 vs. NCBI nr
Match: gi|449437322|ref|XP_004136441.1| (PREDICTED: uncharacterized protein LOC101210084 [Cucumis sativus])

HSP 1 Score: 403.3 bits (1035), Expect = 3.7e-109
Identity = 226/313 (72.20%), Postives = 245/313 (78.27%), Query Frame = 1

Query: 2   KEEDGKRGTEVSGSRRTRSRIA--PDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVA 61
           KE  G RG+ VSGSRRTRS+IA  P WTAA+CLVLVNVIAAVEADC KALSS+QKWKIVA
Sbjct: 3   KENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA 62

Query: 62  ENCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFD 121
           ENCTSLDV R SNQCRRKWDCLLIEHDVIKQWEL+MPDDDSYWCL SGRRKELGLP+NFD
Sbjct: 63  ENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWCLASGRRKELGLPENFD 122

Query: 122 EELFKAIDNVVLMRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSTRNQALEKSV 181
           EELFKAIDNV  MRANQSDTEPDSDPEAA+   DE AEPGPKRQRR SMS  NQ LEKS+
Sbjct: 123 EELFKAIDNVASMRANQSDTEPDSDPEAAIGNADEIAEPGPKRQRRRSMSKSNQVLEKSL 182

Query: 182 KCE------------EEEEEEE---EEEEEEPLVSSPEVERRGCYIKSHGEKATDNTEPE 241
           +CE            E E+  E   EE EE+PL+SSPE+E R CYIKS+  K TDN EP+
Sbjct: 183 ECERNLGLEISLECKEVEDRGERGGEEVEEKPLLSSPELEPRECYIKSNESKVTDNIEPK 242

Query: 242 EQTMTKKLLETAEKVQAIVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIKCLEDFLN 298
           EQ M K LLE AEKVQAIVSENA+Y  S EK   D    +TN +R QGSKLI+CL D LN
Sbjct: 243 EQMMAKFLLENAEKVQAIVSENAEYTTSDEKCAKD----QTNLVRHQGSKLIRCLGDILN 302

BLAST of CmaCh01G015860.1 vs. NCBI nr
Match: gi|659132591|ref|XP_008466281.1| (PREDICTED: uncharacterized protein LOC103503736 isoform X1 [Cucumis melo])

HSP 1 Score: 388.7 bits (997), Expect = 9.4e-105
Identity = 217/311 (69.77%), Postives = 241/311 (77.49%), Query Frame = 1

Query: 2   KEEDGKRGTEVSGSRRTRSRIA--PDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVA 61
           KE  G RG+ VSGSRRTRS+IA  P WTAA+CLVLVNVIAAVEADC KALSS+QKWKIVA
Sbjct: 3   KENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA 62

Query: 62  ENCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFD 121
           ENCTSLDV R SNQCRRKWDCLLIEHDVIKQWEL+MPDDDSYW L SGRRKELGLP+NFD
Sbjct: 63  ENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWRLASGRRKELGLPENFD 122

Query: 122 EELFKAIDNVVLMRANQSDTEPDSDPEAAVEKVDEAAEPGPKRQRRCSMSTRNQALEKSV 181
           EELFKAIDNV  MRANQSDTEPDSDPEAA+E  +E AEPGPKRQRR SMS  NQALE S 
Sbjct: 123 EELFKAIDNVASMRANQSDTEPDSDPEAAIENANEIAEPGPKRQRRRSMSKSNQALENSP 182

Query: 182 KCEEEE---------------EEEEEEEEEEPLVSSPEVERRGCYIKSHGEKATDNTEPE 241
           +CE  +               E E EE +E+PL+SSPE+E +  YIKS+  K  D+ EP+
Sbjct: 183 ECERNQALEISLECKEVEDGGEGEGEEVKEKPLLSSPELESQEYYIKSNESKVADDVEPK 242

Query: 242 EQTMTKKLLETAEKVQAIVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIKCLEDFLN 296
           EQ M K LLE AEKVQAIVSENA+Y  S EK + D    +TN +R QGSKLI+CL D LN
Sbjct: 243 EQMMAKFLLENAEKVQAIVSENAEYTTSDEKCNKD----QTNLVRHQGSKLIRCLGDILN 302

BLAST of CmaCh01G015860.1 vs. NCBI nr
Match: gi|659132597|ref|XP_008466284.1| (PREDICTED: uncharacterized protein LOC103503736 isoform X2 [Cucumis melo])

HSP 1 Score: 268.9 bits (686), Expect = 1.1e-68
Identity = 134/159 (84.28%), Postives = 142/159 (89.31%), Query Frame = 1

Query: 2   KEEDGKRGTEVSGSRRTRSRIA--PDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVA 61
           KE  G RG+ VSGSRRTRS+IA  P WTAA+CLVLVNVIAAVEADC KALSS+QKWKIVA
Sbjct: 3   KENAGNRGSGVSGSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVA 62

Query: 62  ENCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFD 121
           ENCTSLDV R SNQCRRKWDCLLIEHDVIKQWEL+MPDDDSYW L SGRRKELGLP+NFD
Sbjct: 63  ENCTSLDVVRTSNQCRRKWDCLLIEHDVIKQWELKMPDDDSYWRLASGRRKELGLPENFD 122

Query: 122 EELFKAIDNVVLMRANQSDTEPDSDPEAAVEKVDEAAEP 159
           EELFKAIDNV  MRANQSDTEPDSDPEAA+E  +E AEP
Sbjct: 123 EELFKAIDNVASMRANQSDTEPDSDPEAAIENANEIAEP 161

BLAST of CmaCh01G015860.1 vs. NCBI nr
Match: gi|596295947|ref|XP_007227091.1| (hypothetical protein PRUPE_ppa025574mg [Prunus persica])

HSP 1 Score: 211.5 bits (537), Expect = 2.1e-51
Identity = 136/320 (42.50%), Postives = 190/320 (59.38%), Query Frame = 1

Query: 13  SGSRRTRSRIAPDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVAENCTSLDVARNSN 72
           S  R TRS++APDW + + L+LVN IAAVEADC KALSSFQKWKI+++NC++L V R  +
Sbjct: 18  SSRRSTRSQVAPDWNSTDELLLVNEIAAVEADCLKALSSFQKWKIISQNCSALGVPRTLD 77

Query: 73  QCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDEELFKAIDNVVLM 132
           Q RRKWD L +++  IKQWE       SYW LE GRRK+ GLP+NFD ELF+AIDN+V +
Sbjct: 78  QYRRKWDALFLQYKSIKQWESASRGGASYWVLEIGRRKQKGLPENFDNELFRAIDNLVRV 137

Query: 133 RANQSDTEPDSDPEAAVEKV----DEAAEPGPKRQRRCSMSTRNQALEKSV--------- 192
           R NQSDT+PDSDPEA ++      D  AEP  KR+RR S   ++ ++E S+         
Sbjct: 138 RGNQSDTDPDSDPEAEIDAEADVPDVVAEPESKRRRRRSTHQKSCSIENSLEDVRWKSLK 197

Query: 193 --KCEEEEEEEEEEE-------EEEPLVSSPEV----------ERRGCYIKSHGEKATDN 252
             + EE+ EE   EE       EE+P+ S  EV           ++ C  K    +  + 
Sbjct: 198 KPRVEEKPEETHAEEKPQETHAEEKPVGSCLEVIPQKSLAEQKSQKSCAKKHKNSQIKEK 257

Query: 253 T---EPEEQTMTKKLLETAEKVQAIVSENAKYAMSGEKNDNDNYNNRTNSIRRQGSKLIK 298
               E +EQ    +L E  E +QAIV+ENA +  + +     +   +T+ +RRQG ++I 
Sbjct: 258 AISIEEQEQIAVMQLHENVELIQAIVNENADHEAAADVKSTGD--PQTDLVRRQGDQVIA 317

BLAST of CmaCh01G015860.1 vs. NCBI nr
Match: gi|645226294|ref|XP_008219973.1| (PREDICTED: uncharacterized protein LOC103320122 [Prunus mume])

HSP 1 Score: 210.7 bits (535), Expect = 3.5e-51
Identity = 140/332 (42.17%), Postives = 193/332 (58.13%), Query Frame = 1

Query: 1   MKEEDGKRGTEVSGSRRTRSRIAPDWTAANCLVLVNVIAAVEADCWKALSSFQKWKIVAE 60
           MK +  + G+  S  R TRS++APDW   + L+LVN IAAVEADC KALSSFQKWKI+++
Sbjct: 1   MKVQGSQSGS--SSRRSTRSQVAPDWNPTDELLLVNEIAAVEADCLKALSSFQKWKIISQ 60

Query: 61  NCTSLDVARNSNQCRRKWDCLLIEHDVIKQWELEMPDDDSYWCLESGRRKELGLPDNFDE 120
           NC++L V R  +Q RRKWD L +E+  IKQWE       SYW LE GRRK+ GLP+NFD 
Sbjct: 61  NCSALGVPRTLDQYRRKWDALFLEYKSIKQWESASRGGASYWVLEIGRRKQKGLPENFDN 120

Query: 121 ELFKAIDNVVLMRANQSDTEPDSDPEAAVEKV----DEAAEPGPKRQRRCSMSTRNQALE 180
           ELF+AIDN+V +R NQSDT+PDSDPEA ++      D  AEP  KR+RR S   ++  ++
Sbjct: 121 ELFRAIDNLVRVRGNQSDTDPDSDPEAEIDAEADVPDVVAEPESKRRRRRSTHQKSCPIK 180

Query: 181 KSVK-CEEEEEEE-----------------EEEEEEEPLVSSPEV----------ERRGC 240
            S + C+ E  EE                 E   EE+P+ S  EV           ++ C
Sbjct: 181 NSFRRCKVERPEETHVEEKPEETHAEVKPQETHAEEKPVGSCLEVIPQKSLAEEKSQKSC 240

Query: 241 YIKSHGEKATDNT---EPEEQTMTKKLLETAEKVQAIVSENAKYAMSGEKNDNDNYNNRT 298
             K    +  +     E +EQ    +L E  E +QAIV+ENA +  +  K+  D    +T
Sbjct: 241 AKKHKNSQIKEKAISIEEQEQIAVMQLHENVELIQAIVNENADHEAADVKSTGDP---QT 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LDW0_CUCSA2.6e-10972.20Uncharacterized protein OS=Cucumis sativus GN=Csa_3G882960 PE=4 SV=1[more]
M5Y8E8_PRUPE1.4e-5142.50Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025574mg PE=4 SV=1[more]
V4P3R9_EUTSA1.3e-4741.94Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10025787mg PE=4 SV=1[more]
O49577_ARATH6.9e-4641.50Putative uncharacterized protein AT4g31270 OS=Arabidopsis thaliana GN=At4g31270 ... [more]
D7MB82_ARALL9.0e-4641.16Predicted protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_656923 PE=4 ... [more]
Match NameE-valueIdentityDescription
AT4G31270.15.1e-4840.88 sequence-specific DNA binding transcription factors[more]
AT2G35640.14.3e-0726.57 Homeodomain-like superfamily protein[more]
AT1G31310.11.6e-0627.50 hydroxyproline-rich glycoprotein family protein[more]
Match NameE-valueIdentityDescription
gi|449437322|ref|XP_004136441.1|3.7e-10972.20PREDICTED: uncharacterized protein LOC101210084 [Cucumis sativus][more]
gi|659132591|ref|XP_008466281.1|9.4e-10569.77PREDICTED: uncharacterized protein LOC103503736 isoform X1 [Cucumis melo][more]
gi|659132597|ref|XP_008466284.1|1.1e-6884.28PREDICTED: uncharacterized protein LOC103503736 isoform X2 [Cucumis melo][more]
gi|596295947|ref|XP_007227091.1|2.1e-5142.50hypothetical protein PRUPE_ppa025574mg [Prunus persica][more]
gi|645226294|ref|XP_008219973.1|3.5e-5142.17PREDICTED: uncharacterized protein LOC103320122 [Prunus mume][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR017877Myb-like_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh01G015860CmaCh01G015860gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh01G015860.1CmaCh01G015860.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh01G015860.1.CDS.2CmaCh01G015860.1.CDS.2CDS
CmaCh01G015860.1.CDS.1CmaCh01G015860.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh01G015860.1.exon.2CmaCh01G015860.1.exon.2exon
CmaCh01G015860.1.exon.1CmaCh01G015860.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR017877Myb-like domainPROFILEPS50090MYB_LIKEcoord: 26..82
score: 6
NoneNo IPR availableunknownCoilCoilcoord: 171..195
scor
NoneNo IPR availablePANTHERPTHR33492FAMILY NOT NAMEDcoord: 7..295
score: 2.8
NoneNo IPR availablePANTHERPTHR33492:SF4SUBFAMILY NOT NAMEDcoord: 7..295
score: 2.8
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 25..96
score: 2.