CmaCh05G004550 (gene) Cucurbita maxima (Rimu)

NameCmaCh05G004550
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPlant/F4N2-9 protein, putative
LocationCma_Chr05 : 2098206 .. 2099567 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCCACATTCATCTATGCCTCCTAAAGCACAATTGCCTTTCATTTTCATGTGGGTCTTCCTCTTAATTCATTTCCATTTCTCACACTCTTCAATATCTCCCAAGATCCCTCTTATGTCCTTCAATTCTTCCATATATATAGCTTTTTACACCAAGTTTTGCACAACAAGCTCGCCCTTACGAAGAACAAAGATGTCGCTCTCGAGCTCGGAGATGATCAATAAGAAGATGTTCCACCATAGGAGAGCTTCAAAGGAGCTCGAGGTATTCGAAGCTGCTCGATATTTCTCCGATTACAATGAAACTTCGGCGACAATGAACGGTTTCGGCGCAAAGTTTACTCCGAAAGTGAAGAAGAAAGACAAAGGATGGATCAAAGGGAGAATCAGCTTAGACATGCAGGTTTTCGATCTTAATATGAAGTTCGAGTCCCATTATTGAAGTTCGGGCGTAAATTTAACCATGTTTTTTTTTCCGACATAGGTGAAGAACATCCTTAATCTCCCCGAACATTTTCCCCAACATTCTTACAAAGTTGAAAAGCATGTTACAAAGGAGAAGAAGTACAAACAACCAAGCTCACCTGGTGGGAGACTAGCAAGCTTTCTAAACTCTCTCTTTAGCCATTCAAGTTCGAAGAAGAAGAAGTCAAAACTTTTTGCGCAATCCGTGGAAGACATAGCGGAAGACGAAAGCCGAACGTCGAAGAGAAGGATTAGCATCAGCCATTTTCGATCCTCGAACACAGCTGCAACGGATGCTAAGTTCATCTATTCTTCGTCTCCGGGCAATAATTCAGGTAAACCCCGAATCAACTTAACCTAAATATATATATATTCTAATCATTTATTCATATTCTTAACGAATAGGTTTCAGAACACCGCCTCCGCACGTTCAAACCCCGACAAAGAGCTACAAGGAACTATTGTCGTTTTCGAAGTTCACCCGACACGTAAAATCAGCGGAGACGTTGGAGAAACGGGCTTCGAAACAAAAGAAGAAGAATTTGAGTAATTGTTCTTCAGAAAAAGATAGGGTTTGGGTTGAAAAACCATTGGGAGATGAAGAAATGAAGAAAAAATTGAAGAAATTTGAGCATGACATTAATATTATTGATGATATTGATGATGATGATGGTGGAGAAACAGATTCCAGCTCAGATTTGTTTGAATTACAGATCTATGATTTAGATTATTACTCAAATGGGTTGCCTGTTTATGAATCTACTGATATTGATAGCATCAAGAGAAGAAATTCAATCTCTAATGGTGTTTGTTGAGATGTAAGAAGAACAAACTTGGTCTTTAATGGTGTTTTGTAATTAAAAAAATTTCAACTTTTCGATTGATCTTTTGATTTTCTG

mRNA sequence

CCCCACATTCATCTATGCCTCCTAAAGCACAATTGCCTTTCATTTTCATGTGGGTCTTCCTCTTAATTCATTTCCATTTCTCACACTCTTCAATATCTCCCAAGATCCCTCTTATGTCCTTCAATTCTTCCATATATATAGCTTTTTACACCAAGTTTTGCACAACAAGCTCGCCCTTACGAAGAACAAAGATGTCGCTCTCGAGCTCGGAGATGATCAATAAGAAGATGTTCCACCATAGGAGAGCTTCAAAGGAGCTCGAGGTATTCGAAGCTGCTCGATATTTCTCCGATTACAATGAAACTTCGGCGACAATGAACGGTTTCGGCGCAAAGTTTACTCCGAAAGTGAAGAAGAAAGACAAAGGATGGATCAAAGGGAGAATCAGCTTAGACATGCAGGTGAAGAACATCCTTAATCTCCCCGAACATTTTCCCCAACATTCTTACAAAGTTGAAAAGCATGTTACAAAGGAGAAGAAGTACAAACAACCAAGCTCACCTGGTGGGAGACTAGCAAGCTTTCTAAACTCTCTCTTTAGCCATTCAAGTTCGAAGAAGAAGAAGTCAAAACTTTTTGCGCAATCCGTGGAAGACATAGCGGAAGACGAAAGCCGAACGTCGAAGAGAAGGATTAGCATCAGCCATTTTCGATCCTCGAACACAGCTGCAACGGATGCTAAGTTCATCTATTCTTCGTCTCCGGGCAATAATTCAGGTTTCAGAACACCGCCTCCGCACGTTCAAACCCCGACAAAGAGCTACAAGGAACTATTGTCGTTTTCGAAGTTCACCCGACACGTAAAATCAGCGGAGACGTTGGAGAAACGGGCTTCGAAACAAAAGAAGAAGAATTTGAGTAATTGTTCTTCAGAAAAAGATAGGGTTTGGGTTGAAAAACCATTGGGAGATGAAGAAATGAAGAAAAAATTGAAGAAATTTGAGCATGACATTAATATTATTGATGATATTGATGATGATGATGGTGGAGAAACAGATTCCAGCTCAGATTTGTTTGAATTACAGATCTATGATTTAGATTATTACTCAAATGGGTTGCCTGTTTATGAATCTACTGATATTGATAGCATCAAGAGAAGAAATTCAATCTCTAATGGTGTTTGTTGAGATGTAAGAAGAACAAACTTGGTCTTTAATGGTGTTTTGTAATTAAAAAAATTTCAACTTTTCGATTGATCTTTTGATTTTCTG

Coding sequence (CDS)

ATGCCTCCTAAAGCACAATTGCCTTTCATTTTCATGTGGGTCTTCCTCTTAATTCATTTCCATTTCTCACACTCTTCAATATCTCCCAAGATCCCTCTTATGTCCTTCAATTCTTCCATATATATAGCTTTTTACACCAAGTTTTGCACAACAAGCTCGCCCTTACGAAGAACAAAGATGTCGCTCTCGAGCTCGGAGATGATCAATAAGAAGATGTTCCACCATAGGAGAGCTTCAAAGGAGCTCGAGGTATTCGAAGCTGCTCGATATTTCTCCGATTACAATGAAACTTCGGCGACAATGAACGGTTTCGGCGCAAAGTTTACTCCGAAAGTGAAGAAGAAAGACAAAGGATGGATCAAAGGGAGAATCAGCTTAGACATGCAGGTGAAGAACATCCTTAATCTCCCCGAACATTTTCCCCAACATTCTTACAAAGTTGAAAAGCATGTTACAAAGGAGAAGAAGTACAAACAACCAAGCTCACCTGGTGGGAGACTAGCAAGCTTTCTAAACTCTCTCTTTAGCCATTCAAGTTCGAAGAAGAAGAAGTCAAAACTTTTTGCGCAATCCGTGGAAGACATAGCGGAAGACGAAAGCCGAACGTCGAAGAGAAGGATTAGCATCAGCCATTTTCGATCCTCGAACACAGCTGCAACGGATGCTAAGTTCATCTATTCTTCGTCTCCGGGCAATAATTCAGGTTTCAGAACACCGCCTCCGCACGTTCAAACCCCGACAAAGAGCTACAAGGAACTATTGTCGTTTTCGAAGTTCACCCGACACGTAAAATCAGCGGAGACGTTGGAGAAACGGGCTTCGAAACAAAAGAAGAAGAATTTGAGTAATTGTTCTTCAGAAAAAGATAGGGTTTGGGTTGAAAAACCATTGGGAGATGAAGAAATGAAGAAAAAATTGAAGAAATTTGAGCATGACATTAATATTATTGATGATATTGATGATGATGATGGTGGAGAAACAGATTCCAGCTCAGATTTGTTTGAATTACAGATCTATGATTTAGATTATTACTCAAATGGGTTGCCTGTTTATGAATCTACTGATATTGATAGCATCAAGAGAAGAAATTCAATCTCTAATGGTGTTTGTTGA

Protein sequence

MPPKAQLPFIFMWVFLLIHFHFSHSSISPKIPLMSFNSSIYIAFYTKFCTTSSPLRRTKMSLSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKDKGWIKGRISLDMQVKNILNLPEHFPQHSYKVEKHVTKEKKYKQPSSPGGRLASFLNSLFSHSSSKKKKSKLFAQSVEDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSPGNNSGFRTPPPHVQTPTKSYKELLSFSKFTRHVKSAETLEKRASKQKKKNLSNCSSEKDRVWVEKPLGDEEMKKKLKKFEHDINIIDDIDDDDGGETDSSSDLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSISNGVC
BLAST of CmaCh05G004550 vs. Swiss-Prot
Match: BIG1E_ARATH (Protein BIG GRAIN 1-like E OS=Arabidopsis thaliana GN=At1g69160 PE=2 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 2.9e-34
Identity = 120/329 (36.47%), Postives = 181/329 (55.02%), Query Frame = 1

Query: 61  SLSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKDK--- 120
           S  S ++  +    H+R S+EL+VFEAA YF  YNE S+  +G   K+     +++    
Sbjct: 8   SAESDKLSRRISLTHKRNSEELDVFEAAVYFG-YNEASSGDHGHTQKYGYNAAREENPRR 67

Query: 121 -GWIKG--RISLDMQVK---NILNLPE-HFPQHSYKVEKHVTKEKKYKQPSSPGGRLASF 180
            G + G  RISLD+ ++    + +L + H  +H     K      ++KQPSSPGG++ASF
Sbjct: 68  WGILGGGRRISLDLPIRCSEQVYHLQQDHHEKHEVTTIKERLGNVRHKQPSSPGGKIASF 127

Query: 181 LNSLFSHSSSKKKKSKLFAQS-------VEDIAEDESRTSKRRISISHFRSSN------- 240
           LNSLF  + SKK KSK  +++        E+I        +RR SISHF SS+       
Sbjct: 128 LNSLFHQAGSKKNKSKSKSKTKPTDPEVEEEIPGGGWMRRRRRSSISHFFSSSRSTSTTT 187

Query: 241 --TAATDAKFIYSSSPGNNSGFRTPPPHVQTPTKSYKELLSFSKFTRHVKSAETLEKRAS 300
             TA++ +K + SSS   +SGFRTPPP++ TPTK+YK+ L+++  T+ V   ET   +  
Sbjct: 188 TTTASSSSKSLISSS---SSGFRTPPPYLNTPTKNYKQFLNYTSATKQVGEEETKTNKEY 247

Query: 301 K--QKKKNLSNCSSEKDRVWVEKPLGDEEMKKKLKKFEHDINIIDDIDDDDGGETDSSSD 360
               +K  +    SE  R+W +    D++ + K +             +DDG E+DSSSD
Sbjct: 248 SWLDEKLKVMESLSENQRIWSDDEDIDDDRRIKRE------------GEDDGMESDSSSD 307

Query: 361 LFELQIYDLDYYSNGLPVYESTDIDSIKR 362
           LFELQ Y+L     GLPVYE+T++ +I +
Sbjct: 308 LFELQNYELS--RGGLPVYETTNVANINK 318

BLAST of CmaCh05G004550 vs. TrEMBL
Match: A0A0A0KD85_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G366580 PE=4 SV=1)

HSP 1 Score: 474.9 bits (1221), Expect = 8.7e-131
Identity = 266/338 (78.70%), Postives = 284/338 (84.02%), Query Frame = 1

Query: 60  MSLSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKDKGW 119
           MSLSSSE+INKKMFHHRRASKEL+VFEAARYFSDYNETSA+ + FGAKFTPK+KKKDKGW
Sbjct: 1   MSLSSSEIINKKMFHHRRASKELDVFEAARYFSDYNETSASTSSFGAKFTPKMKKKDKGW 60

Query: 120 IKGRISLDMQVKNILNLPEHFPQH-SYKVEKHVTKEKKYKQPSSPGGRLASFLNSLFSHS 179
           IKGRISLDMQVKNILNLP+HFPQH SY VEK VTKEKKYKQPSSPGGRLASFLNS+FSHS
Sbjct: 61  IKGRISLDMQVKNILNLPQHFPQHDSYSVEKQVTKEKKYKQPSSPGGRLASFLNSIFSHS 120

Query: 180 SSKKKKSKLFAQSV-EDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSP-GNNSGF 239
           SSKKKKSK FAQS+ ED+ +DESRTSKRRISISHFR+SN  ATDAKFIYSSSP  NNSGF
Sbjct: 121 SSKKKKSKHFAQSMDEDMEDDESRTSKRRISISHFRTSNATATDAKFIYSSSPRNNNSGF 180

Query: 240 RTPPPHVQTPTKSYKELLSFSKFTRHVKSAETLEK---------------RASKQKKKNL 299
           RTPPPHVQTPTKSYKELLSFSKF R VKSAE LEK                  K KK NL
Sbjct: 181 RTPPPHVQTPTKSYKELLSFSKFNRLVKSAEALEKPSMDDKRIRKDKGVVEKQKMKKNNL 240

Query: 300 S--NCSSEKDRVWVEKPLGDEEMKKKLKKFEHDIN--------IIDDIDDDDGGETDSSS 359
           S  NC SEKDRVWVEK L   EMKKKL+KF+H+IN          +D ++DDGGETDSSS
Sbjct: 241 SNNNCCSEKDRVWVEKNLVGGEMKKKLRKFDHEINKNNTVGGHNNNDDEEDDGGETDSSS 300

Query: 360 DLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSISNGV 370
           DLFELQIYDLDYYSNGLPVYESTDIDSIKRRNS+SN V
Sbjct: 301 DLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSVSNAV 338

BLAST of CmaCh05G004550 vs. TrEMBL
Match: F6HF00_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00120 PE=4 SV=1)

HSP 1 Score: 231.9 bits (590), Expect = 1.3e-57
Identity = 164/338 (48.52%), Postives = 212/338 (62.72%), Query Frame = 1

Query: 61  SLSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKDKGWI 120
           +LS ++ I KK FH R  S EL+VFEAARYFS  NE     NG        ++++ +GW 
Sbjct: 5   ALSEADKIYKKSFHRRNDSGELDVFEAARYFSGGNEIIG-YNGAAFPQRMMMREERQGWR 64

Query: 121 KGRISLDMQVKNILNLPEHFPQHSYKVEKHVTKEKKYKQPSSPGGRLASFLNSLFSHSSS 180
            GRISLDM +++  +LP    Q S+ VEK + ++ KYKQPSSPGGRLASFLNSLF+ ++S
Sbjct: 65  GGRISLDMPMRS--SLPT---QSSHAVEKQMKEKIKYKQPSSPGGRLASFLNSLFNQTNS 124

Query: 181 KKKKSKLFAQSVEDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSPGNNSGFRTPP 240
           KKKKSK  AQS++D  E      KRR SISHFRSS+TA  D+K +YSSS   +SGFRTPP
Sbjct: 125 KKKKSKSTAQSIKDEEESPGGRRKRRSSISHFRSSSTA--DSKSVYSSS---SSGFRTPP 184

Query: 241 PHVQTPTKSYKELLSFSKFTR----------HVKSA----ETLEKRASK----------- 300
           P+  TPTK+YK+L S+S   +          +VK+     E L+++  K           
Sbjct: 185 PYANTPTKTYKDLRSYSDHRQVVSLPNYNNGNVKATGLRNEALDEKRIKELVWLDEKFKF 244

Query: 301 -----QKKKNLSNCSSEKDRVWVEKPLGDEEMKKKLKKFEHDINIIDDIDDDDGGETDSS 360
                +K KN SN  SEKDR+WV++   +E+  +KL          D+I  D G E+DSS
Sbjct: 245 SSGFSEKHKNFSNGLSEKDRIWVDEYPSEEKEFRKL----------DEI--DAGAESDSS 304

Query: 361 SDLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSISNG 369
           SDLFELQ YDL  YS+GLPVYE+T +DSIKR   ISNG
Sbjct: 305 SDLFELQNYDLGCYSSGLPVYETTHMDSIKRGAPISNG 319

BLAST of CmaCh05G004550 vs. TrEMBL
Match: B9REB4_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1619950 PE=4 SV=1)

HSP 1 Score: 231.1 bits (588), Expect = 2.2e-57
Identity = 162/339 (47.79%), Postives = 205/339 (60.47%), Query Frame = 1

Query: 62  LSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKD--KGW 121
           L  +  + KK  H R  S EL+VFEAARYFS YNE +    G    +T K+ + D    W
Sbjct: 6   LPDTSKLYKKSLHRRNDSDELDVFEAARYFSGYNEAAGYNGG---TYTQKILRDDYRHPW 65

Query: 122 IKGRISLDMQVKNILNLPEHFPQHSYKVEKHVTKEKKYKQPSSPGGRLASFLNSLFSHSS 181
             GR+SLD+ ++N   LP+    H + VEK + KEKKYKQPSSPGGRLASFLNSLF+ +S
Sbjct: 66  RGGRMSLDVPMRN--PLPQQTHSHHHTVEKQILKEKKYKQPSSPGGRLASFLNSLFNQTS 125

Query: 182 SKKKKSKLFAQSVEDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSPGNNSGFRTP 241
           SKKKKSK   QS +D  E      KRR SISHFRS++TA  D K +YSSS   +SGFRTP
Sbjct: 126 SKKKKSKSATQSTKDDDESPGGRRKRRSSISHFRSTSTA--DTKSLYSSS---SSGFRTP 185

Query: 242 PPHVQTPTKSYKEL---------LSFSKFTRHVKSA----ETLEKR-------------- 301
           PP+  TPTKSYK+L         +S S    +VKS     E L+++              
Sbjct: 186 PPYANTPTKSYKDLRSYSDHKQVISLSMQNGNVKSTGLQNEVLDEKKKTDLSWLDEKFKI 245

Query: 302 --ASKQKKKNLSNCSS-EKDRVWVEKPLGDEEMKKKLKKFEHDINIIDDIDDDDGGETDS 361
             A  +K KNL N    EKDR+WV++   +E   K  +KF+         + DDG ++DS
Sbjct: 246 SDALSEKTKNLGNHRYLEKDRIWVDQYPSEE---KGFRKFD---------EVDDGADSDS 305

Query: 362 SSDLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSISNG 369
           SSDLFELQ YDL  YS+GLPVYE+T++DSIK+   ISNG
Sbjct: 306 SSDLFELQNYDLGIYSSGLPVYETTNMDSIKKGAPISNG 322

BLAST of CmaCh05G004550 vs. TrEMBL
Match: B9HIG7_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0008s09770g PE=4 SV=1)

HSP 1 Score: 228.0 bits (580), Expect = 1.9e-56
Identity = 165/334 (49.40%), Postives = 197/334 (58.98%), Query Frame = 1

Query: 70  KKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKDK--GWIKGRISLD 129
           KK  H R  S EL+VFEAARYFS YNE  A  NG  A +T KV ++D    W  GR+SLD
Sbjct: 15  KKSLHRRNDSDELDVFEAARYFSGYNEAGAGYNG--AVYTQKVMREDHKHSWRGGRVSLD 74

Query: 130 MQVKNILNLPEHFPQHSYKVEKHVTKEKKYKQPSSPGGRLASFLNSLFSHSSSKKKKSKL 189
           + ++N   LP H  QHS+ VEK + KEKKYKQPSSPGGRLASFLNSLF+ +SSKKKKSK 
Sbjct: 75  VPMRN--PLPHHLHQHSHTVEKQILKEKKYKQPSSPGGRLASFLNSLFNQTSSKKKKSKS 134

Query: 190 FAQSVEDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSPGNNSGFRTPPPHVQTPT 249
             QS++D  E      KRR SISHFRSS T  TD K +YSSS   +SGF TPPP+  TPT
Sbjct: 135 TTQSMKDDDESPGGRRKRRSSISHFRSSGT--TDTKSLYSSS---SSGFMTPPPYTHTPT 194

Query: 250 KSYKE---------LLSFSKFTRHVKSAETLEKRASKQKKKNLSNCSS------------ 309
           K YKE         ++S  K    VKS     +    +K  +LS                
Sbjct: 195 KGYKEFRSCSDHRQIVSLPKQNGIVKSIAFRNEILDDKKNTDLSWLEEKYKFNDGFSDQK 254

Query: 310 ----------EKDRVWVEKPLGDEEMKKKLKKFEHDINIIDDIDDDDGGETDSSSDLFEL 369
                     EKDR WV++   +E   K+ +KF+         + DDG E+DSSSDLFEL
Sbjct: 255 VPRNRGNQHLEKDRTWVDQYPSEE---KECRKFD---------EVDDGTESDSSSDLFEL 314

BLAST of CmaCh05G004550 vs. TrEMBL
Match: V4VF59_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10032087mg PE=4 SV=1)

HSP 1 Score: 225.7 bits (574), Expect = 9.2e-56
Identity = 157/346 (45.38%), Postives = 202/346 (58.38%), Query Frame = 1

Query: 60  MSLSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKD--K 119
           ++LS ++ I KK FHHR  S EL+VFEAARYFS YNE +A      A F+ K+ ++D  +
Sbjct: 3   LTLSDTDKIYKKSFHHRNDSGELDVFEAARYFSGYNEAAA------AAFSQKILREDHRQ 62

Query: 120 GWIKGRISLDMQVKN--ILNLPEHFPQHSYKVEKHVT-------KEKKYKQPSSPGGRLA 179
            W  GRISLD+ +++  +  LP+H   ++   +K          KEKKYKQPSSPGGRLA
Sbjct: 63  AWRGGRISLDVPLRDSVLQQLPQHHANNAISYQKQSLNTIEKQMKEKKYKQPSSPGGRLA 122

Query: 180 SFLNSLFSHSSSKKKKSKLFAQSVEDIAEDESRTSKRRISISHFRSSNTAA-TDAKFIYS 239
           SFLNSLFS S SKKKKSK   QS++D  E      KRR SISHFRSS+T   + +K +YS
Sbjct: 123 SFLNSLFSQSGSKKKKSKSTTQSLKDEEESPGGRRKRRSSISHFRSSSTTTDSSSKSLYS 182

Query: 240 SSPGNNSGFRTPPPHVQTPTKSYKELLSFSKFTRHVKSAETLEKRASKQ----------- 299
           SS   +SGFRTPP +  TPTKSYK+  S+S     V+S    +     Q           
Sbjct: 183 SS---SSGFRTPPAYAYTPTKSYKDFRSYSDHKHQVESLSLSKHNIGGQVVLKPTTALLQ 242

Query: 300 --------------KKKNLSNCSSEKDRVWVEKPLGDEEMKKKLKKFEHDINIIDDIDDD 359
                         K  N +N    K R WV++   +E   K+ +KF+         + D
Sbjct: 243 NNEVTTILDHHDNFKFNNNTNSEKHKTRNWVDQYSSEE---KEFRKFD---------EVD 302

Query: 360 DGGETDSSSDLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSISNG 369
           DG ++DSSSDLFELQ YDL  YS+GLPVYE+T +DSIKR   ISNG
Sbjct: 303 DGADSDSSSDLFELQNYDLGIYSSGLPVYETTHMDSIKRGAPISNG 327

BLAST of CmaCh05G004550 vs. TAIR10
Match: AT1G69160.1 (AT1G69160.1 unknown protein)

HSP 1 Score: 147.5 bits (371), Expect = 1.6e-35
Identity = 120/329 (36.47%), Postives = 181/329 (55.02%), Query Frame = 1

Query: 61  SLSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKDK--- 120
           S  S ++  +    H+R S+EL+VFEAA YF  YNE S+  +G   K+     +++    
Sbjct: 8   SAESDKLSRRISLTHKRNSEELDVFEAAVYFG-YNEASSGDHGHTQKYGYNAAREENPRR 67

Query: 121 -GWIKG--RISLDMQVK---NILNLPE-HFPQHSYKVEKHVTKEKKYKQPSSPGGRLASF 180
            G + G  RISLD+ ++    + +L + H  +H     K      ++KQPSSPGG++ASF
Sbjct: 68  WGILGGGRRISLDLPIRCSEQVYHLQQDHHEKHEVTTIKERLGNVRHKQPSSPGGKIASF 127

Query: 181 LNSLFSHSSSKKKKSKLFAQS-------VEDIAEDESRTSKRRISISHFRSSN------- 240
           LNSLF  + SKK KSK  +++        E+I        +RR SISHF SS+       
Sbjct: 128 LNSLFHQAGSKKNKSKSKSKTKPTDPEVEEEIPGGGWMRRRRRSSISHFFSSSRSTSTTT 187

Query: 241 --TAATDAKFIYSSSPGNNSGFRTPPPHVQTPTKSYKELLSFSKFTRHVKSAETLEKRAS 300
             TA++ +K + SSS   +SGFRTPPP++ TPTK+YK+ L+++  T+ V   ET   +  
Sbjct: 188 TTTASSSSKSLISSS---SSGFRTPPPYLNTPTKNYKQFLNYTSATKQVGEEETKTNKEY 247

Query: 301 K--QKKKNLSNCSSEKDRVWVEKPLGDEEMKKKLKKFEHDINIIDDIDDDDGGETDSSSD 360
               +K  +    SE  R+W +    D++ + K +             +DDG E+DSSSD
Sbjct: 248 SWLDEKLKVMESLSENQRIWSDDEDIDDDRRIKRE------------GEDDGMESDSSSD 307

Query: 361 LFELQIYDLDYYSNGLPVYESTDIDSIKR 362
           LFELQ Y+L     GLPVYE+T++ +I +
Sbjct: 308 LFELQNYELS--RGGLPVYETTNVANINK 318

BLAST of CmaCh05G004550 vs. TAIR10
Match: AT1G54200.1 (AT1G54200.1 unknown protein)

HSP 1 Score: 51.2 bits (121), Expect = 1.6e-06
Identity = 95/348 (27.30%), Postives = 145/348 (41.67%), Query Frame = 1

Query: 35  SFNSSIYIAFYTKF---CTTSSPLRRTKMSLSSSEMINKKMFHHRRASKELEVFEAARYF 94
           SF+S++    Y       T SS +R+TK        ++         SK L+  E   +F
Sbjct: 27  SFSSTLLDQIYRSIDDSSTNSSSMRKTKHQNREDTRVSANRRDDFNRSKNLKTIEPV-FF 86

Query: 95  SDYNETSATMNGFGAKFTPKVKKKDKGW--IKGRISLDMQVKNILNLPE-HFPQHSYKVE 154
              + +S+  +GF +  +    ++ K    I     +   V+     P+ H P  S K E
Sbjct: 87  KHSSSSSSDSSGFSSSESDYFYRRSKSSPAISHPKPIRTTVERFERSPQNHRPNSSNKQE 146

Query: 155 ------------KHVTKEKKYKQPSSPGGRLASFLNSLFSHSSSKKKKSKLFAQSVEDIA 214
                       K  +  KK KQP SPGGRLA+FLNS+F+ + + KK +K+        A
Sbjct: 147 HGSFLKTKSKALKIYSDLKKVKQPISPGGRLATFLNSIFTGAGNTKKLNKINTTVTSTTA 206

Query: 215 EDESRTSKRRI-SISHFRSSNTAATDAKFIYSSSPGNNSGFRTPPPHVQTPTKSYKELLS 274
              + +S     S S F  S  + T      SSS  +    R  P +V     S      
Sbjct: 207 AAAAASSTTTCSSASSFSRSCLSKTP-----SSSEKSKRSVRFCPVNVIFDEDS------ 266

Query: 275 FSKFTRHVKSAETLEKRASKQKKKNLSNCSSEKDRVWVE--KPLGDEEMKKKLKKFEHDI 334
            SK+           +R  +  +  L N   E++R  +E  K L     KK  +  E  +
Sbjct: 267 -SKYNNKNNKVYGNNEREYESIRHTLENRVMEENRRVIEAAKELLRSYQKKNKEVIEVSV 326

Query: 335 NIIDDIDDDDGGETDSSSDLFE---LQIYDLDYYSNGLPVYESTDIDS 359
              D+ DDDD   + +SSDLFE   L    +D Y   LPVYE+T +++
Sbjct: 327 E-DDEEDDDDDALSCTSSDLFELDNLSAIGIDRYREELPVYETTRLNT 360

BLAST of CmaCh05G004550 vs. TAIR10
Match: AT5G12050.1 (AT5G12050.1 unknown protein)

HSP 1 Score: 48.9 bits (115), Expect = 7.8e-06
Identity = 87/329 (26.44%), Postives = 135/329 (41.03%), Query Frame = 1

Query: 36  FNSSIYIAFYTKFCTTSSPLRRTKMSLSSSEMINKKMFHHRRASKELEVFEAARYFSDYN 95
           F+SS     + K  T+S PL          +  +K +FH  RA+         R + DY+
Sbjct: 75  FSSSDTELTHGKKTTSSRPLCFGPSKTKPRKTEDKTLFHQNRAT---------RVYDDYD 134

Query: 96  ETSATMNGFGAKFTPKVKKKDKGWIKGRISLDMQVKNILNLPEHFPQHSYKVEKHVTKEK 155
             S           PK  + D+ W   R                    + +  K    +K
Sbjct: 135 YASD---------VPKFNRHDENWENTR--------------------NRRSVKSSGNQK 194

Query: 156 KYKQPSSPGGRLASFLNSLFSHSSSKKKKSKLFAQSVEDIAEDESRTSKRRISISHFRSS 215
           K K P+SPGGR+ +FLNSLFS++S +    K + +     + D+S    R+ S  +  S+
Sbjct: 195 KPKTPASPGGRIVNFLNSLFSNNSKQSNAVKSYPRKT---SYDDS-AYVRKTSNDYHSST 254

Query: 216 NTAATDAKFIYSSSPGNNSGFRTPPPHVQTPTK-SYKELLSFSKFTRHVKSAETLEKRAS 275
            T ++ + F  S     N G+      ++   + S   ++    FT   +   +    A 
Sbjct: 255 TTCSSASSFSRSCM---NKGYEKSSGRIKRSVRFSPVNVIVPESFTSKEEDYFS-NGNAR 314

Query: 276 KQKKKNLSNCSSEKDRVWVEKPLGD-----EEMKKKLKKFEHDINIIDDIDDDDGGETDS 335
           K  KKN+ +           + L D     E    K   FE D    D+ DDDD   +DS
Sbjct: 315 KSVKKNVEDGGRRSVEEIAREFLRDYHKNHENSLVKTNGFE-DYEDDDEDDDDDDVASDS 356

Query: 336 SSDLFELQI----YDLDYYSNGLPVYEST 355
           SSDLFEL +    +  + Y + LPVYE+T
Sbjct: 375 SSDLFELDLVGNHHHHNVYGDELPVYETT 356

BLAST of CmaCh05G004550 vs. NCBI nr
Match: gi|449453025|ref|XP_004144259.1| (PREDICTED: uncharacterized protein LOC101221232 [Cucumis sativus])

HSP 1 Score: 474.9 bits (1221), Expect = 1.2e-130
Identity = 266/338 (78.70%), Postives = 284/338 (84.02%), Query Frame = 1

Query: 60  MSLSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKDKGW 119
           MSLSSSE+INKKMFHHRRASKEL+VFEAARYFSDYNETSA+ + FGAKFTPK+KKKDKGW
Sbjct: 1   MSLSSSEIINKKMFHHRRASKELDVFEAARYFSDYNETSASTSSFGAKFTPKMKKKDKGW 60

Query: 120 IKGRISLDMQVKNILNLPEHFPQH-SYKVEKHVTKEKKYKQPSSPGGRLASFLNSLFSHS 179
           IKGRISLDMQVKNILNLP+HFPQH SY VEK VTKEKKYKQPSSPGGRLASFLNS+FSHS
Sbjct: 61  IKGRISLDMQVKNILNLPQHFPQHDSYSVEKQVTKEKKYKQPSSPGGRLASFLNSIFSHS 120

Query: 180 SSKKKKSKLFAQSV-EDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSP-GNNSGF 239
           SSKKKKSK FAQS+ ED+ +DESRTSKRRISISHFR+SN  ATDAKFIYSSSP  NNSGF
Sbjct: 121 SSKKKKSKHFAQSMDEDMEDDESRTSKRRISISHFRTSNATATDAKFIYSSSPRNNNSGF 180

Query: 240 RTPPPHVQTPTKSYKELLSFSKFTRHVKSAETLEK---------------RASKQKKKNL 299
           RTPPPHVQTPTKSYKELLSFSKF R VKSAE LEK                  K KK NL
Sbjct: 181 RTPPPHVQTPTKSYKELLSFSKFNRLVKSAEALEKPSMDDKRIRKDKGVVEKQKMKKNNL 240

Query: 300 S--NCSSEKDRVWVEKPLGDEEMKKKLKKFEHDIN--------IIDDIDDDDGGETDSSS 359
           S  NC SEKDRVWVEK L   EMKKKL+KF+H+IN          +D ++DDGGETDSSS
Sbjct: 241 SNNNCCSEKDRVWVEKNLVGGEMKKKLRKFDHEINKNNTVGGHNNNDDEEDDGGETDSSS 300

Query: 360 DLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSISNGV 370
           DLFELQIYDLDYYSNGLPVYESTDIDSIKRRNS+SN V
Sbjct: 301 DLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSVSNAV 338

BLAST of CmaCh05G004550 vs. NCBI nr
Match: gi|659089306|ref|XP_008445437.1| (PREDICTED: uncharacterized protein LOC103488458 [Cucumis melo])

HSP 1 Score: 249.2 bits (635), Expect = 1.1e-62
Identity = 144/199 (72.36%), Postives = 156/199 (78.39%), Query Frame = 1

Query: 193 EDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSP-GNNSGFRTPPPHVQTPTKSYK 252
           ED+ +DESRTSKRRISISHFR+SN  ATDAKFIYSSSP  NNSGFRTPPPHVQTPTKSYK
Sbjct: 3   EDMEDDESRTSKRRISISHFRTSNATATDAKFIYSSSPKNNNSGFRTPPPHVQTPTKSYK 62

Query: 253 ELLSFSKFTRHVKSAETLE------KRASKQK------KKNLSNCSSEKDRVWVEKPLGD 312
           ELLSFSKF R VKSAE LE      KR  K K      K   +NC SEKDRVWVEK L  
Sbjct: 63  ELLSFSKFNRLVKSAEALEKPTMDDKRIRKDKGVVEKQKMKKNNCCSEKDRVWVEKNLVG 122

Query: 313 EEMKKKLKKFEHDINIIDDI---------DDDDGGETDSSSDLFELQIYDLDYYSNGLPV 370
           EEMKKKL+KF+H+IN  ++          ++DDGGETDSSSDLFELQIYDLDYYSNGLPV
Sbjct: 123 EEMKKKLRKFDHEINKNNNSVGDHNNHNDEEDDGGETDSSSDLFELQIYDLDYYSNGLPV 182

BLAST of CmaCh05G004550 vs. NCBI nr
Match: gi|225423416|ref|XP_002263547.1| (PREDICTED: uncharacterized protein LOC100266436 [Vitis vinifera])

HSP 1 Score: 231.9 bits (590), Expect = 1.8e-57
Identity = 164/338 (48.52%), Postives = 212/338 (62.72%), Query Frame = 1

Query: 61  SLSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKDKGWI 120
           +LS ++ I KK FH R  S EL+VFEAARYFS  NE     NG        ++++ +GW 
Sbjct: 5   ALSEADKIYKKSFHRRNDSGELDVFEAARYFSGGNEIIG-YNGAAFPQRMMMREERQGWR 64

Query: 121 KGRISLDMQVKNILNLPEHFPQHSYKVEKHVTKEKKYKQPSSPGGRLASFLNSLFSHSSS 180
            GRISLDM +++  +LP    Q S+ VEK + ++ KYKQPSSPGGRLASFLNSLF+ ++S
Sbjct: 65  GGRISLDMPMRS--SLPT---QSSHAVEKQMKEKIKYKQPSSPGGRLASFLNSLFNQTNS 124

Query: 181 KKKKSKLFAQSVEDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSPGNNSGFRTPP 240
           KKKKSK  AQS++D  E      KRR SISHFRSS+TA  D+K +YSSS   +SGFRTPP
Sbjct: 125 KKKKSKSTAQSIKDEEESPGGRRKRRSSISHFRSSSTA--DSKSVYSSS---SSGFRTPP 184

Query: 241 PHVQTPTKSYKELLSFSKFTR----------HVKSA----ETLEKRASK----------- 300
           P+  TPTK+YK+L S+S   +          +VK+     E L+++  K           
Sbjct: 185 PYANTPTKTYKDLRSYSDHRQVVSLPNYNNGNVKATGLRNEALDEKRIKELVWLDEKFKF 244

Query: 301 -----QKKKNLSNCSSEKDRVWVEKPLGDEEMKKKLKKFEHDINIIDDIDDDDGGETDSS 360
                +K KN SN  SEKDR+WV++   +E+  +KL          D+I  D G E+DSS
Sbjct: 245 SSGFSEKHKNFSNGLSEKDRIWVDEYPSEEKEFRKL----------DEI--DAGAESDSS 304

Query: 361 SDLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSISNG 369
           SDLFELQ YDL  YS+GLPVYE+T +DSIKR   ISNG
Sbjct: 305 SDLFELQNYDLGCYSSGLPVYETTHMDSIKRGAPISNG 319

BLAST of CmaCh05G004550 vs. NCBI nr
Match: gi|255541978|ref|XP_002512053.1| (PREDICTED: protein BIG GRAIN 1-like E [Ricinus communis])

HSP 1 Score: 231.1 bits (588), Expect = 3.1e-57
Identity = 162/339 (47.79%), Postives = 205/339 (60.47%), Query Frame = 1

Query: 62  LSSSEMINKKMFHHRRASKELEVFEAARYFSDYNETSATMNGFGAKFTPKVKKKD--KGW 121
           L  +  + KK  H R  S EL+VFEAARYFS YNE +    G    +T K+ + D    W
Sbjct: 6   LPDTSKLYKKSLHRRNDSDELDVFEAARYFSGYNEAAGYNGG---TYTQKILRDDYRHPW 65

Query: 122 IKGRISLDMQVKNILNLPEHFPQHSYKVEKHVTKEKKYKQPSSPGGRLASFLNSLFSHSS 181
             GR+SLD+ ++N   LP+    H + VEK + KEKKYKQPSSPGGRLASFLNSLF+ +S
Sbjct: 66  RGGRMSLDVPMRN--PLPQQTHSHHHTVEKQILKEKKYKQPSSPGGRLASFLNSLFNQTS 125

Query: 182 SKKKKSKLFAQSVEDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSPGNNSGFRTP 241
           SKKKKSK   QS +D  E      KRR SISHFRS++TA  D K +YSSS   +SGFRTP
Sbjct: 126 SKKKKSKSATQSTKDDDESPGGRRKRRSSISHFRSTSTA--DTKSLYSSS---SSGFRTP 185

Query: 242 PPHVQTPTKSYKEL---------LSFSKFTRHVKSA----ETLEKR-------------- 301
           PP+  TPTKSYK+L         +S S    +VKS     E L+++              
Sbjct: 186 PPYANTPTKSYKDLRSYSDHKQVISLSMQNGNVKSTGLQNEVLDEKKKTDLSWLDEKFKI 245

Query: 302 --ASKQKKKNLSNCSS-EKDRVWVEKPLGDEEMKKKLKKFEHDINIIDDIDDDDGGETDS 361
             A  +K KNL N    EKDR+WV++   +E   K  +KF+         + DDG ++DS
Sbjct: 246 SDALSEKTKNLGNHRYLEKDRIWVDQYPSEE---KGFRKFD---------EVDDGADSDS 305

Query: 362 SSDLFELQIYDLDYYSNGLPVYESTDIDSIKRRNSISNG 369
           SSDLFELQ YDL  YS+GLPVYE+T++DSIK+   ISNG
Sbjct: 306 SSDLFELQNYDLGIYSSGLPVYETTNMDSIKKGAPISNG 322

BLAST of CmaCh05G004550 vs. NCBI nr
Match: gi|694331538|ref|XP_009356443.1| (PREDICTED: uncharacterized protein LOC103947275 [Pyrus x bretschneideri])

HSP 1 Score: 229.6 bits (584), Expect = 9.1e-57
Identity = 159/323 (49.23%), Postives = 211/323 (65.33%), Query Frame = 1

Query: 65  SEMINKKMFHHRRASKELEVFEAARYFSDYNET-SATMNGFGAKFTPKVKKKDKG-WIKG 124
           ++ I+KK+FHHR  S EL+VFEAARYFS YNET S+  N   + F+ ++ K+D+  W  G
Sbjct: 15  ADKIHKKLFHHRNDSGELDVFEAARYFSGYNETPSSHKNNKTSAFSQRMMKEDRSSWRGG 74

Query: 125 RISLDMQVKNILNLPEHFPQHSYKV--EKHVT-KEKKYKQPSSPGGRLASFLNSLFSHSS 184
           RISLDM ++++L+ P+  P H   V  EK    K+KKYKQPSSPGGRLASFLNSLF+ S+
Sbjct: 75  RISLDMPIRHMLHHPQQNPNHHNHVAMEKQSNIKDKKYKQPSSPGGRLASFLNSLFNQSA 134

Query: 185 SKKKKSKLFA-QSVEDIAEDESRTSKRRISISHFRSSNTAATDAKFIYSSSPGNNSGFRT 244
           SKKKKSK  A QS++D  E      KRR SISHFRSS+T  TDAK +YSSS   +SGFRT
Sbjct: 135 SKKKKSKSSATQSMKDEEESPGGRRKRRSSISHFRSSST--TDAKSVYSSS---SSGFRT 194

Query: 245 PPP--HVQ-TPTKSYKELLSFS-----KFTRHVKSAETLEKRASKQKKKNLSNCSSEKDR 304
           PPP  H Q   + SYK+L S+S     ++ +  +  +T+    SK   K     SS K+ 
Sbjct: 195 PPPYSHAQMVASNSYKDLRSYSDNHKQQYQQQQQQQQTVS--LSKYNTKFDEKRSSNKEL 254

Query: 305 VWVE----------KPLGDEEMKKKLKKFEHDINIIDDIDDDDGGETDSSSDLFELQIYD 364
            W++          K   D++ K  LK+      ++DD  +D+G E+DSSSDLFELQ YD
Sbjct: 255 TWLDHEKFKLSEKYKISSDQDHKGLLKRLS---EVVDDEQEDEGAESDSSSDLFELQNYD 314

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BIG1E_ARATH2.9e-3436.47Protein BIG GRAIN 1-like E OS=Arabidopsis thaliana GN=At1g69160 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KD85_CUCSA8.7e-13178.70Uncharacterized protein OS=Cucumis sativus GN=Csa_6G366580 PE=4 SV=1[more]
F6HF00_VITVI1.3e-5748.52Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00120 PE=4 SV=... [more]
B9REB4_RICCO2.2e-5747.79Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1619950 PE=4 SV=1[more]
B9HIG7_POPTR1.9e-5649.40Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0008s09770g PE=4 SV=1[more]
V4VF59_9ROSI9.2e-5645.38Uncharacterized protein OS=Citrus clementina GN=CICLE_v10032087mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G69160.11.6e-3536.47 unknown protein[more]
AT1G54200.11.6e-0627.30 unknown protein[more]
AT5G12050.17.8e-0626.44 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449453025|ref|XP_004144259.1|1.2e-13078.70PREDICTED: uncharacterized protein LOC101221232 [Cucumis sativus][more]
gi|659089306|ref|XP_008445437.1|1.1e-6272.36PREDICTED: uncharacterized protein LOC103488458 [Cucumis melo][more]
gi|225423416|ref|XP_002263547.1|1.8e-5748.52PREDICTED: uncharacterized protein LOC100266436 [Vitis vinifera][more]
gi|255541978|ref|XP_002512053.1|3.1e-5747.79PREDICTED: protein BIG GRAIN 1-like E [Ricinus communis][more]
gi|694331538|ref|XP_009356443.1|9.1e-5749.23PREDICTED: uncharacterized protein LOC103947275 [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G004550.1CmaCh05G004550.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33541FAMILY NOT NAMEDcoord: 62..369
score: 1.4
NoneNo IPR availablePANTHERPTHR33541:SF3SUBFAMILY NOT NAMEDcoord: 62..369
score: 1.4