Csa1G002030.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa1G002030.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionF-box family protein; contains IPR001810 (F-box domain, cyclin-like), IPR002893 (Zinc finger, MYND-type), IPR011990 (Tetratricopeptide-like helical)
LocationChr1 : 303161 .. 304602 (+)
Sequence length1065
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACAAAAAAACAAAGAGAGAAAAGATTTCTAAAAAAGAGGAAGCAGAACGGTTACCATTTCCGATTTCCCACACCAAGAGAACGCCGCCGTACCACACTCCTAAATCTCTACTCTCAACGCATGTAATCCCCAAACTCTTCAACAATGATTTCCTCCAAGAGGCCAAGAACATGCCGTAATTTCACCGTAGATTCTGATCTATTCGATGTCTTGCCCGACGATCTCCTTATACATCTCCTCTCTCGCCTTGCCGCCTCCGCCTCCTCCCCTTCTGACCTCCTCAACCTTCTCTTGACGTAATTACATATATGATATATCCCTGTTTTCGTTTTCTTTTTCTTTTTTCCTCTGTTTTTTTTTTTTTTTTTTTTTGGTTTTGTAAATTTCTCTTCTTTTTTGGGATGATTTTGCAGATGTAAGAGGTTGAATCGCTTGGTTCTTAATCCTATGGTTTTGTGTAAAGCCGGACCGAAGGCGTTCGCAGTAAGAATGCGAAACTGGTCTGATTCCGCTCACCGATTTCTGAAACGCTGCGTCGATGCCGGTAATTCGGAAGCGAGTTACACGCTTGGAATGGTTAGTTTTTTTTGAAAATTTTGAAGAAATTTTTACTCTGTATTGGGTTTGAGCGAGGATTGATGATATCTGTATTTTTTTTCCTTTGTAGATCCGATTTTATTGTTTGAGGAATCGAGGGAGTGGAGCTTCCTTAATGGCGAAAGCAGCGATTAAATCTCACGCTCCGGCGCTATACTCACTCGCCGTCATTCAATTCAACGGCAGCGGTGGGTCTAAAAGCGACAAGGATCTTCAGGCAGGCGTAGCGCTCTGTGCTCGAGCCGCTTTTCTAGGTCATGTGGACGCACTTCGCGAACTCGGCCACTGCCTTCAAGATGGTTACGGTGTGAGACAGAACTCCGACGAAGGCCGCCGCCTCCTCGTCCAAGCCAACGCCCGTGAGCTTGCTACGGTTTTACTCTCTTCCTCGAGTACATGGCAACAACAGCGACACAACCAGTCCGGCAACCTCCCTGATTTGACAGCGACTCGTTGTTCGTTGCTGAGTGATTTCGGTTGTAACGTTCCAGCACCAGAGCCACATCCGGTGAACCTGTTTTTGAGAGAGTGGTTCGAGTCGGAAGGTGAAGTGGCAGCGCGTGTGGGTCTGAGACTTTGTTCGCACAGTGGATGCGGTCGGGGTGAGACTCGGCCGCACGAGTTCCGACGGTGCTCAGTTTGTGGGACAGTAAATTACTGTTCAAGAGGATGCCAAGCGCAGGATTGGAAAGTCCGGCATAAAGAAGAGTGTACGACGGTGCAGCGGTGGCGGGATGAGGATGCTAACAATGCCGGCGAGATGTTTGGTATTGTTGAAGAAGAAGTTGAGGACGCCAACAATATCGTGGGTATTGTATAATCGTAACGGCGCTGTAATTTTGTCT

mRNA sequence

ATGATTTCCTCCAAGAGGCCAAGAACATGCCGTAATTTCACCGTAGATTCTGATCTATTCGATGTCTTGCCCGACGATCTCCTTATACATCTCCTCTCTCGCCTTGCCGCCTCCGCCTCCTCCCCTTCTGACCTCCTCAACCTTCTCTTGACATGTAAGAGGTTGAATCGCTTGGTTCTTAATCCTATGGTTTTGTGTAAAGCCGGACCGAAGGCGTTCGCAGTAAGAATGCGAAACTGGTCTGATTCCGCTCACCGATTTCTGAAACGCTGCGTCGATGCCGGTAATTCGGAAGCGAGTTACACGCTTGGAATGATCCGATTTTATTGTTTGAGGAATCGAGGGAGTGGAGCTTCCTTAATGGCGAAAGCAGCGATTAAATCTCACGCTCCGGCGCTATACTCACTCGCCGTCATTCAATTCAACGGCAGCGGTGGGTCTAAAAGCGACAAGGATCTTCAGGCAGGCGTAGCGCTCTGTGCTCGAGCCGCTTTTCTAGGTCATGTGGACGCACTTCGCGAACTCGGCCACTGCCTTCAAGATGGTTACGGTGTGAGACAGAACTCCGACGAAGGCCGCCGCCTCCTCGTCCAAGCCAACGCCCGTGAGCTTGCTACGGTTTTACTCTCTTCCTCGAGTACATGGCAACAACAGCGACACAACCAGTCCGGCAACCTCCCTGATTTGACAGCGACTCGTTGTTCGTTGCTGAGTGATTTCGGTTGTAACGTTCCAGCACCAGAGCCACATCCGGTGAACCTGTTTTTGAGAGAGTGGTTCGAGTCGGAAGGTGAAGTGGCAGCGCGTGTGGGTCTGAGACTTTGTTCGCACAGTGGATGCGGTCGGGGTGAGACTCGGCCGCACGAGTTCCGACGGTGCTCAGTTTGTGGGACAGTAAATTACTGTTCAAGAGGATGCCAAGCGCAGGATTGGAAAGTCCGGCATAAAGAAGAGTGTACGACGGTGCAGCGGTGGCGGGATGAGGATGCTAACAATGCCGGCGAGATGTTTGGTATTGTTGAAGAAGAAGTTGAGGACGCCAACAATATCGTGGGTATTGTATAA

Coding sequence (CDS)

ATGATTTCCTCCAAGAGGCCAAGAACATGCCGTAATTTCACCGTAGATTCTGATCTATTCGATGTCTTGCCCGACGATCTCCTTATACATCTCCTCTCTCGCCTTGCCGCCTCCGCCTCCTCCCCTTCTGACCTCCTCAACCTTCTCTTGACATGTAAGAGGTTGAATCGCTTGGTTCTTAATCCTATGGTTTTGTGTAAAGCCGGACCGAAGGCGTTCGCAGTAAGAATGCGAAACTGGTCTGATTCCGCTCACCGATTTCTGAAACGCTGCGTCGATGCCGGTAATTCGGAAGCGAGTTACACGCTTGGAATGATCCGATTTTATTGTTTGAGGAATCGAGGGAGTGGAGCTTCCTTAATGGCGAAAGCAGCGATTAAATCTCACGCTCCGGCGCTATACTCACTCGCCGTCATTCAATTCAACGGCAGCGGTGGGTCTAAAAGCGACAAGGATCTTCAGGCAGGCGTAGCGCTCTGTGCTCGAGCCGCTTTTCTAGGTCATGTGGACGCACTTCGCGAACTCGGCCACTGCCTTCAAGATGGTTACGGTGTGAGACAGAACTCCGACGAAGGCCGCCGCCTCCTCGTCCAAGCCAACGCCCGTGAGCTTGCTACGGTTTTACTCTCTTCCTCGAGTACATGGCAACAACAGCGACACAACCAGTCCGGCAACCTCCCTGATTTGACAGCGACTCGTTGTTCGTTGCTGAGTGATTTCGGTTGTAACGTTCCAGCACCAGAGCCACATCCGGTGAACCTGTTTTTGAGAGAGTGGTTCGAGTCGGAAGGTGAAGTGGCAGCGCGTGTGGGTCTGAGACTTTGTTCGCACAGTGGATGCGGTCGGGGTGAGACTCGGCCGCACGAGTTCCGACGGTGCTCAGTTTGTGGGACAGTAAATTACTGTTCAAGAGGATGCCAAGCGCAGGATTGGAAAGTCCGGCATAAAGAAGAGTGTACGACGGTGCAGCGGTGGCGGGATGAGGATGCTAACAATGCCGGCGAGATGTTTGGTATTGTTGAAGAAGAAGTTGAGGACGCCAACAATATCGTGGGTATTGTATAA

Protein sequence

MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVLNPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASLMAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQDGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDFGCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVNYCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFGIVEEEVEDANNIVGIV*
BLAST of Csa1G002030.1 vs. Swiss-Prot
Match: FB76_ARATH (F-box protein At1g67340 OS=Arabidopsis thaliana GN=At1g67340 PE=1 SV=1)

HSP 1 Score: 446.4 bits (1147), Expect = 2.9e-124
Identity = 217/326 (66.56%), Postives = 247/326 (75.77%), Query Frame = 1

Query: 14  TVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVLNPMVLCKAGPKAF 73
           T  +DL D +PDDL+I +L +L +++  P+D +N+LLTCKRL  L +NP+VL +  PKA 
Sbjct: 38  TTGADLLDSIPDDLVISILCKLGSTSRCPADFINVLLTCKRLKGLAMNPIVLSRLSPKAI 97

Query: 74  AVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASLMAKAAIKSHAPAL 133
           AV+  NWS+ +HRFLKRCVDAG+ EA YTLGMIRFYCL+NRG+GASLMAKAAI SHAPAL
Sbjct: 98  AVKAHNWSEYSHRFLKRCVDAGSLEACYTLGMIRFYCLQNRGNGASLMAKAAISSHAPAL 157

Query: 134 YSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQDGYGVRQNSDEGR 193
           YSLAVIQFNGSGGSK+DKDL+AGVALCARAAFLGHVDALRELGHCLQDGYGV QN  EGR
Sbjct: 158 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHVDALRELGHCLQDGYGVPQNVSEGR 217

Query: 194 RLLVQANARELATVL---LSSSSTWQQQRHNQSGNLPDLTATRCSLLSDFGCNVPAPEPH 253
           R LVQANARELA VL   + + STW          +P+     C LLSDFGCNVPAPE H
Sbjct: 218 RFLVQANARELAAVLSSGIQARSTWLSLSQPPPPVVPNHGQQTCPLLSDFGCNVPAPETH 277

Query: 254 PVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVNYCSRGCQAQD 313
           P N FL +WF   G      GLRLCSH+GCGR ETR HEFRRCSVCG VNYCSR CQA D
Sbjct: 278 PANRFLADWFAVRGGDCPGDGLRLCSHAGCGRPETRKHEFRRCSVCGVVNYCSRACQALD 337

Query: 314 WKVRHKEECTTVQRWRDEDANNAGEM 337
           WK+RHK +C  VQRW +E     G +
Sbjct: 338 WKLRHKMDCAPVQRWLEEGDGGEGNV 363

BLAST of Csa1G002030.1 vs. Swiss-Prot
Match: FB342_ARATH (F-box protein At5g50450 OS=Arabidopsis thaliana GN=At5g50450 PE=2 SV=1)

HSP 1 Score: 393.7 bits (1010), Expect = 2.2e-108
Identity = 204/314 (64.97%), Postives = 237/314 (75.48%), Query Frame = 1

Query: 12  NFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVLNPMVLCKAGPK 71
           N TV++  F+ L DDL+I +L +LA SASSPSD L +L TCKRLNRL L+P+VL KAG +
Sbjct: 15  NNTVNNH-FEDLHDDLIISILRKLATSASSPSDFLTVLSTCKRLNRLGLHPLVLSKAGTQ 74

Query: 72  AFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASLMAKAAIKSHAP 131
             AV    WSDS+H+FLK CV+AGN +ASY+LGMIRFYCL+N  SGASLMAKAAIKSHAP
Sbjct: 75  TLAVTAEKWSDSSHKFLKLCVNAGNIDASYSLGMIRFYCLQNPVSGASLMAKAAIKSHAP 134

Query: 132 ALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQDGYGVRQNSDE 191
           ALYSL+VIQFNGSGGSK+DK+L+AGVALCAR+A+LGHVDALRELGHCLQDGYGV ++  E
Sbjct: 135 ALYSLSVIQFNGSGGSKTDKNLRAGVALCARSAYLGHVDALRELGHCLQDGYGVPRDVSE 194

Query: 192 GRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDFGCNVPAPEPHP 251
           GRRLL+QANARELA  L S  S    +  +++  L DL+             VP  E HP
Sbjct: 195 GRRLLIQANARELACSLRSYLSL---KSGDENETLTDLSV------------VPVQEIHP 254

Query: 252 VNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVNYCSRGCQAQDW 311
           VN FL+EWF S G V    GLR+CSH GCGR ETR HEFRRCSVCG VNYCSRGCQA DW
Sbjct: 255 VNRFLKEWF-SSGRVDLAEGLRMCSHGGCGRPETRAHEFRRCSVCGKVNYCSRGCQALDW 311

Query: 312 KVRHKEECTTVQRW 326
           + +HK ECT +  W
Sbjct: 315 RAKHKVECTPLDLW 311

BLAST of Csa1G002030.1 vs. Swiss-Prot
Match: FB79_ARATH (Putative F-box protein At1g67623 OS=Arabidopsis thaliana GN=At1g67623 PE=3 SV=1)

HSP 1 Score: 53.9 bits (128), Expect = 4.1e-06
Identity = 38/128 (29.69%), Postives = 59/128 (46.09%), Query Frame = 1

Query: 21  DVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVLNPMVLCKAGPKAFAVRMRNW 80
           D LP+DLL+ + S     ASS S + NL L  K   R+     V  +   K   +    W
Sbjct: 25  DSLPEDLLVEISS--CTGASSLSAVRNLRLVSKSFRRICDEKYVFYRLSLKE--IEFLPW 84

Query: 81  SDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASLMAKAAIKSHAPALYSLAVIQ 140
            +++ +F++RC ++ N EA +  G I ++  + +  G   +A+AA K    A Y   VI 
Sbjct: 85  HENSAKFIERCTESRNPEALFQKGFINYFRDKLQDRGLEYLAEAAEKGIKEAKYVYGVIL 144

Query: 141 FNGSGGSK 149
               G +K
Sbjct: 145 ICLGGKTK 148

BLAST of Csa1G002030.1 vs. TrEMBL
Match: A0A0A0LNX1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G002030 PE=4 SV=1)

HSP 1 Score: 723.4 bits (1866), Expect = 1.3e-205
Identity = 354/354 (100.00%), Postives = 354/354 (100.00%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL
Sbjct: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
           NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL
Sbjct: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ
Sbjct: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDF 240
           DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDF
Sbjct: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDF 240

Query: 241 GCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVN 300
           GCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVN
Sbjct: 241 GCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVN 300

Query: 301 YCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFGIVEEEVEDANNIVGIV 355
           YCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFGIVEEEVEDANNIVGIV
Sbjct: 301 YCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFGIVEEEVEDANNIVGIV 354

BLAST of Csa1G002030.1 vs. TrEMBL
Match: A0A0D2RJ28_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G206200 PE=4 SV=1)

HSP 1 Score: 482.6 bits (1241), Expect = 4.0e-133
Identity = 242/353 (68.56%), Postives = 278/353 (78.75%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           M+  K+ R  R FT  SDLFD LP+DL+I +LS+LA+SASSPSD +N+L+TCKRLNRL L
Sbjct: 1   MLQRKKQRISRKFTDKSDLFDGLPEDLVISILSKLASSASSPSDFINILVTCKRLNRLAL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
           +P+VL KAG KA AV+ RNW DSAH FLK C++AGN EA YTLGMI+FYCL+NRGSGASL
Sbjct: 61  HPLVLSKAGSKALAVKARNWCDSAHHFLKHCINAGNVEACYTLGMIQFYCLQNRGSGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           +AKAAIKSHAPALYSL VIQFNGSGGSK+DKDL+AGVALCARAAFLGHVDALRELGHCLQ
Sbjct: 121 LAKAAIKSHAPALYSLGVIQFNGSGGSKNDKDLRAGVALCARAAFLGHVDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLT---ATRCSLL 240
           DGYGVRQ+  EGRRLL++ANARELA+ L +      QQ+H +  N    +    + C LL
Sbjct: 181 DGYGVRQHIAEGRRLLIRANARELASSLNALVKRKPQQQHQRRLNYQHYSFKAGSGCPLL 240

Query: 241 SDFGCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCG 300
           SDFGCNVPAPE HPVN+F +EWFES G  A   GLRLCSH GCGR ETR HEFRRCSVCG
Sbjct: 241 SDFGCNVPAPEVHPVNVFSKEWFES-GRGALGQGLRLCSHKGCGRPETRAHEFRRCSVCG 300

Query: 301 TVNYCSRGCQAQDWKVRHKEECTTVQRWRDEDAN-------NAGEMFGIVEEE 344
           TVNYCSRGCQA DWK+RHK EC  ++RW +E  N         GEM  +VE E
Sbjct: 301 TVNYCSRGCQALDWKLRHKAECGPIERWLEEGGNGDDGGHGGVGEMEEVVEAE 352

BLAST of Csa1G002030.1 vs. TrEMBL
Match: A0A0B0PKV7_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_02830 PE=4 SV=1)

HSP 1 Score: 482.6 bits (1241), Expect = 4.0e-133
Identity = 243/353 (68.84%), Postives = 277/353 (78.47%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           M+  K+ R  R FT  SDLFD LP+DL+I +LS+LA SASSPSD +N+L+TCKRLNRL L
Sbjct: 1   MLQRKKQRISRKFTDKSDLFDGLPEDLVISILSKLAYSASSPSDFINILVTCKRLNRLAL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
           +P+VL KAG KA AV+  NW DSAH FLK C++AGN EA YTLGMI+FYCL+NRGSGASL
Sbjct: 61  HPLVLSKAGSKALAVKAGNWCDSAHHFLKHCINAGNVEACYTLGMIQFYCLQNRGSGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           +AKAAIKSHAPALYSL VIQFNGSGGSK+DKDL+AGVALCARAAFLGHVDALRELGHCLQ
Sbjct: 121 LAKAAIKSHAPALYSLGVIQFNGSGGSKNDKDLRAGVALCARAAFLGHVDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLT---ATRCSLL 240
           DGYGVRQ+  EGRRLL+QANARELA+ L +      QQ+H +  N    +    + C LL
Sbjct: 181 DGYGVRQHIAEGRRLLIQANARELASSLNALVKRKAQQQHQRRLNYQHYSFKAGSGCPLL 240

Query: 241 SDFGCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCG 300
           SDFGCNVPAPE HPVN+FL+EWFES G  A   GLRLCSH GCGR ETR HEFRRCSVCG
Sbjct: 241 SDFGCNVPAPEVHPVNVFLKEWFES-GRGALGQGLRLCSHKGCGRPETRAHEFRRCSVCG 300

Query: 301 TVNYCSRGCQAQDWKVRHKEECTTVQRWRDEDAN-------NAGEMFGIVEEE 344
           TVNYCSRGCQA DWK+RHK EC  ++RW +E  N         GEM  +VE E
Sbjct: 301 TVNYCSRGCQALDWKLRHKAECGPIERWLEEGGNGNDGGHGGVGEMEEVVEAE 352

BLAST of Csa1G002030.1 vs. TrEMBL
Match: A0A061FZI2_THECC (HCP-like superfamily protein with MYND-type zinc finger OS=Theobroma cacao GN=TCM_014641 PE=4 SV=1)

HSP 1 Score: 479.9 bits (1234), Expect = 2.6e-132
Identity = 241/355 (67.89%), Postives = 283/355 (79.72%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           M+  K+ RT R  T  SDLFD +PDDL++ +L +L++S S P+D +N+LLTCKRLNRL L
Sbjct: 1   MLQKKKQRTTRKITDKSDLFDGVPDDLVVSILCKLSSSVSCPTDFVNILLTCKRLNRLGL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
           +P+VL KAG KA AV+ +NWSDSAHRFLK CV AGN EA YTLGMIRFYCL+NRGSGASL
Sbjct: 61  HPLVLSKAGSKALAVKAKNWSDSAHRFLKHCVSAGNIEACYTLGMIRFYCLQNRGSGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           MAKAA+KSHAPALYSLAVIQFNGSGGSK++KDL+AGVALCARAAFLGHVDALRELGHCLQ
Sbjct: 121 MAKAAMKSHAPALYSLAVIQFNGSGGSKNNKDLRAGVALCARAAFLGHVDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPD---LTATRCSLL 240
           DGYGVRQN  EGRRLL+QANARELA+ L +      +Q+H +  N      +T + C LL
Sbjct: 181 DGYGVRQNITEGRRLLIQANARELASSLNTVVKRQLKQQHQRRLNYQHYAYMTGSGCPLL 240

Query: 241 SDFGCNVPAPEPHPVNLFLREWFESE-GEVAARVGLRLCSHSGCGRGETRPHEFRRCSVC 300
           SDFGCNVP PE HPV++FL+EWFES  GE+    GLRLCSH GCGR ETR HEFRRCSVC
Sbjct: 241 SDFGCNVPVPEGHPVHVFLKEWFESGLGELGQ--GLRLCSHKGCGRPETRAHEFRRCSVC 300

Query: 301 GTVNYCSRGCQAQDWKVRHKEECTTVQRWRDEDAN-NAGEMFGIVEEEVEDANNI 351
           GTVNYCSRGCQA DWK+RHK EC  ++RW++E  N N G+      EEV +A ++
Sbjct: 301 GTVNYCSRGCQALDWKLRHKVECGPMERWQEEGGNGNGGDGGAGGMEEVGEAEDL 353

BLAST of Csa1G002030.1 vs. TrEMBL
Match: B9RAG1_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1506210 PE=4 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 1.3e-131
Identity = 241/350 (68.86%), Postives = 272/350 (77.71%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           M   KR +T R     SDLFD LPDD+++ +LS+L++SAS PSD +N+L TCKRLNRL L
Sbjct: 1   MYQRKRQKTSRRTPEKSDLFDELPDDIVVCILSKLSSSASCPSDFINILFTCKRLNRLAL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
            P+VL KAGP+ FAV+ +NWSDSAHRFLK C++AGN+EASYTLGMIRFYCL+NRG GASL
Sbjct: 61  QPVVLSKAGPQTFAVKAKNWSDSAHRFLKLCINAGNTEASYTLGMIRFYCLQNRGVGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           MAKAAIKSHAPALYSLAV+QFNGSGGSK DKDL+AGV+LCARAA LGH+DALRELGHCLQ
Sbjct: 121 MAKAAIKSHAPALYSLAVMQFNGSGGSKIDKDLRAGVSLCARAAVLGHIDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQ--QQRHNQSGNLPDLTATRCSLLS 240
           DGYGV QN  EGRRLLVQANARELA+  L S  TWQ   Q H Q  +   + +  C LLS
Sbjct: 181 DGYGVAQNIAEGRRLLVQANARELAS-SLRSMLTWQPHNQHHRQYASCEVMESAGCPLLS 240

Query: 241 DFGCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGT 300
           DFGCNVPA E HP N FL EWFES G      GLRLCSHSGCGR ETRPHEFRRCSVCGT
Sbjct: 241 DFGCNVPAREVHPANRFLSEWFES-GRGLLGPGLRLCSHSGCGRPETRPHEFRRCSVCGT 300

Query: 301 VNYCSRGCQAQDWKVRHKEECTTVQRW----RDEDANNAGEMFGIVEEEV 345
           VNYCSRGCQA DWK+RHK EC  +++W     D   N  G M  I E E+
Sbjct: 301 VNYCSRGCQALDWKLRHKMECVPLEQWLVVEDDGHDNEIGAMVEIEEREI 348

BLAST of Csa1G002030.1 vs. TAIR10
Match: AT1G67340.1 (AT1G67340.1 HCP-like superfamily protein with MYND-type zinc finger)

HSP 1 Score: 446.4 bits (1147), Expect = 1.6e-125
Identity = 217/326 (66.56%), Postives = 247/326 (75.77%), Query Frame = 1

Query: 14  TVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVLNPMVLCKAGPKAF 73
           T  +DL D +PDDL+I +L +L +++  P+D +N+LLTCKRL  L +NP+VL +  PKA 
Sbjct: 38  TTGADLLDSIPDDLVISILCKLGSTSRCPADFINVLLTCKRLKGLAMNPIVLSRLSPKAI 97

Query: 74  AVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASLMAKAAIKSHAPAL 133
           AV+  NWS+ +HRFLKRCVDAG+ EA YTLGMIRFYCL+NRG+GASLMAKAAI SHAPAL
Sbjct: 98  AVKAHNWSEYSHRFLKRCVDAGSLEACYTLGMIRFYCLQNRGNGASLMAKAAISSHAPAL 157

Query: 134 YSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQDGYGVRQNSDEGR 193
           YSLAVIQFNGSGGSK+DKDL+AGVALCARAAFLGHVDALRELGHCLQDGYGV QN  EGR
Sbjct: 158 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHVDALRELGHCLQDGYGVPQNVSEGR 217

Query: 194 RLLVQANARELATVL---LSSSSTWQQQRHNQSGNLPDLTATRCSLLSDFGCNVPAPEPH 253
           R LVQANARELA VL   + + STW          +P+     C LLSDFGCNVPAPE H
Sbjct: 218 RFLVQANARELAAVLSSGIQARSTWLSLSQPPPPVVPNHGQQTCPLLSDFGCNVPAPETH 277

Query: 254 PVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVNYCSRGCQAQD 313
           P N FL +WF   G      GLRLCSH+GCGR ETR HEFRRCSVCG VNYCSR CQA D
Sbjct: 278 PANRFLADWFAVRGGDCPGDGLRLCSHAGCGRPETRKHEFRRCSVCGVVNYCSRACQALD 337

Query: 314 WKVRHKEECTTVQRWRDEDANNAGEM 337
           WK+RHK +C  VQRW +E     G +
Sbjct: 338 WKLRHKMDCAPVQRWLEEGDGGEGNV 363

BLAST of Csa1G002030.1 vs. TAIR10
Match: AT5G50450.1 (AT5G50450.1 HCP-like superfamily protein with MYND-type zinc finger)

HSP 1 Score: 393.7 bits (1010), Expect = 1.2e-109
Identity = 204/314 (64.97%), Postives = 237/314 (75.48%), Query Frame = 1

Query: 12  NFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVLNPMVLCKAGPK 71
           N TV++  F+ L DDL+I +L +LA SASSPSD L +L TCKRLNRL L+P+VL KAG +
Sbjct: 15  NNTVNNH-FEDLHDDLIISILRKLATSASSPSDFLTVLSTCKRLNRLGLHPLVLSKAGTQ 74

Query: 72  AFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASLMAKAAIKSHAP 131
             AV    WSDS+H+FLK CV+AGN +ASY+LGMIRFYCL+N  SGASLMAKAAIKSHAP
Sbjct: 75  TLAVTAEKWSDSSHKFLKLCVNAGNIDASYSLGMIRFYCLQNPVSGASLMAKAAIKSHAP 134

Query: 132 ALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQDGYGVRQNSDE 191
           ALYSL+VIQFNGSGGSK+DK+L+AGVALCAR+A+LGHVDALRELGHCLQDGYGV ++  E
Sbjct: 135 ALYSLSVIQFNGSGGSKTDKNLRAGVALCARSAYLGHVDALRELGHCLQDGYGVPRDVSE 194

Query: 192 GRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDFGCNVPAPEPHP 251
           GRRLL+QANARELA  L S  S    +  +++  L DL+             VP  E HP
Sbjct: 195 GRRLLIQANARELACSLRSYLSL---KSGDENETLTDLSV------------VPVQEIHP 254

Query: 252 VNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVNYCSRGCQAQDW 311
           VN FL+EWF S G V    GLR+CSH GCGR ETR HEFRRCSVCG VNYCSRGCQA DW
Sbjct: 255 VNRFLKEWF-SSGRVDLAEGLRMCSHGGCGRPETRAHEFRRCSVCGKVNYCSRGCQALDW 311

Query: 312 KVRHKEECTTVQRW 326
           + +HK ECT +  W
Sbjct: 315 RAKHKVECTPLDLW 311

BLAST of Csa1G002030.1 vs. TAIR10
Match: AT1G67623.1 (AT1G67623.1 F-box family protein)

HSP 1 Score: 53.9 bits (128), Expect = 2.3e-07
Identity = 38/128 (29.69%), Postives = 59/128 (46.09%), Query Frame = 1

Query: 21  DVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVLNPMVLCKAGPKAFAVRMRNW 80
           D LP+DLL+ + S     ASS S + NL L  K   R+     V  +   K   +    W
Sbjct: 25  DSLPEDLLVEISS--CTGASSLSAVRNLRLVSKSFRRICDEKYVFYRLSLKE--IEFLPW 84

Query: 81  SDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASLMAKAAIKSHAPALYSLAVIQ 140
            +++ +F++RC ++ N EA +  G I ++  + +  G   +A+AA K    A Y   VI 
Sbjct: 85  HENSAKFIERCTESRNPEALFQKGFINYFRDKLQDRGLEYLAEAAEKGIKEAKYVYGVIL 144

Query: 141 FNGSGGSK 149
               G +K
Sbjct: 145 ICLGGKTK 148

BLAST of Csa1G002030.1 vs. NCBI nr
Match: gi|449440552|ref|XP_004138048.1| (PREDICTED: F-box protein At1g67340-like [Cucumis sativus])

HSP 1 Score: 723.4 bits (1866), Expect = 1.9e-205
Identity = 354/354 (100.00%), Postives = 354/354 (100.00%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL
Sbjct: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
           NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL
Sbjct: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ
Sbjct: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDF 240
           DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDF
Sbjct: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDF 240

Query: 241 GCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVN 300
           GCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVN
Sbjct: 241 GCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVN 300

Query: 301 YCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFGIVEEEVEDANNIVGIV 355
           YCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFGIVEEEVEDANNIVGIV
Sbjct: 301 YCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFGIVEEEVEDANNIVGIV 354

BLAST of Csa1G002030.1 vs. NCBI nr
Match: gi|659128906|ref|XP_008464430.1| (PREDICTED: F-box protein At1g67340-like [Cucumis melo])

HSP 1 Score: 682.9 bits (1761), Expect = 2.9e-193
Identity = 338/354 (95.48%), Postives = 342/354 (96.61%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           MIS KRPRT RNFT DSDLFDVLPDDLLIHLL  LAASASSPSDLLNLLLTCKRLNRLVL
Sbjct: 1   MISVKRPRTRRNFTADSDLFDVLPDDLLIHLLCHLAASASSPSDLLNLLLTCKRLNRLVL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
           +P+VL KAGPKAFAVRM+NW DS+HRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL
Sbjct: 61  HPLVLSKAGPKAFAVRMQNWCDSSHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ
Sbjct: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLTATRCSLLSDF 240
           DGYGVRQNSDEGRRLLVQANARELATVLLSSSS WQQQRHNQSGNLPDLTATRCSLLSDF
Sbjct: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSIWQQQRHNQSGNLPDLTATRCSLLSDF 240

Query: 241 GCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCGTVN 300
           GCNVPAPEPHPVNLFLREWFESEGEVAAR GLRLCSHSGCGR ETRPHEFRRCSVCGTVN
Sbjct: 241 GCNVPAPEPHPVNLFLREWFESEGEVAARGGLRLCSHSGCGRAETRPHEFRRCSVCGTVN 300

Query: 301 YCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFGIVEEEVEDANNIVGIV 355
           YCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMF IV EEVEDANNIVGIV
Sbjct: 301 YCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFDIV-EEVEDANNIVGIV 353

BLAST of Csa1G002030.1 vs. NCBI nr
Match: gi|823212501|ref|XP_012439000.1| (PREDICTED: F-box protein At5g50450-like [Gossypium raimondii])

HSP 1 Score: 482.6 bits (1241), Expect = 5.7e-133
Identity = 242/353 (68.56%), Postives = 278/353 (78.75%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           M+  K+ R  R FT  SDLFD LP+DL+I +LS+LA+SASSPSD +N+L+TCKRLNRL L
Sbjct: 1   MLQRKKQRISRKFTDKSDLFDGLPEDLVISILSKLASSASSPSDFINILVTCKRLNRLAL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
           +P+VL KAG KA AV+ RNW DSAH FLK C++AGN EA YTLGMI+FYCL+NRGSGASL
Sbjct: 61  HPLVLSKAGSKALAVKARNWCDSAHHFLKHCINAGNVEACYTLGMIQFYCLQNRGSGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           +AKAAIKSHAPALYSL VIQFNGSGGSK+DKDL+AGVALCARAAFLGHVDALRELGHCLQ
Sbjct: 121 LAKAAIKSHAPALYSLGVIQFNGSGGSKNDKDLRAGVALCARAAFLGHVDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLT---ATRCSLL 240
           DGYGVRQ+  EGRRLL++ANARELA+ L +      QQ+H +  N    +    + C LL
Sbjct: 181 DGYGVRQHIAEGRRLLIRANARELASSLNALVKRKPQQQHQRRLNYQHYSFKAGSGCPLL 240

Query: 241 SDFGCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCG 300
           SDFGCNVPAPE HPVN+F +EWFES G  A   GLRLCSH GCGR ETR HEFRRCSVCG
Sbjct: 241 SDFGCNVPAPEVHPVNVFSKEWFES-GRGALGQGLRLCSHKGCGRPETRAHEFRRCSVCG 300

Query: 301 TVNYCSRGCQAQDWKVRHKEECTTVQRWRDEDAN-------NAGEMFGIVEEE 344
           TVNYCSRGCQA DWK+RHK EC  ++RW +E  N         GEM  +VE E
Sbjct: 301 TVNYCSRGCQALDWKLRHKAECGPIERWLEEGGNGDDGGHGGVGEMEEVVEAE 352

BLAST of Csa1G002030.1 vs. NCBI nr
Match: gi|728846084|gb|KHG25527.1| (hypothetical protein F383_02830 [Gossypium arboreum])

HSP 1 Score: 482.6 bits (1241), Expect = 5.7e-133
Identity = 243/353 (68.84%), Postives = 277/353 (78.47%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           M+  K+ R  R FT  SDLFD LP+DL+I +LS+LA SASSPSD +N+L+TCKRLNRL L
Sbjct: 1   MLQRKKQRISRKFTDKSDLFDGLPEDLVISILSKLAYSASSPSDFINILVTCKRLNRLAL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
           +P+VL KAG KA AV+  NW DSAH FLK C++AGN EA YTLGMI+FYCL+NRGSGASL
Sbjct: 61  HPLVLSKAGSKALAVKAGNWCDSAHHFLKHCINAGNVEACYTLGMIQFYCLQNRGSGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           +AKAAIKSHAPALYSL VIQFNGSGGSK+DKDL+AGVALCARAAFLGHVDALRELGHCLQ
Sbjct: 121 LAKAAIKSHAPALYSLGVIQFNGSGGSKNDKDLRAGVALCARAAFLGHVDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPDLT---ATRCSLL 240
           DGYGVRQ+  EGRRLL+QANARELA+ L +      QQ+H +  N    +    + C LL
Sbjct: 181 DGYGVRQHIAEGRRLLIQANARELASSLNALVKRKAQQQHQRRLNYQHYSFKAGSGCPLL 240

Query: 241 SDFGCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRGETRPHEFRRCSVCG 300
           SDFGCNVPAPE HPVN+FL+EWFES G  A   GLRLCSH GCGR ETR HEFRRCSVCG
Sbjct: 241 SDFGCNVPAPEVHPVNVFLKEWFES-GRGALGQGLRLCSHKGCGRPETRAHEFRRCSVCG 300

Query: 301 TVNYCSRGCQAQDWKVRHKEECTTVQRWRDEDAN-------NAGEMFGIVEEE 344
           TVNYCSRGCQA DWK+RHK EC  ++RW +E  N         GEM  +VE E
Sbjct: 301 TVNYCSRGCQALDWKLRHKAECGPIERWLEEGGNGNDGGHGGVGEMEEVVEAE 352

BLAST of Csa1G002030.1 vs. NCBI nr
Match: gi|590670176|ref|XP_007037982.1| (HCP-like superfamily protein with MYND-type zinc finger [Theobroma cacao])

HSP 1 Score: 479.9 bits (1234), Expect = 3.7e-132
Identity = 241/355 (67.89%), Postives = 283/355 (79.72%), Query Frame = 1

Query: 1   MISSKRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVL 60
           M+  K+ RT R  T  SDLFD +PDDL++ +L +L++S S P+D +N+LLTCKRLNRL L
Sbjct: 1   MLQKKKQRTTRKITDKSDLFDGVPDDLVVSILCKLSSSVSCPTDFVNILLTCKRLNRLGL 60

Query: 61  NPMVLCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASL 120
           +P+VL KAG KA AV+ +NWSDSAHRFLK CV AGN EA YTLGMIRFYCL+NRGSGASL
Sbjct: 61  HPLVLSKAGSKALAVKAKNWSDSAHRFLKHCVSAGNIEACYTLGMIRFYCLQNRGSGASL 120

Query: 121 MAKAAIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQ 180
           MAKAA+KSHAPALYSLAVIQFNGSGGSK++KDL+AGVALCARAAFLGHVDALRELGHCLQ
Sbjct: 121 MAKAAMKSHAPALYSLAVIQFNGSGGSKNNKDLRAGVALCARAAFLGHVDALRELGHCLQ 180

Query: 181 DGYGVRQNSDEGRRLLVQANARELATVLLSSSSTWQQQRHNQSGNLPD---LTATRCSLL 240
           DGYGVRQN  EGRRLL+QANARELA+ L +      +Q+H +  N      +T + C LL
Sbjct: 181 DGYGVRQNITEGRRLLIQANARELASSLNTVVKRQLKQQHQRRLNYQHYAYMTGSGCPLL 240

Query: 241 SDFGCNVPAPEPHPVNLFLREWFESE-GEVAARVGLRLCSHSGCGRGETRPHEFRRCSVC 300
           SDFGCNVP PE HPV++FL+EWFES  GE+    GLRLCSH GCGR ETR HEFRRCSVC
Sbjct: 241 SDFGCNVPVPEGHPVHVFLKEWFESGLGELGQ--GLRLCSHKGCGRPETRAHEFRRCSVC 300

Query: 301 GTVNYCSRGCQAQDWKVRHKEECTTVQRWRDEDAN-NAGEMFGIVEEEVEDANNI 351
           GTVNYCSRGCQA DWK+RHK EC  ++RW++E  N N G+      EEV +A ++
Sbjct: 301 GTVNYCSRGCQALDWKLRHKVECGPMERWQEEGGNGNGGDGGAGGMEEVGEAEDL 353

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
FB76_ARATH2.9e-12466.56F-box protein At1g67340 OS=Arabidopsis thaliana GN=At1g67340 PE=1 SV=1[more]
FB342_ARATH2.2e-10864.97F-box protein At5g50450 OS=Arabidopsis thaliana GN=At5g50450 PE=2 SV=1[more]
FB79_ARATH4.1e-0629.69Putative F-box protein At1g67623 OS=Arabidopsis thaliana GN=At1g67623 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LNX1_CUCSA1.3e-205100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G002030 PE=4 SV=1[more]
A0A0D2RJ28_GOSRA4.0e-13368.56Uncharacterized protein OS=Gossypium raimondii GN=B456_008G206200 PE=4 SV=1[more]
A0A0B0PKV7_GOSAR4.0e-13368.84Uncharacterized protein OS=Gossypium arboreum GN=F383_02830 PE=4 SV=1[more]
A0A061FZI2_THECC2.6e-13267.89HCP-like superfamily protein with MYND-type zinc finger OS=Theobroma cacao GN=TC... [more]
B9RAG1_RICCO1.3e-13168.86Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1506210 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G67340.11.6e-12566.56 HCP-like superfamily protein with MYND-type zinc finger[more]
AT5G50450.11.2e-10964.97 HCP-like superfamily protein with MYND-type zinc finger[more]
AT1G67623.12.3e-0729.69 F-box family protein[more]
Match NameE-valueIdentityDescription
gi|449440552|ref|XP_004138048.1|1.9e-205100.00PREDICTED: F-box protein At1g67340-like [Cucumis sativus][more]
gi|659128906|ref|XP_008464430.1|2.9e-19395.48PREDICTED: F-box protein At1g67340-like [Cucumis melo][more]
gi|823212501|ref|XP_012439000.1|5.7e-13368.56PREDICTED: F-box protein At5g50450-like [Gossypium raimondii][more]
gi|728846084|gb|KHG25527.1|5.7e-13368.84hypothetical protein F383_02830 [Gossypium arboreum][more]
gi|590670176|ref|XP_007037982.1|3.7e-13267.89HCP-like superfamily protein with MYND-type zinc finger [Theobroma cacao][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR001810F-box_dom
IPR002893Znf_MYND
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa1G002030Csa1G002030gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa1G002030.1Csa1G002030.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G002030.1.utr5p1Csa1G002030.1.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G002030.1.cds1Csa1G002030.1.cds1CDS
Csa1G002030.1.cds2Csa1G002030.1.cds2CDS
Csa1G002030.1.cds3Csa1G002030.1.cds3CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G002030.1.utr3p1Csa1G002030.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001810F-box domainPFAMPF12937F-box-likecoord: 22..62
score: 1.
IPR001810F-box domainunknownSSF81383F-box domaincoord: 14..74
score: 1.3
IPR002893Zinc finger, MYND-typePFAMPF01753zf-MYNDcoord: 280..319
score: 1.
IPR002893Zinc finger, MYND-typePROFILEPS50865ZF_MYND_2coord: 277..319
score: 11
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 82..207
score: 1.3
NoneNo IPR availableGENE3DG3DSA:1.20.1280.50coord: 16..63
score: 4.
NoneNo IPR availablePANTHERPTHR12298PCDC2 PROGRAMMED CELL DEATH PROTEIN 2 -RELATEDcoord: 11..337
score: 3.3E
NoneNo IPR availablePANTHERPTHR12298:SF7SUBFAMILY NOT NAMEDcoord: 11..337
score: 3.3E
NoneNo IPR availableunknownSSF144232HIT/MYND zinc finger-likecoord: 273..324
score: 4.53
NoneNo IPR availableunknownSSF81901HCP-likecoord: 74..199
score: 3.66