CmaCh16G004950 (gene) Cucurbita maxima (Rimu)

NameCmaCh16G004950
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionVQ motif-containing family protein
LocationCma_Chr16 : 2533970 .. 2534581 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCGGCCCTGGCGATTGGCTTCAATTTTACCATCAAAATCTTCCTACCACGGCGGCGCCACCGCCTTCCGATCAATCCACCTCCGAGATGATCTTCGTCGATCGAGTTTCCGACGCCATCGCCTCCGCAAACACCGGCGCTTCCACTGGCTTGAATCCGGAACGTCGTGTAGCAAAGCCGGTTCGCCGCCGGTCCAGGACTTCTCGGCGGACTCCGACGACCTTGCTCAATACAGACACAACTAATTTCAGAGCGATGGTTCAACAATTCACCGGCGGTCCAACTCCGCCTTTCGCCTCATCGATTTCCCCCAATTTTTCGTTAGGGTTCGGCGCGATTCCTCAATCGAATTCCGGCTTGATTTCTCCGCCGTCTGGTTATCCGCTGCAGTTTTACCATCACAACCCCCAACCGTTCGTGATTCCAGCATCCGCACACGGCGGCGAATTTCTTCAGAGGCTATCCGCCGCGAAGCCAGGAAATGGCGGAGTCGCCGCAGACGGATTTCTGATGGAAAGTGCAGTTTCTTCTCAAATTCCTCCCGCCGGAGCTTCTGCAGATTCCTCCAACAAAAGCAACGGCGGTAGTGGCGGCGGATTTCTGTTCTGA

mRNA sequence

ATGTCCGGCCCTGGCGATTGGCTTCAATTTTACCATCAAAATCTTCCTACCACGGCGGCGCCACCGCCTTCCGATCAATCCACCTCCGAGATGATCTTCGTCGATCGAGTTTCCGACGCCATCGCCTCCGCAAACACCGGCGCTTCCACTGGCTTGAATCCGGAACGTCGTGTAGCAAAGCCGGTTCGCCGCCGGTCCAGGACTTCTCGGCGGACTCCGACGACCTTGCTCAATACAGACACAACTAATTTCAGAGCGATGGTTCAACAATTCACCGGCGGTCCAACTCCGCCTTTCGCCTCATCGATTTCCCCCAATTTTTCGTTAGGGTTCGGCGCGATTCCTCAATCGAATTCCGGCTTGATTTCTCCGCCGTCTGGTTATCCGCTGCAGTTTTACCATCACAACCCCCAACCGTTCGTGATTCCAGCATCCGCACACGGCGGCGAATTTCTTCAGAGGCTATCCGCCGCGAAGCCAGGAAATGGCGGAGTCGCCGCAGACGGATTTCTGATGGAAAGTGCAGTTTCTTCTCAAATTCCTCCCGCCGGAGCTTCTGCAGATTCCTCCAACAAAAGCAACGGCGGTAGTGGCGGCGGATTTCTGTTCTGA

Coding sequence (CDS)

ATGTCCGGCCCTGGCGATTGGCTTCAATTTTACCATCAAAATCTTCCTACCACGGCGGCGCCACCGCCTTCCGATCAATCCACCTCCGAGATGATCTTCGTCGATCGAGTTTCCGACGCCATCGCCTCCGCAAACACCGGCGCTTCCACTGGCTTGAATCCGGAACGTCGTGTAGCAAAGCCGGTTCGCCGCCGGTCCAGGACTTCTCGGCGGACTCCGACGACCTTGCTCAATACAGACACAACTAATTTCAGAGCGATGGTTCAACAATTCACCGGCGGTCCAACTCCGCCTTTCGCCTCATCGATTTCCCCCAATTTTTCGTTAGGGTTCGGCGCGATTCCTCAATCGAATTCCGGCTTGATTTCTCCGCCGTCTGGTTATCCGCTGCAGTTTTACCATCACAACCCCCAACCGTTCGTGATTCCAGCATCCGCACACGGCGGCGAATTTCTTCAGAGGCTATCCGCCGCGAAGCCAGGAAATGGCGGAGTCGCCGCAGACGGATTTCTGATGGAAAGTGCAGTTTCTTCTCAAATTCCTCCCGCCGGAGCTTCTGCAGATTCCTCCAACAAAAGCAACGGCGGTAGTGGCGGCGGATTTCTGTTCTGA

Protein sequence

MSGPGDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDAIASANTGASTGLNPERRVAKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFASSISPNFSLGFGAIPQSNSGLISPPSGYPLQFYHHNPQPFVIPASAHGGEFLQRLSAAKPGNGGVAADGFLMESAVSSQIPPAGASADSSNKSNGGSGGGFLF
BLAST of CmaCh16G004950 vs. Swiss-Prot
Match: VQ22_ARATH (VQ motif-containing protein 22 OS=Arabidopsis thaliana GN=VQ22 PE=2 SV=1)

HSP 1 Score: 82.0 bits (201), Expect = 8.1e-15
Identity = 61/140 (43.57%), Postives = 78/140 (55.71%), Query Frame = 1

Query: 1   MSGPGDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDAIASANTGASTGLNPER-RVA 60
           M+ P +W QFY+ N   T     +  ST+       V+   A   T   + L+PE  RV 
Sbjct: 1   MANPNEWSQFYNNN--QTFFTTSTTASTA-------VTTTTAGDTTSIDSRLSPETGRVT 60

Query: 61  KPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTP-PFAS-SISPNFSLGFGAIPQS 120
           KP RRRSR SRRTPTTLLNTDT+NFRAMVQQ+TGGP+   F S + +  FSL   + P +
Sbjct: 61  KPTRRRSRASRRTPTTLLNTDTSNFRAMVQQYTGGPSAMAFGSGNTTSAFSLTSSSDPSA 120

Query: 121 NSGLISPPSGYPLQFYHHNP 138
            S   +P   +   F  H P
Sbjct: 121 GSSQQAP---WQYNFQPHAP 128

BLAST of CmaCh16G004950 vs. TrEMBL
Match: A0A0A0KV74_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G056730 PE=4 SV=1)

HSP 1 Score: 255.0 bits (650), Expect = 7.8e-65
Identity = 141/197 (71.57%), Postives = 149/197 (75.63%), Query Frame = 1

Query: 5   GDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDAI------ASANTGASTGLNPERRV 64
           GDWLQFYHQNL +TAAPPPSD STSEM FVDRVSDA       AS NT  STGLNPE RV
Sbjct: 3   GDWLQFYHQNLSSTAAPPPSDHSTSEMFFVDRVSDATGVITTTASVNTLGSTGLNPEGRV 62

Query: 65  AKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFASSISPNFSLGFGAIPQS- 124
            KPVRRRSR SRRTPTTLLNTDTTNFRAMVQQFTGGPTPPF SSISPNFSLGFG I QS 
Sbjct: 63  GKPVRRRSRASRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFTSSISPNFSLGFGGIHQSN 122

Query: 125 -----NSGLISPPSGY----PLQFYHHNPQPFVIPASAHGGEFLQRLSAAKPGNGGVAAD 184
                N+ +  PPSGY    P Q Y+HNPQ F+ P  AHGG+FLQRLSA +P NG VA D
Sbjct: 123 FPTSQNATISPPPSGYLLQQPPQLYNHNPQQFMFPTVAHGGDFLQRLSAPRPANGAVAGD 182

Query: 185 GFLMESAVSSQIPPAGA 186
           GFL+ESA    IPP  A
Sbjct: 183 GFLIESA----IPPTRA 195

BLAST of CmaCh16G004950 vs. TrEMBL
Match: A0A061DLC3_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_002034 PE=4 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 3.5e-33
Identity = 106/228 (46.49%), Postives = 130/228 (57.02%), Query Frame = 1

Query: 1   MSGPGDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDA---------------IASAN 60
           MSGP DW QFY QNL    AP      +SE  F D+ SDA               +AS  
Sbjct: 5   MSGPTDWAQFYQQNLSVQEAPNRGRVVSSESAFGDQGSDATVVTTTITSSSAPSPLASGP 64

Query: 61  TGASTG-LNPERRVAKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPF-ASSI 120
            G+S G L+PE RV+KP+RRRSR SRRTPTTLLNTDTTNFRAMVQQFTGGP+ PF   S 
Sbjct: 65  AGSSAGHLSPEGRVSKPLRRRSRASRRTPTTLLNTDTTNFRAMVQQFTGGPSAPFPGHSG 124

Query: 121 SPNFSLGFGA-IPQSNSG-LISPPSGYPLQF-----------YHHNPQPFVIPASAH--- 180
            PNF  GFG   P  N G L+ PP G+ LQ+           +    QP++   S++   
Sbjct: 125 GPNFGFGFGGRQPHVNPGSLMIPPGGFHLQYQQQQQPQQQHQFQQQNQPYMFSLSSNNPG 184

Query: 181 -GGEFLQRLSAAKPGNGGVAADGFLMESAVSSQIPPAGASADSSNKSN 195
            G  FLQRL    P      +DGF++E A SSQ+PP+   + + N+SN
Sbjct: 185 AGDLFLQRL-GGNPRPNMEGSDGFVVEGA-SSQVPPSRTPSSNENRSN 230

BLAST of CmaCh16G004950 vs. TrEMBL
Match: A0A0D2TGR0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G173300 PE=4 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 1.1e-26
Identity = 101/235 (42.98%), Postives = 117/235 (49.79%), Query Frame = 1

Query: 1   MSGPGDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSD-----------------AIAS 60
           MS P DW QFY   L     P      TSE +F D+ SD                 ++ S
Sbjct: 1   MSTPTDWPQFYDHALSNQEIPNRVRILTSESVFGDQGSDTAVLTTPTVTSSSAPLSSLGS 60

Query: 61  ANTGASTG--LNPERRVAKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFAS 120
              G S+G  L+PE RV KPVRRRSR SRRTPTTLLNTDTTNFRAMVQQFTGGP+ PFA 
Sbjct: 61  GLGGGSSGGHLSPEGRVGKPVRRRSRASRRTPTTLLNTDTTNFRAMVQQFTGGPSAPFAG 120

Query: 121 SI----SPNFSLGFGAIPQ-----SNSGLISPPSGYPLQF----------YHHNPQPFVI 180
                  PNF  GFG   Q      N+ L+ PP+G+ LQ+           HH  QP + 
Sbjct: 121 GAPHHGGPNFGYGFGTRHQPHNSNPNNPLMLPPTGFHLQYQQQQQHQNQLIHHQNQPLMF 180

Query: 181 P------ASAHGGEFLQRLSAAKPGNGGVAADGFLMESAVSSQIPPAGASADSSN 192
                  + A G  F QRL       GGV   G    S  SSQ+P +  S  SSN
Sbjct: 181 SLNSNDNSPAPGELFFQRLGV----GGGVNMQG----SDASSQVPASRTSTSSSN 227

BLAST of CmaCh16G004950 vs. TrEMBL
Match: W9REZ9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_009324 PE=4 SV=1)

HSP 1 Score: 124.4 bits (311), Expect = 1.6e-25
Identity = 99/239 (41.42%), Postives = 125/239 (52.30%), Query Frame = 1

Query: 1   MSGPGDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDAIA--SANTGASTG-----LN 60
           MS P +W+QFY+QN    +  P +  +T   I  DRVSDA A  +A TG+++      L+
Sbjct: 5   MSNPTEWMQFYNQNF---SGQPQASTTTQPTILPDRVSDATAVTAAVTGSASPTVPNRLS 64

Query: 61  PERRVAKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFASSIS----PNFSL 120
           PE RV+KPVRRR+R SRRTPTTLLNTDTTNFRAMVQQFTGGP+ PFAS  S      F+ 
Sbjct: 65  PEGRVSKPVRRRTRASRRTPTTLLNTDTTNFRAMVQQFTGGPSAPFASVGSHPGASAFAF 124

Query: 121 GFGAIPQS--NSGLISPPSG-----------YPLQFYHH--NPQPFVIPASAHGG----E 180
           G GA   S  N   +  P+            +    YHH    Q ++      GG     
Sbjct: 125 GLGARQGSFANPSAVMVPAAQAGNYHHTLHLHQQNLYHHQRQNQQYMFGGGGSGGAGNDS 184

Query: 181 FLQRLSAAKP--GNGGV----AADGFLMESAVSSQIPPAGASADSSNKSNGGSGGGFLF 204
           F QRLS+ +P   N GV      +GFL      +  P A  +  SSN  N      FL+
Sbjct: 185 FFQRLSSGRPVVSNMGVGNINTTEGFLNTVGSQAVAPAAATARPSSNSPNEDRSNSFLY 240

BLAST of CmaCh16G004950 vs. TrEMBL
Match: B9HCD7_POPTR (VQ motif-containing family protein OS=Populus trichocarpa GN=POPTR_0006s00810g PE=4 SV=2)

HSP 1 Score: 122.5 bits (306), Expect = 6.0e-25
Identity = 76/157 (48.41%), Postives = 93/157 (59.24%), Query Frame = 1

Query: 4   PGDWLQFYHQNLPTTAAPP----------PSDQSTSEMIFVDRVSDAIASANTGASTG-- 63
           P DW QFY QNL     PP           S   T+  I    V + + S+N+ +S G  
Sbjct: 9   PTDWSQFYQQNLSNQVLPPIRPMFRDRVADSTTVTTTTINTSGVPNPMGSSNSSSSAGRH 68

Query: 64  LNPERRVAKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFASSI---SPNFS 123
           L+PE RVAKP+R+RSR SRRTPTTLLNTDTTNFRAMVQQFTGGP+ PFAS     + NF 
Sbjct: 69  LSPEGRVAKPIRKRSRASRRTPTTLLNTDTTNFRAMVQQFTGGPSAPFASGSQINATNFG 128

Query: 124 LGFGAIPQSN-----SGLISPPSGYPLQFYHHNPQPF 141
              GA  Q++     S ++ PP+GY LQ+     Q F
Sbjct: 129 FALGAYRQAHHVNQPSPVMMPPAGYNLQYQQQQQQQF 165

BLAST of CmaCh16G004950 vs. TAIR10
Match: AT4G15120.1 (AT4G15120.1 VQ motif-containing protein)

HSP 1 Score: 82.8 bits (203), Expect = 2.7e-16
Identity = 78/202 (38.61%), Postives = 94/202 (46.53%), Query Frame = 1

Query: 6   DWLQFYH-QNLPTTAAPPPSDQSTSEMIFVDRVSDAIASANTGASTGLNPE-RRVAKPVR 65
           DW QFY+ Q   TTA    +  +T+                T A + L+P+ RRVAKP R
Sbjct: 7   DWSQFYNNQTFFTTATSTVTTTTTTA---------------TSADSPLSPDNRRVAKPTR 66

Query: 66  RRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTP-PFASSISPNFSLGFGAIPQSNSGLI 125
           RRSR SRRTPTTL NTDT NFRAMVQQFTGGP+   F SS S  FSL         +G+ 
Sbjct: 67  RRSRASRRTPTTLFNTDTANFRAMVQQFTGGPSAVAFGSSPSSGFSL---TSSDPTAGVS 126

Query: 126 SPPSGY---PLQFYHHNPQPFVIPASAHGGEFLQRLSAAKPGNGGVAADGFLMESAVSSQ 185
           S P  Y     Q  H+       P        +  LS        VA DGF+        
Sbjct: 127 SSPWQYANLQNQMAHNELMQQQRPYMFSSSNNVSTLSYP----NAVATDGFV-------- 172

Query: 186 IPPAGASADSSNKSNGGSGGGF 202
                  A+ S +  GG GGG+
Sbjct: 187 ------GAEESREGGGGGGGGY 172

BLAST of CmaCh16G004950 vs. TAIR10
Match: AT3G22160.1 (AT3G22160.1 VQ motif-containing protein)

HSP 1 Score: 82.0 bits (201), Expect = 4.6e-16
Identity = 61/140 (43.57%), Postives = 78/140 (55.71%), Query Frame = 1

Query: 1   MSGPGDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDAIASANTGASTGLNPER-RVA 60
           M+ P +W QFY+ N   T     +  ST+       V+   A   T   + L+PE  RV 
Sbjct: 1   MANPNEWSQFYNNN--QTFFTTSTTASTA-------VTTTTAGDTTSIDSRLSPETGRVT 60

Query: 61  KPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTP-PFAS-SISPNFSLGFGAIPQS 120
           KP RRRSR SRRTPTTLLNTDT+NFRAMVQQ+TGGP+   F S + +  FSL   + P +
Sbjct: 61  KPTRRRSRASRRTPTTLLNTDTSNFRAMVQQYTGGPSAMAFGSGNTTSAFSLTSSSDPSA 120

Query: 121 NSGLISPPSGYPLQFYHHNP 138
            S   +P   +   F  H P
Sbjct: 121 GSSQQAP---WQYNFQPHAP 128

BLAST of CmaCh16G004950 vs. TAIR10
Match: AT4G39720.1 (AT4G39720.1 VQ motif-containing protein)

HSP 1 Score: 61.6 bits (148), Expect = 6.4e-10
Identity = 40/102 (39.22%), Postives = 60/102 (58.82%), Query Frame = 1

Query: 8   LQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDAIASAN---TGASTGLNPERRVA--KPV 67
           L F H N    +  PP+  + +  + V++  D I+  +   + A++ L P   +   K  
Sbjct: 57  LHFDHNN--NNSLIPPNYFNNNTFLPVNQQPDPISQLDLRTSSATSSLPPTNNIGVIKKT 116

Query: 68  RRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFASSIS 105
           ++RSR SRR PTT+L TDT+NFRAMVQ+FTG P PP  ++ S
Sbjct: 117 KKRSRASRRAPTTVLTTDTSNFRAMVQEFTGIPAPPLFNNNS 156

BLAST of CmaCh16G004950 vs. TAIR10
Match: AT5G65170.1 (AT5G65170.1 VQ motif-containing protein)

HSP 1 Score: 57.0 bits (136), Expect = 1.6e-08
Identity = 55/153 (35.95%), Postives = 70/153 (45.75%), Query Frame = 1

Query: 16  PTTAAPPPSDQSTSEMIFVDR-------------VSDAIASANTGASTGLNPERRVAKPV 75
           PT  + P     +SE +F                 +  I S  T    GL   R   K  
Sbjct: 84  PTDGSRPVPPPISSEQVFFTNPLQQNLRTVPNTNTTSPICSVPTDKKNGLATTRNPKK-- 143

Query: 76  RRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFASSISPNFSLG----FGAIPQSN 135
             RSR SRR PTT+L TDT+NFRAMVQ+FTG P+ PF + +S +F       FG+   S+
Sbjct: 144 --RSRVSRRAPTTVLTTDTSNFRAMVQEFTGNPSTPF-TGLSSSFPRSRFDLFGSSSSSS 203

Query: 136 S--------GLISPPS---GY--PLQFYHHNPQ 139
           S         LISP +    Y  P   YHH+ Q
Sbjct: 204 SRPMKPFPHKLISPSTLNHHYLPPSSEYHHHHQ 231

BLAST of CmaCh16G004950 vs. TAIR10
Match: AT1G35830.1 (AT1G35830.1 VQ motif-containing protein)

HSP 1 Score: 56.6 bits (135), Expect = 2.0e-08
Identity = 38/95 (40.00%), Postives = 50/95 (52.63%), Query Frame = 1

Query: 63  RRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFA---SSISPNFSL---------- 122
           R+R+R SRR PTT+L TDT+NFRAMVQ+FTG P  PF+   SS +  F +          
Sbjct: 100 RKRTRASRRAPTTVLTTDTSNFRAMVQEFTGVPASPFSHPFSSTTRRFDIFRSPSDPLTY 159

Query: 123 -GFGAIPQSNSGLISPPSGYPLQFYHHNPQPFVIP 144
             F  IPQ     ++P +   L  +HH       P
Sbjct: 160 NPFRPIPQKP---LNPSTSSLLNLHHHTTTSMTFP 191

BLAST of CmaCh16G004950 vs. NCBI nr
Match: gi|659101968|ref|XP_008451884.1| (PREDICTED: uncharacterized protein LOC103493042 [Cucumis melo])

HSP 1 Score: 300.8 bits (769), Expect = 1.8e-78
Identity = 160/204 (78.43%), Postives = 169/204 (82.84%), Query Frame = 1

Query: 5   GDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDAIASANTGASTGLNPERRVAKPVRR 64
           GDWLQFYHQNL +TA PPPS  STSEMIF DRVSDA ASANT  STGLNPE RV KPVRR
Sbjct: 3   GDWLQFYHQNLSSTAPPPPSHHSTSEMIFADRVSDATASANTLGSTGLNPEGRVGKPVRR 62

Query: 65  RSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFASSISPNFSLGFGAIPQS------N 124
           RSR SRRTPTTLLNTDTTNFRAMVQQFTGGPTPPF SSISPNFSLGFG I QS      N
Sbjct: 63  RSRASRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFTSSISPNFSLGFGGIRQSNFSTSQN 122

Query: 125 SGLISPPSGY----PLQFYHHNPQPFVIPASAHGGEFLQRLSAAKPGNGGVAADGFLMES 184
           + +  PPSGY    P Q Y+HNPQPFV P  AHGG+FLQRLSA +P NGGVA DGFL+ES
Sbjct: 123 AAISPPPSGYLLQQPPQLYNHNPQPFVFPTVAHGGDFLQRLSAPRPANGGVAGDGFLIES 182

Query: 185 AVSSQIPPAGASADSSNKSNGGSG 199
           AVSSQIPPAGASADSSN+SNGG+G
Sbjct: 183 AVSSQIPPAGASADSSNESNGGNG 206

BLAST of CmaCh16G004950 vs. NCBI nr
Match: gi|778691256|ref|XP_011653250.1| (PREDICTED: VQ motif-containing protein 22-like [Cucumis sativus])

HSP 1 Score: 290.0 bits (741), Expect = 3.1e-75
Identity = 158/210 (75.24%), Postives = 168/210 (80.00%), Query Frame = 1

Query: 5   GDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDAI------ASANTGASTGLNPERRV 64
           GDWLQFYHQNL +TAAPPPSD STSEM FVDRVSDA       AS NT  STGLNPE RV
Sbjct: 3   GDWLQFYHQNLSSTAAPPPSDHSTSEMFFVDRVSDATGVITTTASVNTLGSTGLNPEGRV 62

Query: 65  AKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFASSISPNFSLGFGAIPQS- 124
            KPVRRRSR SRRTPTTLLNTDTTNFRAMVQQFTGGPTPPF SSISPNFSLGFG I QS 
Sbjct: 63  GKPVRRRSRASRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFTSSISPNFSLGFGGIHQSN 122

Query: 125 -----NSGLISPPSGY----PLQFYHHNPQPFVIPASAHGGEFLQRLSAAKPGNGGVAAD 184
                N+ +  PPSGY    P Q Y+HNPQ F+ P  AHGG+FLQRLSA +P NG VA D
Sbjct: 123 FPTSQNATISPPPSGYLLQQPPQLYNHNPQQFMFPTVAHGGDFLQRLSAPRPANGAVAGD 182

Query: 185 GFLMESAVSSQIPPAGASADSSNKSNGGSG 199
           GFL+ESAVSSQIPPAGASADSSN+SNGG+G
Sbjct: 183 GFLIESAVSSQIPPAGASADSSNESNGGNG 212

BLAST of CmaCh16G004950 vs. NCBI nr
Match: gi|700198327|gb|KGN53485.1| (hypothetical protein Csa_4G056730 [Cucumis sativus])

HSP 1 Score: 255.0 bits (650), Expect = 1.1e-64
Identity = 141/197 (71.57%), Postives = 149/197 (75.63%), Query Frame = 1

Query: 5   GDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDAI------ASANTGASTGLNPERRV 64
           GDWLQFYHQNL +TAAPPPSD STSEM FVDRVSDA       AS NT  STGLNPE RV
Sbjct: 3   GDWLQFYHQNLSSTAAPPPSDHSTSEMFFVDRVSDATGVITTTASVNTLGSTGLNPEGRV 62

Query: 65  AKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFASSISPNFSLGFGAIPQS- 124
            KPVRRRSR SRRTPTTLLNTDTTNFRAMVQQFTGGPTPPF SSISPNFSLGFG I QS 
Sbjct: 63  GKPVRRRSRASRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFTSSISPNFSLGFGGIHQSN 122

Query: 125 -----NSGLISPPSGY----PLQFYHHNPQPFVIPASAHGGEFLQRLSAAKPGNGGVAAD 184
                N+ +  PPSGY    P Q Y+HNPQ F+ P  AHGG+FLQRLSA +P NG VA D
Sbjct: 123 FPTSQNATISPPPSGYLLQQPPQLYNHNPQQFMFPTVAHGGDFLQRLSAPRPANGAVAGD 182

Query: 185 GFLMESAVSSQIPPAGA 186
           GFL+ESA    IPP  A
Sbjct: 183 GFLIESA----IPPTRA 195

BLAST of CmaCh16G004950 vs. NCBI nr
Match: gi|590711222|ref|XP_007049045.1| (Uncharacterized protein TCM_002034 [Theobroma cacao])

HSP 1 Score: 149.8 bits (377), Expect = 5.0e-33
Identity = 106/228 (46.49%), Postives = 130/228 (57.02%), Query Frame = 1

Query: 1   MSGPGDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSDA---------------IASAN 60
           MSGP DW QFY QNL    AP      +SE  F D+ SDA               +AS  
Sbjct: 5   MSGPTDWAQFYQQNLSVQEAPNRGRVVSSESAFGDQGSDATVVTTTITSSSAPSPLASGP 64

Query: 61  TGASTG-LNPERRVAKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPF-ASSI 120
            G+S G L+PE RV+KP+RRRSR SRRTPTTLLNTDTTNFRAMVQQFTGGP+ PF   S 
Sbjct: 65  AGSSAGHLSPEGRVSKPLRRRSRASRRTPTTLLNTDTTNFRAMVQQFTGGPSAPFPGHSG 124

Query: 121 SPNFSLGFGA-IPQSNSG-LISPPSGYPLQF-----------YHHNPQPFVIPASAH--- 180
            PNF  GFG   P  N G L+ PP G+ LQ+           +    QP++   S++   
Sbjct: 125 GPNFGFGFGGRQPHVNPGSLMIPPGGFHLQYQQQQQPQQQHQFQQQNQPYMFSLSSNNPG 184

Query: 181 -GGEFLQRLSAAKPGNGGVAADGFLMESAVSSQIPPAGASADSSNKSN 195
            G  FLQRL    P      +DGF++E A SSQ+PP+   + + N+SN
Sbjct: 185 AGDLFLQRL-GGNPRPNMEGSDGFVVEGA-SSQVPPSRTPSSNENRSN 230

BLAST of CmaCh16G004950 vs. NCBI nr
Match: gi|763775772|gb|KJB42895.1| (hypothetical protein B456_007G173300 [Gossypium raimondii])

HSP 1 Score: 128.3 bits (321), Expect = 1.6e-26
Identity = 101/235 (42.98%), Postives = 117/235 (49.79%), Query Frame = 1

Query: 1   MSGPGDWLQFYHQNLPTTAAPPPSDQSTSEMIFVDRVSD-----------------AIAS 60
           MS P DW QFY   L     P      TSE +F D+ SD                 ++ S
Sbjct: 1   MSTPTDWPQFYDHALSNQEIPNRVRILTSESVFGDQGSDTAVLTTPTVTSSSAPLSSLGS 60

Query: 61  ANTGASTG--LNPERRVAKPVRRRSRTSRRTPTTLLNTDTTNFRAMVQQFTGGPTPPFAS 120
              G S+G  L+PE RV KPVRRRSR SRRTPTTLLNTDTTNFRAMVQQFTGGP+ PFA 
Sbjct: 61  GLGGGSSGGHLSPEGRVGKPVRRRSRASRRTPTTLLNTDTTNFRAMVQQFTGGPSAPFAG 120

Query: 121 SI----SPNFSLGFGAIPQ-----SNSGLISPPSGYPLQF----------YHHNPQPFVI 180
                  PNF  GFG   Q      N+ L+ PP+G+ LQ+           HH  QP + 
Sbjct: 121 GAPHHGGPNFGYGFGTRHQPHNSNPNNPLMLPPTGFHLQYQQQQQHQNQLIHHQNQPLMF 180

Query: 181 P------ASAHGGEFLQRLSAAKPGNGGVAADGFLMESAVSSQIPPAGASADSSN 192
                  + A G  F QRL       GGV   G    S  SSQ+P +  S  SSN
Sbjct: 181 SLNSNDNSPAPGELFFQRLGV----GGGVNMQG----SDASSQVPASRTSTSSSN 227

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
VQ22_ARATH8.1e-1543.57VQ motif-containing protein 22 OS=Arabidopsis thaliana GN=VQ22 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KV74_CUCSA7.8e-6571.57Uncharacterized protein OS=Cucumis sativus GN=Csa_4G056730 PE=4 SV=1[more]
A0A061DLC3_THECC3.5e-3346.49Uncharacterized protein OS=Theobroma cacao GN=TCM_002034 PE=4 SV=1[more]
A0A0D2TGR0_GOSRA1.1e-2642.98Uncharacterized protein OS=Gossypium raimondii GN=B456_007G173300 PE=4 SV=1[more]
W9REZ9_9ROSA1.6e-2541.42Uncharacterized protein OS=Morus notabilis GN=L484_009324 PE=4 SV=1[more]
B9HCD7_POPTR6.0e-2548.41VQ motif-containing family protein OS=Populus trichocarpa GN=POPTR_0006s00810g P... [more]
Match NameE-valueIdentityDescription
AT4G15120.12.7e-1638.61 VQ motif-containing protein[more]
AT3G22160.14.6e-1643.57 VQ motif-containing protein[more]
AT4G39720.16.4e-1039.22 VQ motif-containing protein[more]
AT5G65170.11.6e-0835.95 VQ motif-containing protein[more]
AT1G35830.12.0e-0840.00 VQ motif-containing protein[more]
Match NameE-valueIdentityDescription
gi|659101968|ref|XP_008451884.1|1.8e-7878.43PREDICTED: uncharacterized protein LOC103493042 [Cucumis melo][more]
gi|778691256|ref|XP_011653250.1|3.1e-7575.24PREDICTED: VQ motif-containing protein 22-like [Cucumis sativus][more]
gi|700198327|gb|KGN53485.1|1.1e-6471.57hypothetical protein Csa_4G056730 [Cucumis sativus][more]
gi|590711222|ref|XP_007049045.1|5.0e-3346.49Uncharacterized protein TCM_002034 [Theobroma cacao][more]
gi|763775772|gb|KJB42895.1|1.6e-2642.98hypothetical protein B456_007G173300 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008889VQ
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G004950.1CmaCh16G004950.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008889VQPFAMPF05678VQcoord: 71..98
score: 4.6
NoneNo IPR availablePANTHERPTHR33179FAMILY NOT NAMEDcoord: 1..139
score: 8.7
NoneNo IPR availablePANTHERPTHR33179:SF7GENOMIC DNA, CHROMOSOME 3, P1 CLONE:MKA23coord: 1..139
score: 8.7