CsaV3_UNG050390 (gene) Cucumber (Chinese Long) v3

NameCsaV3_UNG050390
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionDUF3531 domain-containing protein
Locationscaffold72 : 37994 .. 42368 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAAAGTGAGGAAGAAGAGTAGAAAGGTATGGACGAAACCCTAGAGAAGACGCGATGGTGGTGGCATCGCGTTTATTCTCTCTATCAATTTCTTCGTTCCTTTCTTCCCCCAATTTGAATTGCCGCTCCCCATCTCCTTTTATTATTCTTTTCAATCAAAGTTATTTTTTTAAAAAACAAAAGTTTTCTTTTCAATAAGCGACATTATTTCTCGTTTATTTCCATTTTCTACCTCTCAATTTTCCATATTCTCATTTGTTGTCAACTGAATTTAATTTTATTCTAAAAAAAATTTAAAATTTAATTTCTACATTTAAATATTTTTTTATGAGATTATATTGTGTTAAATGCTATATAATATACTCTAATTGTATATTATCACGTCACTATTTTATAATAAATATATGTAAATATATGTAGTACTCTCAATATATAGTAGTTTATTATGATTTATTGTAATTACTATATATTTTATAATTATTTTTATTCAATTTACTATATTAAAAAACAATATAAATAGTTTACCTTTCTAATATTTTGTTTTTCTTTTATATATTTTTTATAATTTATTTTATATAAAAATGTTGATTAATTATATATGTTATATTTCATGACATATAATTAAAAAAAGAGTGTGAATTCAAAATTTAGTAAAGTTTATGTTAAATAAATATTGGAGCATGTTTTAACAAATAACTTTACCTTTAATTTAATTTTCAAAGCATAATAGAGATGAACAAACTCCTTTCTTGATTCCTTGTTTAGATAAGGGAAAATATGAAAAAAATTTAAGATCTAAGAACAATTTTGAAATCATGTTAAAGTCGCAACTGCGTGAGTAACTTTATCAGTGATGAGAGTGATAAATAGATACAATGTTAATATAGAGGAGATTAGCAAAAGACTATTTGTTTTTTAATGCTACATTTTCTAGGTGCTTGAATCAATTTGAAATTCTTTAACTTTAGCTCTTTGAGCAACTTGATTTGTTATATATGTCAAAATGAGCATACCTCAATAGCATAATAATCAATTTTTTAAGCATTTCCAAAATAAAAGAATCGAGTAACATATATATATATAACTTTCAGATGATATTATGAATCTACAAATCTTTGTTAGAATGCCAACACATTTTAGGAACTCATCTCATTCTAACTAATCTATAACTTTCTTTTTCTACTTACTACTATTTAAAGTAAACTAATGCATTGATTACAAGTCAATGAAATAATTGATGGGTCTAATCTTAATAATGTGATTACCTTCCTATGACTTATCTAATTAAAAAATACTATTAATGAAAAATTCAAATTGTATAGTATATAAGTTTATCGAACTAATATTGTGAAATACATAAGATTAAACTAAAGACAACCTTATGCTCCAAATTTAGATAAAAGTATTATACAAATTCTCAATTAGTATGGTTAAGTAATGCCAAACACGATAGACATGTACAATTGTCTGTTTTCATTAACAAAATTAGTTTGGGATTAACTTGTATTATACAATTTTTTTATGATGAAAAATAGTAAAAGTAGTAGTAATTCAAATTGTCACATCTCACACATTTATATACTCGAAGAAGAAAGAGGCAAACCTTCTCTATGCATATGACTTAGTCTTGCATTTTAATTTTAAACCTTCGGTGTTGAATTTCATTTCATACTTCTTAGTTATGATATTGCAAAGGATTTTAATGCAATCTCAAAATGTTGAAAAATGCATTAAAATATCGAGTTAGATTGATCAAAACATGATTTCAGATGGAAAAGAGACGATGATAGAACGAAGTCGAAAAAGAGAGACGAAACTACATTTTGGTTGTGGATGTATTTATTCAACATTGCTTCCCCAATATGTTTGGGTCATCCATAGTCTACAATTATCTTCTTGCCTTCTATCTTTTCCGGTATACAAACCCTGTGGAAAAGATATTTGAACACTAAGCTTGTTAGAGCAAGAGCTAAAATCAGCACCACCGCTTTGCTGGTTCATGTCCAATTGTTTCGAACTCTCATTTCTTTCCAGATTTGAAGAAGGTTTTATCTTTTAATTTTCATCGCTCAAGGTTTTCAAGCTCCACAGACAAAGAAATGGTTGTCTTGCAATTCCACCATTCCATTTTCTTGCCAATTCGTCTTCCCATCACAACAATTGGCGCTAAAATTTCCCTCCCAACCAACCATTTATTGCCCCCATACACCCTCTCTAAACCCCTTCACCGGAATTTTCTATCCAGAGCTACCCGCGACCAAAATCGGAAATTTACAACATCCCCTCCACTGAAATCGAGGTGGTGGATGAAGAAGGTGAAGATTATGATGATGGGGACGACAGATTTTGGAGTGAGAGTGGATTTAGAGGAAGAGAAGGAGAAAAGGACTATGATCGAGACCCTGAATTCGCTGAAATTATAGGAACTAGTCTTGATGATCCAGATAAAGCTCGGTCCAAAGTAAAACTTTCTTATCTCTTCCTCGTCAATCGCTTTTTCATGTTTGTTTTTTCTATTATCCCATTGTGGGTTTATCAACGAAGTACTAAGTGGACATTTTATACTTCAACTTCTTGCTACTGTTTAATGGCAATAATCTCTTTTTGGGTCATTTGTTTGTTTTACGAGTGCAAATATGGTGGCTTAAGTTGAAATGTCTAGATAAACATGTATTGTTGCTTATTAGAAGCGTTCTTGTTTTTGGCTTTTGGTGTTTAACTTGTAAGAGGTACATACTGCAGATGGAAGAGAGGCTGAGGAAGAAAAGAAACAAGATACTTCAGCCAAAGACGGGATCAGCTGTTCCAGTAAAAGTTACATTCAACAAGTAGGATTAGTTTTACCACTTGAATAGATGGATTGTCAATCTTTTCATTTTATTGAATTAACCATGCTCTTGCTTTTCCACCTAACATTTATTTAGGATTTCAATCATTTTTTAAACAAAAATGTTTCAATATGTTCCATTTTACTATGAGAATAAATCCTTAAACTTCTGGTCCTGAAGTATCTAGCTTGGAAAAAACGACTTGGGATGTGTTGCTTAAATCATTGATCCGGTACTAGATGTAGCTTTTCTCCAGGAAAAAAGGCCAAATATTTACTTTGATAGTAAAAGGTCAAGATATTGACAGTCAAGTAATTAATGTGACTTGTAATCTATATGGATGAATGCATTTGATTTGACTATGTCTGAGGCAGTAGAACAGAGAACCCCAATGTTGGACTATCTATACAGCCACAATTTCATAAATGCATATTGTTTCTTCTTGGTTTCCACTAGAGTAGGGAGCTAAATTCTAATTTAGGAAGAATTTCCGTAGCCAACAAGCATATAACTTGTGTGTAGCGATTGCACTACGAATAGTTTTGGTATAGTTCTGATAACTGAGAATTTTATTGCTTCTTATATGCTGAAGCCTGAGTATATGAACCGTTAGAATACAAGCATATTTCCTAGTTATATTCCTATGTACTTAGTATCCATTCTCTCCTTAATCTCTAAAACATATTCAGTTGAGAATAAAAAAACTTATTACTTTTTTGCGTGTCTTTTCTTTCTTCTATATAATTTATGTGAATTATCTGTTGAAGATGTATCTTCTGGCTTAATTATGCATTGAATTCTCCCCATTCCGTTACATATATTCTCTCTTTAGTTTGGAATTGGACATATGAGTTCTCAATGTTTCTTCCAGATTCGATTTCTCAAACTCGTATATATGGTTTGAATTCTACAACACTCCACTGGCAAAAGATATCACCTTAATTTGTGATGTAAGTTTTTGTTTCTGTTATCTTAGCTTCTGCCTAAAAAACTTCAATGAATTAGATCTATTATCTTTGTTCACAGACCATAAGGTCATGGCACATCATTGGACGTCTTGGTGGATGCAATTCCATGAATATGCAGGTAAACTTCAATTTCACTCACGTTGTTATAGGATTGCATGATTTTGTTTGGAAAATTAGCCTTCAGCAAATTTTATGAATCATATTTGCTGTCGATGATCATTTAAGTTAATCTTTACAGCTGTCTCAATCTCCTTTGGATAAACGGCCAAGTTATGATGCTATTCAGGGAGCTAATGTTAATCCAACCACATTTTATAACATTGGGGATTTTGAGGTTCAAGACAACTTGGCTCGCATATGGTAATATCTCTCAATCTAATTTTCTGGATGTTGCTGTCATTTTCACAACATTTACGGAGGTGTTCGTGATACCAGGAAAGAGCTTAAGATGTCAGCGTTCACTTTTTTTTCTCATTTATCTATTTTACTTTATGCACAGGGTTGATATTGGGACCAGCGAACCTTTGCTACTAGATGTTTTGATAAATGCATTAATCCAGATAAGCTCTGAGTAA

mRNA sequence

ATGGGGAAAGTGAGGAAGAAGAGTAGAAAGAGCTACCCGCGACCAAAATCGGAAATTTACAACATCCCCTCCACTGAAATCGAGGTGGTGGATGAAGAAGGTGAAGATTATGATGATGGGGACGACAGATTTTGGAGTGAGAGTGGATTTAGAGGAAGAGAAGGAGAAAAGGACTATGATCGAGACCCTGAATTCGCTGAAATTATAGGAACTAGTCTTGATGATCCAGATAAAGCTCGGTCCAAAATGGAAGAGAGGCTGAGGAAGAAAAGAAACAAGATACTTCAGCCAAAGACGGGATCAGCTGTTCCAGTAAAAGTTACATTCAACAAATTCGATTTCTCAAACTCGTATATATGGTTTGAATTCTACAACACTCCACTGGCAAAAGATATCACCTTAATTTGTGATACCATAAGGTCATGGCACATCATTGGACGTCTTGGTGGATGCAATTCCATGAATATGCAGCTGTCTCAATCTCCTTTGGATAAACGGCCAAGTTATGATGCTATTCAGGGAGCTAATGTTAATCCAACCACATTTTATAACATTGGGGATTTTGAGGTTCAAGACAACTTGGCTCGCATATGGGTTGATATTGGGACCAGCGAACCTTTGCTACTAGATGTTTTGATAAATGCATTAATCCAGATAAGCTCTGAGTAA

Coding sequence (CDS)

ATGGGGAAAGTGAGGAAGAAGAGTAGAAAGAGCTACCCGCGACCAAAATCGGAAATTTACAACATCCCCTCCACTGAAATCGAGGTGGTGGATGAAGAAGGTGAAGATTATGATGATGGGGACGACAGATTTTGGAGTGAGAGTGGATTTAGAGGAAGAGAAGGAGAAAAGGACTATGATCGAGACCCTGAATTCGCTGAAATTATAGGAACTAGTCTTGATGATCCAGATAAAGCTCGGTCCAAAATGGAAGAGAGGCTGAGGAAGAAAAGAAACAAGATACTTCAGCCAAAGACGGGATCAGCTGTTCCAGTAAAAGTTACATTCAACAAATTCGATTTCTCAAACTCGTATATATGGTTTGAATTCTACAACACTCCACTGGCAAAAGATATCACCTTAATTTGTGATACCATAAGGTCATGGCACATCATTGGACGTCTTGGTGGATGCAATTCCATGAATATGCAGCTGTCTCAATCTCCTTTGGATAAACGGCCAAGTTATGATGCTATTCAGGGAGCTAATGTTAATCCAACCACATTTTATAACATTGGGGATTTTGAGGTTCAAGACAACTTGGCTCGCATATGGGTTGATATTGGGACCAGCGAACCTTTGCTACTAGATGTTTTGATAAATGCATTAATCCAGATAAGCTCTGAGTAA

Protein sequence

MGKVRKKSRKSYPRPKSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDDPDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLARIWVDIGTSEPLLLDVLINALIQISSE
BLAST of CsaV3_UNG050390 vs. NCBI nr
Match: XP_004135692.1 (PREDICTED: uncharacterized protein LOC101205319 [Cucumis sativus] >XP_011659965.1 PREDICTED: uncharacterized protein LOC101205319 [Cucumis sativus] >KGN66207.1 hypothetical protein Csa_1G580220 [Cucumis sativus])

HSP 1 Score: 419.5 bits (1077), Expect = 7.3e-114
Identity = 206/207 (99.52%), Postives = 207/207 (100.00%), Query Frame = 0

Query: 16  KSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDD 75
           KSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDD
Sbjct: 55  KSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDD 114

Query: 76  PDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI 135
           PDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI
Sbjct: 115 PDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI 174

Query: 136 CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLA 195
           CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLA
Sbjct: 175 CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLA 234

Query: 196 RIWVDIGTSEPLLLDVLINALIQISSE 223
           RIWVDIGTSEPLLLDVLINALIQISS+
Sbjct: 235 RIWVDIGTSEPLLLDVLINALIQISSD 261

BLAST of CsaV3_UNG050390 vs. NCBI nr
Match: XP_008450822.1 (PREDICTED: uncharacterized protein LOC103492292 [Cucumis melo] >XP_016901029.1 PREDICTED: uncharacterized protein LOC103492292 [Cucumis melo])

HSP 1 Score: 407.5 bits (1046), Expect = 2.9e-110
Identity = 201/207 (97.10%), Postives = 203/207 (98.07%), Query Frame = 0

Query: 16  KSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDD 75
           KSEIYNIPSTEIEVVDEE EDYDDGDDRFWSESGFRGRE EKDYDRDPEFAEIIGTSLDD
Sbjct: 55  KSEIYNIPSTEIEVVDEEREDYDDGDDRFWSESGFRGREEEKDYDRDPEFAEIIGTSLDD 114

Query: 76  PDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI 135
           P+KARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI
Sbjct: 115 PEKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI 174

Query: 136 CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLA 195
           CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANV PTTFYNIGDFEVQDNLA
Sbjct: 175 CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVTPTTFYNIGDFEVQDNLA 234

Query: 196 RIWVDIGTSEPLLLDVLINALIQISSE 223
           RIWVDIGTSEPLLLDVLINAL QISS+
Sbjct: 235 RIWVDIGTSEPLLLDVLINALTQISSD 261

BLAST of CsaV3_UNG050390 vs. NCBI nr
Match: XP_022960849.1 (uncharacterized protein LOC111461533 [Cucurbita moschata])

HSP 1 Score: 383.6 bits (984), Expect = 4.4e-103
Identity = 193/220 (87.73%), Postives = 204/220 (92.73%), Query Frame = 0

Query: 3   KVRKKSRKSYPRPKSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRD 62
           ++R+   +S    KSE  NIPS+EIEVVDEEGEDY D DD F+S SGFRGREGEKDYDRD
Sbjct: 51  ELRQYVTRSSRDQKSETQNIPSSEIEVVDEEGEDY-DADDGFFSASGFRGREGEKDYDRD 110

Query: 63  PEFAEIIGTSLDDPDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFE 122
           PEFAEIIGTS+DDP+K RSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFE
Sbjct: 111 PEFAEIIGTSVDDPEKFRSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFE 170

Query: 123 FYNTPLAKDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTF 182
           FYNTPLAKDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANV PTTF
Sbjct: 171 FYNTPLAKDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVTPTTF 230

Query: 183 YNIGDFEVQDNLARIWVDIGTSEPLLLDVLINALIQISSE 223
           YNIGDFEVQDNLARIWVDIGTSEPL+LD+LINAL QISS+
Sbjct: 231 YNIGDFEVQDNLARIWVDIGTSEPLILDILINALTQISSD 269

BLAST of CsaV3_UNG050390 vs. NCBI nr
Match: XP_023516267.1 (uncharacterized protein LOC111780172 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 381.3 bits (978), Expect = 2.2e-102
Identity = 192/220 (87.27%), Postives = 204/220 (92.73%), Query Frame = 0

Query: 3   KVRKKSRKSYPRPKSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRD 62
           ++R+   +S    KSE  NIPS+EIEVV+EEGEDY D DD F+S SGFRGREGEKDYDRD
Sbjct: 51  ELRQCVTRSSRDQKSETQNIPSSEIEVVNEEGEDY-DADDGFFSASGFRGREGEKDYDRD 110

Query: 63  PEFAEIIGTSLDDPDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFE 122
           PEFAEIIGTS+DDP+K RSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFE
Sbjct: 111 PEFAEIIGTSVDDPEKFRSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFE 170

Query: 123 FYNTPLAKDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTF 182
           FYNTPLAKDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANV PTTF
Sbjct: 171 FYNTPLAKDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVTPTTF 230

Query: 183 YNIGDFEVQDNLARIWVDIGTSEPLLLDVLINALIQISSE 223
           YNIGDFEVQDNLARIWVDIGTSEPL+LD+LINAL QISS+
Sbjct: 231 YNIGDFEVQDNLARIWVDIGTSEPLILDILINALTQISSD 269

BLAST of CsaV3_UNG050390 vs. NCBI nr
Match: XP_022987807.1 (uncharacterized protein LOC111485243 [Cucurbita maxima])

HSP 1 Score: 378.6 bits (971), Expect = 1.4e-101
Identity = 190/213 (89.20%), Postives = 198/213 (92.96%), Query Frame = 0

Query: 10  KSYPRPKSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEII 69
           KS    KSE  NIPS+EIEVVDEE  DY D DD F+S SGFRGREGEKDYDRDPEFAEII
Sbjct: 55  KSSRDQKSETQNIPSSEIEVVDEEDGDY-DADDGFFSASGFRGREGEKDYDRDPEFAEII 114

Query: 70  GTSLDDPDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLA 129
           GTS+DDP+K RSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLA
Sbjct: 115 GTSVDDPEKFRSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLA 174

Query: 130 KDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFE 189
           KDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYD+IQGANV PTTFYNIGDFE
Sbjct: 175 KDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDSIQGANVTPTTFYNIGDFE 234

Query: 190 VQDNLARIWVDIGTSEPLLLDVLINALIQISSE 223
           VQDNLARIWVDIGTSEPL+LD+LINAL QISS+
Sbjct: 235 VQDNLARIWVDIGTSEPLILDILINALTQISSD 266

BLAST of CsaV3_UNG050390 vs. TAIR10
Match: AT4G29400.1 (Protein of unknown function (DUF3531))

HSP 1 Score: 283.1 bits (723), Expect = 1.5e-76
Identity = 144/204 (70.59%), Postives = 168/204 (82.35%), Query Frame = 0

Query: 26  EIEVVDEEGEDYDDGD-----DRFWSESG--FRGREGEKDYDRDPEFAEIIGTSLDDPDK 85
           E+E   EE E+ DDGD     D F ++    +R ++ + DYD+DPEFA+I+G  LD+PDK
Sbjct: 66  EMEEGVEEFEEVDDGDDDEVEDEFSAKKRGVYRAKKEKIDYDKDPEFADILGDCLDNPDK 125

Query: 86  ARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLICDT 145
           A+ KMEERLRKKRNKIL  KTGSA  + VTFNKF++SNSY+W EFYNTPL KDI LI DT
Sbjct: 126 AQKKMEERLRKKRNKILHTKTGSATSMPVTFNKFEYSNSYMWLEFYNTPLDKDIALISDT 185

Query: 146 IRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLARIW 205
           IRSWHI+GRLGG NSMNMQLSQ+PLDKRP+YDAI GANV PTTFYNIGD EVQDN+ARIW
Sbjct: 186 IRSWHILGRLGGYNSMNMQLSQAPLDKRPNYDAILGANVEPTTFYNIGDLEVQDNVARIW 245

Query: 206 VDIGTSEPLLLDVLINALIQISSE 223
           +DIGTSEPL+LDVLINAL QISS+
Sbjct: 246 LDIGTSEPLILDVLINALTQISSD 269

BLAST of CsaV3_UNG050390 vs. TAIR10
Match: AT5G08400.1 (Protein of unknown function (DUF3531))

HSP 1 Score: 155.2 bits (391), Expect = 4.7e-38
Identity = 73/161 (45.34%), Postives = 107/161 (66.46%), Query Frame = 0

Query: 62  DPEFAEIIGTSLDDPDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWF 121
           D E   ++G S+ +P+  R K+EER+RKK     + KTGS +  KV F  F+  +S+IWF
Sbjct: 140 DEELRAVLGDSIGNPELMRKKVEERVRKKGKDFQKQKTGSVLSFKVNFRDFNPVDSFIWF 199

Query: 122 EFYNTPLAKDITLICDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTT 181
           E Y TP  +D+ LI   I++W+++GRLG  N+ N+QL+ + L+  P YDA +G  V P++
Sbjct: 200 ELYGTPSDRDVDLIGSVIQAWYVMGRLGAFNTSNLQLANTSLEYDPLYDAEKGFKVMPSS 259

Query: 182 FYNIGDFEVQDNLARIWVDIGTSEPLLLDVLINALIQISSE 223
           F++I D E QDN  R+WVD+GTS+   LDVL+N L  +SSE
Sbjct: 260 FHDISDVEFQDNWGRVWVDLGTSDIFALDVLLNCLTVMSSE 300

BLAST of CsaV3_UNG050390 vs. TrEMBL
Match: tr|A0A0A0LZB5|A0A0A0LZB5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G580220 PE=4 SV=1)

HSP 1 Score: 419.5 bits (1077), Expect = 4.8e-114
Identity = 206/207 (99.52%), Postives = 207/207 (100.00%), Query Frame = 0

Query: 16  KSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDD 75
           KSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDD
Sbjct: 55  KSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDD 114

Query: 76  PDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI 135
           PDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI
Sbjct: 115 PDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI 174

Query: 136 CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLA 195
           CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLA
Sbjct: 175 CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLA 234

Query: 196 RIWVDIGTSEPLLLDVLINALIQISSE 223
           RIWVDIGTSEPLLLDVLINALIQISS+
Sbjct: 235 RIWVDIGTSEPLLLDVLINALIQISSD 261

BLAST of CsaV3_UNG050390 vs. TrEMBL
Match: tr|A0A1S3BQT1|A0A1S3BQT1_CUCME (uncharacterized protein LOC103492292 OS=Cucumis melo OX=3656 GN=LOC103492292 PE=4 SV=1)

HSP 1 Score: 407.5 bits (1046), Expect = 1.9e-110
Identity = 201/207 (97.10%), Postives = 203/207 (98.07%), Query Frame = 0

Query: 16  KSEIYNIPSTEIEVVDEEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDD 75
           KSEIYNIPSTEIEVVDEE EDYDDGDDRFWSESGFRGRE EKDYDRDPEFAEIIGTSLDD
Sbjct: 55  KSEIYNIPSTEIEVVDEEREDYDDGDDRFWSESGFRGREEEKDYDRDPEFAEIIGTSLDD 114

Query: 76  PDKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI 135
           P+KARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI
Sbjct: 115 PEKARSKMEERLRKKRNKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLI 174

Query: 136 CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLA 195
           CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANV PTTFYNIGDFEVQDNLA
Sbjct: 175 CDTIRSWHIIGRLGGCNSMNMQLSQSPLDKRPSYDAIQGANVTPTTFYNIGDFEVQDNLA 234

Query: 196 RIWVDIGTSEPLLLDVLINALIQISSE 223
           RIWVDIGTSEPLLLDVLINAL QISS+
Sbjct: 235 RIWVDIGTSEPLLLDVLINALTQISSD 261

BLAST of CsaV3_UNG050390 vs. TrEMBL
Match: tr|A0A0B0NCN4|A0A0B0NCN4_GOSAR (Dna-directed rna polymerase i subunit rpa2 OS=Gossypium arboreum OX=29729 GN=F383_09136 PE=4 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 8.6e-87
Identity = 158/191 (82.72%), Postives = 176/191 (92.15%), Query Frame = 0

Query: 32  EEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDDPDKARSKMEERLRKKR 91
           E  ++YDD ++ + + SGFRGRE EK+YD+DPEFAEI+G+ LDDP+KARSKME+RLRKKR
Sbjct: 69  EMDDEYDDDNEGYEARSGFRGREEEKNYDKDPEFAEILGSCLDDPEKARSKMEDRLRKKR 128

Query: 92  NKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLICDTIRSWHIIGRLGGC 151
           NKIL  KTGS  P+KVTFNKFDFSNSYIWFEFYNTPL KDI+LICDTIRSWHIIGRLGGC
Sbjct: 129 NKILHTKTGSGTPMKVTFNKFDFSNSYIWFEFYNTPLEKDISLICDTIRSWHIIGRLGGC 188

Query: 152 NSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLARIWVDIGTSEPLLLDV 211
           NSMNMQLSQSPL+KRPSYDAIQGANVNPTTFYNIGD EVQDNLARIWVDIGT+EPL+LDV
Sbjct: 189 NSMNMQLSQSPLEKRPSYDAIQGANVNPTTFYNIGDLEVQDNLARIWVDIGTTEPLILDV 248

Query: 212 LINALIQISSE 223
           LINAL QISS+
Sbjct: 249 LINALTQISSD 259

BLAST of CsaV3_UNG050390 vs. TrEMBL
Match: tr|A0A1U8MU88|A0A1U8MU88_GOSHI (uncharacterized protein LOC107940196 OS=Gossypium hirsutum OX=3635 GN=LOC107940196 PE=4 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 8.6e-87
Identity = 158/191 (82.72%), Postives = 176/191 (92.15%), Query Frame = 0

Query: 32  EEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDDPDKARSKMEERLRKKR 91
           E  ++YDD ++ + + SGFRGRE EK+YD+DPEFAEI+G+ LDDP+KARSKME+RLRKKR
Sbjct: 69  EMDDEYDDDNEGYEARSGFRGREEEKNYDKDPEFAEILGSCLDDPEKARSKMEDRLRKKR 128

Query: 92  NKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLICDTIRSWHIIGRLGGC 151
           NKIL  KTGS  P+KVTFNKFDFSNSYIWFEFYNTPL KDI+LICDTIRSWHIIGRLGGC
Sbjct: 129 NKILHKKTGSGTPMKVTFNKFDFSNSYIWFEFYNTPLEKDISLICDTIRSWHIIGRLGGC 188

Query: 152 NSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLARIWVDIGTSEPLLLDV 211
           NSMNMQLSQSPL+KRPSYDAIQGANVNPTTFYNIGD EVQDNLARIWVDIGT+EPL+LDV
Sbjct: 189 NSMNMQLSQSPLEKRPSYDAIQGANVNPTTFYNIGDLEVQDNLARIWVDIGTTEPLILDV 248

Query: 212 LINALIQISSE 223
           LINAL QISS+
Sbjct: 249 LINALTQISSD 259

BLAST of CsaV3_UNG050390 vs. TrEMBL
Match: tr|A0A0D2S7D3|A0A0D2S7D3_GOSRA (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_004G283500 PE=4 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 1.9e-86
Identity = 158/191 (82.72%), Postives = 175/191 (91.62%), Query Frame = 0

Query: 32  EEGEDYDDGDDRFWSESGFRGREGEKDYDRDPEFAEIIGTSLDDPDKARSKMEERLRKKR 91
           E  +DY D ++ + + SGFRGRE EK+YD+DPEFAEI+G+ LDDP+KARSKME+RLRKKR
Sbjct: 69  EMDDDYGDDNEGYEASSGFRGREEEKNYDKDPEFAEILGSCLDDPEKARSKMEDRLRKKR 128

Query: 92  NKILQPKTGSAVPVKVTFNKFDFSNSYIWFEFYNTPLAKDITLICDTIRSWHIIGRLGGC 151
           NKIL  KTGS  P+KVTFNKFDFSNSYIWFEFYNTPL KDI+LICDTIRSWHIIGRLGGC
Sbjct: 129 NKILHTKTGSGTPMKVTFNKFDFSNSYIWFEFYNTPLEKDISLICDTIRSWHIIGRLGGC 188

Query: 152 NSMNMQLSQSPLDKRPSYDAIQGANVNPTTFYNIGDFEVQDNLARIWVDIGTSEPLLLDV 211
           NSMNMQLSQSPL+KRPSYDAIQGANVNPTTFYNIGD EVQDNLARIWVDIGT+EPL+LDV
Sbjct: 189 NSMNMQLSQSPLEKRPSYDAIQGANVNPTTFYNIGDLEVQDNLARIWVDIGTTEPLILDV 248

Query: 212 LINALIQISSE 223
           LINAL QISS+
Sbjct: 249 LINALTQISSD 259

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004135692.17.3e-11499.52PREDICTED: uncharacterized protein LOC101205319 [Cucumis sativus] >XP_011659965.... [more]
XP_008450822.12.9e-11097.10PREDICTED: uncharacterized protein LOC103492292 [Cucumis melo] >XP_016901029.1 P... [more]
XP_022960849.14.4e-10387.73uncharacterized protein LOC111461533 [Cucurbita moschata][more]
XP_023516267.12.2e-10287.27uncharacterized protein LOC111780172 [Cucurbita pepo subsp. pepo][more]
XP_022987807.11.4e-10189.20uncharacterized protein LOC111485243 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT4G29400.11.5e-7670.59Protein of unknown function (DUF3531)[more]
AT5G08400.14.7e-3845.34Protein of unknown function (DUF3531)[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
tr|A0A0A0LZB5|A0A0A0LZB5_CUCSA4.8e-11499.52Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G580220 PE=4 SV=1[more]
tr|A0A1S3BQT1|A0A1S3BQT1_CUCME1.9e-11097.10uncharacterized protein LOC103492292 OS=Cucumis melo OX=3656 GN=LOC103492292 PE=... [more]
tr|A0A0B0NCN4|A0A0B0NCN4_GOSAR8.6e-8782.72Dna-directed rna polymerase i subunit rpa2 OS=Gossypium arboreum OX=29729 GN=F38... [more]
tr|A0A1U8MU88|A0A1U8MU88_GOSHI8.6e-8782.72uncharacterized protein LOC107940196 OS=Gossypium hirsutum OX=3635 GN=LOC1079401... [more]
tr|A0A0D2S7D3|A0A0D2S7D3_GOSRA1.9e-8682.72Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_004G283500 PE=4 ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR021920DUF3531
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0032774 RNA biosynthetic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005730 nucleolus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003899 DNA-directed RNA polymerase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_UNG050390.1CsaV3_UNG050390.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021920Protein of unknown function DUF3531PFAMPF12049DUF3531coord: 106..222
e-value: 3.5E-44
score: 150.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 32..61
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 43..61
NoneNo IPR availablePANTHERPTHR33102:SF2SUBFAMILY NOT NAMEDcoord: 28..222
NoneNo IPR availablePANTHERPTHR33102FAMILY NOT NAMEDcoord: 28..222