Csa3G187300 (gene) Cucumber (Chinese Long) v2

NameCsa3G187300
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionNicastrin, putative; contains IPR008710 (Nicastrin)
LocationChr3 : 13214879 .. 13217944 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAACGATGAAAGAAAGAAAAAAAAGCCCCGCCAAGTTCCCCAATTTTTCCCCTTGTCCGAAGGTCAATTTTCAGTTTAATTTCCCAACCCGATCTGGGTTCTCTAGTCCATTTCACCTTCACTAACGAGGAGCTGAGGTCCCCAAGAACGATCCGACGGACTCCGATTTGATAGAAAGAACACCCAAATCATCGTCCATGTCTTCCCAGTTTCTCTACCTCCTTCTGTTTCTTACTTCTCTTTGTCTCTCTTCATCAGGTATGTTAAAAAGAAATAGCTATTGAATTTTTGTTTCAATGTCTTCTTATTGCTCGTTGGATATGAATGCCATTCAGTGTTGACGTCTTGTGTATAAATTACTTCAAATTTCGTACTCTGTATAGTTTATTGGCAGCTAAAGAGTACATTGTTTGCAGCAATTAGTTGCCGTTTTGATACTTTATATATACATTTCACCACCGAATGTTTTGTTGTACGTGTTTTGTTTGCTTGAACGCCGGGGTCTCTACTTCCCTTCAAGTTCTCATTCCATATACATAATATGTTTTTAACCAGTGGAGTTACTATTCCATTCCCATCAAGCTTATTCTATGGTTCATACCCTTCTGAAGCCTCTCATGTGGCATAAGGTTGGAAGAACATTTTTAGATGTTGCTGTTCCCATTGTATGGTTGATTACTGTTGCTACATCGTTTGGCTAAAATTTCAGTAATATCTCTCAAAAAGAAAAAAAGAAAAGAAAAGGAAAGAAAAGAAAAGAAAAGAAAATTTATGTAATGTCATTACATGCAATTACTTTTTATTTTATTTTTTAATCCAATTACATAATCTATTTACTAAACATCTCATACATGTGCAAGTTTGAGAAGGCTTTGGCTGTGATAGACCTAAATGAGAGATTAGAAGTGGTTTCTTTTTTGAAAGAACCGATCCCACTAGCCTTTTGCTCTAATGTTCACTACTAGTAGTCCTTTGGGATCAGTTAGGTGTGTGAAACTTAGTTGGACTGAGCACATGAATACTAGTTAATTATTTTTACCTCCTTCCCGTGGTTATGTTATTAGGCAGAGTTGGGGTTAAATTGAAAGAACATAGAGCACATGGATCGTGGATAATTATGGGTTTACGCAAAATTATCTTTTGTTCTATTATAATTGTTCTGCTTTTTGAACGGGCATTTTTATTTTGGCTACAAAGCATTCAGCATCTATCCTGACATGTTATGATTTTTGATAATGCATTCAATAGATGAACACTCAATGGAGTCGGTTCCCGATCTCCAAAATTCGATGTACCTAGCAGTTGATGCTTATCCGTGCATTCGGTTACTTAATCTTTCTGGAGAAATTGGCTGTTCAAGTAGGTTCTTTACACTAGATGTAAAAAATCAATTATTTTTCGAGTGCCAACATAATCCTCAGTGGAATATGGTGGCATCATCCATGTCTTTTTACTTTGTACCTCAGATCCTGGACGAGAAAAGGTTGTAGTTCCGATGATTAACTTCAAGGATGCTGATGAGATATTGCAACCATCTGCAGTTTTAGTTTCAATGGATGCAATCTCTAGTTTCTTTACTAGGTGGGTCTTACATTTTTTATATGTACTCCCTTGTTCATACATATTCTTGTTCCCTTATAAGGATTATTTTCCGAGCCTCGGGTCTAAAACACTTTCCAAACCTCCAAATGGAAAATTAAAGATAAGGGAAGATAACCATTTTTTAAAAATAAATATTAATAAAAAGAAGAAACAAGTTTTCTTTATTAATGAAGAATAGAGTACAAGAATGAAAAACAGGCTAAAAGGAGTAGACTGAGAAAATTAAAAAATAAAGCAAATAAAGGGTAGAACTACAAGAAAACACCTCAATTATAACAAATATATTGGGAAGAGTAACCCTCAAAAGTGTTTGAGAGGGAGGATCAAGAAGCTTTTATCTTGACAAACTCAAAAGGATCTGACTGTTAATAAAATTTGTCCTCAAAGATTCATTGATTTCGTTCCAGCCAAATGTCTAAAAGAATGGCTTTTAAAAAATTGATCTAAATAAGCTTAGATGCGATTAGAACAAGTAGAAAACAAAGAATTTGAAATATATTGTCAGAAACATCTTCACCAGAAACATATCTTAAGAAAATAGAGAAGGAAGAAAACAAGCAAATTTGAGAAAAAGCACTTACGGAAGAAGAAGATATGAACTTTATCTTCCAAAACTGCAAAACTCAAAGTGCAAAAAGAGGAAATAAAACCAGGAATATTCTTTTGGAATTGGAAGAAAGGGTTCGGAGTTGAATAATATCACGTTTCCTTTCCAAAAAATATCTATGAAAATCAAATTCTCCTTTGGATTTCTTGTACTAAAATATATCACTACTTGTGATGACATTCACATAAAAGATCCTCAATTTTGCTTTTCCTTGCATATTTTGTGAACCGTTTCTCCAATGGGATGATTATATATATATAAAATTTAAAAAATATATAAAAGCAGCAACTTTCATTGAATTTTTTTTTTAAAATACAAGGGCATACAAAAAAGCAAGTCCACCAAAAGGCTCCAACTAAGTGAATAAATGCTCTCCCTCCACTTTCTTCTTTGCTTATTATAATTGGACGAAAACTATCTACTCAGATAGAATTCATATATCTATTTCAGACTACAGGACGATTCTCATTTTGCAAATAATGTTGGTGGTGTTTTAATCGAACCAGGAACTGGAATACAAAATAGAACTGAAGGTAAAGTTCATTTACTTATTCAAATTCTTTTCCCTTCATTTTGTTTTCTTTCCTCTTCTCTCTCTCTCTGTTAATTGTGTGGTAACTGGTAATTCATGCATGCACATCAACTACTGACATGGTCATTTATTTTTCCTTGCAGGATTTTCTCCGGCCCAAAAGTTTCCACAAGCTAAATTTGCTCCGTATGAAAAAAGTGACTATGAATGGAACCCAAGTGTATGTTTGAATAATCATACAAAATTCTCAGATTTTTTTAGGAATGGTCAACTTTTAGAAGCTAAACTGCAAAACTCTTATATTGTTCTCATTTGTAAAACTTAG

mRNA sequence

ATGTCTTCCCAGTTTCTCTACCTCCTTCTGTTTCTTACTTCTCTTTGTCTCTCTTCATCAGATGAACACTCAATGGAGTCGGTTCCCGATCTCCAAAATTCGATGTACCTAGCAGTTGATGCTTATCCGTGCATTCGGTTACTTAATCTTTCTGGAGAAATTGGCTGTTCAAATCCTGGACGAGAAAAGGTTGTAGTTCCGATGATTAACTTCAAGGATGCTGATGAGATATTGCAACCATCTGCAGTTTTAGTTTCAATGGATGCAATCTCTAGTTTCTTTACTAGACTACAGGACGATTCTCATTTTGCAAATAATGTTGGTGGTGTTTTAATCGAACCAGGAACTGGAATACAAAATAGAACTGAAGGATTTTCTCCGGCCCAAAAGTTTCCACAAGCTAAATTTGCTCCGTATGAAAAAAGTGACTATGAATGGAACCCAAGTGTATGTTTGAATAATCATACAAAATTCTCAGATTTTTTTAGGAATGGTCAACTTTTAGAAGCTAAACTGCAAAACTCTTATATTGTTCTCATTTGTAAAACTTAG

Coding sequence (CDS)

ATGTCTTCCCAGTTTCTCTACCTCCTTCTGTTTCTTACTTCTCTTTGTCTCTCTTCATCAGATGAACACTCAATGGAGTCGGTTCCCGATCTCCAAAATTCGATGTACCTAGCAGTTGATGCTTATCCGTGCATTCGGTTACTTAATCTTTCTGGAGAAATTGGCTGTTCAAATCCTGGACGAGAAAAGGTTGTAGTTCCGATGATTAACTTCAAGGATGCTGATGAGATATTGCAACCATCTGCAGTTTTAGTTTCAATGGATGCAATCTCTAGTTTCTTTACTAGACTACAGGACGATTCTCATTTTGCAAATAATGTTGGTGGTGTTTTAATCGAACCAGGAACTGGAATACAAAATAGAACTGAAGGATTTTCTCCGGCCCAAAAGTTTCCACAAGCTAAATTTGCTCCGTATGAAAAAAGTGACTATGAATGGAACCCAAGTGTATGTTTGAATAATCATACAAAATTCTCAGATTTTTTTAGGAATGGTCAACTTTTAGAAGCTAAACTGCAAAACTCTTATATTGTTCTCATTTGTAAAACTTAG

Protein sequence

MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPGREKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQNRTEGFSPAQKFPQAKFAPYEKSDYEWNPSVCLNNHTKFSDFFRNGQLLEAKLQNSYIVLICKT*
BLAST of Csa3G187300 vs. Swiss-Prot
Match: NICA_ARATH (Nicastrin OS=Arabidopsis thaliana GN=At3g52640/At3g52650 PE=2 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 2.7e-41
Identity = 80/148 (54.05%), Postives = 106/148 (71.62%), Query Frame = 1

Query: 1   MSSQFLYLLLFLTSLCLSSSDE-HSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNP 60
           +S  F  +LL +  L LS +DE  S+ESVPDLQ  MY+AVD +PC+RLLNLSGEIGCSNP
Sbjct: 9   LSIAFTLVLLSILPLHLSLADEITSIESVPDLQKLMYVAVDGFPCVRLLNLSGEIGCSNP 68

Query: 61  GREKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQ 120
           G  KVV P+I  KD  +++QP  +LV+ D +  FFTR+  D  FA+ +GGVL+E G+  Q
Sbjct: 69  GINKVVAPIIKLKDVKDLVQPHTILVTADEMEDFFTRVSTDLSFASKIGGVLVESGSNFQ 128

Query: 121 NRTEGFSPAQKFPQAKFAPYEKSDYEWN 148
            + +GFSP ++FPQA+F+PYE  +Y+WN
Sbjct: 129 QKLKGFSPDKRFPQAQFSPYENVEYKWN 156

BLAST of Csa3G187300 vs. Swiss-Prot
Match: NICA_DICDI (Nicastrin OS=Dictyostelium discoideum GN=ncstn PE=3 SV=2)

HSP 1 Score: 57.8 bits (138), Expect = 1.5e-07
Identity = 41/152 (26.97%), Postives = 81/152 (53.29%), Query Frame = 1

Query: 7   YLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNP-GREKVV 66
           ++++F+  + + S+D  S +S   +++ MY ++++YPC R++ L+G+IGCS+  G +  +
Sbjct: 7   FIIVFI--IIVLSTDVISSQS--SIEDKMYTSLNSYPCTRIMTLNGQIGCSSSHGGDSGI 66

Query: 67  VPMINFKDADEIL-------QPSAVLVSMDAISSFFTR-LQDDSHFANNVGGVLIEPGTG 126
           + +I   D+DE         Q   ++V  D  S++F + L  + +    + G L+    G
Sbjct: 67  LYLI---DSDESYHNYFSYNQQKDIIVVFD--SNYFNKTLVLEMYSKKKMNGALVLTDIG 126

Query: 127 IQNRTEGFSPAQKFPQAKFAPYEKSDYEWNPS 150
              +T  +SP  ++P  +F  Y  S+  WNP+
Sbjct: 127 ---KTYPYSPEDQYPIKQFGLYPDSNLNWNPN 146

BLAST of Csa3G187300 vs. TrEMBL
Match: A0A0A0LBP2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G187300 PE=4 SV=1)

HSP 1 Score: 371.7 bits (953), Expect = 5.2e-100
Identity = 183/183 (100.00%), Postives = 183/183 (100.00%), Query Frame = 1

Query: 1   MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG 60
           MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG
Sbjct: 1   MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG 60

Query: 61  REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 120
           REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN
Sbjct: 61  REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 120

Query: 121 RTEGFSPAQKFPQAKFAPYEKSDYEWNPSVCLNNHTKFSDFFRNGQLLEAKLQNSYIVLI 180
           RTEGFSPAQKFPQAKFAPYEKSDYEWNPSVCLNNHTKFSDFFRNGQLLEAKLQNSYIVLI
Sbjct: 121 RTEGFSPAQKFPQAKFAPYEKSDYEWNPSVCLNNHTKFSDFFRNGQLLEAKLQNSYIVLI 180

Query: 181 CKT 184
           CKT
Sbjct: 181 CKT 183

BLAST of Csa3G187300 vs. TrEMBL
Match: A0A061FP76_THECC (Zn-dependent exopeptidases superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_043862 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 2.3e-47
Identity = 93/142 (65.49%), Postives = 114/142 (80.28%), Query Frame = 1

Query: 8   LLLFLTSLCLSSSDE-HSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPGREKVVV 67
           LLLF     LS SD+ +SMESVPDLQ SMY+ VD YPC+RL+NLSGEIGCSNPGR+KVV 
Sbjct: 9   LLLFSFQSRLSLSDQTNSMESVPDLQKSMYMVVDGYPCVRLVNLSGEIGCSNPGRDKVVA 68

Query: 68  PMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQNRTEGFS 127
           P++ +KD  E+ QPSA+L+SMD +  FF+R+ +DS FA NVGGVL+E G  IQN+ +GFS
Sbjct: 69  PIVKYKDTKELGQPSAILLSMDDVQGFFSRVSNDSSFARNVGGVLVESGIEIQNKLKGFS 128

Query: 128 PAQKFPQAKFAPYEKSDYEWNP 149
           PAQKFPQA+FAPY  + YEWNP
Sbjct: 129 PAQKFPQAEFAPYHNTSYEWNP 150

BLAST of Csa3G187300 vs. TrEMBL
Match: A0A061FPX7_THECC (Zn-dependent exopeptidases superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_043862 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 2.3e-47
Identity = 93/142 (65.49%), Postives = 114/142 (80.28%), Query Frame = 1

Query: 8   LLLFLTSLCLSSSDE-HSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPGREKVVV 67
           LLLF     LS SD+ +SMESVPDLQ SMY+ VD YPC+RL+NLSGEIGCSNPGR+KVV 
Sbjct: 9   LLLFSFQSRLSLSDQTNSMESVPDLQKSMYMVVDGYPCVRLVNLSGEIGCSNPGRDKVVA 68

Query: 68  PMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQNRTEGFS 127
           P++ +KD  E+ QPSA+L+SMD +  FF+R+ +DS FA NVGGVL+E G  IQN+ +GFS
Sbjct: 69  PIVKYKDTKELGQPSAILLSMDDVQGFFSRVSNDSSFARNVGGVLVESGIEIQNKLKGFS 128

Query: 128 PAQKFPQAKFAPYEKSDYEWNP 149
           PAQKFPQA+FAPY  + YEWNP
Sbjct: 129 PAQKFPQAEFAPYHNTSYEWNP 150

BLAST of Csa3G187300 vs. TrEMBL
Match: D7TSK0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0006g00380 PE=4 SV=1)

HSP 1 Score: 189.5 bits (480), Expect = 3.6e-45
Identity = 92/153 (60.13%), Postives = 115/153 (75.16%), Query Frame = 1

Query: 1   MSSQFLYLLL--FLTSL---CLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIG 60
           M S  +YLLL  F+  L    L+     S+ESVPDL+ SMY+ VD YPC+RLLNLSGEIG
Sbjct: 1   MESNLIYLLLLFFVAQLHPPLLAEEAMRSLESVPDLEKSMYMVVDGYPCVRLLNLSGEIG 60

Query: 61  CSNPGREKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPG 120
           CSNPGREKVV P++ FK+ + + Q SA+LVS+D I SFFTRL  DS+FA NVGGVL+E  
Sbjct: 61  CSNPGREKVVAPIVRFKNVNVLAQSSAILVSLDEIQSFFTRLSHDSNFARNVGGVLVESV 120

Query: 121 TGIQNRTEGFSPAQKFPQAKFAPYEKSDYEWNP 149
           +  QN+ +GFSP +KFPQA+FAPY+  +YEWNP
Sbjct: 121 SASQNKLKGFSPVEKFPQAEFAPYQSINYEWNP 153

BLAST of Csa3G187300 vs. TrEMBL
Match: W9SA84_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_004947 PE=4 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 5.2e-44
Identity = 91/150 (60.67%), Postives = 112/150 (74.67%), Query Frame = 1

Query: 1   MSSQFLYL-LLFLTSLCLSSSDEHS-MESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSN 60
           M+  FLY  LLF   L  S + + + MESVPDLQNSMY  VD YPC+RL+N SGEIGCSN
Sbjct: 1   MAFNFLYFALLFTFHLQFSLAGQTNLMESVPDLQNSMYKVVDGYPCVRLVNSSGEIGCSN 60

Query: 61  PGREKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGI 120
           PGREKVV P+I FKDA+ + + S +L+S D + SFF R+ DDS FA NVGGVL+E G   
Sbjct: 61  PGREKVVAPIIRFKDANTLSRSSTILLSFDEVESFFARIADDSTFARNVGGVLVESGAEF 120

Query: 121 QNRTEGFSPAQKFPQAKFAPYEKSDYEWNP 149
           Q++  GFSPA KFPQA+FAPY+ ++YEWNP
Sbjct: 121 QDKLRGFSPAHKFPQAEFAPYKSTNYEWNP 150

BLAST of Csa3G187300 vs. TAIR10
Match: AT3G52640.2 (AT3G52640.2 Zn-dependent exopeptidases superfamily protein)

HSP 1 Score: 169.9 bits (429), Expect = 1.5e-42
Identity = 80/148 (54.05%), Postives = 106/148 (71.62%), Query Frame = 1

Query: 1   MSSQFLYLLLFLTSLCLSSSDE-HSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNP 60
           +S  F  +LL +  L LS +DE  S+ESVPDLQ  MY+AVD +PC+RLLNLSGEIGCSNP
Sbjct: 9   LSIAFTLVLLSILPLHLSLADEITSIESVPDLQKLMYVAVDGFPCVRLLNLSGEIGCSNP 68

Query: 61  GREKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQ 120
           G  KVV P+I  KD  +++QP  +LV+ D +  FFTR+  D  FA+ +GGVL+E G+  Q
Sbjct: 69  GINKVVAPIIKLKDVKDLVQPHTILVTADEMEDFFTRVSTDLSFASKIGGVLVESGSNFQ 128

Query: 121 NRTEGFSPAQKFPQAKFAPYEKSDYEWN 148
            + +GFSP ++FPQA+F+PYE  +Y+WN
Sbjct: 129 QKLKGFSPDKRFPQAQFSPYENVEYKWN 156

BLAST of Csa3G187300 vs. NCBI nr
Match: gi|700202323|gb|KGN57456.1| (hypothetical protein Csa_3G187300 [Cucumis sativus])

HSP 1 Score: 371.7 bits (953), Expect = 7.4e-100
Identity = 183/183 (100.00%), Postives = 183/183 (100.00%), Query Frame = 1

Query: 1   MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG 60
           MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG
Sbjct: 1   MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG 60

Query: 61  REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 120
           REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN
Sbjct: 61  REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 120

Query: 121 RTEGFSPAQKFPQAKFAPYEKSDYEWNPSVCLNNHTKFSDFFRNGQLLEAKLQNSYIVLI 180
           RTEGFSPAQKFPQAKFAPYEKSDYEWNPSVCLNNHTKFSDFFRNGQLLEAKLQNSYIVLI
Sbjct: 121 RTEGFSPAQKFPQAKFAPYEKSDYEWNPSVCLNNHTKFSDFFRNGQLLEAKLQNSYIVLI 180

Query: 181 CKT 184
           CKT
Sbjct: 181 CKT 183

BLAST of Csa3G187300 vs. NCBI nr
Match: gi|449445945|ref|XP_004140732.1| (PREDICTED: nicastrin [Cucumis sativus])

HSP 1 Score: 302.4 bits (773), Expect = 5.5e-79
Identity = 149/149 (100.00%), Postives = 149/149 (100.00%), Query Frame = 1

Query: 1   MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG 60
           MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG
Sbjct: 1   MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG 60

Query: 61  REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 120
           REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN
Sbjct: 61  REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 120

Query: 121 RTEGFSPAQKFPQAKFAPYEKSDYEWNPS 150
           RTEGFSPAQKFPQAKFAPYEKSDYEWNPS
Sbjct: 121 RTEGFSPAQKFPQAKFAPYEKSDYEWNPS 149

BLAST of Csa3G187300 vs. NCBI nr
Match: gi|659114563|ref|XP_008457115.1| (PREDICTED: nicastrin isoform X1 [Cucumis melo])

HSP 1 Score: 285.0 bits (728), Expect = 9.1e-74
Identity = 140/148 (94.59%), Postives = 144/148 (97.30%), Query Frame = 1

Query: 1   MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG 60
           MSSQFLYLLLFLTSL LSSSDE  MESVPDLQNSMYLAVD YPCIRLLNLSGEIGCSNPG
Sbjct: 19  MSSQFLYLLLFLTSLRLSSSDEQKMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPG 78

Query: 61  REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 120
           REKVV+PMINFKDADEIL+PSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN
Sbjct: 79  REKVVLPMINFKDADEILEPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 138

Query: 121 RTEGFSPAQKFPQAKFAPYEKSDYEWNP 149
           RTEGFSPAQKFPQAKFAPY+K+DYEWNP
Sbjct: 139 RTEGFSPAQKFPQAKFAPYKKNDYEWNP 166

BLAST of Csa3G187300 vs. NCBI nr
Match: gi|659114565|ref|XP_008457117.1| (PREDICTED: nicastrin isoform X2 [Cucumis melo])

HSP 1 Score: 285.0 bits (728), Expect = 9.1e-74
Identity = 140/148 (94.59%), Postives = 144/148 (97.30%), Query Frame = 1

Query: 1   MSSQFLYLLLFLTSLCLSSSDEHSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPG 60
           MSSQFLYLLLFLTSL LSSSDE  MESVPDLQNSMYLAVD YPCIRLLNLSGEIGCSNPG
Sbjct: 19  MSSQFLYLLLFLTSLRLSSSDEQKMESVPDLQNSMYLAVDGYPCIRLLNLSGEIGCSNPG 78

Query: 61  REKVVVPMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 120
           REKVV+PMINFKDADEIL+PSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN
Sbjct: 79  REKVVLPMINFKDADEILEPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQN 138

Query: 121 RTEGFSPAQKFPQAKFAPYEKSDYEWNP 149
           RTEGFSPAQKFPQAKFAPY+K+DYEWNP
Sbjct: 139 RTEGFSPAQKFPQAKFAPYKKNDYEWNP 166

BLAST of Csa3G187300 vs. NCBI nr
Match: gi|590566687|ref|XP_007010306.1| (Zn-dependent exopeptidases superfamily protein isoform 2 [Theobroma cacao])

HSP 1 Score: 196.8 bits (499), Expect = 3.3e-47
Identity = 93/142 (65.49%), Postives = 114/142 (80.28%), Query Frame = 1

Query: 8   LLLFLTSLCLSSSDE-HSMESVPDLQNSMYLAVDAYPCIRLLNLSGEIGCSNPGREKVVV 67
           LLLF     LS SD+ +SMESVPDLQ SMY+ VD YPC+RL+NLSGEIGCSNPGR+KVV 
Sbjct: 9   LLLFSFQSRLSLSDQTNSMESVPDLQKSMYMVVDGYPCVRLVNLSGEIGCSNPGRDKVVA 68

Query: 68  PMINFKDADEILQPSAVLVSMDAISSFFTRLQDDSHFANNVGGVLIEPGTGIQNRTEGFS 127
           P++ +KD  E+ QPSA+L+SMD +  FF+R+ +DS FA NVGGVL+E G  IQN+ +GFS
Sbjct: 69  PIVKYKDTKELGQPSAILLSMDDVQGFFSRVSNDSSFARNVGGVLVESGIEIQNKLKGFS 128

Query: 128 PAQKFPQAKFAPYEKSDYEWNP 149
           PAQKFPQA+FAPY  + YEWNP
Sbjct: 129 PAQKFPQAEFAPYHNTSYEWNP 150

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NICA_ARATH2.7e-4154.05Nicastrin OS=Arabidopsis thaliana GN=At3g52640/At3g52650 PE=2 SV=1[more]
NICA_DICDI1.5e-0726.97Nicastrin OS=Dictyostelium discoideum GN=ncstn PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LBP2_CUCSA5.2e-100100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G187300 PE=4 SV=1[more]
A0A061FP76_THECC2.3e-4765.49Zn-dependent exopeptidases superfamily protein isoform 1 OS=Theobroma cacao GN=T... [more]
A0A061FPX7_THECC2.3e-4765.49Zn-dependent exopeptidases superfamily protein isoform 2 OS=Theobroma cacao GN=T... [more]
D7TSK0_VITVI3.6e-4560.13Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0006g00380 PE=4 SV=... [more]
W9SA84_9ROSA5.2e-4460.67Uncharacterized protein OS=Morus notabilis GN=L484_004947 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G52640.21.5e-4254.05 Zn-dependent exopeptidases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700202323|gb|KGN57456.1|7.4e-100100.00hypothetical protein Csa_3G187300 [Cucumis sativus][more]
gi|449445945|ref|XP_004140732.1|5.5e-79100.00PREDICTED: nicastrin [Cucumis sativus][more]
gi|659114563|ref|XP_008457115.1|9.1e-7494.59PREDICTED: nicastrin isoform X1 [Cucumis melo][more]
gi|659114565|ref|XP_008457117.1|9.1e-7494.59PREDICTED: nicastrin isoform X2 [Cucumis melo][more]
gi|590566687|ref|XP_007010306.1|3.3e-4765.49Zn-dependent exopeptidases superfamily protein isoform 2 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008710Nicastrin
Vocabulary: Cellular Component
TermDefinition
GO:0016021integral component of membrane
Vocabulary: Biological Process
TermDefinition
GO:0016485protein processing
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016485 protein processing
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0005798 Golgi-associated vesicle
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005774 vacuolar membrane
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU094906cucumber EST collection version 3.0transcribed_cluster
CU112104cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa3G187300.1Csa3G187300.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU112104CU112104transcribed_cluster
CU094906CU094906transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008710NicastrinPANTHERPTHR21092NICASTRINcoord: 1..148
score: 8.6
NoneNo IPR availablePANTHERPTHR21092:SF0NICASTRINcoord: 1..148
score: 8.6

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Csa3G187300CSPI03G17720Wild cucumber (PI 183967)cpicuB120
Csa3G187300Cucsa.108560Cucumber (Gy14) v1cgycuB132
The following gene(s) are paralogous to this gene:

None