Csa1G043220 (gene) Cucumber (Chinese Long) v2

NameCsa1G043220
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionKunitz trypsin inhibitor; contains IPR002160 (Proteinase inhibitor I3, Kunitz legume)
LocationChr1 : 4797141 .. 4797758 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAAACTTTGCATTACTGTGTTTTCTTTTCATTGTCATTGCCTCATCTGAGGTACGCTTCTGCAGAGCGGATGCCTCCCCCGATGCGGTCCTCGACACCGACGGAAAGAAGCTTCGAGCCGGTGATCAATATTATATTCTTTCAGTTTACAGCAGAAACAGTGGTGGATTAAGCATCGGCGGTATCTACGGATACGAAAAATGTCCCATCAACATCCTCCCGGAATCATACGATTATTTACACGGTCTACCGGCGACGTTTTCCCCCATAAACCCTAAAAAGGGTGTGGTCCGAGTTTCTACAGATTTGAACATCCAATTCGAGGCGAACACGAGGTGCGGAATATCGACGGTATGGAAAGTAGGTAAGTTTGATGAATACTTGAAGCAGTATTTTGTGACGATGGGGGGAATGAAAGGGAATCCGGGGCGGGAAACAATAGAAAATTGGTTCAAAGTGGAGAAGTATGGTAAGAATTATAAGTTGGTGTATTGTCCAACAGTTTGTAAGTATTGCAAAGTAGTGTGCAAAGATGTGGGATTATTTTACAAGAATGGAAGGAGGGTTATTGCTTTGAATGATGCTCCATTCCCTGTTATGTTCAAGAAAGTTTGA

mRNA sequence

ATGAGAAACTTTGCATTACTGTGTTTTCTTTTCATTGTCATTGCCTCATCTGAGGTACGCTTCTGCAGAGCGGATGCCTCCCCCGATGCGGTCCTCGACACCGACGGAAAGAAGCTTCGAGCCGGTGATCAATATTATATTCTTTCAGTTTACAGCAGAAACAGTGGTGGATTAAGCATCGGCGGTATCTACGGATACGAAAAATGTCCCATCAACATCCTCCCGGAATCATACGATTATTTACACGGTCTACCGGCGACGTTTTCCCCCATAAACCCTAAAAAGGGTGTGGTCCGAGTTTCTACAGATTTGAACATCCAATTCGAGGCGAACACGAGGTGCGGAATATCGACGGTATGGAAAGTAGGTAAGTTTGATGAATACTTGAAGCAGTATTTTGTGACGATGGGGGGAATGAAAGGGAATCCGGGGCGGGAAACAATAGAAAATTGGTTCAAAGTGGAGAAGTATGGTAAGAATTATAAGTTGGTGTATTGTCCAACAGTTTGTAAGTATTGCAAAGTAGTGTGCAAAGATGTGGGATTATTTTACAAGAATGGAAGGAGGGTTATTGCTTTGAATGATGCTCCATTCCCTGTTATGTTCAAGAAAGTTTGA

Coding sequence (CDS)

ATGAGAAACTTTGCATTACTGTGTTTTCTTTTCATTGTCATTGCCTCATCTGAGGTACGCTTCTGCAGAGCGGATGCCTCCCCCGATGCGGTCCTCGACACCGACGGAAAGAAGCTTCGAGCCGGTGATCAATATTATATTCTTTCAGTTTACAGCAGAAACAGTGGTGGATTAAGCATCGGCGGTATCTACGGATACGAAAAATGTCCCATCAACATCCTCCCGGAATCATACGATTATTTACACGGTCTACCGGCGACGTTTTCCCCCATAAACCCTAAAAAGGGTGTGGTCCGAGTTTCTACAGATTTGAACATCCAATTCGAGGCGAACACGAGGTGCGGAATATCGACGGTATGGAAAGTAGGTAAGTTTGATGAATACTTGAAGCAGTATTTTGTGACGATGGGGGGAATGAAAGGGAATCCGGGGCGGGAAACAATAGAAAATTGGTTCAAAGTGGAGAAGTATGGTAAGAATTATAAGTTGGTGTATTGTCCAACAGTTTGTAAGTATTGCAAAGTAGTGTGCAAAGATGTGGGATTATTTTACAAGAATGGAAGGAGGGTTATTGCTTTGAATGATGCTCCATTCCCTGTTATGTTCAAGAAAGTTTGA

Protein sequence

MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSIGGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVWKVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDVGLFYKNGRRVIALNDAPFPVMFKKV*
BLAST of Csa1G043220 vs. Swiss-Prot
Match: MIRA_SYNDU (Miraculin OS=Synsepalum dulcificum PE=1 SV=3)

HSP 1 Score: 179.1 bits (453), Expect = 4.9e-44
Identity = 98/208 (47.12%), Postives = 128/208 (61.54%), Query Frame = 1

Query: 7   LCFLFI---VIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSIGGI 66
           L F F+   + A++      AD++P+ VLD DG+KLR G  YYI+ V   + GGL++   
Sbjct: 9   LSFFFVSALLAAAANPLLSAADSAPNPVLDIDGEKLRTGTNYYIVPVLRDHGGGLTVSAT 68

Query: 67  Y--GYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCG--ISTV 126
              G   CP  ++    +  H  P  F P NPK+ VVRVSTDLNI F A   C    STV
Sbjct: 69  TPNGTFVCPPRVVQTRKEVDHDRPLAFFPENPKEDVVRVSTDLNINFSAFMPCRWTSSTV 128

Query: 127 WKVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKN--YKLVYCPTVCKYCKVVC 186
           W++ K+DE   QYFVT+GG+KGNPG ETI +WFK+E++  +  YKLV+CPTVC  CKV C
Sbjct: 129 WRLDKYDESTGQYFVTIGGVKGNPGPETISSWFKIEEFCGSGFYKLVFCPTVCGSCKVKC 188

Query: 187 KDVGLFY-KNGRRVIALNDAPFPVMFKK 205
            DVG++  + GRR +AL+D PF   F K
Sbjct: 189 GDVGIYIDQKGRRRLALSDKPFAFEFNK 216

BLAST of Csa1G043220 vs. Swiss-Prot
Match: ASP_THECC (21 kDa seed protein OS=Theobroma cacao GN=ASP PE=2 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 5.7e-32
Identity = 80/209 (38.28%), Postives = 115/209 (55.02%), Query Frame = 1

Query: 1   MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYIL-SVYSRNSGGLS 60
           M+    +  L     S    F  A+A+   VLDTDG +L+ G QYY+L S+     GGL+
Sbjct: 1   MKTATAVVLLLFAFTSKSYFFGVANAANSPVLDTDGDELQTGVQYYVLSSISGAGGGGLA 60

Query: 61  IGGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFE--ANTRCGIS 120
           +G   G + CP  ++    D  +G P  FS  + K  VVRVSTD+NI+F    +  C  S
Sbjct: 61  LGRATG-QSCPEIVVQRRSDLDNGTPVIFSNADSKDDVVRVSTDVNIEFVPIRDRLCSTS 120

Query: 121 TVWKVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYG-KNYKLVYCPTVCKYCKVV 180
           TVW++  +D    +++VT  G+KG PG  T+ +WFK+EK G   YK  +CP+VC  C  +
Sbjct: 121 TVWRLDNYDNSAGKWWVTTDGVKGEPGPNTLCSWFKIEKAGVLGYKFRFCPSVCDSCTTL 180

Query: 181 CKDVGLFY-KNGRRVIALNDAPFPVMFKK 205
           C D+G     +G+  +AL+D  +  MFKK
Sbjct: 181 CSDIGRHSDDDGQIRLALSDNEWAWMFKK 208

BLAST of Csa1G043220 vs. Swiss-Prot
Match: LSPI_CARPA (Latex serine proteinase inhibitor OS=Carica papaya PE=1 SV=1)

HSP 1 Score: 103.2 bits (256), Expect = 3.4e-21
Identity = 62/183 (33.88%), Postives = 100/183 (54.64%), Query Frame = 1

Query: 27  SPDAVLDTDGKKLRAGDQYYILS-VYSRNSGGLSIGGIYGYEKCPINILPESYDYLHGLP 86
           +P  ++D DGK +  G  Y+++S ++    GGL++ G    +KCP++++ + +D  +G P
Sbjct: 2   APKPIVDIDGKPVLYGVDYFVVSAIWGAGGGGLTVYGPGNKKKCPLSVVQDPFD--NGEP 61

Query: 87  ATFSPI-NPKKGVVRVSTDLNIQFEANTRCGISTVWKVGKFDEYLKQYFVTMGGMKGNPG 146
             FS I N K  +V  S DLN++F     C  +T WKV +F   +  + VT+GG KG  G
Sbjct: 62  IIFSAIKNVKDNIVFESVDLNVKFNITINCNETTAWKVDRFPGVI-GWTVTLGGEKGYHG 121

Query: 147 RETIENWFKVEKYGK--NYKLVYCPTVCKYCKVVCKDVGLFYKNG--RRVIALNDAPFPV 204
            E+  + FK++K G   +YK  +CP+  +   + C +V +F+     RR+I  NDA   V
Sbjct: 122 FESTHSMFKIKKAGLPFSYKFHFCPSYPRTRLIPCNNVDIFFDKYRIRRLILTNDAKEFV 181

BLAST of Csa1G043220 vs. Swiss-Prot
Match: DRTI_DELRE (Kunitz-type serine protease inhibitor DrTI OS=Delonix regia PE=1 SV=1)

HSP 1 Score: 93.6 bits (231), Expect = 2.7e-18
Identity = 62/181 (34.25%), Postives = 92/181 (50.83%), Query Frame = 1

Query: 29  DAVLDTDGKKLRAGDQYYILS-VYSRNSGGLSIGGIYGYEKCPINILPESYDYLHGLPAT 88
           + V D +G  +  G +YYI+S +     GG+  G   G   CP++I+ E  D   GLP  
Sbjct: 4   EKVYDIEGYPVFLGSEYYIVSAIIGAGGGGVRPGRTRG-SMCPMSIIQEQSDLQMGLPVR 63

Query: 89  FSPINPKKGVVRVSTDLNIQFEANTRCGISTVWKVGKFDEYLKQYFVTMGGMKGNPGRET 148
           FS     +G +   T+L I+F     C  S+ W + K     +   V +GG + +P  E 
Sbjct: 64  FSSPEESQGKIYTDTELEIEFVEKPDCAESSKWVIVKDSGEAR---VAIGGSEDHPQGEL 123

Query: 149 IENWFKVEKYGK-NYKLVYCPTVCKYCKVVCKDVGLFYKNGRRVIAL---NDAPFPVMFK 205
           +  +FK+EK G   YKLV+CP   K     C D+G+ Y+ GRR + L   +D+PF V+F 
Sbjct: 124 VRGFFKIEKLGSLAYKLVFCP---KSSSGSCSDIGINYE-GRRSLVLKSSDDSPFRVVFV 176

BLAST of Csa1G043220 vs. Swiss-Prot
Match: IAAS_ORYSJ (Alpha-amylase/subtilisin inhibitor OS=Oryza sativa subsp. japonica GN=RASI PE=1 SV=2)

HSP 1 Score: 92.8 bits (229), Expect = 4.6e-18
Identity = 64/183 (34.97%), Postives = 87/183 (47.54%), Query Frame = 1

Query: 26  ASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSIGGIYGYEKCPINILPESYDYLHGLP 85
           A+P  V DT+G +L A   YY+L     + GGL++        CP+ +  E+ +   G P
Sbjct: 22  AAPPPVYDTEGHELSADGSYYVLPASPGHGGGLTMAP--RVLPCPLLVAQETDERRKGFP 81

Query: 86  ATFSPIN----PKKGVVRVSTDLNIQFEANTRCGISTVWKVGKFDEYLKQYFVTMGGMKG 145
             F+P      P+   +RVSTD+ I+F A T C  ST W VG  DE L      + G   
Sbjct: 82  VRFTPWGGAAAPEDRTIRVSTDVRIRFNAATICVQSTEWHVG--DEPLTGARRVVTGPLI 141

Query: 146 NPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDVGLFYKNGRRVIALNDAPFPVM 205
            P     EN F+VEKYG  YKLV        C+  C+D+G+     R  +  +  P  V+
Sbjct: 142 GPSPSGRENAFRVEKYGGGYKLV-------SCRDSCQDLGVSRDGARAWLGASQPPHVVV 193

BLAST of Csa1G043220 vs. TrEMBL
Match: A0A0A0LT81_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043220 PE=4 SV=1)

HSP 1 Score: 429.9 bits (1104), Expect = 1.8e-117
Identity = 205/205 (100.00%), Postives = 205/205 (100.00%), Query Frame = 1

Query: 1   MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI 60
           MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI
Sbjct: 1   MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI 60

Query: 61  GGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVW 120
           GGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVW
Sbjct: 61  GGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVW 120

Query: 121 KVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDV 180
           KVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDV
Sbjct: 121 KVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDV 180

Query: 181 GLFYKNGRRVIALNDAPFPVMFKKV 206
           GLFYKNGRRVIALNDAPFPVMFKKV
Sbjct: 181 GLFYKNGRRVIALNDAPFPVMFKKV 205

BLAST of Csa1G043220 vs. TrEMBL
Match: A0A0A0LTW2_CUCSA (Tumor-related protein OS=Cucumis sativus GN=Csa_1G043200 PE=4 SV=1)

HSP 1 Score: 281.2 bits (718), Expect = 1.0e-72
Identity = 129/207 (62.32%), Postives = 168/207 (81.16%), Query Frame = 1

Query: 1   MRNFALLC-FLFIVIASSE-VRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGL 60
           M+NF +L  FLFI++AS++ +RF  ADASP+AVLD DGKKLRAG  YYIL V+    GGL
Sbjct: 1   MKNFGILFYFLFILLASTQLIRFSTADASPEAVLDIDGKKLRAGVNYYILPVFRGRGGGL 60

Query: 61  SIGGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGIST 120
           ++G +   EKCP+N++ E  + ++G P TF P+NPKKGVVRVSTDLN+QFEA+T C  ST
Sbjct: 61  TLGNLQS-EKCPLNVVQEQLEVMNGFPTTFHPVNPKKGVVRVSTDLNVQFEASTICVTST 120

Query: 121 VWKVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCK 180
           VWK+ KFDE   Q+ VT+GG +GNPG ET++NWFK+EK+GK+YKLV+CPTVC +CKV+C+
Sbjct: 121 VWKLDKFDESTGQWLVTIGGSRGNPGVETVDNWFKIEKHGKDYKLVFCPTVCNFCKVMCR 180

Query: 181 DVGLFYKNGRRVIALNDAPFPVMFKKV 206
           D+G+F+KNG R +AL+D PFPVMFKKV
Sbjct: 181 DIGIFFKNGERALALSDTPFPVMFKKV 206

BLAST of Csa1G043220 vs. TrEMBL
Match: M5X024_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011496mg PE=4 SV=1)

HSP 1 Score: 233.4 bits (594), Expect = 2.5e-58
Identity = 109/200 (54.50%), Postives = 144/200 (72.00%), Query Frame = 1

Query: 7   LCFLFIVIA-SSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSIGGIYG 66
           L F F++ A S+++R   ADA+P  VLD  G KL+ G  YYIL V     GGL++     
Sbjct: 9   LVFCFLLFAFSAKLRSVAADAAPSPVLDITGNKLQTGVDYYILPVIRGRGGGLTLASTSN 68

Query: 67  YEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVWKVGKF 126
              CP++++ E  +  +GLP  FSP+N  KGVVRVSTDLNI+F A T C  STVWK+GKF
Sbjct: 69  KTSCPLDVVQEQNEVSNGLPLKFSPVNVTKGVVRVSTDLNIKFSATTICVQSTVWKLGKF 128

Query: 127 DEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDVGLFYK 186
           DE   Q+FVT GG++GNPGR+T  NWFK+EK+G +YKLV+CPTVC +CKV+C DVG+F++
Sbjct: 129 DEQTGQWFVTSGGVEGNPGRQTTSNWFKIEKFGDDYKLVFCPTVCNFCKVICGDVGIFFQ 188

Query: 187 NGRRVIALNDAPFPVMFKKV 206
           +G+R +AL+D PF  MFKKV
Sbjct: 189 DGKRRLALSDVPFRAMFKKV 208

BLAST of Csa1G043220 vs. TrEMBL
Match: A5AXN3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027379 PE=4 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 4.6e-57
Identity = 107/204 (52.45%), Postives = 151/204 (74.02%), Query Frame = 1

Query: 1   MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI 60
           M+  +LL  L ++  + +     A+++PD VLDT+GKKLR+G  YYIL V+    GGL++
Sbjct: 1   MKTTSLLFSLLLIALAVKPFSVAAESAPDPVLDTEGKKLRSGVDYYILPVFRGRGGGLTL 60

Query: 61  GGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVW 120
               G E CP++++ E  +  +GLP TF+P+NPKKGV+RVSTD NI+F A+T C  ST+W
Sbjct: 61  AST-GNESCPLDVVQEQQEVSNGLPLTFTPVNPKKGVIRVSTDHNIKFSASTICVQSTLW 120

Query: 121 KVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDV 180
           K+ ++DE   Q+FVT GG++GNPGRET++NWFK+EKY  +YKLV+CPTVC +CK VC D+
Sbjct: 121 KL-EYDESSGQWFVTTGGVEGNPGRETLDNWFKIEKYEDDYKLVFCPTVCDFCKPVCGDI 180

Query: 181 GLFYKNGRRVIALNDAPFPVMFKK 205
           G++ +NG R +AL+D PF VMFKK
Sbjct: 181 GIYIQNGYRRLALSDVPFKVMFKK 202

BLAST of Csa1G043220 vs. TrEMBL
Match: V4TSJ0_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10023257mg PE=4 SV=1)

HSP 1 Score: 228.4 bits (581), Expect = 7.9e-57
Identity = 109/206 (52.91%), Postives = 144/206 (69.90%), Query Frame = 1

Query: 1   MRNFALLCFLFIVIASSEVRF-CRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLS 60
           MRN  +L  L ++ A        R +ASPD VLD  GK+LRAG +YYIL V     GGL+
Sbjct: 1   MRNTLVLPSLILLFAFIATPLPVRGNASPDPVLDIAGKQLRAGSKYYILPVTKGRGGGLT 60

Query: 61  IGGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTV 120
           + G    + CP++++ E + + +GLP TFSP+NPKKGVVR STDLNI+F+A T C  STV
Sbjct: 61  LAGRSNNKTCPVDVVQEQHSFRNGLPVTFSPVNPKKGVVRESTDLNIKFDAATSCAQSTV 120

Query: 121 WKVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKD 180
           WK+  FD  L Q+ VT GG++GNPG  T+ NWFK+EK+  NYKLVYCP+VC +C+ +C+D
Sbjct: 121 WKLDNFDAALGQWLVTTGGVEGNPGPRTMRNWFKIEKFFGNYKLVYCPSVCNFCRGLCRD 180

Query: 181 VGLFYKNGRRVIALNDAPFPVMFKKV 206
           VG+F   G R +AL+D PF V+FKKV
Sbjct: 181 VGIFINGGVRRLALSDVPFKVVFKKV 206

BLAST of Csa1G043220 vs. TAIR10
Match: AT1G17860.1 (AT1G17860.1 Kunitz family trypsin and protease inhibitor protein)

HSP 1 Score: 171.8 bits (434), Expect = 4.4e-43
Identity = 81/199 (40.70%), Postives = 128/199 (64.32%), Query Frame = 1

Query: 6   LLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSIGGIYG 65
           L  FL + +  S  R    +A+ + V D +GK L  G  YYIL V     GGL++  +  
Sbjct: 5   LYIFLLLAVFISH-RGVTTEAAVEPVKDINGKSLLTGVNYYILPVIRGRGGGLTMSNLKT 64

Query: 66  YEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVWKVGKF 125
            E CP +++ + ++   GLP  FSP + K   + VSTD+NI+F        +++W++  F
Sbjct: 65  -ETCPTSVIQDQFEVSQGLPVKFSPYD-KSRTIPVSTDVNIKFSP------TSIWELANF 124

Query: 126 DEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDVGLFYK 185
           DE  KQ+F++  G++GNPG++T++NWFK++K+ K+YK+ +CPTVC +CKV+C+DVG+F +
Sbjct: 125 DETTKQWFISTCGVEGNPGQKTVDNWFKIDKFEKDYKIRFCPTVCNFCKVICRDVGVFVQ 184

Query: 186 NGRRVIALNDAPFPVMFKK 205
           +G+R +AL+D P  VMFK+
Sbjct: 185 DGKRRLALSDVPLKVMFKR 194

BLAST of Csa1G043220 vs. TAIR10
Match: AT1G73325.1 (AT1G73325.1 Kunitz family trypsin and protease inhibitor protein)

HSP 1 Score: 90.1 bits (222), Expect = 1.7e-18
Identity = 67/211 (31.75%), Postives = 105/211 (49.76%), Query Frame = 1

Query: 1   MRNFALLCFLFIVIASSEVRFCRADASPD-AVLDTDGKKLRAGDQYYILSVYSRNSGGL- 60
           M    L      V+++       ADA+P   VLD  G  +++  QYYI+       GGL 
Sbjct: 1   MEKLTLSFITLTVLSAIFTAASAADATPSQVVLDIAGHPVQSNVQYYIIPAKIGTGGGLI 60

Query: 61  -SIGGIYGYEKC-PINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANT-RCG 120
            S   +   + C  ++I+  S  ++ GLP TFSP+N K   V++S  LN++F++    C 
Sbjct: 61  PSNRNLSTQDLCLNLDIVQSSSPFVSGLPVTFSPLNTKVKHVQLSASLNLEFDSTVWLCP 120

Query: 121 ISTVWKVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKV 180
            S VW++    + L++ FV++GG KG        +WF++++ G  YKL+YCP       V
Sbjct: 121 DSKVWRIDHSVQ-LRKSFVSIGGQKGKGN-----SWFQIQEDGDAYKLMYCPI---SSIV 180

Query: 181 VCKDVGLFYKNG--RRVIALNDAPFPVMFKK 205
            C +V L   +   RR++   D  F V F+K
Sbjct: 181 ACINVSLEIDDHGVRRLVLSTDQSFVVKFQK 202

BLAST of Csa1G043220 vs. NCBI nr
Match: gi|449439731|ref|XP_004137639.1| (PREDICTED: miraculin [Cucumis sativus])

HSP 1 Score: 429.9 bits (1104), Expect = 2.6e-117
Identity = 205/205 (100.00%), Postives = 205/205 (100.00%), Query Frame = 1

Query: 1   MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI 60
           MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI
Sbjct: 1   MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI 60

Query: 61  GGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVW 120
           GGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVW
Sbjct: 61  GGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVW 120

Query: 121 KVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDV 180
           KVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDV
Sbjct: 121 KVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDV 180

Query: 181 GLFYKNGRRVIALNDAPFPVMFKKV 206
           GLFYKNGRRVIALNDAPFPVMFKKV
Sbjct: 181 GLFYKNGRRVIALNDAPFPVMFKKV 205

BLAST of Csa1G043220 vs. NCBI nr
Match: gi|659066987|ref|XP_008437082.1| (PREDICTED: miraculin [Cucumis melo])

HSP 1 Score: 409.1 bits (1050), Expect = 4.7e-111
Identity = 192/205 (93.66%), Postives = 200/205 (97.56%), Query Frame = 1

Query: 1   MRNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI 60
           M+NFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI
Sbjct: 1   MKNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSI 60

Query: 61  GGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVW 120
           GGIYGYEKCPINILPESYDYL GLPATFSP+NPKKGVVRVSTDLNI+FEA+TRCGISTVW
Sbjct: 61  GGIYGYEKCPINILPESYDYLDGLPATFSPVNPKKGVVRVSTDLNIEFEASTRCGISTVW 120

Query: 121 KVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDV 180
           KVGKFD+YLKQYFVTMGG KGNPGRETI NWFKVEK+GKNYK VYCPTVCKYCKV+CKDV
Sbjct: 121 KVGKFDQYLKQYFVTMGGTKGNPGRETIGNWFKVEKHGKNYKFVYCPTVCKYCKVMCKDV 180

Query: 181 GLFYKNGRRVIALNDAPFPVMFKKV 206
           GLFYKNGRR+ ALNDAPFPVMFKKV
Sbjct: 181 GLFYKNGRRIFALNDAPFPVMFKKV 205

BLAST of Csa1G043220 vs. NCBI nr
Match: gi|659066985|ref|XP_008437071.1| (PREDICTED: miraculin-like [Cucumis melo])

HSP 1 Score: 296.6 bits (758), Expect = 3.4e-77
Identity = 146/210 (69.52%), Postives = 170/210 (80.95%), Query Frame = 1

Query: 1   MRNFALLCFLFI--VIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGG- 60
           M+ FALL FLFI  VIASSE+RFCRADASPDAVLDTDGKKLR  ++YYIL  +  + GG 
Sbjct: 1   MKKFALLSFLFIAIVIASSELRFCRADASPDAVLDTDGKKLRVSNKYYILPAFEGSGGGG 60

Query: 61  LSIGGIYG-YEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGI 120
           L+IG I   Y++C IN++ E Y+   G P TF PINPKKGVVRVSTDLNI+F+A TRC  
Sbjct: 61  LAIGNIRKEYDRCGINVVQERYEQSDGDPTTFLPINPKKGVVRVSTDLNIEFDATTRCRK 120

Query: 121 STVWKVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGK-NYKLVYCPTVCKYCKV 180
           STVWK+G FD YL+QYFVT+GG KGNPGRET ENWFK+EKYGK NYKLVYCP VCKYCKV
Sbjct: 121 STVWKLGTFDRYLRQYFVTIGGTKGNPGRETTENWFKIEKYGKGNYKLVYCPRVCKYCKV 180

Query: 181 VCKDVGLFYKNGRRVIALNDAPFPVMFKKV 206
           +CKD+G+F  NG R + L+D PFPV+FKKV
Sbjct: 181 MCKDIGIFENNGIRGLVLSDTPFPVIFKKV 210

BLAST of Csa1G043220 vs. NCBI nr
Match: gi|659066983|ref|XP_008437058.1| (PREDICTED: miraculin-like [Cucumis melo])

HSP 1 Score: 287.3 bits (734), Expect = 2.1e-74
Identity = 126/204 (61.76%), Postives = 168/204 (82.35%), Query Frame = 1

Query: 2   RNFALLCFLFIVIASSEVRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGLSIG 61
           +NF +  F+FI++AS+E+RF  ADASP+AVLD DGKKLRAG  YYIL V+    GGL++G
Sbjct: 3   KNFGIFYFIFILLASTELRFSTADASPEAVLDIDGKKLRAGVNYYILPVFRGRGGGLTLG 62

Query: 62  GIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGISTVWK 121
            +   E CP+N++ E ++ ++G P TF P+NPKKGVVRVSTDLN+QF+A+T C  STVWK
Sbjct: 63  NLQS-EICPVNVVQEQFELMNGFPTTFHPVNPKKGVVRVSTDLNVQFDASTICVTSTVWK 122

Query: 122 VGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCKDVG 181
           + KFDE   Q+FVT+GG +GNPG ET++NWFK+EK+GK+YKLV+CPTVC +CKV+C+D+G
Sbjct: 123 LDKFDESTGQWFVTIGGSRGNPGVETVDNWFKIEKHGKDYKLVFCPTVCNFCKVMCRDIG 182

Query: 182 LFYKNGRRVIALNDAPFPVMFKKV 206
           +F+KNG+R +AL+D PFPVMFKKV
Sbjct: 183 IFFKNGKRALALSDTPFPVMFKKV 205

BLAST of Csa1G043220 vs. NCBI nr
Match: gi|449439521|ref|XP_004137534.1| (PREDICTED: miraculin-like [Cucumis sativus])

HSP 1 Score: 281.2 bits (718), Expect = 1.5e-72
Identity = 129/207 (62.32%), Postives = 168/207 (81.16%), Query Frame = 1

Query: 1   MRNFALLC-FLFIVIASSE-VRFCRADASPDAVLDTDGKKLRAGDQYYILSVYSRNSGGL 60
           M+NF +L  FLFI++AS++ +RF  ADASP+AVLD DGKKLRAG  YYIL V+    GGL
Sbjct: 1   MKNFGILFYFLFILLASTQLIRFSTADASPEAVLDIDGKKLRAGVNYYILPVFRGRGGGL 60

Query: 61  SIGGIYGYEKCPINILPESYDYLHGLPATFSPINPKKGVVRVSTDLNIQFEANTRCGIST 120
           ++G +   EKCP+N++ E  + ++G P TF P+NPKKGVVRVSTDLN+QFEA+T C  ST
Sbjct: 61  TLGNLQS-EKCPLNVVQEQLEVMNGFPTTFHPVNPKKGVVRVSTDLNVQFEASTICVTST 120

Query: 121 VWKVGKFDEYLKQYFVTMGGMKGNPGRETIENWFKVEKYGKNYKLVYCPTVCKYCKVVCK 180
           VWK+ KFDE   Q+ VT+GG +GNPG ET++NWFK+EK+GK+YKLV+CPTVC +CKV+C+
Sbjct: 121 VWKLDKFDESTGQWLVTIGGSRGNPGVETVDNWFKIEKHGKDYKLVFCPTVCNFCKVMCR 180

Query: 181 DVGLFYKNGRRVIALNDAPFPVMFKKV 206
           D+G+F+KNG R +AL+D PFPVMFKKV
Sbjct: 181 DIGIFFKNGERALALSDTPFPVMFKKV 206

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MIRA_SYNDU4.9e-4447.12Miraculin OS=Synsepalum dulcificum PE=1 SV=3[more]
ASP_THECC5.7e-3238.2821 kDa seed protein OS=Theobroma cacao GN=ASP PE=2 SV=1[more]
LSPI_CARPA3.4e-2133.88Latex serine proteinase inhibitor OS=Carica papaya PE=1 SV=1[more]
DRTI_DELRE2.7e-1834.25Kunitz-type serine protease inhibitor DrTI OS=Delonix regia PE=1 SV=1[more]
IAAS_ORYSJ4.6e-1834.97Alpha-amylase/subtilisin inhibitor OS=Oryza sativa subsp. japonica GN=RASI PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0LT81_CUCSA1.8e-117100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043220 PE=4 SV=1[more]
A0A0A0LTW2_CUCSA1.0e-7262.32Tumor-related protein OS=Cucumis sativus GN=Csa_1G043200 PE=4 SV=1[more]
M5X024_PRUPE2.5e-5854.50Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa011496mg PE=4 SV=1[more]
A5AXN3_VITVI4.6e-5752.45Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027379 PE=4 SV=1[more]
V4TSJ0_9ROSI7.9e-5752.91Uncharacterized protein OS=Citrus clementina GN=CICLE_v10023257mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G17860.14.4e-4340.70 Kunitz family trypsin and protease inhibitor protein[more]
AT1G73325.11.7e-1831.75 Kunitz family trypsin and protease inhibitor protein[more]
Match NameE-valueIdentityDescription
gi|449439731|ref|XP_004137639.1|2.6e-117100.00PREDICTED: miraculin [Cucumis sativus][more]
gi|659066987|ref|XP_008437082.1|4.7e-11193.66PREDICTED: miraculin [Cucumis melo][more]
gi|659066985|ref|XP_008437071.1|3.4e-7769.52PREDICTED: miraculin-like [Cucumis melo][more]
gi|659066983|ref|XP_008437058.1|2.1e-7461.76PREDICTED: miraculin-like [Cucumis melo][more]
gi|449439521|ref|XP_004137534.1|1.5e-7262.32PREDICTED: miraculin-like [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002160Prot_inh_Kunz-lg
IPR011065Kunitz_inhibitor_STI-like_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004866endopeptidase inhibitor activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010951 negative regulation of endopeptidase activity
cellular_component GO:0005575 cellular_component
molecular_function GO:0004866 endopeptidase inhibitor activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G043220.1Csa1G043220.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002160Proteinase inhibitor I3, Kunitz legumePRINTSPR00291KUNITZINHBTRcoord: 30..59
score: 1.7E-11coord: 151..170
score: 1.7E-11coord: 69..89
score: 1.7
IPR002160Proteinase inhibitor I3, Kunitz legumePFAMPF00197Kunitz_legumecoord: 31..204
score: 1.1
IPR002160Proteinase inhibitor I3, Kunitz legumeSMARTSM00452kul_2coord: 30..205
score: 3.2
IPR002160Proteinase inhibitor I3, Kunitz legumePROSITEPS00283SOYBEAN_KUNITZcoord: 31..47
scor
IPR011065Kunitz inhibitor ST1-likeunknownSSF50386STI-likecoord: 27..205
score: 1.55
NoneNo IPR availableGENE3DG3DSA:2.80.10.50coord: 27..205
score: 1.1
NoneNo IPR availablePANTHERPTHR33107FAMILY NOT NAMEDcoord: 2..205
score: 5.2
NoneNo IPR availablePANTHERPTHR33107:SF2SUBFAMILY NOT NAMEDcoord: 2..205
score: 5.2

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None