Tan0003781 (gene) Snake gourd v1

Overview
NameTan0003781
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG01: 116119930 .. 116120649 (+)
RNA-Seq ExpressionTan0003781
SyntenyTan0003781
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAAACAATGGACAAAAAACAATCCAAACCCAAAAGCAAAATCATGAAGTTTCTTCCCAGAGCGGCCTCCGCCGTCACTTTCCACAACCCACCGATCAGTCCCGGACGGGAGAACCGCCCGGTCGCCGCCCGAGGATTTTCCGGTCCGATTAAGATCTCGATCATACCGCAAGAAGCGCGGTCCAAATCGAATAAATCGGGATTCGAGACGCCGGAGCCCACTTCGCCGAAAGTTTCGTGCATTGGCCAAATCAAGCACAAGAAGAAGTTGAAGGAATTGGCGAAGAGTACTGCGGCGGCGGTTGAAGTTTCCGAGCCGAAGAAGCGGCATCCGCCGTCGTCGATTAAGAGAATTTTGACCAGCGGGAAAGTACTCGGGAGTGCGAAATCGAACGTTGCCGCAGCTCCGGCGGGGAATCGCCGTAGTGGTGGGAAGCCGCCGCGGCCGGAGAGAGCGCCGGGGTTGAATCAGATGAAGCGATTCTCGAGTGGACGTGGCGCTTTGGCGAATTTTGATTGGACGGCGCAGATTGCGCCGGCTGACGTGGAAGGAGAGGATGCGGAGGGCGGCGGCCGGAGTCCGGTATCGGAGGGGGTGTGGATTGGAGAAGAAGTGGGCCCACTGCAACCGAGAAAGGAGGTGAATATATGGAAGAGAAGAACCGTCGTTCCGCCAACACCTCTTCAACTCAACTCCACCATGGTCAAACAAAAATGA

mRNA sequence

AGAAAAACAATGGACAAAAAACAATCCAAACCCAAAAGCAAAATCATGAAGTTTCTTCCCAGAGCGGCCTCCGCCGTCACTTTCCACAACCCACCGATCAGTCCCGGACGGGAGAACCGCCCGGTCGCCGCCCGAGGATTTTCCGGTCCGATTAAGATCTCGATCATACCGCAAGAAGCGCGGTCCAAATCGAATAAATCGGGATTCGAGACGCCGGAGCCCACTTCGCCGAAAGTTTCGTGCATTGGCCAAATCAAGCACAAGAAGAAGTTGAAGGAATTGGCGAAGAGTACTGCGGCGGCGGTTGAAGTTTCCGAGCCGAAGAAGCGGCATCCGCCGTCGTCGATTAAGAGAATTTTGACCAGCGGGAAAGTACTCGGGAGTGCGAAATCGAACGTTGCCGCAGCTCCGGCGGGGAATCGCCGTAGTGGTGGGAAGCCGCCGCGGCCGGAGAGAGCGCCGGGGTTGAATCAGATGAAGCGATTCTCGAGTGGACGTGGCGCTTTGGCGAATTTTGATTGGACGGCGCAGATTGCGCCGGCTGACGTGGAAGGAGAGGATGCGGAGGGCGGCGGCCGGAGTCCGGTATCGGAGGGGGTGTGGATTGGAGAAGAAGTGGGCCCACTGCAACCGAGAAAGGAGGTGAATATATGGAAGAGAAGAACCGTCGTTCCGCCAACACCTCTTCAACTCAACTCCACCATGGTCAAACAAAAATGA

Coding sequence (CDS)

ATGGACAAAAAACAATCCAAACCCAAAAGCAAAATCATGAAGTTTCTTCCCAGAGCGGCCTCCGCCGTCACTTTCCACAACCCACCGATCAGTCCCGGACGGGAGAACCGCCCGGTCGCCGCCCGAGGATTTTCCGGTCCGATTAAGATCTCGATCATACCGCAAGAAGCGCGGTCCAAATCGAATAAATCGGGATTCGAGACGCCGGAGCCCACTTCGCCGAAAGTTTCGTGCATTGGCCAAATCAAGCACAAGAAGAAGTTGAAGGAATTGGCGAAGAGTACTGCGGCGGCGGTTGAAGTTTCCGAGCCGAAGAAGCGGCATCCGCCGTCGTCGATTAAGAGAATTTTGACCAGCGGGAAAGTACTCGGGAGTGCGAAATCGAACGTTGCCGCAGCTCCGGCGGGGAATCGCCGTAGTGGTGGGAAGCCGCCGCGGCCGGAGAGAGCGCCGGGGTTGAATCAGATGAAGCGATTCTCGAGTGGACGTGGCGCTTTGGCGAATTTTGATTGGACGGCGCAGATTGCGCCGGCTGACGTGGAAGGAGAGGATGCGGAGGGCGGCGGCCGGAGTCCGGTATCGGAGGGGGTGTGGATTGGAGAAGAAGTGGGCCCACTGCAACCGAGAAAGGAGGTGAATATATGGAAGAGAAGAACCGTCGTTCCGCCAACACCTCTTCAACTCAACTCCACCATGGTCAAACAAAAATGA

Protein sequence

MDKKQSKPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSKSNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILTSGKVLGSAKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADVEGEDAEGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMVKQK
Homology
BLAST of Tan0003781 vs. ExPASy Swiss-Prot
Match: Q9SGS5 (Uncharacterized protein At1g76070 OS=Arabidopsis thaliana OX=3702 GN=At1g76070 PE=1 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 3.8e-19
Identity = 90/262 (34.35%), Postives = 124/262 (47.33%), Query Frame = 0

Query: 7   KPKSKIMKFLPRAASAVTFHNPPISPGRE-------NRPVAAR-GFSGPIKISIIPQEAR 66
           K K+K++K LP+A S      PP SPGR+       N   A +  FSGP+ + ++P  AR
Sbjct: 13  KNKNKLLKMLPKAMS-FGHRVPPFSPGRDLHHNNHHNYTAANKMFFSGPM-VPLVPNAAR 72

Query: 67  SKSNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILT 126
            + NKS     EPTSPKVSCIGQIK  K      K   A   +     +   SS+ +   
Sbjct: 73  VRRNKSDAVWDEPTSPKVSCIGQIKLGKSKCPTGKKNKAPSSLIPKISKTSTSSLTKEDE 132

Query: 127 SGKVLGSAKSNVAAAPAGNRRSGGK--PPRPERA-------------PGLNQMKRFSSGR 186
            G+ L   KS  + +PA  R +  K  P     A             P L QMK+F+S R
Sbjct: 133 KGR-LSKIKSIFSFSPASGRNTSRKSHPTAVSAADEHPVTVVSTAAVPSLGQMKKFASSR 192

Query: 187 GALANFDWTAQI-----APAD----VEGEDAEGGGRSPVSEGVWIGEEVGP------LQP 231
            AL +FDW  ++     +PAD       +D   G      +     + + P      L+P
Sbjct: 193 DALGDFDWAVEMKHEEESPADHHRGYYSDDDTRGAYLRYDDDEDEDDIIIPFSAPLGLKP 252

BLAST of Tan0003781 vs. NCBI nr
Match: KAG6591250.1 (hypothetical protein SDJN03_13596, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 360.5 bits (924), Expect = 1.1e-95
Identity = 192/236 (81.36%), Postives = 205/236 (86.86%), Query Frame = 0

Query: 1   MDKKQSKPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSK 60
           M  KQS PKSKIMKFLPRAASA+TFHNPP+SPGRENRPVAARGFSGP+KISIIP+EARSK
Sbjct: 1   MMDKQSNPKSKIMKFLPRAASAITFHNPPVSPGRENRPVAARGFSGPMKISIIPREARSK 60

Query: 61  SNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILTSG 120
           SN SGFET EPTSPKVSCIGQIKHKKK+KELAK TAAAVE+SEPKKRH PS+I+RIL   
Sbjct: 61  SNNSGFETSEPTSPKVSCIGQIKHKKKMKELAKITAAAVEISEPKKRHLPSAIERILIGR 120

Query: 121 KVLGSAKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADV 180
           KVLG AKSNV AA      S GKPP PERA GLNQMKRFSSGRGAL NFDWTAQIAPA+V
Sbjct: 121 KVLGRAKSNVEAA------SAGKPPLPERALGLNQMKRFSSGRGALTNFDWTAQIAPAEV 180

Query: 181 EGEDAEGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMVKQK 237
           EGE+  GGG S  S  VWIGEE+  LQPRKEVNIWKRRTVVPPTPLQL+STMV+QK
Sbjct: 181 EGEEG-GGGGSSASTAVWIGEEIVTLQPRKEVNIWKRRTVVPPTPLQLHSTMVRQK 229

BLAST of Tan0003781 vs. NCBI nr
Match: XP_022937423.1 (uncharacterized protein At1g76070 [Cucurbita moschata])

HSP 1 Score: 359.0 bits (920), Expect = 3.1e-95
Identity = 191/236 (80.93%), Postives = 205/236 (86.86%), Query Frame = 0

Query: 1   MDKKQSKPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSK 60
           M  KQS PKSKIMKFLPRAASA+TFHNPP+SPGRENRPVAARGFSGP+KISIIP+EARSK
Sbjct: 1   MMDKQSNPKSKIMKFLPRAASAITFHNPPVSPGRENRPVAARGFSGPMKISIIPREARSK 60

Query: 61  SNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILTSG 120
           SN SGFET EPTSPKVSCIGQIKHKKK+KELAK TAAAVE+SEPKKRH PS+I+RIL   
Sbjct: 61  SNNSGFETSEPTSPKVSCIGQIKHKKKMKELAKITAAAVEISEPKKRHLPSAIERILIGR 120

Query: 121 KVLGSAKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADV 180
           KVLG AKSNV AA      S GKPP PERA GLNQMKRFSSGRGAL NFDWTAQIAPA+V
Sbjct: 121 KVLGRAKSNVEAA------SAGKPPLPERALGLNQMKRFSSGRGALTNFDWTAQIAPAEV 180

Query: 181 EGEDAEGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMVKQK 237
           EGE+  GGG S  S  VWIGEE+  L+PRKEVNIWKRRTVVPPTPLQL+STMV+QK
Sbjct: 181 EGEEG-GGGGSRASTAVWIGEEIVTLEPRKEVNIWKRRTVVPPTPLQLHSTMVRQK 229

BLAST of Tan0003781 vs. NCBI nr
Match: KAG7024133.1 (hypothetical protein SDJN02_12946, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 358.6 bits (919), Expect = 4.1e-95
Identity = 190/236 (80.51%), Postives = 205/236 (86.86%), Query Frame = 0

Query: 1   MDKKQSKPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSK 60
           M  KQS PKSKIMKFLPRAASA+TFHNPP+SPGRENRPVAARGFSGP+KISIIP+EARSK
Sbjct: 1   MMDKQSNPKSKIMKFLPRAASAITFHNPPVSPGRENRPVAARGFSGPMKISIIPREARSK 60

Query: 61  SNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILTSG 120
           SN SGFET EPTSPKVSCIGQIKHKKK+KELAK TAAAVE+SEPKKRH PS+I+RIL   
Sbjct: 61  SNNSGFETSEPTSPKVSCIGQIKHKKKMKELAKITAAAVEISEPKKRHLPSAIERILIGR 120

Query: 121 KVLGSAKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADV 180
           KVLG AKSN+ A       S GKPP PERA GLNQMKRFSSGRGAL NFDWTAQIAPA+V
Sbjct: 121 KVLGRAKSNMEAV------SAGKPPLPERALGLNQMKRFSSGRGALTNFDWTAQIAPAEV 180

Query: 181 EGEDAEGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMVKQK 237
           EGE+A GGG S  S  VWIGEE+  LQPRKEVNIWKRRT+VPPTPLQL+STMV+QK
Sbjct: 181 EGEEA-GGGGSRASTAVWIGEEIVTLQPRKEVNIWKRRTIVPPTPLQLHSTMVRQK 229

BLAST of Tan0003781 vs. NCBI nr
Match: XP_022976337.1 (uncharacterized protein At1g76070 [Cucurbita maxima])

HSP 1 Score: 356.7 bits (914), Expect = 1.6e-94
Identity = 189/233 (81.12%), Postives = 203/233 (87.12%), Query Frame = 0

Query: 1   MDKKQSKPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSK 60
           M  KQS PKSKIMKFLPRAASA+TFHNPP+SPGRENRPVAARGFSGP+KISIIP+EARSK
Sbjct: 1   MMDKQSNPKSKIMKFLPRAASAITFHNPPVSPGRENRPVAARGFSGPMKISIIPREARSK 60

Query: 61  SNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILTSG 120
           S  S FE+PEPTSPKVSCIGQIKHKKK+K+LAK TAAAVE+SEPKKRH PS+I+RIL   
Sbjct: 61  SKNSEFESPEPTSPKVSCIGQIKHKKKMKDLAKITAAAVEISEPKKRHLPSAIERILIGR 120

Query: 121 KVLGSAKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADV 180
           KVLG AKSNV AA      S GKPP PERAPGLNQMKRFSSGRGALANFDWTAQIAPA+V
Sbjct: 121 KVLGRAKSNVEAA------SAGKPPLPERAPGLNQMKRFSSGRGALANFDWTAQIAPAEV 180

Query: 181 EGEDAEGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMV 234
           EGE+  GGG S  S  VWIGEE+  LQPRKEVNIWKRRTVVPPTPLQL+STMV
Sbjct: 181 EGEEG-GGGGSRASTAVWIGEEIVTLQPRKEVNIWKRRTVVPPTPLQLHSTMV 226

BLAST of Tan0003781 vs. NCBI nr
Match: XP_038877727.1 (uncharacterized protein At1g76070-like [Benincasa hispida])

HSP 1 Score: 353.2 bits (905), Expect = 1.7e-93
Identity = 190/236 (80.51%), Postives = 199/236 (84.32%), Query Frame = 0

Query: 1   MDKKQSKPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSK 60
           M +K SKPKS+ MKFLPRAASAV F+NPP+SPGRE RP+  RGFSGP+ ISIIP+EARSK
Sbjct: 1   MMEKHSKPKSQFMKFLPRAASAVNFNNPPVSPGRETRPLPGRGFSGPMMISIIPREARSK 60

Query: 61  SNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILTSG 120
           SN SGFETPEPTSPKVSCIGQIKHKKKLK+LAK  AA V   E KKRHPPS IKRILT G
Sbjct: 61  SNNSGFETPEPTSPKVSCIGQIKHKKKLKDLAKKAAATV---ESKKRHPPSGIKRILTGG 120

Query: 121 KVLGSAKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADV 180
           KVLG AKSNVAAA        GKPP PERAPGLNQMKRFSSGRGALANFDWTAQIAP DV
Sbjct: 121 KVLGRAKSNVAAA-----APSGKPPLPERAPGLNQMKRFSSGRGALANFDWTAQIAPDDV 180

Query: 181 EGEDAEGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMVKQK 237
           EGE+A GGGR      VWI EEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMVKQK
Sbjct: 181 EGEEAGGGGR----RTVWIDEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMVKQK 224

BLAST of Tan0003781 vs. ExPASy TrEMBL
Match: A0A6J1FG08 (uncharacterized protein At1g76070 OS=Cucurbita moschata OX=3662 GN=LOC111443719 PE=4 SV=1)

HSP 1 Score: 359.0 bits (920), Expect = 1.5e-95
Identity = 191/236 (80.93%), Postives = 205/236 (86.86%), Query Frame = 0

Query: 1   MDKKQSKPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSK 60
           M  KQS PKSKIMKFLPRAASA+TFHNPP+SPGRENRPVAARGFSGP+KISIIP+EARSK
Sbjct: 1   MMDKQSNPKSKIMKFLPRAASAITFHNPPVSPGRENRPVAARGFSGPMKISIIPREARSK 60

Query: 61  SNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILTSG 120
           SN SGFET EPTSPKVSCIGQIKHKKK+KELAK TAAAVE+SEPKKRH PS+I+RIL   
Sbjct: 61  SNNSGFETSEPTSPKVSCIGQIKHKKKMKELAKITAAAVEISEPKKRHLPSAIERILIGR 120

Query: 121 KVLGSAKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADV 180
           KVLG AKSNV AA      S GKPP PERA GLNQMKRFSSGRGAL NFDWTAQIAPA+V
Sbjct: 121 KVLGRAKSNVEAA------SAGKPPLPERALGLNQMKRFSSGRGALTNFDWTAQIAPAEV 180

Query: 181 EGEDAEGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMVKQK 237
           EGE+  GGG S  S  VWIGEE+  L+PRKEVNIWKRRTVVPPTPLQL+STMV+QK
Sbjct: 181 EGEEG-GGGGSRASTAVWIGEEIVTLEPRKEVNIWKRRTVVPPTPLQLHSTMVRQK 229

BLAST of Tan0003781 vs. ExPASy TrEMBL
Match: A0A6J1IFG4 (uncharacterized protein At1g76070 OS=Cucurbita maxima OX=3661 GN=LOC111476771 PE=4 SV=1)

HSP 1 Score: 356.7 bits (914), Expect = 7.6e-95
Identity = 189/233 (81.12%), Postives = 203/233 (87.12%), Query Frame = 0

Query: 1   MDKKQSKPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSK 60
           M  KQS PKSKIMKFLPRAASA+TFHNPP+SPGRENRPVAARGFSGP+KISIIP+EARSK
Sbjct: 1   MMDKQSNPKSKIMKFLPRAASAITFHNPPVSPGRENRPVAARGFSGPMKISIIPREARSK 60

Query: 61  SNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILTSG 120
           S  S FE+PEPTSPKVSCIGQIKHKKK+K+LAK TAAAVE+SEPKKRH PS+I+RIL   
Sbjct: 61  SKNSEFESPEPTSPKVSCIGQIKHKKKMKDLAKITAAAVEISEPKKRHLPSAIERILIGR 120

Query: 121 KVLGSAKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADV 180
           KVLG AKSNV AA      S GKPP PERAPGLNQMKRFSSGRGALANFDWTAQIAPA+V
Sbjct: 121 KVLGRAKSNVEAA------SAGKPPLPERAPGLNQMKRFSSGRGALANFDWTAQIAPAEV 180

Query: 181 EGEDAEGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMV 234
           EGE+  GGG S  S  VWIGEE+  LQPRKEVNIWKRRTVVPPTPLQL+STMV
Sbjct: 181 EGEEG-GGGGSRASTAVWIGEEIVTLQPRKEVNIWKRRTVVPPTPLQLHSTMV 226

BLAST of Tan0003781 vs. ExPASy TrEMBL
Match: A0A6J1FQ15 (uncharacterized protein At1g76070-like OS=Cucurbita moschata OX=3662 GN=LOC111446210 PE=4 SV=1)

HSP 1 Score: 336.3 bits (861), Expect = 1.1e-88
Identity = 173/226 (76.55%), Postives = 192/226 (84.96%), Query Frame = 0

Query: 7   KPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSKSNKSGF 66
           +PK KIMK  PRAASA+ FHNPP SPGRENR VA RGFSGP+KISIIP+EA+SKSN SG 
Sbjct: 78  EPKGKIMKLFPRAASALIFHNPPKSPGRENRQVAGRGFSGPVKISIIPKEAQSKSNSSGL 137

Query: 67  ETPEPTSPKVSCIGQIKHKKKLKELAK-STAAAVEVSEPKKRHPPSSIKRILTSGKVLGS 126
           ET EPTSPKVSCIGQIKHKKK+KE+A+ + AAA ++S+PKKRHPPS IKRILT GK+LG 
Sbjct: 138 ETLEPTSPKVSCIGQIKHKKKMKEIARNAAAAAAKISQPKKRHPPSVIKRILTGGKILGR 197

Query: 127 AKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADVEGEDA 186
           AKSN  AAP+ N R GGKPPR      LNQMKRFSSGR  LA+FDWTAQI PAD+EGE+ 
Sbjct: 198 AKSNHVAAPSRNHRGGGKPPR------LNQMKRFSSGRDTLASFDWTAQIVPADLEGEEP 257

Query: 187 EGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNST 232
           EG GRSPVSE VWIGE+VGPLQPRKEVNIWKRRT VPPTPLQLNS+
Sbjct: 258 EGDGRSPVSETVWIGEDVGPLQPRKEVNIWKRRTAVPPTPLQLNSS 297

BLAST of Tan0003781 vs. ExPASy TrEMBL
Match: A0A6J1J4Y0 (uncharacterized protein At1g76070-like OS=Cucurbita maxima OX=3661 GN=LOC111481291 PE=4 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 2.4e-88
Identity = 169/220 (76.82%), Postives = 189/220 (85.91%), Query Frame = 0

Query: 13  MKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEARSKSNKSGFETPEPT 72
           MK  PRAASA  FHNP  SPGRENR VA RGFS P+KISIIP+EA+SKSN SGF+TPEPT
Sbjct: 1   MKLFPRAASAFIFHNPLKSPGRENRQVAGRGFSSPVKISIIPKEAQSKSNSSGFQTPEPT 60

Query: 73  SPKVSCIGQIKHKKKLKELAK-STAAAVEVSEPKKRHPPSSIKRILTSGKVLGSAKSNVA 132
           SPKVSCIGQIKHKKK+KE+A+ + AAA ++S+PKKRHPPS IKRILT GK+LG AKSN  
Sbjct: 61  SPKVSCIGQIKHKKKMKEIARNAAAAAAKISQPKKRHPPSVIKRILTGGKILGRAKSNDV 120

Query: 133 AAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADVEGEDAEGGGRS 192
           AAP+ N R GGKPP P++AP LNQMKRFSSGR  LA+FDWTAQI PAD+EGE+ EG  RS
Sbjct: 121 AAPSRNHRGGGKPPLPKKAPRLNQMKRFSSGRDTLASFDWTAQIVPADLEGEETEGDWRS 180

Query: 193 PVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNST 232
           PVSE VWIGE+VGPLQPRKEVNIWKRRT VPPTPLQLNS+
Sbjct: 181 PVSEAVWIGEDVGPLQPRKEVNIWKRRTAVPPTPLQLNSS 220

BLAST of Tan0003781 vs. ExPASy TrEMBL
Match: A0A0A0LDM5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G875410 PE=4 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 3.2e-85
Identity = 179/235 (76.17%), Postives = 189/235 (80.43%), Query Frame = 0

Query: 3   KKQSKPKSKIMKFLPRAASAVTFHNPPISPGRENRPVAARGFSGPIKISIIPQEAR-SKS 62
           +K S PKSK MKFLPRAASAV FHNPP+SPGRE RP+A RGFSGP+  SIIP+EAR +KS
Sbjct: 5   EKHSNPKSKFMKFLPRAASAVNFHNPPVSPGRETRPLAGRGFSGPMNFSIIPREARITKS 64

Query: 63  NKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILTSGK 122
           N SGFETPEPTSPKVSCIGQIKHKKKLK +AK+ AAA    E K R PPS IKRI T GK
Sbjct: 65  NNSGFETPEPTSPKVSCIGQIKHKKKLKGMAKTAAAA--TVESKNRLPPSRIKRIFTGGK 124

Query: 123 VLGSAKSNVAAAPAGNRRSGGKPPRPERAPGLNQMKRFSSGRGALANFDWTAQIAPADVE 182
           VLG AKSNVA A A      GKPP PERAPGLNQMKRFSSGRGALANFDWTAQIAP D  
Sbjct: 125 VLGRAKSNVAGAAA----QSGKPPLPERAPGLNQMKRFSSGRGALANFDWTAQIAPED-- 184

Query: 183 GEDAEGGGRSPVSEGVWIGEEVGPLQPRKEVNIWKRRTVVPPTPLQLNSTMVKQK 237
             + EG GR      VWIGEEVGP QPRKEVNIWKRRTVVPPTPLQLNST+VKQK
Sbjct: 185 --EVEGEGR----RTVWIGEEVGPFQPRKEVNIWKRRTVVPPTPLQLNSTIVKQK 225

BLAST of Tan0003781 vs. TAIR 10
Match: AT1G76070.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 8 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G20310.1); Has 66 Blast hits to 66 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 64; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 96.7 bits (239), Expect = 2.7e-20
Identity = 90/262 (34.35%), Postives = 124/262 (47.33%), Query Frame = 0

Query: 7   KPKSKIMKFLPRAASAVTFHNPPISPGRE-------NRPVAAR-GFSGPIKISIIPQEAR 66
           K K+K++K LP+A S      PP SPGR+       N   A +  FSGP+ + ++P  AR
Sbjct: 13  KNKNKLLKMLPKAMS-FGHRVPPFSPGRDLHHNNHHNYTAANKMFFSGPM-VPLVPNAAR 72

Query: 67  SKSNKSGFETPEPTSPKVSCIGQIKHKKKLKELAKSTAAAVEVSEPKKRHPPSSIKRILT 126
            + NKS     EPTSPKVSCIGQIK  K      K   A   +     +   SS+ +   
Sbjct: 73  VRRNKSDAVWDEPTSPKVSCIGQIKLGKSKCPTGKKNKAPSSLIPKISKTSTSSLTKEDE 132

Query: 127 SGKVLGSAKSNVAAAPAGNRRSGGK--PPRPERA-------------PGLNQMKRFSSGR 186
            G+ L   KS  + +PA  R +  K  P     A             P L QMK+F+S R
Sbjct: 133 KGR-LSKIKSIFSFSPASGRNTSRKSHPTAVSAADEHPVTVVSTAAVPSLGQMKKFASSR 192

Query: 187 GALANFDWTAQI-----APAD----VEGEDAEGGGRSPVSEGVWIGEEVGP------LQP 231
            AL +FDW  ++     +PAD       +D   G      +     + + P      L+P
Sbjct: 193 DALGDFDWAVEMKHEEESPADHHRGYYSDDDTRGAYLRYDDDEDEDDIIIPFSAPLGLKP 252

BLAST of Tan0003781 vs. TAIR 10
Match: AT1G20310.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G76070.1); Has 46 Blast hits to 46 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 46; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 75.5 bits (184), Expect = 6.5e-14
Identity = 89/252 (35.32%), Postives = 118/252 (46.83%), Query Frame = 0

Query: 1   MDKKQSKPKS-KIMKFLPRAASAVTFHNPPISPGRE-----NRPVAARG--FSGPIKISI 60
           M+K  +  KS K  K L RA S +     P SP R+         A RG  FS P    +
Sbjct: 1   MEKASNSNKSTKFSKMLQRAMS-IGHSAAPFSPRRDFHQHRTTSTANRGIFFSSP----L 60

Query: 61  IPQEARSKSN-KSGFETPEPTSPKVSCIGQI---------KHKKKLKELAKSTAAAVEVS 120
           +P  AR + N K      EPTSPKVSCIGQ+         K  K  K L  +++ +  V 
Sbjct: 61  VPTAARVRRNTKYEAVFAEPTSPKVSCIGQVKLARPKCPEKKNKAPKNLKTASSLSSCVI 120

Query: 121 EPKKRHPPSSIKRILTSGKVLGSAKSN--VAAAPAGNRRSGGKPPRPERAPGLNQMKRFS 180
           + +     S +KRI  S +   S KSN    AA A       +      AP L  MK+F+
Sbjct: 121 KEEDNGSFSKLKRIF-SMRSYPSRKSNSTAFAAAAAREHPIAEVDAVTAAPSLGAMKKFA 180

Query: 181 SGRGALANFDWTAQIAPADVEGEDAEGGGRSPVSEGVWIGE----EVGPLQPRKEVNIWK 229
           S R AL  FDWT Q+     E ED       P S G+ + +     + P +P+ EVN+WK
Sbjct: 181 SSREALGGFDWTVQMKR---EKEDV----MIPCSVGIPLTQLEDLSLCP-KPKSEVNLWK 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SGS53.8e-1934.35Uncharacterized protein At1g76070 OS=Arabidopsis thaliana OX=3702 GN=At1g76070 P... [more]
Match NameE-valueIdentityDescription
KAG6591250.11.1e-9581.36hypothetical protein SDJN03_13596, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022937423.13.1e-9580.93uncharacterized protein At1g76070 [Cucurbita moschata][more]
KAG7024133.14.1e-9580.51hypothetical protein SDJN02_12946, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022976337.11.6e-9481.12uncharacterized protein At1g76070 [Cucurbita maxima][more]
XP_038877727.11.7e-9380.51uncharacterized protein At1g76070-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1FG081.5e-9580.93uncharacterized protein At1g76070 OS=Cucurbita moschata OX=3662 GN=LOC111443719 ... [more]
A0A6J1IFG47.6e-9581.12uncharacterized protein At1g76070 OS=Cucurbita maxima OX=3661 GN=LOC111476771 PE... [more]
A0A6J1FQ151.1e-8876.55uncharacterized protein At1g76070-like OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A6J1J4Y02.4e-8876.82uncharacterized protein At1g76070-like OS=Cucurbita maxima OX=3661 GN=LOC1114812... [more]
A0A0A0LDM53.2e-8576.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G875410 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G76070.12.7e-2034.35unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G20310.16.5e-1435.32unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 177..202
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 96..161
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 54..77
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 55..75
IPR038796At1g76070-likePANTHERPTHR34779OS09G0542900 PROTEINcoord: 1..230

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003781.1Tan0003781.1mRNA