Tan0022345 (gene) Snake gourd v1

Overview
NameTan0022345
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
LocationLG10: 1136279 .. 1138531 (-)
RNA-Seq ExpressionTan0022345
SyntenyTan0022345
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATCATTTTAAAATTTTTAATTTTAATTATTATAAACAAACTCTCTTTTCTTTATTATTATGGCAAATCTTCCTCGTTTTGGCCGTACATGGCAACGTTTCTCTTCTCTACCTCGCCCCGGAACTGCCTCACGGCCTGAAATAATTCAGCCGGCGACGGTGACGACGGAGCCAGAAGTTTTTCCCTCTGCTCCAACAACCACTAATATTCTTCAAACTTCCCCTATTAGAGAAAGATCATCTCCGGTGAGGAAATTTCCCTCGCCGCCTACTTCTCCAAAATACAGGACCGCCGCCACAATCACTTCGCCTCGTAAGCCGTTATCTCCAACTCCGACATACAACCGCTACGACGGCGAACGACGGACCAGTGCCCCTTCATCTCCAAAAACCTTCAAACCAACTCCCAGCTCTCCGCCACTGTCGCCGGCCAAGCATAAATATTCAACCACCACTACCGCCGCACCACCTTCTCCTCTTCTGCCACGCTCGGAGCCAAGACATGAACCTGAGCAAATGATTCGATCCAGGAGCCCACCGGAGGTTAGTTCTGACCATTTATAGTTTATAGGAGTTGAATTAGACTTATTTTGACAATATAATCACCCTTTAAGTAGCACAAATTATTCACTCAGGGCGTCACAAATTATTCAATATGGCTGTCGCATGATAACCGTTGGATGTGAATTCGATAATTTTAATACTTCCTAATATCCCTTGGATACAAATTCAGAGCACTTCTCTCTCTCTATCCAACGGTGAAAAATTACCGGCTACATTGTGGCAAGCTCACCGAATAATTTCACCTTAAGTAGGGGTATACAACTAATTGATAACCCGAGCAACGTGGATTATCGATTCTTGAGTTGGATTTTTTTATACTCTGGTTAACCCAATTCAACTTAAAATTTATATTTTTTTAATTTTATTTTAATTAAATATATACTTATAAATAAATTATTTGTCAATTTTTTTTAAAATTCTAACCAAAGTTAAAATGTTATGAATTTTGAAATTTGATATTTGAATTTTATGTGATTTTAGTCATTAATTCAAGAAAAAATATTTTAAATAATTCAAATAATTATTAAAAAATAAAAAAGTTAACCTAATGACCCAACCCGAAAATATTGGGTTAGGTTGGGTTACTCTCATGAACAGGGTTGGTTGGATTGAAGGAATAACCAACCCAAATTTTCGGATTGGTTCAAAAATCTCCCCCAACCCAACTCCAACCCGATTTATGTACACTCCTATCTTTAAGTTCAATATAATCACCCTTAAGTTTGTTGATAAATTATTAGAAGTGATGCTTGAGCAGCATTTAAGTATGTGCAACCCCATTGATGATCACATATTTTTTTCTTTTTTAAAAAAAAACATTTTTTTGTGTGGCTATCAATGGGCTACACACCACTTGGTCGTATAGCAACATCTAAGTATTTTTCTCATATGTATAAACATTTGACCATCAATGGGCTACACACCACTTGGTATGGTACCAGCAACATACAATTTTTTAAAGTGAAGTTTTGTTTTAAAATCTATATATACATATATATATATATATATGTGACGGCTTAAACTATTAATTAATTAGTTGGGGATTGAGAAATAAGATTGATATATATACTTAAAACATATTGTTAAGCAGGTGCAGCAGAAAACCATCCTTTACGAAAAGACGACGACGGCGGAGAAGCCTGCGAAAACCGACCACCGAGCATCGGAGTACAGCTCCGGCAAACCCCAGCAGAAGCAGAAGCAGCAGCAGCAGCAAAGTGATGTTATAAATATGAAAGGCGAGAATGTGGGGGCCGTCATGCACATCACTCAATCGTCTGATGGAACAGAAATTCATAAGAAAAAGCCAATCAACGAAAATGCTGAAAAAACAAACAAATCAAGCTCTAATTTGCCACCAAAATCGTTCATGAACAGCAATTTTCAAGGCGTCAACAATTCCATTCTCTACAATTCGTCTTTGAGTCATCGTGATCCTGGCCTGCACCTTTCTTACTCCAACAAGCCCACCCATGGCGATTCTCTTCATCATTGATCTTCAAGGCTTTCATATATATATGCTAAAATAATTATGAATAAAGGAACAACATCAATATTATATATGATAATATATATATATATAATTAAATTAAACCATTTTATATATTCATATATATGATTCATATATGGGATGTATTTTCATATGCTCTTACTATTTGGAAAATGAGTTATTAATAATAAAAGCTTTTAC

mRNA sequence

AATCATTTTAAAATTTTTAATTTTAATTATTATAAACAAACTCTCTTTTCTTTATTATTATGGCAAATCTTCCTCGTTTTGGCCGTACATGGCAACGTTTCTCTTCTCTACCTCGCCCCGGAACTGCCTCACGGCCTGAAATAATTCAGCCGGCGACGGTGACGACGGAGCCAGAAGTTTTTCCCTCTGCTCCAACAACCACTAATATTCTTCAAACTTCCCCTATTAGAGAAAGATCATCTCCGGTGAGGAAATTTCCCTCGCCGCCTACTTCTCCAAAATACAGGACCGCCGCCACAATCACTTCGCCTCGTAAGCCGTTATCTCCAACTCCGACATACAACCGCTACGACGGCGAACGACGGACCAGTGCCCCTTCATCTCCAAAAACCTTCAAACCAACTCCCAGCTCTCCGCCACTGTCGCCGGCCAAGCATAAATATTCAACCACCACTACCGCCGCACCACCTTCTCCTCTTCTGCCACGCTCGGAGCCAAGACATGAACCTGAGCAAATGATTCGATCCAGGAGCCCACCGGAGGTGCAGCAGAAAACCATCCTTTACGAAAAGACGACGACGGCGGAGAAGCCTGCGAAAACCGACCACCGAGCATCGGAGTACAGCTCCGGCAAACCCCAGCAGAAGCAGAAGCAGCAGCAGCAGCAAAGTGATGTTATAAATATGAAAGGCGAGAATGTGGGGGCCGTCATGCACATCACTCAATCGTCTGATGGAACAGAAATTCATAAGAAAAAGCCAATCAACGAAAATGCTGAAAAAACAAACAAATCAAGCTCTAATTTGCCACCAAAATCGTTCATGAACAGCAATTTTCAAGGCGTCAACAATTCCATTCTCTACAATTCGTCTTTGAGTCATCGTGATCCTGGCCTGCACCTTTCTTACTCCAACAAGCCCACCCATGGCGATTCTCTTCATCATTGATCTTCAAGGCTTTCATATATATATGCTAAAATAATTATGAATAAAGGAACAACATCAATATTATATATGATAATATATATATATATAATTAAATTAAACCATTTTATATATTCATATATATGATTCATATATGGGATGTATTTTCATATGCTCTTACTATTTGGAAAATGAGTTATTAATAATAAAAGCTTTTAC

Coding sequence (CDS)

ATGGCAAATCTTCCTCGTTTTGGCCGTACATGGCAACGTTTCTCTTCTCTACCTCGCCCCGGAACTGCCTCACGGCCTGAAATAATTCAGCCGGCGACGGTGACGACGGAGCCAGAAGTTTTTCCCTCTGCTCCAACAACCACTAATATTCTTCAAACTTCCCCTATTAGAGAAAGATCATCTCCGGTGAGGAAATTTCCCTCGCCGCCTACTTCTCCAAAATACAGGACCGCCGCCACAATCACTTCGCCTCGTAAGCCGTTATCTCCAACTCCGACATACAACCGCTACGACGGCGAACGACGGACCAGTGCCCCTTCATCTCCAAAAACCTTCAAACCAACTCCCAGCTCTCCGCCACTGTCGCCGGCCAAGCATAAATATTCAACCACCACTACCGCCGCACCACCTTCTCCTCTTCTGCCACGCTCGGAGCCAAGACATGAACCTGAGCAAATGATTCGATCCAGGAGCCCACCGGAGGTGCAGCAGAAAACCATCCTTTACGAAAAGACGACGACGGCGGAGAAGCCTGCGAAAACCGACCACCGAGCATCGGAGTACAGCTCCGGCAAACCCCAGCAGAAGCAGAAGCAGCAGCAGCAGCAAAGTGATGTTATAAATATGAAAGGCGAGAATGTGGGGGCCGTCATGCACATCACTCAATCGTCTGATGGAACAGAAATTCATAAGAAAAAGCCAATCAACGAAAATGCTGAAAAAACAAACAAATCAAGCTCTAATTTGCCACCAAAATCGTTCATGAACAGCAATTTTCAAGGCGTCAACAATTCCATTCTCTACAATTCGTCTTTGAGTCATCGTGATCCTGGCCTGCACCTTTCTTACTCCAACAAGCCCACCCATGGCGATTCTCTTCATCATTGA

Protein sequence

MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRERSSPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTPSSPPLSPAKHKYSTTTTAAPPSPLLPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTTAEKPAKTDHRASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKKPINENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNKPTHGDSLHH
Homology
BLAST of Tan0022345 vs. NCBI nr
Match: KAG6597058.1 (hypothetical protein SDJN03_10238, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 413.7 bits (1062), Expect = 1.3e-111
Identity = 232/303 (76.57%), Postives = 257/303 (84.82%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRERS 60
           MANLPRFGRTW RFSSLPRPGTA R ++  PA  T+EPEV+PSAP T N+LQTSPI+ERS
Sbjct: 1   MANLPRFGRTWNRFSSLPRPGTAPRLDVPPPA-ATSEPEVYPSAPRTANVLQTSPIKERS 60

Query: 61  ----SPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
               SPVRK+PSPPTSPKYR AA  TSPRKPLSP PTYNRYDGERR+SA +SPKTFK T 
Sbjct: 61  PRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSALASPKTFKTTY 120

Query: 121 SSPPLSPAKHKYSTTTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTTA 180
           +SPP SPAKHKYST+T  AP SPL LP  E RHEPE M+R RSPPEVQQK++LY+K TT+
Sbjct: 121 TSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQK-TTS 180

Query: 181 EKPAKTDHRASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKKPI 240
           EKPAKTDHRASEYSSGKPQQKQ   QQQSDVIN+KGENVGAVMHITQSSD TE HKKKP 
Sbjct: 181 EKPAKTDHRASEYSSGKPQQKQ-THQQQSDVINIKGENVGAVMHITQSSDATETHKKKPT 240

Query: 241 ----NENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNKPTHGD 295
               +EN + +NKSSSN+P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+++ KPTH D
Sbjct: 241 VSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTHAD 300

BLAST of Tan0022345 vs. NCBI nr
Match: XP_022945677.1 (DNA-directed RNA polymerase II subunit 1-like [Cucurbita moschata])

HSP 1 Score: 412.1 bits (1058), Expect = 3.9e-111
Identity = 231/303 (76.24%), Postives = 257/303 (84.82%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRERS 60
           MANLPRFGRTW RFSSLPRPGTA R ++  PA  T+EPEV+PSAP T N+LQTSPI+ERS
Sbjct: 1   MANLPRFGRTWNRFSSLPRPGTAPRLDVPPPA-ATSEPEVYPSAPRTANVLQTSPIKERS 60

Query: 61  ----SPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
               SPVRK+PSPPTSPKYR AA  TSPRKPLSP PTYNRYDGERR+SA +SPKTFK T 
Sbjct: 61  PRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSALASPKTFKTTY 120

Query: 121 SSPPLSPAKHKYSTTTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTTA 180
           +SPP SPAKHKYST+T  AP SPL LP  E RHEPE ++R RSPPEVQQK++LY+K TT+
Sbjct: 121 TSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPVVRPRSPPEVQQKSVLYQK-TTS 180

Query: 181 EKPAKTDHRASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKKPI 240
           EKPAKTDHRASEYSSGKPQQKQ   QQQSDVIN+KGENVGAVMHITQSSD TE HKKKP 
Sbjct: 181 EKPAKTDHRASEYSSGKPQQKQ-THQQQSDVINIKGENVGAVMHITQSSDATETHKKKPT 240

Query: 241 ----NENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNKPTHGD 295
               +EN + +NKSSSN+P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+++ KPTH D
Sbjct: 241 VSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTHAD 300

BLAST of Tan0022345 vs. NCBI nr
Match: XP_022974814.1 (DNA-directed RNA polymerase II subunit 1-like isoform X2 [Cucurbita maxima] >XP_022974815.1 DNA-directed RNA polymerase II subunit 1-like isoform X3 [Cucurbita maxima])

HSP 1 Score: 409.8 bits (1052), Expect = 1.9e-110
Identity = 231/303 (76.24%), Postives = 255/303 (84.16%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRERS 60
           MANLPRFGRTW RFSSLPRP TA R ++  PA  T+EPEVF SAP T N+LQTSPI+ERS
Sbjct: 1   MANLPRFGRTWNRFSSLPRPATAPRFDVPPPA-ATSEPEVFSSAPPTANVLQTSPIKERS 60

Query: 61  ----SPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
               SPVRK+PSPPTSPKYR AA  TSPRKPLSP PTYNRYDGERR+SA +SPKTFK T 
Sbjct: 61  PRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSAQASPKTFKTTY 120

Query: 121 SSPPLSPAKHKYSTTTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTTA 180
           +SPP SPAKHKYST+T  AP SPL LP  E RHEPE M+R RSPPEVQQK++LY+K TT+
Sbjct: 121 TSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQK-TTS 180

Query: 181 EKPAKTDHRASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKKPI 240
           EKPAKTDHRASEYSSGKPQQKQ   QQQSDVIN+KGENVGAVMHITQSSD TE HKKKP 
Sbjct: 181 EKPAKTDHRASEYSSGKPQQKQ-THQQQSDVINIKGENVGAVMHITQSSDATETHKKKPT 240

Query: 241 ----NENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNKPTHGD 295
               +EN + +NKSSSN+P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+++ KPTH D
Sbjct: 241 VSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTHAD 300

BLAST of Tan0022345 vs. NCBI nr
Match: XP_023538835.1 (proline-rich extensin-like protein EPR1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 409.1 bits (1050), Expect = 3.3e-110
Identity = 230/303 (75.91%), Postives = 254/303 (83.83%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRERS 60
           MANLPRFGRTW RFSSLPRPGTA R ++  PA   +EPEV+P  P T N+LQTSPI+ERS
Sbjct: 1   MANLPRFGRTWNRFSSLPRPGTAPRLDVPPPA-AASEPEVYPPPPRTANVLQTSPIKERS 60

Query: 61  ----SPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
               SPVRK+PSPPTSPKYR AA  TSPRKPLSP PTYNRYDGERRTSA +SPKTFK T 
Sbjct: 61  PRIPSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRTSALASPKTFKTTY 120

Query: 121 SSPPLSPAKHKYSTTTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTTA 180
           +SPP SPAKHKYST+T  AP SPL LP  E RHEPE M+R RSPPEVQQK++LY+K TT+
Sbjct: 121 TSPPRSPAKHKYSTSTLQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQK-TTS 180

Query: 181 EKPAKTDHRASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKKPI 240
           EKPAKTDHRASEYSSGKPQQKQ   QQQSDVIN+KGENVGAVMHITQSSD TE HKKKP 
Sbjct: 181 EKPAKTDHRASEYSSGKPQQKQ-THQQQSDVINIKGENVGAVMHITQSSDATETHKKKPT 240

Query: 241 ----NENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNKPTHGD 295
               +EN + +NKSSSN+P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+++ KPTH D
Sbjct: 241 VSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTHAD 300

BLAST of Tan0022345 vs. NCBI nr
Match: XP_022974813.1 (DNA-directed RNA polymerase II subunit 1-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 406.4 bits (1043), Expect = 2.1e-109
Identity = 230/303 (75.91%), Postives = 254/303 (83.83%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRERS 60
           MANLPRFGRTW RFSSLPRP TA R ++  PA  T+EPEVF SAP T N+LQTSPI+ERS
Sbjct: 1   MANLPRFGRTWNRFSSLPRPATAPRFDVPPPA-ATSEPEVFSSAPPTANVLQTSPIKERS 60

Query: 61  ----SPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
               SPVRK+ SPPTSPKYR AA  TSPRKPLSP PTYNRYDGERR+SA +SPKTFK T 
Sbjct: 61  PRITSPVRKYHSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSAQASPKTFKTTY 120

Query: 121 SSPPLSPAKHKYSTTTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTTA 180
           +SPP SPAKHKYST+T  AP SPL LP  E RHEPE M+R RSPPEVQQK++LY+K TT+
Sbjct: 121 TSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQK-TTS 180

Query: 181 EKPAKTDHRASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKKPI 240
           EKPAKTDHRASEYSSGKPQQKQ   QQQSDVIN+KGENVGAVMHITQSSD TE HKKKP 
Sbjct: 181 EKPAKTDHRASEYSSGKPQQKQ-THQQQSDVINIKGENVGAVMHITQSSDATETHKKKPT 240

Query: 241 ----NENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNKPTHGD 295
               +EN + +NKSSSN+P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+++ KPTH D
Sbjct: 241 VSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTHAD 300

BLAST of Tan0022345 vs. ExPASy TrEMBL
Match: A0A6J1G1L2 (DNA-directed RNA polymerase II subunit 1-like OS=Cucurbita moschata OX=3662 GN=LOC111449836 PE=4 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 1.9e-111
Identity = 231/303 (76.24%), Postives = 257/303 (84.82%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRERS 60
           MANLPRFGRTW RFSSLPRPGTA R ++  PA  T+EPEV+PSAP T N+LQTSPI+ERS
Sbjct: 1   MANLPRFGRTWNRFSSLPRPGTAPRLDVPPPA-ATSEPEVYPSAPRTANVLQTSPIKERS 60

Query: 61  ----SPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
               SPVRK+PSPPTSPKYR AA  TSPRKPLSP PTYNRYDGERR+SA +SPKTFK T 
Sbjct: 61  PRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSALASPKTFKTTY 120

Query: 121 SSPPLSPAKHKYSTTTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTTA 180
           +SPP SPAKHKYST+T  AP SPL LP  E RHEPE ++R RSPPEVQQK++LY+K TT+
Sbjct: 121 TSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPVVRPRSPPEVQQKSVLYQK-TTS 180

Query: 181 EKPAKTDHRASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKKPI 240
           EKPAKTDHRASEYSSGKPQQKQ   QQQSDVIN+KGENVGAVMHITQSSD TE HKKKP 
Sbjct: 181 EKPAKTDHRASEYSSGKPQQKQ-THQQQSDVINIKGENVGAVMHITQSSDATETHKKKPT 240

Query: 241 ----NENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNKPTHGD 295
               +EN + +NKSSSN+P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+++ KPTH D
Sbjct: 241 VSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTHAD 300

BLAST of Tan0022345 vs. ExPASy TrEMBL
Match: A0A6J1IEX0 (DNA-directed RNA polymerase II subunit 1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111473598 PE=4 SV=1)

HSP 1 Score: 409.8 bits (1052), Expect = 9.4e-111
Identity = 231/303 (76.24%), Postives = 255/303 (84.16%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRERS 60
           MANLPRFGRTW RFSSLPRP TA R ++  PA  T+EPEVF SAP T N+LQTSPI+ERS
Sbjct: 1   MANLPRFGRTWNRFSSLPRPATAPRFDVPPPA-ATSEPEVFSSAPPTANVLQTSPIKERS 60

Query: 61  ----SPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
               SPVRK+PSPPTSPKYR AA  TSPRKPLSP PTYNRYDGERR+SA +SPKTFK T 
Sbjct: 61  PRITSPVRKYPSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSAQASPKTFKTTY 120

Query: 121 SSPPLSPAKHKYSTTTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTTA 180
           +SPP SPAKHKYST+T  AP SPL LP  E RHEPE M+R RSPPEVQQK++LY+K TT+
Sbjct: 121 TSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQK-TTS 180

Query: 181 EKPAKTDHRASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKKPI 240
           EKPAKTDHRASEYSSGKPQQKQ   QQQSDVIN+KGENVGAVMHITQSSD TE HKKKP 
Sbjct: 181 EKPAKTDHRASEYSSGKPQQKQ-THQQQSDVINIKGENVGAVMHITQSSDATETHKKKPT 240

Query: 241 ----NENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNKPTHGD 295
               +EN + +NKSSSN+P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+++ KPTH D
Sbjct: 241 VSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTHAD 300

BLAST of Tan0022345 vs. ExPASy TrEMBL
Match: A0A6J1IHF4 (DNA-directed RNA polymerase II subunit 1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111473598 PE=4 SV=1)

HSP 1 Score: 406.4 bits (1043), Expect = 1.0e-109
Identity = 230/303 (75.91%), Postives = 254/303 (83.83%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRERS 60
           MANLPRFGRTW RFSSLPRP TA R ++  PA  T+EPEVF SAP T N+LQTSPI+ERS
Sbjct: 1   MANLPRFGRTWNRFSSLPRPATAPRFDVPPPA-ATSEPEVFSSAPPTANVLQTSPIKERS 60

Query: 61  ----SPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
               SPVRK+ SPPTSPKYR AA  TSPRKPLSP PTYNRYDGERR+SA +SPKTFK T 
Sbjct: 61  PRITSPVRKYHSPPTSPKYRAAAPSTSPRKPLSPPPTYNRYDGERRSSAQASPKTFKTTY 120

Query: 121 SSPPLSPAKHKYSTTTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTTA 180
           +SPP SPAKHKYST+T  AP SPL LP  E RHEPE M+R RSPPEVQQK++LY+K TT+
Sbjct: 121 TSPPRSPAKHKYSTSTVQAPLSPLALPSLEKRHEPEPMVRPRSPPEVQQKSVLYQK-TTS 180

Query: 181 EKPAKTDHRASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKKPI 240
           EKPAKTDHRASEYSSGKPQQKQ   QQQSDVIN+KGENVGAVMHITQSSD TE HKKKP 
Sbjct: 181 EKPAKTDHRASEYSSGKPQQKQ-THQQQSDVINIKGENVGAVMHITQSSDATETHKKKPT 240

Query: 241 ----NENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNKPTHGD 295
               +EN + +NKSSSN+P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+++ KPTH D
Sbjct: 241 VSYSSENEKDSNKSSSNMPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFTKKPTHAD 300

BLAST of Tan0022345 vs. ExPASy TrEMBL
Match: A0A5A7TZ60 (Microtubule-associated protein RP/EB family member 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold291G00750 PE=4 SV=1)

HSP 1 Score: 368.6 bits (945), Expect = 2.4e-98
Identity = 217/306 (70.92%), Postives = 241/306 (78.76%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRER- 60
           M+NLPRFGRTW RFSSLPRP TA+R E     T  TEPEVFPSA  TTN LQTSP+++R 
Sbjct: 1   MSNLPRFGRTWNRFSSLPRPSTAARQETQPAFTAATEPEVFPSAVPTTNTLQTSPVKQRT 60

Query: 61  ---SSPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
              SSPV+KFPSPP+SPKY  A    SPRKPLSP  TYNR+DGERRTSA +SPKTFKPT 
Sbjct: 61  PRLSSPVKKFPSPPSSPKYSGATGTISPRKPLSPPSTYNRHDGERRTSATTSPKTFKPTY 120

Query: 121 SSPPLSPAKHKYST-TTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTT 180
            SPP SP K K+ST  T  AP SPL LPRSE R EPE+ IR RSPPE++QK ILY+ TTT
Sbjct: 121 ISPPPSPPKPKHSTAATVVAPLSPLALPRSEVRREPERSIRPRSPPEIEQKKILYQ-TTT 180

Query: 181 AEKPAKTDHR-ASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKK 240
            EKP KTD+R   +Y + KPQQKQ  QQ QSDVIN+KG+NVGAVMHITQSSDG+EI KKK
Sbjct: 181 TEKPTKTDYRQTDQYGTSKPQQKQHHQQLQSDVINIKGDNVGAVMHITQSSDGSEILKKK 240

Query: 241 P----INENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNK-PT 295
           P      EN EKTNKS+SN P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+YS+K P 
Sbjct: 241 PSVGQSKENEEKTNKSNSNFPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAYSSKNPK 300

BLAST of Tan0022345 vs. ExPASy TrEMBL
Match: A0A1S3AVJ6 (microtubule-associated protein RP/EB family member 1-like OS=Cucumis melo OX=3656 GN=LOC103483157 PE=4 SV=1)

HSP 1 Score: 368.6 bits (945), Expect = 2.4e-98
Identity = 217/306 (70.92%), Postives = 241/306 (78.76%), Query Frame = 0

Query: 1   MANLPRFGRTWQRFSSLPRPGTASRPEIIQPATVTTEPEVFPSAPTTTNILQTSPIRER- 60
           M+NLPRFGRTW RFSSLPRP TA+R E     T  TEPEVFPSA  TTN LQTSP+++R 
Sbjct: 1   MSNLPRFGRTWNRFSSLPRPSTAARQETQPAFTAATEPEVFPSAVPTTNTLQTSPVKQRT 60

Query: 61  ---SSPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYDGERRTSAPSSPKTFKPTP 120
              SSPV+KFPSPP+SPKY  A    SPRKPLSP  TYNR+DGERRTSA +SPKTFKPT 
Sbjct: 61  PRLSSPVKKFPSPPSSPKYSGATGTISPRKPLSPPSTYNRHDGERRTSATTSPKTFKPTY 120

Query: 121 SSPPLSPAKHKYST-TTTAAPPSPL-LPRSEPRHEPEQMIRSRSPPEVQQKTILYEKTTT 180
            SPP SP K K+ST  T  AP SPL LPRSE R EPE+ IR RSPPE++QK ILY+ TTT
Sbjct: 121 ISPPPSPPKPKHSTAATVVAPLSPLALPRSEVRREPERSIRPRSPPEIEQKKILYQ-TTT 180

Query: 181 AEKPAKTDHR-ASEYSSGKPQQKQKQQQQQSDVINMKGENVGAVMHITQSSDGTEIHKKK 240
            EKP KTD+R   +Y + KPQQKQ  QQ QSDVIN+KG+NVGAVMHITQSSDG+EI KKK
Sbjct: 181 TEKPTKTDYRQTDQYGTSKPQQKQHHQQLQSDVINIKGDNVGAVMHITQSSDGSEILKKK 240

Query: 241 P----INENAEKTNKSSSNLPPKSFMNSNFQGVNNSILYNSSLSHRDPGLHLSYSNK-PT 295
           P      EN EKTNKS+SN P KSFMNSNFQGVNNSILYNSSLSHRDPGLHL+YS+K P 
Sbjct: 241 PSVGQSKENEEKTNKSNSNFPGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAYSSKNPK 300

BLAST of Tan0022345 vs. TAIR 10
Match: AT2G46630.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 110095 Blast hits to 59224 proteins in 2216 species: Archae - 177; Bacteria - 15429; Metazoa - 38345; Fungi - 18843; Plants - 13341; Viruses - 3084; Other Eukaryotes - 20876 (source: NCBI BLink). )

HSP 1 Score: 64.7 bits (156), Expect = 1.4e-10
Identity = 99/332 (29.82%), Postives = 128/332 (38.55%), Query Frame = 0

Query: 42  PSAPTTTNILQTSPIRERS---SPVRKFPSPPTSPKYRTAATITSPRKPLSPTPTYNRYD 101
           P  P       TSP +ERS   SP  +  SPPT PK  T            P P  + Y 
Sbjct: 73  PLTPPRQKAPPTSPPQERSPYHSPPSRHMSPPTPPKAATP----------PPPPPRSSY- 132

Query: 102 GERRTSAPSSPKTFKPT-----PSSPPLSPAKHKYSTTTTAAPPSPLLPRSEPRHEPEQM 161
                ++P SPK  +       P+SPP SPA    STT+ +            R  P   
Sbjct: 133 -----TSPPSPKEVQEALPPRKPNSPP-SPAHSSRSTTSESVKTRSPSESENHRKAPSPR 192

Query: 162 IRS---------RSPPEVQQKTILYEKTTTAEKPAKT----------------------- 221
           + S          S  E  QK IL     TAEK ++T                       
Sbjct: 193 VLSPYSLPASLLHSERETTQKNIL-----TAEKTSQTHETNHHNQNHNHDYNQNHNYNQN 252

Query: 222 -DHRASEYSSGKPQQKQKQQQQQSD--------VINMKGENVGAVMHITQSSDGTE---- 281
             +  ++   G   +K  +Q   SD        VI + GEN GAVM I +S  G +    
Sbjct: 253 HSYNQNQNHQGNNPKKMHRQPSSSDSENIMSTRVITIAGENKGAVMEILRSPQGNKTGGS 312

Query: 282 -IHKKKPINENAEK-------------------------TNKSSSNLPPKSFMNSNFQGV 295
             H  +  +   EK                          NK +SNLP K+FMNSN Q +
Sbjct: 313 GTHSSRVSHGTGEKGRRLQSSSSSSSDEGEGKKKTTKNVPNKGNSNLPMKAFMNSNVQMI 372

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6597058.11.3e-11176.57hypothetical protein SDJN03_10238, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022945677.13.9e-11176.24DNA-directed RNA polymerase II subunit 1-like [Cucurbita moschata][more]
XP_022974814.11.9e-11076.24DNA-directed RNA polymerase II subunit 1-like isoform X2 [Cucurbita maxima] >XP_... [more]
XP_023538835.13.3e-11075.91proline-rich extensin-like protein EPR1 [Cucurbita pepo subsp. pepo][more]
XP_022974813.12.1e-10975.91DNA-directed RNA polymerase II subunit 1-like isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1G1L21.9e-11176.24DNA-directed RNA polymerase II subunit 1-like OS=Cucurbita moschata OX=3662 GN=L... [more]
A0A6J1IEX09.4e-11176.24DNA-directed RNA polymerase II subunit 1-like isoform X2 OS=Cucurbita maxima OX=... [more]
A0A6J1IHF41.0e-10975.91DNA-directed RNA polymerase II subunit 1-like isoform X1 OS=Cucurbita maxima OX=... [more]
A0A5A7TZ602.4e-9870.92Microtubule-associated protein RP/EB family member 1-like OS=Cucumis melo var. m... [more]
A0A1S3AVJ62.4e-9870.92microtubule-associated protein RP/EB family member 1-like OS=Cucumis melo OX=365... [more]
Match NameE-valueIdentityDescription
AT2G46630.11.4e-1029.82unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 28..59
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 157..171
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 175..189
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 73..111
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..253
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 225..239
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 190..209
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 122..138
NoneNo IPR availablePANTHERPTHR33472:SF1EXTENSIN-RELATEDcoord: 192..290
NoneNo IPR availablePANTHERPTHR33472OS01G0106600 PROTEINcoord: 192..290

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0022345.1Tan0022345.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032774 RNA biosynthetic process
molecular_function GO:0003899 DNA-directed 5'-3' RNA polymerase activity