Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGAAAACTCCGGTGAAACGACAGCGTATTCCCAGGAGAGGACCTGGCGTTGCAGAGCTTGAAAAGATTTTAAAGGAGCAAGGAGGCCAAGACTCCGAGACCCAATTTCAAGACATTTCCCCTCCACCGCCGCCGCCGCCACCGTCGCCACCGCCCCCACCCTTTCGTTTTCAACCTCACCACCAGTCTCCGAGTACTCCGTCGTTAAACCCTCCTCCTCCGCCGTCGCCGCCTCCATTCCTGGCGCCGAGAGACTACTCTTCTTGGTCAGATCTTCCGTCGTTCCCAAGTCTTAGTCTCATTCCCCCGGCCCTCCCAACCGCCGCCGACGCCGCCGGAAAAGCCGTCTTTCCAACCACCGCAAAATCCGACACCAATATTCCTCCACATTTTTTTCCCAATTTTCAATTTTCTGCTTCCTCACTCAATCGTCAATATTACAATTCAATGGTGAGACAATATTAAATAGCAAATATACATATTTTTTTTAAAAAAATTTATGGATGTATATATATTAAATTCATGGAAAAGGTTTGCAACTTTTTCAGGTGAATTTTATTCCAGGCTCTGCCTCATCGTCTCCGTCAGCTACGTCATCGGCGGCGAGATATTTTCATCAAATAGAGCACCCTTCAAGCCAAAGATCTACCGATTTCAACCACACGTGGGCCTCGCCGGAGGAACAAGAAAAGGCAATTACTCCTCAGCTCTTTTCTTCTTCAATAACAATATTTATTTATTTTAAGTTAAAATATCATTTTCATTCTGGTTTTGAGTTCCATTTCATTTTAATCCTTCTATTATTTAATTGCTAAATATTTAAATAAAGAGATCAAAATGGTATTTTTAATATTATAAAAAAATGATATTTTAACTTTTATGGCTAAATTTCTTGTAATTATTTTTACTTTAGGAAAACATCTTTGTTATTAAATACATAATTTTGCCACCTATTTAGGATACAAAGAAAATAATGGCACAACTAATTAAAGAAAAATATTTGGATGTTGCTAAAAAAAATGTATCTTTTTTAAAGAAAAAAAATGTATGGTTGATGAGTGGGGAGCTCAACCTTTAAGTGTTGCCCCTAAGTAAAATACTCTACTTGGCTAATTAAATAATATATCAATTTTTGTGGAATTTTTTAAGAGAACTTTTAATATGATATTTTTTTCCTGACTTATATGTGCATGGAAATTTGAGTTTTGAAAGGGAATAGAAATAATAATAATTGATGATTACTAAAAAAAAATTATGTTCTGATTTTATAATGGTGTGGATAATTTTGACAGATGGTAATTAGTGCAAAGAGGGCGAGAGGCTTTTTGGAAGAAAGCCAAAGGGATTCAAATTTTGAGAGCAGAAGAGTTCCAATTTTCTCAAACATGGTCGTGAAAGAGTCATCTTCCTCCTCCTCTTCTTCTTTGGAGATGAATCGCACTCCAATCAATTTGGATTCAAACTTGAGGTTCAGCTTCCTTTCTTTTTTTCAACGTCTCTTCTTTTGTTTAAAAAAAAAAAAATTGAAATCTCAATTGAATTTCAAACAAAATAAATGTTTTTCAAAATTAAGAACCCAAATCTTTATCAAAATATTCAGAAAACTATTGTTTCTTTCGACTTAGTAATCCAAAAAACAAACTAATAAAATAAGAGAATTATTTAGGTGTTGTAGCAGTCCATGTTTTAATCACAAATATTTTTCTTAAAAAATAATAAAATTTGTGAATATCCATTGGATTGCACAACACTCAGATGCTGCCTCTTTGCTTAAAAAATAAAAAATAAAAAAACTTTGTTTACTATATATAATTTGTTTTTGTTTTCCTCTCATTTTTTAAAAACATTTTCAAACGGCCAATTAATTTCTTTTTAAATGTTAGAAATTTTGTTTTAAAAACATTAGTATTAGATAATAACAAGGAAGTAATTGAGAGATTGAATCATAAATCATAGACTAAATGCTTGAATTACAATGGTTATAAACTAAAATCAAAATTGTTATATTGCTGTCTAAGTTAAGTTAATAAAGCTTTGGTTTCCGGCTTTTGATTAAATTGGGATGAATAACTGTAATTAGCGAAATTAAGGAGTTGTTTGGGGTGCTAAATGGGTTATAATAACATGAGGTTTATAATAGTCTGTAGGTTATTATAATCTGCGAAACATATAATATTATTTAAAATACAGAGTAGTATAGTCTGGGTTATAATAGTCTATGTTTGGAGTGCAGAATATTTCACAAGTTGATTAATTGGTTTTGTTTATGTTTAATTTTTTTTGGTGGGCAGAAGTACAAAAAGGGATTTTGCAGGTCAGTTAATGATCTCCAAAAAAAGCAGTGGAAGCTACCAATTAGCAGAAAATGATAGCAGCAATTTAATGGCAGTAGCAGGATCTTCATCAACTCCAAATCAATTTGCAGCATTTGATTTCCAAGTAAGAAACATCCACCCATGTTCATCCATCATCAACCCTTTCTTTGTTTAATCAAACTTTCTACAAATGAGTTTATATGTTTTCTTATCTACTTTGTACCTATATTTTCAAAAACTTTCAACTTTTAAAAACTAAAAAAATAGCTTTCAAAAACTTTGATTTTGTTTTTAGAATTTAGCTAAAAGTTCAAATGTTTACTTAAGGAGATGATCTCCATTGTAGAGAAATTAGGTGAAAATAAACTTAACTTTCAAAAATTAAAAATGTAAAACAAAATGGTTATCAAACGGGGTCTAAATGATAATCATTTGGTTTTTTATTTTAAAAATTGGACTTATTTTCTCACTGTTTCTTTATAATGAGTTACATATTTCCTAAGAAAAGATTTGAATTTTTAATCAAATTTCAAAAACAAAAGTAAAAAACTTGGATTTTCAAAATATTTCTAAGAAATAGATAATAAAACAAAGGAAATTCTATGGGTGGAAGTAGCTTTTAATTAAAAGTTTAATTTTAAAAAAATAAATAATTATCAAATGGGAAGAAATTAGTTGAATTAATGAAATTTGACCTTAGAAAATTTTAATTTCATCGATTAAATCCTAGACTTTCATTAGACAACAAAAATTCAATACCCTCCTTAGAATTTTCTAAAAATCAATAAATCACCAAAAATCTGACCAAACAAAATATTCCAAATCCAGCTAGGTGGAAAAATGCTCAAACCCAAAAGGATGTCTAATCCAACACCAAAATGTCTAAAATCCAACCAAAAACAAACATCCAACATAGCTACAAAACTACTCACAAAATTAGTGTCTCCTTGCTAGTAATTAATAACCGATTTTTTTATCTTTATACTAATCGATTTTACCTTTAAATTTTATTAAATTCAATTCTAAACTTGAATGAATTAGGTGGAAGTTATTAGTCTAGTTTTCACTATTTTGAAAATTTGTGCTCAACTATTATAAGTCAATGTTATTTATGCAAGTTTTGTAGTCATTACATGTTTTTTAGTTCGGTTGGTTAAGTTTTATTATTTTGAAAATTTATTGTTAGTTATAAACATTTTTTGTGATTGAATAATTAATGCTATTTTTATAAATTTAATTTTTTTTTTGTCATATTGTATGCATAGCCAAGCTTTCGTCATTTGAAAATTTGTCATTGGCTAAAGTATTCTTGTATAGCTTGACAATATTATTGATCTTATATATATATATATATGTGTGTGTAAGTTTTTCTGGATATAGAATATATTTACACATATATATTATTAAAGTAATTACAAATAGAGCATATTTATAAAAATAACTGTTAAATATAATATTTTGTAGAAATTATTAAAGTAAATCAATATGTCTTTTTTAATTATTATTTTGTATGAGCTCCTGGATGAACATAAGTTTACCATTATCTTTTTCTTTTTTATATAGTTTAAAGTATACTATTATCCTCTATGTATAACTATAGATTAACTAAAATGACAGTGATCGCTATTTTAATAAATTACAATCATATTCTTTTCCTTTTAAAATATTTGAAATGATTTCTTAATTTTTATTTTTTTTAATTTTCAGTCACCCTAGCTTTCTATTGTTGAGTTTGGGAGCATTTAATCATTTTATCCAGTTGATTTCATGAGTATATTTAGGATTAAATATTTGTATATGGCTTGATTTTTTTCTAATTTAGTTTTGAAGGAGTATTTAAAATTAATTTGATAAAATAATAAAATACGGTAAAAATGTTATATGAAGAGGGCAAGAGGGAAATAAATGTTTAAAGCATTTTTTAAAGTAAAAATTGATATGTGGGGAAATGATTTTGAAGGAAACTATGGAGGCTTCAGAGCATAGAGATGGAGGAGGAAGTGCCTCAGATTACAACAAAATTACATTTAACTCCTCCTCCTTGTATGAATCAAAGTCAAAGTCAAAGTCAAAGTCAAAGTCAAAGGGCATCATAGTAGCTGTCACTGAAGTCGAAGCCGAAGCCGAAGCCGAAGGAGATGGCATTGATCTTGATTTGAAGCTTTAA
mRNA sequence
ATGAGGAAAACTCCGGTGAAACGACAGCGTATTCCCAGGAGAGGACCTGGCGTTGCAGAGCTTGAAAAGATTTTAAAGGAGCAAGGAGGCCAAGACTCCGAGACCCAATTTCAAGACATTTCCCCTCCACCGCCGCCGCCGCCACCGTCGCCACCGCCCCCACCCTTTCGTTTTCAACCTCACCACCAGTCTCCGAGTACTCCGTCGTTAAACCCTCCTCCTCCGCCGTCGCCGCCTCCATTCCTGGCGCCGAGAGACTACTCTTCTTGGTCAGATCTTCCGTCGTTCCCAAGTCTTAGTCTCATTCCCCCGGCCCTCCCAACCGCCGCCGACGCCGCCGGAAAAGCCGTCTTTCCAACCACCGCAAAATCCGACACCAATATTCCTCCACATTTTTTTCCCAATTTTCAATTTTCTGCTTCCTCACTCAATCGTCAATATTACAATTCAATGATGGTAATTAGTGCAAAGAGGGCGAGAGGCTTTTTGGAAGAAAGCCAAAGGGATTCAAATTTTGAGAGCAGAAGAGTTCCAATTTTCTCAAACATGGTCGTGAAAGAGTCATCTTCCTCCTCCTCTTCTTCTTTGGAGATGAATCGCACTCCAATCAATTTGGATTCAAACTTGAGAAGTACAAAAAGGGATTTTGCAGGTCAGTTAATGATCTCCAAAAAAAGCAGTGGAAGCTACCAATTAGCAGAAAATGATAGCAGCAATTTAATGGCAGTAGCAGGATCTTCATCAACTCCAAATCAATTTGCAGCATTTGATTTCCAAGAAACTATGGAGGCTTCAGAGCATAGAGATGGAGGAGGAAGTGCCTCAGATTACAACAAAATTACATTTAACTCCTCCTCCTTGTATGAATCAAAGTCAAAGTCAAAGTCAAAGTCAAAGTCAAAGGGCATCATAGTAGCTGTCACTGAAGTCGAAGCCGAAGCCGAAGCCGAAGGAGATGGCATTGATCTTGATTTGAAGCTTTAA
Coding sequence (CDS)
ATGAGGAAAACTCCGGTGAAACGACAGCGTATTCCCAGGAGAGGACCTGGCGTTGCAGAGCTTGAAAAGATTTTAAAGGAGCAAGGAGGCCAAGACTCCGAGACCCAATTTCAAGACATTTCCCCTCCACCGCCGCCGCCGCCACCGTCGCCACCGCCCCCACCCTTTCGTTTTCAACCTCACCACCAGTCTCCGAGTACTCCGTCGTTAAACCCTCCTCCTCCGCCGTCGCCGCCTCCATTCCTGGCGCCGAGAGACTACTCTTCTTGGTCAGATCTTCCGTCGTTCCCAAGTCTTAGTCTCATTCCCCCGGCCCTCCCAACCGCCGCCGACGCCGCCGGAAAAGCCGTCTTTCCAACCACCGCAAAATCCGACACCAATATTCCTCCACATTTTTTTCCCAATTTTCAATTTTCTGCTTCCTCACTCAATCGTCAATATTACAATTCAATGATGGTAATTAGTGCAAAGAGGGCGAGAGGCTTTTTGGAAGAAAGCCAAAGGGATTCAAATTTTGAGAGCAGAAGAGTTCCAATTTTCTCAAACATGGTCGTGAAAGAGTCATCTTCCTCCTCCTCTTCTTCTTTGGAGATGAATCGCACTCCAATCAATTTGGATTCAAACTTGAGAAGTACAAAAAGGGATTTTGCAGGTCAGTTAATGATCTCCAAAAAAAGCAGTGGAAGCTACCAATTAGCAGAAAATGATAGCAGCAATTTAATGGCAGTAGCAGGATCTTCATCAACTCCAAATCAATTTGCAGCATTTGATTTCCAAGAAACTATGGAGGCTTCAGAGCATAGAGATGGAGGAGGAAGTGCCTCAGATTACAACAAAATTACATTTAACTCCTCCTCCTTGTATGAATCAAAGTCAAAGTCAAAGTCAAAGTCAAAGTCAAAGGGCATCATAGTAGCTGTCACTGAAGTCGAAGCCGAAGCCGAAGCCGAAGGAGATGGCATTGATCTTGATTTGAAGCTTTAA
Protein sequence
MRKTPVKRQRIPRRGPGVAELEKILKEQGGQDSETQFQDISPPPPPPPPSPPPPPFRFQPHHQSPSTPSLNPPPPPSPPPFLAPRDYSSWSDLPSFPSLSLIPPALPTAADAAGKAVFPTTAKSDTNIPPHFFPNFQFSASSLNRQYYNSMMVISAKRARGFLEESQRDSNFESRRVPIFSNMVVKESSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLMISKKSSGSYQLAENDSSNLMAVAGSSSTPNQFAAFDFQETMEASEHRDGGGSASDYNKITFNSSSLYESKSKSKSKSKSKGIIVAVTEVEAEAEAEGDGIDLDLKL
Homology
BLAST of Tan0021218 vs. NCBI nr
Match:
KAG7019594.1 (hypothetical protein SDJN02_18557, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 270.4 bits (690), Expect = 2.0e-68
Identity = 202/351 (57.55%), Postives = 230/351 (65.53%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKE-QGGQDSETQFQDISPPPPPPPPSPPPPPFRFQ 60
M+KT KR RIPRRGPGVAELEKILKE QGG QD S PP F+ +
Sbjct: 1 MKKTHPKRHRIPRRGPGVAELEKILKEQQGGHGGGNGGQD-----QAHISSLPPSFFQHR 60
Query: 61 PHHQSPS-TP-SLN---PPPPPSPPPFLAP---RDYSSWSDLPSFPSLSLIPPALPT--- 120
SPS TP SLN PPPPP PPP L P RDY+SWS+LP FP+L IPPALPT
Sbjct: 61 RRRHSPSNTPSSLNPPRPPPPPPPPPLLPPPLTRDYASWSNLPLFPTLDFIPPALPTPTP 120
Query: 121 AADAAGKAVFPTTAKSDT--NIPPHFFPNFQFSASSLNRQYYNSMMVISAKRARGFLEES 180
AA A K +FPTT KSD N+PPHFFP FQ+SASS N M++SAKR R FL+E
Sbjct: 121 AAAAVEKPLFPTTRKSDAQINLPPHFFPTFQYSASSHN------PMMVSAKRVRPFLDEG 180
Query: 181 QRDSNFESRRVPIFSNMVVKE----SSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLMI 240
RD N ES R P F+NM KE SSSSSSSS++MNR+P +LDSN R TKR F G+LM
Sbjct: 181 HRDPNAES-RAPFFTNMATKESSSSSSSSSSSSMDMNRSPFDLDSNSRGTKRGFGGELMR 240
Query: 241 SKKSSGSYQLAENDSSNLMAVAGSSSTPNQF-AAFDF---QETMEASEHRDGGGSASDYN 300
K S YQLA + S+LMA+ SSS PN+ AAF+ QETMEAS++RDGGGSASDYN
Sbjct: 241 CSKRSERYQLA-GEVSHLMALGSSSSAPNEVAAAFNIHHPQETMEASQYRDGGGSASDYN 300
Query: 301 KITFN--SSSLYESKSKSKSKSKSKGIIVAVTEVEAEAEAEGDGIDLDLKL 328
K+TFN SSSLYESK K + E EAEAEAE +GIDL LKL
Sbjct: 301 KVTFNSSSSSLYESKLKGNKEIVIGSGFEVEAEAEAEAEAEAEGIDLSLKL 338
BLAST of Tan0021218 vs. NCBI nr
Match:
XP_023001726.1 (trinucleotide repeat-containing gene 18 protein-like [Cucurbita maxima] >XP_023001727.1 trinucleotide repeat-containing gene 18 protein-like [Cucurbita maxima])
HSP 1 Score: 268.5 bits (685), Expect = 7.7e-68
Identity = 204/391 (52.17%), Postives = 237/391 (60.61%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKEQ----GGQDSETQFQDISPPPPPPPPSPPPPPF 60
M+KT KR RIPRRGPGVAELEKILKEQ GG + DIS S PP F
Sbjct: 1 MKKTHPKRHRIPRRGPGVAELEKILKEQEGGNGGGNGGQDQADIS--------SLPPSFF 60
Query: 61 RFQPHHQSPSTP-SLN---PPPPPSPPPFLAP---RDYSSWSDLPSFPSLSLIPPALPT- 120
+ + H +TP SLN PPPPP PPP L P RDY+SWS+LP FP+L IPPALPT
Sbjct: 61 QHRRRHSPSNTPSSLNPPRPPPPPPPPPLLPPPLTRDYASWSNLPLFPTLDFIPPALPTP 120
Query: 121 ----AADAAGKAVFPTTAKSDT--NIPPHFFPNFQFSASSLN--------------RQYY 180
AA AA K +FPTT KSD N+PPHFFP FQ+SASS N +YY
Sbjct: 121 TPAAAAAAAEKPLFPTTRKSDAQINLPPHFFPTFQYSASSHNLMMNIIPGSATSPATRYY 180
Query: 181 NSM----------------------MVISAKRARGFLEESQRDSNFESRRVPIFSNMVVK 240
+ ++SAKR R FL+E RD N ES R P F+NM K
Sbjct: 181 RQIEHPSSQISTQFNHTWTSPEEQQKMVSAKRVRPFLDEGHRDPNAES-RAPFFTNMATK 240
Query: 241 E----SSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLMISKKSSGSYQLAENDSSNLMA 300
E SSSSSSSS++MNR+P NLDSN R TKR F G+LM K S YQLA + S+LMA
Sbjct: 241 ESSSSSSSSSSSSMDMNRSPFNLDSNSRGTKRGFGGELMRCSKRSERYQLA-GEVSHLMA 300
Query: 301 VAGSSSTPNQF-AAFDF---QETMEASEHRDGGGSASDYNKITFN--SSSLYESKSKSKS 328
+ SSS PN+ AAF+ QETMEAS++RD GGSASDYNK+TFN SSSLYESKSK
Sbjct: 301 LGSSSSAPNEVAAAFNIHHPQETMEASQYRDEGGSASDYNKVTFNSSSSSLYESKSKGNK 360
BLAST of Tan0021218 vs. NCBI nr
Match:
XP_022927210.1 (nuclear envelope pore membrane protein POM 121-like [Cucurbita moschata] >XP_022927212.1 nuclear envelope pore membrane protein POM 121-like [Cucurbita moschata] >XP_022927213.1 nuclear envelope pore membrane protein POM 121-like [Cucurbita moschata])
HSP 1 Score: 256.5 bits (654), Expect = 3.0e-64
Identity = 202/391 (51.66%), Postives = 235/391 (60.10%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKEQ-----GGQDSETQFQ-DISPPPPPPPPSPPPP 60
M+KT KR RIPRRGPGVAELEKILKEQ GG + Q Q DIS S PP
Sbjct: 1 MKKTQPKRHRIPRRGPGVAELEKILKEQEGGDGGGNGNGGQDQADIS--------SLPPS 60
Query: 61 PFRFQPHHQSPSTP-SLN---PPPPPSPPPFLAP---RDYSSWSDLPSFPSLSLIPPALP 120
F+ + H +TP SLN PPPPP PPP L P RDY+SWS+LP FP+L IPPALP
Sbjct: 61 FFQHRRRHSPSNTPSSLNPPRPPPPPPPPPLLPPPLTRDYASWSNLPLFPTLDFIPPALP 120
Query: 121 T---AADAAGKAVFPTTAKSDT--NIPPHFFPNFQFSASSLN--------------RQYY 180
T AA A K +FPTT KSD N+PPHFFP FQ+SASS N +YY
Sbjct: 121 TPTPAAAAVEKPLFPTTRKSDAQINLPPHFFPTFQYSASSHNPMMNIIPGSATSPAARYY 180
Query: 181 NSM----------------------MVISAKRARGFLEESQRDSNFESRRVPIFSNMVVK 240
+ ++SAKR R FL+E RD N ES R P F+NM K
Sbjct: 181 RQIEHPSSQISTQFNNTWTSPEEQQKMVSAKRVRPFLDEGHRDPNAES-RAPFFTNMATK 240
Query: 241 E----SSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLMISKKSSGSYQLAENDSSNLMA 300
E SSSSSSSS++MNR+P +LDSN R TKR F G+LM K S YQLA + S+LMA
Sbjct: 241 ESSSSSSSSSSSSMDMNRSPFDLDSNSRGTKRGFGGELMRCSKRSERYQLA-GEVSHLMA 300
Query: 301 VAGSSSTPNQF-AAFDF---QETMEASEHRDGGGSASDYNKITFN--SSSLYESKSKSKS 328
+ SSS PN+ AAF+ QET EAS++RD GGSASDYNK+TFN SSSLYESK K
Sbjct: 301 LGSSSSAPNEVAAAFNIHHPQETTEASQYRD-GGSASDYNKVTFNSSSSSLYESKLKGNK 360
BLAST of Tan0021218 vs. NCBI nr
Match:
KAG6583971.1 (hypothetical protein SDJN03_19903, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 253.4 bits (646), Expect = 2.6e-63
Identity = 201/390 (51.54%), Postives = 234/390 (60.00%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKE-QGGQDSETQFQDISPPPPPPPPSPPPPPFRFQ 60
M+KT KR RIPRRGPGVAELEKILKE QGG QD S PP F+ +
Sbjct: 1 MKKTHPKRHRIPRRGPGVAELEKILKEQQGGHGGGNGGQD-----QAHISSLPPSFFQHR 60
Query: 61 PHHQSPS-TP-SLN---PPPPPSPPPFLAP---RDYSSWSDLPSFPSLSLIPPALPT--- 120
SPS TP SLN PPPPP PPP L P RDY+SWS+LP FP+L IPPALPT
Sbjct: 61 RRRHSPSNTPSSLNPPRPPPPPPPPPLLPPPLTRDYASWSNLPLFPTLDFIPPALPTPTP 120
Query: 121 ---AADAAGKAVFPTTAKSDT--NIPPHFFPNFQFSASSLN--------------RQYYN 180
AA A K +FPTT KSD N+PPHFFP FQ+SASS N +YY
Sbjct: 121 TPAAAAAVEKPLFPTTRKSDAQINLPPHFFPTFQYSASSHNLMMNIIPGSATSPAARYYR 180
Query: 181 SM----------------------MVISAKRARGFLEESQRDSNFESRRVPIFSNMVVKE 240
+ ++SAKR R FL+E RD N ES R P F+NM KE
Sbjct: 181 QIEHPSSQISTQFNNTWTSPEEQQKMVSAKRVRPFLDEGHRDPNAES-RAPFFTNMATKE 240
Query: 241 ----SSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLMISKKSSGSYQLAENDSSNLMAV 300
SSSSSSSS++MNR+P++LDSN R TKR F G+LM K S YQLA + S+LMA+
Sbjct: 241 SSSSSSSSSSSSMDMNRSPLDLDSNSRGTKRGFGGELMRCSKRSERYQLA-GEVSHLMAL 300
Query: 301 AGSSSTPNQF-AAFDF---QETMEASEHRDGGGSASDYNKITFN--SSSLYESKSKSKSK 328
SSS PN+ AAF+ QETMEAS++RDGGGSASDYN++TFN SSSLYESK K +
Sbjct: 301 GSSSSAPNEVAAAFNIHHPQETMEASQYRDGGGSASDYNRVTFNSSSSSLYESKLKGNKE 360
BLAST of Tan0021218 vs. NCBI nr
Match:
XP_038895159.1 (uncharacterized protein DDB_G0271670-like [Benincasa hispida])
HSP 1 Score: 248.4 bits (633), Expect = 8.3e-62
Identity = 195/395 (49.37%), Postives = 229/395 (57.97%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKEQGGQDSETQFQDISPPPPPPPPSPPPPPFRFQP 60
M+K KRQRIPRRGPGVAELEKILKEQ G + QDI PP P
Sbjct: 1 MKKAQAKRQRIPRRGPGVAELEKILKEQEGAAAA---QDIISPPQPT-----------NT 60
Query: 61 HHQSPSTPSLNPPPPPSPPPFLAPRDY-SSWS-DLPSFPSLSLIPPALPTAADA----AG 120
H S P PPPPP PPP + DY +SWS +LP FP+L IPP LP +A A A
Sbjct: 61 HPLSSLNPP-RPPPPPPPPPLVPTTDYNASWSNNLPLFPTLEFIPPPLPNSAAAVTASAK 120
Query: 121 KAVFPTTAKSDTNI--PPHFFPNFQFSASSLNRQYYNSMM-------------------- 180
K +FPTT SD+ + PHFFP+FQ+SASSLN + YNSM+
Sbjct: 121 KPLFPTTRISDSQLKFAPHFFPSFQYSASSLNVECYNSMVNINPGSASSCPSATSSAGRY 180
Query: 181 ---------------------------VISAKRARGFLEESQRDSNFESRRVPIFSNMVV 240
++SAKR R FLEES R++N ES+ PIF NM
Sbjct: 181 FREIEHPSSQISSDFNNIWTSPEEQEKMVSAKRVRAFLEESHREANIESKG-PIFKNMAT 240
Query: 241 KE--------SSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLMISKKSSGSYQLAENDS 300
K+ SSSSSSSS+EMN +P N DSN R TKR FAGQLM S K SGSYQLAE+
Sbjct: 241 KDSSSSSSSSSSSSSSSSMEMNHSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLAED-- 300
Query: 301 SNLMAV-AGSSSTPNQFAAFDF---QETMEASEHRDGGGSASDYNKITFN-SSSLYESKS 328
SNLMA+ + SSS PN+ AAF+F QETMEAS+HRD GG SDY K+TFN SSS+YES S
Sbjct: 301 SNLMALGSSSSSAPNEIAAFNFHLPQETMEASQHRD-GGCVSDYYKVTFNSSSSMYESNS 360
BLAST of Tan0021218 vs. ExPASy TrEMBL
Match:
A0A6J1KHF7 (trinucleotide repeat-containing gene 18 protein-like OS=Cucurbita maxima OX=3661 GN=LOC111495777 PE=4 SV=1)
HSP 1 Score: 268.5 bits (685), Expect = 3.7e-68
Identity = 204/391 (52.17%), Postives = 237/391 (60.61%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKEQ----GGQDSETQFQDISPPPPPPPPSPPPPPF 60
M+KT KR RIPRRGPGVAELEKILKEQ GG + DIS S PP F
Sbjct: 1 MKKTHPKRHRIPRRGPGVAELEKILKEQEGGNGGGNGGQDQADIS--------SLPPSFF 60
Query: 61 RFQPHHQSPSTP-SLN---PPPPPSPPPFLAP---RDYSSWSDLPSFPSLSLIPPALPT- 120
+ + H +TP SLN PPPPP PPP L P RDY+SWS+LP FP+L IPPALPT
Sbjct: 61 QHRRRHSPSNTPSSLNPPRPPPPPPPPPLLPPPLTRDYASWSNLPLFPTLDFIPPALPTP 120
Query: 121 ----AADAAGKAVFPTTAKSDT--NIPPHFFPNFQFSASSLN--------------RQYY 180
AA AA K +FPTT KSD N+PPHFFP FQ+SASS N +YY
Sbjct: 121 TPAAAAAAAEKPLFPTTRKSDAQINLPPHFFPTFQYSASSHNLMMNIIPGSATSPATRYY 180
Query: 181 NSM----------------------MVISAKRARGFLEESQRDSNFESRRVPIFSNMVVK 240
+ ++SAKR R FL+E RD N ES R P F+NM K
Sbjct: 181 RQIEHPSSQISTQFNHTWTSPEEQQKMVSAKRVRPFLDEGHRDPNAES-RAPFFTNMATK 240
Query: 241 E----SSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLMISKKSSGSYQLAENDSSNLMA 300
E SSSSSSSS++MNR+P NLDSN R TKR F G+LM K S YQLA + S+LMA
Sbjct: 241 ESSSSSSSSSSSSMDMNRSPFNLDSNSRGTKRGFGGELMRCSKRSERYQLA-GEVSHLMA 300
Query: 301 VAGSSSTPNQF-AAFDF---QETMEASEHRDGGGSASDYNKITFN--SSSLYESKSKSKS 328
+ SSS PN+ AAF+ QETMEAS++RD GGSASDYNK+TFN SSSLYESKSK
Sbjct: 301 LGSSSSAPNEVAAAFNIHHPQETMEASQYRDEGGSASDYNKVTFNSSSSSLYESKSKGNK 360
BLAST of Tan0021218 vs. ExPASy TrEMBL
Match:
A0A6J1EGJ1 (nuclear envelope pore membrane protein POM 121-like OS=Cucurbita moschata OX=3662 GN=LOC111434129 PE=4 SV=1)
HSP 1 Score: 256.5 bits (654), Expect = 1.5e-64
Identity = 202/391 (51.66%), Postives = 235/391 (60.10%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKEQ-----GGQDSETQFQ-DISPPPPPPPPSPPPP 60
M+KT KR RIPRRGPGVAELEKILKEQ GG + Q Q DIS S PP
Sbjct: 1 MKKTQPKRHRIPRRGPGVAELEKILKEQEGGDGGGNGNGGQDQADIS--------SLPPS 60
Query: 61 PFRFQPHHQSPSTP-SLN---PPPPPSPPPFLAP---RDYSSWSDLPSFPSLSLIPPALP 120
F+ + H +TP SLN PPPPP PPP L P RDY+SWS+LP FP+L IPPALP
Sbjct: 61 FFQHRRRHSPSNTPSSLNPPRPPPPPPPPPLLPPPLTRDYASWSNLPLFPTLDFIPPALP 120
Query: 121 T---AADAAGKAVFPTTAKSDT--NIPPHFFPNFQFSASSLN--------------RQYY 180
T AA A K +FPTT KSD N+PPHFFP FQ+SASS N +YY
Sbjct: 121 TPTPAAAAVEKPLFPTTRKSDAQINLPPHFFPTFQYSASSHNPMMNIIPGSATSPAARYY 180
Query: 181 NSM----------------------MVISAKRARGFLEESQRDSNFESRRVPIFSNMVVK 240
+ ++SAKR R FL+E RD N ES R P F+NM K
Sbjct: 181 RQIEHPSSQISTQFNNTWTSPEEQQKMVSAKRVRPFLDEGHRDPNAES-RAPFFTNMATK 240
Query: 241 E----SSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLMISKKSSGSYQLAENDSSNLMA 300
E SSSSSSSS++MNR+P +LDSN R TKR F G+LM K S YQLA + S+LMA
Sbjct: 241 ESSSSSSSSSSSSMDMNRSPFDLDSNSRGTKRGFGGELMRCSKRSERYQLA-GEVSHLMA 300
Query: 301 VAGSSSTPNQF-AAFDF---QETMEASEHRDGGGSASDYNKITFN--SSSLYESKSKSKS 328
+ SSS PN+ AAF+ QET EAS++RD GGSASDYNK+TFN SSSLYESK K
Sbjct: 301 LGSSSSAPNEVAAAFNIHHPQETTEASQYRD-GGSASDYNKVTFNSSSSSLYESKLKGNK 360
BLAST of Tan0021218 vs. ExPASy TrEMBL
Match:
A0A6J1GSS8 (proline-rich receptor-like protein kinase PERK2 OS=Cucurbita moschata OX=3662 GN=LOC111457104 PE=4 SV=1)
HSP 1 Score: 201.1 bits (510), Expect = 7.3e-48
Identity = 174/382 (45.55%), Postives = 204/382 (53.40%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKEQGGQDSETQFQDISPPPPPPPPSPPPPPFRFQP 60
M+KTP KRQR PRRGPGVAELEKILKEQ QDS TQF +SPPPPPP PPPP +F
Sbjct: 1 MKKTPAKRQRTPRRGPGVAELEKILKEQQAQDSNTQFPPMSPPPPPP---PPPPSHQF-- 60
Query: 61 HHQSPSTPSLNPPPPPSPPPFLAPRDYSSWSDLPSFPSLSLIPPALPTAADAAGKAVFPT 120
SPS PP P +P PFL PR++ S ++ F +L PP P
Sbjct: 61 --LSPS-----PPLPLTPIPFLPPREFPSRPNISPFTTLRPRPP--------------PP 120
Query: 121 TAKSDTNIPPHFFPNFQFSASSLNRQYYNSMM---------------------------- 180
+ + NIPP F PNFQFS SLNRQ+YN+ M
Sbjct: 121 KSDTQINIPPQFVPNFQFSPPSLNRQFYNNPMENFTPASASSSSCPPATPPPPNCFNQIE 180
Query: 181 ----------------------VISAKRARGFLEESQRDSNFESRRVPIFSNMVVKE--- 240
++S KR R FL+E+ D N N+ K+
Sbjct: 181 QPSSQKSPDLNHPWTTPEEQEQIVSGKRTRPFLDEAHWDPN--------LFNIARKKGCS 240
Query: 241 SSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLMISKKSSGSYQLAENDSSNLMAVAGSS 300
SSSSSSSS EMNR+ + LDSNL SGSYQL E DSS+L+A+ SS
Sbjct: 241 SSSSSSSSSEMNRSAMFLDSNL-----------------SGSYQLGE-DSSSLLALR-SS 300
Query: 301 STPNQFAAFDFQETMEASEHRDGGGSASDYNKITFNSSSLYESKSKSKSKSKSKGIIVAV 328
STPNQFA F FQETME S HRD GGSAS+YNKITF++S+ SK KSKSK K +I +
Sbjct: 301 STPNQFAPFHFQETMETSMHRD-GGSASNYNKITFDTSN--GSKMKSKSKGKEVNVIGSR 326
BLAST of Tan0021218 vs. ExPASy TrEMBL
Match:
A0A0A0LSF6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G077140 PE=4 SV=1)
HSP 1 Score: 184.9 bits (468), Expect = 5.4e-43
Identity = 175/412 (42.48%), Postives = 211/412 (51.21%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKEQGGQDSETQFQDISPP----------------- 60
MRKT KRQRIPRRGPGVAELEKILKEQ + T Q S P
Sbjct: 1 MRKTQAKRQRIPRRGPGVAELEKILKEQ-ESGAATDHQHTSSPHTNSTSTAATTTTHPLS 60
Query: 61 --PPPPPPSPPPPPFRFQPHHQSPSTPSLNPPPPPSPPPFLAPRDYSSWS-DLPSFPSLS 120
PP PPP PPPPP L PSPPP PRDY +WS +LP FP+L
Sbjct: 61 LNPPRPPPPPPPPP--------------LVQIMTPSPPPL--PRDYVAWSNNLPLFPTLE 120
Query: 121 LIPPA-LPTAADAAGKAVFPTT--AKSDTNIPPHFFPNFQFSASSLN-RQYYNSMM---- 180
IPP LPT K +FPTT ++S N+ P+F P+FQ+SASS N QYYN M+
Sbjct: 121 FIPPPYLPTV--VTEKPLFPTTRMSESQLNLAPYFLPSFQYSASSFNPDQYYNPMVNVNQ 180
Query: 181 --------------------------------------------VISAKRARGFLEESQR 240
+++AKR FLEES R
Sbjct: 181 GSGSSCPSATSSSAGRHFREIEHPSSQISTDFNNIWNSPEEEEKMVNAKRVIPFLEESHR 240
Query: 241 DSNFESRRVPIFSNMVV-------KESSSSSSSSLEMNRTPINLDSNLRSTKRDFAGQLM 300
+ + I M V K+SSSSSSSS+E N +P + SN R TKR GQ
Sbjct: 241 EEANNNNNNNIIEKMRVENNIMGTKDSSSSSSSSMETNCSPFHFHSNFRGTKRGLGGQSR 300
Query: 301 ISK-KSSGSYQLAENDSSNLMAV-AGSSSTPNQFAAFDF----QETMEASEHRDGGGSAS 328
+S K SG YQL + + SNLMA+ + SSS PN+ F+ QETME +HRD GG S
Sbjct: 301 MSNTKRSGRYQLGDQE-SNLMALGSSSSSAPNEIPTFNIFHLPQETMEVPQHRDEGGCPS 360
BLAST of Tan0021218 vs. ExPASy TrEMBL
Match:
A0A1S3B6J8 (uncharacterized protein DDB_G0271670-like OS=Cucumis melo OX=3656 GN=LOC103486731 PE=4 SV=1)
HSP 1 Score: 180.6 bits (457), Expect = 1.0e-41
Identity = 173/415 (41.69%), Postives = 211/415 (50.84%), Query Frame = 0
Query: 1 MRKTPVKRQRIPRRGPGVAELEKILKEQGGQDSETQFQDISP----------------PP 60
MRKT KRQRIPRRGPGVAELEKILKEQ + T Q ISP PP
Sbjct: 1 MRKTQAKRQRIPRRGPGVAELEKILKEQESGGASTD-QHISPHTNSTSTATTHPLSLNPP 60
Query: 61 PPPPPSPPPPPFRFQPHHQSPSTPSLNPPPPPSPPPFLAPRDYSSW-SDLPSFPSLSLIP 120
PPPP PPPP + P P+PP PRDY SW ++LP FP+L IP
Sbjct: 61 LPPPPPPPPPLVQIM-------------TPSPTPP----PRDYVSWTNNLPLFPTLEFIP 120
Query: 121 PA-LPTAADAAGKAVFPTTAKSDT---NIPPHFFPNFQFSASSLN-RQYYNSMM------ 180
P LPT AA K +FPTT S+ N+ P+F P FQFSASS N QYYN M+
Sbjct: 121 PPYLPTV--AAEKPLFPTTRISEASQLNLAPYFLPTFQFSASSCNPDQYYNPMVNINQGS 180
Query: 181 ------------------------------------------VISAKRARGFLEESQRDS 240
+++AKR FLEES R+
Sbjct: 181 GSSCPSATSSSAGRYLREIEHPSSQISTDFNNIWTSPEEQEKMVNAKRVIPFLEESHREE 240
Query: 241 NFESRRVPIFSNMVVK------------ESSSSSSSSLEMNRTPINLDSNLRSTKRDFAG 300
+ I M V+ SSSSSSSS+E+N +P + SN R TKR AG
Sbjct: 241 ANNNNNNNIIEKMRVENNTMGTKDSSSSSSSSSSSSSMEINCSPFHSHSNFRGTKRGSAG 300
Query: 301 QLMISK-KSSGSYQLAENDSSNLMAV-AGSSSTPNQFAAFDF----QETMEASEHRDGGG 328
QLM + K SG YQL + + SNLMA+ + SSS PN+ F+ +ETME +HRDGG
Sbjct: 301 QLMKNNTKRSGRYQLGDQE-SNLMALGSSSSSAPNEIPTFNIFHLPKETMEVPQHRDGGC 360
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG7019594.1 | 2.0e-68 | 57.55 | hypothetical protein SDJN02_18557, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_023001726.1 | 7.7e-68 | 52.17 | trinucleotide repeat-containing gene 18 protein-like [Cucurbita maxima] >XP_0230... | [more] |
XP_022927210.1 | 3.0e-64 | 51.66 | nuclear envelope pore membrane protein POM 121-like [Cucurbita moschata] >XP_022... | [more] |
KAG6583971.1 | 2.6e-63 | 51.54 | hypothetical protein SDJN03_19903, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_038895159.1 | 8.3e-62 | 49.37 | uncharacterized protein DDB_G0271670-like [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1KHF7 | 3.7e-68 | 52.17 | trinucleotide repeat-containing gene 18 protein-like OS=Cucurbita maxima OX=3661... | [more] |
A0A6J1EGJ1 | 1.5e-64 | 51.66 | nuclear envelope pore membrane protein POM 121-like OS=Cucurbita moschata OX=366... | [more] |
A0A6J1GSS8 | 7.3e-48 | 45.55 | proline-rich receptor-like protein kinase PERK2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A0A0LSF6 | 5.4e-43 | 42.48 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G077140 PE=4 SV=1 | [more] |
A0A1S3B6J8 | 1.0e-41 | 41.69 | uncharacterized protein DDB_G0271670-like OS=Cucumis melo OX=3656 GN=LOC10348673... | [more] |
Match Name | E-value | Identity | Description | |