Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAGCTTGAAATTGTTCTTTGTTCCTTCTCCAAACCCTTTCCCCTTCTCTGCAACTCTCTTTCTCCTACTTTTTCCTTCGCCATTAAGTCTTCATCTTCTTCCATTTCCCTCCAATTCATCGAAGAACCAGAATAATCTCTTCCTCCCTCCTTCTTCAATGGCGGAATCAGACGTTCTCCCCCCAGGGCAAACTCAGCCTACTCCAAGTAAGTTCCATACCCATATCTTGTACAAAGTTTTAACTGCAATTTTCTTTCTGGTGATTCTCCCTCTAGTCCCCTCCCAAGCCCCTGAGTTCATCAATCAAACGCTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTCTTTAGCCGGAGAACTGACGAGATAGAGGATGAAATTACTGTCTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGATTACTTCATGTTTCGTCTGTTTTTGATGATGAGCCTGAAACTCCATCTGCTAATGATGAATCGCTGTCTTCGTCTGATGAAAATAAGGTCCAAACATGGGGTAGTCGGTATTTTAGGAATGAGTCTGTGGTTGTTGCTGAAGAACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAACCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGTGTTGTTGTTGACGATGAGTCTAAAACTGTTTCTGGTTCTAAGCCCAGAGTGAGTTCGAGAAGAGTGTTGAGCATGCCGAAGGGTAGTTCGAATGGGGAATTGAATGAGAAGGTCGTTCTTCGATCCCCGGTTCCATGGCGATCGAGATCGGAGAGGATGGAAGTGCAAGAAGAAGCTGATAATCCTCCTGTGTATTCTCCTGCTGCTCCCATGGAGGAATCTGAATCGAATTGGATTGATTCTCGGTCGTCGAGGCCTCAAACTTCAAGGTCTTCTCGAACTAGTGTCATTACTCAGAAGCTATCTCCTTCTCCTTCTCCATCTCCAAGGAAATTATCTCCTTCTCCTACTGTGTCGCCTGAATTACAGGCCAAGAGTGCTGAGGATTTGGTGAGGAAGAAGAACTTTTACCGGTCTCCTCCACCTCCACCGCCTCCTCCACCCCCGCCAACTGTTCGAAGAATTTCCTCAATGAAACCTTGGTTGAATGACAATGACAATGATGTACCTCATCAAAAGGATTTGAGGCGAAGCCTTACTAGCAAACCTAGAAGCTCAATTCGTGATGCGGGAAATGGTACCGATACAATCATTGGTGCTAATTCAAGTGTTGAAGCTCTGCCTAGAAATTATGTTGATGGTCAATCAATGGGAAGATCTGTTAGAACAATCAGACCAGGGGAAGTTGTGAATGAGCCACCAAGAAGAGGGAGAGAATTTGGTGGAAATGATCCATTGAAGGGGAAGAAGATGGAACAGAATGCCCATGTCCAAGAATTTGAAGAAAACCCCATTGAGTTTCCAGATGAAGATAAAGAAGAACTGGTCGAAAAGCTAGCCATGGAAACCGATGACGACATGGAAAGCGAAGAAGAAGACAATGTTGTGGGACAGTTTATCCGGGAAGATAACGGAGAACCTTTCAATGTGAAAAGGAGAGACAACGAAAGAAGTTCGAGTAATGAAGAAGCAAGCTCTAACAATATGGCTAATGATGGAGGACCTGATGTAGATAAGAAAGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATTAATCAAGAAATCAAGTGGACAAATTGGTAGGAACACTTCAAGGCAAACTTGAAGTTTGAAAGATAGATGTATCTTGCCTTTCAGTAAGTTCATCAAAATTTGAACTTCAAATCATTCAACTGACATTTAGAGTCCTTTTCTAACACTTTTTTTTTCTTCTCTCCTCTTTTTTACATGATTTTCTTCTTTGATCATCAAGTTCAATTCCTGGAAAGTGTTTCTTCTTCTCCAAATTCACCCTCTGGATTTATGATAGATGTTGTAATTCAGTAGGCATCTGCCATTAGTGTTTGTTGATATACAGCTGAAACAACAAGGTTTATGAGACTTAGTTGTATGTCATGTGTTTGAGAGTTAGCTTTTTAATTTGTCTATTAAATAGATGGACTAGGTTGTTTCGCTTTAATTAACTATTTCTTTCTATATATTTGGATATAGAACTTTTAACTACAAAAA
mRNA sequence
CAAAGCTTGAAATTGTTCTTTGTTCCTTCTCCAAACCCTTTCCCCTTCTCTGCAACTCTCTTTCTCCTACTTTTTCCTTCGCCATTAAGTCTTCATCTTCTTCCATTTCCCTCCAATTCATCGAAGAACCAGAATAATCTCTTCCTCCCTCCTTCTTCAATGGCGGAATCAGACGTTCTCCCCCCAGGGCAAACTCAGCCTACTCCAAGTAAGTTCCATACCCATATCTTGTACAAAGTTTTAACTGCAATTTTCTTTCTGGTGATTCTCCCTCTAGTCCCCTCCCAAGCCCCTGAGTTCATCAATCAAACGCTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTCTTTAGCCGGAGAACTGACGAGATAGAGGATGAAATTACTGTCTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGATTACTTCATGTTTCGTCTGTTTTTGATGATGAGCCTGAAACTCCATCTGCTAATGATGAATCGCTGTCTTCGTCTGATGAAAATAAGGTCCAAACATGGGGTAGTCGGTATTTTAGGAATGAGTCTGTGGTTGTTGCTGAAGAACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAACCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGTGTTGTTGTTGACGATGAGTCTAAAACTGTTTCTGGTTCTAAGCCCAGAGTGAGTTCGAGAAGAGTGTTGAGCATGCCGAAGGGTAGTTCGAATGGGGAATTGAATGAGAAGGTCGTTCTTCGATCCCCGGTTCCATGGCGATCGAGATCGGAGAGGATGGAAGTGCAAGAAGAAGCTGATAATCCTCCTGTGTATTCTCCTGCTGCTCCCATGGAGGAATCTGAATCGAATTGGATTGATTCTCGGTCGTCGAGGCCTCAAACTTCAAGGTCTTCTCGAACTAGTGTCATTACTCAGAAGCTATCTCCTTCTCCTTCTCCATCTCCAAGGAAATTATCTCCTTCTCCTACTGTGTCGCCTGAATTACAGGCCAAGAGTGCTGAGGATTTGGTGAGGAAGAAGAACTTTTACCGGTCTCCTCCACCTCCACCGCCTCCTCCACCCCCGCCAACTGTTCGAAGAATTTCCTCAATGAAACCTTGGTTGAATGACAATGACAATGATGTACCTCATCAAAAGGATTTGAGGCGAAGCCTTACTAGCAAACCTAGAAGCTCAATTCGTGATGCGGGAAATGGTACCGATACAATCATTGGTGCTAATTCAAGTGTTGAAGCTCTGCCTAGAAATTATGTTGATGGTCAATCAATGGGAAGATCTGTTAGAACAATCAGACCAGGGGAAGTTGTGAATGAGCCACCAAGAAGAGGGAGAGAATTTGGTGGAAATGATCCATTGAAGGGGAAGAAGATGGAACAGAATGCCCATGTCCAAGAATTTGAAGAAAACCCCATTGAGTTTCCAGATGAAGATAAAGAAGAACTGGTCGAAAAGCTAGCCATGGAAACCGATGACGACATGGAAAGCGAAGAAGAAGACAATGTTGTGGGACAGTTTATCCGGGAAGATAACGGAGAACCTTTCAATGTGAAAAGGAGAGACAACGAAAGAAGTTCGAGTAATGAAGAAGCAAGCTCTAACAATATGGCTAATGATGGAGGACCTGATGTAGATAAGAAAGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATTAATCAAGAAATCAAGTGGACAAATTGGTAGGAACACTTCAAGGCAAACTTGAAGTTTGAAAGATAGATGTATCTTGCCTTTCATTCAATTCCTGGAAAGTGTTTCTTCTTCTCCAAATTCACCCTCTGGATTTATGATAGATGTTGTAATTCAGTAGGCATCTGCCATTAGTGTTTGTTGATATACAGCTGAAACAACAAGGTTTATGAGACTTAGTTGTATGTCATGTGTTTGAGAGTTAGCTTTTTAATTTGTCTATTAAATAGATGGACTAGGTTGTTTCGCTTTAATTAACTATTTCTTTCTATATATTTGGATATAGAACTTTTAACTACAAAAA
Coding sequence (CDS)
ATGGCGGAATCAGACGTTCTCCCCCCAGGGCAAACTCAGCCTACTCCAAGTAAGTTCCATACCCATATCTTGTACAAAGTTTTAACTGCAATTTTCTTTCTGGTGATTCTCCCTCTAGTCCCCTCCCAAGCCCCTGAGTTCATCAATCAAACGCTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTCTTTAGCCGGAGAACTGACGAGATAGAGGATGAAATTACTGTCTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGATTACTTCATGTTTCGTCTGTTTTTGATGATGAGCCTGAAACTCCATCTGCTAATGATGAATCGCTGTCTTCGTCTGATGAAAATAAGGTCCAAACATGGGGTAGTCGGTATTTTAGGAATGAGTCTGTGGTTGTTGCTGAAGAACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAACCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGTGTTGTTGTTGACGATGAGTCTAAAACTGTTTCTGGTTCTAAGCCCAGAGTGAGTTCGAGAAGAGTGTTGAGCATGCCGAAGGGTAGTTCGAATGGGGAATTGAATGAGAAGGTCGTTCTTCGATCCCCGGTTCCATGGCGATCGAGATCGGAGAGGATGGAAGTGCAAGAAGAAGCTGATAATCCTCCTGTGTATTCTCCTGCTGCTCCCATGGAGGAATCTGAATCGAATTGGATTGATTCTCGGTCGTCGAGGCCTCAAACTTCAAGGTCTTCTCGAACTAGTGTCATTACTCAGAAGCTATCTCCTTCTCCTTCTCCATCTCCAAGGAAATTATCTCCTTCTCCTACTGTGTCGCCTGAATTACAGGCCAAGAGTGCTGAGGATTTGGTGAGGAAGAAGAACTTTTACCGGTCTCCTCCACCTCCACCGCCTCCTCCACCCCCGCCAACTGTTCGAAGAATTTCCTCAATGAAACCTTGGTTGAATGACAATGACAATGATGTACCTCATCAAAAGGATTTGAGGCGAAGCCTTACTAGCAAACCTAGAAGCTCAATTCGTGATGCGGGAAATGGTACCGATACAATCATTGGTGCTAATTCAAGTGTTGAAGCTCTGCCTAGAAATTATGTTGATGGTCAATCAATGGGAAGATCTGTTAGAACAATCAGACCAGGGGAAGTTGTGAATGAGCCACCAAGAAGAGGGAGAGAATTTGGTGGAAATGATCCATTGAAGGGGAAGAAGATGGAACAGAATGCCCATGTCCAAGAATTTGAAGAAAACCCCATTGAGTTTCCAGATGAAGATAAAGAAGAACTGGTCGAAAAGCTAGCCATGGAAACCGATGACGACATGGAAAGCGAAGAAGAAGACAATGTTGTGGGACAGTTTATCCGGGAAGATAACGGAGAACCTTTCAATGTGAAAAGGAGAGACAACGAAAGAAGTTCGAGTAATGAAGAAGCAAGCTCTAACAATATGGCTAATGATGGAGGACCTGATGTAGATAAGAAAGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATTAATCAAGAAATCAAGTGGACAAATTGGTAGGAACACTTCAAGGCAAACTTGA
Protein sequence
MAESDVLPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEEDNVVGQFIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT
Homology
BLAST of Tan0020220 vs. NCBI nr
Match:
XP_023005363.1 (uncharacterized protein DDB_G0284459-like [Cucurbita maxima])
HSP 1 Score: 852.8 bits (2202), Expect = 1.6e-243
Identity = 476/556 (85.61%), Postives = 499/556 (89.75%), Query Frame = 0
Query: 1 MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
MAESDV L GQTQPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1 MAESDVHAKPSNLAAGQTQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120
Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
SANDES+SS DE+KVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDESKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
VD+E KT+SGSK RVSSRR LSMP SSN ELNEK+VL SPVPWRSRSE EVQEEADN
Sbjct: 181 VDEEYKTISGSKRRVSSRRSLSMPMRSSNEELNEKIVLPSPVPWRSRSEWKEVQEEADNL 240
Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL SPSPSPRK SPSPTVSPE
Sbjct: 241 PLYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300
Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP WL +++DV HQKDLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQKDLRRSL 360
Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
SKPR SIRD G+ D ++ ANSS E LPRNYVDGQSMG+SVRTIRPGEVVNEPPRRGRE
Sbjct: 361 ISKPRRSIRDTGDEIDLMMDANSSAEVLPRNYVDGQSMGKSVRTIRPGEVVNEPPRRGRE 420
Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
FGG D LKG KMEQNAH QEFEENPIE+PDEDK +LVEKLAME DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNAHAQEFEENPIEYPDEDKADLVEKLAMEAGDDMENEEEEDDVVGQ 480
Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
FIREDNGEPFNVKRRD SSS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNESSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
Query: 541 LIKKSSGQIGRNTSRQ 547
LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 550
BLAST of Tan0020220 vs. NCBI nr
Match:
XP_023540912.1 (uncharacterized protein DDB_G0284459-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 847.0 bits (2187), Expect = 8.8e-242
Identity = 475/558 (85.13%), Postives = 496/558 (88.89%), Query Frame = 0
Query: 1 MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
MAESDV L GQ QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1 MAESDVHAKPSNLAAGQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120
Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
VDDE KT+SGSK RVSSRR LSMP SSN ELNEK+VL SPVPWRSRSER EVQEEADN
Sbjct: 181 VDDEYKTISGSKRRVSSRRSLSMPMRSSNEELNEKIVLPSPVPWRSRSERKEVQEEADNL 240
Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300
Query: 301 LQAKSAEDLVRKKNFYRS--PPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRR 360
LQAKSAEDLVRKKNFY S PPPPPPPPPPPTVRRISSMKP WL +++DV HQ DLRR
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQNDLRR 360
Query: 361 SLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRG 420
SLT+KPR IRD G+ D ++ ANSS E LPRNYVDGQSMG+SVRTIRPGEVVNEPPRRG
Sbjct: 361 SLTTKPRRYIRDTGDEIDLMMDANSSAEVLPRNYVDGQSMGKSVRTIRPGEVVNEPPRRG 420
Query: 421 REFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVV 480
REFGG D LKG KMEQN H QEFEENPIEFPDEDKE LVEKLAME DDME+ EEED+VV
Sbjct: 421 REFGGTDQLKG-KMEQNPHAQEFEENPIEFPDEDKENLVEKLAMEAGDDMENEEEEDDVV 480
Query: 481 GQFIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQR 540
GQFIREDNGEPFNVKRRD +S EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQR
Sbjct: 481 GQFIREDNGEPFNVKRRDFNETSC-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQR 540
Query: 541 IELIKKSSGQIGRNTSRQ 547
IELIKKSSGQIGRNTSRQ
Sbjct: 541 IELIKKSSGQIGRNTSRQ 552
BLAST of Tan0020220 vs. NCBI nr
Match:
KAG7030146.1 (hypothetical protein SDJN02_08493, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 845.9 bits (2184), Expect = 2.0e-241
Identity = 472/556 (84.89%), Postives = 496/556 (89.21%), Query Frame = 0
Query: 1 MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
MAESDV L GQ QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1 MAESDVHAKPSNLAAGQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120
Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
VDDE KT+SGSK R+SSRR LSMP SSN E+NEKVVL SPVPWRSRSER EVQEEA+N
Sbjct: 181 VDDEYKTISGSKRRMSSRRSLSMPMRSSNEEMNEKVVLPSPVPWRSRSERKEVQEEAENL 240
Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300
Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP WL +++DV HQ DLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQNDLRRSL 360
Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
T+KPR SIRD G+ D ++ ANSS E PRNYVDGQSMG+SVRTIRPGEV+NEPPRRGRE
Sbjct: 361 TTKPRRSIRDTGDEIDLMMDANSSAEVPPRNYVDGQSMGKSVRTIRPGEVLNEPPRRGRE 420
Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
FGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNPHAQEFEENPIEFPDEYKENLVEKLAMEAGDDMENEEEEDDVVGQ 480
Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
FIREDNGEPFNVKRRD +SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNETSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
Query: 541 LIKKSSGQIGRNTSRQ 547
LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 550
BLAST of Tan0020220 vs. NCBI nr
Match:
KAG6596871.1 (hypothetical protein SDJN03_10051, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 845.9 bits (2184), Expect = 2.0e-241
Identity = 472/556 (84.89%), Postives = 496/556 (89.21%), Query Frame = 0
Query: 1 MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
MAESDV L GQ QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1 MAESDVHAKPLNLAAGQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120
Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
VDDE KT+SGSK R+SSRR LSMP SSN E+NEKVVL SPVPWRSRSER EVQEEA+N
Sbjct: 181 VDDEYKTISGSKRRMSSRRSLSMPMRSSNEEMNEKVVLPSPVPWRSRSERKEVQEEAENL 240
Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300
Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP WL +++DV HQ DLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQNDLRRSL 360
Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
T+KPR SIRD G+ D ++ ANSS E PRNYVDGQSMG+SVRTIRPGEV+NEPPRRGRE
Sbjct: 361 TTKPRRSIRDTGDEIDLMMDANSSAEVPPRNYVDGQSMGKSVRTIRPGEVLNEPPRRGRE 420
Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
FGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNPHAQEFEENPIEFPDEYKENLVEKLAMEAGDDMENEEEEDDVVGQ 480
Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
FIREDNGEPFNVKRRD +SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNETSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
Query: 541 LIKKSSGQIGRNTSRQ 547
LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 550
BLAST of Tan0020220 vs. NCBI nr
Match:
XP_022949423.1 (MAP7 domain-containing protein 1-like [Cucurbita moschata])
HSP 1 Score: 836.6 bits (2160), Expect = 1.2e-238
Identity = 471/556 (84.71%), Postives = 493/556 (88.67%), Query Frame = 0
Query: 1 MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
MAESDV L Q QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1 MAESDVHAKTSNLAAEQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120
Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
VDDE KT+SGSK RVSSRR LSMP SSN E+NEKVVL SPVPWRSRSER EVQEEA+N
Sbjct: 181 VDDEYKTISGSKRRVSSRRSLSMPMRSSNEEMNEKVVLPSPVPWRSRSERKEVQEEAENL 240
Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300
Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
LQAKSAEDLVRKKNFY S PPPPPPPPPTVRRISSMKP WL +D+DV HQ DLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSS--PPPPPPPPPTVRRISSMKPNSWL--HDSDVSHQNDLRRSL 360
Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
T+KPR SIRD G+ D ++ ANSS E PRNYVDGQSMG+SVRTIRPGEV+NEPPRRGRE
Sbjct: 361 TTKPRRSIRDTGDEIDLMMDANSSAEVPPRNYVDGQSMGKSVRTIRPGEVLNEPPRRGRE 420
Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
FGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNPHAQEFEENPIEFPDEYKENLVEKLAMEAGDDMENEEEEDDVVGQ 480
Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
FIREDNGEPFNVKRRD +SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNETSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
Query: 541 LIKKSSGQIGRNTSRQ 547
LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 548
BLAST of Tan0020220 vs. ExPASy TrEMBL
Match:
A0A6J1L1Z4 (uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC111498378 PE=4 SV=1)
HSP 1 Score: 852.8 bits (2202), Expect = 7.8e-244
Identity = 476/556 (85.61%), Postives = 499/556 (89.75%), Query Frame = 0
Query: 1 MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
MAESDV L GQTQPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1 MAESDVHAKPSNLAAGQTQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120
Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
SANDES+SS DE+KVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDESKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
VD+E KT+SGSK RVSSRR LSMP SSN ELNEK+VL SPVPWRSRSE EVQEEADN
Sbjct: 181 VDEEYKTISGSKRRVSSRRSLSMPMRSSNEELNEKIVLPSPVPWRSRSEWKEVQEEADNL 240
Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL SPSPSPRK SPSPTVSPE
Sbjct: 241 PLYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300
Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP WL +++DV HQKDLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQKDLRRSL 360
Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
SKPR SIRD G+ D ++ ANSS E LPRNYVDGQSMG+SVRTIRPGEVVNEPPRRGRE
Sbjct: 361 ISKPRRSIRDTGDEIDLMMDANSSAEVLPRNYVDGQSMGKSVRTIRPGEVVNEPPRRGRE 420
Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
FGG D LKG KMEQNAH QEFEENPIE+PDEDK +LVEKLAME DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNAHAQEFEENPIEYPDEDKADLVEKLAMEAGDDMENEEEEDDVVGQ 480
Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
FIREDNGEPFNVKRRD SSS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNESSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
Query: 541 LIKKSSGQIGRNTSRQ 547
LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 550
BLAST of Tan0020220 vs. ExPASy TrEMBL
Match:
A0A6J1GC23 (MAP7 domain-containing protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111452771 PE=4 SV=1)
HSP 1 Score: 836.6 bits (2160), Expect = 5.8e-239
Identity = 471/556 (84.71%), Postives = 493/556 (88.67%), Query Frame = 0
Query: 1 MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
MAESDV L Q QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1 MAESDVHAKTSNLAAEQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120
Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
VDDE KT+SGSK RVSSRR LSMP SSN E+NEKVVL SPVPWRSRSER EVQEEA+N
Sbjct: 181 VDDEYKTISGSKRRVSSRRSLSMPMRSSNEEMNEKVVLPSPVPWRSRSERKEVQEEAENL 240
Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300
Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
LQAKSAEDLVRKKNFY S PPPPPPPPPTVRRISSMKP WL +D+DV HQ DLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSS--PPPPPPPPPTVRRISSMKPNSWL--HDSDVSHQNDLRRSL 360
Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
T+KPR SIRD G+ D ++ ANSS E PRNYVDGQSMG+SVRTIRPGEV+NEPPRRGRE
Sbjct: 361 TTKPRRSIRDTGDEIDLMMDANSSAEVPPRNYVDGQSMGKSVRTIRPGEVLNEPPRRGRE 420
Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
FGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNPHAQEFEENPIEFPDEYKENLVEKLAMEAGDDMENEEEEDDVVGQ 480
Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
FIREDNGEPFNVKRRD +SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNETSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
Query: 541 LIKKSSGQIGRNTSRQ 547
LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 548
BLAST of Tan0020220 vs. ExPASy TrEMBL
Match:
A0A6J1L1K4 (uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC111500278 PE=4 SV=1)
HSP 1 Score: 796.2 bits (2055), Expect = 8.6e-227
Identity = 447/571 (78.28%), Postives = 486/571 (85.11%), Query Frame = 0
Query: 1 MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
MAESDV LPPG+ Q TPSKF++HILYK+L AIFFLVILPLVPSQAPEF+NQTLLT
Sbjct: 1 MAESDVPAKPPNLPPGKDQATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
R+WELLHLLFVGIAVSYGLFSRR DE ED I+VS FDNVQSYVSGLLHVSSVFDDE ETP
Sbjct: 61 RTWELLHLLFVGIAVSYGLFSRRNDEKEDGISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
SANDES+SSSD NKVQTW +RYFRNES+VVAEE PVVNEQRVRSEKPLLLPVRSL S+VV
Sbjct: 121 SANDESMSSSDGNKVQTWSNRYFRNESLVVAEESPVVNEQRVRSEKPLLLPVRSLNSQVV 180
Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGE------------LNEKVVLRSPVPWRSRS 240
VDDES+TVSGS RVSS R+LS K SSNGE LNE VVL SPVPWRSRS
Sbjct: 181 VDDESRTVSGSTSRVSSGRLLSNSKRSSNGEFGGLSLEGIEDNLNENVVLPSPVPWRSRS 240
Query: 241 ERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLS-PSPSPSP 300
R EVQEEADNPPVYSPA PMEESESNWIDSRSSRPQTSRS + S I KLS PSPSP P
Sbjct: 241 GRTEVQEEADNPPVYSPAVPMEESESNWIDSRSSRPQTSRSFQASAI--KLSPPSPSPFP 300
Query: 301 RKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDV 360
RK SPSP VSPEL+AKS+ED VRKK+F+ SPPPPPPPPPPP VRRI+SMKP NDNDV
Sbjct: 301 RKPSPSPNVSPELKAKSSEDSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSLLNDNDV 360
Query: 361 PHQKDLRRSL-TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGE 420
PHQKDL+RS+ TSKPR SIRD G+ D ++G NSS EALPRNY D SMG+S+R IRPGE
Sbjct: 361 PHQKDLKRSVTTSKPRRSIRDTGDDIDMVMGTNSSAEALPRNYDDILSMGKSIRKIRPGE 420
Query: 421 VVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMET--DDD 480
V NEP RRGREFGGND LKGK ++QN HVQ FEENPIEFPD+DK+E VEKL MET DDD
Sbjct: 421 VANEPTRRGREFGGNDQLKGKMIDQNTHVQAFEENPIEFPDDDKKEPVEKLGMETDDDDD 480
Query: 481 MESEEED-NVVGQFIREDNGEPFNVKRRDNERSSSNEEA-SSNNMANDGGPDVDKKADEF 540
MESEEED N+VG+FIREDNGEPFNV RRDNERSSSNEEA S+N++NDGGPDVDKKADEF
Sbjct: 481 MESEEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNEEAGGSSNLSNDGGPDVDKKADEF 540
Query: 541 IAKFREQIRLQRIELIKKSSGQIGRNTSRQT 548
IAKFREQIRLQRIE IK+S+GQI RNTS+Q+
Sbjct: 541 IAKFREQIRLQRIESIKRSTGQIRRNTSKQS 569
BLAST of Tan0020220 vs. ExPASy TrEMBL
Match:
A0A6J1E6G0 (uncharacterized protein DDB_G0284459 OS=Cucurbita moschata OX=3662 GN=LOC111431041 PE=4 SV=1)
HSP 1 Score: 795.4 bits (2053), Expect = 1.5e-226
Identity = 444/574 (77.35%), Postives = 487/574 (84.84%), Query Frame = 0
Query: 1 MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
MAESDV LPP + + TPSKF++HILYK+L AIFFLVILPLVPSQAPEF+NQTLLT
Sbjct: 1 MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
R+WELLHLLFVGIAVSYGLFSRR DE EDEI+VS FDNVQSYVSGLLHVSSVFDDE ETP
Sbjct: 61 RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
SANDES+S SD NKVQTW +RYFRNESV V+EE PVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180
Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGEL------------NEKVVLRSPVPWRSRS 240
VDDES+TVSGS RVSSRR+LS K SSNGE+ NE V L SPVPWRSRS
Sbjct: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRS 240
Query: 241 ERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI---TQKLSPSPSP 300
R EVQEEADNPP+YSPA PMEESESNWIDSRSSRPQTSRSS+ S I SPSPSP
Sbjct: 241 GRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPSP 300
Query: 301 SPRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDN 360
SPRK SPSP VSPEL+AKS+E VRKK+F+ SPPPPPPPPPPP VRRI+SMKP WL N
Sbjct: 301 SPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWL--N 360
Query: 361 DNDVPHQKDLRRSL-TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTI 420
DNDVPHQKDL+RS+ TSKPRSSIR G+ D ++G NSS EALPRNY D SMG+S R I
Sbjct: 361 DNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKI 420
Query: 421 RPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETD 480
RPGEV NEPPRRGREFGG D LKGK ++QNAHVQ FEENPIEFP+++K+ELVEKL+METD
Sbjct: 421 RPGEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKLSMETD 480
Query: 481 DDMESEEED-NVVGQFIREDNGEPFNVKRRDNERSSSN--EEASSNNMANDGGPDVDKKA 540
DDMES+EED N+VG+FIREDNGEPFNV RRDNERSSSN E SS+N++NDGGPDVDKKA
Sbjct: 481 DDMESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDVDKKA 540
Query: 541 DEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT 548
DEFIAKFREQIRLQRIE IK+S+GQI RNTS+QT
Sbjct: 541 DEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 572
BLAST of Tan0020220 vs. ExPASy TrEMBL
Match:
A0A6J1DZG8 (WW domain-binding protein 11 OS=Momordica charantia OX=3673 GN=LOC111024943 PE=4 SV=1)
HSP 1 Score: 790.4 bits (2040), Expect = 4.7e-225
Identity = 443/577 (76.78%), Postives = 483/577 (83.71%), Query Frame = 0
Query: 1 MAESDV--------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTL 60
MAE+++ L Q + PSKFH+H+LYKVLTAIFFLVILPLVPS+APEFINQTL
Sbjct: 1 MAETEIRAKSPTLALRQSQAEACPSKFHSHVLYKVLTAIFFLVILPLVPSRAPEFINQTL 60
Query: 61 LTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPE 120
LTRSWELLHLLFVGIAVSYGLFSRR +E E+E++ SKFDNVQSYVSGLLHVSSVFDDEPE
Sbjct: 61 LTRSWELLHLLFVGIAVSYGLFSRRNEEKENEVSGSKFDNVQSYVSGLLHVSSVFDDEPE 120
Query: 121 TPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSR 180
TPSANDESLSSSDE+KVQTW SRYFRNESVVVAEERP VNEQRVRSEKPLLLPVRSLKSR
Sbjct: 121 TPSANDESLSSSDESKVQTWSSRYFRNESVVVAEERPAVNEQRVRSEKPLLLPVRSLKSR 180
Query: 181 VVVD----DESKTVSGSKPRVSSRRVLSMPKGSSNGE------------LNEKVVLRSPV 240
VV D DES+ VSGSKPR SSRR+LS K S+ GE LNE VVLRSPV
Sbjct: 181 VVADDDLLDESRAVSGSKPRASSRRLLSKSKRSTEGEFGGVNLEEMEDKLNENVVLRSPV 240
Query: 241 PWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPS 300
PWRSRS RME+QEEADNPP+YSP A MEESESNWIDSRSSRPQTSRS+R + I QKLSPS
Sbjct: 241 PWRSRSGRMEMQEEADNPPMYSPVAAMEESESNWIDSRSSRPQTSRSTRANAIGQKLSPS 300
Query: 301 PSPS--PRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMK--P 360
PSPS P+K SP PTVSPELQ K AED VRKK+FYRSPPPPPPPPPPP VRRISSMK
Sbjct: 301 PSPSPTPKKPSPPPTVSPELQGKGAEDFVRKKSFYRSPPPPPPPPPPPRVRRISSMKQSS 360
Query: 361 WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSS-VEALPRNYVDGQSMGR 420
WL NDNDVPHQKDLRRS TSKPRSSIRD G+ D ++G NSS V PRNYVD QSMG+
Sbjct: 361 WL--NDNDVPHQKDLRRSFTSKPRSSIRDTGDDIDMMVGPNSSVVNEPPRNYVDSQSMGK 420
Query: 421 SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKL 480
SVRTIRPGE+VNEPPRRGRE GGN+ LKG+ QN HVQ+FEENPIEFPDE+KEELVEKL
Sbjct: 421 SVRTIRPGELVNEPPRRGRELGGNE-LKGRMDHQNVHVQDFEENPIEFPDEEKEELVEKL 480
Query: 481 AMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVD 540
METDDDME+ EEED + +FIR+ NG + R+DNERSSSNEEA S++MA DGGPDVD
Sbjct: 481 DMETDDDMETEEEEDTMATEFIRDKNGGTYTETRKDNERSSSNEEAGSSSMAGDGGPDVD 540
Query: 541 KKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT 548
KKADEFIAKFREQIRLQRIE IK+SSGQI RN+SRQT
Sbjct: 541 KKADEFIAKFREQIRLQRIESIKRSSGQIRRNSSRQT 574
BLAST of Tan0020220 vs. TAIR 10
Match:
AT4G16790.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 159.8 bits (403), Expect = 6.0e-39
Identity = 184/548 (33.58%), Postives = 246/548 (44.89%), Query Frame = 0
Query: 16 PSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFS 75
P KF++ ++K L ++P+ SQ PE NQ TR ELLHL+FVGIAVSYGLFS
Sbjct: 21 PRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQ---TRLLELLHLVFVGIAVSYGLFS 80
Query: 76 RR---------TDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETPSANDESLSSSDE 135
RR T + +N SYV +L VSSVF+ E+ S + SS D+
Sbjct: 81 RRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEVSSVFNVGHESESEPSDD-SSGDQ 140
Query: 136 NKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLK-SRVVVDDESKTVSGS 195
K QTW ++Y + + E R V EKPLLLPVRSL SR V D S SG
Sbjct: 141 RKFQTWKNKY--HMKIPEVETRFVDRVSSENREKPLLLPVRSLNYSR--VSDSSGDNSGR 200
Query: 196 KPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEE 255
+V S+R L G N + VL SP+PWRSRS +
Sbjct: 201 WEKVRSKRELLKTLGDDNSD-----VLPSPIPWRSRS------------------SSSSS 260
Query: 256 SESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSAEDLVRK 315
S S ++S S + +I PS SPRK +P P ++ E
Sbjct: 261 SSSKEVESLPSVKNLTTVESQPLIKNLTPPSSFSSPRKSNPIPNLASE------------ 320
Query: 316 KNFYRSPPPPPPPPPP-PTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGN 375
F+ SPPPPPPPPPP P SS K D+ ++ + R S K +
Sbjct: 321 --FHPSPPPPPPPPPPLPAFYNSSSRK------DHPGIYRVERRESSVHKTK-------- 380
Query: 376 GTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKME 435
G P P E PP + R +KM+
Sbjct: 381 ----FAGGEFHPPPPP-------------PPPPPVEYYKSPPTKFRLSNERRKSSEQKMK 440
Query: 436 QNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEEDNVVGQFIRE-DNGEPFNVK 495
+NA + + +PI E KE+ EK +++ N+ + + E +NGE +
Sbjct: 441 RNAPKKVWWSDPIV---ESKEQDTEK----------NDQRSNLGSKAVEESENGEQ---R 473
Query: 496 RRDNERSSSNEEASSNNMANDG------GPDVDKKADEFIAKFREQIRLQRIELIKKSSG 546
R +NE ++E + +G G DVDKKADEFIAKFREQIRLQRIE IK+S+
Sbjct: 501 RGENE---IHDEVEKKIVEEEGVSEINNGSDVDKKADEFIAKFREQIRLQRIESIKRSTN 473
BLAST of Tan0020220 vs. TAIR 10
Match:
AT3G60380.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT4G16790.1); Has 6102 Blast hits to 3981 proteins in 424 species: Archae - 6; Bacteria - 372; Metazoa - 2603; Fungi - 655; Plants - 291; Viruses - 28; Other Eukaryotes - 2147 (source: NCBI BLink). )
HSP 1 Score: 99.8 bits (247), Expect = 7.4e-21
Identity = 155/542 (28.60%), Postives = 235/542 (43.36%), Query Frame = 0
Query: 33 FLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDN 92
FL+ LPL PSQAP+F+ +T+LT+ WEL+HLLFVGIAV+YGLFSRR E ++ +++ D
Sbjct: 42 FLLALPLFPSQAPDFVGETVLTKFWELIHLLFVGIAVAYGLFSRRNVESAVDLRMTRVDE 101
Query: 93 VQ-SYVSGLLHVSSVFDDEPETPSA------NDESLSS---------------------- 152
SYVS + VSSVFD+E + S +DES+S+
Sbjct: 102 SSLSYVSRIFQVSSVFDEEFDDNSCEFVDVRSDESVSARASVVGKSESFVVESGELEESS 161
Query: 153 --SDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKS--RVVVDDES 212
+ N+V+ W S+YF+ +S VV RP +PL LP+R L+S R +
Sbjct: 162 EFGETNEVRAWNSQYFQGKSKVVV-ARPAYGLDGHVVHQPLGLPIRRLRSSLRDNAALQD 221
Query: 213 KTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSP 272
K+ + S + S+ + E+ SPVPW++R E M + DN P S
Sbjct: 222 KSFADSCDGAVNAEAESLLADNFFDEV--LAAPASPVPWQARPEMMGI---GDNYP--SN 281
Query: 273 AAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSA 332
P+ E+ + S SSR S SS+TS +Q + + SPS +VS E +
Sbjct: 282 FQPISVDET--LKSISSRSTGSSSSQTSYASQ--------NQNRFSPSRSVSAESLNSNV 341
Query: 333 EDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSI 392
E+LV++K+ S P PP S P L ND +L T + S
Sbjct: 342 EELVKEKSRQSSSRSSSPSLPPSPSLSPSPPSPELVPNDTR-RRSPELVTDDTPRRASHS 401
Query: 393 RDAGNGT----DTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE---- 452
R +G+ D G + +E + + R + + E RRG +
Sbjct: 402 RHYSDGSLLEEDVRRGFENELEGSKVRGRKAEFFSKKERGSKSLNLAAESSRRGNKSRRS 461
Query: 453 ---------FGGND--PLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDME 512
GG D + + ++Q ++ EEN + + D L K + D +E
Sbjct: 462 YPPESISSPVGGADDSTTRRQDLQQKSNCHLLEENIRKGVEADHNNLRVKKG-RSHDSLE 521
Query: 513 SEEEDNVVGQFIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKF 523
ED+ + + E V + N ++S SS GG D + D K
Sbjct: 522 LTAEDSAKDEKVSESFPALDVVFQPTNAKASRRAMRSSR-----GGRDTLPEKDVVTRKL 558
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023005363.1 | 1.6e-243 | 85.61 | uncharacterized protein DDB_G0284459-like [Cucurbita maxima] | [more] |
XP_023540912.1 | 8.8e-242 | 85.13 | uncharacterized protein DDB_G0284459-like [Cucurbita pepo subsp. pepo] | [more] |
KAG7030146.1 | 2.0e-241 | 84.89 | hypothetical protein SDJN02_08493, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
KAG6596871.1 | 2.0e-241 | 84.89 | hypothetical protein SDJN03_10051, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022949423.1 | 1.2e-238 | 84.71 | MAP7 domain-containing protein 1-like [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1L1Z4 | 7.8e-244 | 85.61 | uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC1114... | [more] |
A0A6J1GC23 | 5.8e-239 | 84.71 | MAP7 domain-containing protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111452... | [more] |
A0A6J1L1K4 | 8.6e-227 | 78.28 | uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC1115... | [more] |
A0A6J1E6G0 | 1.5e-226 | 77.35 | uncharacterized protein DDB_G0284459 OS=Cucurbita moschata OX=3662 GN=LOC1114310... | [more] |
A0A6J1DZG8 | 4.7e-225 | 76.78 | WW domain-binding protein 11 OS=Momordica charantia OX=3673 GN=LOC111024943 PE=4... | [more] |
Match Name | E-value | Identity | Description | |
AT4G16790.1 | 6.0e-39 | 33.58 | hydroxyproline-rich glycoprotein family protein | [more] |
AT3G60380.1 | 7.4e-21 | 28.60 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |