Tan0020220 (gene) Snake gourd v1

Overview
NameTan0020220
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMAP7 domain-containing protein 1-like
LocationLG01: 23660946 .. 23663152 (-)
RNA-Seq ExpressionTan0020220
SyntenyTan0020220
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAGCTTGAAATTGTTCTTTGTTCCTTCTCCAAACCCTTTCCCCTTCTCTGCAACTCTCTTTCTCCTACTTTTTCCTTCGCCATTAAGTCTTCATCTTCTTCCATTTCCCTCCAATTCATCGAAGAACCAGAATAATCTCTTCCTCCCTCCTTCTTCAATGGCGGAATCAGACGTTCTCCCCCCAGGGCAAACTCAGCCTACTCCAAGTAAGTTCCATACCCATATCTTGTACAAAGTTTTAACTGCAATTTTCTTTCTGGTGATTCTCCCTCTAGTCCCCTCCCAAGCCCCTGAGTTCATCAATCAAACGCTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTCTTTAGCCGGAGAACTGACGAGATAGAGGATGAAATTACTGTCTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGATTACTTCATGTTTCGTCTGTTTTTGATGATGAGCCTGAAACTCCATCTGCTAATGATGAATCGCTGTCTTCGTCTGATGAAAATAAGGTCCAAACATGGGGTAGTCGGTATTTTAGGAATGAGTCTGTGGTTGTTGCTGAAGAACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAACCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGTGTTGTTGTTGACGATGAGTCTAAAACTGTTTCTGGTTCTAAGCCCAGAGTGAGTTCGAGAAGAGTGTTGAGCATGCCGAAGGGTAGTTCGAATGGGGAATTGAATGAGAAGGTCGTTCTTCGATCCCCGGTTCCATGGCGATCGAGATCGGAGAGGATGGAAGTGCAAGAAGAAGCTGATAATCCTCCTGTGTATTCTCCTGCTGCTCCCATGGAGGAATCTGAATCGAATTGGATTGATTCTCGGTCGTCGAGGCCTCAAACTTCAAGGTCTTCTCGAACTAGTGTCATTACTCAGAAGCTATCTCCTTCTCCTTCTCCATCTCCAAGGAAATTATCTCCTTCTCCTACTGTGTCGCCTGAATTACAGGCCAAGAGTGCTGAGGATTTGGTGAGGAAGAAGAACTTTTACCGGTCTCCTCCACCTCCACCGCCTCCTCCACCCCCGCCAACTGTTCGAAGAATTTCCTCAATGAAACCTTGGTTGAATGACAATGACAATGATGTACCTCATCAAAAGGATTTGAGGCGAAGCCTTACTAGCAAACCTAGAAGCTCAATTCGTGATGCGGGAAATGGTACCGATACAATCATTGGTGCTAATTCAAGTGTTGAAGCTCTGCCTAGAAATTATGTTGATGGTCAATCAATGGGAAGATCTGTTAGAACAATCAGACCAGGGGAAGTTGTGAATGAGCCACCAAGAAGAGGGAGAGAATTTGGTGGAAATGATCCATTGAAGGGGAAGAAGATGGAACAGAATGCCCATGTCCAAGAATTTGAAGAAAACCCCATTGAGTTTCCAGATGAAGATAAAGAAGAACTGGTCGAAAAGCTAGCCATGGAAACCGATGACGACATGGAAAGCGAAGAAGAAGACAATGTTGTGGGACAGTTTATCCGGGAAGATAACGGAGAACCTTTCAATGTGAAAAGGAGAGACAACGAAAGAAGTTCGAGTAATGAAGAAGCAAGCTCTAACAATATGGCTAATGATGGAGGACCTGATGTAGATAAGAAAGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATTAATCAAGAAATCAAGTGGACAAATTGGTAGGAACACTTCAAGGCAAACTTGAAGTTTGAAAGATAGATGTATCTTGCCTTTCAGTAAGTTCATCAAAATTTGAACTTCAAATCATTCAACTGACATTTAGAGTCCTTTTCTAACACTTTTTTTTTCTTCTCTCCTCTTTTTTACATGATTTTCTTCTTTGATCATCAAGTTCAATTCCTGGAAAGTGTTTCTTCTTCTCCAAATTCACCCTCTGGATTTATGATAGATGTTGTAATTCAGTAGGCATCTGCCATTAGTGTTTGTTGATATACAGCTGAAACAACAAGGTTTATGAGACTTAGTTGTATGTCATGTGTTTGAGAGTTAGCTTTTTAATTTGTCTATTAAATAGATGGACTAGGTTGTTTCGCTTTAATTAACTATTTCTTTCTATATATTTGGATATAGAACTTTTAACTACAAAAA

mRNA sequence

CAAAGCTTGAAATTGTTCTTTGTTCCTTCTCCAAACCCTTTCCCCTTCTCTGCAACTCTCTTTCTCCTACTTTTTCCTTCGCCATTAAGTCTTCATCTTCTTCCATTTCCCTCCAATTCATCGAAGAACCAGAATAATCTCTTCCTCCCTCCTTCTTCAATGGCGGAATCAGACGTTCTCCCCCCAGGGCAAACTCAGCCTACTCCAAGTAAGTTCCATACCCATATCTTGTACAAAGTTTTAACTGCAATTTTCTTTCTGGTGATTCTCCCTCTAGTCCCCTCCCAAGCCCCTGAGTTCATCAATCAAACGCTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTCTTTAGCCGGAGAACTGACGAGATAGAGGATGAAATTACTGTCTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGATTACTTCATGTTTCGTCTGTTTTTGATGATGAGCCTGAAACTCCATCTGCTAATGATGAATCGCTGTCTTCGTCTGATGAAAATAAGGTCCAAACATGGGGTAGTCGGTATTTTAGGAATGAGTCTGTGGTTGTTGCTGAAGAACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAACCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGTGTTGTTGTTGACGATGAGTCTAAAACTGTTTCTGGTTCTAAGCCCAGAGTGAGTTCGAGAAGAGTGTTGAGCATGCCGAAGGGTAGTTCGAATGGGGAATTGAATGAGAAGGTCGTTCTTCGATCCCCGGTTCCATGGCGATCGAGATCGGAGAGGATGGAAGTGCAAGAAGAAGCTGATAATCCTCCTGTGTATTCTCCTGCTGCTCCCATGGAGGAATCTGAATCGAATTGGATTGATTCTCGGTCGTCGAGGCCTCAAACTTCAAGGTCTTCTCGAACTAGTGTCATTACTCAGAAGCTATCTCCTTCTCCTTCTCCATCTCCAAGGAAATTATCTCCTTCTCCTACTGTGTCGCCTGAATTACAGGCCAAGAGTGCTGAGGATTTGGTGAGGAAGAAGAACTTTTACCGGTCTCCTCCACCTCCACCGCCTCCTCCACCCCCGCCAACTGTTCGAAGAATTTCCTCAATGAAACCTTGGTTGAATGACAATGACAATGATGTACCTCATCAAAAGGATTTGAGGCGAAGCCTTACTAGCAAACCTAGAAGCTCAATTCGTGATGCGGGAAATGGTACCGATACAATCATTGGTGCTAATTCAAGTGTTGAAGCTCTGCCTAGAAATTATGTTGATGGTCAATCAATGGGAAGATCTGTTAGAACAATCAGACCAGGGGAAGTTGTGAATGAGCCACCAAGAAGAGGGAGAGAATTTGGTGGAAATGATCCATTGAAGGGGAAGAAGATGGAACAGAATGCCCATGTCCAAGAATTTGAAGAAAACCCCATTGAGTTTCCAGATGAAGATAAAGAAGAACTGGTCGAAAAGCTAGCCATGGAAACCGATGACGACATGGAAAGCGAAGAAGAAGACAATGTTGTGGGACAGTTTATCCGGGAAGATAACGGAGAACCTTTCAATGTGAAAAGGAGAGACAACGAAAGAAGTTCGAGTAATGAAGAAGCAAGCTCTAACAATATGGCTAATGATGGAGGACCTGATGTAGATAAGAAAGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATTAATCAAGAAATCAAGTGGACAAATTGGTAGGAACACTTCAAGGCAAACTTGAAGTTTGAAAGATAGATGTATCTTGCCTTTCATTCAATTCCTGGAAAGTGTTTCTTCTTCTCCAAATTCACCCTCTGGATTTATGATAGATGTTGTAATTCAGTAGGCATCTGCCATTAGTGTTTGTTGATATACAGCTGAAACAACAAGGTTTATGAGACTTAGTTGTATGTCATGTGTTTGAGAGTTAGCTTTTTAATTTGTCTATTAAATAGATGGACTAGGTTGTTTCGCTTTAATTAACTATTTCTTTCTATATATTTGGATATAGAACTTTTAACTACAAAAA

Coding sequence (CDS)

ATGGCGGAATCAGACGTTCTCCCCCCAGGGCAAACTCAGCCTACTCCAAGTAAGTTCCATACCCATATCTTGTACAAAGTTTTAACTGCAATTTTCTTTCTGGTGATTCTCCCTCTAGTCCCCTCCCAAGCCCCTGAGTTCATCAATCAAACGCTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTACGGCCTCTTTAGCCGGAGAACTGACGAGATAGAGGATGAAATTACTGTCTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGATTACTTCATGTTTCGTCTGTTTTTGATGATGAGCCTGAAACTCCATCTGCTAATGATGAATCGCTGTCTTCGTCTGATGAAAATAAGGTCCAAACATGGGGTAGTCGGTATTTTAGGAATGAGTCTGTGGTTGTTGCTGAAGAACGTCCTGTTGTTAATGAGCAGAGAGTTAGGAGTGAGAAACCTCTGCTTCTTCCTGTTCGTAGCTTGAAGTCTCGTGTTGTTGTTGACGATGAGTCTAAAACTGTTTCTGGTTCTAAGCCCAGAGTGAGTTCGAGAAGAGTGTTGAGCATGCCGAAGGGTAGTTCGAATGGGGAATTGAATGAGAAGGTCGTTCTTCGATCCCCGGTTCCATGGCGATCGAGATCGGAGAGGATGGAAGTGCAAGAAGAAGCTGATAATCCTCCTGTGTATTCTCCTGCTGCTCCCATGGAGGAATCTGAATCGAATTGGATTGATTCTCGGTCGTCGAGGCCTCAAACTTCAAGGTCTTCTCGAACTAGTGTCATTACTCAGAAGCTATCTCCTTCTCCTTCTCCATCTCCAAGGAAATTATCTCCTTCTCCTACTGTGTCGCCTGAATTACAGGCCAAGAGTGCTGAGGATTTGGTGAGGAAGAAGAACTTTTACCGGTCTCCTCCACCTCCACCGCCTCCTCCACCCCCGCCAACTGTTCGAAGAATTTCCTCAATGAAACCTTGGTTGAATGACAATGACAATGATGTACCTCATCAAAAGGATTTGAGGCGAAGCCTTACTAGCAAACCTAGAAGCTCAATTCGTGATGCGGGAAATGGTACCGATACAATCATTGGTGCTAATTCAAGTGTTGAAGCTCTGCCTAGAAATTATGTTGATGGTCAATCAATGGGAAGATCTGTTAGAACAATCAGACCAGGGGAAGTTGTGAATGAGCCACCAAGAAGAGGGAGAGAATTTGGTGGAAATGATCCATTGAAGGGGAAGAAGATGGAACAGAATGCCCATGTCCAAGAATTTGAAGAAAACCCCATTGAGTTTCCAGATGAAGATAAAGAAGAACTGGTCGAAAAGCTAGCCATGGAAACCGATGACGACATGGAAAGCGAAGAAGAAGACAATGTTGTGGGACAGTTTATCCGGGAAGATAACGGAGAACCTTTCAATGTGAAAAGGAGAGACAACGAAAGAAGTTCGAGTAATGAAGAAGCAAGCTCTAACAATATGGCTAATGATGGAGGACCTGATGTAGATAAGAAAGCTGATGAGTTCATTGCCAAATTCAGAGAGCAAATCAGGCTTCAAAGGATTGAATTAATCAAGAAATCAAGTGGACAAATTGGTAGGAACACTTCAAGGCAAACTTGA

Protein sequence

MAESDVLPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVVVDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEEDNVVGQFIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT
Homology
BLAST of Tan0020220 vs. NCBI nr
Match: XP_023005363.1 (uncharacterized protein DDB_G0284459-like [Cucurbita maxima])

HSP 1 Score: 852.8 bits (2202), Expect = 1.6e-243
Identity = 476/556 (85.61%), Postives = 499/556 (89.75%), Query Frame = 0

Query: 1   MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
           MAESDV      L  GQTQPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1   MAESDVHAKPSNLAAGQTQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60

Query: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
           RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120

Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+SS DE+KVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDESKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
           VD+E KT+SGSK RVSSRR LSMP  SSN ELNEK+VL SPVPWRSRSE  EVQEEADN 
Sbjct: 181 VDEEYKTISGSKRRVSSRRSLSMPMRSSNEELNEKIVLPSPVPWRSRSEWKEVQEEADNL 240

Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
           P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Sbjct: 241 PLYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300

Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
           LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP  WL  +++DV HQKDLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQKDLRRSL 360

Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
            SKPR SIRD G+  D ++ ANSS E LPRNYVDGQSMG+SVRTIRPGEVVNEPPRRGRE
Sbjct: 361 ISKPRRSIRDTGDEIDLMMDANSSAEVLPRNYVDGQSMGKSVRTIRPGEVVNEPPRRGRE 420

Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
           FGG D LKG KMEQNAH QEFEENPIE+PDEDK +LVEKLAME  DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNAHAQEFEENPIEYPDEDKADLVEKLAMEAGDDMENEEEEDDVVGQ 480

Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
           FIREDNGEPFNVKRRD   SSS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNESSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540

Query: 541 LIKKSSGQIGRNTSRQ 547
           LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 550

BLAST of Tan0020220 vs. NCBI nr
Match: XP_023540912.1 (uncharacterized protein DDB_G0284459-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 847.0 bits (2187), Expect = 8.8e-242
Identity = 475/558 (85.13%), Postives = 496/558 (88.89%), Query Frame = 0

Query: 1   MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
           MAESDV      L  GQ QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1   MAESDVHAKPSNLAAGQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60

Query: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
           RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120

Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
           VDDE KT+SGSK RVSSRR LSMP  SSN ELNEK+VL SPVPWRSRSER EVQEEADN 
Sbjct: 181 VDDEYKTISGSKRRVSSRRSLSMPMRSSNEELNEKIVLPSPVPWRSRSERKEVQEEADNL 240

Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
           P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300

Query: 301 LQAKSAEDLVRKKNFYRS--PPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRR 360
           LQAKSAEDLVRKKNFY S  PPPPPPPPPPPTVRRISSMKP  WL  +++DV HQ DLRR
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQNDLRR 360

Query: 361 SLTSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRG 420
           SLT+KPR  IRD G+  D ++ ANSS E LPRNYVDGQSMG+SVRTIRPGEVVNEPPRRG
Sbjct: 361 SLTTKPRRYIRDTGDEIDLMMDANSSAEVLPRNYVDGQSMGKSVRTIRPGEVVNEPPRRG 420

Query: 421 REFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVV 480
           REFGG D LKG KMEQN H QEFEENPIEFPDEDKE LVEKLAME  DDME+ EEED+VV
Sbjct: 421 REFGGTDQLKG-KMEQNPHAQEFEENPIEFPDEDKENLVEKLAMEAGDDMENEEEEDDVV 480

Query: 481 GQFIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQR 540
           GQFIREDNGEPFNVKRRD   +S  EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQR
Sbjct: 481 GQFIREDNGEPFNVKRRDFNETSC-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQR 540

Query: 541 IELIKKSSGQIGRNTSRQ 547
           IELIKKSSGQIGRNTSRQ
Sbjct: 541 IELIKKSSGQIGRNTSRQ 552

BLAST of Tan0020220 vs. NCBI nr
Match: KAG7030146.1 (hypothetical protein SDJN02_08493, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 845.9 bits (2184), Expect = 2.0e-241
Identity = 472/556 (84.89%), Postives = 496/556 (89.21%), Query Frame = 0

Query: 1   MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
           MAESDV      L  GQ QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1   MAESDVHAKPSNLAAGQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60

Query: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
           RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120

Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
           VDDE KT+SGSK R+SSRR LSMP  SSN E+NEKVVL SPVPWRSRSER EVQEEA+N 
Sbjct: 181 VDDEYKTISGSKRRMSSRRSLSMPMRSSNEEMNEKVVLPSPVPWRSRSERKEVQEEAENL 240

Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
           P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300

Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
           LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP  WL  +++DV HQ DLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQNDLRRSL 360

Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
           T+KPR SIRD G+  D ++ ANSS E  PRNYVDGQSMG+SVRTIRPGEV+NEPPRRGRE
Sbjct: 361 TTKPRRSIRDTGDEIDLMMDANSSAEVPPRNYVDGQSMGKSVRTIRPGEVLNEPPRRGRE 420

Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
           FGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME  DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNPHAQEFEENPIEFPDEYKENLVEKLAMEAGDDMENEEEEDDVVGQ 480

Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
           FIREDNGEPFNVKRRD   +SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNETSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540

Query: 541 LIKKSSGQIGRNTSRQ 547
           LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 550

BLAST of Tan0020220 vs. NCBI nr
Match: KAG6596871.1 (hypothetical protein SDJN03_10051, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 845.9 bits (2184), Expect = 2.0e-241
Identity = 472/556 (84.89%), Postives = 496/556 (89.21%), Query Frame = 0

Query: 1   MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
           MAESDV      L  GQ QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1   MAESDVHAKPLNLAAGQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60

Query: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
           RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120

Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
           VDDE KT+SGSK R+SSRR LSMP  SSN E+NEKVVL SPVPWRSRSER EVQEEA+N 
Sbjct: 181 VDDEYKTISGSKRRMSSRRSLSMPMRSSNEEMNEKVVLPSPVPWRSRSERKEVQEEAENL 240

Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
           P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300

Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
           LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP  WL  +++DV HQ DLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQNDLRRSL 360

Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
           T+KPR SIRD G+  D ++ ANSS E  PRNYVDGQSMG+SVRTIRPGEV+NEPPRRGRE
Sbjct: 361 TTKPRRSIRDTGDEIDLMMDANSSAEVPPRNYVDGQSMGKSVRTIRPGEVLNEPPRRGRE 420

Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
           FGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME  DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNPHAQEFEENPIEFPDEYKENLVEKLAMEAGDDMENEEEEDDVVGQ 480

Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
           FIREDNGEPFNVKRRD   +SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNETSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540

Query: 541 LIKKSSGQIGRNTSRQ 547
           LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 550

BLAST of Tan0020220 vs. NCBI nr
Match: XP_022949423.1 (MAP7 domain-containing protein 1-like [Cucurbita moschata])

HSP 1 Score: 836.6 bits (2160), Expect = 1.2e-238
Identity = 471/556 (84.71%), Postives = 493/556 (88.67%), Query Frame = 0

Query: 1   MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
           MAESDV      L   Q QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1   MAESDVHAKTSNLAAEQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60

Query: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
           RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120

Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
           VDDE KT+SGSK RVSSRR LSMP  SSN E+NEKVVL SPVPWRSRSER EVQEEA+N 
Sbjct: 181 VDDEYKTISGSKRRVSSRRSLSMPMRSSNEEMNEKVVLPSPVPWRSRSERKEVQEEAENL 240

Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
           P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300

Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
           LQAKSAEDLVRKKNFY S  PPPPPPPPPTVRRISSMKP  WL  +D+DV HQ DLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSS--PPPPPPPPPTVRRISSMKPNSWL--HDSDVSHQNDLRRSL 360

Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
           T+KPR SIRD G+  D ++ ANSS E  PRNYVDGQSMG+SVRTIRPGEV+NEPPRRGRE
Sbjct: 361 TTKPRRSIRDTGDEIDLMMDANSSAEVPPRNYVDGQSMGKSVRTIRPGEVLNEPPRRGRE 420

Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
           FGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME  DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNPHAQEFEENPIEFPDEYKENLVEKLAMEAGDDMENEEEEDDVVGQ 480

Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
           FIREDNGEPFNVKRRD   +SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNETSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540

Query: 541 LIKKSSGQIGRNTSRQ 547
           LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 548

BLAST of Tan0020220 vs. ExPASy TrEMBL
Match: A0A6J1L1Z4 (uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC111498378 PE=4 SV=1)

HSP 1 Score: 852.8 bits (2202), Expect = 7.8e-244
Identity = 476/556 (85.61%), Postives = 499/556 (89.75%), Query Frame = 0

Query: 1   MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
           MAESDV      L  GQTQPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1   MAESDVHAKPSNLAAGQTQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60

Query: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
           RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120

Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+SS DE+KVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDESKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
           VD+E KT+SGSK RVSSRR LSMP  SSN ELNEK+VL SPVPWRSRSE  EVQEEADN 
Sbjct: 181 VDEEYKTISGSKRRVSSRRSLSMPMRSSNEELNEKIVLPSPVPWRSRSEWKEVQEEADNL 240

Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
           P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Sbjct: 241 PLYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300

Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
           LQAKSAEDLVRKKNFY SPPPPPPPPPPPTVRRISSMKP  WL  +++DV HQKDLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSSPPPPPPPPPPPTVRRISSMKPNSWL--HESDVSHQKDLRRSL 360

Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
            SKPR SIRD G+  D ++ ANSS E LPRNYVDGQSMG+SVRTIRPGEVVNEPPRRGRE
Sbjct: 361 ISKPRRSIRDTGDEIDLMMDANSSAEVLPRNYVDGQSMGKSVRTIRPGEVVNEPPRRGRE 420

Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
           FGG D LKG KMEQNAH QEFEENPIE+PDEDK +LVEKLAME  DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNAHAQEFEENPIEYPDEDKADLVEKLAMEAGDDMENEEEEDDVVGQ 480

Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
           FIREDNGEPFNVKRRD   SSS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNESSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540

Query: 541 LIKKSSGQIGRNTSRQ 547
           LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 550

BLAST of Tan0020220 vs. ExPASy TrEMBL
Match: A0A6J1GC23 (MAP7 domain-containing protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111452771 PE=4 SV=1)

HSP 1 Score: 836.6 bits (2160), Expect = 5.8e-239
Identity = 471/556 (84.71%), Postives = 493/556 (88.67%), Query Frame = 0

Query: 1   MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
           MAESDV      L   Q QPTPSKFH+HILYKVLTAIFFLVILPLVPSQAPEFINQTLLT
Sbjct: 1   MAESDVHAKTSNLAAEQNQPTPSKFHSHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60

Query: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
           RSWELLHLLFVGIAVSYGLFSRRTDEIEDEI+VS+FDNVQSYVS LLHVSSVFDDEP TP
Sbjct: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEISVSRFDNVQSYVSRLLHVSSVFDDEPGTP 120

Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+SS DENKVQTW +RYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESVSSPDENKVQTWSNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNP 240
           VDDE KT+SGSK RVSSRR LSMP  SSN E+NEKVVL SPVPWRSRSER EVQEEA+N 
Sbjct: 181 VDDEYKTISGSKRRVSSRRSLSMPMRSSNEEMNEKVVLPSPVPWRSRSERKEVQEEAENL 240

Query: 241 PVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI-TQKLSPSPSPSPRKLSPSPTVSPE 300
           P+YSPAAPMEESES+WIDSRSSRP TSRSSR S I TQKL  SPSPSPRK SPSPTVSPE
Sbjct: 241 PMYSPAAPMEESESSWIDSRSSRPPTSRSSRASAISTQKL--SPSPSPRKPSPSPTVSPE 300

Query: 301 LQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDNDNDVPHQKDLRRSL 360
           LQAKSAEDLVRKKNFY S  PPPPPPPPPTVRRISSMKP  WL  +D+DV HQ DLRRSL
Sbjct: 301 LQAKSAEDLVRKKNFYSS--PPPPPPPPPTVRRISSMKPNSWL--HDSDVSHQNDLRRSL 360

Query: 361 TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE 420
           T+KPR SIRD G+  D ++ ANSS E  PRNYVDGQSMG+SVRTIRPGEV+NEPPRRGRE
Sbjct: 361 TTKPRRSIRDTGDEIDLMMDANSSAEVPPRNYVDGQSMGKSVRTIRPGEVLNEPPRRGRE 420

Query: 421 FGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMES-EEEDNVVGQ 480
           FGG D LKG KMEQN H QEFEENPIEFPDE KE LVEKLAME  DDME+ EEED+VVGQ
Sbjct: 421 FGGTDQLKG-KMEQNPHAQEFEENPIEFPDEYKENLVEKLAMEAGDDMENEEEEDDVVGQ 480

Query: 481 FIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540
           FIREDNGEPFNVKRRD   +SS EEA S+NMANDGGPDVDKKADEFIAKFREQIRLQRIE
Sbjct: 481 FIREDNGEPFNVKRRDFNETSS-EEAGSSNMANDGGPDVDKKADEFIAKFREQIRLQRIE 540

Query: 541 LIKKSSGQIGRNTSRQ 547
           LIKKSSGQIGRNTSRQ
Sbjct: 541 LIKKSSGQIGRNTSRQ 548

BLAST of Tan0020220 vs. ExPASy TrEMBL
Match: A0A6J1L1K4 (uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC111500278 PE=4 SV=1)

HSP 1 Score: 796.2 bits (2055), Expect = 8.6e-227
Identity = 447/571 (78.28%), Postives = 486/571 (85.11%), Query Frame = 0

Query: 1   MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
           MAESDV      LPPG+ Q TPSKF++HILYK+L AIFFLVILPLVPSQAPEF+NQTLLT
Sbjct: 1   MAESDVPAKPPNLPPGKDQATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60

Query: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
           R+WELLHLLFVGIAVSYGLFSRR DE ED I+VS FDNVQSYVSGLLHVSSVFDDE ETP
Sbjct: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDGISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120

Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+SSSD NKVQTW +RYFRNES+VVAEE PVVNEQRVRSEKPLLLPVRSL S+VV
Sbjct: 121 SANDESMSSSDGNKVQTWSNRYFRNESLVVAEESPVVNEQRVRSEKPLLLPVRSLNSQVV 180

Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGE------------LNEKVVLRSPVPWRSRS 240
           VDDES+TVSGS  RVSS R+LS  K SSNGE            LNE VVL SPVPWRSRS
Sbjct: 181 VDDESRTVSGSTSRVSSGRLLSNSKRSSNGEFGGLSLEGIEDNLNENVVLPSPVPWRSRS 240

Query: 241 ERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLS-PSPSPSP 300
            R EVQEEADNPPVYSPA PMEESESNWIDSRSSRPQTSRS + S I  KLS PSPSP P
Sbjct: 241 GRTEVQEEADNPPVYSPAVPMEESESNWIDSRSSRPQTSRSFQASAI--KLSPPSPSPFP 300

Query: 301 RKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDV 360
           RK SPSP VSPEL+AKS+ED VRKK+F+ SPPPPPPPPPPP VRRI+SMKP    NDNDV
Sbjct: 301 RKPSPSPNVSPELKAKSSEDSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSLLNDNDV 360

Query: 361 PHQKDLRRSL-TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGE 420
           PHQKDL+RS+ TSKPR SIRD G+  D ++G NSS EALPRNY D  SMG+S+R IRPGE
Sbjct: 361 PHQKDLKRSVTTSKPRRSIRDTGDDIDMVMGTNSSAEALPRNYDDILSMGKSIRKIRPGE 420

Query: 421 VVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMET--DDD 480
           V NEP RRGREFGGND LKGK ++QN HVQ FEENPIEFPD+DK+E VEKL MET  DDD
Sbjct: 421 VANEPTRRGREFGGNDQLKGKMIDQNTHVQAFEENPIEFPDDDKKEPVEKLGMETDDDDD 480

Query: 481 MESEEED-NVVGQFIREDNGEPFNVKRRDNERSSSNEEA-SSNNMANDGGPDVDKKADEF 540
           MESEEED N+VG+FIREDNGEPFNV RRDNERSSSNEEA  S+N++NDGGPDVDKKADEF
Sbjct: 481 MESEEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNEEAGGSSNLSNDGGPDVDKKADEF 540

Query: 541 IAKFREQIRLQRIELIKKSSGQIGRNTSRQT 548
           IAKFREQIRLQRIE IK+S+GQI RNTS+Q+
Sbjct: 541 IAKFREQIRLQRIESIKRSTGQIRRNTSKQS 569

BLAST of Tan0020220 vs. ExPASy TrEMBL
Match: A0A6J1E6G0 (uncharacterized protein DDB_G0284459 OS=Cucurbita moschata OX=3662 GN=LOC111431041 PE=4 SV=1)

HSP 1 Score: 795.4 bits (2053), Expect = 1.5e-226
Identity = 444/574 (77.35%), Postives = 487/574 (84.84%), Query Frame = 0

Query: 1   MAESDV------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLT 60
           MAESDV      LPP + + TPSKF++HILYK+L AIFFLVILPLVPSQAPEF+NQTLLT
Sbjct: 1   MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60

Query: 61  RSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
           R+WELLHLLFVGIAVSYGLFSRR DE EDEI+VS FDNVQSYVSGLLHVSSVFDDE ETP
Sbjct: 61  RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120

Query: 121 SANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVV 180
           SANDES+S SD NKVQTW +RYFRNESV V+EE PVVNEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180

Query: 181 VDDESKTVSGSKPRVSSRRVLSMPKGSSNGEL------------NEKVVLRSPVPWRSRS 240
           VDDES+TVSGS  RVSSRR+LS  K SSNGE+            NE V L SPVPWRSRS
Sbjct: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGVNLGGVEDNFNENVALPSPVPWRSRS 240

Query: 241 ERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVI---TQKLSPSPSP 300
            R EVQEEADNPP+YSPA PMEESESNWIDSRSSRPQTSRSS+ S I       SPSPSP
Sbjct: 241 GRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPSP 300

Query: 301 SPRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKP--WLNDN 360
           SPRK SPSP VSPEL+AKS+E  VRKK+F+ SPPPPPPPPPPP VRRI+SMKP  WL  N
Sbjct: 301 SPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWL--N 360

Query: 361 DNDVPHQKDLRRSL-TSKPRSSIRDAGNGTDTIIGANSSVEALPRNYVDGQSMGRSVRTI 420
           DNDVPHQKDL+RS+ TSKPRSSIR  G+  D ++G NSS EALPRNY D  SMG+S R I
Sbjct: 361 DNDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKI 420

Query: 421 RPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETD 480
           RPGEV NEPPRRGREFGG D LKGK ++QNAHVQ FEENPIEFP+++K+ELVEKL+METD
Sbjct: 421 RPGEVANEPPRRGREFGGYDQLKGKMIDQNAHVQAFEENPIEFPNDNKKELVEKLSMETD 480

Query: 481 DDMESEEED-NVVGQFIREDNGEPFNVKRRDNERSSSN--EEASSNNMANDGGPDVDKKA 540
           DDMES+EED N+VG+FIREDNGEPFNV RRDNERSSSN  E  SS+N++NDGGPDVDKKA
Sbjct: 481 DDMESKEEDNNMVGKFIREDNGEPFNVNRRDNERSSSNELEAGSSSNLSNDGGPDVDKKA 540

Query: 541 DEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT 548
           DEFIAKFREQIRLQRIE IK+S+GQI RNTS+QT
Sbjct: 541 DEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 572

BLAST of Tan0020220 vs. ExPASy TrEMBL
Match: A0A6J1DZG8 (WW domain-binding protein 11 OS=Momordica charantia OX=3673 GN=LOC111024943 PE=4 SV=1)

HSP 1 Score: 790.4 bits (2040), Expect = 4.7e-225
Identity = 443/577 (76.78%), Postives = 483/577 (83.71%), Query Frame = 0

Query: 1   MAESDV--------LPPGQTQPTPSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTL 60
           MAE+++        L   Q +  PSKFH+H+LYKVLTAIFFLVILPLVPS+APEFINQTL
Sbjct: 1   MAETEIRAKSPTLALRQSQAEACPSKFHSHVLYKVLTAIFFLVILPLVPSRAPEFINQTL 60

Query: 61  LTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPE 120
           LTRSWELLHLLFVGIAVSYGLFSRR +E E+E++ SKFDNVQSYVSGLLHVSSVFDDEPE
Sbjct: 61  LTRSWELLHLLFVGIAVSYGLFSRRNEEKENEVSGSKFDNVQSYVSGLLHVSSVFDDEPE 120

Query: 121 TPSANDESLSSSDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSR 180
           TPSANDESLSSSDE+KVQTW SRYFRNESVVVAEERP VNEQRVRSEKPLLLPVRSLKSR
Sbjct: 121 TPSANDESLSSSDESKVQTWSSRYFRNESVVVAEERPAVNEQRVRSEKPLLLPVRSLKSR 180

Query: 181 VVVD----DESKTVSGSKPRVSSRRVLSMPKGSSNGE------------LNEKVVLRSPV 240
           VV D    DES+ VSGSKPR SSRR+LS  K S+ GE            LNE VVLRSPV
Sbjct: 181 VVADDDLLDESRAVSGSKPRASSRRLLSKSKRSTEGEFGGVNLEEMEDKLNENVVLRSPV 240

Query: 241 PWRSRSERMEVQEEADNPPVYSPAAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPS 300
           PWRSRS RME+QEEADNPP+YSP A MEESESNWIDSRSSRPQTSRS+R + I QKLSPS
Sbjct: 241 PWRSRSGRMEMQEEADNPPMYSPVAAMEESESNWIDSRSSRPQTSRSTRANAIGQKLSPS 300

Query: 301 PSPS--PRKLSPSPTVSPELQAKSAEDLVRKKNFYRSPPPPPPPPPPPTVRRISSMK--P 360
           PSPS  P+K SP PTVSPELQ K AED VRKK+FYRSPPPPPPPPPPP VRRISSMK   
Sbjct: 301 PSPSPTPKKPSPPPTVSPELQGKGAEDFVRKKSFYRSPPPPPPPPPPPRVRRISSMKQSS 360

Query: 361 WLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGNGTDTIIGANSS-VEALPRNYVDGQSMGR 420
           WL  NDNDVPHQKDLRRS TSKPRSSIRD G+  D ++G NSS V   PRNYVD QSMG+
Sbjct: 361 WL--NDNDVPHQKDLRRSFTSKPRSSIRDTGDDIDMMVGPNSSVVNEPPRNYVDSQSMGK 420

Query: 421 SVRTIRPGEVVNEPPRRGREFGGNDPLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKL 480
           SVRTIRPGE+VNEPPRRGRE GGN+ LKG+   QN HVQ+FEENPIEFPDE+KEELVEKL
Sbjct: 421 SVRTIRPGELVNEPPRRGRELGGNE-LKGRMDHQNVHVQDFEENPIEFPDEEKEELVEKL 480

Query: 481 AMETDDDMES-EEEDNVVGQFIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVD 540
            METDDDME+ EEED +  +FIR+ NG  +   R+DNERSSSNEEA S++MA DGGPDVD
Sbjct: 481 DMETDDDMETEEEEDTMATEFIRDKNGGTYTETRKDNERSSSNEEAGSSSMAGDGGPDVD 540

Query: 541 KKADEFIAKFREQIRLQRIELIKKSSGQIGRNTSRQT 548
           KKADEFIAKFREQIRLQRIE IK+SSGQI RN+SRQT
Sbjct: 541 KKADEFIAKFREQIRLQRIESIKRSSGQIRRNSSRQT 574

BLAST of Tan0020220 vs. TAIR 10
Match: AT4G16790.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 159.8 bits (403), Expect = 6.0e-39
Identity = 184/548 (33.58%), Postives = 246/548 (44.89%), Query Frame = 0

Query: 16  PSKFHTHILYKVLTAIFFLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFS 75
           P KF++  ++K L       ++P+  SQ PE  NQ   TR  ELLHL+FVGIAVSYGLFS
Sbjct: 21  PRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQ---TRLLELLHLVFVGIAVSYGLFS 80

Query: 76  RR---------TDEIEDEITVSKFDNVQSYVSGLLHVSSVFDDEPETPSANDESLSSSDE 135
           RR         T   +        +N  SYV  +L VSSVF+   E+ S   +  SS D+
Sbjct: 81  RRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEVSSVFNVGHESESEPSDD-SSGDQ 140

Query: 136 NKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLK-SRVVVDDESKTVSGS 195
            K QTW ++Y  +  +   E R V        EKPLLLPVRSL  SR  V D S   SG 
Sbjct: 141 RKFQTWKNKY--HMKIPEVETRFVDRVSSENREKPLLLPVRSLNYSR--VSDSSGDNSGR 200

Query: 196 KPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSPAAPMEE 255
             +V S+R L    G  N +     VL SP+PWRSRS                  +    
Sbjct: 201 WEKVRSKRELLKTLGDDNSD-----VLPSPIPWRSRS------------------SSSSS 260

Query: 256 SESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSAEDLVRK 315
           S S  ++S  S    +      +I     PS   SPRK +P P ++ E            
Sbjct: 261 SSSKEVESLPSVKNLTTVESQPLIKNLTPPSSFSSPRKSNPIPNLASE------------ 320

Query: 316 KNFYRSPPPPPPPPPP-PTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSIRDAGN 375
             F+ SPPPPPPPPPP P     SS K      D+   ++ + R S   K +        
Sbjct: 321 --FHPSPPPPPPPPPPLPAFYNSSSRK------DHPGIYRVERRESSVHKTK-------- 380

Query: 376 GTDTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGREFGGNDPLKGKKME 435
                 G        P                 P E    PP + R          +KM+
Sbjct: 381 ----FAGGEFHPPPPP-------------PPPPPVEYYKSPPTKFRLSNERRKSSEQKMK 440

Query: 436 QNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDMESEEEDNVVGQFIRE-DNGEPFNVK 495
           +NA  + +  +PI    E KE+  EK          +++  N+  + + E +NGE    +
Sbjct: 441 RNAPKKVWWSDPIV---ESKEQDTEK----------NDQRSNLGSKAVEESENGEQ---R 473

Query: 496 RRDNERSSSNEEASSNNMANDG------GPDVDKKADEFIAKFREQIRLQRIELIKKSSG 546
           R +NE    ++E     +  +G      G DVDKKADEFIAKFREQIRLQRIE IK+S+ 
Sbjct: 501 RGENE---IHDEVEKKIVEEEGVSEINNGSDVDKKADEFIAKFREQIRLQRIESIKRSTN 473

BLAST of Tan0020220 vs. TAIR 10
Match: AT3G60380.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT4G16790.1); Has 6102 Blast hits to 3981 proteins in 424 species: Archae - 6; Bacteria - 372; Metazoa - 2603; Fungi - 655; Plants - 291; Viruses - 28; Other Eukaryotes - 2147 (source: NCBI BLink). )

HSP 1 Score: 99.8 bits (247), Expect = 7.4e-21
Identity = 155/542 (28.60%), Postives = 235/542 (43.36%), Query Frame = 0

Query: 33  FLVILPLVPSQAPEFINQTLLTRSWELLHLLFVGIAVSYGLFSRRTDEIEDEITVSKFDN 92
           FL+ LPL PSQAP+F+ +T+LT+ WEL+HLLFVGIAV+YGLFSRR  E   ++ +++ D 
Sbjct: 42  FLLALPLFPSQAPDFVGETVLTKFWELIHLLFVGIAVAYGLFSRRNVESAVDLRMTRVDE 101

Query: 93  VQ-SYVSGLLHVSSVFDDEPETPSA------NDESLSS---------------------- 152
              SYVS +  VSSVFD+E +  S       +DES+S+                      
Sbjct: 102 SSLSYVSRIFQVSSVFDEEFDDNSCEFVDVRSDESVSARASVVGKSESFVVESGELEESS 161

Query: 153 --SDENKVQTWGSRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKS--RVVVDDES 212
              + N+V+ W S+YF+ +S VV   RP          +PL LP+R L+S  R     + 
Sbjct: 162 EFGETNEVRAWNSQYFQGKSKVVV-ARPAYGLDGHVVHQPLGLPIRRLRSSLRDNAALQD 221

Query: 213 KTVSGSKPRVSSRRVLSMPKGSSNGELNEKVVLRSPVPWRSRSERMEVQEEADNPPVYSP 272
           K+ + S     +    S+   +   E+       SPVPW++R E M +    DN P  S 
Sbjct: 222 KSFADSCDGAVNAEAESLLADNFFDEV--LAAPASPVPWQARPEMMGI---GDNYP--SN 281

Query: 273 AAPMEESESNWIDSRSSRPQTSRSSRTSVITQKLSPSPSPSPRKLSPSPTVSPELQAKSA 332
             P+   E+  + S SSR   S SS+TS  +Q        +  + SPS +VS E    + 
Sbjct: 282 FQPISVDET--LKSISSRSTGSSSSQTSYASQ--------NQNRFSPSRSVSAESLNSNV 341

Query: 333 EDLVRKKNFYRSPPPPPPPPPPPTVRRISSMKPWLNDNDNDVPHQKDLRRSLTSKPRSSI 392
           E+LV++K+   S     P  PP      S   P L  ND       +L    T +  S  
Sbjct: 342 EELVKEKSRQSSSRSSSPSLPPSPSLSPSPPSPELVPNDTR-RRSPELVTDDTPRRASHS 401

Query: 393 RDAGNGT----DTIIGANSSVEALPRNYVDGQSMGRSVRTIRPGEVVNEPPRRGRE---- 452
           R   +G+    D   G  + +E         +   +  R  +   +  E  RRG +    
Sbjct: 402 RHYSDGSLLEEDVRRGFENELEGSKVRGRKAEFFSKKERGSKSLNLAAESSRRGNKSRRS 461

Query: 453 ---------FGGND--PLKGKKMEQNAHVQEFEENPIEFPDEDKEELVEKLAMETDDDME 512
                     GG D    + + ++Q ++    EEN  +  + D   L  K    + D +E
Sbjct: 462 YPPESISSPVGGADDSTTRRQDLQQKSNCHLLEENIRKGVEADHNNLRVKKG-RSHDSLE 521

Query: 513 SEEEDNVVGQFIREDNGEPFNVKRRDNERSSSNEEASSNNMANDGGPDVDKKADEFIAKF 523
              ED+   + + E       V +  N ++S     SS      GG D   + D    K 
Sbjct: 522 LTAEDSAKDEKVSESFPALDVVFQPTNAKASRRAMRSSR-----GGRDTLPEKDVVTRKL 558

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023005363.11.6e-24385.61uncharacterized protein DDB_G0284459-like [Cucurbita maxima][more]
XP_023540912.18.8e-24285.13uncharacterized protein DDB_G0284459-like [Cucurbita pepo subsp. pepo][more]
KAG7030146.12.0e-24184.89hypothetical protein SDJN02_08493, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAG6596871.12.0e-24184.89hypothetical protein SDJN03_10051, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022949423.11.2e-23884.71MAP7 domain-containing protein 1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1L1Z47.8e-24485.61uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC1114... [more]
A0A6J1GC235.8e-23984.71MAP7 domain-containing protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111452... [more]
A0A6J1L1K48.6e-22778.28uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC1115... [more]
A0A6J1E6G01.5e-22677.35uncharacterized protein DDB_G0284459 OS=Cucurbita moschata OX=3662 GN=LOC1114310... [more]
A0A6J1DZG84.7e-22576.78WW domain-binding protein 11 OS=Momordica charantia OX=3673 GN=LOC111024943 PE=4... [more]
Match NameE-valueIdentityDescription
AT4G16790.16.0e-3933.58hydroxyproline-rich glycoprotein family protein [more]
AT3G60380.17.4e-2128.60FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008480Protein of unknown function DUF761, plantPFAMPF05553DUF761coord: 507..536
e-value: 4.2E-12
score: 45.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 356..375
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 309..326
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 338..355
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 176..208
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 248..278
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 454..514
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 108..127
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 184..203
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 466..492
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 401..421
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 220..421
NoneNo IPR availablePANTHERPTHR34059:SF1EXPRESSED PROTEINcoord: 11..545
NoneNo IPR availablePANTHERPTHR34059EXPRESSED PROTEINcoord: 11..545
NoneNo IPR availableSUPERFAMILY101447Formin homology 2 domain (FH2 domain)coord: 312..322

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020220.1Tan0020220.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane