Tan0021474 (gene) Snake gourd v1

Overview
NameTan0021474
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMatrin-type domain-containing protein
LocationLG02: 10130024 .. 10140679 (+)
RNA-Seq ExpressionTan0021474
SyntenyTan0021474
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTGGATGCTAGAATTGAACTGATCCACAACCCGCTTCTTTCCGAACCCGAATGTTATTCACCCCCAAAATCAACGGATCTACTCATCGAGCGGGAGATTTAGGGTTTTAAGATTTCTTCCTACAATCTTTCTGCATCAATTTCCCCAAATTTGTTCGATTTCGGAACATATTACCGCTATACGCTTGAAATTCAGTCCAAACTTTTCGTTCAATAAAGTTTCAGTGTGCACTGACGACAGATTTCGTTATTCTATACCAGTTACTCATTCTGCAAGACGCATAATCATGACTGAGGTACCCGTTTCTCCAAATGCTTTTTCTTGATCACCCTTTGGGGAAATTTCTTGTGTATATTTACTTGTTGATGAACTTGAAAGGATCTCGAGCAGCTAAGCAACTCGATTTCTTTCCGTGTTTATAAGTCACTTGGTCATCCAACCGCATGTGTCATGTAATTCTGGCACCCATCTTGCAGATGTTTGGTTTTATTGAGAATTATGTGACTTTCTATATTGCTTATTAAGTAAACTTGTCTCGTAAGTTGATCTTTTGTTTTGATACAGAGAGATGCAAATCGTTTGCATTTTTTGTTCTTACTTTATTGCTTCGTTCATTAGTTTTTGAAAGCATGTATAGTGTGTAGTTTCGGAAAGTTGAAAAATTGCTTCACGTCTTTGTTGGTTGTGATGCCAGTTCTCATTTCAATACTTGGTGCAAGTTGTAGTTTGGTTTCATGTATACAGTATACATTCATATATGCATACACATACTTACGCCGGTACATATGTGTGTTTTCAATCATGATCTTTTTTATTTTAAATTGTTGTTCGGCTTTTGATTAACATTTTGGGAATGGCATCTATGATCTTATTTTTAAAATTTTCAGTGTTTGAAAGTTCTATGGTTTCCTGACCATTTCTTGTGATGTTTGCACCTAAATGTGTTGTTCTCATGCAAAGATTACGGCTTTATTCATGCATCCATTTGTGTGGTTAAACACATACTTTAGTTAGAATGATTATCTACGACAGCAAGACTGCTAACCTGTTTTTTTAATCCCTCTCCCCAGTATTGGGTTAGCCAAGGTAACAAATGGTGCGACTTCTGCAAAATATTTATTTCAAATAATCCTTCCACAATCCGGAATCATGAGCTCGGTCAACGTCATAAGGACAATGTTGCCAAAAGGCTTGCAAACATGAGAAAAGAGAATGCTGCCAAGGACAAAGAACAAAAGGAAGCAGTACGCGCCATTGAGCAAATTGAGGCAGTAAGTGAACAAACCTTTTATACATCCCATCTTTATTGATGACTAAGTATTGTAGTATGCATCTGTATAAATTTCAAATGTTGGTGGTTTGGTTATATTCAGAAGTACTAGAGTTAGCCTTTCTGTACTAAAAACGTTATGAATGATCACGTCAACTTTATTGTTAAATTATCTGCCTGCTTCACATCAGTGCTGCAAAGGAATCAAGCTGTTTGCTTACATCCTTAATTTAGCTCAACGAAGGTTAGCTTGTGGTGGTAGTTGAATTGTTGAGTAGTGAAAGATGAATCTAGTTAATGCTCTGAACTTGACCTGTTAATTTCATGTAGCTCCCCTGACGGGGTTCTTAAAAACTAATAGTTATACATTAAATTCCGTTTCCCAGTATATAATTTGAGCATTTGAGTCTGATGTAAGTCTAGTAAGAATAGTAATGTGCTACTAGAATTAATTTATCATGATTGATTTTGCATACTAGAGTGAAGGTCTTTTCTAGATATGGTTGTCCACTTACAAAACTGCCATTCAAAGATCCAACCTTCCAAGTTGTTAGTGGCACGACGGTGTTGGCAGAGGCGGTCGACAAAAAAGTCCATTGGAGGTATAGGGAGAGAAATCGAATGAATGTGGAGAGAGATGAAGGGAAAGCAGAATGTGGAGGAGGGGAGGGAGAGAATCGGGGGAGGGGGAGTTTTTTAGGGTTAGGTAAAGCAGAAATGTGGAGGGGGGAAAGCAGAATGCGTGGGGGAGAGGGGGGAGTTTTTTAGGGTTAGGGTTTCTTTTCAATTACAAACATTTTTACCAATTTTATTTTGTGGTTTTTTAATGGTTATTTGTGGCATGTTGGGCTTTTGTCTTATTGGGCTTAAGTGGGGTTTTTATTAGGTTTTATGTACATTCAAATTTTAAACTAAATTCAAAGGCTAATTAAATTTACATTCACATTTTTTTTAATTTTAATTATTTTTATACTGGGTAAAAAAAACTAAAGTATTTTGGGCTTGATCTTTTAATATTGTTTAATAAAATAGAAAAACAATAATGTGTCCCTAACATGTTCATATCCTACATTTTTCAGAAATTGGTGTCTCCGTGTCTGGTCATGTTGCATCCGTGTCTGTGTTCGTGCTTGATCACCGCTTAACCCAAAAGCTTAAGCTGATAGGTTATGGTAAATATAATCTTATATCAAAACTTTTAACACTTCCCCACACTTGTGGGTTTAGAAATCTATAGAAGTCTCAACAAGTGGAAATTAATATTAATTGGGGAGTAAATTATATTACAGGGGGTTCAAACACGACTTTGCTTTGATACCATGTTAAATCACCACTCGATCCAAAAGTTTAAGTTAATGGGTTATGGTAAATTTAATCTTATAACAATACTTTTGACACATTTTAATGGATATATGGATACTTTTGACAATGATAAAGAAGAGACATGAAGTTGCTGAAACTTTAAATAACAGGGTTAAGTGCCAAACCAAATTCGTGAAGTTCACTTAACAGGAGACAGAATCCTGCAAGTTTGTGTGTATGGCCTTGCTATATTCATTATATATGAGCAAGACAAATAGTTTGTGCTTGATCCATGAACTTTTATAGTTTTACATTTTTGTCATATTTTTTCCCAATTTGCAGAAAGCTAAGCGTAGTTATCAAAGGGACATAGCAAATTTTCGTGAAGCTAGAGATTCTCACGCACTTCCAGTTGATGTTGAAGAAAATGGTGAAGAGAGTAAGAACTCCAGACTTGCAATTTTGTTAGTTTTCAGAAATTAGACTCTTTATCTTTCTCCAAAACTTTCAGAAAATAAATGCCTCATAATTTGATGCATATGAGCTTCTATTGACTTGCGTGGAGAATATCATTCTGTTTATGTTTGTTACAATCTGATCTACTTAAGTGGAAGGTTTATGTAATTGGTCATAATATTCCATGCACATATATTTTGCATGGTTGTTCATATAGATAGTCAATAAATTGGGCCTTCCATAGGTCAGGACACTCCTGTATCAGAGTGTAAGGATTTATATCAATTTATTGGTTAGGAAACATGTGACTTGTTGTTCTTCTAACCACTAACAATTTCAATTTGAGATTTGAAGGCCTGCTTAATTATTTCTGCTTTAGTTTTGTGCCTATGTGCTTGTCATAAGTGGGAGAGCAAAGATTTTCTTTGCTTTGGTAAGAAAGAAGAGATTTCAGTAATAATAAACAAAGAACAAAGGAGACTGGAAGACATGCATTTTATGCTCAATGTTTCTTTGATCCATCAACTTATCCATTTGAATAATTTTTAAACCTTTTGTGAGCCTGGTTCTTTTCATTGTCATTTATCAATTATCATCATCATGATCTTTGGTACTGTTCAAACTGATACAGCTCTCAATGATGACAGAATGGGAGCTTGACAGCACTTCAGGCTATTATTACAATGAAAGCAATGGTTTTTACTATGATTCAAATTCAGGCTTCTACTACTCCGATGCCATTGGTACTGAAATTGTTTTTGGATGATGAACCTTCTACTTCTATCTTTCTTTTATCTTTTAGAAACAAGCATTTGTTGCTTACTAAGTTGAATGTTTGCAACAGGCAAGTGGGTAACACAGGAAGAGGCACATGCTTCACCTCAGTTCTTTTCGAACTCCAAACATAAGAAACCAGTTTTAGAGAAGCCATCGTCGGCCTCTGCAAGTGCAGTTATAAAAGACAAAAATGTGGATAAAGGGGGAAGTGGGCCTCCGCCTGGGTTGGTTGTTTCAGCTTCTTTAAACCCCAAGCGATCTGTTAAAGGTGCTCCTTCATCAGTTGCTGTTGGTAAGAGGAAGAGGCCAGATGAGAAGCAAAAGGCCATCTCTGAAGAAGAAAAAGCTGCACTTAAAGCAAGGGAGGCAGCAAAGAAGAGGGTTGAACAGAGGGAGAAGCCACTTCTCGGCCTCTACAAATTGCCTTGAAATAGGCTGTTGCTCTCAAGTAAATTTCTCTACCATCTCAAGTTTGCAACATCCACCTTCTCCCACCTGTAAACTGATTTTTCTCTTATTGATTATATAGATTGCTTAACATGAGCTGGAAGAGTGTTACTGTATGAAACTTCCCAAAGCTCTACCTGTTATTGTTATATCTTGCTGCTTTTATTGTGGGGTTTACATGAATTCATCACCAGGGAGTATGTGGGGTTGATATAGTCTTTTGCTCCATCCATGTGTGCTTGGTTGTGTAAGGTATCCACTCCCTGTTACAAGAATAAGAAACAAGAGCAATCCAAATACAAGAGCAATATGATAATGGACAACATTCTATTGGTTATCTTCAATAGACAAAGTGGTCTGTAATTCTTGGAATAATAAGCAAATCCCTCAAGAAGGGTAAAACAACAAGATAATCCAGAACCAACAATCACCCGACTATTCGAGAGAGCTGGTCCTCTCCCAAGTCTGCACACACGCCTAGCCAAAAAGCGCCTCCTCACAGCCAAACACATTTCTATAAAACCCGGCCTTACCCTGCACAAGCACTCACTCCCACGTGGTCCCTTCCCTCCGGCAATTTTGGTCTTTACATAGTACCCTTTTGCCCCTTCTATTATAAGTATGTAAAATCGGAGGTCTAACAATACTCCCGGGTTGAAAGACACCTTGTCCTCAAGGTGGAAGTGTGGAAATTGATTCTGATAAACTCGGCTAACTCCCGAGGAAGCATCACACACGGTCGCACCTTCCCACCGCACTCACCATTCATTTACGCACTTTTATCCGAGTTATGCACCGTATGCCCAATACTTCCCTCGGGGTGGCTTCCCATTCATAGTTATCGAGTAAGTTTAGGAGGTTTGTCTGCAATGCCACATGTGTGCCACACCACTTTTTTTGAGTTGGGAGACATGGAAGACATCATGGATGGATGCCTCGGGGAAGGTTCGGTTTATATGCCACCGCCCAATCTTCTCAATGATTGTGTAAGGACCAAAAACTTAGGAGCTAGCTTTTCACAACGACGCTTGGCTATCGATAGTTGGCGATATGGCCTCAATTTCGGATATACTAGGTCTCCCACTTGGAAATTTACCTCTCCTTTTTTTATCGGCTTGTTTCTTCATTCTATCCCGGCAACTCTTAGGTGTTCTTTGAGAGCCATCAAAGCGATATCTCTCTCAATCAATTGTTGTTCTAAAGTGTTGTTTCTTTGTCTTGTTGTCCCCATAAGCTAATAAGGGTGGAGGGTGTCTGCCATAGACAATGTTGAATGGTGTAGAGTTGATGGATACATGAAATGTAGTATTGTACCAGTATTCGGCCCATGCTAACCATTTCTCCCACTTGGATGGCCGTTCACTACTGAGAAACAACGTAGGTAGTTTTCCACACAGCAGTTCACTCTCTCGATCCATCCATCCGTTTGGGGGTGGGGCGCGTGCTTTTCGCGCGATCTTGTGCCCTGAATACGAAACAACTCATTCCAGAATCGACTCACAAATATTTTATCTCGATCCGACACTATCGAACGGGGAAACCCATGCAGTCTCACAATCTCTTGAACAAATATGTCAGCCACTTCTTTGGCGGAGAAGGGATGTTTTAGCCGAATAAAATGAGCATACTTACTTAGCCGGTCGACGACCACCACCAATATGACAAAGATATCCTCGTGATTTGGGCAGCCCTTCTATGAAATCCATGGTGATATCTTCCCATATTCGCTCCGGGCGTGGGAGGGGTTGGAGGAGGCCGGCTGGAGAAATAGAGTTACTCTTATTCTTTTGGCAAATGGCACACCTCTCCACATACTTCTGGATTTCGTTTTTCATTCCAGGCCAATAGAGCTCACCCGTTAGCCGCTTGTAGGTGCGCAGGAAGCCGGAGTGTCCCCCCAACACTGAGTCGTGAAACGTGTGTAGGATGGTCGGTATCAACCTGGATTTTCGCGAAATCACCATCCTATCTTTGTACTTCAAGTGTCCCTGTTGGTTTGAAAATTTCGGATGTGATAGGGGGTCTGCCTGAATGTCTCTGATAATCTTCTGTAACTCCTCGTCTTGCTCTATCTCGGTTTGAATTTGGCTGATATCAATGATCGAAGGTACTGACAGGGGAGCTAGTGATACTTCCGGCGGCCTACGAGACAGTGCATCGGCTGCTTTATTCAAGAGTCCTGGGTGATATCTAATTTCAAAATCATACCCGAGGAGTTTGGTGAGCCACCTTTGGTACTCTGGTTGGACCTCTGTTTGCTCCAATAAATGTCTAAGTGCCCGTTGGTCTGTTAGCACAGTAAATTTTTGTCCCAGTAAGTAATGTCTCCATTTTTGGACTGCTAAAACAATCGCCATTAGCTCACGTTCGTAGGTAGATTTCATCCGAGATCGATTAGATAAGGCCTGACTGAAATAGGCCACTGGTTTGTTCTTCTGCGATAGGACTGCACCCACCCCTGTTCCAGAGGCATCGGTTTCGATCACGAACGGCAGGGAGAAATCTGGCAACACCAAGACTGGTAAGGTCACCATTGCTTGTTTGAGGAGTTCAAACGCTTTGGTTGCTTCTTCATTCCAGCTGAACGCATCCTTCTTCAACAATTGAGTCAGAGGTTGTGCTATCAGGCCATAACTTGCCACGAACCTCCTGTAGTATCCCGTCAGGCCCAAAAAGCCTCTCAGTTCTCTCACGTTCTTAGGAATGGGCCATTGGACCATGGCTCGGACCTTCTCTGGATCAGCTTCCACTCCTTTGGCGGACACCCAGTGCCCCAAATACTCTATGCGATCTGTGGCAAAAACACACTTACTTCGATTCACATACAGGTGGTTCTCCTTCAACACATTAAAAATTACTCCGAGATGCTGCAGATGAGTGTCATAATCCGGACTGTATATTAGTATGTCGTCGAAGAAAACTAGCAGAAATCGCCTTAAGAAGGGTCTGAAGATTTCATTCATCAGTGACTGAAATGTGGCTGGTGCGTTAGTTAGCCCAAATGGCATAACAAGAAACTCATAATGACCCTCATGTGTCCGGAACGCTGTCTTGGGTATGTCAGAACTGTTCATACGGATTTGATGATATCCAGATTTGAGGTCTAATTTCGAATAAACCTGAGAGCCATGTAATTCATCTAGGAGTTCATCGATCAGGCATGGGAAATTTGTTTCGGCATGCCGTCCGTTGGTTGAGTGCCCTGTAATCCACGCAAAAGCGCCATCCGCCATCTTTCTTCTTGACTAGCAGAACAGGACTTGAGAATGGACTACTACTCGGCTGTATGATCCCCGAAGCTAGCATCTCTTTGATTAGTCTTTCAATTTCTGTTTTCTGTACGTGGGCATAACGGTATGGTCTTACATTGATTGGCGATGTTCCCTCAATCAGTTCAATTCGATGATCAACTTTCCTCTGTGGTGGCAGTCCCCCATTCCACGTAAATAATTCTCAGACTCTGCTAGAGGTCGCTTACTCCCTTTGGTGTTCCGGCTATTGCCTCTTCTTCACTCTCTCGCTTCTCCTCACTCTGTTCATCGAGTCTCAGTTCTTGTAATTCAATTAGAAATCCTTGGTCTTCGCAGTGCCAGGTCTTGGCTAACATTTTTAGAGAGACCTCTGCTCTAGTCAAGGAGGGGTCGCCTTTGATAGTTACTTTGGTTCCCTTTACTTCGAAGGTCATCGATAGTTCAGTCCAGTTAACTCCCATCAACCCCATCGTTCGCAGCCATTGCATTCCTAGGATAATATCCACTCCCCCCAGGTCTAGTGGAAGGAAATCGTCCACGATCGATAATTCCGGTAGGTGTATCGCGATATCCTTGCACACCCCTTTTCCTTTGACAGCTACACCATTTCCCATTACTACTCCATAATTGGTTGTCTCCGTTTGTCTGAGTTTCAGTTCATCCACTAGTTTTTGTGATATGAAGTTGTGGGTTGCCCCACAGTCGATCAGTACTATCACCTCTTTATCCTCTAGTCTGCCTCGAATTTTCATGGTCCCCGGAGTCGAGAACCCAATCACCGATTTCAAGTCAAGTTCTACAGCGTCCCCCACTTCCACATGCCGTCTGCTCCACTAATCCTTCTTCCTGGCCATCTATCTCGACTTCAACTTGTCGTAGCTCTATTTCTCCGAATTCCCCCACTAGAAGAACTCTGAGTTCTCGAGTCTCTCTATTCTTACACTGATGACCGCGGGTGTACTTTTCATCGCATCGGAAACATAAGCCCTTTTCCTTCTTTGCCTGAAACTCGGCGTCGGATAATCGTTTGGGTGGGAGTTCCCTTCGATTTCCCGCTCCTTTATCGGGTATGGTGACAGCTCGAATATTTTCAAACGTGCGTTTAGGGTTAGTGTTCTTTTCCACAGTCCCTTGGGCCTTTCTCGTGTTTATCGATGACGGCCCATGACTCCATCCCCTGGCCGCTACCACCTTTGCTATGTCCCGTCTTCCACCGGTTGGGCACATTTTATGATTTCCTCGAGCCCAGTGAGGACATCTACTTATCACCCTCTGCCTAATTGTTGGGCTTAGCCTGCTCAAAAATGTGTTCTCCAGGACATCCTCCGTCAACCCTGCCATCAATGGAGCGGCGAAGATTTCGAACTTCTTCCGGAACTCAGCCACTGTTCCCTCTTGCTTCACGGCCAAGAATCGAGCGCAGAGACTACCCTCTTGTGAAGGGCGATATCTCTCGAACACTCTACTTTTCAGGTCTGCCCAGGATTCTATCCTCTTCCTGCTATCGGTATAGCGATACCATTCTACTACGTCGGGTTCAAAGCTGATGATCGATACGTTAATCTTCTCTAGATCATTCAGTTGGTGCATCTCGAAATAAGTTTCAGCTCTAAATAACCAGGAATCTGGGTTCTCACCTGAGAATGAAGGCATCTCGAGTTTCTTGAATTGTTTCCGTTCGCCCTTCGATTCCACTCTCCTATCCTCGTGGTTTTGGCCTACTTCTATCTCATCCCTCTTTTTCACCGTCGAACTCTCTCCTTCGACATACTTCGGAGGAGCAGGGGTGTTAATGGCCGAAATCGACTGTCCTCTTCCCGAGTTGCTTTCCACAATCATCGTCATTAGTGTCTTTTGGTTCTCTCGCATCTCGTTGGCCATCATTTCCATCTTCTTTCCCAAGGTTTGGAGCACCTCTTTCACTTCTGTCATCTCTCTCTCATTGGTGCTCATCCTCTCTTCCATCTGTTGTTGCGCCATTTTGCGTGTTCTCCCCAGGATTCTACGTGCTCTGATACCAGTTTGTAAGGTATCCACTCCCTGTTACAAGAATAAGAAACAAGAGCAATCCAAATACAAGAGCAATATGATAATGGACACTGTTCTATTGGTTATCTTCAATAGACAGTGGTCTGTAATTCTTGGAATAATAAGCAAATCCCTCAAGAAGGGTAAAACAACAAGATAATCCAGAACCAACAATCACCCAGCTATTCGAGAGAGCTGGTCCTCTCCCAAGTCTGCTACTACTGCCTAGCCAAAAACTGCCTCCTCACAGGCCAACACAGCCCTATAAAACCCAGCCTTACCCCGTACAAGCACTCACTCCCACGTGGTCCCTTCCCTCCAGCTCAGCTGGTCTTTACATAGTACCCTTTTTGCCCCTTCTATTATAAGTATGTAAAATCGGAGGTCTAACAGGTTGTTGAAAAGATGAGAAACATTTGTGCGAAGAAAATCTGTAAGCAATCAACTCGCAGGCATGAAACCGATCTTGCTTTTTCTGTGGATCTGTTTGGTTACAGTTATGACTAAGCAGAATCGTGTACTGTCGAAGGCATAGATGTGAAGGAATCTCCCTAGTATCAGATTTGACATTATCTCCCTGTTCAGAAACCAAGGCCCTTATACAAGGTGAAATTGCAGTTAAAGAGGAAAACAGAAGATACTACTTAATACATATTGTATTTGGTGAGCAATAACATTTGTCATTAATAATTCAGCCATTGCCTGGTTTTTCTTTCTATTCACAGGATAATTAGGAGCTTGAATCCTAGATGAACTAATTTTATGTCAGCTCATCTATCTTGAAAGGACTTCAACTGGTTTGTGCTAACTTTTGTTGTATAGAGTAATGTTAGTCATCAACACCTTATTTTATTTATGCATTTTCATTTCTTTTATAGTTAAACTCCTTGCCTTGAATTTTAAAGCTTTGCATTTGTGTTGTAATTTGTAAAGTGATTCTCTCATGCTTGATCTCTAATAATAATTCTCATTCAGGTAGCAAAGTAGAGCACCGTGATTTGACCTACTAATTGAGGAGATGAAGTTGTAAATTCACCTTTCTAGCTACTCATTTATCTATAATATTCCTTTTTATTTTTATTATTTTTTATTATTTTTTATTTTTGGGTGTGTCTTAAAGATGG

mRNA sequence

GTTGGATGCTAGAATTGAACTGATCCACAACCCGCTTCTTTCCGAACCCGAATGTTATTCACCCCCAAAATCAACGGATCTACTCATCGAGCGGGAGATTTAGGGTTTTAAGATTTCTTCCTACAATCTTTCTGCATCAATTTCCCCAAATTTGTTCGATTTCGGAACATATTACCGCTATACGCTTGAAATTCAGTCCAAACTTTTCGTTCAATAAAGTTTCAGTGTGCACTGACGACAGATTTCGTTATTCTATACCAGTTACTCATTCTGCAAGACGCATAATCATGACTGAGTATTGGGTTAGCCAAGGTAACAAATGGTGCGACTTCTGCAAAATATTTATTTCAAATAATCCTTCCACAATCCGGAATCATGAGCTCGGTCAACGTCATAAGGACAATGTTGCCAAAAGGCTTGCAAACATGAGAAAAGAGAATGCTGCCAAGGACAAAGAACAAAAGGAAGCAGTACGCGCCATTGAGCAAATTGAGGCAAAAGCTAAGCGTAGTTATCAAAGGGACATAGCAAATTTTCGTGAAGCTAGAGATTCTCACGCACTTCCAGTTGATGTTGAAGAAAATGGTGAAGAGAAATGGGAGCTTGACAGCACTTCAGGCTATTATTACAATGAAAGCAATGGTTTTTACTATGATTCAAATTCAGGCTTCTACTACTCCGATGCCATTGGCAAGTGGGTAACACAGGAAGAGGCACATGCTTCACCTCAGTTCTTTTCGAACTCCAAACATAAGAAACCAGTTTTAGAGAAGCCATCGTCGGCCTCTGCAAGTGCAGTTATAAAAGACAAAAATGTGGATAAAGGGGGAAGTGGGCCTCCGCCTGGGTTGGTTGTTTCAGCTTCTTTAAACCCCAAGCGATCTGTTAAAGGTGCTCCTTCATCAGTTGCTGTTGGTAAGAGGAAGAGGCCAGATGAGAAGCAAAAGGCCATCTCTGAAGAAGAAAAAGCTGCACTTAAAGCAAGGGAGGCAGCAAAGAAGAGGGTTGAACAGAGGGAGAAGCCACTTCTCGGCCTCTACAAATTGCCTTGAAATAGGCTGTTGCTCTCAAGTAGCAAAGTAGAGCACCGTGATTTGACCTACTAATTGAGGAGATGAAGTTGTAAATTCACCTTTCTAGCTACTCATTTATCTATAATATTCCTTTTTATTTTTATTATTTTTTATTATTTTTTATTTTTGGGTGTGTCTTAAAGATGG

Coding sequence (CDS)

ATGACTGAGTATTGGGTTAGCCAAGGTAACAAATGGTGCGACTTCTGCAAAATATTTATTTCAAATAATCCTTCCACAATCCGGAATCATGAGCTCGGTCAACGTCATAAGGACAATGTTGCCAAAAGGCTTGCAAACATGAGAAAAGAGAATGCTGCCAAGGACAAAGAACAAAAGGAAGCAGTACGCGCCATTGAGCAAATTGAGGCAAAAGCTAAGCGTAGTTATCAAAGGGACATAGCAAATTTTCGTGAAGCTAGAGATTCTCACGCACTTCCAGTTGATGTTGAAGAAAATGGTGAAGAGAAATGGGAGCTTGACAGCACTTCAGGCTATTATTACAATGAAAGCAATGGTTTTTACTATGATTCAAATTCAGGCTTCTACTACTCCGATGCCATTGGCAAGTGGGTAACACAGGAAGAGGCACATGCTTCACCTCAGTTCTTTTCGAACTCCAAACATAAGAAACCAGTTTTAGAGAAGCCATCGTCGGCCTCTGCAAGTGCAGTTATAAAAGACAAAAATGTGGATAAAGGGGGAAGTGGGCCTCCGCCTGGGTTGGTTGTTTCAGCTTCTTTAAACCCCAAGCGATCTGTTAAAGGTGCTCCTTCATCAGTTGCTGTTGGTAAGAGGAAGAGGCCAGATGAGAAGCAAAAGGCCATCTCTGAAGAAGAAAAAGCTGCACTTAAAGCAAGGGAGGCAGCAAAGAAGAGGGTTGAACAGAGGGAGAAGCCACTTCTCGGCCTCTACAAATTGCCTTGA

Protein sequence

MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKEAVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGFYYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKGGSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRVEQREKPLLGLYKLP
Homology
BLAST of Tan0021474 vs. ExPASy Swiss-Prot
Match: Q7XA66 (Zinc finger protein ZOP1 OS=Arabidopsis thaliana OX=3702 GN=ZOP1 PE=1 SV=1)

HSP 1 Score: 265.8 bits (678), Expect = 5.1e-70
Identity = 150/256 (58.59%), Postives = 193/256 (75.39%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWC+FCKI+I NNP++IRNH+LG+RH++ V K+L +MR+ +AAKDKE K+
Sbjct: 1   MTEYWVSQGNKWCEFCKIWIQNNPTSIRNHDLGKRHRECVDKKLTDMRERSAAKDKELKK 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
             + ++QIEAKA RSYQ+DIA  ++   ++  P    E+G   W LDS SGYYYN++NG 
Sbjct: 61  NEKLLQQIEAKATRSYQKDIATAQQVAKANGAP----EDGTSDWMLDSASGYYYNQTNGL 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           +YDS SGFYYSD+IG WVTQ+EA+A+ +  ++S  K P+++KP S+S +           
Sbjct: 121 HYDSQSGFYYSDSIGHWVTQDEAYAAVK--TSSGTKVPLVKKPVSSSGAG---------P 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVG--KRKRPDEKQKAISEEEKAALKAREAAKK 240
             G PPG +V+ASLNPKR+VKGA SSV +G  KRKR DEK K +S EEKAALKAREAA+K
Sbjct: 181 SVGKPPGRLVTASLNPKRAVKGAASSVDLGNNKRKRQDEKPKKVSAEEKAALKAREAARK 240

Query: 241 RVEQREKPLLGLYKLP 255
           RVE REKPLLGLY  P
Sbjct: 241 RVEDREKPLLGLYNRP 241

BLAST of Tan0021474 vs. ExPASy Swiss-Prot
Match: O75554 (WW domain-binding protein 4 OS=Homo sapiens OX=9606 GN=WBP4 PE=1 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 1.1e-11
Identity = 32/80 (40.00%), Postives = 57/80 (71.25%), Query Frame = 0

Query: 1  MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
          M +YW SQ  K+CD+CK +I++N  ++  HE G+ HK+NVAKR++ +++++  K KE+++
Sbjct: 1  MADYWKSQPKKFCDYCKCWIADNRPSVEFHERGKNHKENVAKRISEIKQKSLDKAKEEEK 60

Query: 61 AVRAIEQIEAKAKRSYQRDI 81
          A +    +EA A ++YQ D+
Sbjct: 61 ASKEFAAMEAAALKAYQEDL 80

BLAST of Tan0021474 vs. ExPASy Swiss-Prot
Match: Q61048 (WW domain-binding protein 4 OS=Mus musculus OX=10090 GN=Wbp4 PE=1 SV=4)

HSP 1 Score: 71.6 bits (174), Expect = 1.4e-11
Identity = 36/98 (36.73%), Postives = 63/98 (64.29%), Query Frame = 0

Query: 1  MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
          M +YW SQ  K+CD+CK +I++N  ++  HE G+ HK+NVA+R++ +++++  K KE+++
Sbjct: 1  MADYWKSQPKKFCDYCKCWIADNRPSVEFHERGKNHKENVARRISEIKQKSLDKAKEEEK 60

Query: 61 AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEE 99
          A +    +EA A ++YQ D+      R    LP D+ E
Sbjct: 61 ASKEFAAMEAAALKAYQEDL-----KRLGLPLPSDISE 93

BLAST of Tan0021474 vs. ExPASy Swiss-Prot
Match: Q5F457 (WW domain-binding protein 4 OS=Gallus gallus OX=9031 GN=WBP4 PE=2 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 2.4e-11
Identity = 33/80 (41.25%), Postives = 54/80 (67.50%), Query Frame = 0

Query: 1  MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
          M +YW SQ  K+CD+CK +I++N  +I  HE G+ HK+NVAKR++ +RK++  K KE++ 
Sbjct: 1  MADYWKSQPKKFCDYCKCWIADNRPSIDFHERGKNHKENVAKRISEIRKKSMEKAKEEEN 60

Query: 61 AVRAIEQIEAKAKRSYQRDI 81
            +    +E  A ++YQ D+
Sbjct: 61 MSKEFAAMEEAAMKAYQEDL 80

BLAST of Tan0021474 vs. ExPASy Swiss-Prot
Match: Q5HZF2 (WW domain-binding protein 4 OS=Rattus norvegicus OX=10116 GN=Wbp4 PE=2 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 5.4e-11
Identity = 30/80 (37.50%), Postives = 57/80 (71.25%), Query Frame = 0

Query: 1  MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
          M +YW SQ  K+CD+CK +I++N  ++  HE G+ HK+NVA++++ +++++  K KE+++
Sbjct: 1  MADYWKSQPKKFCDYCKCWIADNRPSVEFHERGKNHKENVARKISEIKQKSLDKAKEEEK 60

Query: 61 AVRAIEQIEAKAKRSYQRDI 81
          A +    +EA A ++YQ D+
Sbjct: 61 ASKEFAAMEAAALKAYQEDL 80

BLAST of Tan0021474 vs. NCBI nr
Match: XP_022135853.1 (zinc finger protein ZOP1 [Momordica charantia])

HSP 1 Score: 461.1 bits (1185), Expect = 6.3e-126
Identity = 238/254 (93.70%), Postives = 246/254 (96.85%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAK+KEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKEKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRAIEQIEAKAKRSYQ+DIANFR+ARDSHALPVDV+ENGEEKWE DSTSGYYYNESNGF
Sbjct: 61  AVRAIEQIEAKAKRSYQKDIANFRDARDSHALPVDVQENGEEKWEFDSTSGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDSNSGFYY DA+GKWVTQEEAHASPQFFSN KHKKP+LEKPSSASASA +KDKNVDKG
Sbjct: 121 YYDSNSGFYYCDALGKWVTQEEAHASPQFFSNFKHKKPILEKPSSASASAAMKDKNVDKG 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
            SGPPPGLVVSASLNP RSVKGAPSSVAVGKRKRPD KQK IS+EEKAALKAREAAKKRV
Sbjct: 181 ESGPPPGLVVSASLNPTRSVKGAPSSVAVGKRKRPDAKQKVISDEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           EQREKPLLGLYKLP
Sbjct: 241 EQREKPLLGLYKLP 254

BLAST of Tan0021474 vs. NCBI nr
Match: XP_038888245.1 (zinc finger protein ZOP1 [Benincasa hispida] >XP_038888246.1 zinc finger protein ZOP1 [Benincasa hispida] >XP_038888247.1 zinc finger protein ZOP1 [Benincasa hispida])

HSP 1 Score: 460.3 bits (1183), Expect = 1.1e-125
Identity = 239/254 (94.09%), Postives = 248/254 (97.64%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAK+LANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRAIEQIEAKA RSYQ+DIAN REA+DSHALPV+V+ENGEE+WELDSTSGYYYNESNGF
Sbjct: 61  AVRAIEQIEAKANRSYQKDIANLREAKDSHALPVNVQENGEEEWELDSTSGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDS+SGFYYSDAIGKWVTQEEAH+SPQ+F NSKHKKPVL KPSSASASA IKDKNVDKG
Sbjct: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQYFLNSKHKKPVLAKPSSASASAAIKDKNVDKG 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
            SGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV
Sbjct: 181 ESGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           EQREKPLLGLYKLP
Sbjct: 241 EQREKPLLGLYKLP 254

BLAST of Tan0021474 vs. NCBI nr
Match: XP_031744713.1 (uncharacterized protein LOC101207712 isoform X3 [Cucumis sativus] >KGN43944.1 hypothetical protein Csa_017045 [Cucumis sativus])

HSP 1 Score: 458.0 bits (1177), Expect = 5.3e-125
Identity = 235/254 (92.52%), Postives = 246/254 (96.85%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAK+LANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRAIEQIEAKA RSYQ+DIANFREARDSHALPVDV+E G+EKWELDSTSGYYYNESNGF
Sbjct: 61  AVRAIEQIEAKANRSYQKDIANFREARDSHALPVDVQETGDEKWELDSTSGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDSNSGFYYSDAIGKWVTQEEAH+SPQFF +SKHKKP+L KPSSASAS  IKDKNVDKG
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHSSPQFFLDSKHKKPILAKPSSASASTAIKDKNVDKG 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
             GPPPGLVVSASLNPKRS+KGAPSS+AVGKRKRPDEKQKAISEEEKAALKAREAAKKRV
Sbjct: 181 EGGPPPGLVVSASLNPKRSIKGAPSSIAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           E+REKPLLGLY+LP
Sbjct: 241 EKREKPLLGLYRLP 254

BLAST of Tan0021474 vs. NCBI nr
Match: XP_022953041.1 (zinc finger protein ZOP1 [Cucurbita moschata] >XP_022953042.1 zinc finger protein ZOP1 [Cucurbita moschata])

HSP 1 Score: 457.2 bits (1175), Expect = 9.1e-125
Identity = 239/254 (94.09%), Postives = 246/254 (96.85%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELG+RHKDNVAKRLANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGERHKDNVAKRLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRA+EQIEAKAKRSYQ+DIANFREARDSHALPVDV+EN EEKWELDST+GYYYNESNGF
Sbjct: 61  AVRAVEQIEAKAKRSYQKDIANFREARDSHALPVDVQEN-EEKWELDSTTGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDSNSGFYYSDAIGKWVTQEEAHASPQFF NSKHKKPVLE PSSASASA IKDKNVDKG
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFLNSKHKKPVLENPSSASASAAIKDKNVDKG 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
            SGPPPGLVVSAS  PKRSVKGAPSS+AVGKRKRP+EKQK ISEEEKAALKAREAAKKRV
Sbjct: 181 ESGPPPGLVVSASSKPKRSVKGAPSSIAVGKRKRPNEKQKVISEEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           EQREKPLLGLYKLP
Sbjct: 241 EQREKPLLGLYKLP 253

BLAST of Tan0021474 vs. NCBI nr
Match: XP_022972471.1 (zinc finger protein ZOP1 [Cucurbita maxima] >XP_022972472.1 zinc finger protein ZOP1 [Cucurbita maxima] >XP_023511693.1 zinc finger protein ZOP1 [Cucurbita pepo subsp. pepo] >XP_023511694.1 zinc finger protein ZOP1 [Cucurbita pepo subsp. pepo] >KAG7011666.1 Zinc finger protein ZOP1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 455.3 bits (1170), Expect = 3.5e-124
Identity = 238/254 (93.70%), Postives = 245/254 (96.46%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELG+RHKDNVAKRLANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGERHKDNVAKRLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRA+EQIEAKAKRSYQ+DIANFREARDSHALPVDV+EN EEKWELDST+GYYYNESNGF
Sbjct: 61  AVRAVEQIEAKAKRSYQKDIANFREARDSHALPVDVQEN-EEKWELDSTTGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDSNSGFYYSDAIGKWVTQEEAHASPQFF NSKHKKPVLE PSSASASA  KDKNVDKG
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFLNSKHKKPVLENPSSASASAATKDKNVDKG 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
            SGPPPGLVVSAS  PKRSVKGAPSS+AVGKRKRP+EKQK ISEEEKAALKAREAAKKRV
Sbjct: 181 ESGPPPGLVVSASSKPKRSVKGAPSSIAVGKRKRPNEKQKVISEEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           EQREKPLLGLYKLP
Sbjct: 241 EQREKPLLGLYKLP 253

BLAST of Tan0021474 vs. ExPASy TrEMBL
Match: A0A6J1C616 (zinc finger protein ZOP1 OS=Momordica charantia OX=3673 GN=LOC111007702 PE=4 SV=1)

HSP 1 Score: 461.1 bits (1185), Expect = 3.1e-126
Identity = 238/254 (93.70%), Postives = 246/254 (96.85%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAK+KEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKEKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRAIEQIEAKAKRSYQ+DIANFR+ARDSHALPVDV+ENGEEKWE DSTSGYYYNESNGF
Sbjct: 61  AVRAIEQIEAKAKRSYQKDIANFRDARDSHALPVDVQENGEEKWEFDSTSGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDSNSGFYY DA+GKWVTQEEAHASPQFFSN KHKKP+LEKPSSASASA +KDKNVDKG
Sbjct: 121 YYDSNSGFYYCDALGKWVTQEEAHASPQFFSNFKHKKPILEKPSSASASAAMKDKNVDKG 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
            SGPPPGLVVSASLNP RSVKGAPSSVAVGKRKRPD KQK IS+EEKAALKAREAAKKRV
Sbjct: 181 ESGPPPGLVVSASLNPTRSVKGAPSSVAVGKRKRPDAKQKVISDEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           EQREKPLLGLYKLP
Sbjct: 241 EQREKPLLGLYKLP 254

BLAST of Tan0021474 vs. ExPASy TrEMBL
Match: A0A0A0K4T4 (Matrin-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G074280 PE=4 SV=1)

HSP 1 Score: 458.0 bits (1177), Expect = 2.6e-125
Identity = 235/254 (92.52%), Postives = 246/254 (96.85%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAK+LANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRAIEQIEAKA RSYQ+DIANFREARDSHALPVDV+E G+EKWELDSTSGYYYNESNGF
Sbjct: 61  AVRAIEQIEAKANRSYQKDIANFREARDSHALPVDVQETGDEKWELDSTSGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDSNSGFYYSDAIGKWVTQEEAH+SPQFF +SKHKKP+L KPSSASAS  IKDKNVDKG
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHSSPQFFLDSKHKKPILAKPSSASASTAIKDKNVDKG 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
             GPPPGLVVSASLNPKRS+KGAPSS+AVGKRKRPDEKQKAISEEEKAALKAREAAKKRV
Sbjct: 181 EGGPPPGLVVSASLNPKRSIKGAPSSIAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           E+REKPLLGLY+LP
Sbjct: 241 EKREKPLLGLYRLP 254

BLAST of Tan0021474 vs. ExPASy TrEMBL
Match: A0A6J1GM42 (zinc finger protein ZOP1 OS=Cucurbita moschata OX=3662 GN=LOC111455563 PE=4 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 4.4e-125
Identity = 239/254 (94.09%), Postives = 246/254 (96.85%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELG+RHKDNVAKRLANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGERHKDNVAKRLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRA+EQIEAKAKRSYQ+DIANFREARDSHALPVDV+EN EEKWELDST+GYYYNESNGF
Sbjct: 61  AVRAVEQIEAKAKRSYQKDIANFREARDSHALPVDVQEN-EEKWELDSTTGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDSNSGFYYSDAIGKWVTQEEAHASPQFF NSKHKKPVLE PSSASASA IKDKNVDKG
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFLNSKHKKPVLENPSSASASAAIKDKNVDKG 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
            SGPPPGLVVSAS  PKRSVKGAPSS+AVGKRKRP+EKQK ISEEEKAALKAREAAKKRV
Sbjct: 181 ESGPPPGLVVSASSKPKRSVKGAPSSIAVGKRKRPNEKQKVISEEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           EQREKPLLGLYKLP
Sbjct: 241 EQREKPLLGLYKLP 253

BLAST of Tan0021474 vs. ExPASy TrEMBL
Match: A0A6J1I626 (zinc finger protein ZOP1 OS=Cucurbita maxima OX=3661 GN=LOC111471024 PE=4 SV=1)

HSP 1 Score: 455.3 bits (1170), Expect = 1.7e-124
Identity = 238/254 (93.70%), Postives = 245/254 (96.46%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELG+RHKDNVAKRLANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGERHKDNVAKRLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRA+EQIEAKAKRSYQ+DIANFREARDSHALPVDV+EN EEKWELDST+GYYYNESNGF
Sbjct: 61  AVRAVEQIEAKAKRSYQKDIANFREARDSHALPVDVQEN-EEKWELDSTTGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDSNSGFYYSDAIGKWVTQEEAHASPQFF NSKHKKPVLE PSSASASA  KDKNVDKG
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFLNSKHKKPVLENPSSASASAATKDKNVDKG 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
            SGPPPGLVVSAS  PKRSVKGAPSS+AVGKRKRP+EKQK ISEEEKAALKAREAAKKRV
Sbjct: 181 ESGPPPGLVVSASSKPKRSVKGAPSSIAVGKRKRPNEKQKVISEEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           EQREKPLLGLYKLP
Sbjct: 241 EQREKPLLGLYKLP 253

BLAST of Tan0021474 vs. ExPASy TrEMBL
Match: A0A1S3BZS9 (uncharacterized protein C18H10.07 OS=Cucumis melo OX=3656 GN=LOC103495224 PE=4 SV=1)

HSP 1 Score: 451.8 bits (1161), Expect = 1.9e-123
Identity = 233/254 (91.73%), Postives = 243/254 (95.67%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNV K+LANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVTKKLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
           AVRAIEQIEAKA RSYQ+DIANFREARDSHALPVDV+E G+EKWELDSTSGYYYNESNGF
Sbjct: 61  AVRAIEQIEAKANRSYQKDIANFREARDSHALPVDVQETGDEKWELDSTSGYYYNESNGF 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           YYDSNSGFYYSDAIGKWVTQEEAH+SPQFF +SKHKKP+L  PSSASAS  IKDKNVDK 
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHSSPQFFLDSKHKKPILGMPSSASASTAIKDKNVDKA 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240
             GPPPGLVVSASLNPKRSVKGAPSS+AVGKRKRPDEKQKAISEEEKAALKAREAAKKRV
Sbjct: 181 EGGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240

Query: 241 EQREKPLLGLYKLP 255
           E+REKPLLGLY+LP
Sbjct: 241 EKREKPLLGLYRLP 254

BLAST of Tan0021474 vs. TAIR 10
Match: AT1G49590.1 (C2H2 and C2HC zinc fingers superfamily protein )

HSP 1 Score: 265.8 bits (678), Expect = 3.6e-71
Identity = 150/256 (58.59%), Postives = 193/256 (75.39%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWC+FCKI+I NNP++IRNH+LG+RH++ V K+L +MR+ +AAKDKE K+
Sbjct: 1   MTEYWVSQGNKWCEFCKIWIQNNPTSIRNHDLGKRHRECVDKKLTDMRERSAAKDKELKK 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVDVEENGEEKWELDSTSGYYYNESNGF 120
             + ++QIEAKA RSYQ+DIA  ++   ++  P    E+G   W LDS SGYYYN++NG 
Sbjct: 61  NEKLLQQIEAKATRSYQKDIATAQQVAKANGAP----EDGTSDWMLDSASGYYYNQTNGL 120

Query: 121 YYDSNSGFYYSDAIGKWVTQEEAHASPQFFSNSKHKKPVLEKPSSASASAVIKDKNVDKG 180
           +YDS SGFYYSD+IG WVTQ+EA+A+ +  ++S  K P+++KP S+S +           
Sbjct: 121 HYDSQSGFYYSDSIGHWVTQDEAYAAVK--TSSGTKVPLVKKPVSSSGAG---------P 180

Query: 181 GSGPPPGLVVSASLNPKRSVKGAPSSVAVG--KRKRPDEKQKAISEEEKAALKAREAAKK 240
             G PPG +V+ASLNPKR+VKGA SSV +G  KRKR DEK K +S EEKAALKAREAA+K
Sbjct: 181 SVGKPPGRLVTASLNPKRAVKGAASSVDLGNNKRKRQDEKPKKVSAEEKAALKAREAARK 240

Query: 241 RVEQREKPLLGLYKLP 255
           RVE REKPLLGLY  P
Sbjct: 241 RVEDREKPLLGLYNRP 241

BLAST of Tan0021474 vs. TAIR 10
Match: AT1G49590.2 (C2H2 and C2HC zinc fingers superfamily protein )

HSP 1 Score: 120.6 bits (301), Expect = 1.9e-27
Identity = 59/104 (56.73%), Postives = 81/104 (77.88%), Query Frame = 0

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWC+FCKI+I NNP++IRNH+LG+RH++ V K+L +MR+ +AAKDKE K+
Sbjct: 1   MTEYWVSQGNKWCEFCKIWIQNNPTSIRNHDLGKRHRECVDKKLTDMRERSAAKDKELKK 60

Query: 61  AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVD-VEENGEEK 104
             + ++QIEAKA RSYQ+DIA  ++   ++  P D   EN  E+
Sbjct: 61  NEKLLQQIEAKATRSYQKDIATAQQVAKANGAPEDGTSENHHER 104

BLAST of Tan0021474 vs. TAIR 10
Match: AT1G49590.3 (C2H2 and C2HC zinc fingers superfamily protein )

HSP 1 Score: 120.2 bits (300), Expect = 2.5e-27
Identity = 56/95 (58.95%), Postives = 77/95 (81.05%), Query Frame = 0

Query: 1  MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKRLANMRKENAAKDKEQKE 60
          MTEYWVSQGNKWC+FCKI+I NNP++IRNH+LG+RH++ V K+L +MR+ +AAKDKE K+
Sbjct: 1  MTEYWVSQGNKWCEFCKIWIQNNPTSIRNHDLGKRHRECVDKKLTDMRERSAAKDKELKK 60

Query: 61 AVRAIEQIEAKAKRSYQRDIANFREARDSHALPVD 96
            + ++QIEAKA RSYQ+DIA  ++   ++  P D
Sbjct: 61 NEKLLQQIEAKATRSYQKDIATAQQVAKANGAPED 95

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q7XA665.1e-7058.59Zinc finger protein ZOP1 OS=Arabidopsis thaliana OX=3702 GN=ZOP1 PE=1 SV=1[more]
O755541.1e-1140.00WW domain-binding protein 4 OS=Homo sapiens OX=9606 GN=WBP4 PE=1 SV=1[more]
Q610481.4e-1136.73WW domain-binding protein 4 OS=Mus musculus OX=10090 GN=Wbp4 PE=1 SV=4[more]
Q5F4572.4e-1141.25WW domain-binding protein 4 OS=Gallus gallus OX=9031 GN=WBP4 PE=2 SV=1[more]
Q5HZF25.4e-1137.50WW domain-binding protein 4 OS=Rattus norvegicus OX=10116 GN=Wbp4 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_022135853.16.3e-12693.70zinc finger protein ZOP1 [Momordica charantia][more]
XP_038888245.11.1e-12594.09zinc finger protein ZOP1 [Benincasa hispida] >XP_038888246.1 zinc finger protein... [more]
XP_031744713.15.3e-12592.52uncharacterized protein LOC101207712 isoform X3 [Cucumis sativus] >KGN43944.1 hy... [more]
XP_022953041.19.1e-12594.09zinc finger protein ZOP1 [Cucurbita moschata] >XP_022953042.1 zinc finger protei... [more]
XP_022972471.13.5e-12493.70zinc finger protein ZOP1 [Cucurbita maxima] >XP_022972472.1 zinc finger protein ... [more]
Match NameE-valueIdentityDescription
A0A6J1C6163.1e-12693.70zinc finger protein ZOP1 OS=Momordica charantia OX=3673 GN=LOC111007702 PE=4 SV=... [more]
A0A0A0K4T42.6e-12592.52Matrin-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G074280... [more]
A0A6J1GM424.4e-12594.09zinc finger protein ZOP1 OS=Cucurbita moschata OX=3662 GN=LOC111455563 PE=4 SV=1[more]
A0A6J1I6261.7e-12493.70zinc finger protein ZOP1 OS=Cucurbita maxima OX=3661 GN=LOC111471024 PE=4 SV=1[more]
A0A1S3BZS91.9e-12391.73uncharacterized protein C18H10.07 OS=Cucumis melo OX=3656 GN=LOC103495224 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G49590.13.6e-7158.59C2H2 and C2HC zinc fingers superfamily protein [more]
AT1G49590.21.9e-2756.73C2H2 and C2HC zinc fingers superfamily protein [more]
AT1G49590.32.5e-2758.95C2H2 and C2HC zinc fingers superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 37..74
NoneNo IPR availableGENE3D3.30.160.60Classic Zinc Fingercoord: 9..65
e-value: 3.4E-8
score: 35.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 149..168
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 173..254
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 210..246
IPR003604Matrin/U1-C-like, C2H2-type zinc fingerSMARTSM00451ZnF_U1_5coord: 8..43
e-value: 3.2E-6
score: 36.7
IPR013085U1-C, C2H2-type zinc fingerPFAMPF06220zf-U1coord: 11..43
e-value: 2.3E-7
score: 30.5
IPR041591OCRE domainPFAMPF17780OCREcoord: 102..151
e-value: 2.0E-14
score: 53.3
IPR040023WW domain-binding protein 4PANTHERPTHR13173WW DOMAIN BINDING PROTEIN 4coord: 1..246
IPR000690Matrin/U1-C, C2H2-type zinc fingerPROSITEPS50171ZF_MATRINcoord: 11..42
score: 9.740756
IPR036236Zinc finger C2H2 superfamilySUPERFAMILY57667beta-beta-alpha zinc fingerscoord: 12..57

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0021474.1Tan0021474.1mRNA
Tan0021474.2Tan0021474.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0071011 precatalytic spliceosome
cellular_component GO:0005634 nucleus
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding