Tan0003391 (gene) Snake gourd v1

Overview
NameTan0003391
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHistone-lysine N-methyltransferase SETD1B-like protein
LocationLG08: 69553930 .. 69557668 (+)
RNA-Seq ExpressionTan0003391
SyntenyTan0003391
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCCTTCTCTCCACATTCCACTCTCTCCCCTCACACAGTTCTTTTGCTCTGCTTTATTTATTTCTTAATCTTCTGTCAACTCATCTTTCTCGACCTTTTTTTCCCACTGAAAGTTGAAATTTCTTCAACTTGGCTTTACTGGAATCAAAATTCTCGAATTCCCCCATTTTAGCTTTCGTCTTCTCTGAATATTCACTCCCCTTTTGCAGTTGCAGAGAACACATACACACTCCCCATTGATGGCTCAAAAGCACTTACACGAGCTTCTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATCGCCGATAGACGCTCCCTTCTCAAACGCCCTTCCCCCAAATCCCATTTTCTTCACCTCAACAAACGAAAACCCATTTCCCATTCCTCTGATTTTCCCCGAAAATTTTGCAAGACCGCCTGTTTTTTCTCCTTCACTCATTCCCCTGATCTCAGAAACCCTTCGCCGCTCTTTGAATTTCACTCTCCGGTCAAGAGCCCTTCCCGGAACCCCAATCCCATTTTCCTCCATGTTCCGGCTAGAACGGCGGGGCTTCTCTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGCCCGCCAGATCCAAATCCCTACCGAAATCGAATGGTTTAGGGCTTTTCGGTTCTTTTCTTAAGCGCTTTACTCATCGCGGCCGTTCTCGGAAGCGAGAGATCAACGGCGACTGCCGGAGAAATGACCCCCGCGGCAGCCCGCCACTGCCGCCGAAAATGGCGATTAACAAGAATGAGAACGACTCTGTTTCTCCGCAGAGTAATGTAACGAGCTTTGATTTCTGCGAGAGTAATTTTTGCGATAGCCCTTTTCGGTTCGTGCTTCAATCGAGCCCCTCCGCCGGTCACCGGACGCCGGAGTTCTCTTCTCCGGCAACTTCTCCGGCTCGAAACGACCATCAGGTTTCTCTCTTCTTGAAATACTCTTTTTGTTAATGTTTTCTTTGCATCCTTATGAAATGGGATTCATCGGAAAAAACTAATTGTCGGCAGTGTTGCGTAAATACTCTGAAATTACCGCCGGAATGTCGCAGTTTCAGGCGTCAAAACACGTCAACCCCACCAATAAAAACTGGGAATCATGACGAACAAACTGATAAATCTCTCATTTTCTTTTCCACACTCTTCCTTTTTCCCGGAAAATTATGACCCTCCAAGAAAAAGACAAATCCTCTTTTGTATTTTCCTTTGTATTCTAATTCCCATAGGTACCGACATCCTTCTTCTTATAATTAAACATTTTTCCTTTTGGTTGTTAGCATCATTATAATAATTAGAAGAAGAGAAAAAGAACCAATAAAGTTAAGATGGAAATCAACAACTTCACCAAATCCAACTAAATTAATGATCCTTTTTCACACGCATTTTGTTTTGTTAGTTTGTGAACTTTTCTTGCCTTGTTTTTTTTTTTTTGGACAAATTTACAGGTCAATGATGTAGAGAGCTTGAAGAAATTGCCAGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCTGTGTTGGATCCTCCATTTGAGGATGATAACGAAGGACATTATGAGGATGGTGAGGATGAGGACGATTACGATTTGGAACGCAGCTACGCCATTGTACAAAGTATGTAAGCTTGATTCAACTCAATTGCTAATCACATTAAGGAAACATATATTAGTTTTGTTTTATGCACTGATTCGAGTTTGTTGTGCGGTTCTTATCGGTGACATTTGCATATAGGTCTGTCTGAGTTGACATTTGTTACTGATCCAAATGTCTGTTACACTCCATAGAAATGGTTAATAGAGCTTCTCATAACTCATAAGCTTGTTGCTGTACTAGATATGAACACTTGTATCTTTGGTGGGTTCCATTGGGGAGATTTCATTTATCAGATGGGTGAGGGTGAAACCAAAACAAACTTTTGAATTTAGGTAAAACTGTACCAGAATAAAGGAAAAAAGGGGGAGAAAATGTTTTGGTTGTCAGATAAATGTTAGACGATGACATTGCGAGAAAATCTCAGCACAACGGTAACGTTGAACACTTGTTTTCGGCCCGGATGGCCTCTTCTGTTGGAGAAATGTTTTGGAGGTCTGGCCCTCCTTCTTTTACTTGCTTTTCAAAACTTCAAATCCTGACCCACGTTTTGCTGTAAGAGATGTGTCATAATTATGGGAGTCCTCTTCAATTTATTAGGCTGTTCAATCAGTTTATTCATCGTTGTCGAATGGCATCGTCATTTTCTGTACTTCAGAACAAGTCCTCAGTCCCCCCATCCCCCTACCCGCCCATCTGATTTTTTTGAACTGTTTGAGCATTTAATATAGGATTTTGAAGCATTTCAGTGACTAATCTGTGAAAGCATAACAATGAGTTGTTCTAGTAATTAGGCACATGCACTTCATCTGTAAAAGCTGTCTTTGTAGCATTTTCTATTAATTATCTTGTAACTTTTAGTTAGGAAATAGCTAAACTCTCTAGTTCCTTCTCTTCATTAGGCACATTATTTTATTTGAGCTAGAGATTCCAAATTCTACTCTTTACATTGTCGTTTCTTTGTTCTCTTTTTGAATTTACCGGTTCGCTCTTACTGAAAATGAAACATTTAGTCGAACTTCAAAGTTTGAAGATCGAAAATATGTTTGTCTTCTTCTGCAATTAAAAGGAGTTACATAACAATCACAACCACAAAAAGTTTTTCTACCTTGTTAGTACAGTTCAGTAGCTTGAGCTTGGTACAAATTTGCTATTACCTTCATGGCCACGGCCCTACTTGTTTGTTGTGCAATAAAAAAACAGCATAAATGTCGAGTTTTCAACCAGGATTTCAAAATTTATTTTTACCTTCTTCCAGGAGTTTAACTTCCATTGTACTTGGATTCTTGGTAAAAGTAAAACTTGATGTTGAAGTCAAGAGGGCATAGACTCATGATGACTGTAAATTTAAGCCTGTTCCTGATCTGTTCATGTTAATATTTGTCGTGCTTTTCAGAGGCGAAGCATCAGCTACTGAAAAAACTTCGGAGATTCGAGAGACTAGCAGAACTAGATCCAGTAGAACTCGAGACGTTTCTACTAAAAGATGAGGAAGACGAACTCGACGATGACGATGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACACAAGCCATAACTTTGATCCATCTAATAACGAAAAACACATCAAACAACACAACGTAGAGGCGAATGGCAGTTCAAGCTTCCAAATTCCTCACCGACCCGCAAAAGATACGAAGAGACTCGTCTGCAATCTCATAGCCGGGGAAAAGAGAGATCCGGTTGTGATCGACGAGAGAGAAGAGATGAGAAAGAGAGTCTACGTGAGATCAGATTTGTGGAAACGGGTGGACTCGAGGGCCATCGACGTGATGGCGGGGCAAGATTTGAAAGCAGAGCTTGATGGGTGGATCAGAAATGGGGAGCAAAGAGGAGAAATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTAGTGGAGGAAATGCAAACTGAGCTACATTGCTTAACTCATTAACTGATGGAAATTATTCCACTCCACAGAAATAATATTTAAATTTCAACAATAATCTCTAGATTTTAAATTACTTTTAGGAATATAATCTGACTTTAAGAGCCATAGGTTAAGTTTAACACTACCTCTTAAGTAGAAAGATTAGCATGAAAGGGACATCACTGTGATTTGTAAATTTA

mRNA sequence

GTCCTTCTCTCCACATTCCACTCTCTCCCCTCACACAGTTCTTTTGCTCTGCTTTATTTATTTCTTAATCTTCTGTCAACTCATCTTTCTCGACCTTTTTTTCCCACTGAAAGTTGAAATTTCTTCAACTTGGCTTTACTGGAATCAAAATTCTCGAATTCCCCCATTTTAGCTTTCGTCTTCTCTGAATATTCACTCCCCTTTTGCAGTTGCAGAGAACACATACACACTCCCCATTGATGGCTCAAAAGCACTTACACGAGCTTCTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATCGCCGATAGACGCTCCCTTCTCAAACGCCCTTCCCCCAAATCCCATTTTCTTCACCTCAACAAACGAAAACCCATTTCCCATTCCTCTGATTTTCCCCGAAAATTTTGCAAGACCGCCTGTTTTTTCTCCTTCACTCATTCCCCTGATCTCAGAAACCCTTCGCCGCTCTTTGAATTTCACTCTCCGGTCAAGAGCCCTTCCCGGAACCCCAATCCCATTTTCCTCCATGTTCCGGCTAGAACGGCGGGGCTTCTCTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGCCCGCCAGATCCAAATCCCTACCGAAATCGAATGGTTTAGGGCTTTTCGGTTCTTTTCTTAAGCGCTTTACTCATCGCGGCCGTTCTCGGAAGCGAGAGATCAACGGCGACTGCCGGAGAAATGACCCCCGCGGCAGCCCGCCACTGCCGCCGAAAATGGCGATTAACAAGAATGAGAACGACTCTGTTTCTCCGCAGAGTAATGTAACGAGCTTTGATTTCTGCGAGAGTAATTTTTGCGATAGCCCTTTTCGGTTCGTGCTTCAATCGAGCCCCTCCGCCGGTCACCGGACGCCGGAGTTCTCTTCTCCGGCAACTTCTCCGGCTCGAAACGACCATCAGGTCAATGATGTAGAGAGCTTGAAGAAATTGCCAGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCTGTGTTGGATCCTCCATTTGAGGATGATAACGAAGGACATTATGAGGATGGTGAGGATGAGGACGATTACGATTTGGAACGCAGCTACGCCATTGTACAAAAGGCGAAGCATCAGCTACTGAAAAAACTTCGGAGATTCGAGAGACTAGCAGAACTAGATCCAGTAGAACTCGAGACGTTTCTACTAAAAGATGAGGAAGACGAACTCGACGATGACGATGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACACAAGCCATAACTTTGATCCATCTAATAACGAAAAACACATCAAACAACACAACGTAGAGGCGAATGGCAGTTCAAGCTTCCAAATTCCTCACCGACCCGCAAAAGATACGAAGAGACTCGTCTGCAATCTCATAGCCGGGGAAAAGAGAGATCCGGTTGTGATCGACGAGAGAGAAGAGATGAGAAAGAGAGTCTACGTGAGATCAGATTTGTGGAAACGGGTGGACTCGAGGGCCATCGACGTGATGGCGGGGCAAGATTTGAAAGCAGAGCTTGATGGGTGGATCAGAAATGGGGAGCAAAGAGGAGAAATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTAGTGGAGGAAATGCAAACTGAGCTACATTGCTTAACTCATTAACTGATGGAAATTATTCCACTCCACAGAAATAATATTTAAATTTCAACAATAATCTCTAGATTTTAAATTACTTTTAGGAATATAATCTGACTTTAAGAGCCATAGGTTAAGTTTAACACTACCTCTTAAGTAGAAAGATTAGCATGAAAGGGACATCACTGTGATTTGTAAATTTA

Coding sequence (CDS)

ATGGCTCAAAAGCACTTACACGAGCTTCTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATCGCCGATAGACGCTCCCTTCTCAAACGCCCTTCCCCCAAATCCCATTTTCTTCACCTCAACAAACGAAAACCCATTTCCCATTCCTCTGATTTTCCCCGAAAATTTTGCAAGACCGCCTGTTTTTTCTCCTTCACTCATTCCCCTGATCTCAGAAACCCTTCGCCGCTCTTTGAATTTCACTCTCCGGTCAAGAGCCCTTCCCGGAACCCCAATCCCATTTTCCTCCATGTTCCGGCTAGAACGGCGGGGCTTCTCTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGCCCGCCAGATCCAAATCCCTACCGAAATCGAATGGTTTAGGGCTTTTCGGTTCTTTTCTTAAGCGCTTTACTCATCGCGGCCGTTCTCGGAAGCGAGAGATCAACGGCGACTGCCGGAGAAATGACCCCCGCGGCAGCCCGCCACTGCCGCCGAAAATGGCGATTAACAAGAATGAGAACGACTCTGTTTCTCCGCAGAGTAATGTAACGAGCTTTGATTTCTGCGAGAGTAATTTTTGCGATAGCCCTTTTCGGTTCGTGCTTCAATCGAGCCCCTCCGCCGGTCACCGGACGCCGGAGTTCTCTTCTCCGGCAACTTCTCCGGCTCGAAACGACCATCAGGTCAATGATGTAGAGAGCTTGAAGAAATTGCCAGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCTGTGTTGGATCCTCCATTTGAGGATGATAACGAAGGACATTATGAGGATGGTGAGGATGAGGACGATTACGATTTGGAACGCAGCTACGCCATTGTACAAAAGGCGAAGCATCAGCTACTGAAAAAACTTCGGAGATTCGAGAGACTAGCAGAACTAGATCCAGTAGAACTCGAGACGTTTCTACTAAAAGATGAGGAAGACGAACTCGACGATGACGATGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACACAAGCCATAACTTTGATCCATCTAATAACGAAAAACACATCAAACAACACAACGTAGAGGCGAATGGCAGTTCAAGCTTCCAAATTCCTCACCGACCCGCAAAAGATACGAAGAGACTCGTCTGCAATCTCATAGCCGGGGAAAAGAGAGATCCGGTTGTGATCGACGAGAGAGAAGAGATGAGAAAGAGAGTCTACGTGAGATCAGATTTGTGGAAACGGGTGGACTCGAGGGCCATCGACGTGATGGCGGGGCAAGATTTGAAAGCAGAGCTTGATGGGTGGATCAGAAATGGGGAGCAAAGAGGAGAAATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTAGTGGAGGAAATGCAAACTGAGCTACATTGCTTAACTCATTAA

Protein sequence

MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH
Homology
BLAST of Tan0003391 vs. NCBI nr
Match: XP_038903007.1 (uncharacterized protein LOC120089713 [Benincasa hispida])

HSP 1 Score: 717.2 bits (1850), Expect = 9.1e-203
Identity = 380/473 (80.34%), Postives = 414/473 (87.53%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
           MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPS KSHF HLN  KPISHSSDFP KFC+
Sbjct: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHF-HLNNPKPISHSSDFPAKFCR 60

Query: 61  TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
           +ACFFSF HSPDL N SPLF F SPVK+P RNPNPIFLHVPARTAGLLLEAALRIQKQST
Sbjct: 61  SACFFSFNHSPDLINSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 120

Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKN 180
            ARSKSL KSNGLG+ GSFLKR THRGR+RKREI+GD R+NDPR  PPLP KMAI  N+N
Sbjct: 121 VARSKSLGKSNGLGVLGSFLKRLTHRGRARKREIDGDGRKNDPRDGPPLPAKMAIEENEN 180

Query: 181 ENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDV 240
           ENDSVS  SNVT FDFC+SN CDSPFRFVLQSSPS GH+TPE +SPA+SPAR DHQ NDV
Sbjct: 181 ENDSVSRLSNVTGFDFCDSNLCDSPFRFVLQSSPSPGHQTPELASPASSPARLDHQANDV 240

Query: 241 ESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQ 300
           E LKKLPVEDEEEEKEQSSPVSVLDPPFEDD+EGHYEDGEDEDDY+LERS+AIVQ+AKHQ
Sbjct: 241 EGLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQQAKHQ 300

Query: 301 LLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKH 360
           LLKKLRRFERLAELDPVELETFLLKDE+++ D+DDDDIDHLKEEE+Y          +K 
Sbjct: 301 LLKKLRRFERLAELDPVELETFLLKDEDEDEDEDDDDIDHLKEEEDY----------KKD 360

Query: 361 IKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWK 420
           IK+H++EAN SS FQIPHRPA+D   LVCNL+  E+RD VVI++REEM K +YVRSDLWK
Sbjct: 361 IKEHDIEANDSSRFQIPHRPARDMTTLVCNLVTEEERDLVVIEKREEMMKGMYVRSDLWK 420

Query: 421 RVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELH 472
           RVDS AI+VM GQDLK E+DGW RN EQR EIAIEIE+AIFSLLVEEMQ ELH
Sbjct: 421 RVDSNAINVMVGQDLKEEVDGWKRNKEQRREIAIEIEVAIFSLLVEEMQPELH 462

BLAST of Tan0003391 vs. NCBI nr
Match: XP_022144766.1 (uncharacterized protein LOC111014376 [Momordica charantia])

HSP 1 Score: 677.6 bits (1747), Expect = 8.0e-191
Identity = 378/481 (78.59%), Postives = 399/481 (82.95%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
           M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPKS+ LHL +RKPIS + DFP KFCK
Sbjct: 2   MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSN-LHLKRRKPISETLDFPGKFCK 61

Query: 61  TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
           +ACFFSF  SPDLR  SPLFEF SPV    RNPN IFLHVPARTAG+LLEAALRIQKQST
Sbjct: 62  SACFFSFHESPDLRK-SPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQST 121

Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI----- 180
            ARSK   K+NGLGL GSFLKR THRGR+RKREI+GD RRND  G  PLP KMAI     
Sbjct: 122 AARSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENED 181

Query: 181 -NKNENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQ 240
            N NEN SVS Q+N+TSF FCESNFCDSPFRFVLQSSPS+GHRTPEFSSPA SP R DHQ
Sbjct: 182 ENVNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQ 241

Query: 241 VNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQK 300
            NDVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDD+EGHYEDGEDED YDLERSY IVQK
Sbjct: 242 DNDVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQK 301

Query: 301 AKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSN 360
           AKHQLLKKLRRFE+LAELDPVELE+FLLK EEDEL DDDDDIDHLK EEEY SHNF+   
Sbjct: 302 AKHQLLKKLRRFEKLAELDPVELESFLLKGEEDEL-DDDDDIDHLK-EEEYESHNFE--- 361

Query: 361 NEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRS 420
                 QH+VEANGSSSFQIPH       RLV N I GE+RD  V D REEM K VYVRS
Sbjct: 362 ------QHDVEANGSSSFQIPH-------RLVRNRITGEQRDQAVTDNREEMTKGVYVRS 421

Query: 421 DLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLT 476
           DLWKRVDS AID   GQDLK ELDGW RN +QRGE+AIEIELAIFSLLV EMQTEL CLT
Sbjct: 422 DLWKRVDSNAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLT 458

BLAST of Tan0003391 vs. NCBI nr
Match: XP_011651995.1 (uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical protein Csa_002656 [Cucumis sativus])

HSP 1 Score: 674.5 bits (1739), Expect = 6.8e-190
Identity = 368/482 (76.35%), Postives = 399/482 (82.78%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFC 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL   KPI HSSDF  KFC
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPIPHSSDFSAKFC 60

Query: 61  KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 120
           ++ CFFSF HSPDL N SP F F SPVK+P RNPNP+F HVPARTAGLLLEAALRIQKQS
Sbjct: 61  RSTCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQS 120

Query: 121 TPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NK 180
           T ARSKS  KSNGLGL GSFLKR THR R+RKREI+GD R NDPR  PPLP KMAI  N+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENE 180

Query: 181 NENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVND 240
            ENDSV   SNVT FDFCESN CDSPFRFVLQSSPS GHRTPE SSPA+SPAR DHQ ND
Sbjct: 181 TENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQAND 240

Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH 300
           VESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EGH+EDGEDEDDY+LERS+AIVQKAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKH 300

Query: 301 QLLKKLRRFERLAELDPVELETFLLKDE---EDELDD-DDDDIDHLKEEEEYTSHNFDPS 360
           QLLKKLRRFERLAELDP+ELETFLL DE   EDEL D D DDIDHLKEE E         
Sbjct: 301 QLLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVE--------- 360

Query: 361 NNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVR 420
             EK IKQHN E N SS FQIP+RP++DTK LVCNLI  E+R+ VVI++ EE  KRVY+R
Sbjct: 361 QYEKDIKQHNKEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMR 420

Query: 421 SDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCL 476
            DLWKRVDS AID+M G+DLK E+DGW  N E RGEIA+EIE+AIFSLLVEEMQ+ELHCL
Sbjct: 421 QDLWKRVDSNAIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCL 472

BLAST of Tan0003391 vs. NCBI nr
Match: KAA0043909.1 (histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa] >TYK25228.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 657.9 bits (1696), Expect = 6.6e-185
Identity = 363/481 (75.47%), Postives = 396/481 (82.33%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFC 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL   KPISHS DF  KFC
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPISHSPDFSAKFC 60

Query: 61  KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 120
           ++ CFFSF HSPDL N SPLF F SPVK+P R+PNP+F HVPARTAGLLLEAALRIQKQS
Sbjct: 61  RSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQS 120

Query: 121 TPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NK 180
           T ARSKS  KSNGLGL GSFLKR THR RSRKREI+GD R NDPR  PPLP KMAI  N+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE 180

Query: 181 NENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVND 240
            ENDSV   SNVT FDFCESN CDSPFRFVLQSS S GHRTPE SSP +SPAR DHQ ND
Sbjct: 181 KENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQAND 240

Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH 300
           VESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EG++EDGEDEDDY+LERS+AIVQKAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKH 300

Query: 301 QLLKKLRRFERLAELDPVELETFLLKDEEDELDD--DDDDIDHLKEE-EEYTSHNFDPSN 360
           QLLKKLRRFERLAELDP+ELETFLL DE+ + D+  D DDIDHLKEE EEY         
Sbjct: 301 QLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEY--------- 360

Query: 361 NEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRS 420
            EK IKQHN E N SS FQ  +RP++DTK LVCNLI  E+R+ V I++REE  KRVY+R 
Sbjct: 361 -EKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRP 420

Query: 421 DLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLT 476
           DLWKRVDS AIDVM G+DLK E+DGW RN E RGEI IEIE+AIFSLLVEEMQ+ELHCL 
Sbjct: 421 DLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLA 468

BLAST of Tan0003391 vs. NCBI nr
Match: XP_022945267.1 (uncharacterized protein LOC111449564 [Cucurbita moschata] >KAG7028088.1 hypothetical protein SDJN02_09268 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 609.8 bits (1571), Expect = 2.1e-170
Identity = 346/473 (73.15%), Postives = 369/473 (78.01%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
           MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPSPKSH LHLNKRKPISH SDFP  FCK
Sbjct: 1   MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCK 60

Query: 61  TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
            ACF SF  SPDLRNPSPLF+F SPVKSP RN N +FLHVPA TAGLLLEAALRIQKQST
Sbjct: 61  GACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQST 120

Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNEN 180
            AR      SNG GL GSFLKRFTHRGRSRKREI+G CRRNDPR    LPP      NE 
Sbjct: 121 AAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-----INEK 180

Query: 181 DSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVES 240
           DSVS QSNVTS DFCE     SPFRFVLQSSPSAGHRTPEFSSP +SPAR+DHQVNDVES
Sbjct: 181 DSVSRQSNVTSSDFCE-----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVES 240

Query: 241 LKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLL 300
           LKKLPV+DEEEEKEQSSPVSVLDPPFEDD EG YEDGED+DDY++ERSYAIV+KAKHQLL
Sbjct: 241 LKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLL 300

Query: 301 KKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIK 360
           KKLRRFERLAELDPVELETFLLKDEE EL  DDDDIDHLK EEE  SHNFD SNNEK +K
Sbjct: 301 KKLRRFERLAELDPVELETFLLKDEEGEL--DDDDIDHLK-EEECESHNFDRSNNEKDMK 360

Query: 361 QHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWKRV 420
           QH ++ N                                        +RVY+R DLWK V
Sbjct: 361 QHGIDGN---------------------------------------VERVYMRWDLWKEV 414

Query: 421 DSRAIDVMAGQDLKAEL-DGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC 473
           +S AIDVMAG+DL+AE+ DGW RNGE RG+IAIEIE+ IF LLVEEMQTE+ C
Sbjct: 421 ESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC 414

BLAST of Tan0003391 vs. ExPASy TrEMBL
Match: A0A6J1CUE0 (uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014376 PE=4 SV=1)

HSP 1 Score: 677.6 bits (1747), Expect = 3.9e-191
Identity = 378/481 (78.59%), Postives = 399/481 (82.95%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
           M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPKS+ LHL +RKPIS + DFP KFCK
Sbjct: 2   MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSN-LHLKRRKPISETLDFPGKFCK 61

Query: 61  TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
           +ACFFSF  SPDLR  SPLFEF SPV    RNPN IFLHVPARTAG+LLEAALRIQKQST
Sbjct: 62  SACFFSFHESPDLRK-SPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQST 121

Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI----- 180
            ARSK   K+NGLGL GSFLKR THRGR+RKREI+GD RRND  G  PLP KMAI     
Sbjct: 122 AARSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENED 181

Query: 181 -NKNENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQ 240
            N NEN SVS Q+N+TSF FCESNFCDSPFRFVLQSSPS+GHRTPEFSSPA SP R DHQ
Sbjct: 182 ENVNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQ 241

Query: 241 VNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQK 300
            NDVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDD+EGHYEDGEDED YDLERSY IVQK
Sbjct: 242 DNDVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQK 301

Query: 301 AKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSN 360
           AKHQLLKKLRRFE+LAELDPVELE+FLLK EEDEL DDDDDIDHLK EEEY SHNF+   
Sbjct: 302 AKHQLLKKLRRFEKLAELDPVELESFLLKGEEDEL-DDDDDIDHLK-EEEYESHNFE--- 361

Query: 361 NEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRS 420
                 QH+VEANGSSSFQIPH       RLV N I GE+RD  V D REEM K VYVRS
Sbjct: 362 ------QHDVEANGSSSFQIPH-------RLVRNRITGEQRDQAVTDNREEMTKGVYVRS 421

Query: 421 DLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLT 476
           DLWKRVDS AID   GQDLK ELDGW RN +QRGE+AIEIELAIFSLLV EMQTEL CLT
Sbjct: 422 DLWKRVDSNAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLT 458

BLAST of Tan0003391 vs. ExPASy TrEMBL
Match: A0A0A0LAR8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 3.3e-190
Identity = 368/482 (76.35%), Postives = 399/482 (82.78%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFC 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL   KPI HSSDF  KFC
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPIPHSSDFSAKFC 60

Query: 61  KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 120
           ++ CFFSF HSPDL N SP F F SPVK+P RNPNP+F HVPARTAGLLLEAALRIQKQS
Sbjct: 61  RSTCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQS 120

Query: 121 TPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NK 180
           T ARSKS  KSNGLGL GSFLKR THR R+RKREI+GD R NDPR  PPLP KMAI  N+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENE 180

Query: 181 NENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVND 240
            ENDSV   SNVT FDFCESN CDSPFRFVLQSSPS GHRTPE SSPA+SPAR DHQ ND
Sbjct: 181 TENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQAND 240

Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH 300
           VESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EGH+EDGEDEDDY+LERS+AIVQKAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKH 300

Query: 301 QLLKKLRRFERLAELDPVELETFLLKDE---EDELDD-DDDDIDHLKEEEEYTSHNFDPS 360
           QLLKKLRRFERLAELDP+ELETFLL DE   EDEL D D DDIDHLKEE E         
Sbjct: 301 QLLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVE--------- 360

Query: 361 NNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVR 420
             EK IKQHN E N SS FQIP+RP++DTK LVCNLI  E+R+ VVI++ EE  KRVY+R
Sbjct: 361 QYEKDIKQHNKEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMR 420

Query: 421 SDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCL 476
            DLWKRVDS AID+M G+DLK E+DGW  N E RGEIA+EIE+AIFSLLVEEMQ+ELHCL
Sbjct: 421 QDLWKRVDSNAIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCL 472

BLAST of Tan0003391 vs. ExPASy TrEMBL
Match: A0A5D3DNQ5 (Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003580 PE=4 SV=1)

HSP 1 Score: 657.9 bits (1696), Expect = 3.2e-185
Identity = 363/481 (75.47%), Postives = 396/481 (82.33%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFC 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL   KPISHS DF  KFC
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPISHSPDFSAKFC 60

Query: 61  KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 120
           ++ CFFSF HSPDL N SPLF F SPVK+P R+PNP+F HVPARTAGLLLEAALRIQKQS
Sbjct: 61  RSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQS 120

Query: 121 TPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NK 180
           T ARSKS  KSNGLGL GSFLKR THR RSRKREI+GD R NDPR  PPLP KMAI  N+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE 180

Query: 181 NENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVND 240
            ENDSV   SNVT FDFCESN CDSPFRFVLQSS S GHRTPE SSP +SPAR DHQ ND
Sbjct: 181 KENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQAND 240

Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH 300
           VESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EG++EDGEDEDDY+LERS+AIVQKAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKH 300

Query: 301 QLLKKLRRFERLAELDPVELETFLLKDEEDELDD--DDDDIDHLKEE-EEYTSHNFDPSN 360
           QLLKKLRRFERLAELDP+ELETFLL DE+ + D+  D DDIDHLKEE EEY         
Sbjct: 301 QLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEY--------- 360

Query: 361 NEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRS 420
            EK IKQHN E N SS FQ  +RP++DTK LVCNLI  E+R+ V I++REE  KRVY+R 
Sbjct: 361 -EKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRP 420

Query: 421 DLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLT 476
           DLWKRVDS AIDVM G+DLK E+DGW RN E RGEI IEIE+AIFSLLVEEMQ+ELHCL 
Sbjct: 421 DLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLA 468

BLAST of Tan0003391 vs. ExPASy TrEMBL
Match: A0A6J1G0G0 (uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC111449564 PE=4 SV=1)

HSP 1 Score: 609.8 bits (1571), Expect = 1.0e-170
Identity = 346/473 (73.15%), Postives = 369/473 (78.01%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
           MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPSPKSH LHLNKRKPISH SDFP  FCK
Sbjct: 1   MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCK 60

Query: 61  TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
            ACF SF  SPDLRNPSPLF+F SPVKSP RN N +FLHVPA TAGLLLEAALRIQKQST
Sbjct: 61  GACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQST 120

Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNEN 180
            AR      SNG GL GSFLKRFTHRGRSRKREI+G CRRNDPR    LPP      NE 
Sbjct: 121 AAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-----INEK 180

Query: 181 DSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVES 240
           DSVS QSNVTS DFCE     SPFRFVLQSSPSAGHRTPEFSSP +SPAR+DHQVNDVES
Sbjct: 181 DSVSRQSNVTSSDFCE-----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVES 240

Query: 241 LKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLL 300
           LKKLPV+DEEEEKEQSSPVSVLDPPFEDD EG YEDGED+DDY++ERSYAIV+KAKHQLL
Sbjct: 241 LKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLL 300

Query: 301 KKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIK 360
           KKLRRFERLAELDPVELETFLLKDEE EL  DDDDIDHLK EEE  SHNFD SNNEK +K
Sbjct: 301 KKLRRFERLAELDPVELETFLLKDEEGEL--DDDDIDHLK-EEECESHNFDRSNNEKDMK 360

Query: 361 QHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWKRV 420
           QH ++ N                                        +RVY+R DLWK V
Sbjct: 361 QHGIDGN---------------------------------------VERVYMRWDLWKEV 414

Query: 421 DSRAIDVMAGQDLKAEL-DGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC 473
           +S AIDVMAG+DL+AE+ DGW RNGE RG+IAIEIE+ IF LLVEEMQTE+ C
Sbjct: 421 ESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC 414

BLAST of Tan0003391 vs. ExPASy TrEMBL
Match: A0A6J1L3C1 (uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735 PE=4 SV=1)

HSP 1 Score: 599.4 bits (1544), Expect = 1.3e-167
Identity = 343/475 (72.21%), Postives = 372/475 (78.32%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
           MAQKHLHELLKEDQEPFLLTNFIA+RR +LKRPSPKSH LHLNK KPISH +DFP  FCK
Sbjct: 1   MAQKHLHELLKEDQEPFLLTNFIANRR-VLKRPSPKSHLLHLNKPKPISHFADFPASFCK 60

Query: 61  TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
            ACF SF HSPDLRNPSPLF+F SPVKSP RN N +FLHVPA TA LLLEAALRIQKQST
Sbjct: 61  GACFLSFNHSPDLRNPSPLFQFQSPVKSPCRNSNAMFLHVPATTARLLLEAALRIQKQST 120

Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNE- 180
           PAR      SNG GL GSFLKRFT+RGRSRKREI+G CRRNDP  +     KMAIN+NE 
Sbjct: 121 PAR------SNGFGLLGSFLKRFTYRGRSRKREIDGGCRRNDPSTA-----KMAINENEN 180

Query: 181 -NDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDV 240
            NDSVS QSNVTS     S+FCDSPFRFVLQSSPSAGHRTPEFSSP +SPAR+DHQVNDV
Sbjct: 181 GNDSVSRQSNVTS-----SDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARDDHQVNDV 240

Query: 241 ESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQ 300
           ESLKKLPV+DEEEEKEQSSPVSVLDPPFEDD EG YEDGED+DDY +ERSYAIVQKAKHQ
Sbjct: 241 ESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYKMERSYAIVQKAKHQ 300

Query: 301 LLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKH 360
           LLKKLRRFERLAELDPVELETFLLKDEE +LDDD    DHL EEEE  SHNFD SNNEK 
Sbjct: 301 LLKKLRRFERLAELDPVELETFLLKDEEGKLDDDG---DHL-EEEECKSHNFDRSNNEKD 360

Query: 361 IKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWK 420
           +KQH +E+N                                        +RVY+R DLWK
Sbjct: 361 MKQHGIESN---------------------------------------VERVYMRWDLWK 415

Query: 421 RVDSRAIDVMAGQDLKAELD-GWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC 473
            V+S AIDVMA +DL+AE+D GW RNGE+RG+IAIEIE+ IF LLVEEMQTE+ C
Sbjct: 421 EVESSAIDVMAEEDLRAEVDVGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDC 415

BLAST of Tan0003391 vs. TAIR 10
Match: AT5G03670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G36420.1); Has 700 Blast hits to 624 proteins in 104 species: Archae - 0; Bacteria - 18; Metazoa - 333; Fungi - 60; Plants - 73; Viruses - 24; Other Eukaryotes - 192 (source: NCBI BLink). )

HSP 1 Score: 244.2 bits (622), Expect = 2.1e-64
Identity = 207/547 (37.84%), Postives = 295/547 (53.93%), Query Frame = 0

Query: 2   AQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHL--NKRKPISHSSDFPRKFC 61
           +Q+HL +LL+EDQEPF L ++I+DRR  +      +H  HL   KR+PIS ++  P +FC
Sbjct: 3   SQRHLKDLLEEDQEPFQLQSYISDRRCQI-----NAHVTHLQVKKRRPISQNAGLPSRFC 62

Query: 62  KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 121
           + ACFFS   SPD +  SPLFE    +KSP+R+ N IF+++PARTA +LLEAA+RIQKQS
Sbjct: 63  RNACFFSLRESPDPKK-SPLFE----LKSPNRSQNAIFVNIPARTASILLEAAVRIQKQS 122

Query: 122 TP-ARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREING---------DCRRNDPRGSPPL 181
           +  +++++    N  G+FGS LK+ T+R   +KREI+G            ++  R   P+
Sbjct: 123 SEVSKTRTRNAGNAFGIFGSVLKKLTNR---KKREISGGKEAGRVSSSSVKDMLRWESPV 182

Query: 182 PPKMAINK---NENDSVSPQS---------------------NVT--------------- 241
             K+   K   NE ++ S Q+                     +VT               
Sbjct: 183 VRKIVTRKSKRNEEENASSQTHKIASETHFSRRSSSSGVWSESVTNGERSWDVDFETSIS 242

Query: 242 -------SFDFC----------ESNFCDSPFRFVLQSSPS-AGHRTPEFSSPATSPARND 301
                  S +F           +  FC+SPF FVLQ+ PS  G RTP FSSPA SP  + 
Sbjct: 243 TSSRSNGSDEFAMMMNGQDLSEDKRFCESPFHFVLQTMPSNGGFRTPNFSSPAASPRHDC 302

Query: 302 HQVN----DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERS 361
           H++     +VE LKKL +E+EEEEKEQSSPVSVLDPPF+DD+E  +      DD ++  S
Sbjct: 303 HEMEKESYEVEKLKKLEMEEEEEEKEQSSPVSVLDPPFQDDDEDIH-----MDDNNIPSS 362

Query: 362 YAIVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSH 421
           +  VQKAKH LL+KL RFE+LA LDP+ELE   + D+E E ++++       EEEE  S 
Sbjct: 363 FRSVQKAKHLLLQKLCRFEQLAGLDPMELEK-RMSDQETEEEEEE-------EEEEMKSL 422

Query: 422 NFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREE--- 471
                  ++ +K +  E       ++P    +  + L+ +L A E   P  ID   E   
Sbjct: 423 YHCEIITQRVLKTYFEE-----MVEVP----EGVEALISDLAAEEL--PSDIDGEAEAAI 482

BLAST of Tan0003391 vs. TAIR 10
Match: AT2G36420.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G03670.1); Has 10588 Blast hits to 6606 proteins in 440 species: Archae - 8; Bacteria - 365; Metazoa - 4146; Fungi - 1198; Plants - 483; Viruses - 212; Other Eukaryotes - 4176 (source: NCBI BLink). )

HSP 1 Score: 202.2 bits (513), Expect = 9.2e-52
Identity = 180/488 (36.89%), Postives = 264/488 (54.10%), Query Frame = 0

Query: 3   QKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKF-CKT 62
           +KHLHE L++DQEPF L ++I + RS +         + + KRK  + ++  P  F C+ 
Sbjct: 7   KKHLHEFLEDDQEPFHLNHYIGNLRSQMGCSD-----MRVKKRKSDNVATFPPGLFSCEN 66

Query: 63  ACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST- 122
           +CFF+   SPD R  SPLFE  SP K   R+   +FL +PARTA +LL+AA RIQKQ + 
Sbjct: 67  SCFFAAHKSPDPRK-SPLFELRSPGKKKIRD-GRVFLQIPARTAAILLDAAARIQKQQSE 126

Query: 123 -PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRR-NDPRGSPPLPPKMAINKN 182
               +K+  + NG G+FGS LK  T+R  ++ R  N D    +  RGS P          
Sbjct: 127 KAKTNKARTRGNGFGMFGSVLKLLTYR-ITKPRLDNADGNAVSLERGSEP---------- 186

Query: 183 ENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSP-SAGHRTPEFSSPATSPAR---NDHQ 242
              S   +  V   D C   FC+SPF FVLQ++P S+GH+TP F+S ATSPAR    D  
Sbjct: 187 -TSSSRRERIVEISDKC---FCESPFHFVLQTTPSSSGHQTPHFTSTATSPARRSTEDED 246

Query: 243 VNDVESLKKLPVED----EEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYA 302
            ++ ESL+K+  ++    EEE+KEQ SPVSVLDP  E++ +  +   E +   +L  S+ 
Sbjct: 247 SDETESLEKVRGQEEEDKEEEDKEQCSPVSVLDPLEEEEEDEDHHQHEPDPPNNLSCSFE 306

Query: 303 IVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDELD-----DDDDDIDHLKEEEEY 362
           IVQ+AK +LLKKLRRFE+LA LDPVELE  + ++E++E +     ++DD+I     +EEY
Sbjct: 307 IVQRAKRRLLKKLRRFEKLAGLDPVELEGKMSEEEDEEEEEYEESEEDDNIRIYDSDEEY 366

Query: 363 TSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREE 422
              +               EA    S     R A+D KR                 ++ +
Sbjct: 367 EDVD---------------EAMARES-----RCAEDEKR-----------------KKND 426

Query: 423 MRKRVYVRSDLWKRVDSRA---IDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLL 471
            R++ +   + W RV   A   +D +  +DL+ E   W R+G +  E   ++E +IF +L
Sbjct: 427 ERQKKWRMMNAW-RVGLGAEEDVDAVVRKDLREEAGEWTRHGGEVEEAVSDLEHSIFFVL 434

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038903007.19.1e-20380.34uncharacterized protein LOC120089713 [Benincasa hispida][more]
XP_022144766.18.0e-19178.59uncharacterized protein LOC111014376 [Momordica charantia][more]
XP_011651995.16.8e-19076.35uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical ... [more]
KAA0043909.16.6e-18575.47histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. mak... [more]
XP_022945267.12.1e-17073.15uncharacterized protein LOC111449564 [Cucurbita moschata] >KAG7028088.1 hypothet... [more]
Match NameE-valueIdentityDescription
A0A6J1CUE03.9e-19178.59uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A0A0LAR83.3e-19076.35Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1[more]
A0A5D3DNQ53.2e-18575.47Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. m... [more]
A0A6J1G0G01.0e-17073.15uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC1114495... [more]
A0A6J1L3C11.3e-16772.21uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735... [more]
Match NameE-valueIdentityDescription
AT5G03670.12.1e-6437.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G36420.19.2e-5236.89unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 322..342
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 235..250
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 211..234
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 211..282
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 147..186
NoneNo IPR availablePANTHERPTHR33623:SF5HISTONE-LYSINE N-METHYLTRANSFERASE SETD1B-LIKE PROTEINcoord: 1..160
coord: 179..473
NoneNo IPR availablePANTHERPTHR33623OS04G0572500 PROTEINcoord: 1..160
coord: 179..473

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003391.1Tan0003391.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032259 methylation
molecular_function GO:0008168 methyltransferase activity