Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCCTTCTCTCCACATTCCACTCTCTCCCCTCACACAGTTCTTTTGCTCTGCTTTATTTATTTCTTAATCTTCTGTCAACTCATCTTTCTCGACCTTTTTTTCCCACTGAAAGTTGAAATTTCTTCAACTTGGCTTTACTGGAATCAAAATTCTCGAATTCCCCCATTTTAGCTTTCGTCTTCTCTGAATATTCACTCCCCTTTTGCAGTTGCAGAGAACACATACACACTCCCCATTGATGGCTCAAAAGCACTTACACGAGCTTCTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATCGCCGATAGACGCTCCCTTCTCAAACGCCCTTCCCCCAAATCCCATTTTCTTCACCTCAACAAACGAAAACCCATTTCCCATTCCTCTGATTTTCCCCGAAAATTTTGCAAGACCGCCTGTTTTTTCTCCTTCACTCATTCCCCTGATCTCAGAAACCCTTCGCCGCTCTTTGAATTTCACTCTCCGGTCAAGAGCCCTTCCCGGAACCCCAATCCCATTTTCCTCCATGTTCCGGCTAGAACGGCGGGGCTTCTCTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGCCCGCCAGATCCAAATCCCTACCGAAATCGAATGGTTTAGGGCTTTTCGGTTCTTTTCTTAAGCGCTTTACTCATCGCGGCCGTTCTCGGAAGCGAGAGATCAACGGCGACTGCCGGAGAAATGACCCCCGCGGCAGCCCGCCACTGCCGCCGAAAATGGCGATTAACAAGAATGAGAACGACTCTGTTTCTCCGCAGAGTAATGTAACGAGCTTTGATTTCTGCGAGAGTAATTTTTGCGATAGCCCTTTTCGGTTCGTGCTTCAATCGAGCCCCTCCGCCGGTCACCGGACGCCGGAGTTCTCTTCTCCGGCAACTTCTCCGGCTCGAAACGACCATCAGGTTTCTCTCTTCTTGAAATACTCTTTTTGTTAATGTTTTCTTTGCATCCTTATGAAATGGGATTCATCGGAAAAAACTAATTGTCGGCAGTGTTGCGTAAATACTCTGAAATTACCGCCGGAATGTCGCAGTTTCAGGCGTCAAAACACGTCAACCCCACCAATAAAAACTGGGAATCATGACGAACAAACTGATAAATCTCTCATTTTCTTTTCCACACTCTTCCTTTTTCCCGGAAAATTATGACCCTCCAAGAAAAAGACAAATCCTCTTTTGTATTTTCCTTTGTATTCTAATTCCCATAGGTACCGACATCCTTCTTCTTATAATTAAACATTTTTCCTTTTGGTTGTTAGCATCATTATAATAATTAGAAGAAGAGAAAAAGAACCAATAAAGTTAAGATGGAAATCAACAACTTCACCAAATCCAACTAAATTAATGATCCTTTTTCACACGCATTTTGTTTTGTTAGTTTGTGAACTTTTCTTGCCTTGTTTTTTTTTTTTTGGACAAATTTACAGGTCAATGATGTAGAGAGCTTGAAGAAATTGCCAGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCTGTGTTGGATCCTCCATTTGAGGATGATAACGAAGGACATTATGAGGATGGTGAGGATGAGGACGATTACGATTTGGAACGCAGCTACGCCATTGTACAAAGTATGTAAGCTTGATTCAACTCAATTGCTAATCACATTAAGGAAACATATATTAGTTTTGTTTTATGCACTGATTCGAGTTTGTTGTGCGGTTCTTATCGGTGACATTTGCATATAGGTCTGTCTGAGTTGACATTTGTTACTGATCCAAATGTCTGTTACACTCCATAGAAATGGTTAATAGAGCTTCTCATAACTCATAAGCTTGTTGCTGTACTAGATATGAACACTTGTATCTTTGGTGGGTTCCATTGGGGAGATTTCATTTATCAGATGGGTGAGGGTGAAACCAAAACAAACTTTTGAATTTAGGTAAAACTGTACCAGAATAAAGGAAAAAAGGGGGAGAAAATGTTTTGGTTGTCAGATAAATGTTAGACGATGACATTGCGAGAAAATCTCAGCACAACGGTAACGTTGAACACTTGTTTTCGGCCCGGATGGCCTCTTCTGTTGGAGAAATGTTTTGGAGGTCTGGCCCTCCTTCTTTTACTTGCTTTTCAAAACTTCAAATCCTGACCCACGTTTTGCTGTAAGAGATGTGTCATAATTATGGGAGTCCTCTTCAATTTATTAGGCTGTTCAATCAGTTTATTCATCGTTGTCGAATGGCATCGTCATTTTCTGTACTTCAGAACAAGTCCTCAGTCCCCCCATCCCCCTACCCGCCCATCTGATTTTTTTGAACTGTTTGAGCATTTAATATAGGATTTTGAAGCATTTCAGTGACTAATCTGTGAAAGCATAACAATGAGTTGTTCTAGTAATTAGGCACATGCACTTCATCTGTAAAAGCTGTCTTTGTAGCATTTTCTATTAATTATCTTGTAACTTTTAGTTAGGAAATAGCTAAACTCTCTAGTTCCTTCTCTTCATTAGGCACATTATTTTATTTGAGCTAGAGATTCCAAATTCTACTCTTTACATTGTCGTTTCTTTGTTCTCTTTTTGAATTTACCGGTTCGCTCTTACTGAAAATGAAACATTTAGTCGAACTTCAAAGTTTGAAGATCGAAAATATGTTTGTCTTCTTCTGCAATTAAAAGGAGTTACATAACAATCACAACCACAAAAAGTTTTTCTACCTTGTTAGTACAGTTCAGTAGCTTGAGCTTGGTACAAATTTGCTATTACCTTCATGGCCACGGCCCTACTTGTTTGTTGTGCAATAAAAAAACAGCATAAATGTCGAGTTTTCAACCAGGATTTCAAAATTTATTTTTACCTTCTTCCAGGAGTTTAACTTCCATTGTACTTGGATTCTTGGTAAAAGTAAAACTTGATGTTGAAGTCAAGAGGGCATAGACTCATGATGACTGTAAATTTAAGCCTGTTCCTGATCTGTTCATGTTAATATTTGTCGTGCTTTTCAGAGGCGAAGCATCAGCTACTGAAAAAACTTCGGAGATTCGAGAGACTAGCAGAACTAGATCCAGTAGAACTCGAGACGTTTCTACTAAAAGATGAGGAAGACGAACTCGACGATGACGATGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACACAAGCCATAACTTTGATCCATCTAATAACGAAAAACACATCAAACAACACAACGTAGAGGCGAATGGCAGTTCAAGCTTCCAAATTCCTCACCGACCCGCAAAAGATACGAAGAGACTCGTCTGCAATCTCATAGCCGGGGAAAAGAGAGATCCGGTTGTGATCGACGAGAGAGAAGAGATGAGAAAGAGAGTCTACGTGAGATCAGATTTGTGGAAACGGGTGGACTCGAGGGCCATCGACGTGATGGCGGGGCAAGATTTGAAAGCAGAGCTTGATGGGTGGATCAGAAATGGGGAGCAAAGAGGAGAAATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTAGTGGAGGAAATGCAAACTGAGCTACATTGCTTAACTCATTAACTGATGGAAATTATTCCACTCCACAGAAATAATATTTAAATTTCAACAATAATCTCTAGATTTTAAATTACTTTTAGGAATATAATCTGACTTTAAGAGCCATAGGTTAAGTTTAACACTACCTCTTAAGTAGAAAGATTAGCATGAAAGGGACATCACTGTGATTTGTAAATTTA
mRNA sequence
GTCCTTCTCTCCACATTCCACTCTCTCCCCTCACACAGTTCTTTTGCTCTGCTTTATTTATTTCTTAATCTTCTGTCAACTCATCTTTCTCGACCTTTTTTTCCCACTGAAAGTTGAAATTTCTTCAACTTGGCTTTACTGGAATCAAAATTCTCGAATTCCCCCATTTTAGCTTTCGTCTTCTCTGAATATTCACTCCCCTTTTGCAGTTGCAGAGAACACATACACACTCCCCATTGATGGCTCAAAAGCACTTACACGAGCTTCTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATCGCCGATAGACGCTCCCTTCTCAAACGCCCTTCCCCCAAATCCCATTTTCTTCACCTCAACAAACGAAAACCCATTTCCCATTCCTCTGATTTTCCCCGAAAATTTTGCAAGACCGCCTGTTTTTTCTCCTTCACTCATTCCCCTGATCTCAGAAACCCTTCGCCGCTCTTTGAATTTCACTCTCCGGTCAAGAGCCCTTCCCGGAACCCCAATCCCATTTTCCTCCATGTTCCGGCTAGAACGGCGGGGCTTCTCTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGCCCGCCAGATCCAAATCCCTACCGAAATCGAATGGTTTAGGGCTTTTCGGTTCTTTTCTTAAGCGCTTTACTCATCGCGGCCGTTCTCGGAAGCGAGAGATCAACGGCGACTGCCGGAGAAATGACCCCCGCGGCAGCCCGCCACTGCCGCCGAAAATGGCGATTAACAAGAATGAGAACGACTCTGTTTCTCCGCAGAGTAATGTAACGAGCTTTGATTTCTGCGAGAGTAATTTTTGCGATAGCCCTTTTCGGTTCGTGCTTCAATCGAGCCCCTCCGCCGGTCACCGGACGCCGGAGTTCTCTTCTCCGGCAACTTCTCCGGCTCGAAACGACCATCAGGTCAATGATGTAGAGAGCTTGAAGAAATTGCCAGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCTGTGTTGGATCCTCCATTTGAGGATGATAACGAAGGACATTATGAGGATGGTGAGGATGAGGACGATTACGATTTGGAACGCAGCTACGCCATTGTACAAAAGGCGAAGCATCAGCTACTGAAAAAACTTCGGAGATTCGAGAGACTAGCAGAACTAGATCCAGTAGAACTCGAGACGTTTCTACTAAAAGATGAGGAAGACGAACTCGACGATGACGATGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACACAAGCCATAACTTTGATCCATCTAATAACGAAAAACACATCAAACAACACAACGTAGAGGCGAATGGCAGTTCAAGCTTCCAAATTCCTCACCGACCCGCAAAAGATACGAAGAGACTCGTCTGCAATCTCATAGCCGGGGAAAAGAGAGATCCGGTTGTGATCGACGAGAGAGAAGAGATGAGAAAGAGAGTCTACGTGAGATCAGATTTGTGGAAACGGGTGGACTCGAGGGCCATCGACGTGATGGCGGGGCAAGATTTGAAAGCAGAGCTTGATGGGTGGATCAGAAATGGGGAGCAAAGAGGAGAAATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTAGTGGAGGAAATGCAAACTGAGCTACATTGCTTAACTCATTAACTGATGGAAATTATTCCACTCCACAGAAATAATATTTAAATTTCAACAATAATCTCTAGATTTTAAATTACTTTTAGGAATATAATCTGACTTTAAGAGCCATAGGTTAAGTTTAACACTACCTCTTAAGTAGAAAGATTAGCATGAAAGGGACATCACTGTGATTTGTAAATTTA
Coding sequence (CDS)
ATGGCTCAAAAGCACTTACACGAGCTTCTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAATTTCATCGCCGATAGACGCTCCCTTCTCAAACGCCCTTCCCCCAAATCCCATTTTCTTCACCTCAACAAACGAAAACCCATTTCCCATTCCTCTGATTTTCCCCGAAAATTTTGCAAGACCGCCTGTTTTTTCTCCTTCACTCATTCCCCTGATCTCAGAAACCCTTCGCCGCTCTTTGAATTTCACTCTCCGGTCAAGAGCCCTTCCCGGAACCCCAATCCCATTTTCCTCCATGTTCCGGCTAGAACGGCGGGGCTTCTCTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGCCCGCCAGATCCAAATCCCTACCGAAATCGAATGGTTTAGGGCTTTTCGGTTCTTTTCTTAAGCGCTTTACTCATCGCGGCCGTTCTCGGAAGCGAGAGATCAACGGCGACTGCCGGAGAAATGACCCCCGCGGCAGCCCGCCACTGCCGCCGAAAATGGCGATTAACAAGAATGAGAACGACTCTGTTTCTCCGCAGAGTAATGTAACGAGCTTTGATTTCTGCGAGAGTAATTTTTGCGATAGCCCTTTTCGGTTCGTGCTTCAATCGAGCCCCTCCGCCGGTCACCGGACGCCGGAGTTCTCTTCTCCGGCAACTTCTCCGGCTCGAAACGACCATCAGGTCAATGATGTAGAGAGCTTGAAGAAATTGCCAGTTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCTGTGTTGGATCCTCCATTTGAGGATGATAACGAAGGACATTATGAGGATGGTGAGGATGAGGACGATTACGATTTGGAACGCAGCTACGCCATTGTACAAAAGGCGAAGCATCAGCTACTGAAAAAACTTCGGAGATTCGAGAGACTAGCAGAACTAGATCCAGTAGAACTCGAGACGTTTCTACTAAAAGATGAGGAAGACGAACTCGACGATGACGATGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACACAAGCCATAACTTTGATCCATCTAATAACGAAAAACACATCAAACAACACAACGTAGAGGCGAATGGCAGTTCAAGCTTCCAAATTCCTCACCGACCCGCAAAAGATACGAAGAGACTCGTCTGCAATCTCATAGCCGGGGAAAAGAGAGATCCGGTTGTGATCGACGAGAGAGAAGAGATGAGAAAGAGAGTCTACGTGAGATCAGATTTGTGGAAACGGGTGGACTCGAGGGCCATCGACGTGATGGCGGGGCAAGATTTGAAAGCAGAGCTTGATGGGTGGATCAGAAATGGGGAGCAAAGAGGAGAAATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTAGTGGAGGAAATGCAAACTGAGCTACATTGCTTAACTCATTAA
Protein sequence
MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCKTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQSTPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLTH
Homology
BLAST of Tan0003391 vs. NCBI nr
Match:
XP_038903007.1 (uncharacterized protein LOC120089713 [Benincasa hispida])
HSP 1 Score: 717.2 bits (1850), Expect = 9.1e-203
Identity = 380/473 (80.34%), Postives = 414/473 (87.53%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPS KSHF HLN KPISHSSDFP KFC+
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHF-HLNNPKPISHSSDFPAKFCR 60
Query: 61 TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
+ACFFSF HSPDL N SPLF F SPVK+P RNPNPIFLHVPARTAGLLLEAALRIQKQST
Sbjct: 61 SACFFSFNHSPDLINSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NKN 180
ARSKSL KSNGLG+ GSFLKR THRGR+RKREI+GD R+NDPR PPLP KMAI N+N
Sbjct: 121 VARSKSLGKSNGLGVLGSFLKRLTHRGRARKREIDGDGRKNDPRDGPPLPAKMAIEENEN 180
Query: 181 ENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDV 240
ENDSVS SNVT FDFC+SN CDSPFRFVLQSSPS GH+TPE +SPA+SPAR DHQ NDV
Sbjct: 181 ENDSVSRLSNVTGFDFCDSNLCDSPFRFVLQSSPSPGHQTPELASPASSPARLDHQANDV 240
Query: 241 ESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQ 300
E LKKLPVEDEEEEKEQSSPVSVLDPPFEDD+EGHYEDGEDEDDY+LERS+AIVQ+AKHQ
Sbjct: 241 EGLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQQAKHQ 300
Query: 301 LLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKH 360
LLKKLRRFERLAELDPVELETFLLKDE+++ D+DDDDIDHLKEEE+Y +K
Sbjct: 301 LLKKLRRFERLAELDPVELETFLLKDEDEDEDEDDDDIDHLKEEEDY----------KKD 360
Query: 361 IKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWK 420
IK+H++EAN SS FQIPHRPA+D LVCNL+ E+RD VVI++REEM K +YVRSDLWK
Sbjct: 361 IKEHDIEANDSSRFQIPHRPARDMTTLVCNLVTEEERDLVVIEKREEMMKGMYVRSDLWK 420
Query: 421 RVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELH 472
RVDS AI+VM GQDLK E+DGW RN EQR EIAIEIE+AIFSLLVEEMQ ELH
Sbjct: 421 RVDSNAINVMVGQDLKEEVDGWKRNKEQRREIAIEIEVAIFSLLVEEMQPELH 462
BLAST of Tan0003391 vs. NCBI nr
Match:
XP_022144766.1 (uncharacterized protein LOC111014376 [Momordica charantia])
HSP 1 Score: 677.6 bits (1747), Expect = 8.0e-191
Identity = 378/481 (78.59%), Postives = 399/481 (82.95%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPKS+ LHL +RKPIS + DFP KFCK
Sbjct: 2 MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSN-LHLKRRKPISETLDFPGKFCK 61
Query: 61 TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
+ACFFSF SPDLR SPLFEF SPV RNPN IFLHVPARTAG+LLEAALRIQKQST
Sbjct: 62 SACFFSFHESPDLRK-SPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQST 121
Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI----- 180
ARSK K+NGLGL GSFLKR THRGR+RKREI+GD RRND G PLP KMAI
Sbjct: 122 AARSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENED 181
Query: 181 -NKNENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQ 240
N NEN SVS Q+N+TSF FCESNFCDSPFRFVLQSSPS+GHRTPEFSSPA SP R DHQ
Sbjct: 182 ENVNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQ 241
Query: 241 VNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQK 300
NDVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDD+EGHYEDGEDED YDLERSY IVQK
Sbjct: 242 DNDVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQK 301
Query: 301 AKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSN 360
AKHQLLKKLRRFE+LAELDPVELE+FLLK EEDEL DDDDDIDHLK EEEY SHNF+
Sbjct: 302 AKHQLLKKLRRFEKLAELDPVELESFLLKGEEDEL-DDDDDIDHLK-EEEYESHNFE--- 361
Query: 361 NEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRS 420
QH+VEANGSSSFQIPH RLV N I GE+RD V D REEM K VYVRS
Sbjct: 362 ------QHDVEANGSSSFQIPH-------RLVRNRITGEQRDQAVTDNREEMTKGVYVRS 421
Query: 421 DLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLT 476
DLWKRVDS AID GQDLK ELDGW RN +QRGE+AIEIELAIFSLLV EMQTEL CLT
Sbjct: 422 DLWKRVDSNAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLT 458
BLAST of Tan0003391 vs. NCBI nr
Match:
XP_011651995.1 (uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical protein Csa_002656 [Cucumis sativus])
HSP 1 Score: 674.5 bits (1739), Expect = 6.8e-190
Identity = 368/482 (76.35%), Postives = 399/482 (82.78%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFC 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL KPI HSSDF KFC
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPIPHSSDFSAKFC 60
Query: 61 KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 120
++ CFFSF HSPDL N SP F F SPVK+P RNPNP+F HVPARTAGLLLEAALRIQKQS
Sbjct: 61 RSTCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQS 120
Query: 121 TPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NK 180
T ARSKS KSNGLGL GSFLKR THR R+RKREI+GD R NDPR PPLP KMAI N+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENE 180
Query: 181 NENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVND 240
ENDSV SNVT FDFCESN CDSPFRFVLQSSPS GHRTPE SSPA+SPAR DHQ ND
Sbjct: 181 TENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQAND 240
Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH 300
VESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EGH+EDGEDEDDY+LERS+AIVQKAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKH 300
Query: 301 QLLKKLRRFERLAELDPVELETFLLKDE---EDELDD-DDDDIDHLKEEEEYTSHNFDPS 360
QLLKKLRRFERLAELDP+ELETFLL DE EDEL D D DDIDHLKEE E
Sbjct: 301 QLLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVE--------- 360
Query: 361 NNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVR 420
EK IKQHN E N SS FQIP+RP++DTK LVCNLI E+R+ VVI++ EE KRVY+R
Sbjct: 361 QYEKDIKQHNKEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMR 420
Query: 421 SDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCL 476
DLWKRVDS AID+M G+DLK E+DGW N E RGEIA+EIE+AIFSLLVEEMQ+ELHCL
Sbjct: 421 QDLWKRVDSNAIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCL 472
BLAST of Tan0003391 vs. NCBI nr
Match:
KAA0043909.1 (histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa] >TYK25228.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa])
HSP 1 Score: 657.9 bits (1696), Expect = 6.6e-185
Identity = 363/481 (75.47%), Postives = 396/481 (82.33%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFC 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL KPISHS DF KFC
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPISHSPDFSAKFC 60
Query: 61 KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 120
++ CFFSF HSPDL N SPLF F SPVK+P R+PNP+F HVPARTAGLLLEAALRIQKQS
Sbjct: 61 RSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQS 120
Query: 121 TPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NK 180
T ARSKS KSNGLGL GSFLKR THR RSRKREI+GD R NDPR PPLP KMAI N+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE 180
Query: 181 NENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVND 240
ENDSV SNVT FDFCESN CDSPFRFVLQSS S GHRTPE SSP +SPAR DHQ ND
Sbjct: 181 KENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQAND 240
Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH 300
VESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EG++EDGEDEDDY+LERS+AIVQKAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKH 300
Query: 301 QLLKKLRRFERLAELDPVELETFLLKDEEDELDD--DDDDIDHLKEE-EEYTSHNFDPSN 360
QLLKKLRRFERLAELDP+ELETFLL DE+ + D+ D DDIDHLKEE EEY
Sbjct: 301 QLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEY--------- 360
Query: 361 NEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRS 420
EK IKQHN E N SS FQ +RP++DTK LVCNLI E+R+ V I++REE KRVY+R
Sbjct: 361 -EKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRP 420
Query: 421 DLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLT 476
DLWKRVDS AIDVM G+DLK E+DGW RN E RGEI IEIE+AIFSLLVEEMQ+ELHCL
Sbjct: 421 DLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLA 468
BLAST of Tan0003391 vs. NCBI nr
Match:
XP_022945267.1 (uncharacterized protein LOC111449564 [Cucurbita moschata] >KAG7028088.1 hypothetical protein SDJN02_09268 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 609.8 bits (1571), Expect = 2.1e-170
Identity = 346/473 (73.15%), Postives = 369/473 (78.01%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPSPKSH LHLNKRKPISH SDFP FCK
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCK 60
Query: 61 TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
ACF SF SPDLRNPSPLF+F SPVKSP RN N +FLHVPA TAGLLLEAALRIQKQST
Sbjct: 61 GACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQST 120
Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNEN 180
AR SNG GL GSFLKRFTHRGRSRKREI+G CRRNDPR LPP NE
Sbjct: 121 AAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-----INEK 180
Query: 181 DSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVES 240
DSVS QSNVTS DFCE SPFRFVLQSSPSAGHRTPEFSSP +SPAR+DHQVNDVES
Sbjct: 181 DSVSRQSNVTSSDFCE-----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVES 240
Query: 241 LKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLL 300
LKKLPV+DEEEEKEQSSPVSVLDPPFEDD EG YEDGED+DDY++ERSYAIV+KAKHQLL
Sbjct: 241 LKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLL 300
Query: 301 KKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIK 360
KKLRRFERLAELDPVELETFLLKDEE EL DDDDIDHLK EEE SHNFD SNNEK +K
Sbjct: 301 KKLRRFERLAELDPVELETFLLKDEEGEL--DDDDIDHLK-EEECESHNFDRSNNEKDMK 360
Query: 361 QHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWKRV 420
QH ++ N +RVY+R DLWK V
Sbjct: 361 QHGIDGN---------------------------------------VERVYMRWDLWKEV 414
Query: 421 DSRAIDVMAGQDLKAEL-DGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC 473
+S AIDVMAG+DL+AE+ DGW RNGE RG+IAIEIE+ IF LLVEEMQTE+ C
Sbjct: 421 ESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC 414
BLAST of Tan0003391 vs. ExPASy TrEMBL
Match:
A0A6J1CUE0 (uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014376 PE=4 SV=1)
HSP 1 Score: 677.6 bits (1747), Expect = 3.9e-191
Identity = 378/481 (78.59%), Postives = 399/481 (82.95%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPKS+ LHL +RKPIS + DFP KFCK
Sbjct: 2 MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSN-LHLKRRKPISETLDFPGKFCK 61
Query: 61 TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
+ACFFSF SPDLR SPLFEF SPV RNPN IFLHVPARTAG+LLEAALRIQKQST
Sbjct: 62 SACFFSFHESPDLRK-SPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQST 121
Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI----- 180
ARSK K+NGLGL GSFLKR THRGR+RKREI+GD RRND G PLP KMAI
Sbjct: 122 AARSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENED 181
Query: 181 -NKNENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQ 240
N NEN SVS Q+N+TSF FCESNFCDSPFRFVLQSSPS+GHRTPEFSSPA SP R DHQ
Sbjct: 182 ENVNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQ 241
Query: 241 VNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQK 300
NDVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDD+EGHYEDGEDED YDLERSY IVQK
Sbjct: 242 DNDVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQK 301
Query: 301 AKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSN 360
AKHQLLKKLRRFE+LAELDPVELE+FLLK EEDEL DDDDDIDHLK EEEY SHNF+
Sbjct: 302 AKHQLLKKLRRFEKLAELDPVELESFLLKGEEDEL-DDDDDIDHLK-EEEYESHNFE--- 361
Query: 361 NEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRS 420
QH+VEANGSSSFQIPH RLV N I GE+RD V D REEM K VYVRS
Sbjct: 362 ------QHDVEANGSSSFQIPH-------RLVRNRITGEQRDQAVTDNREEMTKGVYVRS 421
Query: 421 DLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLT 476
DLWKRVDS AID GQDLK ELDGW RN +QRGE+AIEIELAIFSLLV EMQTEL CLT
Sbjct: 422 DLWKRVDSNAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLT 458
BLAST of Tan0003391 vs. ExPASy TrEMBL
Match:
A0A0A0LAR8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1)
HSP 1 Score: 674.5 bits (1739), Expect = 3.3e-190
Identity = 368/482 (76.35%), Postives = 399/482 (82.78%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFC 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL KPI HSSDF KFC
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPIPHSSDFSAKFC 60
Query: 61 KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 120
++ CFFSF HSPDL N SP F F SPVK+P RNPNP+F HVPARTAGLLLEAALRIQKQS
Sbjct: 61 RSTCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQS 120
Query: 121 TPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NK 180
T ARSKS KSNGLGL GSFLKR THR R+RKREI+GD R NDPR PPLP KMAI N+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENE 180
Query: 181 NENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVND 240
ENDSV SNVT FDFCESN CDSPFRFVLQSSPS GHRTPE SSPA+SPAR DHQ ND
Sbjct: 181 TENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQAND 240
Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH 300
VESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EGH+EDGEDEDDY+LERS+AIVQKAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKH 300
Query: 301 QLLKKLRRFERLAELDPVELETFLLKDE---EDELDD-DDDDIDHLKEEEEYTSHNFDPS 360
QLLKKLRRFERLAELDP+ELETFLL DE EDEL D D DDIDHLKEE E
Sbjct: 301 QLLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVE--------- 360
Query: 361 NNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVR 420
EK IKQHN E N SS FQIP+RP++DTK LVCNLI E+R+ VVI++ EE KRVY+R
Sbjct: 361 QYEKDIKQHNKEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMR 420
Query: 421 SDLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCL 476
DLWKRVDS AID+M G+DLK E+DGW N E RGEIA+EIE+AIFSLLVEEMQ+ELHCL
Sbjct: 421 QDLWKRVDSNAIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCL 472
BLAST of Tan0003391 vs. ExPASy TrEMBL
Match:
A0A5D3DNQ5 (Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003580 PE=4 SV=1)
HSP 1 Score: 657.9 bits (1696), Expect = 3.2e-185
Identity = 363/481 (75.47%), Postives = 396/481 (82.33%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFC 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHF HL KPISHS DF KFC
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHF-HLKNPKPISHSPDFSAKFC 60
Query: 61 KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 120
++ CFFSF HSPDL N SPLF F SPVK+P R+PNP+F HVPARTAGLLLEAALRIQKQS
Sbjct: 61 RSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQS 120
Query: 121 TPARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAI--NK 180
T ARSKS KSNGLGL GSFLKR THR RSRKREI+GD R NDPR PPLP KMAI N+
Sbjct: 121 TAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE 180
Query: 181 NENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVND 240
ENDSV SNVT FDFCESN CDSPFRFVLQSS S GHRTPE SSP +SPAR DHQ ND
Sbjct: 181 KENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQAND 240
Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKH 300
VESL+KLP EDEEEEKEQSSPVSVLDPPFEDD+EG++EDGEDEDDY+LERS+AIVQKAKH
Sbjct: 241 VESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKH 300
Query: 301 QLLKKLRRFERLAELDPVELETFLLKDEEDELDD--DDDDIDHLKEE-EEYTSHNFDPSN 360
QLLKKLRRFERLAELDP+ELETFLL DE+ + D+ D DDIDHLKEE EEY
Sbjct: 301 QLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEY--------- 360
Query: 361 NEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRS 420
EK IKQHN E N SS FQ +RP++DTK LVCNLI E+R+ V I++REE KRVY+R
Sbjct: 361 -EKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRP 420
Query: 421 DLWKRVDSRAIDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLT 476
DLWKRVDS AIDVM G+DLK E+DGW RN E RGEI IEIE+AIFSLLVEEMQ+ELHCL
Sbjct: 421 DLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLA 468
BLAST of Tan0003391 vs. ExPASy TrEMBL
Match:
A0A6J1G0G0 (uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC111449564 PE=4 SV=1)
HSP 1 Score: 609.8 bits (1571), Expect = 1.0e-170
Identity = 346/473 (73.15%), Postives = 369/473 (78.01%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPSPKSH LHLNKRKPISH SDFP FCK
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCK 60
Query: 61 TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
ACF SF SPDLRNPSPLF+F SPVKSP RN N +FLHVPA TAGLLLEAALRIQKQST
Sbjct: 61 GACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQST 120
Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNEN 180
AR SNG GL GSFLKRFTHRGRSRKREI+G CRRNDPR LPP NE
Sbjct: 121 AAR------SNGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPP-----INEK 180
Query: 181 DSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDVES 240
DSVS QSNVTS DFCE SPFRFVLQSSPSAGHRTPEFSSP +SPAR+DHQVNDVES
Sbjct: 181 DSVSRQSNVTSSDFCE-----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVES 240
Query: 241 LKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQLL 300
LKKLPV+DEEEEKEQSSPVSVLDPPFEDD EG YEDGED+DDY++ERSYAIV+KAKHQLL
Sbjct: 241 LKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQLL 300
Query: 301 KKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKHIK 360
KKLRRFERLAELDPVELETFLLKDEE EL DDDDIDHLK EEE SHNFD SNNEK +K
Sbjct: 301 KKLRRFERLAELDPVELETFLLKDEEGEL--DDDDIDHLK-EEECESHNFDRSNNEKDMK 360
Query: 361 QHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWKRV 420
QH ++ N +RVY+R DLWK V
Sbjct: 361 QHGIDGN---------------------------------------VERVYMRWDLWKEV 414
Query: 421 DSRAIDVMAGQDLKAEL-DGWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC 473
+S AIDVMAG+DL+AE+ DGW RNGE RG+IAIEIE+ IF LLVEEMQTE+ C
Sbjct: 421 ESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC 414
BLAST of Tan0003391 vs. ExPASy TrEMBL
Match:
A0A6J1L3C1 (uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735 PE=4 SV=1)
HSP 1 Score: 599.4 bits (1544), Expect = 1.3e-167
Identity = 343/475 (72.21%), Postives = 372/475 (78.32%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKFCK 60
MAQKHLHELLKEDQEPFLLTNFIA+RR +LKRPSPKSH LHLNK KPISH +DFP FCK
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIANRR-VLKRPSPKSHLLHLNKPKPISHFADFPASFCK 60
Query: 61 TACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
ACF SF HSPDLRNPSPLF+F SPVKSP RN N +FLHVPA TA LLLEAALRIQKQST
Sbjct: 61 GACFLSFNHSPDLRNPSPLFQFQSPVKSPCRNSNAMFLHVPATTARLLLEAALRIQKQST 120
Query: 121 PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRRNDPRGSPPLPPKMAINKNE- 180
PAR SNG GL GSFLKRFT+RGRSRKREI+G CRRNDP + KMAIN+NE
Sbjct: 121 PAR------SNGFGLLGSFLKRFTYRGRSRKREIDGGCRRNDPSTA-----KMAINENEN 180
Query: 181 -NDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSPSAGHRTPEFSSPATSPARNDHQVNDV 240
NDSVS QSNVTS S+FCDSPFRFVLQSSPSAGHRTPEFSSP +SPAR+DHQVNDV
Sbjct: 181 GNDSVSRQSNVTS-----SDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARDDHQVNDV 240
Query: 241 ESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYAIVQKAKHQ 300
ESLKKLPV+DEEEEKEQSSPVSVLDPPFEDD EG YEDGED+DDY +ERSYAIVQKAKHQ
Sbjct: 241 ESLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYKMERSYAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSHNFDPSNNEKH 360
LLKKLRRFERLAELDPVELETFLLKDEE +LDDD DHL EEEE SHNFD SNNEK
Sbjct: 301 LLKKLRRFERLAELDPVELETFLLKDEEGKLDDDG---DHL-EEEECKSHNFDRSNNEKD 360
Query: 361 IKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREEMRKRVYVRSDLWK 420
+KQH +E+N +RVY+R DLWK
Sbjct: 361 MKQHGIESN---------------------------------------VERVYMRWDLWK 415
Query: 421 RVDSRAIDVMAGQDLKAELD-GWIRNGEQRGEIAIEIELAIFSLLVEEMQTELHC 473
V+S AIDVMA +DL+AE+D GW RNGE+RG+IAIEIE+ IF LLVEEMQTE+ C
Sbjct: 421 EVESSAIDVMAEEDLRAEVDVGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDC 415
BLAST of Tan0003391 vs. TAIR 10
Match:
AT5G03670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G36420.1); Has 700 Blast hits to 624 proteins in 104 species: Archae - 0; Bacteria - 18; Metazoa - 333; Fungi - 60; Plants - 73; Viruses - 24; Other Eukaryotes - 192 (source: NCBI BLink). )
HSP 1 Score: 244.2 bits (622), Expect = 2.1e-64
Identity = 207/547 (37.84%), Postives = 295/547 (53.93%), Query Frame = 0
Query: 2 AQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHL--NKRKPISHSSDFPRKFC 61
+Q+HL +LL+EDQEPF L ++I+DRR + +H HL KR+PIS ++ P +FC
Sbjct: 3 SQRHLKDLLEEDQEPFQLQSYISDRRCQI-----NAHVTHLQVKKRRPISQNAGLPSRFC 62
Query: 62 KTACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQS 121
+ ACFFS SPD + SPLFE +KSP+R+ N IF+++PARTA +LLEAA+RIQKQS
Sbjct: 63 RNACFFSLRESPDPKK-SPLFE----LKSPNRSQNAIFVNIPARTASILLEAAVRIQKQS 122
Query: 122 TP-ARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREING---------DCRRNDPRGSPPL 181
+ +++++ N G+FGS LK+ T+R +KREI+G ++ R P+
Sbjct: 123 SEVSKTRTRNAGNAFGIFGSVLKKLTNR---KKREISGGKEAGRVSSSSVKDMLRWESPV 182
Query: 182 PPKMAINK---NENDSVSPQS---------------------NVT--------------- 241
K+ K NE ++ S Q+ +VT
Sbjct: 183 VRKIVTRKSKRNEEENASSQTHKIASETHFSRRSSSSGVWSESVTNGERSWDVDFETSIS 242
Query: 242 -------SFDFC----------ESNFCDSPFRFVLQSSPS-AGHRTPEFSSPATSPARND 301
S +F + FC+SPF FVLQ+ PS G RTP FSSPA SP +
Sbjct: 243 TSSRSNGSDEFAMMMNGQDLSEDKRFCESPFHFVLQTMPSNGGFRTPNFSSPAASPRHDC 302
Query: 302 HQVN----DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERS 361
H++ +VE LKKL +E+EEEEKEQSSPVSVLDPPF+DD+E + DD ++ S
Sbjct: 303 HEMEKESYEVEKLKKLEMEEEEEEKEQSSPVSVLDPPFQDDDEDIH-----MDDNNIPSS 362
Query: 362 YAIVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDELDDDDDDIDHLKEEEEYTSH 421
+ VQKAKH LL+KL RFE+LA LDP+ELE + D+E E ++++ EEEE S
Sbjct: 363 FRSVQKAKHLLLQKLCRFEQLAGLDPMELEK-RMSDQETEEEEEE-------EEEEMKSL 422
Query: 422 NFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREE--- 471
++ +K + E ++P + + L+ +L A E P ID E
Sbjct: 423 YHCEIITQRVLKTYFEE-----MVEVP----EGVEALISDLAAEEL--PSDIDGEAEAAI 482
BLAST of Tan0003391 vs. TAIR 10
Match:
AT2G36420.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G03670.1); Has 10588 Blast hits to 6606 proteins in 440 species: Archae - 8; Bacteria - 365; Metazoa - 4146; Fungi - 1198; Plants - 483; Viruses - 212; Other Eukaryotes - 4176 (source: NCBI BLink). )
HSP 1 Score: 202.2 bits (513), Expect = 9.2e-52
Identity = 180/488 (36.89%), Postives = 264/488 (54.10%), Query Frame = 0
Query: 3 QKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKSHFLHLNKRKPISHSSDFPRKF-CKT 62
+KHLHE L++DQEPF L ++I + RS + + + KRK + ++ P F C+
Sbjct: 7 KKHLHEFLEDDQEPFHLNHYIGNLRSQMGCSD-----MRVKKRKSDNVATFPPGLFSCEN 66
Query: 63 ACFFSFTHSPDLRNPSPLFEFHSPVKSPSRNPNPIFLHVPARTAGLLLEAALRIQKQST- 122
+CFF+ SPD R SPLFE SP K R+ +FL +PARTA +LL+AA RIQKQ +
Sbjct: 67 SCFFAAHKSPDPRK-SPLFELRSPGKKKIRD-GRVFLQIPARTAAILLDAAARIQKQQSE 126
Query: 123 -PARSKSLPKSNGLGLFGSFLKRFTHRGRSRKREINGDCRR-NDPRGSPPLPPKMAINKN 182
+K+ + NG G+FGS LK T+R ++ R N D + RGS P
Sbjct: 127 KAKTNKARTRGNGFGMFGSVLKLLTYR-ITKPRLDNADGNAVSLERGSEP---------- 186
Query: 183 ENDSVSPQSNVTSFDFCESNFCDSPFRFVLQSSP-SAGHRTPEFSSPATSPAR---NDHQ 242
S + V D C FC+SPF FVLQ++P S+GH+TP F+S ATSPAR D
Sbjct: 187 -TSSSRRERIVEISDKC---FCESPFHFVLQTTPSSSGHQTPHFTSTATSPARRSTEDED 246
Query: 243 VNDVESLKKLPVED----EEEEKEQSSPVSVLDPPFEDDNEGHYEDGEDEDDYDLERSYA 302
++ ESL+K+ ++ EEE+KEQ SPVSVLDP E++ + + E + +L S+
Sbjct: 247 SDETESLEKVRGQEEEDKEEEDKEQCSPVSVLDPLEEEEEDEDHHQHEPDPPNNLSCSFE 306
Query: 303 IVQKAKHQLLKKLRRFERLAELDPVELETFLLKDEEDELD-----DDDDDIDHLKEEEEY 362
IVQ+AK +LLKKLRRFE+LA LDPVELE + ++E++E + ++DD+I +EEY
Sbjct: 307 IVQRAKRRLLKKLRRFEKLAGLDPVELEGKMSEEEDEEEEEYEESEEDDNIRIYDSDEEY 366
Query: 363 TSHNFDPSNNEKHIKQHNVEANGSSSFQIPHRPAKDTKRLVCNLIAGEKRDPVVIDEREE 422
+ EA S R A+D KR ++ +
Sbjct: 367 EDVD---------------EAMARES-----RCAEDEKR-----------------KKND 426
Query: 423 MRKRVYVRSDLWKRVDSRA---IDVMAGQDLKAELDGWIRNGEQRGEIAIEIELAIFSLL 471
R++ + + W RV A +D + +DL+ E W R+G + E ++E +IF +L
Sbjct: 427 ERQKKWRMMNAW-RVGLGAEEDVDAVVRKDLREEAGEWTRHGGEVEEAVSDLEHSIFFVL 434
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_038903007.1 | 9.1e-203 | 80.34 | uncharacterized protein LOC120089713 [Benincasa hispida] | [more] |
XP_022144766.1 | 8.0e-191 | 78.59 | uncharacterized protein LOC111014376 [Momordica charantia] | [more] |
XP_011651995.1 | 6.8e-190 | 76.35 | uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical ... | [more] |
KAA0043909.1 | 6.6e-185 | 75.47 | histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. mak... | [more] |
XP_022945267.1 | 2.1e-170 | 73.15 | uncharacterized protein LOC111449564 [Cucurbita moschata] >KAG7028088.1 hypothet... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1CUE0 | 3.9e-191 | 78.59 | uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014... | [more] |
A0A0A0LAR8 | 3.3e-190 | 76.35 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1 | [more] |
A0A5D3DNQ5 | 3.2e-185 | 75.47 | Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. m... | [more] |
A0A6J1G0G0 | 1.0e-170 | 73.15 | uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC1114495... | [more] |
A0A6J1L3C1 | 1.3e-167 | 72.21 | uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735... | [more] |
Match Name | E-value | Identity | Description | |
AT5G03670.1 | 2.1e-64 | 37.84 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G36420.1 | 9.2e-52 | 36.89 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |