Cla97C04G073010 (gene) Watermelon (97103) v2.5

Overview
NameCla97C04G073010
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
LocationCla97Chr04: 20400150 .. 20403242 (+)
RNA-Seq ExpressionCla97C04G073010
SyntenyCla97C04G073010
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGCCGCCGTCCACCGTCGTCATTAGAAAAATGGCTCTCTTACGCACAAAATCCCGCTTGACAATCCCAGCTCCGCCGCCGTCTCCAATCCCAACCGCCACTGGATCGCGCTCGGCGGCCAATGAAACCTTCAAAACTTTCCTCGAGAACTCGATTCACCTGCCGCAGCTCTCTTTGCCGGAATCTCGCTTCGTCTCCGGCGCCAATCCTGCCCCCGCCGTCCTCGATTTCCGATCGCTCTTTTCTCCCGGTGGCGGTGATTTGGCGGCGCGGATGCTCCGGTCCGTGAATGAATTCGGCGCGTTTCGGATTGTTAATCACGGGATATCCGGCGAGGAGATTTTGTCGGTGGTGAATGAAGCTAAATCCGTGTTGGAAGATTGTAATAAGGGAGTTGATGATCGGAGCTGGGTCGACGACGACGGAAATCGGGAGGCGATCTTGCAGGTACGGCGGCGGAATGACAGCGAGCCGTCGGGAAATACAGTTGTACAGTCCGAAACGAACCGCCAAATCAGGTAAAAATCCGAGACCATCCGATGTCCTTGATAGTTTCCGATATTTTGAAATGGGCTGGGCTTAGCTGTAAGTGGCCCAAGTAAGGCCCACTTTTTACGGAAATTTTTTAAAAAGAAAATTTGTTTATTAGTATGAATTTTAATAATATAATCAAGTAGGTTGATCTAATTTCATTTTTTTCTAAATTTTTTGGTACGATTTTTTTGAGAAAAATTATTTCATGTCAAATTAACATATTTTTGTTTTTATGTCATAAAATTTACTTTAGCGGCTATTTATATATTATAGTACTTGTATACAATTGTACGTAGTATACAATTGTAATTCTTATAATTACATAGTGTAAATTTTTTTTTTCTTAATCTAAAATTAGTCAACTAATTTTAAAAAGAAACGGTTTCAAATATAGCAATCAAGCTCTTGAAAAGAATCTACAGACATAACAAAATTTAAATTCAGCTCTCGAAGTCTATCCATGATAGACTATATCGCGGATAGGAATCATTGATAGAGTTTGTCATCGATAAATTTGGCTATATTTGTAATTTTTTAAAAATGTTGTTATACACTTAATTATTAACCCAAAAATTACTACCCATTACAAGTACCCTTTTTAAAAACCTTTCGCAATTTTAGTCAATCTAGTCTTTACTTTTTTCTTTTTCCATGTTTCGTTATCATTTACTTCTCAGTTATTATTAATTTATTTTTTTTCGTAAATTTTTAATTAAGATAAAAAAAAAAAACCAAATCCAAATATTAATATTCTAAAGTTAGAAAGTATGTAGTTTGAAACTCAATTAAAAGAATATTCCTACCAACCCAATACATAATAACTATTTGAGTATGTGTTTAATTAATCAGTCGGGTTTGGTTTGGTTTGATTTGGTTTGACTTGATTTACAGCGAAAAGATGGAGAGAATAAGAAGCAAACTAGAAGTCATTGCAGAGAAATTAAGTGAACTTTTATGGGAATGCATGGGAGAGAAGGTGAAGAAAGGAGATAGAAAAGAGACAATTTTTAGCATTTACAAATACAATAATCAAAATATATATGAGAAGGAAAATAAAAATAAAATTTCAAAGAACGAGAAAGAAGAAAGAGAAAGTGATGATGGGAATGTGACGATGAGTCTTCACATCCCAGGAGAGCATTGCCAATTCTATGTTAATTCTCATCAACAACACTCTTTTTGCTTTGATGCTGCTGCTGATACCATTGTTGTCACCATTGCCAAACAACTTCAGGTAATTTTATCATACTCCTCTCTCTCATAACCTTTTGTTTTTTATTTTTTTAATTTGAATTTGGCTAAAAAATCAACTATTTTATTTAAGATTGATACAAACCATTTTAAGAAAATGAGAGTAAATACGCTTAATTTTCAAAAACTAAAAGTAAAAAATAAAATGAGTCACCGAACAAGATCTTAACTTTTCTTTTTTTTTTTTTTTTTTTTTTTGTATTTGGCTAAAAATTCAAAGTTTTCTTAAAAATAATAATAATAAAAAAGGTGGCAACCATAATAATGATAAAAAAAAAAAATCTTAAAGAATCAAGCAAAATTATATATATATTAAAAAGACGGAAGTCACAATTTGTTAACTATTTGATTTTTTATTTTTTAAAATTAAGTTTATTTTCTCTCAACTTTTTATAATGATTTTCATATATATATATTAAATAAAAGAGTTGATTGCTTAACTAAAATTTCAAAAACAATTCTAGTTTTCAATATCTGACTTGGTTTTTGAAAACATTAATAGAAAGTAAATAACGAAGCAAAAAATTTAGAGTTAAAAGAAGTATTTATAAGCTTGATTTTAAAAATTAAAAATAAAAAATCAAATTATCACTAAATGAGGTGAAAATTATTATGAAATAGAGCCTTAGATTCAATTTGTTGGGTTCTTACCAGCCTAAAAAAGATGTTCCAAATTGTGACTTATGCTCTTATCTTTTGAAATAAATAAAAATGAGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGCATTCTTCCCTAATTCTTTCCAATCAGATTATTACACACTTTTCTTTTTCCAATTACATACCCTCATTTCAAAAAAAGACTTACAAGATTCTTCAAACATTTTGGAATTAAAATGCTATTGAAAGGGTTAAAATAACTTTTATCTTGTTCAAAATTATTTAGAAACATATTTTTAATATTCAAATATAAGTTTAATGTTTGATTTTGGAAATCAAAATTACTATCAAACATGCATTTGGAAATGTCTATAGTAGAAATCGACTTCAAATTTTGCATAATAATAATTGATTGTGATATGGTTTTGGTAGGAATTGAGCTTGGGAAAATTGAAGAGTGCAAGAAGTGAGATGATATTTGTGCCAGATTTGCTTGGAAGCAAAACCTCTTTCTCTATTGAGCTCAAATTTTCAAACCCAAATTTATTGAATAATCATTCCCATTCTAAAATCATCTCCATTTCTGATCAAATCTTCATTGCCTTTCTTCTTATTTCCCTTTACTTTCTTTACACTTACATTTCTTCATTCTTTAAAGGCAAATAA

mRNA sequence

ATGCCGCCGCCGTCCACCGTCGTCATTAGAAAAATGGCTCTCTTACGCACAAAATCCCGCTTGACAATCCCAGCTCCGCCGCCGTCTCCAATCCCAACCGCCACTGGATCGCGCTCGGCGGCCAATGAAACCTTCAAAACTTTCCTCGAGAACTCGATTCACCTGCCGCAGCTCTCTTTGCCGGAATCTCGCTTCGTCTCCGGCGCCAATCCTGCCCCCGCCGTCCTCGATTTCCGATCGCTCTTTTCTCCCGGTGGCGGTGATTTGGCGGCGCGGATGCTCCGGTCCGTGAATGAATTCGGCGCGTTTCGGATTGTTAATCACGGGATATCCGGCGAGGAGATTTTGTCGGTGGTGAATGAAGCTAAATCCGTGTTGGAAGATTGTAATAAGGGAGTTGATGATCGGAGCTGGGTCGACGACGACGGAAATCGGGAGGCGATCTTGCAGGTACGGCGGCGGAATGACAGCGAGCCGTCGGGAAATACAGTTGTACAGTCCGAAACGAACCGCCAAATCAGCGAAAAGATGGAGAGAATAAGAAGCAAACTAGAAGTCATTGCAGAGAAATTAAGTGAACTTTTATGGGAATGCATGGGAGAGAAGGTGAAGAAAGGAGATAGAAAAGAGACAATTTTTAGCATTTACAAATACAATAATCAAAATATATATGAGAAGGAAAATAAAAATAAAATTTCAAAGAACGAGAAAGAAGAAAGAGAAAGTGATGATGGGAATGTGACGATGAGTCTTCACATCCCAGGAGAGCATTGCCAATTCTATGTTAATTCTCATCAACAACACTCTTTTTGCTTTGATGCTGCTGCTGATACCATTGTTGTCACCATTGCCAAACAACTTCAGGAATTGAGCTTGGGAAAATTGAAGAGTGCAAGAAGTGAGATGATATTTGTGCCAGATTTGCTTGGAAGCAAAACCTCTTTCTCTATTGAGCTCAAATTTTCAAACCCAAATTTATTGAATAATCATTCCCATTCTAAAATCATCTCCATTTCTGATCAAATCTTCATTGCCTTTCTTCTTATTTCCCTTTACTTTCTTTACACTTACATTTCTTCATTCTTTAAAGGCAAATAA

Coding sequence (CDS)

ATGCCGCCGCCGTCCACCGTCGTCATTAGAAAAATGGCTCTCTTACGCACAAAATCCCGCTTGACAATCCCAGCTCCGCCGCCGTCTCCAATCCCAACCGCCACTGGATCGCGCTCGGCGGCCAATGAAACCTTCAAAACTTTCCTCGAGAACTCGATTCACCTGCCGCAGCTCTCTTTGCCGGAATCTCGCTTCGTCTCCGGCGCCAATCCTGCCCCCGCCGTCCTCGATTTCCGATCGCTCTTTTCTCCCGGTGGCGGTGATTTGGCGGCGCGGATGCTCCGGTCCGTGAATGAATTCGGCGCGTTTCGGATTGTTAATCACGGGATATCCGGCGAGGAGATTTTGTCGGTGGTGAATGAAGCTAAATCCGTGTTGGAAGATTGTAATAAGGGAGTTGATGATCGGAGCTGGGTCGACGACGACGGAAATCGGGAGGCGATCTTGCAGGTACGGCGGCGGAATGACAGCGAGCCGTCGGGAAATACAGTTGTACAGTCCGAAACGAACCGCCAAATCAGCGAAAAGATGGAGAGAATAAGAAGCAAACTAGAAGTCATTGCAGAGAAATTAAGTGAACTTTTATGGGAATGCATGGGAGAGAAGGTGAAGAAAGGAGATAGAAAAGAGACAATTTTTAGCATTTACAAATACAATAATCAAAATATATATGAGAAGGAAAATAAAAATAAAATTTCAAAGAACGAGAAAGAAGAAAGAGAAAGTGATGATGGGAATGTGACGATGAGTCTTCACATCCCAGGAGAGCATTGCCAATTCTATGTTAATTCTCATCAACAACACTCTTTTTGCTTTGATGCTGCTGCTGATACCATTGTTGTCACCATTGCCAAACAACTTCAGGAATTGAGCTTGGGAAAATTGAAGAGTGCAAGAAGTGAGATGATATTTGTGCCAGATTTGCTTGGAAGCAAAACCTCTTTCTCTATTGAGCTCAAATTTTCAAACCCAAATTTATTGAATAATCATTCCCATTCTAAAATCATCTCCATTTCTGATCAAATCTTCATTGCCTTTCTTCTTATTTCCCTTTACTTTCTTTACACTTACATTTCTTCATTCTTTAAAGGCAAATAA

Protein sequence

MPPPSTVVIRKMALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSLPESRFVSGANPAPAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLEDCNKGVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERIRSKLEVIAEKLSELLWECMGEKVKKGDRKETIFSIYKYNNQNIYEKENKNKISKNEKEERESDDGNVTMSLHIPGEHCQFYVNSHQQHSFCFDAAADTIVVTIAKQLQELSLGKLKSARSEMIFVPDLLGSKTSFSIELKFSNPNLLNNHSHSKIISISDQIFIAFLLISLYFLYTYISSFFKGK
Homology
BLAST of Cla97C04G073010 vs. NCBI nr
Match: XP_038881407.1 (uncharacterized protein LOC120072944, partial [Benincasa hispida])

HSP 1 Score: 544.7 bits (1402), Expect = 6.2e-151
Identity = 293/351 (83.48%), Postives = 314/351 (89.46%), Query Frame = 0

Query: 21  LTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSLPESRFVSGANPAPAVLDFRS 80
           + IPAPPPSPIPT TGSRSAANETFKTFLE SIHLPQLSLPESRF+SG NP PAV+DFRS
Sbjct: 1   IIIPAPPPSPIPTGTGSRSAANETFKTFLEKSIHLPQLSLPESRFLSGVNPTPAVVDFRS 60

Query: 81  LFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLEDCNKGVDDRSWVD 140
           L SPGGG+ AARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLEDCNKGVDDR W +
Sbjct: 61  LVSPGGGEAAARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLEDCNKGVDDRGWFE 120

Query: 141 DDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERIRSKLEVIAEKLSELLWECMG 200
           +DGNREAILQ+RRRNDS+ S NT+V +ETNRQIS KMERIRSKLE IAEKLSE+LW+CMG
Sbjct: 121 NDGNREAILQLRRRNDSKASENTIVPAETNRQISGKMERIRSKLEGIAEKLSEILWKCMG 180

Query: 201 EKVKKGDRKETIFSIYKY-NNQNIYEKE--NKNKISKNEKEERESDDGNVTMSLHIPGEH 260
           E VKK D+KE IFSIY+Y NNQNI E+E  N N I+KN+KEERE +DGN  M LHIPGEH
Sbjct: 181 ENVKKRDKKEAIFSIYRYNNNQNILERENNNNNNIAKNDKEERE-NDGNEMMRLHIPGEH 240

Query: 261 CQFYVNSHQQH--SFCFDAAADTIVVTIAKQLQELSLGKLKSARSEMIFVPDLLGSKTSF 320
           CQFYVNSHQQ   SFCFDAAADTIVVTI KQLQELSLGKLKSARSEMIF+ DLLGSK SF
Sbjct: 241 CQFYVNSHQQEQPSFCFDAAADTIVVTIGKQLQELSLGKLKSARSEMIFLTDLLGSKASF 300

Query: 321 SIELKFSNPNLLNNH-SHSKIISISDQIFIAFLLISLYFLYTYISSFFKGK 366
           SIELK SNPNLL NH SHSKIISISDQIFIAFL +SLYFLYT+ISSFFKGK
Sbjct: 301 SIELKISNPNLLKNHNSHSKIISISDQIFIAFLFLSLYFLYTHISSFFKGK 350

BLAST of Cla97C04G073010 vs. NCBI nr
Match: XP_008463252.1 (PREDICTED: uncharacterized protein LOC103501456 [Cucumis melo])

HSP 1 Score: 521.2 bits (1341), Expect = 7.4e-144
Identity = 288/376 (76.60%), Postives = 320/376 (85.11%), Query Frame = 0

Query: 1   MPPPSTVVIRKMALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSL 60
           MPPPSTVV+RKMALLRTKSRLTIPAPPPSPIPTATGSRSA NETFKTFLENS HLPQLSL
Sbjct: 1   MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSL 60

Query: 61  PESRFVSGANPAPAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVN 120
           PESRF SG N  PAV+DFRSL S G G+  ARMLRSVNEFGAFRIVNHGISGEE+LSVVN
Sbjct: 61  PESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGISGEEVLSVVN 120

Query: 121 EAKSVLEDCNKGVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERI 180
           EAKSVLED NKGVDDR W  DDGNREAILQVRR NDSE SGNTVV++ETNR+ISEKME+I
Sbjct: 121 EAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKI 180

Query: 181 RSKLEVIAEKLSELLWECMGEKVKK-GDRKETIFSIYKYNN--QNIYE--KENKNKISKN 240
           R KLE I EKLSE+L   +GE V+K G++KETIFSIY+Y++   +++E  K++  K SKN
Sbjct: 181 RRKLEGIGEKLSEILCGFVGENVEKLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKN 240

Query: 241 EKEERESDDGNVTMSLHIPGEHCQFYVN----SHQQHSFCFDAAADTIVVTIAKQLQELS 300
           E+E     D  V M L IPGEHCQFYVN      +Q+S CFDAAADTIVVTI KQ QE+S
Sbjct: 241 ERE----SDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQEMS 300

Query: 301 LGKLKSARSEMIFVPDLLGSKTSFSIELKFSNPNLL----NNHSHSKIISISDQIFIAFL 360
           +GKLKSARSEMIFVPDLLG++TSFSI+LKFSNPNLL    NN+SHSKIISISDQIF+AFL
Sbjct: 301 IGKLKSARSEMIFVPDLLGTQTSFSIDLKFSNPNLLLSNNNNNSHSKIISISDQIFVAFL 360

Query: 361 LISLYFLYTYISSFFK 364
           L+SLYFLYTYISSFFK
Sbjct: 361 LLSLYFLYTYISSFFK 372

BLAST of Cla97C04G073010 vs. NCBI nr
Match: XP_011653719.1 (uncharacterized protein LOC101207912 [Cucumis sativus] >KAE8649590.1 hypothetical protein Csa_012180 [Cucumis sativus])

HSP 1 Score: 505.0 bits (1299), Expect = 5.5e-139
Identity = 284/379 (74.93%), Postives = 320/379 (84.43%), Query Frame = 0

Query: 1   MPPPSTVVIRKMALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSL 60
           MPPPST+ IRKM+LLRT+S LTIPAPPPSPIPT TGSRSAANETFKTFL+NS HLPQLSL
Sbjct: 1   MPPPSTLAIRKMSLLRTQSHLTIPAPPPSPIPTGTGSRSAANETFKTFLDNSTHLPQLSL 60

Query: 61  PESRFVSGANPAPAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVN 120
           PESRF S  NPAPAVLDF+SL S G  D+ ARMLRSV+EFGAFRIVNHGISGEE+LSVVN
Sbjct: 61  PESRFFSAHNPAPAVLDFQSLVSSGCADVVARMLRSVHEFGAFRIVNHGISGEEVLSVVN 120

Query: 121 EAK--SVLEDCNKGVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKME 180
           +AK  SVLED NKGVDDRSW  DDGNREAILQVRR NDSE SGNTVV++ETNR+IS+KME
Sbjct: 121 QAKSVSVLEDSNKGVDDRSWDGDDGNREAILQVRRLNDSEVSGNTVVEAETNREISQKME 180

Query: 181 RIRSKLEVIAEKLSELLWECMGEKVKK-GDRKETIFSIYKYNNQN-----IYEKENKNKI 240
           +IR KLE I EKLSE+L   MGE V+K GD+KET+FSIY+YNN N     I+E+EN +  
Sbjct: 181 KIRRKLEGIGEKLSEILCGFMGENVEKLGDKKETMFSIYRYNNNNNRPNDIFERENDHN- 240

Query: 241 SKNEKEERESDDGNVTMSLHIPGEHCQFYV--NSHQQHSF--CFDAAADTIVVTIAKQLQ 300
           +K  K ERE D+ +V M L IPGEHCQFYV  + HQQ  +  CFDAAADTIVVTI KQ Q
Sbjct: 241 TKISKSEREGDE-SVMMKLEIPGEHCQFYVSYSCHQQKQYTRCFDAAADTIVVTIGKQFQ 300

Query: 301 ELSLGKLKSARSEMIFVPDLLGSKTSFSIELKFSNPNLL----NNHSHSKIISISDQIFI 360
           E+S+GKLKSARSEMIFVPDLLG++TSFSI+LKFSNPNLL    NN+SHSK+ISISDQIF 
Sbjct: 301 EMSMGKLKSARSEMIFVPDLLGTQTSFSIDLKFSNPNLLLNNNNNNSHSKVISISDQIFF 360

Query: 361 AFLLISLYFLYTYISSFFK 364
           AFLL+SL+FLYT ISSFFK
Sbjct: 361 AFLLLSLHFLYTCISSFFK 377

BLAST of Cla97C04G073010 vs. NCBI nr
Match: XP_022950423.1 (uncharacterized protein LOC111453527 [Cucurbita moschata])

HSP 1 Score: 470.3 bits (1209), Expect = 1.5e-128
Identity = 262/359 (72.98%), Postives = 285/359 (79.39%), Query Frame = 0

Query: 12  MALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSLPESRFVSGANP 71
           MAL RTKSRLTIPAPPPSPIPT TGSRSAANETFK FLE SIHLPQLSLPESRF+S  NP
Sbjct: 1   MALSRTKSRLTIPAPPPSPIPTGTGSRSAANETFKQFLEKSIHLPQLSLPESRFISTTNP 60

Query: 72  APAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLEDCNK 131
           + AV+DFRSL SP GGD  ARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSV ED   
Sbjct: 61  SLAVIDFRSLASPSGGDATARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVWED--- 120

Query: 132 GVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERIRSKLEVIAEKL 191
             DDRSW  D GNRE + QVRRRNDSE S NTVVQ+ TNRQISEKME+IRSKLE I EK+
Sbjct: 121 --DDRSWAGDVGNREVVFQVRRRNDSEASENTVVQAATNRQISEKMEKIRSKLEGIEEKI 180

Query: 192 SELLWECMGEKVKKGDRKETIFSIYKYNNQ----NIYEKENKNKISKNEKEERESDDGNV 251
           SE LW+CMGE +KKGD+KETIFSIY+YNN     N  E+ENKNK                
Sbjct: 181 SERLWKCMGENMKKGDKKETIFSIYRYNNNHQNPNPCERENKNK---------------N 240

Query: 252 TMSLHIPGEHCQFYVNSHQQ-HSFCFDAAADTIVVTIAKQLQELSLGKLKSARSEMIFVP 311
            MSLHIP EHCQF +NSHQQ  SF FDAAADTIVVT+ +QL E SLGKLKSARSEM FVP
Sbjct: 241 MMSLHIPVEHCQFSINSHQQPSSFSFDAAADTIVVTLGEQLVEQSLGKLKSARSEMTFVP 300

Query: 312 DLLGSKTSFSIELKFSNPNLLNNHSHSKIISISDQIFIAFLLISLYFLYTYISSFFKGK 366
           DLLGS TSFSIEL+ SN NL+  HSHSKII+I DQI +AF +IS+Y LY ++ S FKGK
Sbjct: 301 DLLGSGTSFSIELEVSNTNLVKKHSHSKIITIYDQILVAFYVISVYVLYNFMCSLFKGK 339

BLAST of Cla97C04G073010 vs. NCBI nr
Match: XP_023544633.1 (uncharacterized protein LOC111804157 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 462.6 bits (1189), Expect = 3.1e-126
Identity = 256/355 (72.11%), Postives = 287/355 (80.85%), Query Frame = 0

Query: 12  MALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSLPESRFVSGANP 71
           MAL RTKSRLTIPAPPPSPIPT TGSRSAANETFK FLE SIHLPQLSLPESRF+S  NP
Sbjct: 1   MALSRTKSRLTIPAPPPSPIPTGTGSRSAANETFKQFLEKSIHLPQLSLPESRFISTTNP 60

Query: 72  APAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLEDCNK 131
           + AV+DFRSL SP GGD  ARMLRS NEFGAFRIVNHGISGEEILSVVNEAKSV ED   
Sbjct: 61  SLAVIDFRSLASPSGGDATARMLRSANEFGAFRIVNHGISGEEILSVVNEAKSVWED--- 120

Query: 132 GVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERIRSKLEVIAEKL 191
             DD+SW  D GNRE + QVRRRNDSE S NTVVQ+ TNRQISEKME++RSKLE I EK+
Sbjct: 121 --DDQSWAGDVGNREVVFQVRRRNDSEASENTVVQAATNRQISEKMEKLRSKLEGIEEKI 180

Query: 192 SELLWECMGEKVKKGDRKETIFSIYKYNNQNIYEKENKNKISKNEKEERESDDGNVTMSL 251
           SE LW+CMGE +KKGD+KE+IFS+Y+YNN +    +N N        ERE+   N+ MSL
Sbjct: 181 SERLWKCMGENMKKGDKKESIFSVYRYNNNH----QNPNPC------ERENKTKNM-MSL 240

Query: 252 HIPGEHCQFYVNSHQQ-HSFCFDAAADTIVVTIAKQLQELSLGKLKSARSEMIFVPDLLG 311
           HIPGEHCQF +NS+QQ  SF FDAAADTIVVT+ +QL E SLGKLKSARSEM FVPDLLG
Sbjct: 241 HIPGEHCQFSINSYQQPSSFTFDAAADTIVVTLGEQLVERSLGKLKSARSEMTFVPDLLG 300

Query: 312 SKTSFSIELKFSNPNLLNNHSHSKIISISDQIFIAFLLISLYFLYTYISSFFKGK 366
           S+TSFSIEL+ SN NL+  HSHSKII+I DQI +AF +IS+Y LY  + S FKGK
Sbjct: 301 SRTSFSIELEVSNRNLVKKHSHSKIITIYDQILVAFYVISVYVLYNCMCSLFKGK 339

BLAST of Cla97C04G073010 vs. ExPASy TrEMBL
Match: A0A1S3CIU6 (uncharacterized protein LOC103501456 OS=Cucumis melo OX=3656 GN=LOC103501456 PE=4 SV=1)

HSP 1 Score: 521.2 bits (1341), Expect = 3.6e-144
Identity = 288/376 (76.60%), Postives = 320/376 (85.11%), Query Frame = 0

Query: 1   MPPPSTVVIRKMALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSL 60
           MPPPSTVV+RKMALLRTKSRLTIPAPPPSPIPTATGSRSA NETFKTFLENS HLPQLSL
Sbjct: 1   MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSL 60

Query: 61  PESRFVSGANPAPAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVN 120
           PESRF SG N  PAV+DFRSL S G G+  ARMLRSVNEFGAFRIVNHGISGEE+LSVVN
Sbjct: 61  PESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGISGEEVLSVVN 120

Query: 121 EAKSVLEDCNKGVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERI 180
           EAKSVLED NKGVDDR W  DDGNREAILQVRR NDSE SGNTVV++ETNR+ISEKME+I
Sbjct: 121 EAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKI 180

Query: 181 RSKLEVIAEKLSELLWECMGEKVKK-GDRKETIFSIYKYNN--QNIYE--KENKNKISKN 240
           R KLE I EKLSE+L   +GE V+K G++KETIFSIY+Y++   +++E  K++  K SKN
Sbjct: 181 RRKLEGIGEKLSEILCGFVGENVEKLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKN 240

Query: 241 EKEERESDDGNVTMSLHIPGEHCQFYVN----SHQQHSFCFDAAADTIVVTIAKQLQELS 300
           E+E     D  V M L IPGEHCQFYVN      +Q+S CFDAAADTIVVTI KQ QE+S
Sbjct: 241 ERE----SDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQEMS 300

Query: 301 LGKLKSARSEMIFVPDLLGSKTSFSIELKFSNPNLL----NNHSHSKIISISDQIFIAFL 360
           +GKLKSARSEMIFVPDLLG++TSFSI+LKFSNPNLL    NN+SHSKIISISDQIF+AFL
Sbjct: 301 IGKLKSARSEMIFVPDLLGTQTSFSIDLKFSNPNLLLSNNNNNSHSKIISISDQIFVAFL 360

Query: 361 LISLYFLYTYISSFFK 364
           L+SLYFLYTYISSFFK
Sbjct: 361 LLSLYFLYTYISSFFK 372

BLAST of Cla97C04G073010 vs. ExPASy TrEMBL
Match: A0A6J1GEV0 (uncharacterized protein LOC111453527 OS=Cucurbita moschata OX=3662 GN=LOC111453527 PE=4 SV=1)

HSP 1 Score: 470.3 bits (1209), Expect = 7.2e-129
Identity = 262/359 (72.98%), Postives = 285/359 (79.39%), Query Frame = 0

Query: 12  MALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSLPESRFVSGANP 71
           MAL RTKSRLTIPAPPPSPIPT TGSRSAANETFK FLE SIHLPQLSLPESRF+S  NP
Sbjct: 1   MALSRTKSRLTIPAPPPSPIPTGTGSRSAANETFKQFLEKSIHLPQLSLPESRFISTTNP 60

Query: 72  APAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLEDCNK 131
           + AV+DFRSL SP GGD  ARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSV ED   
Sbjct: 61  SLAVIDFRSLASPSGGDATARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVWED--- 120

Query: 132 GVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERIRSKLEVIAEKL 191
             DDRSW  D GNRE + QVRRRNDSE S NTVVQ+ TNRQISEKME+IRSKLE I EK+
Sbjct: 121 --DDRSWAGDVGNREVVFQVRRRNDSEASENTVVQAATNRQISEKMEKIRSKLEGIEEKI 180

Query: 192 SELLWECMGEKVKKGDRKETIFSIYKYNNQ----NIYEKENKNKISKNEKEERESDDGNV 251
           SE LW+CMGE +KKGD+KETIFSIY+YNN     N  E+ENKNK                
Sbjct: 181 SERLWKCMGENMKKGDKKETIFSIYRYNNNHQNPNPCERENKNK---------------N 240

Query: 252 TMSLHIPGEHCQFYVNSHQQ-HSFCFDAAADTIVVTIAKQLQELSLGKLKSARSEMIFVP 311
            MSLHIP EHCQF +NSHQQ  SF FDAAADTIVVT+ +QL E SLGKLKSARSEM FVP
Sbjct: 241 MMSLHIPVEHCQFSINSHQQPSSFSFDAAADTIVVTLGEQLVEQSLGKLKSARSEMTFVP 300

Query: 312 DLLGSKTSFSIELKFSNPNLLNNHSHSKIISISDQIFIAFLLISLYFLYTYISSFFKGK 366
           DLLGS TSFSIEL+ SN NL+  HSHSKII+I DQI +AF +IS+Y LY ++ S FKGK
Sbjct: 301 DLLGSGTSFSIELEVSNTNLVKKHSHSKIITIYDQILVAFYVISVYVLYNFMCSLFKGK 339

BLAST of Cla97C04G073010 vs. ExPASy TrEMBL
Match: A0A6J1ITI2 (uncharacterized protein LOC111478267 OS=Cucurbita maxima OX=3661 GN=LOC111478267 PE=4 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 2.8e-125
Identity = 256/359 (71.31%), Postives = 280/359 (77.99%), Query Frame = 0

Query: 12  MALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSLPESRFVSGANP 71
           MAL RTKSRLTIPAPPPSPIPT TGSRSAANETFK FLE SIHLPQLSLPESRF+S  NP
Sbjct: 1   MALSRTKSRLTIPAPPPSPIPTGTGSRSAANETFKQFLEKSIHLPQLSLPESRFISTTNP 60

Query: 72  APAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLEDCNK 131
           + AV+DFRSL SP GGD  ARMLRS NEFGAFRIVNHGISGEEILSVVNEAKSV ED   
Sbjct: 61  SLAVIDFRSLASPSGGDATARMLRSANEFGAFRIVNHGISGEEILSVVNEAKSVWED--- 120

Query: 132 GVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERIRSKLEVIAEKL 191
             DDRSW  D GNR+ + QVRRRNDS+ S  TVVQ+ TNRQISEKME+IRSKLE I EK+
Sbjct: 121 --DDRSWAGDVGNRDVVFQVRRRNDSDASAITVVQAATNRQISEKMEKIRSKLEGIEEKI 180

Query: 192 SELLWECMGEKVKKGDRKETIFSIYKYNNQ---NIYEKENKNKISKNEKEERESDDGNVT 251
           SE LWECMGE +KKGD+KETIFSIY+YNN    N  E++NKNK                 
Sbjct: 181 SERLWECMGENMKKGDKKETIFSIYRYNNHQNPNPCERDNKNK---------------NM 240

Query: 252 MSLHIPGEHCQFYVNSHQQ--HSFCFDAAADTIVVTIAKQLQELSLGKLKSARSEMIFVP 311
           MSLHIPGEHCQF +NSHQQ   SF FDAAADTIVVT+ +QL E S  KLKSARSEM FVP
Sbjct: 241 MSLHIPGEHCQFSINSHQQPSSSFTFDAAADTIVVTLGEQLVERSSEKLKSARSEMTFVP 300

Query: 312 DLLGSKTSFSIELKFSNPNLLNNHSHSKIISISDQIFIAFLLISLYFLYTYISSFFKGK 366
           DLLGS TSFSIEL+ SN NL+  HSHS II+I DQI +AF +IS+Y LY  + S FKGK
Sbjct: 301 DLLGSGTSFSIELQVSNTNLVKKHSHSNIITIYDQIMVAFYVISVYVLYNCMCSLFKGK 339

BLAST of Cla97C04G073010 vs. ExPASy TrEMBL
Match: A0A6J1CSG7 (uncharacterized protein LOC111013795 OS=Momordica charantia OX=3673 GN=LOC111013795 PE=4 SV=1)

HSP 1 Score: 387.9 bits (995), Expect = 4.7e-104
Identity = 236/362 (65.19%), Postives = 267/362 (73.76%), Query Frame = 0

Query: 12  MALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLE-NSIHLPQLSLPESRFVSGAN 71
           MALLRTKSRLTIPAPPPSPIPT TGSRSAANETFK FLE  SI LPQLSLPESRFVSGAN
Sbjct: 1   MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGAN 60

Query: 72  PAPAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLEDCN 131
           P PA+LD+R L SP  GD  ARMLRS  EFGAFRIVNHGISGEEILSVV +AKS+LE   
Sbjct: 61  PLPALLDYRLLASP-DGDAVARMLRSAGEFGAFRIVNHGISGEEILSVVKDAKSILE--- 120

Query: 132 KGVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERIRSKLEVIAEK 191
               D S   +DG R AI+QVRRR     S ++V + E  R  S +ME++  K+E I EK
Sbjct: 121 ----DSSSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEK 180

Query: 192 LSELLWECMGEK-------VKKGDRKETIFSIYKYNNQNIYEKENKNKISKNEKEERESD 251
           LSE+L E MGE+        KK   KE I SI++YNN    ++   +   + E EERESD
Sbjct: 181 LSEILSESMGEEWGEDQKVKKKRGEKEAILSIFRYNNN---QQNQFDDGGERENEERESD 240

Query: 252 DGNVTMSLHIPGEHCQFYVNSHQQHSFCFDAAADTIVVTIAKQLQELSLGKLKSARSEMI 311
           + +V MSLHIP EHCQF VN H Q SFCFD+AADTIVVTI KQLQE S+GKLKSARS+MI
Sbjct: 241 E-SVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARSKMI 300

Query: 312 FVPDLLGSKTSFSIELKFSNPNLLNNHSHSKIISISDQIFIAFLLISLYFLYTYISSFFK 366
           FVP+  GS++ FSIELK S+P LL+N  HS IISISDQIFIA LL SLY LYTY SS FK
Sbjct: 301 FVPN--GSQSPFSIELKISHPKLLHN-PHSNIISISDQIFIALLLFSLYLLYTYTSSLFK 347

BLAST of Cla97C04G073010 vs. ExPASy TrEMBL
Match: A0A314Y0U8 (Uncharacterized protein OS=Prunus yedoensis var. nudiflora OX=2094558 GN=Pyn_34865 PE=4 SV=1)

HSP 1 Score: 216.1 bits (549), Expect = 2.5e-52
Identity = 159/364 (43.68%), Postives = 214/364 (58.79%), Query Frame = 0

Query: 12  MALLRTKSRLTIPAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSLP---ESRFVSG 71
           MA++R +SRLT  APPPSPIPTA GSRSA+NE F  FL+  + +P L+ P      F   
Sbjct: 1   MAVMRGRSRLTSGAPPPSPIPTAKGSRSASNENFTQFLDKCLQIPDLAWPPQFHPYFSGT 60

Query: 72  ANPAPAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVNEAKSVLED 131
            +P PA +D RSL S    D  AR+L S  EFGAFRI NHGIS EE+ SVV EA+SV   
Sbjct: 61  RHPVPADVDLRSLSS----DAIARLLVSAREFGAFRIANHGISAEELGSVVLEAESVFG- 120

Query: 132 CNKGVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKMERIRSKLEVIA 191
            N G   R +V+  GNRE I  VR       SG  VV+ E  R   + ME++ SK+E IA
Sbjct: 121 -NDGNLSRRFVERTGNREEIKWVRE------SGQKVVEDEKYRVFCKSMEKVASKVEAIA 180

Query: 192 EKLSELLWECMGEKVKKGDRKET-IFSIYKYNNQNIYEKEN------KNKISKNEKEERE 251
           E++SE+L+    + V+K  R E     +Y+YN+++   ++N      +N+I  N    RE
Sbjct: 181 EQVSEVLFANAEKHVEKTMRSELGKVRLYRYNHEDHSMEQNPSSNYLQNEII-NGNNLRE 240

Query: 252 SDDGNVTMSLHIPGEHCQFYVNSH-QQHSFCFDAAADTIVVTIAKQLQELSLGKLKSARS 311
            +D    + LH+P EH QF + S  +  S CFDA  +T+VVT+  QL+       K    
Sbjct: 241 CEDH--ALCLHLPLEHSQFNIRSEGEGGSLCFDAGPETLVVTVGNQLE-----GFKCVSG 300

Query: 312 EMIFVPDLLGSKTSFSIELKFSNPNLLNNHSHSKIISISDQIFIAFLLISLY----FLYT 361
           EMIFVPD++ S+ SFSI+LK   P L N+   S  +SI+DQ  IA +L  LY    F+YT
Sbjct: 301 EMIFVPDIIRSQASFSIQLKV--PLLSNSRKKSNTVSIADQFVIAVILCLLYMIFVFVYT 342

BLAST of Cla97C04G073010 vs. TAIR 10
Match: AT2G38500.1 (2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein )

HSP 1 Score: 198.4 bits (503), Expect = 1.0e-50
Identity = 147/358 (41.06%), Postives = 207/358 (57.82%), Query Frame = 0

Query: 12  MALLRTKSRLTI-----PAPPPSPIPTATGSRSAANETFKTFLENSIHLPQLSLPESRFV 71
           MAL+RT+S+L +     P PPPSPIP A GSR AA+E     +E SI +P+L+LPES   
Sbjct: 1   MALMRTRSQLNVSSLTPPPPPPSPIPRARGSRCAASEILTEIIERSIQVPELTLPESH-- 60

Query: 72  SGANPA------PAVLDFRSLFSPGGGDLAARMLRSVNEFGAFRIVNHGISGEEILSVVN 131
           SG          PA +DFR L S   G +  R++RS  EFGAFR+  HGISGEE+ S+V 
Sbjct: 61  SGGESCGSRHLIPAEIDFRLLASRREGSV-DRLVRSAREFGAFRVSYHGISGEELRSLVR 120

Query: 132 EAK---SVLEDCNKGVDDRSWVDDDGNREAILQVRRRNDSEPSGNTVVQSETNRQISEKM 191
           E+     VLE  + G   RS V   GNR+ I+ VR   +        +  E  R  S++M
Sbjct: 121 ESGRVFGVLEGRDTGF-HRSVV---GNRDEIVWVRSWKERMEWAREYIGPERYRCFSQEM 180

Query: 192 ERIRSKLEVIAEKLSELLWE-CMGEKVKKGDRKETIFSIYKYNNQNIYEKENKNKISKNE 251
           E +  KLE IA KL +++ E       KK  R E++ S+Y+YN++N+ E+      S   
Sbjct: 181 ENVADKLEDIARKLGQIMVENSRRPNDKKIQRGESVLSVYRYNHENVTEQ------SPPL 240

Query: 252 KEERESDDGNVTMSLHIPGEHCQFYVNSHQQHSFCFDAAADTIVVTIAKQLQELSLGKLK 311
            +ER  +  + T+SLH+P ++C+F VNS  +    F A  DTI+VT  +QL+E SLG+ K
Sbjct: 241 PKERTEEMLHYTLSLHLPAKNCEFRVNS-GKGPLSFHADPDTILVTFGRQLEEWSLGEFK 300

Query: 312 SARSEMIFVPDLLGSKTSFSIELKFSNPNLLNNH--SHSKIISISDQIFIAFLLISLY 353
             + E+I+ PD  GS TSFS+ELK  +  L +    + SK  S++ QIF AFLL+  +
Sbjct: 301 CRQGEIIYHPDAYGSPTSFSVELKCMSLFLSHTSIATTSKTFSLTHQIFTAFLLLFFF 344

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881407.16.2e-15183.48uncharacterized protein LOC120072944, partial [Benincasa hispida][more]
XP_008463252.17.4e-14476.60PREDICTED: uncharacterized protein LOC103501456 [Cucumis melo][more]
XP_011653719.15.5e-13974.93uncharacterized protein LOC101207912 [Cucumis sativus] >KAE8649590.1 hypothetica... [more]
XP_022950423.11.5e-12872.98uncharacterized protein LOC111453527 [Cucurbita moschata][more]
XP_023544633.13.1e-12672.11uncharacterized protein LOC111804157 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3CIU63.6e-14476.60uncharacterized protein LOC103501456 OS=Cucumis melo OX=3656 GN=LOC103501456 PE=... [more]
A0A6J1GEV07.2e-12972.98uncharacterized protein LOC111453527 OS=Cucurbita moschata OX=3662 GN=LOC1114535... [more]
A0A6J1ITI22.8e-12571.31uncharacterized protein LOC111478267 OS=Cucurbita maxima OX=3661 GN=LOC111478267... [more]
A0A6J1CSG74.7e-10465.19uncharacterized protein LOC111013795 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A314Y0U82.5e-5243.68Uncharacterized protein OS=Prunus yedoensis var. nudiflora OX=2094558 GN=Pyn_348... [more]
Match NameE-valueIdentityDescription
AT2G38500.11.0e-5041.062-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 221..241
NoneNo IPR availableCOILSCoilCoilcoord: 170..190
NoneNo IPR availablePANTHERPTHR34945:SF42-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEINcoord: 12..358
NoneNo IPR availablePANTHERPTHR349452-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEINcoord: 12..358
NoneNo IPR availableSUPERFAMILY51197Clavaminate synthase-likecoord: 75..298
IPR027443Isopenicillin N synthase-like superfamilyGENE3D2.60.120.330coord: 74..326
e-value: 5.2E-8
score: 34.5

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C04G073010.1Cla97C04G073010.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane