Cla97C08G151030 (gene) Watermelon (97103) v2

NameCla97C08G151030
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionHemerythrin HHE cation-binding domain protein
LocationCla97Chr08 : 19421943 .. 19425357 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAATAATTGCCTCGGGAGTTCGATGAAATCGGCGGCGGAGATTGTGCCTCAGGAGTTCATTAGAGGCTGTGGCGATACTGCGGCGGCTGCTAATCCGATCGTGCGACTTTACGGCCCTCCGAATAATGCTCTCACCTGCTACATCCGATTCGCTTTGCTATACAAGTCTGTGAAACTCAGTTTCATCCCTTCTGAGACTCCGCATTTCGGTTCCGATTCGCCGGCCATTCGGATCGGGACCGAGACTATTTCCGGTTCACGTGAAATGTTGCTTCGGTACATAGACAATAGGTTTCCTCATCCGCCGCTAGCGTTGTCGAGCCGCCGCGTTGACGACGACGAAACGACTTCGTTGGTTGCCGTGAGGGTGGTGGCTCTGCAGCACAAGAGCGTGTTATGGCATTTGGAGAGGATGTTGAGATGGGCGAAGGATCTGGCGACTCGTGGAGGGAGAACGACCGTCGATCCGGCGGTGGGAACGCCGAGGATGGAGCTGAGGAAGTTCGGGAAGAGCTACTCTCAGCTGCTGGAAGTGATGCTGGAACACGCTCAAATGGAGGAGAGAGTCCTCTTCCCGATATTGGAGAAGGCTGATCGAGGTAAGTTCCATGGCTCCGATTCTGCTCTGCGTTACATTGAATAGAATTTGAGTTTATGTTTATGCAATCCAAACCCTAAAAATTTTCGCATCATGAAAGAAAATACATGGAAGAAGAAGTGTTTGTGATTGCGAGTTTGCAGCAGATTTAATGCAATTTTCATAAACGTTATTTATTCTAGAAAGGTTGTTTAGATCTACCAAACTAGCCCAGAAGATTGAGATGATCAGTAAAACAGTGATGAACTTCAATAATTTTCTTGGTTATCCTATCTTTTCAATTTGGGGGAAATTAAGAGGTGGTTTGGCTGGTTTTTTGGTACTAACAAGTGGGGTATTGAGAATTTGAACTTTCGATCTAAGAGAAGGAGTACTGCTGAACTCAACCACTTTGACATTTGGCTTCTTTTACTTAGTTTTATGGAATTGTGCAACTATTCACATGCTTAAAGGAAAAACTGTTTTCAGTTATTTAAAAAACAAATCACATGAGCGGAAACTAGGGACAGCAACTAGAAGACAACCTGTACTCAGAAAAAGGAAAAAAGAAAAAAAGACAACCCATGCTAATGATATATGTTTCTAATTTTTTTATTTAAAATTTATTTTGTTCCATTAATTTCTATATTTATTTTGTTTTTGTGATCAAAATTTCAACTGTATTATTTCAGTCTTTAAACTTTTTTTTTTTTTTGAAATGTAGTTTATTTCTAAAAGGTTTTGAAATGTAGCCTTGAAACTTTTTAGTTTTTAAATAAATGTATTGTTTTAGACCATAAACTATACAAAAGATATATTTAAATAATTTTTGTTAACATGCTTATTCTATTAAATATTTAACCAGAATTTTCATTAATATATATATATATATATATATATATATATATAAAACCAGAATTTTGATTTATTTCTTTGTGATTTATAAAAAAATTAATGGCAAAAGTCTAAAGTACTTTCTAATATGGAATTTCAGAATTAAAATAATTTTTTAAAACAAAATAGAATAAATAGGGTAGCTTAAATATGATTTCAACTGAATTGAGCATGCCTTACCTACTAATCTATGTTCACCTAGACAAAAATTTGGTCTATTCTAAACAAATATACTTGTGATGGTTTTGTGCAATGGGAATTTGGGATTTATTCAAACAGAGCCCATGTTGTATATATAGTAGATGAAGGGACTAATTGCTAGATGCCATAATGCAGGCTTATGTAAAGTTTCAAACGAGGAGCATGCAAGGGATCTACCCATCATGAATGGCATCAAAGAAGACATTAAGTCCGCGGTCGTTTTGGACTTGGGAAGTTCAGTTTGTCAAGAAGCGCTCTCCAACCTTTCCAAACGTCTCAAGCTGTTGCAGGTAAACTTCCTTCAGGCCTTCTATAATCCTAACATGTTCAAAATACATAGGGATTAAAGCCTGATACAGATGTAAGACCTTATGCTTAAGATTCCAGAGAAGTATTGACCACAGAGCAGAATCGGATGAACGAATGACAATGGATGAGTGATAAATTGTCAAACCATAACAATGTGATCTGTTTTTCTTCATTGGTTATGGGAAGAGACCAAAGGTCTTTTAACTGCCGATCCCATAGAAGTAGAACAACTAAAAAGATGTAAAAGAAAACCTTGGACAAACCACTCAAGAAAAAGAGATAGAGTGGTTGCCATCTAAGCATAGGTCTAGTTAAAATATTTGTCATCGAACAAGAGGTCAAAAGTTCGAATCTTTCCACACCCCATATGTTTGTTGAGGATGAAAAAGATTAAGTGATTCGTAGCTGCTTTCTATAATTCAATTACCAAATTCTAGTCGTTTCAGGATTGTTGTTTTAAGACTTAAATCTCTTCCCTCACATTTAAATCTTTGTCAGAAACGTTTATTTACTCTGCCGTCATTTAGGTCGACCCCACTAAGTTAGACTGAAACAAAGAACACTCATCGTGCCTTTATGGAATAGATAGAGGCAAAGTAGGAAATAAAGATAACCATCTAAACCATGTGGACTGATGGAGCATCAAATAACTCTCTGCCACACGACACCTTGTGACTACAGTCACACTATAAGACTAGCTGCTAGCATAACCCTTTTTCTGAGGTTTTTAAATTCAGGCTAATTGATCTACATTATGACATTGTGGAGAATGTATTAAATTTATCACACTTAATAAGTTAAGCTTTTGAACATATTGATGATTTAGTAGACTTCAATAGCCAAACCACTTCTACTTCATATTCAACTACTTTCATGTTTCATTATTACACAATATTGATAACAGGTAGAGTTTTAGAGCCATTCAGTTATTCTTACTAAAATATCATGTGGAATCCACTTGACTTTTAAAATAAAAATTTAAGTTTCATGTTTTGATGCTTGGTATTTTTCTTGACTATTAATCATCAAAATCATTTATATGTACTTCACACTTCACATGTATGTTCTTTATTGCACTCTAAGTTTGTCTGCTCATCGTAAGCCGATTTCTTTTTCCTTACCTTTTTTCTTTTTTCCAATCTCTTCCTTTACATTTTTTTGGGGGCAGGAACACTGTAAACATCACTTTTTGGATGAAGAGAAAAATCTACTACCTTGGCTTGAAGCTGTAGAGCTGAACAAAGAGCAACAGGACAAAATGTTAGAGCAGCTCTTGGATGTGATGAAACAAACTCATTCTCATTTACTAAATTTCTTTCTTGAAGGTCTTCTCCCTCTGGAAGCTCTGCAGTATTTGGATCTGATTACAAACAGTAGCGATAAAATCCGAACGAGCTTAGGCTCAATGCTCCTGATGAATGTTAAGTAA

mRNA sequence

ATGGGGAATAATTGCCTCGGGAGTTCGATGAAATCGGCGGCGGAGATTGTGCCTCAGGAGTTCATTAGAGGCTGTGGCGATACTGCGGCGGCTGCTAATCCGATCGTGCGACTTTACGGCCCTCCGAATAATGCTCTCACCTGCTACATCCGATTCGCTTTGCTATACAAGTCTGTGAAACTCAGTTTCATCCCTTCTGAGACTCCGCATTTCGGTTCCGATTCGCCGGCCATTCGGATCGGGACCGAGACTATTTCCGGTTCACGTGAAATGTTGCTTCGGTACATAGACAATAGGTTTCCTCATCCGCCGCTAGCGTTGTCGAGCCGCCGCGTTGACGACGACGAAACGACTTCGTTGGTTGCCGTGAGGGTGGTGGCTCTGCAGCACAAGAGCGTGTTATGGCATTTGGAGAGGATGTTGAGATGGGCGAAGGATCTGGCGACTCGTGGAGGGAGAACGACCGTCGATCCGGCGGTGGGAACGCCGAGGATGGAGCTGAGGAAGTTCGGGAAGAGCTACTCTCAGCTGCTGGAAGTGATGCTGGAACACGCTCAAATGGAGGAGAGAGTCCTCTTCCCGATATTGGAGAAGGCTGATCGAGGCTTATGTAAAGTTTCAAACGAGGAGCATGCAAGGGATCTACCCATCATGAATGGCATCAAAGAAGACATTAAGTCCGCGGTCGTTTTGGACTTGGGAAGTTCAGTTTGTCAAGAAGCGCTCTCCAACCTTTCCAAACGTCTCAAGCTGTTGCAGGAACACTGTAAACATCACTTTTTGGATGAAGAGAAAAATCTACTACCTTGGCTTGAAGCTGTAGAGCTGAACAAAGAGCAACAGGACAAAATGTTAGAGCAGCTCTTGGATGTGATGAAACAAACTCATTCTCATTTACTAAATTTCTTTCTTGAAGGTCTTCTCCCTCTGGAAGCTCTGCAGTATTTGGATCTGATTACAAACAGTAGCGATAAAATCCGAACGAGCTTAGGCTCAATGCTCCTGATGAATGTTAAGTAA

Coding sequence (CDS)

ATGGGGAATAATTGCCTCGGGAGTTCGATGAAATCGGCGGCGGAGATTGTGCCTCAGGAGTTCATTAGAGGCTGTGGCGATACTGCGGCGGCTGCTAATCCGATCGTGCGACTTTACGGCCCTCCGAATAATGCTCTCACCTGCTACATCCGATTCGCTTTGCTATACAAGTCTGTGAAACTCAGTTTCATCCCTTCTGAGACTCCGCATTTCGGTTCCGATTCGCCGGCCATTCGGATCGGGACCGAGACTATTTCCGGTTCACGTGAAATGTTGCTTCGGTACATAGACAATAGGTTTCCTCATCCGCCGCTAGCGTTGTCGAGCCGCCGCGTTGACGACGACGAAACGACTTCGTTGGTTGCCGTGAGGGTGGTGGCTCTGCAGCACAAGAGCGTGTTATGGCATTTGGAGAGGATGTTGAGATGGGCGAAGGATCTGGCGACTCGTGGAGGGAGAACGACCGTCGATCCGGCGGTGGGAACGCCGAGGATGGAGCTGAGGAAGTTCGGGAAGAGCTACTCTCAGCTGCTGGAAGTGATGCTGGAACACGCTCAAATGGAGGAGAGAGTCCTCTTCCCGATATTGGAGAAGGCTGATCGAGGCTTATGTAAAGTTTCAAACGAGGAGCATGCAAGGGATCTACCCATCATGAATGGCATCAAAGAAGACATTAAGTCCGCGGTCGTTTTGGACTTGGGAAGTTCAGTTTGTCAAGAAGCGCTCTCCAACCTTTCCAAACGTCTCAAGCTGTTGCAGGAACACTGTAAACATCACTTTTTGGATGAAGAGAAAAATCTACTACCTTGGCTTGAAGCTGTAGAGCTGAACAAAGAGCAACAGGACAAAATGTTAGAGCAGCTCTTGGATGTGATGAAACAAACTCATTCTCATTTACTAAATTTCTTTCTTGAAGGTCTTCTCCCTCTGGAAGCTCTGCAGTATTTGGATCTGATTACAAACAGTAGCGATAAAATCCGAACGAGCTTAGGCTCAATGCTCCTGATGAATGTTAAGTAA

Protein sequence

MGNNCLGSSMKSAAEIVPQEFIRGCGDTAAAANPIVRLYGPPNNALTCYIRFALLYKSVKLSFIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDDDETTSLVAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQLLEVMLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSHLLNFFLEGLLPLEALQYLDLITNSSDKIRTSLGSMLLMNVK
BLAST of Cla97C08G151030 vs. NCBI nr
Match: XP_008456602.1 (PREDICTED: uncharacterized protein LOC103496512 [Cucumis melo])

HSP 1 Score: 615.9 bits (1587), Expect = 8.1e-173
Identity = 309/339 (91.15%), Postives = 323/339 (95.28%), Query Frame = 0

Query: 1   MGNNCLGSSMKSAAEIVPQEFIRGCGD--TAAAANPIVRLYGPPNNALTCYIRFALLYKS 60
           MGNNC GSS KSAAEIVPQE  R C +   AAA+NPIVRLYGPPNNALTCYIRFALLYKS
Sbjct: 1   MGNNCYGSSNKSAAEIVPQELFRSCNNDSAAAASNPIVRLYGPPNNALTCYIRFALLYKS 60

Query: 61  VKLSFIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDDDETT 120
           VKLSFIPSETPHFGSDSPAIRIG+ETISGSRE +LR+IDNRFPHPPL LSSRRVDDDET+
Sbjct: 61  VKLSFIPSETPHFGSDSPAIRIGSETISGSRERMLRFIDNRFPHPPLPLSSRRVDDDETS 120

Query: 121 SLVAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQLL 180
           SLVA+RVVALQHKSVLWHLERMLRW KDLA RGGRTT DPAVGTPRMELRKFGKSYSQLL
Sbjct: 121 SLVALRVVALQHKSVLWHLERMLRWGKDLANRGGRTTFDPAVGTPRMELRKFGKSYSQLL 180

Query: 181 EVMLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC 240
           EVMLEHAQMEERVLFPIL++ADRGLCK SNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC
Sbjct: 181 EVMLEHAQMEERVLFPILDRADRGLCKASNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC 240

Query: 241 QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSH 300
           QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSH
Sbjct: 241 QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSH 300

Query: 301 LLNFFLEGLLPLEALQYLDLITNSSDKIRTSLGSMLLMN 338
           LLNFFLEGLLPLEALQYLDLIT+SSD+IRTS G+ML+M+
Sbjct: 301 LLNFFLEGLLPLEALQYLDLITSSSDRIRTSFGTMLMMD 339

BLAST of Cla97C08G151030 vs. NCBI nr
Match: XP_022956168.1 (uncharacterized protein LOC111457939 [Cucurbita moschata])

HSP 1 Score: 610.9 bits (1574), Expect = 2.6e-171
Identity = 304/339 (89.68%), Postives = 320/339 (94.40%), Query Frame = 0

Query: 1   MGNNCLGSSMKSAAEIVPQEFIRGCGDTAAAANPIVRLYGPPNNALTCYIRFALLYKSVK 60
           MGNNCLGS  KS AEIVPQEFIRGC D + A+NP+VRLYGPPNNALTCYIRFALLYKSVK
Sbjct: 1   MGNNCLGSKTKSTAEIVPQEFIRGCSD-STASNPVVRLYGPPNNALTCYIRFALLYKSVK 60

Query: 61  LSFIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDDDETTSL 120
            SFIPSET HFGSDSPAIRIGTET+SGSR+ LLRYIDN+FPHPPLA+SSRRVDDDETT L
Sbjct: 61  HSFIPSETTHFGSDSPAIRIGTETVSGSRDRLLRYIDNKFPHPPLAISSRRVDDDETTQL 120

Query: 121 VAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQLLEV 180
           VA+ VV+LQHKSVLWHLERMLRWAKDLA RGGRT VDP +GTPRMELRKFGKSYSQLLEV
Sbjct: 121 VALTVVSLQHKSVLWHLERMLRWAKDLAARGGRTAVDPTMGTPRMELRKFGKSYSQLLEV 180

Query: 181 MLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSVCQE 240
           MLEHAQMEERVLFPILE ADRGLCK SNEEHARDLPIMNGIKEDIKS VVLD+GSSVCQE
Sbjct: 181 MLEHAQMEERVLFPILEMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDVGSSVCQE 240

Query: 241 ALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSHLL 300
           ALSNLSKRLKLLQEHCKHHF++EEKNLLPW EAVELNKEQQDK LEQLLDVMKQTHSHLL
Sbjct: 241 ALSNLSKRLKLLQEHCKHHFMEEEKNLLPWFEAVELNKEQQDKTLEQLLDVMKQTHSHLL 300

Query: 301 NFFLEGLLPLEALQYLDLITNSSDKIRTSLGSMLLMNVK 340
           NFFLEGLLPLEALQYLDLIT+SSDKIRTSLG+MLLMNV+
Sbjct: 301 NFFLEGLLPLEALQYLDLITSSSDKIRTSLGTMLLMNVE 338

BLAST of Cla97C08G151030 vs. NCBI nr
Match: XP_023525990.1 (uncharacterized protein LOC111789551 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 610.1 bits (1572), Expect = 4.4e-171
Identity = 303/339 (89.38%), Postives = 320/339 (94.40%), Query Frame = 0

Query: 1   MGNNCLGSSMKSAAEIVPQEFIRGCGDTAAAANPIVRLYGPPNNALTCYIRFALLYKSVK 60
           MGNNCLGS  KS AEIVPQEFIRGC D + A+NP+VRLYGPPNNALTCYIRFALLYKSVK
Sbjct: 1   MGNNCLGSKTKSTAEIVPQEFIRGCSD-STASNPVVRLYGPPNNALTCYIRFALLYKSVK 60

Query: 61  LSFIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDDDETTSL 120
            SFIPSET HFGSDSPAIRIGTET+SGSR+ LLRYIDN+FPHPPLA+SSRRVDDDETT L
Sbjct: 61  HSFIPSETTHFGSDSPAIRIGTETVSGSRDRLLRYIDNKFPHPPLAISSRRVDDDETTQL 120

Query: 121 VAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQLLEV 180
           VA+ VV+LQHKSVLWHLERMLRWAKDLA RGGRT VDP +GTPRMELRKFGKSYSQLLEV
Sbjct: 121 VALTVVSLQHKSVLWHLERMLRWAKDLAARGGRTAVDPTMGTPRMELRKFGKSYSQLLEV 180

Query: 181 MLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSVCQE 240
           MLEHAQMEERVLFPILE ADRGLCK SNEEHARDLPIMNGIKEDIKS VVLD+GSSVCQE
Sbjct: 181 MLEHAQMEERVLFPILEMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDVGSSVCQE 240

Query: 241 ALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSHLL 300
           ALSNLSKRLKLLQEHCKHHF++EEKNLLPW EAVE+NKEQQDK LEQLLDVMKQTHSHLL
Sbjct: 241 ALSNLSKRLKLLQEHCKHHFMEEEKNLLPWFEAVEMNKEQQDKTLEQLLDVMKQTHSHLL 300

Query: 301 NFFLEGLLPLEALQYLDLITNSSDKIRTSLGSMLLMNVK 340
           NFFLEGLLPLEALQYLDLIT+SSDKIRTSLG+MLLMNV+
Sbjct: 301 NFFLEGLLPLEALQYLDLITSSSDKIRTSLGTMLLMNVE 338

BLAST of Cla97C08G151030 vs. NCBI nr
Match: XP_022990403.1 (uncharacterized protein LOC111487271 [Cucurbita maxima])

HSP 1 Score: 607.8 bits (1566), Expect = 2.2e-170
Identity = 303/339 (89.38%), Postives = 319/339 (94.10%), Query Frame = 0

Query: 1   MGNNCLGSSMKSAAEIVPQEFIRGCGDTAAAANPIVRLYGPPNNALTCYIRFALLYKSVK 60
           MGNNCLGS  KS AEIVPQEFIRGC D + A+N +VRLYGPPNNALTCYIRFALLYKSVK
Sbjct: 1   MGNNCLGSKTKSTAEIVPQEFIRGCSD-STASNSVVRLYGPPNNALTCYIRFALLYKSVK 60

Query: 61  LSFIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDDDETTSL 120
            SFIPSET HFGSDSPAIRIGTET+SGSR+ LLRYIDN+FPHPPLA+SSRRVDDDETT L
Sbjct: 61  HSFIPSETTHFGSDSPAIRIGTETVSGSRDRLLRYIDNKFPHPPLAISSRRVDDDETTQL 120

Query: 121 VAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQLLEV 180
           VA+ VV+LQHKSVLWHLERMLRWAKDLA RGGRT VDP +GTPRMELRKFGKSYSQLLEV
Sbjct: 121 VALTVVSLQHKSVLWHLERMLRWAKDLAARGGRTAVDPTMGTPRMELRKFGKSYSQLLEV 180

Query: 181 MLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSVCQE 240
           MLEHAQMEERVLFPILE ADRGLCK SNEEHARDLPIMNGIKEDIKS VVLD+GSSVCQE
Sbjct: 181 MLEHAQMEERVLFPILEMADRGLCKTSNEEHARDLPIMNGIKEDIKSTVVLDVGSSVCQE 240

Query: 241 ALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSHLL 300
           ALSNLSKRLKLLQEHCKHHF++EEKNLLPW EAVELNKEQQDK LEQLLDVMKQTHSHLL
Sbjct: 241 ALSNLSKRLKLLQEHCKHHFMEEEKNLLPWFEAVELNKEQQDKTLEQLLDVMKQTHSHLL 300

Query: 301 NFFLEGLLPLEALQYLDLITNSSDKIRTSLGSMLLMNVK 340
           NFFLEGLLPLEALQYLDLIT+SSDKIRTSLG+MLLMNV+
Sbjct: 301 NFFLEGLLPLEALQYLDLITSSSDKIRTSLGTMLLMNVE 338

BLAST of Cla97C08G151030 vs. NCBI nr
Match: XP_004140904.2 (PREDICTED: uncharacterized protein LOC101208874 [Cucumis sativus] >KGN46040.1 hypothetical protein Csa_6G045190 [Cucumis sativus])

HSP 1 Score: 603.2 bits (1554), Expect = 5.4e-169
Identity = 301/339 (88.79%), Postives = 319/339 (94.10%), Query Frame = 0

Query: 1   MGNNCLGSSMKSAAEIVPQEFIRGCGD--TAAAANPIVRLYGPPNNALTCYIRFALLYKS 60
           MGNNC GSS KSAAEIVPQE  R C +    AA+NP VRLYGPPNNA TCYIRFALLYKS
Sbjct: 1   MGNNCYGSSNKSAAEIVPQELFRSCNNDSVTAASNPTVRLYGPPNNAFTCYIRFALLYKS 60

Query: 61  VKLSFIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDDDETT 120
           VKLSFIPS+ PHFGSDSPAIRIG+ETISGSRE +LR+IDN+FPHPPL LSSRRVD+DET+
Sbjct: 61  VKLSFIPSDAPHFGSDSPAIRIGSETISGSRERMLRFIDNKFPHPPLPLSSRRVDEDETS 120

Query: 121 SLVAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQLL 180
           SLVAVRVVALQHKSVLWHLERMLRW KDLA RGGRTT DPAVGTPRMELRKFGKSYSQLL
Sbjct: 121 SLVAVRVVALQHKSVLWHLERMLRWGKDLANRGGRTTFDPAVGTPRMELRKFGKSYSQLL 180

Query: 181 EVMLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC 240
           EVMLEHAQMEERVLFPIL++ADRGLCK SNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC
Sbjct: 181 EVMLEHAQMEERVLFPILDRADRGLCKASNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC 240

Query: 241 QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSH 300
           QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVEL+KEQQDKMLEQLLD+MKQTHSH
Sbjct: 241 QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELSKEQQDKMLEQLLDLMKQTHSH 300

Query: 301 LLNFFLEGLLPLEALQYLDLITNSSDKIRTSLGSMLLMN 338
           LLNFFLEGLLPLEALQYLDLIT+SSD+IRTS G+ML+M+
Sbjct: 301 LLNFFLEGLLPLEALQYLDLITSSSDRIRTSFGTMLMMD 339

BLAST of Cla97C08G151030 vs. TrEMBL
Match: tr|A0A1S3C379|A0A1S3C379_CUCME (uncharacterized protein LOC103496512 OS=Cucumis melo OX=3656 GN=LOC103496512 PE=4 SV=1)

HSP 1 Score: 615.9 bits (1587), Expect = 5.3e-173
Identity = 309/339 (91.15%), Postives = 323/339 (95.28%), Query Frame = 0

Query: 1   MGNNCLGSSMKSAAEIVPQEFIRGCGD--TAAAANPIVRLYGPPNNALTCYIRFALLYKS 60
           MGNNC GSS KSAAEIVPQE  R C +   AAA+NPIVRLYGPPNNALTCYIRFALLYKS
Sbjct: 1   MGNNCYGSSNKSAAEIVPQELFRSCNNDSAAAASNPIVRLYGPPNNALTCYIRFALLYKS 60

Query: 61  VKLSFIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDDDETT 120
           VKLSFIPSETPHFGSDSPAIRIG+ETISGSRE +LR+IDNRFPHPPL LSSRRVDDDET+
Sbjct: 61  VKLSFIPSETPHFGSDSPAIRIGSETISGSRERMLRFIDNRFPHPPLPLSSRRVDDDETS 120

Query: 121 SLVAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQLL 180
           SLVA+RVVALQHKSVLWHLERMLRW KDLA RGGRTT DPAVGTPRMELRKFGKSYSQLL
Sbjct: 121 SLVALRVVALQHKSVLWHLERMLRWGKDLANRGGRTTFDPAVGTPRMELRKFGKSYSQLL 180

Query: 181 EVMLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC 240
           EVMLEHAQMEERVLFPIL++ADRGLCK SNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC
Sbjct: 181 EVMLEHAQMEERVLFPILDRADRGLCKASNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC 240

Query: 241 QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSH 300
           QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSH
Sbjct: 241 QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSH 300

Query: 301 LLNFFLEGLLPLEALQYLDLITNSSDKIRTSLGSMLLMN 338
           LLNFFLEGLLPLEALQYLDLIT+SSD+IRTS G+ML+M+
Sbjct: 301 LLNFFLEGLLPLEALQYLDLITSSSDRIRTSFGTMLMMD 339

BLAST of Cla97C08G151030 vs. TrEMBL
Match: tr|A0A0A0K8J3|A0A0A0K8J3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G045190 PE=4 SV=1)

HSP 1 Score: 603.2 bits (1554), Expect = 3.6e-169
Identity = 301/339 (88.79%), Postives = 319/339 (94.10%), Query Frame = 0

Query: 1   MGNNCLGSSMKSAAEIVPQEFIRGCGD--TAAAANPIVRLYGPPNNALTCYIRFALLYKS 60
           MGNNC GSS KSAAEIVPQE  R C +    AA+NP VRLYGPPNNA TCYIRFALLYKS
Sbjct: 1   MGNNCYGSSNKSAAEIVPQELFRSCNNDSVTAASNPTVRLYGPPNNAFTCYIRFALLYKS 60

Query: 61  VKLSFIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDDDETT 120
           VKLSFIPS+ PHFGSDSPAIRIG+ETISGSRE +LR+IDN+FPHPPL LSSRRVD+DET+
Sbjct: 61  VKLSFIPSDAPHFGSDSPAIRIGSETISGSRERMLRFIDNKFPHPPLPLSSRRVDEDETS 120

Query: 121 SLVAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQLL 180
           SLVAVRVVALQHKSVLWHLERMLRW KDLA RGGRTT DPAVGTPRMELRKFGKSYSQLL
Sbjct: 121 SLVAVRVVALQHKSVLWHLERMLRWGKDLANRGGRTTFDPAVGTPRMELRKFGKSYSQLL 180

Query: 181 EVMLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC 240
           EVMLEHAQMEERVLFPIL++ADRGLCK SNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC
Sbjct: 181 EVMLEHAQMEERVLFPILDRADRGLCKASNEEHARDLPIMNGIKEDIKSAVVLDLGSSVC 240

Query: 241 QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSH 300
           QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVEL+KEQQDKMLEQLLD+MKQTHSH
Sbjct: 241 QEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELSKEQQDKMLEQLLDLMKQTHSH 300

Query: 301 LLNFFLEGLLPLEALQYLDLITNSSDKIRTSLGSMLLMN 338
           LLNFFLEGLLPLEALQYLDLIT+SSD+IRTS G+ML+M+
Sbjct: 301 LLNFFLEGLLPLEALQYLDLITSSSDRIRTSFGTMLMMD 339

BLAST of Cla97C08G151030 vs. TrEMBL
Match: tr|A0A2N9FWA8|A0A2N9FWA8_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS19171 PE=4 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 1.2e-111
Identity = 215/331 (64.95%), Postives = 258/331 (77.95%), Query Frame = 0

Query: 4   NCLG-SSMKSAAEIVPQEFIRGCGDTAAAANPIVRLYGPPNNALTCYIRFALLYKSVKLS 63
           NC G +S KS AEIVP + I+G           VRLYG P   +T YIRFALLYK+V L 
Sbjct: 3   NCFGKNSKKSTAEIVPHDNIKGI-SPXXXXXXXVRLYGSPTCTVTAYIRFALLYKTVSLR 62

Query: 64  FIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDD-----DET 123
           F+PSETP FGS++P ++IG+ET+SGSRE LL YI+ RFPHPPL +  RR DD       T
Sbjct: 63  FVPSETPSFGSETPVLQIGSETVSGSRETLLSYIEARFPHPPLVI--RRGDDVXXXXXXT 122

Query: 124 TSLVAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQL 183
           T LV VRV+ LQHKS+ WH+ER++RW  DL TRGG+ +VDPAVG+PRME+RKF +SYS+L
Sbjct: 123 TPLV-VRVIGLQHKSMTWHVERLVRWVDDLTTRGGKGSVDPAVGSPRMEVRKFARSYSEL 182

Query: 184 LEVMLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSV 243
           LEVMLEHAQMEE+V+FPIL+ ADRGLCK +N+EHARDLPIMNGIKEDIKS  VLD GS V
Sbjct: 183 LEVMLEHAQMEEKVVFPILDMADRGLCKAANQEHARDLPIMNGIKEDIKSIGVLDSGSPV 242

Query: 244 CQEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHS 303
            QEAL +LS RLK L EH K HF++E++ LLP +EAVEL+KEQQ + LEQ LDVM+ THS
Sbjct: 243 YQEALFSLSTRLKSLHEHSKQHFMEEDRVLLPLMEAVELSKEQQKRALEQCLDVMQGTHS 302

Query: 304 HLLNFFLEGLLPLEALQYLDLITNSSDKIRT 329
           HL NF LEGLLPLEA+QYLDL T+ +D+ RT
Sbjct: 303 HLFNFLLEGLLPLEAMQYLDLFTSCNDRERT 329

BLAST of Cla97C08G151030 vs. TrEMBL
Match: tr|A0A061FWR7|A0A061FWR7_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao OX=3641 GN=TCM_044208 PE=4 SV=1)

HSP 1 Score: 391.7 bits (1005), Expect = 1.6e-105
Identity = 202/336 (60.12%), Postives = 252/336 (75.00%), Query Frame = 0

Query: 4   NCLGSSMKSAAEIVPQEFIRGCGDTAAAANPIVRLYGPPNNALTCYIRFALLYKSVKLSF 63
           NC   S KS AEI P + IR          P VRLYG  ++ L  YIRFALL+K++ L F
Sbjct: 3   NCFAQSKKSTAEIAPYDSIRRFKPVPVV--PTVRLYGSASSTLAAYIRFALLHKNLPLQF 62

Query: 64  IPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPPLALSSRRVDDDETTSLVAV 123
           +P++ P    +   + IG+ET+SG RE LL++I+++FPHPPL  +   VD    T+ + V
Sbjct: 63  VPTDKPPCDGEPLLLEIGSETVSGYRETLLQFIEDKFPHPPLGFN--MVD----TTPLTV 122

Query: 124 RVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTPRMELRKFGKSYSQLLEVMLE 183
           +V  LQH+S+ WHLERM+RWA+DL+TRGGR TVDPAVG+PRMELRKFGK+YSQLLE+M+E
Sbjct: 123 QVTWLQHRSITWHLERMVRWAEDLSTRGGRRTVDPAVGSPRMELRKFGKNYSQLLELMVE 182

Query: 184 HAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKEDIKSAVVLDLGSSVCQEALS 243
           HAQMEERV+FP+LE ADRGLCK +NEEHARDLP+MNGIKEDIKS  V+D G+    E LS
Sbjct: 183 HAQMEERVVFPVLEMADRGLCKSANEEHARDLPVMNGIKEDIKSIGVMDYGTPAYHEGLS 242

Query: 244 NLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDKMLEQLLDVMKQTHSHLLNFF 303
           NLS RLK LQ+HCK HF +EEK+LLP +EA EL++EQQ ++ EQ  D MK THSHLLNFF
Sbjct: 243 NLSTRLKSLQKHCKEHFDEEEKDLLPLIEATELSEEQQTRVFEQCFDAMKATHSHLLNFF 302

Query: 304 LEGLLPLEALQYLDLITNSSDKIRTSLGSMLLMNVK 340
           LEGLLP EA++Y+DLI   SDK RT+  SM+ M  K
Sbjct: 303 LEGLLPSEAMEYVDLINKCSDKERTA--SMIQMIAK 328

BLAST of Cla97C08G151030 vs. TrEMBL
Match: tr|A0A2P4LUH8|A0A2P4LUH8_QUESU (Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_22781 PE=4 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 4.8e-105
Identity = 192/300 (64.00%), Postives = 238/300 (79.33%), Query Frame = 0

Query: 34  PIVRLYGPPNNALTCYIRFALLYKSVKLSFIPSETPHFGSDSPAIRIGTETISGSREMLL 93
           P VRLYG P + L  YIRFALL+K V + F+PS+TP+FGSD+P ++IG ET+SGS E +L
Sbjct: 54  PTVRLYGSPTSVLAAYIRFALLHKGVSVRFVPSDTPNFGSDAPVLQIGPETVSGSLETVL 113

Query: 94  RYIDNRFPHPPLALSS----RRVDDDETTSLVAVRVVALQHKSVLWHLERMLRWAKDLAT 153
           RYID RFPHPPL + S                 VR +ALQHKS+ WH+ER++RW +DLAT
Sbjct: 114 RYIDARFPHPPLGVRSCDXXXXXXXXXXXXXSVVRAIALQHKSMTWHVERLVRWVEDLAT 173

Query: 154 RGGRTTVDPAVGTPRMELRKFGKSYSQLLEVMLEHAQMEERVLFPILEKADRGLCKVSNE 213
           RGG+ +VDP VG+PRME++K  +SYS+LLEV+LEHAQMEERV+FPILE+ADRG+CKV+NE
Sbjct: 174 RGGKGSVDPTVGSPRMEMKKLARSYSELLEVLLEHAQMEERVVFPILERADRGICKVANE 233

Query: 214 EHARDLPIMNGIKEDIKSAVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFLDEEKNLLP 273
           EHARDLPIMNGIKE IKS  V+D GS    EALSNLS RLK LQE+ K HF++E+K+LLP
Sbjct: 234 EHARDLPIMNGIKEGIKSIGVMDSGSPDYHEALSNLSTRLKSLQENSKQHFMEEDKDLLP 293

Query: 274 WLEAVELNKEQQDKMLEQLLDVMKQTHSHLLNFFLEGLLPLEALQYLDLITNSSDKIRTS 330
           ++EAVELNKEQQ ++LEQ LDVM+ THSHL NF LEGLLP EA+QYLDL  + +D+ RT+
Sbjct: 294 FMEAVELNKEQQKRVLEQCLDVMQGTHSHLFNFLLEGLLPHEAMQYLDLFISCNDRERTA 353

BLAST of Cla97C08G151030 vs. TAIR10
Match: AT3G54290.1 (FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 343.6 bits (880), Expect = 1.4e-94
Identity = 178/341 (52.20%), Postives = 235/341 (68.91%), Query Frame = 0

Query: 5   CLGSSMKSAAEIVPQEFI--------------------RGCGDTAAAANPIVRLYGPPNN 64
           C  SS KS AEI P + +                         +  +    VRLYGPPN+
Sbjct: 4   CFSSSTKSTAEISPFDLVVXXXXXXXXXXXXXTQRIPTAKTETSTVSFTATVRLYGPPNS 63

Query: 65  ALTCYIRFALLYKSVKLSFIPSETPHFGSDSPAIRIGTETISGSREMLLRYIDNRFPHPP 124
            +T Y+RFALL+K V L F+PSE        P I++G+ET+SGSRE+LLRYI+++FP P 
Sbjct: 64  LVTSYLRFALLHKKVPLRFVPSE-----DQKPTIQVGSETVSGSREVLLRYIEDKFPEPR 123

Query: 125 LALSSRRVDD-DETTSLVAVRVVALQHKSVLWHLERMLRWAKDLATRGGRTTVDPAVGTP 184
           L +    ++  DE T L+ V+++ LQH+S+LWH+ERMLRW++DLA RGG+  VDP+VGTP
Sbjct: 124 LMIWKFNLEGFDEATPLI-VKMIWLQHRSMLWHMERMLRWSEDLAARGGKKAVDPSVGTP 183

Query: 185 RMELRKFGKSYSQLLEVMLEHAQMEERVLFPILEKADRGLCKVSNEEHARDLPIMNGIKE 244
           +ME+RKF KSY+ L E+MLEHAQMEER+LFP+LE  DRG+CK +NEEH R+LP+MNGIKE
Sbjct: 184 KMEIRKFAKSYTHLQELMLEHAQMEERILFPVLESVDRGMCKSANEEHGRELPMMNGIKE 243

Query: 245 DIKSAVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFLDEEKNLLPWLEAVELNKEQQDK 304
           DIKS  VLD  S +C EAL +L+ R K LQ  CK HF +EEK+LLP +EA E+ KE+Q K
Sbjct: 244 DIKSIGVLD--SGICSEALFSLASRFKSLQMMCKTHFEEEEKDLLPMVEAAEMGKEKQKK 303

Query: 305 MLEQLLDVMKQTHSHLLNFFLEGLLPLEALQYLDLITNSSD 325
           ++ Q L+VM  THS+  +F LEGL P EA+QY+DL+    D
Sbjct: 304 LMNQSLEVMSGTHSNSFDFLLEGLTPQEAMQYIDLLMTFGD 336

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008456602.18.1e-17391.15PREDICTED: uncharacterized protein LOC103496512 [Cucumis melo][more]
XP_022956168.12.6e-17189.68uncharacterized protein LOC111457939 [Cucurbita moschata][more]
XP_023525990.14.4e-17189.38uncharacterized protein LOC111789551 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022990403.12.2e-17089.38uncharacterized protein LOC111487271 [Cucurbita maxima][more]
XP_004140904.25.4e-16988.79PREDICTED: uncharacterized protein LOC101208874 [Cucumis sativus] >KGN46040.1 hy... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3C379|A0A1S3C379_CUCME5.3e-17391.15uncharacterized protein LOC103496512 OS=Cucumis melo OX=3656 GN=LOC103496512 PE=... [more]
tr|A0A0A0K8J3|A0A0A0K8J3_CUCSA3.6e-16988.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G045190 PE=4 SV=1[more]
tr|A0A2N9FWA8|A0A2N9FWA8_FAGSY1.2e-11164.95Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS19171 PE=4 SV=1[more]
tr|A0A061FWR7|A0A061FWR7_THECC1.6e-10560.12Uncharacterized protein isoform 1 OS=Theobroma cacao OX=3641 GN=TCM_044208 PE=4 ... [more]
tr|A0A2P4LUH8|A0A2P4LUH8_QUESU4.8e-10564.00Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_22781 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT3G54290.11.4e-9452.20FUNCTIONS IN: molecular_function unknown[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR012312Haemerythrin-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G151030.1Cla97C08G151030.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 271..291
NoneNo IPR availableGENE3DG3DSA:1.20.120.520coord: 122..291
e-value: 1.9E-9
score: 39.9
NoneNo IPR availablePANTHERPTHR35739FAMILY NOT NAMEDcoord: 12..334
NoneNo IPR availableCDDcd12108Hr-likecoord: 128..269
e-value: 6.00553E-15
score: 69.3816
IPR012312Haemerythrin-likePFAMPF01814Hemerythrincoord: 122..271
e-value: 4.1E-17
score: 62.9

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C08G151030Wax gourdwgowmbB541
Cla97C08G151030Watermelon (97103) v2wmbwmbB143
Cla97C08G151030Silver-seed gourdcarwmbB0864
Cla97C08G151030Cucurbita maxima (Rimu)cmawmbB317
Cla97C08G151030Cucurbita maxima (Rimu)cmawmbB788
Cla97C08G151030Cucurbita moschata (Rifu)cmowmbB300
Cla97C08G151030Bottle gourd (USVL1VR-Ls)lsiwmbB448