Clc10G15040 (gene) Watermelon (cordophanus) v2

Overview
NameClc10G15040
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionUPF0739 protein C1orf74 homolog isoform X2
LocationClcChr10: 28633942 .. 28637966 (-)
RNA-Seq ExpressionClc10G15040
SyntenyClc10G15040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGACCAGTTGTGATGATAGACTATGGTGGGAAGATGCCTGAATTGCAACAGCGGCTTTGTGCACTTCTAAAACTAATTCAAAAGGTTCATGACTTACTTCTTGCTTGCATGTTGTATTGGATCATAAATTATGCTGCTCAATGTCTTTATATCGTTAACATCGTTTGGGCAGTTCATAAAATCAGTTTCTCTGTTTTATCAGTAAGGGGAATAAGACTGAAGCCTGTAAAAATTATCACCACTTTGAATACAAAATCTCACATAGCAACAACTAAGGCGGGGTCATATAAAGGTTTTAATCTTGTAAAATGACGTGGAGATTTTACCTCTATATAAATGTTATTCTGAGGTATTTACTCTAGAGAGCTGAACTACTTCTTGGAAACAATAATGAGAACTAGTTCATGCTCATGAATCTGCTAGAACAATTAAATTCCCTTTAAATTGCTTGTATTTTTTGTATGGTAAGTCGGAGTTGGAGGTAAAAATCTAGTTCGAGACTTAGTTCAGTGTGATTATTTCAAATATGAACTTACCAACACTAAGTGGAGGTACCTATTTGGGAGGTACTTACACCCATTCCATAGTGAATATCTGTCAACCTTTATATTTTCATTTTAGAAAACCTGAAATGTTGTGAGGATTTGATAGTGATCATATCCTACATTTTATTTGAAAGTAGAAGCTGCACTTTTACATATGCTATAAACTTGGAAATTTGAAAACAAAGTCATGTCAGTCAATTTACTGGAGTCGGGGGCATATTTTCTATGCACACTGTGTATGAATATCCCTTAAGCCAAAAATGCATGAATATTTCTATTCCTCATAATATTTTGGATAGGCTCCTTTTATTTGACATGACTATGCTGATCAATAGCTGATGCGGAAGTGTGGTCATTGAAGATATATGACCATGTTATCTATCTTAGTGTCCTCTGTTGCAACTTTTCTAATATTGATGTTCTTAAACTGATGCAGGAATTACGTATATTTGAGGATCTTAAGGTAATGGTCATAGAGGATATGATATATCTGATACATGTGCAAGGACTTGCCGAACATGTTCATTCAACTTTAAATTCCAAATTAACACTGCTCCTTGTGGACATTGAACGGGACCCCCCCAAGGTTCACAGTGATCTTTGATTATCATCTTTTAATTTTTGAGGTCCTACATTTACCCTCTTTGCTGATTGCCTTCTGTCTTCTTATTTGCTTCAGATGTTAATAGATGCTGAGAAAAGTTCACTGGGTTTGCAGCTTAAATCAATTCAAAAGTTGTTCTTATCTTTATTTTCTCAAGATGAAATGGAAAGTGACCCATCACCATCACTTGGGGAAACCTTTATAACGGATATCAGATCCTCCATTCATGGGATTAGTTCTCAGTCGTCTGTCATTGATCTTAGTAACTTTTTGGAACATACTGAAATCACTCTGCCTACTTTAAATGGGTAATTCTTTACTTTTGTGTACCCCTTTATTATTCATTGCTTCCATAAGACTCATAGACTACCAAGAGGCACTTTTGATCTTTGTAGTTGCCAGTTAGGTTTGATCATTTTAGTTTAGGAGGATGCTTGAGCTCTACAGCCTCTAAGTGAGTCCAGAATCCATCAGATTGCATCGTCCGCTTAAATGTTTGGATGTAGTAGTAACTCAACATGACATCAGTTAACATTATTGGAGAACTTTATTGGAGAAGTGTGGATTGAAGCAGATAGAGAGGCTAATTTATCCATACGTAGAGAAACCACCCTCTGAGTAGAGAGACAATAGGAGATGGAATTTGGATCCTTCAGTAGACCTTTTTTAGGAAATCTCTTCTCATACAATTAGTTGGAGAAAGATTTTTGTAGCTATAGCCCTTTTGTAAGTAGTTCTCATTGGAGCAGTTTTTTGTTAGCTCTGAGGTTTCAGAAGTTTCATTATGAGTTTTGAACTCTGACTCTATCAATGAGAATGAATTACCCATTTCAAATGTAAATTAGTTTAAAGAAATCAATTATCACTTTTCCCTGGAAGGCTCTTGTTTAGCTCTTCCCAGCCTCTAGGCAGTTATGCTCTGCTCTTTTTTAATATATCTATTCCAATGTTTCTTATTAGAAGGAAAAAGGAAATGAATGAGCATTCAATGTGATTTGTATAGTATGATTTAACTTCATTCTTTGATGAATTGAGTTTATCTGGCCCAGATGGCTTCTTGGCTATCCAATTGTTTACCTTTTTGACAAGGAGCACATATCTGAGGCTACTTACAATCTTTCTGCTAAGCCCCTTCACATCTTCAGATTATCAGTTAGCTGGTACGCCATCCCAAATTTCTTTTCTATCAGCGAAATAATAATATTAGAAAGTTCATTCAAGTAAAATTGTTTCAACTCTAAACATGATAAGAAACACTCTTAAAGTAAAAGTAGGAAAATAATTATTTATAATTTATAATCTTATTAAATAGCATTTCTTCGTTTCTGCATGGTTTCCATACTTAAGACACTGGCATAACCCTTCCATAATTCAAATCTATAGACGGTCCAAAATCAATTTTGAAAGAACAATTATCTAGTTGGATGACAAGCTTGTCACAAGTCTAAAACCCAATCCTTATATTATTTGGACTACTCGACTGATGGATGATCTTTTCGTGTTGTTGAGCGACAGTTCTTCAATTTGTAAGCAAAATTCTTTCCCCCTCTAACATCACATTAAATGGAGTCGTAAACCCTTGTTGCATACCTATCAGATAAAGAAAAATTAACTGAACGCTAGCTAATTTTCTTAGAACGAGTTTAGAATGACTTTTGGAACGTTGAAAAATGTTATTAGCTGATTTATAAAAACAATTTTCAAAATTTTTTCATATGGTTTAGTTAAATAATCAAAAGAAAATTCGAAAGTAAAATATAAATGAACTTATAAAAAAAACTTTTTTAGAAGAAGCATGATTTGAAAGTTTTATGAAAAAAAACTTTTATCAAAAGAAATTTAAAAGTAAAATATAATTAAAGAAATTTTGAAAGCAGAAACTTTGAAGTACTTATAGAAGAAGCATGATTAAAGTACTTGCAGAAGAAAGGGTTCTTCTAAAAAGTACTTATTCTTATTGAAGAAGTACTTCTAAAAACATTTTTCTCAAGAGGCCTTTCAACTTACTACACTACAGGAAAGGTGCCTCCACGAAAGGATCTCAGCTTGAAGAGCTTCTAAGGTAATTTTGTAAGTAAGGTTCAAATAGCTGCCTACCATTTTTTTGAAATATAGATATCGAGTATCTTGATATAGATTTCGAGTATCTTGTTAGGCTGAGAAATTCTCAGAACATCACTATTAATCTTTCTATTTGTTAAATAAAAGATGTGAGAGAGAGGGTGGAAAGCTTTGGCTGTTGCTTCACTATATAAATTTGTACAACAGTTTCACAGTGCCTTATGAACTGAGCATGAGAGGGGCGAAGGAGGCATGGGCGGAGGCATTTTTGGAAAGTATGCAGCAAAAGTGGGAGAGATGCAGTCAAGTGTGGGGTTCGTTGAGGATGGACGTCACTGAATGTCATGCACAGGCCATTGTCTTGTAAATTGTGATTCTGCCAACCAAGAAAAAGAAATGAACGAATTGTAATCTGTACATAAACACTGAATGAAACGGCATCCTATCTTGGCTGCCTTTTACTCCAAACGTCGATCTGCATTTTCAGGTATTCTTCTCTCCAACACTTGAATACTTCACTTTTCCTTTTCAACCTCATGGTTTAATTTTGTTATTTTTTGAGTCTTAGGTTAAAGAAATTATGCTGAATCTTCTAAGATTTGGAGTTGGTGTCTATTTGGTTGATGGTTTTTGGGTTGGTGTCTATCTAGTATATGAGATTCCAAATTTTGGAGATTTGGAAAATAGTTTTAAATAGTTCAGGTTGACCATTAGTTGATGAATGGAAAAATGACATGGAGTTTTTGAGTGGAAAATTTTAAATCAGTTGGCGAACTTTATTCATTTTATAATTTGTTCTTTTCTTATGTGCC

mRNA sequence

ATGAGACCAGTTGTGATGATAGACTATGGTGGGAAGATGCCTGAATTGCAACAGCGGCTTTGTGCACTTCTAAAACTAATTCAAAAGGTTCATGACTTACTTCTTGCTTGCATGTTGTATTGGATCATAAATTATGCTGCTCAATGTCTTTATATCGTTAACATCGTTTGGGCAGTTCATAAAATCAGTTTCTCTGTTTTATCAGAATTACGTATATTTGAGGATCTTAAGGTAATGGTCATAGAGGATATGATATATCTGATACATGTGCAAGGACTTGCCGAACATGTTCATTCAACTTTAAATTCCAAATTAACACTGCTCCTTGTGGACATTGAACGGGACCCCCCCAAGATGTTAATAGATGCTGAGAAAAGTTCACTGGGTTTGCAGCTTAAATCAATTCAAAAGTTGTTCTTATCTTTATTTTCTCAAGATGAAATGGAAAGTGACCCATCACCATCACTTGGGGAAACCTTTATAACGGATATCAGATCCTCCATTCATGGGATTAGTTCTCAGTCGTCTGTCATTGATCTTAGTAACTTTTTGGAACATACTGAAATCACTCTGCCTACTTTAAATGGATGGCTTCTTGGCTATCCAATTGTTTACCTTTTTGACAAGGAGCACATATCTGAGGCTACTTACAATCTTTCTGCTAAGCCCCTTCACATCTTCAGATTATCAGTTAGCTGGTACGCCATCCCAAATTTCTTTTCTATCAGCGAAATAATAATATTAGAAAGTTCATTCAAGAAAGGTGCCTCCACGAAAGGATCTCAGCTTGAAGAGCTTCTAAGTTTCACAGTGCCTTATGAACTGAGCATGAGAGGGGCGAAGGAGGCATGGGCGGAGGCATTTTTGGAAAGTATGCAGCAAAAGTGGGAGAGATGCAGTCAAGTGTGGGGTTCGTTGAGGATGGACGTCACTGAATGTCATGCACAGGCCATTGTCTTGTAAATTGTGATTCTGCCAACCAAGAAAAAGAAATGAACGAATTGTAATCTGTACATAAACACTGAATGAAACGGCATCCTATCTTGGCTGCCTTTTACTCCAAACGTCGATCTGCATTTTCAGGTATTCTTCTCTCCAACACTTGAATACTTCACTTTTCCTTTTCAACCTCATGGTTTAATTTTGTTATTTTTTGAGTCTTAGGTTAAAGAAATTATGCTGAATCTTCTAAGATTTGGAGTTGGTGTCTATTTGGTTGATGGTTTTTGGGTTGGTGTCTATCTAGTATATGAGATTCCAAATTTTGGAGATTTGGAAAATAGTTTTAAATAGTTCAGGTTGACCATTAGTTGATGAATGGAAAAATGACATGGAGTTTTTGAGTGGAAAATTTTAAATCAGTTGGCGAACTTTATTCATTTTATAATTTGTTCTTTTCTTATGTGCC

Coding sequence (CDS)

ATGAGACCAGTTGTGATGATAGACTATGGTGGGAAGATGCCTGAATTGCAACAGCGGCTTTGTGCACTTCTAAAACTAATTCAAAAGGTTCATGACTTACTTCTTGCTTGCATGTTGTATTGGATCATAAATTATGCTGCTCAATGTCTTTATATCGTTAACATCGTTTGGGCAGTTCATAAAATCAGTTTCTCTGTTTTATCAGAATTACGTATATTTGAGGATCTTAAGGTAATGGTCATAGAGGATATGATATATCTGATACATGTGCAAGGACTTGCCGAACATGTTCATTCAACTTTAAATTCCAAATTAACACTGCTCCTTGTGGACATTGAACGGGACCCCCCCAAGATGTTAATAGATGCTGAGAAAAGTTCACTGGGTTTGCAGCTTAAATCAATTCAAAAGTTGTTCTTATCTTTATTTTCTCAAGATGAAATGGAAAGTGACCCATCACCATCACTTGGGGAAACCTTTATAACGGATATCAGATCCTCCATTCATGGGATTAGTTCTCAGTCGTCTGTCATTGATCTTAGTAACTTTTTGGAACATACTGAAATCACTCTGCCTACTTTAAATGGATGGCTTCTTGGCTATCCAATTGTTTACCTTTTTGACAAGGAGCACATATCTGAGGCTACTTACAATCTTTCTGCTAAGCCCCTTCACATCTTCAGATTATCAGTTAGCTGGTACGCCATCCCAAATTTCTTTTCTATCAGCGAAATAATAATATTAGAAAGTTCATTCAAGAAAGGTGCCTCCACGAAAGGATCTCAGCTTGAAGAGCTTCTAAGTTTCACAGTGCCTTATGAACTGAGCATGAGAGGGGCGAAGGAGGCATGGGCGGAGGCATTTTTGGAAAGTATGCAGCAAAAGTGGGAGAGATGCAGTCAAGTGTGGGGTTCGTTGAGGATGGACGTCACTGAATGTCATGCACAGGCCATTGTCTTGTAA

Protein sequence

MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVHKISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKMLIDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDLSNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFFSISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCSQVWGSLRMDVTECHAQAIVL
Homology
BLAST of Clc10G15040 vs. NCBI nr
Match: XP_038904583.1 (uncharacterized protein LOC120090945 isoform X1 [Benincasa hispida])

HSP 1 Score: 454.5 bits (1168), Expect = 7.5e-124
Identity = 248/320 (77.50%), Postives = 259/320 (80.94%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLCALLKLIQK                               
Sbjct: 47  MRPVVMIDYGGKMPELQQRLCALLKLIQK------------------------------- 106

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                   ELR+F++LKVMVIEDMIYLIHVQGLAEH+HSTLNSKLTLLLVDIERDPPKML
Sbjct: 107 --------ELRLFQNLKVMVIEDMIYLIHVQGLAEHIHSTLNSKLTLLLVDIERDPPKML 166

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           IDAEK+SLGLQLKSIQKLF SLFSQ+EMESDPSPSLGET  TD RSSIHG SSQSSVIDL
Sbjct: 167 IDAEKNSLGLQLKSIQKLFSSLFSQEEMESDPSPSLGETCTTDTRSSIHGFSSQSSVIDL 226

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           SN LEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS     N F
Sbjct: 227 SNILEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-STSKNIF 286

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
               + +L   +++GASTKGSQ EELLSFTVPYELSMRGAKEAWAEAFL SMQQKWERCS
Sbjct: 287 LKRPLNLLH--YRRGASTKGSQPEELLSFTVPYELSMRGAKEAWAEAFLGSMQQKWERCS 324

Query: 301 QVWGSLRMDVTECHAQAIVL 321
           QVWGSLRMDVTECHAQAIVL
Sbjct: 347 QVWGSLRMDVTECHAQAIVL 324

BLAST of Clc10G15040 vs. NCBI nr
Match: XP_038904585.1 (uncharacterized protein LOC120090945 isoform X3 [Benincasa hispida])

HSP 1 Score: 451.1 bits (1159), Expect = 8.2e-123
Identity = 246/320 (76.88%), Postives = 257/320 (80.31%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLCALLKLIQK                               
Sbjct: 47  MRPVVMIDYGGKMPELQQRLCALLKLIQK------------------------------- 106

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                   ELR+F++LKVMVIEDMIYLIHVQGLAEH+HSTLNSKLTLLLVDIERDPPKML
Sbjct: 107 --------ELRLFQNLKVMVIEDMIYLIHVQGLAEHIHSTLNSKLTLLLVDIERDPPKML 166

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           IDAEK+SLGLQLKSIQKLF SLFSQ+EMESDPSPSLGET  TD RSSIHG SSQSSVIDL
Sbjct: 167 IDAEKNSLGLQLKSIQKLFSSLFSQEEMESDPSPSLGETCTTDTRSSIHGFSSQSSVIDL 226

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           SN LEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS        
Sbjct: 227 SNILEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-------- 286

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
               + +L   +++GASTKGSQ EELLSFTVPYELSMRGAKEAWAEAFL SMQQKWERCS
Sbjct: 287 --RPLNLLH--YRRGASTKGSQPEELLSFTVPYELSMRGAKEAWAEAFLGSMQQKWERCS 315

Query: 301 QVWGSLRMDVTECHAQAIVL 321
           QVWGSLRMDVTECHAQAIVL
Sbjct: 347 QVWGSLRMDVTECHAQAIVL 315

BLAST of Clc10G15040 vs. NCBI nr
Match: XP_038904586.1 (uncharacterized protein LOC120090945 isoform X4 [Benincasa hispida])

HSP 1 Score: 450.3 bits (1157), Expect = 1.4e-122
Identity = 245/320 (76.56%), Postives = 253/320 (79.06%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLCALLKLIQK                               
Sbjct: 47  MRPVVMIDYGGKMPELQQRLCALLKLIQK------------------------------- 106

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                   ELR+F++LKVMVIEDMIYLIHVQGLAEH+HSTLNSKLTLLLVDIERDPPKML
Sbjct: 107 --------ELRLFQNLKVMVIEDMIYLIHVQGLAEHIHSTLNSKLTLLLVDIERDPPKML 166

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           IDAEK+SLGLQLKSIQKLF SLFSQ+EMESDPSPSLGET  TD RSSIHG SSQSSVIDL
Sbjct: 167 IDAEKNSLGLQLKSIQKLFSSLFSQEEMESDPSPSLGETCTTDTRSSIHGFSSQSSVIDL 226

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           SN LEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS        
Sbjct: 227 SNILEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-------- 286

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
                       ++GASTKGSQ EELLSFTVPYELSMRGAKEAWAEAFL SMQQKWERCS
Sbjct: 287 ------------RRGASTKGSQPEELLSFTVPYELSMRGAKEAWAEAFLGSMQQKWERCS 307

Query: 301 QVWGSLRMDVTECHAQAIVL 321
           QVWGSLRMDVTECHAQAIVL
Sbjct: 347 QVWGSLRMDVTECHAQAIVL 307

BLAST of Clc10G15040 vs. NCBI nr
Match: XP_008443345.1 (PREDICTED: UPF0739 protein C1orf74 homolog isoform X2 [Cucumis melo])

HSP 1 Score: 443.7 bits (1140), Expect = 1.3e-120
Identity = 240/320 (75.00%), Postives = 252/320 (78.75%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLCALLKLIQ                                
Sbjct: 43  MRPVVMIDYGGKMPELQQRLCALLKLIQ-------------------------------- 102

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                  +EL IFE+LKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIE+DPPKML
Sbjct: 103 -------TELHIFENLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIEQDPPKML 162

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           +DAEKSSLGLQLKSIQKLF SLFSQDE E DP PS+GET +TDIRSSIHGISSQSSVIDL
Sbjct: 163 VDAEKSSLGLQLKSIQKLFSSLFSQDETEGDPLPSVGETCVTDIRSSIHGISSQSSVIDL 222

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           SNFL+HTEITLPTLNGWLLGYPIVYLFDK+HISEATYNLSAKPLHIFRLSV+        
Sbjct: 223 SNFLQHTEITLPTLNGWLLGYPIVYLFDKDHISEATYNLSAKPLHIFRLSVN-------- 282

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
                       ++G STK SQLEELLSF+VPYELSMRG KEAWAEAFLESMQQKWERCS
Sbjct: 283 ------------RRGGSTKESQLEELLSFSVPYELSMRGEKEAWAEAFLESMQQKWERCS 303

Query: 301 QVWGSLRMDVTECHAQAIVL 321
           QVWGSLRMDVTECHAQAIVL
Sbjct: 343 QVWGSLRMDVTECHAQAIVL 303

BLAST of Clc10G15040 vs. NCBI nr
Match: XP_038904584.1 (uncharacterized protein LOC120090945 isoform X2 [Benincasa hispida])

HSP 1 Score: 443.4 bits (1139), Expect = 1.7e-120
Identity = 244/320 (76.25%), Postives = 253/320 (79.06%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLCALLKLIQK                               
Sbjct: 47  MRPVVMIDYGGKMPELQQRLCALLKLIQK------------------------------- 106

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                         +LKVMVIEDMIYLIHVQGLAEH+HSTLNSKLTLLLVDIERDPPKML
Sbjct: 107 --------------NLKVMVIEDMIYLIHVQGLAEHIHSTLNSKLTLLLVDIERDPPKML 166

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           IDAEK+SLGLQLKSIQKLF SLFSQ+EMESDPSPSLGET  TD RSSIHG SSQSSVIDL
Sbjct: 167 IDAEKNSLGLQLKSIQKLFSSLFSQEEMESDPSPSLGETCTTDTRSSIHGFSSQSSVIDL 226

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           SN LEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS     N F
Sbjct: 227 SNILEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-STSKNIF 286

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
               + +L   +++GASTKGSQ EELLSFTVPYELSMRGAKEAWAEAFL SMQQKWERCS
Sbjct: 287 LKRPLNLLH--YRRGASTKGSQPEELLSFTVPYELSMRGAKEAWAEAFLGSMQQKWERCS 318

Query: 301 QVWGSLRMDVTECHAQAIVL 321
           QVWGSLRMDVTECHAQAIVL
Sbjct: 347 QVWGSLRMDVTECHAQAIVL 318

BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match: A0A1S3B7V2 (UPF0739 protein C1orf74 homolog isoform X2 OS=Cucumis melo OX=3656 GN=LOC103486956 PE=4 SV=1)

HSP 1 Score: 443.7 bits (1140), Expect = 6.4e-121
Identity = 240/320 (75.00%), Postives = 252/320 (78.75%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLCALLKLIQ                                
Sbjct: 43  MRPVVMIDYGGKMPELQQRLCALLKLIQ-------------------------------- 102

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                  +EL IFE+LKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIE+DPPKML
Sbjct: 103 -------TELHIFENLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIEQDPPKML 162

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           +DAEKSSLGLQLKSIQKLF SLFSQDE E DP PS+GET +TDIRSSIHGISSQSSVIDL
Sbjct: 163 VDAEKSSLGLQLKSIQKLFSSLFSQDETEGDPLPSVGETCVTDIRSSIHGISSQSSVIDL 222

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           SNFL+HTEITLPTLNGWLLGYPIVYLFDK+HISEATYNLSAKPLHIFRLSV+        
Sbjct: 223 SNFLQHTEITLPTLNGWLLGYPIVYLFDKDHISEATYNLSAKPLHIFRLSVN-------- 282

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
                       ++G STK SQLEELLSF+VPYELSMRG KEAWAEAFLESMQQKWERCS
Sbjct: 283 ------------RRGGSTKESQLEELLSFSVPYELSMRGEKEAWAEAFLESMQQKWERCS 303

Query: 301 QVWGSLRMDVTECHAQAIVL 321
           QVWGSLRMDVTECHAQAIVL
Sbjct: 343 QVWGSLRMDVTECHAQAIVL 303

BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match: A0A5D3DPY3 (UPF0739 protein C1orf74-like protein isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G007510 PE=4 SV=1)

HSP 1 Score: 442.2 bits (1136), Expect = 1.9e-120
Identity = 239/319 (74.92%), Postives = 251/319 (78.68%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLCALLKLIQ                                
Sbjct: 43  MRPVVMIDYGGKMPELQQRLCALLKLIQ-------------------------------- 102

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                  +EL IFE+LKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIE+DPPKML
Sbjct: 103 -------TELHIFENLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIEQDPPKML 162

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           +DAEKSSLGLQLKSIQKLF SLFSQDE E DP PS+GET +TDIRSSIHGISSQSSVIDL
Sbjct: 163 VDAEKSSLGLQLKSIQKLFSSLFSQDETEGDPLPSVGETCVTDIRSSIHGISSQSSVIDL 222

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           SNFL+HTEITLPTLNGWLLGYPIVYLFDK+HISEATYNLSAKPLHIFRLSV+        
Sbjct: 223 SNFLQHTEITLPTLNGWLLGYPIVYLFDKDHISEATYNLSAKPLHIFRLSVN-------- 282

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
                       ++G STK SQLEELLSF+VPYELSMRG KEAWAEAFLESMQQKWERCS
Sbjct: 283 ------------RRGGSTKESQLEELLSFSVPYELSMRGEKEAWAEAFLESMQQKWERCS 302

Query: 301 QVWGSLRMDVTECHAQAIV 320
           QVWGSLRMDVTECHAQAIV
Sbjct: 343 QVWGSLRMDVTECHAQAIV 302

BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match: A0A0A0LCP2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G823610 PE=4 SV=1)

HSP 1 Score: 441.8 bits (1135), Expect = 2.4e-120
Identity = 240/320 (75.00%), Postives = 251/320 (78.44%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLCALLKLIQ                                
Sbjct: 43  MRPVVMIDYGGKMPELQQRLCALLKLIQ-------------------------------- 102

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                  +EL IFE+LKVMV+EDMIYLIHVQGLAEHVHSTLNSK TLLLVDIE+DPPKM+
Sbjct: 103 -------TELHIFENLKVMVMEDMIYLIHVQGLAEHVHSTLNSKFTLLLVDIEQDPPKMI 162

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           +DAEKSSLGLQLKSIQKLF SLFSQDE ES P PS+GET  TDIRSSIHGISSQSSVIDL
Sbjct: 163 VDAEKSSLGLQLKSIQKLFSSLFSQDETESGPLPSVGETCTTDIRSSIHGISSQSSVIDL 222

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           SNFL+HTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS        
Sbjct: 223 SNFLQHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-------- 282

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
                       ++G STK SQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS
Sbjct: 283 ------------RRGGSTKESQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 303

Query: 301 QVWGSLRMDVTECHAQAIVL 321
           QVWGSLRM+VTECHAQAIVL
Sbjct: 343 QVWGSLRMEVTECHAQAIVL 303

BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match: A0A6J1FAY6 (uncharacterized protein LOC111442366 OS=Cucurbita moschata OX=3662 GN=LOC111442366 PE=4 SV=1)

HSP 1 Score: 407.5 bits (1046), Expect = 5.1e-110
Identity = 223/320 (69.69%), Postives = 239/320 (74.69%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLC+LL+LIQK                               
Sbjct: 44  MRPVVMIDYGGKMPELQQRLCSLLELIQK------------------------------- 103

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                   EL IFE+LKVM+IEDMIYLIHVQGL EHV S+LNS LTLLLVDIE+DPPKML
Sbjct: 104 --------ELHIFENLKVMIIEDMIYLIHVQGLGEHVQSSLNSNLTLLLVDIEQDPPKML 163

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           +DA++S LGLQ KSIQKLF SLFS DE ++DPS SLGE  +T+ RSS HGI SQSSVIDL
Sbjct: 164 VDADQSPLGLQFKSIQKLFSSLFSLDETKNDPSSSLGENHVTNTRSSFHGIYSQSSVIDL 223

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           +N LEH+EITLPTLNGWLLGYPIVYLF KEHISEATYNLSAKPLHIFRLSVS        
Sbjct: 224 TNHLEHSEITLPTLNGWLLGYPIVYLFHKEHISEATYNLSAKPLHIFRLSVS-------- 283

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
                       +  ASTKGSQLEELLSFTVPYELSM GAKEAWAEAFL SMQQKWERCS
Sbjct: 284 ------------RNDASTKGSQLEELLSFTVPYELSMGGAKEAWAEAFLASMQQKWERCS 304

Query: 301 QVWGSLRMDVTECHAQAIVL 321
            VWGSLRMDVTECHAQAIVL
Sbjct: 344 GVWGSLRMDVTECHAQAIVL 304

BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match: A0A6J1IB39 (uncharacterized protein LOC111471896 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111471896 PE=4 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 5.2e-107
Identity = 219/320 (68.44%), Postives = 236/320 (73.75%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQQRLC+LL+LIQK                               
Sbjct: 44  MRPVVMIDYGGKMPELQQRLCSLLELIQK------------------------------- 103

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                   EL IFE+LKVM+IEDMIYLIHVQGL EHV S+LNS LTLLLVDIE+DPPK+L
Sbjct: 104 --------ELHIFENLKVMIIEDMIYLIHVQGLGEHVQSSLNSNLTLLLVDIEQDPPKIL 163

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
           +DA++S LGLQ KSIQKLF SLFS DE ++DPS SLGE  +T+  SS HGI SQSSVIDL
Sbjct: 164 VDADQSPLGLQFKSIQKLFSSLFSLDETKNDPSSSLGENRVTNTTSSFHGIYSQSSVIDL 223

Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
           +N LEH+EITLPTLNGWLLGYPIVYLF KEHISEATYNLSAKPLHIFRLSVS        
Sbjct: 224 TNHLEHSEITLPTLNGWLLGYPIVYLFHKEHISEATYNLSAKPLHIFRLSVS-------- 283

Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
                       +  ASTKGSQLEELLSFTVP ELSM GAKEAWAEAFL  MQQKWERCS
Sbjct: 284 ------------RNDASTKGSQLEELLSFTVPCELSMGGAKEAWAEAFLARMQQKWERCS 304

Query: 301 QVWGSLRMDVTECHAQAIVL 321
            VWGSLRMDVTECHAQAIVL
Sbjct: 344 GVWGSLRMDVTECHAQAIVL 304

BLAST of Clc10G15040 vs. TAIR 10
Match: AT3G59490.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 251.1 bits (640), Expect = 1.2e-66
Identity = 146/321 (45.48%), Postives = 192/321 (59.81%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQ RL +LL+LI++                               
Sbjct: 43  MRPVVMIDYGGKMPELQNRLLSLLELIRE------------------------------- 102

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                    L +F+DLKVMVIEDMIYLI+V+ L + V S+L+S+  L  +D+E+DPPKM+
Sbjct: 103 --------GLPVFKDLKVMVIEDMIYLINVRSLPKFVSSSLDSEPELFFIDLEQDPPKMV 162

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSV-ID 180
             +++S+LG+QL+SIQKLF S F  D+  +D +  L E             SSQ+S+ ID
Sbjct: 163 TQSKESNLGMQLRSIQKLFSSTFPLDDSNTDTTTVLDE-----------ANSSQTSLCID 222

Query: 181 LSNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNF 240
           LS  L+ T++T+PTLNGWLL YP+VYLF  +HI EA YNLS K L +F++ V        
Sbjct: 223 LSCCLQDTKVTIPTLNGWLLDYPVVYLFGTDHIEEAIYNLSTKSLRLFKVLVC------- 282

Query: 241 FSISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERC 300
                        + G + K S LEEL SF+VPY+LSM G+KE WAE FLE M  +WE C
Sbjct: 283 -------------RNGTTEKDSHLEELTSFSVPYDLSMEGSKEVWAEKFLERMSSRWEEC 293

Query: 301 SQVWGSLRMDVTECHAQAIVL 321
             +W SL + V+EC+ QAIVL
Sbjct: 343 KHIWRSLDLQVSECYPQAIVL 293

BLAST of Clc10G15040 vs. TAIR 10
Match: AT3G59490.1 (unknown protein; Has 30 Blast hits to 30 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 28; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 181.4 bits (459), Expect = 1.1e-45
Identity = 115/269 (42.75%), Postives = 153/269 (56.88%), Query Frame = 0

Query: 1   MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
           MRPVVMIDYGGKMPELQ RL +LL+LI++                               
Sbjct: 43  MRPVVMIDYGGKMPELQNRLLSLLELIRE------------------------------- 102

Query: 61  KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
                    L +F+DLKVMVIEDMIYLI+V+ L + V S+L+S+  L  +D+E+DPPKM+
Sbjct: 103 --------GLPVFKDLKVMVIEDMIYLINVRSLPKFVSSSLDSEPELFFIDLEQDPPKMV 162

Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSV-ID 180
             +++S+LG+QL+SIQKLF S F  D+  +D +  L E             SSQ+S+ ID
Sbjct: 163 TQSKESNLGMQLRSIQKLFSSTFPLDDSNTDTTTVLDE-----------ANSSQTSLCID 222

Query: 181 LSNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNF 240
           LS  L+ T++T+PTLNGWLL YP+VYLF  +HI EA YNLS K L +F++ V        
Sbjct: 223 LSCCLQDTKVTIPTLNGWLLDYPVVYLFGTDHIEEAIYNLSTKSLRLFKVLVC------- 241

Query: 241 FSISEIIILESSFKKGASTKGSQLEELLS 269
                        + G + K S LEEL S
Sbjct: 283 -------------RNGTTEKDSHLEELTS 241

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038904583.17.5e-12477.50uncharacterized protein LOC120090945 isoform X1 [Benincasa hispida][more]
XP_038904585.18.2e-12376.88uncharacterized protein LOC120090945 isoform X3 [Benincasa hispida][more]
XP_038904586.11.4e-12276.56uncharacterized protein LOC120090945 isoform X4 [Benincasa hispida][more]
XP_008443345.11.3e-12075.00PREDICTED: UPF0739 protein C1orf74 homolog isoform X2 [Cucumis melo][more]
XP_038904584.11.7e-12076.25uncharacterized protein LOC120090945 isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3B7V26.4e-12175.00UPF0739 protein C1orf74 homolog isoform X2 OS=Cucumis melo OX=3656 GN=LOC1034869... [more]
A0A5D3DPY31.9e-12074.92UPF0739 protein C1orf74-like protein isoform X2 OS=Cucumis melo var. makuwa OX=1... [more]
A0A0A0LCP22.4e-12075.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G823610 PE=4 SV=1[more]
A0A6J1FAY65.1e-11069.69uncharacterized protein LOC111442366 OS=Cucurbita moschata OX=3662 GN=LOC1114423... [more]
A0A6J1IB395.2e-10768.44uncharacterized protein LOC111471896 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT3G59490.21.2e-6645.48unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G59490.11.1e-4542.75unknown protein; Has 30 Blast hits to 30 proteins in 12 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027850Protein of unknown function DUF4504PFAMPF14953DUF4504coord: 1..320
e-value: 5.5E-70
score: 235.8
IPR027850Protein of unknown function DUF4504PANTHERPTHR31366UPF0739 PROTEIN C1ORF74coord: 1..320

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc10G15040.2Clc10G15040.2mRNA