Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGACCAGTTGTGATGATAGACTATGGTGGGAAGATGCCTGAATTGCAACAGCGGCTTTGTGCACTTCTAAAACTAATTCAAAAGGTTCATGACTTACTTCTTGCTTGCATGTTGTATTGGATCATAAATTATGCTGCTCAATGTCTTTATATCGTTAACATCGTTTGGGCAGTTCATAAAATCAGTTTCTCTGTTTTATCAGTAAGGGGAATAAGACTGAAGCCTGTAAAAATTATCACCACTTTGAATACAAAATCTCACATAGCAACAACTAAGGCGGGGTCATATAAAGGTTTTAATCTTGTAAAATGACGTGGAGATTTTACCTCTATATAAATGTTATTCTGAGGTATTTACTCTAGAGAGCTGAACTACTTCTTGGAAACAATAATGAGAACTAGTTCATGCTCATGAATCTGCTAGAACAATTAAATTCCCTTTAAATTGCTTGTATTTTTTGTATGGTAAGTCGGAGTTGGAGGTAAAAATCTAGTTCGAGACTTAGTTCAGTGTGATTATTTCAAATATGAACTTACCAACACTAAGTGGAGGTACCTATTTGGGAGGTACTTACACCCATTCCATAGTGAATATCTGTCAACCTTTATATTTTCATTTTAGAAAACCTGAAATGTTGTGAGGATTTGATAGTGATCATATCCTACATTTTATTTGAAAGTAGAAGCTGCACTTTTACATATGCTATAAACTTGGAAATTTGAAAACAAAGTCATGTCAGTCAATTTACTGGAGTCGGGGGCATATTTTCTATGCACACTGTGTATGAATATCCCTTAAGCCAAAAATGCATGAATATTTCTATTCCTCATAATATTTTGGATAGGCTCCTTTTATTTGACATGACTATGCTGATCAATAGCTGATGCGGAAGTGTGGTCATTGAAGATATATGACCATGTTATCTATCTTAGTGTCCTCTGTTGCAACTTTTCTAATATTGATGTTCTTAAACTGATGCAGGAATTACGTATATTTGAGGATCTTAAGGTAATGGTCATAGAGGATATGATATATCTGATACATGTGCAAGGACTTGCCGAACATGTTCATTCAACTTTAAATTCCAAATTAACACTGCTCCTTGTGGACATTGAACGGGACCCCCCCAAGGTTCACAGTGATCTTTGATTATCATCTTTTAATTTTTGAGGTCCTACATTTACCCTCTTTGCTGATTGCCTTCTGTCTTCTTATTTGCTTCAGATGTTAATAGATGCTGAGAAAAGTTCACTGGGTTTGCAGCTTAAATCAATTCAAAAGTTGTTCTTATCTTTATTTTCTCAAGATGAAATGGAAAGTGACCCATCACCATCACTTGGGGAAACCTTTATAACGGATATCAGATCCTCCATTCATGGGATTAGTTCTCAGTCGTCTGTCATTGATCTTAGTAACTTTTTGGAACATACTGAAATCACTCTGCCTACTTTAAATGGGTAATTCTTTACTTTTGTGTACCCCTTTATTATTCATTGCTTCCATAAGACTCATAGACTACCAAGAGGCACTTTTGATCTTTGTAGTTGCCAGTTAGGTTTGATCATTTTAGTTTAGGAGGATGCTTGAGCTCTACAGCCTCTAAGTGAGTCCAGAATCCATCAGATTGCATCGTCCGCTTAAATGTTTGGATGTAGTAGTAACTCAACATGACATCAGTTAACATTATTGGAGAACTTTATTGGAGAAGTGTGGATTGAAGCAGATAGAGAGGCTAATTTATCCATACGTAGAGAAACCACCCTCTGAGTAGAGAGACAATAGGAGATGGAATTTGGATCCTTCAGTAGACCTTTTTTAGGAAATCTCTTCTCATACAATTAGTTGGAGAAAGATTTTTGTAGCTATAGCCCTTTTGTAAGTAGTTCTCATTGGAGCAGTTTTTTGTTAGCTCTGAGGTTTCAGAAGTTTCATTATGAGTTTTGAACTCTGACTCTATCAATGAGAATGAATTACCCATTTCAAATGTAAATTAGTTTAAAGAAATCAATTATCACTTTTCCCTGGAAGGCTCTTGTTTAGCTCTTCCCAGCCTCTAGGCAGTTATGCTCTGCTCTTTTTTAATATATCTATTCCAATGTTTCTTATTAGAAGGAAAAAGGAAATGAATGAGCATTCAATGTGATTTGTATAGTATGATTTAACTTCATTCTTTGATGAATTGAGTTTATCTGGCCCAGATGGCTTCTTGGCTATCCAATTGTTTACCTTTTTGACAAGGAGCACATATCTGAGGCTACTTACAATCTTTCTGCTAAGCCCCTTCACATCTTCAGATTATCAGTTAGCTGGTACGCCATCCCAAATTTCTTTTCTATCAGCGAAATAATAATATTAGAAAGTTCATTCAAGTAAAATTGTTTCAACTCTAAACATGATAAGAAACACTCTTAAAGTAAAAGTAGGAAAATAATTATTTATAATTTATAATCTTATTAAATAGCATTTCTTCGTTTCTGCATGGTTTCCATACTTAAGACACTGGCATAACCCTTCCATAATTCAAATCTATAGACGGTCCAAAATCAATTTTGAAAGAACAATTATCTAGTTGGATGACAAGCTTGTCACAAGTCTAAAACCCAATCCTTATATTATTTGGACTACTCGACTGATGGATGATCTTTTCGTGTTGTTGAGCGACAGTTCTTCAATTTGTAAGCAAAATTCTTTCCCCCTCTAACATCACATTAAATGGAGTCGTAAACCCTTGTTGCATACCTATCAGATAAAGAAAAATTAACTGAACGCTAGCTAATTTTCTTAGAACGAGTTTAGAATGACTTTTGGAACGTTGAAAAATGTTATTAGCTGATTTATAAAAACAATTTTCAAAATTTTTTCATATGGTTTAGTTAAATAATCAAAAGAAAATTCGAAAGTAAAATATAAATGAACTTATAAAAAAAACTTTTTTAGAAGAAGCATGATTTGAAAGTTTTATGAAAAAAAACTTTTATCAAAAGAAATTTAAAAGTAAAATATAATTAAAGAAATTTTGAAAGCAGAAACTTTGAAGTACTTATAGAAGAAGCATGATTAAAGTACTTGCAGAAGAAAGGGTTCTTCTAAAAAGTACTTATTCTTATTGAAGAAGTACTTCTAAAAACATTTTTCTCAAGAGGCCTTTCAACTTACTACACTACAGGAAAGGTGCCTCCACGAAAGGATCTCAGCTTGAAGAGCTTCTAAGGTAATTTTGTAAGTAAGGTTCAAATAGCTGCCTACCATTTTTTTGAAATATAGATATCGAGTATCTTGATATAGATTTCGAGTATCTTGTTAGGCTGAGAAATTCTCAGAACATCACTATTAATCTTTCTATTTGTTAAATAAAAGATGTGAGAGAGAGGGTGGAAAGCTTTGGCTGTTGCTTCACTATATAAATTTGTACAACAGTTTCACAGTGCCTTATGAACTGAGCATGAGAGGGGCGAAGGAGGCATGGGCGGAGGCATTTTTGGAAAGTATGCAGCAAAAGTGGGAGAGATGCAGTCAAGTGTGGGGTTCGTTGAGGATGGACGTCACTGAATGTCATGCACAGGCCATTGTCTTGTAAATTGTGATTCTGCCAACCAAGAAAAAGAAATGAACGAATTGTAATCTGTACATAAACACTGAATGAAACGGCATCCTATCTTGGCTGCCTTTTACTCCAAACGTCGATCTGCATTTTCAGGTATTCTTCTCTCCAACACTTGAATACTTCACTTTTCCTTTTCAACCTCATGGTTTAATTTTGTTATTTTTTGAGTCTTAGGTTAAAGAAATTATGCTGAATCTTCTAAGATTTGGAGTTGGTGTCTATTTGGTTGATGGTTTTTGGGTTGGTGTCTATCTAGTATATGAGATTCCAAATTTTGGAGATTTGGAAAATAGTTTTAAATAGTTCAGGTTGACCATTAGTTGATGAATGGAAAAATGACATGGAGTTTTTGAGTGGAAAATTTTAAATCAGTTGGCGAACTTTATTCATTTTATAATTTGTTCTTTTCTTATGTGCC
mRNA sequence
ATGAGACCAGTTGTGATGATAGACTATGGTGGGAAGATGCCTGAATTGCAACAGCGGCTTTGTGCACTTCTAAAACTAATTCAAAAGGTTCATGACTTACTTCTTGCTTGCATGTTGTATTGGATCATAAATTATGCTGCTCAATGTCTTTATATCGTTAACATCGTTTGGGCAGTTCATAAAATCAGTTTCTCTGTTTTATCAGAATTACGTATATTTGAGGATCTTAAGGTAATGGTCATAGAGGATATGATATATCTGATACATGTGCAAGGACTTGCCGAACATGTTCATTCAACTTTAAATTCCAAATTAACACTGCTCCTTGTGGACATTGAACGGGACCCCCCCAAGATGTTAATAGATGCTGAGAAAAGTTCACTGGGTTTGCAGCTTAAATCAATTCAAAAGTTGTTCTTATCTTTATTTTCTCAAGATGAAATGGAAAGTGACCCATCACCATCACTTGGGGAAACCTTTATAACGGATATCAGATCCTCCATTCATGGGATTAGTTCTCAGTCGTCTGTCATTGATCTTAGTAACTTTTTGGAACATACTGAAATCACTCTGCCTACTTTAAATGGATGGCTTCTTGGCTATCCAATTGTTTACCTTTTTGACAAGGAGCACATATCTGAGGCTACTTACAATCTTTCTGCTAAGCCCCTTCACATCTTCAGATTATCAGTTAGCTGGTACGCCATCCCAAATTTCTTTTCTATCAGCGAAATAATAATATTAGAAAGTTCATTCAAGAAAGGTGCCTCCACGAAAGGATCTCAGCTTGAAGAGCTTCTAAGTTTCACAGTGCCTTATGAACTGAGCATGAGAGGGGCGAAGGAGGCATGGGCGGAGGCATTTTTGGAAAGTATGCAGCAAAAGTGGGAGAGATGCAGTCAAGTGTGGGGTTCGTTGAGGATGGACGTCACTGAATGTCATGCACAGGCCATTGTCTTGTAAATTGTGATTCTGCCAACCAAGAAAAAGAAATGAACGAATTGTAATCTGTACATAAACACTGAATGAAACGGCATCCTATCTTGGCTGCCTTTTACTCCAAACGTCGATCTGCATTTTCAGGTATTCTTCTCTCCAACACTTGAATACTTCACTTTTCCTTTTCAACCTCATGGTTTAATTTTGTTATTTTTTGAGTCTTAGGTTAAAGAAATTATGCTGAATCTTCTAAGATTTGGAGTTGGTGTCTATTTGGTTGATGGTTTTTGGGTTGGTGTCTATCTAGTATATGAGATTCCAAATTTTGGAGATTTGGAAAATAGTTTTAAATAGTTCAGGTTGACCATTAGTTGATGAATGGAAAAATGACATGGAGTTTTTGAGTGGAAAATTTTAAATCAGTTGGCGAACTTTATTCATTTTATAATTTGTTCTTTTCTTATGTGCC
Coding sequence (CDS)
ATGAGACCAGTTGTGATGATAGACTATGGTGGGAAGATGCCTGAATTGCAACAGCGGCTTTGTGCACTTCTAAAACTAATTCAAAAGGTTCATGACTTACTTCTTGCTTGCATGTTGTATTGGATCATAAATTATGCTGCTCAATGTCTTTATATCGTTAACATCGTTTGGGCAGTTCATAAAATCAGTTTCTCTGTTTTATCAGAATTACGTATATTTGAGGATCTTAAGGTAATGGTCATAGAGGATATGATATATCTGATACATGTGCAAGGACTTGCCGAACATGTTCATTCAACTTTAAATTCCAAATTAACACTGCTCCTTGTGGACATTGAACGGGACCCCCCCAAGATGTTAATAGATGCTGAGAAAAGTTCACTGGGTTTGCAGCTTAAATCAATTCAAAAGTTGTTCTTATCTTTATTTTCTCAAGATGAAATGGAAAGTGACCCATCACCATCACTTGGGGAAACCTTTATAACGGATATCAGATCCTCCATTCATGGGATTAGTTCTCAGTCGTCTGTCATTGATCTTAGTAACTTTTTGGAACATACTGAAATCACTCTGCCTACTTTAAATGGATGGCTTCTTGGCTATCCAATTGTTTACCTTTTTGACAAGGAGCACATATCTGAGGCTACTTACAATCTTTCTGCTAAGCCCCTTCACATCTTCAGATTATCAGTTAGCTGGTACGCCATCCCAAATTTCTTTTCTATCAGCGAAATAATAATATTAGAAAGTTCATTCAAGAAAGGTGCCTCCACGAAAGGATCTCAGCTTGAAGAGCTTCTAAGTTTCACAGTGCCTTATGAACTGAGCATGAGAGGGGCGAAGGAGGCATGGGCGGAGGCATTTTTGGAAAGTATGCAGCAAAAGTGGGAGAGATGCAGTCAAGTGTGGGGTTCGTTGAGGATGGACGTCACTGAATGTCATGCACAGGCCATTGTCTTGTAA
Protein sequence
MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVHKISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKMLIDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDLSNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFFSISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCSQVWGSLRMDVTECHAQAIVL
Homology
BLAST of Clc10G15040 vs. NCBI nr
Match:
XP_038904583.1 (uncharacterized protein LOC120090945 isoform X1 [Benincasa hispida])
HSP 1 Score: 454.5 bits (1168), Expect = 7.5e-124
Identity = 248/320 (77.50%), Postives = 259/320 (80.94%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLCALLKLIQK
Sbjct: 47 MRPVVMIDYGGKMPELQQRLCALLKLIQK------------------------------- 106
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
ELR+F++LKVMVIEDMIYLIHVQGLAEH+HSTLNSKLTLLLVDIERDPPKML
Sbjct: 107 --------ELRLFQNLKVMVIEDMIYLIHVQGLAEHIHSTLNSKLTLLLVDIERDPPKML 166
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
IDAEK+SLGLQLKSIQKLF SLFSQ+EMESDPSPSLGET TD RSSIHG SSQSSVIDL
Sbjct: 167 IDAEKNSLGLQLKSIQKLFSSLFSQEEMESDPSPSLGETCTTDTRSSIHGFSSQSSVIDL 226
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
SN LEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS N F
Sbjct: 227 SNILEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-STSKNIF 286
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
+ +L +++GASTKGSQ EELLSFTVPYELSMRGAKEAWAEAFL SMQQKWERCS
Sbjct: 287 LKRPLNLLH--YRRGASTKGSQPEELLSFTVPYELSMRGAKEAWAEAFLGSMQQKWERCS 324
Query: 301 QVWGSLRMDVTECHAQAIVL 321
QVWGSLRMDVTECHAQAIVL
Sbjct: 347 QVWGSLRMDVTECHAQAIVL 324
BLAST of Clc10G15040 vs. NCBI nr
Match:
XP_038904585.1 (uncharacterized protein LOC120090945 isoform X3 [Benincasa hispida])
HSP 1 Score: 451.1 bits (1159), Expect = 8.2e-123
Identity = 246/320 (76.88%), Postives = 257/320 (80.31%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLCALLKLIQK
Sbjct: 47 MRPVVMIDYGGKMPELQQRLCALLKLIQK------------------------------- 106
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
ELR+F++LKVMVIEDMIYLIHVQGLAEH+HSTLNSKLTLLLVDIERDPPKML
Sbjct: 107 --------ELRLFQNLKVMVIEDMIYLIHVQGLAEHIHSTLNSKLTLLLVDIERDPPKML 166
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
IDAEK+SLGLQLKSIQKLF SLFSQ+EMESDPSPSLGET TD RSSIHG SSQSSVIDL
Sbjct: 167 IDAEKNSLGLQLKSIQKLFSSLFSQEEMESDPSPSLGETCTTDTRSSIHGFSSQSSVIDL 226
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
SN LEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS
Sbjct: 227 SNILEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-------- 286
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
+ +L +++GASTKGSQ EELLSFTVPYELSMRGAKEAWAEAFL SMQQKWERCS
Sbjct: 287 --RPLNLLH--YRRGASTKGSQPEELLSFTVPYELSMRGAKEAWAEAFLGSMQQKWERCS 315
Query: 301 QVWGSLRMDVTECHAQAIVL 321
QVWGSLRMDVTECHAQAIVL
Sbjct: 347 QVWGSLRMDVTECHAQAIVL 315
BLAST of Clc10G15040 vs. NCBI nr
Match:
XP_038904586.1 (uncharacterized protein LOC120090945 isoform X4 [Benincasa hispida])
HSP 1 Score: 450.3 bits (1157), Expect = 1.4e-122
Identity = 245/320 (76.56%), Postives = 253/320 (79.06%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLCALLKLIQK
Sbjct: 47 MRPVVMIDYGGKMPELQQRLCALLKLIQK------------------------------- 106
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
ELR+F++LKVMVIEDMIYLIHVQGLAEH+HSTLNSKLTLLLVDIERDPPKML
Sbjct: 107 --------ELRLFQNLKVMVIEDMIYLIHVQGLAEHIHSTLNSKLTLLLVDIERDPPKML 166
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
IDAEK+SLGLQLKSIQKLF SLFSQ+EMESDPSPSLGET TD RSSIHG SSQSSVIDL
Sbjct: 167 IDAEKNSLGLQLKSIQKLFSSLFSQEEMESDPSPSLGETCTTDTRSSIHGFSSQSSVIDL 226
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
SN LEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS
Sbjct: 227 SNILEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-------- 286
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
++GASTKGSQ EELLSFTVPYELSMRGAKEAWAEAFL SMQQKWERCS
Sbjct: 287 ------------RRGASTKGSQPEELLSFTVPYELSMRGAKEAWAEAFLGSMQQKWERCS 307
Query: 301 QVWGSLRMDVTECHAQAIVL 321
QVWGSLRMDVTECHAQAIVL
Sbjct: 347 QVWGSLRMDVTECHAQAIVL 307
BLAST of Clc10G15040 vs. NCBI nr
Match:
XP_008443345.1 (PREDICTED: UPF0739 protein C1orf74 homolog isoform X2 [Cucumis melo])
HSP 1 Score: 443.7 bits (1140), Expect = 1.3e-120
Identity = 240/320 (75.00%), Postives = 252/320 (78.75%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLCALLKLIQ
Sbjct: 43 MRPVVMIDYGGKMPELQQRLCALLKLIQ-------------------------------- 102
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
+EL IFE+LKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIE+DPPKML
Sbjct: 103 -------TELHIFENLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIEQDPPKML 162
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
+DAEKSSLGLQLKSIQKLF SLFSQDE E DP PS+GET +TDIRSSIHGISSQSSVIDL
Sbjct: 163 VDAEKSSLGLQLKSIQKLFSSLFSQDETEGDPLPSVGETCVTDIRSSIHGISSQSSVIDL 222
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
SNFL+HTEITLPTLNGWLLGYPIVYLFDK+HISEATYNLSAKPLHIFRLSV+
Sbjct: 223 SNFLQHTEITLPTLNGWLLGYPIVYLFDKDHISEATYNLSAKPLHIFRLSVN-------- 282
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
++G STK SQLEELLSF+VPYELSMRG KEAWAEAFLESMQQKWERCS
Sbjct: 283 ------------RRGGSTKESQLEELLSFSVPYELSMRGEKEAWAEAFLESMQQKWERCS 303
Query: 301 QVWGSLRMDVTECHAQAIVL 321
QVWGSLRMDVTECHAQAIVL
Sbjct: 343 QVWGSLRMDVTECHAQAIVL 303
BLAST of Clc10G15040 vs. NCBI nr
Match:
XP_038904584.1 (uncharacterized protein LOC120090945 isoform X2 [Benincasa hispida])
HSP 1 Score: 443.4 bits (1139), Expect = 1.7e-120
Identity = 244/320 (76.25%), Postives = 253/320 (79.06%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLCALLKLIQK
Sbjct: 47 MRPVVMIDYGGKMPELQQRLCALLKLIQK------------------------------- 106
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
+LKVMVIEDMIYLIHVQGLAEH+HSTLNSKLTLLLVDIERDPPKML
Sbjct: 107 --------------NLKVMVIEDMIYLIHVQGLAEHIHSTLNSKLTLLLVDIERDPPKML 166
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
IDAEK+SLGLQLKSIQKLF SLFSQ+EMESDPSPSLGET TD RSSIHG SSQSSVIDL
Sbjct: 167 IDAEKNSLGLQLKSIQKLFSSLFSQEEMESDPSPSLGETCTTDTRSSIHGFSSQSSVIDL 226
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
SN LEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS N F
Sbjct: 227 SNILEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-STSKNIF 286
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
+ +L +++GASTKGSQ EELLSFTVPYELSMRGAKEAWAEAFL SMQQKWERCS
Sbjct: 287 LKRPLNLLH--YRRGASTKGSQPEELLSFTVPYELSMRGAKEAWAEAFLGSMQQKWERCS 318
Query: 301 QVWGSLRMDVTECHAQAIVL 321
QVWGSLRMDVTECHAQAIVL
Sbjct: 347 QVWGSLRMDVTECHAQAIVL 318
BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match:
A0A1S3B7V2 (UPF0739 protein C1orf74 homolog isoform X2 OS=Cucumis melo OX=3656 GN=LOC103486956 PE=4 SV=1)
HSP 1 Score: 443.7 bits (1140), Expect = 6.4e-121
Identity = 240/320 (75.00%), Postives = 252/320 (78.75%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLCALLKLIQ
Sbjct: 43 MRPVVMIDYGGKMPELQQRLCALLKLIQ-------------------------------- 102
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
+EL IFE+LKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIE+DPPKML
Sbjct: 103 -------TELHIFENLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIEQDPPKML 162
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
+DAEKSSLGLQLKSIQKLF SLFSQDE E DP PS+GET +TDIRSSIHGISSQSSVIDL
Sbjct: 163 VDAEKSSLGLQLKSIQKLFSSLFSQDETEGDPLPSVGETCVTDIRSSIHGISSQSSVIDL 222
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
SNFL+HTEITLPTLNGWLLGYPIVYLFDK+HISEATYNLSAKPLHIFRLSV+
Sbjct: 223 SNFLQHTEITLPTLNGWLLGYPIVYLFDKDHISEATYNLSAKPLHIFRLSVN-------- 282
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
++G STK SQLEELLSF+VPYELSMRG KEAWAEAFLESMQQKWERCS
Sbjct: 283 ------------RRGGSTKESQLEELLSFSVPYELSMRGEKEAWAEAFLESMQQKWERCS 303
Query: 301 QVWGSLRMDVTECHAQAIVL 321
QVWGSLRMDVTECHAQAIVL
Sbjct: 343 QVWGSLRMDVTECHAQAIVL 303
BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match:
A0A5D3DPY3 (UPF0739 protein C1orf74-like protein isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G007510 PE=4 SV=1)
HSP 1 Score: 442.2 bits (1136), Expect = 1.9e-120
Identity = 239/319 (74.92%), Postives = 251/319 (78.68%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLCALLKLIQ
Sbjct: 43 MRPVVMIDYGGKMPELQQRLCALLKLIQ-------------------------------- 102
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
+EL IFE+LKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIE+DPPKML
Sbjct: 103 -------TELHIFENLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIEQDPPKML 162
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
+DAEKSSLGLQLKSIQKLF SLFSQDE E DP PS+GET +TDIRSSIHGISSQSSVIDL
Sbjct: 163 VDAEKSSLGLQLKSIQKLFSSLFSQDETEGDPLPSVGETCVTDIRSSIHGISSQSSVIDL 222
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
SNFL+HTEITLPTLNGWLLGYPIVYLFDK+HISEATYNLSAKPLHIFRLSV+
Sbjct: 223 SNFLQHTEITLPTLNGWLLGYPIVYLFDKDHISEATYNLSAKPLHIFRLSVN-------- 282
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
++G STK SQLEELLSF+VPYELSMRG KEAWAEAFLESMQQKWERCS
Sbjct: 283 ------------RRGGSTKESQLEELLSFSVPYELSMRGEKEAWAEAFLESMQQKWERCS 302
Query: 301 QVWGSLRMDVTECHAQAIV 320
QVWGSLRMDVTECHAQAIV
Sbjct: 343 QVWGSLRMDVTECHAQAIV 302
BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match:
A0A0A0LCP2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G823610 PE=4 SV=1)
HSP 1 Score: 441.8 bits (1135), Expect = 2.4e-120
Identity = 240/320 (75.00%), Postives = 251/320 (78.44%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLCALLKLIQ
Sbjct: 43 MRPVVMIDYGGKMPELQQRLCALLKLIQ-------------------------------- 102
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
+EL IFE+LKVMV+EDMIYLIHVQGLAEHVHSTLNSK TLLLVDIE+DPPKM+
Sbjct: 103 -------TELHIFENLKVMVMEDMIYLIHVQGLAEHVHSTLNSKFTLLLVDIEQDPPKMI 162
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
+DAEKSSLGLQLKSIQKLF SLFSQDE ES P PS+GET TDIRSSIHGISSQSSVIDL
Sbjct: 163 VDAEKSSLGLQLKSIQKLFSSLFSQDETESGPLPSVGETCTTDIRSSIHGISSQSSVIDL 222
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
SNFL+HTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS
Sbjct: 223 SNFLQHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVS-------- 282
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
++G STK SQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS
Sbjct: 283 ------------RRGGSTKESQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 303
Query: 301 QVWGSLRMDVTECHAQAIVL 321
QVWGSLRM+VTECHAQAIVL
Sbjct: 343 QVWGSLRMEVTECHAQAIVL 303
BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match:
A0A6J1FAY6 (uncharacterized protein LOC111442366 OS=Cucurbita moschata OX=3662 GN=LOC111442366 PE=4 SV=1)
HSP 1 Score: 407.5 bits (1046), Expect = 5.1e-110
Identity = 223/320 (69.69%), Postives = 239/320 (74.69%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLC+LL+LIQK
Sbjct: 44 MRPVVMIDYGGKMPELQQRLCSLLELIQK------------------------------- 103
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
EL IFE+LKVM+IEDMIYLIHVQGL EHV S+LNS LTLLLVDIE+DPPKML
Sbjct: 104 --------ELHIFENLKVMIIEDMIYLIHVQGLGEHVQSSLNSNLTLLLVDIEQDPPKML 163
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
+DA++S LGLQ KSIQKLF SLFS DE ++DPS SLGE +T+ RSS HGI SQSSVIDL
Sbjct: 164 VDADQSPLGLQFKSIQKLFSSLFSLDETKNDPSSSLGENHVTNTRSSFHGIYSQSSVIDL 223
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
+N LEH+EITLPTLNGWLLGYPIVYLF KEHISEATYNLSAKPLHIFRLSVS
Sbjct: 224 TNHLEHSEITLPTLNGWLLGYPIVYLFHKEHISEATYNLSAKPLHIFRLSVS-------- 283
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
+ ASTKGSQLEELLSFTVPYELSM GAKEAWAEAFL SMQQKWERCS
Sbjct: 284 ------------RNDASTKGSQLEELLSFTVPYELSMGGAKEAWAEAFLASMQQKWERCS 304
Query: 301 QVWGSLRMDVTECHAQAIVL 321
VWGSLRMDVTECHAQAIVL
Sbjct: 344 GVWGSLRMDVTECHAQAIVL 304
BLAST of Clc10G15040 vs. ExPASy TrEMBL
Match:
A0A6J1IB39 (uncharacterized protein LOC111471896 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111471896 PE=4 SV=1)
HSP 1 Score: 397.5 bits (1020), Expect = 5.2e-107
Identity = 219/320 (68.44%), Postives = 236/320 (73.75%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQQRLC+LL+LIQK
Sbjct: 44 MRPVVMIDYGGKMPELQQRLCSLLELIQK------------------------------- 103
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
EL IFE+LKVM+IEDMIYLIHVQGL EHV S+LNS LTLLLVDIE+DPPK+L
Sbjct: 104 --------ELHIFENLKVMIIEDMIYLIHVQGLGEHVQSSLNSNLTLLLVDIEQDPPKIL 163
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSVIDL 180
+DA++S LGLQ KSIQKLF SLFS DE ++DPS SLGE +T+ SS HGI SQSSVIDL
Sbjct: 164 VDADQSPLGLQFKSIQKLFSSLFSLDETKNDPSSSLGENRVTNTTSSFHGIYSQSSVIDL 223
Query: 181 SNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNFF 240
+N LEH+EITLPTLNGWLLGYPIVYLF KEHISEATYNLSAKPLHIFRLSVS
Sbjct: 224 TNHLEHSEITLPTLNGWLLGYPIVYLFHKEHISEATYNLSAKPLHIFRLSVS-------- 283
Query: 241 SISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERCS 300
+ ASTKGSQLEELLSFTVP ELSM GAKEAWAEAFL MQQKWERCS
Sbjct: 284 ------------RNDASTKGSQLEELLSFTVPCELSMGGAKEAWAEAFLARMQQKWERCS 304
Query: 301 QVWGSLRMDVTECHAQAIVL 321
VWGSLRMDVTECHAQAIVL
Sbjct: 344 GVWGSLRMDVTECHAQAIVL 304
BLAST of Clc10G15040 vs. TAIR 10
Match:
AT3G59490.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 251.1 bits (640), Expect = 1.2e-66
Identity = 146/321 (45.48%), Postives = 192/321 (59.81%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQ RL +LL+LI++
Sbjct: 43 MRPVVMIDYGGKMPELQNRLLSLLELIRE------------------------------- 102
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
L +F+DLKVMVIEDMIYLI+V+ L + V S+L+S+ L +D+E+DPPKM+
Sbjct: 103 --------GLPVFKDLKVMVIEDMIYLINVRSLPKFVSSSLDSEPELFFIDLEQDPPKMV 162
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSV-ID 180
+++S+LG+QL+SIQKLF S F D+ +D + L E SSQ+S+ ID
Sbjct: 163 TQSKESNLGMQLRSIQKLFSSTFPLDDSNTDTTTVLDE-----------ANSSQTSLCID 222
Query: 181 LSNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNF 240
LS L+ T++T+PTLNGWLL YP+VYLF +HI EA YNLS K L +F++ V
Sbjct: 223 LSCCLQDTKVTIPTLNGWLLDYPVVYLFGTDHIEEAIYNLSTKSLRLFKVLVC------- 282
Query: 241 FSISEIIILESSFKKGASTKGSQLEELLSFTVPYELSMRGAKEAWAEAFLESMQQKWERC 300
+ G + K S LEEL SF+VPY+LSM G+KE WAE FLE M +WE C
Sbjct: 283 -------------RNGTTEKDSHLEELTSFSVPYDLSMEGSKEVWAEKFLERMSSRWEEC 293
Query: 301 SQVWGSLRMDVTECHAQAIVL 321
+W SL + V+EC+ QAIVL
Sbjct: 343 KHIWRSLDLQVSECYPQAIVL 293
BLAST of Clc10G15040 vs. TAIR 10
Match:
AT3G59490.1 (unknown protein; Has 30 Blast hits to 30 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 28; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )
HSP 1 Score: 181.4 bits (459), Expect = 1.1e-45
Identity = 115/269 (42.75%), Postives = 153/269 (56.88%), Query Frame = 0
Query: 1 MRPVVMIDYGGKMPELQQRLCALLKLIQKVHDLLLACMLYWIINYAAQCLYIVNIVWAVH 60
MRPVVMIDYGGKMPELQ RL +LL+LI++
Sbjct: 43 MRPVVMIDYGGKMPELQNRLLSLLELIRE------------------------------- 102
Query: 61 KISFSVLSELRIFEDLKVMVIEDMIYLIHVQGLAEHVHSTLNSKLTLLLVDIERDPPKML 120
L +F+DLKVMVIEDMIYLI+V+ L + V S+L+S+ L +D+E+DPPKM+
Sbjct: 103 --------GLPVFKDLKVMVIEDMIYLINVRSLPKFVSSSLDSEPELFFIDLEQDPPKMV 162
Query: 121 IDAEKSSLGLQLKSIQKLFLSLFSQDEMESDPSPSLGETFITDIRSSIHGISSQSSV-ID 180
+++S+LG+QL+SIQKLF S F D+ +D + L E SSQ+S+ ID
Sbjct: 163 TQSKESNLGMQLRSIQKLFSSTFPLDDSNTDTTTVLDE-----------ANSSQTSLCID 222
Query: 181 LSNFLEHTEITLPTLNGWLLGYPIVYLFDKEHISEATYNLSAKPLHIFRLSVSWYAIPNF 240
LS L+ T++T+PTLNGWLL YP+VYLF +HI EA YNLS K L +F++ V
Sbjct: 223 LSCCLQDTKVTIPTLNGWLLDYPVVYLFGTDHIEEAIYNLSTKSLRLFKVLVC------- 241
Query: 241 FSISEIIILESSFKKGASTKGSQLEELLS 269
+ G + K S LEEL S
Sbjct: 283 -------------RNGTTEKDSHLEELTS 241
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038904583.1 | 7.5e-124 | 77.50 | uncharacterized protein LOC120090945 isoform X1 [Benincasa hispida] | [more] |
XP_038904585.1 | 8.2e-123 | 76.88 | uncharacterized protein LOC120090945 isoform X3 [Benincasa hispida] | [more] |
XP_038904586.1 | 1.4e-122 | 76.56 | uncharacterized protein LOC120090945 isoform X4 [Benincasa hispida] | [more] |
XP_008443345.1 | 1.3e-120 | 75.00 | PREDICTED: UPF0739 protein C1orf74 homolog isoform X2 [Cucumis melo] | [more] |
XP_038904584.1 | 1.7e-120 | 76.25 | uncharacterized protein LOC120090945 isoform X2 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3B7V2 | 6.4e-121 | 75.00 | UPF0739 protein C1orf74 homolog isoform X2 OS=Cucumis melo OX=3656 GN=LOC1034869... | [more] |
A0A5D3DPY3 | 1.9e-120 | 74.92 | UPF0739 protein C1orf74-like protein isoform X2 OS=Cucumis melo var. makuwa OX=1... | [more] |
A0A0A0LCP2 | 2.4e-120 | 75.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G823610 PE=4 SV=1 | [more] |
A0A6J1FAY6 | 5.1e-110 | 69.69 | uncharacterized protein LOC111442366 OS=Cucurbita moschata OX=3662 GN=LOC1114423... | [more] |
A0A6J1IB39 | 5.2e-107 | 68.44 | uncharacterized protein LOC111471896 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT3G59490.2 | 1.2e-66 | 45.48 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT3G59490.1 | 1.1e-45 | 42.75 | unknown protein; Has 30 Blast hits to 30 proteins in 12 species: Archae - 0; Bac... | [more] |