Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAACGCCAATTCCTCCGCTAACAATGGTGCCCGAAATTTCAGTACCCCTCCACTCAATCAACTCCTAAATTAAATCACAACAATTAAGCTTGATCGAGGAAATTTTCTTATGTGGAAAAATTTGGCCCTACCAATCCTTCGGAGCTACTGATTGGAAGGCTACCTCTTTGGGCAAATTGAGTGCCCTCCTATGTTCGTGAATCCACCCGCCAGTGAAATCGGAGCCAAAGTTATGTCTGGAGCATCAAGCTCACAAGTTACTAGCAGCGAGCAACAGTCAGGGTTAAGTCAGACTATAAATCCCAAATATGAGGCTTGGCACTTGTCGTTGATCAACTACTACTTGGGTGGCTTTACAACTCCATGACACTAGAAATAGCCACTCAGGTAATGGGATATGAGAAACCGGAGGACCTTTTGGAAGCAATCCAGAAGTTGTTCGATGTTCAATCGAGAGCCGAAGAGGATTTTCTCAGGCAAACGTTCCAACATACCAGAAAAGGTAATCTTCTATGTTTGAATACCTACGGTTGATGAAAATGAATTTCGATAATCTAGGTCAAATGGGAAGTCATGTTCCCACAAGAGCACTTGTGTCTTAAGTCCTTTTAGGGCTTGTTGAAGATTATAATCCAGTAGTTGCTATACTACAAGGAAAACCTGAAATAACGTGGCTTGAGATGCAAACGAAGCTGTTGACCTATAAAAGACGACTGGACTACCAGAATGTTGTTTGTTCCAGCGGAGCCAGCAAAAATCCTACGGTGAACATGGGCAGTGGGAGAGGCACTAGTGGACAGCGATCCCAAAACATGAACTAAAACAATGGAGGGTGTACTTAGTTCAATGGGTAGCGTGGTGGATTCAATCCAAATGCAAGTAAAGGGGACGAGGAAATGGAAGAGAACATGGTGGAAATTGTCTAATTTGCCAAGTCTGTGGTAAGATGGGTCACATAGCCTTTGTTTGCTATCATAGATATGAAAAGGAATTTGTCCCCAACAACAATAGTAACAGAGGAACGAATGGGGGGAGCAACTCTACCACAACTAATAATGGCAAGAATGCTCCAACAACTATGATGGCTACACAAAATAGTAATCCTTTCATGACTAATACTGATGGTGTGCTCGACTCAAGCTGGTATGTAGATAGTGCTTCCAACCATGTTACAACCGAATACAATAACCTAAGCAATCTGATGGAGTATGGAGGTAATTAAATGGTAACTGTAGGTAATAGAGAACAATTACAAATAGACTCTGTTGGTAGCACTCTTTTGTCAAGTGGGAATTCTTTTCTTAAGCTTAAAAATATATTATATGTGCCTGATACTGCTCAAAATTAATTAGCGTGTCAAAGCTTGCTAAAGATAATCACGTTTACATTGAATTTCATGATAATTGTTGCCTTGTAAAGGACAAGGGTACAAGTCGAGTAATTTCGAAGGGAATTCTTAAGGATGGGTTGTACCAACTGGAAGATATTGCTGCTATCAAGAGTCTTGAAGTTGCTAAAGAGTCAAAGACGGTCATGAACATAAATAATAATAGTTTATTAGCATTGATTTTGTTTAACATTAGAGTTAACAATGCTGTTTCAAAAAATATCTAGCACTGTCGACTAGGTCATCCATCATCTAGAGTTTTTGAATTCATAGTCAAGAATCATGGTTTGCCAGTTAAAGATAATAAAATATCCAAAATTTGCTTATCCTGTCAATTGGGTAAATCTCACTCCCTCCCCTTCCCGAATTCTACTTCTCGAGCATCGAAACCGTTTGAATTGATCCATTCGGATGTTTGCGGCCCAACACCTTTGCTATCAACAGAAGGCTTTCATTATTACCTCTTATTTGTGGATGATTTCAGTAGATTTGTATGGCTCTATCCACTGTGATAGAAAAGTGATGCCCTCATAGCTTTCTAACATTTTTTGAATTTGATACAGAATCAGTTTAATTCTGGAATTAAAACTATACAAACAGACAATGGTGGAGAGTACATTAAAATTCATCAACTATGCTCTCAATTGGGTATTCAAACCCATATGTCATGCCCTCACACATCAGAACAAAATGGCCAAGCTAAGCGGAAACACAGACGTGTTGTTGAAATAGGCCTTACTCTACCATTGCAATTCTACTGGGATGCTTTTACCACAGCAACACAGTTGCTCAATTGGCGGCCAACTCTAGTCCTTGCAGGTAAGTTTCTAATGGAGGTTCTTTTAAATAAAAAATTAGATGTTCTATAAATAAGGGTCTGGTAGAAGAGGTTGGTGTGTGCAAATCACTTGGGCGAATTGTGATTTTCTTCAACCAAATTATCCTAAATTGAGTATAGTTTCACCGAGAGGGCTGCCGAAAGGAAACTCCATGACGAAGAACTCGAATTTCATCAAAAACTCGAGCTTGCTCCGGCTATCGGGTAAATTGAAGCAGATCGGGAAGATCTGAAGATATCTGGCAATCAAAAAATAATTAGAAGCTGGAATTTTGAGGTAAGCTATCCAAGACATTTATCTACAACTTTGTAAGAGAAAAGCAACTTTAATTGTTGCTAAAAATATTACTTTTGACAGCTAGAAGTAAGGGTAGTGTGATTAAGGCTGTCGGGTGAGTTTTGGAGTGGTTGTGTGTGCGGTGTGGCTTGGGCCGTGGCCGGAAGGAAGAACCATTTGAGGTTATTTTGA
mRNA sequence
ATGGCCAACGCCAATTCCTCCGCTAACAATGGTGCCCGAAATTTCAGCTACCTCTTTGGGCAAATTGAGTGCCCTCCTATGTTCGTGAATCCACCCGCCAGTGAAATCGGAGCCAAAGTTATGTCTGGAGCATCAAGCTCACAAGTTACTAGCAGCGAGCAACAGTCAGGGTTAAGCTTGGCACTTGTCGTTGATCAACTACTACTTGGGTGGCTTTACAACTCCATGACACTAGAAATAGCCACTCAGGTAATGGGATATGAGAAACCGGAGGACCTTTTGGAAGCAATCCAGAAGTTGTTCGATGTTCAATCGAGAGCCGAAGAGGATTTTCTCAGGCAAACGTTCCAACATACCAGAAAAGTCCTTTTAGGGCTTGTTGAAGATTATAATCCAGTAGTTGCTATACTACAAGGAAAACCTGAAATAACGTGGCTTGAGATGCAAACGAAGCTGTTGACCTATAAAAGACGACTGGACTACCAGAATGTTGTTTGTTCCAGCGGAGCCAGCAAAAATCCTACGCGTGGTGGATTCAATCCAAATGCAAGTAAAGGGGACGAGGAAATGGAAGAGAACATGATGGGTCACATAGCCTTTGTTTGCTATCATAGATATGAAAAGGAATTTGTCCCCAACAACAATAGTAACAGAGGAACGAATGGGGGGAGCAACTCTACCACAACTAATAATGGCAAGAATGCTCCAACAACTATGATGGCTACACAAAATAGTAATCCTTTCATGACTAATACTGATGGTGTGCTCGACTCAAGCTGGTATGTAGATAGTGCTTCCAACCATGTTACAACCGAATACAATAACCTAAGCAATCTGATGGAGTATGGAGATAATCACGTTTACATTGAATTTCATGATAATTGTTGCCTTGTAAAGGACAAGGGTACAAGTCGAGTAATTTCGAAGGGAATTCTTAAGGATGGGTTGTACCAACTGGAAGATATTGCTGCTATCAAGAGTCTTGAAGTTGCTAAAGAGTCAAAGACGAATCAGTTTAATTCTGGAATTAAAACTATACAAACAGACAATGGTGGAGAGTACATTAAAATTCATCAACTATGCTCTCAATTGGGTATTCAAACCCATATGTCATGCCCTCACACATCAGAACAAAATGGCCAAGCTAAGCGGAAACACAGACGTGTTGTTGAAATAGGCCTTACTCTACCATTGCAATTCTACTGGGATGCTTTTACCACAGCAACACAGTTGCTCAATTGGCGGCCAACTCTAGTCCTTGCAGGTAAGTTTCTAATGGAGAGGGCTGCCGAAAGGAAACTCCATGACGAAGAACTCGAATTTCATCAAAAACTCGAGCTTGCTCCGGCTATCGGGCTGTCGGGTGAGTTTTGGAGTGGTTGTGTGTGCGGTGTGGCTTGGGCCGTGGCCGGAAGGAAGAACCATTTGAGGTTATTTTGA
Coding sequence (CDS)
ATGGCCAACGCCAATTCCTCCGCTAACAATGGTGCCCGAAATTTCAGCTACCTCTTTGGGCAAATTGAGTGCCCTCCTATGTTCGTGAATCCACCCGCCAGTGAAATCGGAGCCAAAGTTATGTCTGGAGCATCAAGCTCACAAGTTACTAGCAGCGAGCAACAGTCAGGGTTAAGCTTGGCACTTGTCGTTGATCAACTACTACTTGGGTGGCTTTACAACTCCATGACACTAGAAATAGCCACTCAGGTAATGGGATATGAGAAACCGGAGGACCTTTTGGAAGCAATCCAGAAGTTGTTCGATGTTCAATCGAGAGCCGAAGAGGATTTTCTCAGGCAAACGTTCCAACATACCAGAAAAGTCCTTTTAGGGCTTGTTGAAGATTATAATCCAGTAGTTGCTATACTACAAGGAAAACCTGAAATAACGTGGCTTGAGATGCAAACGAAGCTGTTGACCTATAAAAGACGACTGGACTACCAGAATGTTGTTTGTTCCAGCGGAGCCAGCAAAAATCCTACGCGTGGTGGATTCAATCCAAATGCAAGTAAAGGGGACGAGGAAATGGAAGAGAACATGATGGGTCACATAGCCTTTGTTTGCTATCATAGATATGAAAAGGAATTTGTCCCCAACAACAATAGTAACAGAGGAACGAATGGGGGGAGCAACTCTACCACAACTAATAATGGCAAGAATGCTCCAACAACTATGATGGCTACACAAAATAGTAATCCTTTCATGACTAATACTGATGGTGTGCTCGACTCAAGCTGGTATGTAGATAGTGCTTCCAACCATGTTACAACCGAATACAATAACCTAAGCAATCTGATGGAGTATGGAGATAATCACGTTTACATTGAATTTCATGATAATTGTTGCCTTGTAAAGGACAAGGGTACAAGTCGAGTAATTTCGAAGGGAATTCTTAAGGATGGGTTGTACCAACTGGAAGATATTGCTGCTATCAAGAGTCTTGAAGTTGCTAAAGAGTCAAAGACGAATCAGTTTAATTCTGGAATTAAAACTATACAAACAGACAATGGTGGAGAGTACATTAAAATTCATCAACTATGCTCTCAATTGGGTATTCAAACCCATATGTCATGCCCTCACACATCAGAACAAAATGGCCAAGCTAAGCGGAAACACAGACGTGTTGTTGAAATAGGCCTTACTCTACCATTGCAATTCTACTGGGATGCTTTTACCACAGCAACACAGTTGCTCAATTGGCGGCCAACTCTAGTCCTTGCAGGTAAGTTTCTAATGGAGAGGGCTGCCGAAAGGAAACTCCATGACGAAGAACTCGAATTTCATCAAAAACTCGAGCTTGCTCCGGCTATCGGGCTGTCGGGTGAGTTTTGGAGTGGTTGTGTGTGCGGTGTGGCTTGGGCCGTGGCCGGAAGGAAGAACCATTTGAGGTTATTTTGA
Protein sequence
MANANSSANNGARNFSYLFGQIECPPMFVNPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNVVCSSGASKNPTRGGFNPNASKGDEEMEENMMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDSASNHVTTEYNNLSNLMEYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDIAAIKSLEVAKESKTNQFNSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTLPLQFYWDAFTTATQLLNWRPTLVLAGKFLMERAAERKLHDEELEFHQKLELAPAIGLSGEFWSGCVCGVAWAVAGRKNHLRLF
Homology
BLAST of ClCG08G003520 vs. NCBI nr
Match:
XP_022151683.1 (uncharacterized protein LOC111019598 [Momordica charantia])
HSP 1 Score: 212.6 bits (540), Expect = 7.4e-51
Identity = 168/539 (31.17%), Postives = 246/539 (45.64%), Query Frame = 0
Query: 15 FSYLFGQIECPPMFVNPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYN 74
F YL G CPP + P + + G++SSQ +S +VVD+LLLGWLYN
Sbjct: 61 FDYLTGDKPCPPTHLVPTDTPTN---IEGSTSSQ-SSPTLNPTYEAWIVVDKLLLGWLYN 120
Query: 75 SMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRK------------- 134
SM ++A QVMG+ +L A+Q+LF VQSRAE D+L+Q FQ T K
Sbjct: 121 SMAADVAMQVMGFSTSRELWTAVQELFGVQSRAEVDYLKQVFQQTCKGSLQMIEYLKLMK 180
Query: 135 ---------------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLD 194
VL GL E+YNP+V +QGK ++W EM +LLTY++RL+
Sbjct: 181 SHADNLALAGSSVSVRDLVSQVLTGLDEEYNPIVVAVQGKVNLSWSEMHAELLTYEKRLE 240
Query: 195 YQNVVCSS---GASKNPTRGGFNPNASKGDEEMEENMMGHIAFVCYHR---YEKEFVPNN 254
YQN + S ++ P+ + + + ++ H + HR Y++
Sbjct: 241 YQNSLKSGIPINQTQTPSVNYVDGRSFQTNQRTNNGNNSHGSNT--HRGGGYQRGSFGQR 300
Query: 255 NSNRG--TNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTT 314
N RG N T +N+G N + A +++ +T + V+D SWY DS A++HVT
Sbjct: 301 NRGRGPQPTQHKNFTPSNSGPN----VFAAHHTSTTVTTPETVIDPSWYADSGATSHVTA 360
Query: 315 EYNNLSNLMEYGDNHVYIEFHDNCCLVK-----------------------------DKG 374
NN+ ++Y I + N + DK
Sbjct: 361 NPNNVEQKVDYSGTENVIVANGNKLSISHIGSTNIHASGGSLKLKDVLRVPDIAKNLDKA 420
Query: 375 TSRVISKGILKDGLYQLE-------------------DIAAIKSLEVAKESKTNQF---- 434
+ R + KG LKD LY+L+ + ++ + ++ E T F
Sbjct: 421 SGRTLLKGTLKDNLYRLDRSHRSPPATPTLTAPLFAHTVVSLSNNTLSSEKPTPSFPFAE 480
BLAST of ClCG08G003520 vs. NCBI nr
Match:
XP_016902197.1 (PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo])
HSP 1 Score: 178.3 bits (451), Expect = 1.5e-40
Identity = 134/378 (35.45%), Postives = 175/378 (46.30%), Query Frame = 0
Query: 17 YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLG 76
+L + CP FV N +E GA GASSS +T + D LLLG
Sbjct: 59 HLTAETPCPSHFVLSASSSNTTVTEEGADATIGASSS-ITPRIVNPLFEQWVTTDLLLLG 118
Query: 77 WLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRKVLLGLVEDY 136
WLYNSMT ++A Q+MG+ EDL +A Q F VQSRAEEDFLRQ Q TRK GL E Y
Sbjct: 119 WLYNSMTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQTTRK---GLDEVY 178
Query: 137 NPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA------------- 196
N V+ ++QGKP+I+WL+MQ+KLL +++RL +QN + S A
Sbjct: 179 NLVIVVIQGKPDISWLDMQSKLLIFEKRLKHQNTQKKNTGNITQSPALNMAQRFALNGQR 238
Query: 197 -SKNPTRGGFNPNASKGDEEMEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTN 256
N G+N G N GH A VCY+R+ KEF NR +
Sbjct: 239 NQSNKKFYGYNRQHFSGQRGNLNNGPTCQLCGKYGHSALVCYNRFNKEFSSPLVQNRNEH 298
Query: 257 GGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM 306
+ S + N P ++TQN+ PF T D V+D +WY+DS A+NHVT E +N++N
Sbjct: 299 SSNGSVSPN-----PAVFVSTQNATPFAT-PDTVVDPNWYIDSGATNHVTRECSNMTNPT 358
BLAST of ClCG08G003520 vs. NCBI nr
Match:
XP_016902203.1 (PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo])
HSP 1 Score: 174.1 bits (440), Expect = 2.9e-39
Identity = 130/342 (38.01%), Postives = 173/342 (50.58%), Query Frame = 0
Query: 17 YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLG 76
+L + CP FV N +E GA GASSS +T + D LLLG
Sbjct: 59 HLTAETPCPSHFVLSASSSNTTVTEEGADATIGASSS-ITPRIVNPLFEQWVTTDLLLLG 118
Query: 77 WLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRKVLLGLVEDY 136
WLYNSMT ++A Q+MG+ EDL +A Q F VQSRAEEDFLRQ Q TRK GL E Y
Sbjct: 119 WLYNSMTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQTTRK---GLDEVY 178
Query: 137 NPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA------------- 196
N V+ ++QGKP+I+WL+MQ+KLL +++RL +QN + S A
Sbjct: 179 NLVIVVIQGKPDISWLDMQSKLLIFEKRLKHQNTQKKNTGNITQSPALNMAQRFALNGQR 238
Query: 197 -SKNPTRGGFNPNASKGDEEMEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTN 256
N G+N G N GH A VCY+R+ KEF NR +
Sbjct: 239 NQSNKKFYGYNRQHFSGQRGNLNNGPTCQLCGKYGHSALVCYNRFNKEFSSPLVQNRNEH 298
Query: 257 GGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM 316
+ S + N P ++TQN+ PF T D V+D +WY+DS A+NHVT E +N++N
Sbjct: 299 SSNGSVSPN-----PAVFVSTQNATPFAT-PDTVVDPNWYIDSGATNHVTRECSNMTNPT 358
Query: 317 EYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDI 323
EY +Y E + +G L+DG YQLE +
Sbjct: 359 EY-SGQIYGE---------------TLLRGTLRDGFYQLERV 374
BLAST of ClCG08G003520 vs. NCBI nr
Match:
XP_038905161.1 (uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida])
HSP 1 Score: 169.9 bits (429), Expect = 5.5e-38
Identity = 120/331 (36.25%), Postives = 168/331 (50.76%), Query Frame = 0
Query: 42 SGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLF 101
SGASSS +T+ E + VDQLLLGWLYNSMT E+A QVMG E +DL +I +LF
Sbjct: 31 SGASSS-LTALEVNPQYESWMAVDQLLLGWLYNSMTPEVAIQVMGCECAKDLWTSIPQLF 90
Query: 102 DVQSRAEEDFLRQTFQHTRK----------------------------------VLLGLV 161
VQSR EED+LR FQ TRK VLLGL
Sbjct: 91 GVQSRVEEDYLRHVFQTTRKGNLKMEEYLQTMKMNTDNLEQAGSPMPPRTLVSQVLLGLD 150
Query: 162 EDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQ---------NVVCSSGASKNPTR-- 221
E+YN +VA++QG+ +++WL+MQ++LL Y+RRL++Q N + ++ + TR
Sbjct: 151 EEYNAIVAMIQGRVDMSWLDMQSELLLYERRLEHQSNQKTTVGFNQISNASVNMTNTRHV 210
Query: 222 --------------------GGFNPNASKGDEEMEE-----NMMGHIAFVCYHRYEKEFV 281
GG +G + +GHIAF C++RY ++FV
Sbjct: 211 NQNNKTNSSNQSIGGGQRGGGGHGRGRGRGRNNKKPVCQVCGKVGHIAFYCFNRYSRDFV 270
Query: 282 PNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDSASNHVTT 301
PN+ N+ +N T N + PT + SNPF+T + + D++WY ASNHVT+
Sbjct: 271 PNSPQNKVEPFPNNQ--TKNTQPHPTALAIAYGSNPFLTRQENMTDANWYDSGASNHVTS 330
BLAST of ClCG08G003520 vs. NCBI nr
Match:
XP_038905164.1 (uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida])
HSP 1 Score: 168.7 bits (426), Expect = 1.2e-37
Identity = 115/311 (36.98%), Postives = 162/311 (52.09%), Query Frame = 0
Query: 42 SGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLF 101
SGASSS +T+ E + VDQLLLGWLYNSMT E+A QVMG E +DL +I +LF
Sbjct: 31 SGASSS-LTALEVNPQYESWMAVDQLLLGWLYNSMTPEVAIQVMGCECAKDLWTSIPQLF 90
Query: 102 DVQSRAEEDFLRQTFQHTRK----------------------------------VLLGLV 161
VQSR EED+LR FQ TRK VLLGL
Sbjct: 91 GVQSRVEEDYLRHVFQTTRKGNLKMEEYLQTMKMNTDNLEQAGSPMPPRTLVSQVLLGLD 150
Query: 162 EDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQ---------NVVCSSGASKNPTR-- 221
E+YN +VA++QG+ +++WL+MQ++LL Y+RRL++Q N + ++ + TR
Sbjct: 151 EEYNAIVAMIQGRVDMSWLDMQSELLLYERRLEHQSNQKTTVGFNQISNASVNMTNTRHV 210
Query: 222 --------------------GGFNPNASKGDEEMEE-----NMMGHIAFVCYHRYEKEFV 281
GG +G + +GHIAF C++RY ++FV
Sbjct: 211 NQNNKTNSSNQSIGGGQRGGGGHGRGRGRGRNNKKPVCQVCGKVGHIAFYCFNRYSRDFV 270
Query: 282 PNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDSASNHVTT 283
PN+ N+ +N T N + PT + SNPF+T + + D++WY ASNHVT+
Sbjct: 271 PNSPQNKVEPFPNNQ--TKNTQPHPTALAIAYGSNPFLTRQENMTDANWYDSGASNHVTS 330
BLAST of ClCG08G003520 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 76.3 bits (186), Expect = 1.1e-12
Identity = 44/112 (39.29%), Postives = 65/112 (58.04%), Query Frame = 0
Query: 316 LYQLEDIAAIK-SLEVAKESKTNQFNSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPH 375
LY L+ + +K + + K N+F + I T+ +DNGGE++ + SQ GI S PH
Sbjct: 537 LYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPH 596
Query: 376 TSEQNGQAKRKHRRVVEIGLTL------PLQFYWDAFTTATQLLNWRPTLVL 421
T E NG ++RKHR +VE+GLTL P ++ AF+ A L+N PT +L
Sbjct: 597 TPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLL 648
BLAST of ClCG08G003520 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 73.6 bits (179), Expect = 7.0e-12
Identity = 44/112 (39.29%), Postives = 62/112 (55.36%), Query Frame = 0
Query: 316 LYQLEDIAAIKSLEVA-KESKTNQFNSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPH 375
LY L+ + +K + K N+F + I T +DNGGE++ + + SQ GI S PH
Sbjct: 558 LYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWEYFSQHGISHLTSPPH 617
Query: 376 TSEQNGQAKRKHRRVVEIGLTL------PLQFYWDAFTTATQLLNWRPTLVL 421
T E NG ++RKHR +VE GLTL P ++ AF A L+N PT +L
Sbjct: 618 TPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLL 669
BLAST of ClCG08G003520 vs. ExPASy TrEMBL
Match:
A0A6J1DCW4 (uncharacterized protein LOC111019598 OS=Momordica charantia OX=3673 GN=LOC111019598 PE=4 SV=1)
HSP 1 Score: 212.6 bits (540), Expect = 3.6e-51
Identity = 168/539 (31.17%), Postives = 246/539 (45.64%), Query Frame = 0
Query: 15 FSYLFGQIECPPMFVNPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYN 74
F YL G CPP + P + + G++SSQ +S +VVD+LLLGWLYN
Sbjct: 61 FDYLTGDKPCPPTHLVPTDTPTN---IEGSTSSQ-SSPTLNPTYEAWIVVDKLLLGWLYN 120
Query: 75 SMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRK------------- 134
SM ++A QVMG+ +L A+Q+LF VQSRAE D+L+Q FQ T K
Sbjct: 121 SMAADVAMQVMGFSTSRELWTAVQELFGVQSRAEVDYLKQVFQQTCKGSLQMIEYLKLMK 180
Query: 135 ---------------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLD 194
VL GL E+YNP+V +QGK ++W EM +LLTY++RL+
Sbjct: 181 SHADNLALAGSSVSVRDLVSQVLTGLDEEYNPIVVAVQGKVNLSWSEMHAELLTYEKRLE 240
Query: 195 YQNVVCSS---GASKNPTRGGFNPNASKGDEEMEENMMGHIAFVCYHR---YEKEFVPNN 254
YQN + S ++ P+ + + + ++ H + HR Y++
Sbjct: 241 YQNSLKSGIPINQTQTPSVNYVDGRSFQTNQRTNNGNNSHGSNT--HRGGGYQRGSFGQR 300
Query: 255 NSNRG--TNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTT 314
N RG N T +N+G N + A +++ +T + V+D SWY DS A++HVT
Sbjct: 301 NRGRGPQPTQHKNFTPSNSGPN----VFAAHHTSTTVTTPETVIDPSWYADSGATSHVTA 360
Query: 315 EYNNLSNLMEYGDNHVYIEFHDNCCLVK-----------------------------DKG 374
NN+ ++Y I + N + DK
Sbjct: 361 NPNNVEQKVDYSGTENVIVANGNKLSISHIGSTNIHASGGSLKLKDVLRVPDIAKNLDKA 420
Query: 375 TSRVISKGILKDGLYQLE-------------------DIAAIKSLEVAKESKTNQF---- 434
+ R + KG LKD LY+L+ + ++ + ++ E T F
Sbjct: 421 SGRTLLKGTLKDNLYRLDRSHRSPPATPTLTAPLFAHTVVSLSNNTLSSEKPTPSFPFAE 480
BLAST of ClCG08G003520 vs. ExPASy TrEMBL
Match:
A0A1S4E1U6 (uncharacterized protein LOC107991581 isoform X1 OS=Cucumis melo OX=3656 GN=LOC107991581 PE=4 SV=1)
HSP 1 Score: 178.3 bits (451), Expect = 7.5e-41
Identity = 134/378 (35.45%), Postives = 175/378 (46.30%), Query Frame = 0
Query: 17 YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLG 76
+L + CP FV N +E GA GASSS +T + D LLLG
Sbjct: 59 HLTAETPCPSHFVLSASSSNTTVTEEGADATIGASSS-ITPRIVNPLFEQWVTTDLLLLG 118
Query: 77 WLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRKVLLGLVEDY 136
WLYNSMT ++A Q+MG+ EDL +A Q F VQSRAEEDFLRQ Q TRK GL E Y
Sbjct: 119 WLYNSMTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQTTRK---GLDEVY 178
Query: 137 NPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA------------- 196
N V+ ++QGKP+I+WL+MQ+KLL +++RL +QN + S A
Sbjct: 179 NLVIVVIQGKPDISWLDMQSKLLIFEKRLKHQNTQKKNTGNITQSPALNMAQRFALNGQR 238
Query: 197 -SKNPTRGGFNPNASKGDEEMEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTN 256
N G+N G N GH A VCY+R+ KEF NR +
Sbjct: 239 NQSNKKFYGYNRQHFSGQRGNLNNGPTCQLCGKYGHSALVCYNRFNKEFSSPLVQNRNEH 298
Query: 257 GGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM 306
+ S + N P ++TQN+ PF T D V+D +WY+DS A+NHVT E +N++N
Sbjct: 299 SSNGSVSPN-----PAVFVSTQNATPFAT-PDTVVDPNWYIDSGATNHVTRECSNMTNPT 358
BLAST of ClCG08G003520 vs. ExPASy TrEMBL
Match:
A0A1S4E1V2 (uncharacterized protein LOC107991581 isoform X3 OS=Cucumis melo OX=3656 GN=LOC107991581 PE=4 SV=1)
HSP 1 Score: 174.1 bits (440), Expect = 1.4e-39
Identity = 130/342 (38.01%), Postives = 173/342 (50.58%), Query Frame = 0
Query: 17 YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLG 76
+L + CP FV N +E GA GASSS +T + D LLLG
Sbjct: 59 HLTAETPCPSHFVLSASSSNTTVTEEGADATIGASSS-ITPRIVNPLFEQWVTTDLLLLG 118
Query: 77 WLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRKVLLGLVEDY 136
WLYNSMT ++A Q+MG+ EDL +A Q F VQSRAEEDFLRQ Q TRK GL E Y
Sbjct: 119 WLYNSMTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQTTRK---GLDEVY 178
Query: 137 NPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA------------- 196
N V+ ++QGKP+I+WL+MQ+KLL +++RL +QN + S A
Sbjct: 179 NLVIVVIQGKPDISWLDMQSKLLIFEKRLKHQNTQKKNTGNITQSPALNMAQRFALNGQR 238
Query: 197 -SKNPTRGGFNPNASKGDEEMEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTN 256
N G+N G N GH A VCY+R+ KEF NR +
Sbjct: 239 NQSNKKFYGYNRQHFSGQRGNLNNGPTCQLCGKYGHSALVCYNRFNKEFSSPLVQNRNEH 298
Query: 257 GGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM 316
+ S + N P ++TQN+ PF T D V+D +WY+DS A+NHVT E +N++N
Sbjct: 299 SSNGSVSPN-----PAVFVSTQNATPFAT-PDTVVDPNWYIDSGATNHVTRECSNMTNPT 358
Query: 317 EYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDI 323
EY +Y E + +G L+DG YQLE +
Sbjct: 359 EY-SGQIYGE---------------TLLRGTLRDGFYQLERV 374
BLAST of ClCG08G003520 vs. ExPASy TrEMBL
Match:
A0A5A7SIT7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold19G00360 PE=4 SV=1)
HSP 1 Score: 166.8 bits (421), Expect = 2.2e-37
Identity = 132/378 (34.92%), Postives = 176/378 (46.56%), Query Frame = 0
Query: 17 YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLG 76
+L G+ CP FV N +E GA GASSS +T S + D LLLG
Sbjct: 59 HLTGETPCPSHFVLSASSSNTTVTEEGADATIGASSS-ITPRIVNSLFEQWVTTDLLLLG 118
Query: 77 WLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRK--------- 136
WLYNSMT ++A Q+MG+ EDL +A Q F VQSRAEEDFLRQ Q TRK
Sbjct: 119 WLYNSMTPDVAIQLMGFTNVEDLWDATQDFFGVQSRAEEDFLRQMLQTTRKGNTKMEEYL 178
Query: 137 -------------------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYK 196
VLLGL E YN V+ ++QGKP+I+WL+MQ+KLL ++
Sbjct: 179 LVMKTNVDNLGQVGSPVPRRALISQVLLGLDEVYNLVIVVIQGKPDISWLDMQSKLLIFE 238
Query: 197 RRLDYQNVVCSSGASKNPTRG-----------------------GFNPNASKGDEEMEEN 256
+ L +QN N T+ G+N G N
Sbjct: 239 KILKHQNTQKKKKKKGNITQSPALNMAQRFALNGQRNHSNKKFYGYNRQHFSGQRGNLNN 298
Query: 257 --------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNS 316
GH A VCY+R+ KEF +R + + S + N P ++TQN+
Sbjct: 299 GPTCQLCGKYGHSALVCYNRFNKEFSSPLVQDRNEHSSNGSVSPN-----PAVFVSTQNA 358
Query: 317 NPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLMEYGDNHVYIEFHDNCCLVKDKGTS 323
PF T D V+D +WY+DS A+NHVT E +N++N EY +Y E
Sbjct: 359 TPFAT-PDTVVDPNWYIDSGATNHVTRECSNMTNPTEY-SGQIYGE-------------- 413
BLAST of ClCG08G003520 vs. ExPASy TrEMBL
Match:
A0A5D3CPY2 (Retrotransposon protein, putative, Ty1-copia subclass OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold90G00160 PE=4 SV=1)
HSP 1 Score: 166.0 bits (419), Expect = 3.8e-37
Identity = 112/288 (38.89%), Postives = 155/288 (53.82%), Query Frame = 0
Query: 65 DQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRKVLL 124
D LLLGW+YNSMT E+A Q+MG+ +DL EAIQ LF VQSR EEDFLR FQ TRK
Sbjct: 13 DLLLLGWIYNSMTAEVAFQLMGFNIAKDLWEAIQDLFGVQSRVEEDFLRHGFQTTRKG-N 72
Query: 125 GLVEDY-----NPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNVVCSSGASKNPTRGGF 184
+EDY V + Q KP+I+WL+MQ++LL +++RL++Q
Sbjct: 73 SKMEDYLRIMKTNVENLGQEKPDISWLDMQSELLIFEKRLEHQ----------------- 132
Query: 185 NPNASKGDEEMEENMMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTM 244
NSN+ + G ++ T +N T
Sbjct: 133 -----------------------------------NSNKKSKG--HTFTPSNSNQNLTAF 192
Query: 245 MATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLMEYG-----------DNHV 304
+ T NSN F+T + V+DS+WYVD+ A+NHVT +Y+NLSN ++Y DN+V
Sbjct: 193 VTTYNSNSFVT-PETVIDSNWYVDNGATNHVTADYSNLSNPLKYSGIEHVIVGNAQDNNV 244
Query: 305 YIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDIAAIKSLEVAKESK 336
Y+EFH + C V +K T R I +G+LKDGLY LE +A + L+ + K
Sbjct: 253 YLEFHGDYCFVNNKDTGRTIMRGVLKDGLYHLESVAVLADLKKSGSRK 244
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022151683.1 | 7.4e-51 | 31.17 | uncharacterized protein LOC111019598 [Momordica charantia] | [more] |
XP_016902197.1 | 1.5e-40 | 35.45 | PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo] | [more] |
XP_016902203.1 | 2.9e-39 | 38.01 | PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo] | [more] |
XP_038905161.1 | 5.5e-38 | 36.25 | uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida] | [more] |
XP_038905164.1 | 1.2e-37 | 36.98 | uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
Q9ZT94 | 1.1e-12 | 39.29 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
Q94HW2 | 7.0e-12 | 39.29 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DCW4 | 3.6e-51 | 31.17 | uncharacterized protein LOC111019598 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A1S4E1U6 | 7.5e-41 | 35.45 | uncharacterized protein LOC107991581 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S4E1V2 | 1.4e-39 | 38.01 | uncharacterized protein LOC107991581 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5A7SIT7 | 2.2e-37 | 34.92 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A5D3CPY2 | 3.8e-37 | 38.89 | Retrotransposon protein, putative, Ty1-copia subclass OS=Cucumis melo var. makuw... | [more] |
Match Name | E-value | Identity | Description | |