ClCG05G022220 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG05G022220
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionUnknown protein
LocationCG_Chr05: 34210855 .. 34215115 (+)
RNA-Seq ExpressionClCG05G022220
SyntenyClCG05G022220
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGAGTCCCTTTCCCTCTCTCTTTTTCATCTTTTTTTCTCTCCACAGAGTTCTATAAATCTCCATTAACTTCGCTCTGAAAATCCTTGATCATAGTATAACCGGCCTTCTCTGTTCTTCACTCTTTGAGTTTTGAGGCCTTAATGCCTCTAGATTTTGGCAAATGAGGTAGTTTTATGTATTATGCGGTCTCTGTACCGGGATTTTCTTTGCCGATGAAGGGTTTTCCTGCTTCGTTTTTGCTTCTATTTTTGGTGGGTTTTACCACTTTCAGCTGGGTTTTGGCTTTGCCTCAGGATGTCATGCCGAAAGATTCTGGCAAATTCTTTCTAGGTTTGGACATCCTTCTCTATTGTTTGTAAATTCTTTCGGGATTTGATCGTATATTCTCATTTGCTTGTGAGTTTTCTTATGAATTTGATTGATTGATTCTGTTTCTTATGTTGGGCCTGTTTTGTTGTTGAAAATGATGGTGGGGAAATTATCGCAATTATAAGTTACGTAATTTGTAATTTCTGTTTGATTTGGTCTGAGGAAATGGGTGCTCTCGTCTTACTTTAAAGGAGGTTAAAAGTGTTTTGTGGCATATTCAGTCGCTTTGTTTCTTAATTCCATGAATCGACGAGTGGAATATTTTGGGCATCCTGAAATTTCCTTCTTCTCTTGTGTATTCTAAGTGGCCACATTGTGCAGAATATTTGTTCCCTGTTTGCTGGATTTTCTTTTAGTGGTCGGTTTCTTCTGGAAAATGCAAAATGATTCTTGGTTGGAAAAGAAAAGTTTCTCTCTCTGTTTCCTATGGAGCTTATAGAGCTTTGCTTCGTTAAGTAGTTTTATGTTCTTTATGGTCGCTTGTTTGTTAAGGAAGCCTTTTATGTGCAGGACAACAAAACTTGGGTCCTTGGAAGAATGAGATATTGGAGACTGCTGAGGGCCCAGGATCTGCAAATAACAACTCTCAAAACCCTCTTGTATTGGCAGCAAATAGAACGAAGCGCCCAGATATTCTTCATGGATTTAGGGTGTATGAAGGTGGCTGGGACATTGCCAATCCTAATTACTGGGCTGTAAGTGCTGTTACCATAATTTTATTTTACTTTTTGTGTTTGGTATATTTTGCCTTGAATTCTGCTAGTCAATACAATTTCTTGTAATTAGATTGTGCATTTGAGGTGGTTAGAAAATCTTTGGATGCTTATTGTAGAATTTTAGTCATACACTGTCCAATTTACAAGACTATTGAGTAACTCTTTAAGAAATGAGAAAAGATCTAACGACTATCATTCTACTTGGAGTATCCATTTCTCATTTGAGATTTTGTTATTTAAGTATTTAATTAAAGATGTTCTCTACTTGATAGATTGATTTATAGTTAACCACTTGAATTAAGGAAGTTCTTTTACTGTTGTGGCAATCTGCAAATTTATTGTTATAGACAAATGCCTGAAGTCAGGGGTAGGCTTCAAGTCTTACACCTGCAGCCTGAGAATATCTGAGAATTAAATAGGGACCAATTTCTCAGGAGCTTCATATAGCTTTAGTATGTTTTATTGTATGAAGCCCTTGAAAAACCTTCACATTATGTCTGCAGGGATCTCACATTTTCGTTTATGATCTTTGCATTTGTTGATCCTTGGCATTTAGTATCCTTTTATTACATTTGTAATGATTATTCTTATTTGAACTTTTCAACCAGTACTCATTTCTGTCATGTTAATATGTGTCTGCAGTCTGTTGGGTTTACTGGTGCAACTGGTTTCATTCTTTCTATCTTCTGGTTCATTTCCTTTGGCATTGCTCTTCTTATTCATCGTTGCTGTGGATGGAAATTAAACCTTAAAGGCGAAGAATCAAAGACTTCACAGTGGATTTGCCTAGCATTACTTGTTGTTTTCACATCTGCTGCAGCGTAATACTCTCCACCCTTGCCTTGATGATGTACTTAGATTTTGAAACCAAGCTTTCTTTCATTTAAAAGTTTATTAGAAATCTATTTTATTCATCGTCATTTTTCGGGGTTTTGTTGCAGCCATTATAGACCCAAATTTTACTTAGAATACATGAAGAAACAGCAACATCTTTTACTTAAATCAAGTTTCACAATTCTATTTAGGACATCTTTAAGAGCTGTTTTCGAAACAGTCTTGATTTCATGTTGACTTTTTATTTAGAACGAACAATTTTATTCTACTTTCGAAATCACAGATTTGTGAATTCAAACTTCTATGACCATCCTTAAAAATAGTTTTGAAAACATTTGCCAAACAAATTCTTTTATATCCCCTCCTCCCGAGGAAAAAATGGGTTTAGAAACCTAAGTAATAGGTGGTCCTCTGTTTTCACTGACCAACGGTTTTGTCGTTTTAATGTTGTTAATCATATTATTTTGATATCAATTTATGTTTAAATTGTCTTAACTTCTTTCTGACACAAGCTTATCTTTCTATTTCACAGCATTGGCTGCATACTTCTATGTATTGGACAGAATAATTTTTACAATGAAGGTTTGCATACTTTGAAGTATGTTGTAAACCAGTCGGACTACACTGTACACACGCTTAAAAATGTTACAGAATATCTCTCACTTGCAAAGACCATTAGTGTGGCCCAGGTGTTCCTTCCATCTGACGTAATGAACGACATTGATGAGTTAAATGTGGATTTAAATACTGCTGCAGATACAGTGGCAGACAAGACAAGCGTAAATTCTCGTAAAATAAGAAAAGTTTTCACTGCTATGTATGTATCTTCAACTTGTTCCTTTCAATCAAGTTTATGTTCTTTTATGATGTCGAAAAGCTAACATAAATCTTGTTAGTGCAGGCGTTCGGCATTAATTACAGTTGCTGCACTCATGCTTCTTCTGGCTCTCATTGGTCTTTGTATGTAATTCCTAAATTCTTAAATAATGGTGTAGTTTTCGTGTTTACTGGACTTGTTCATACCTTTTTACTCACTGGCTGGAATTGTCGTTGCAGTCCTGTCCTTCTTTGGATATCAACATGCAATCTATGTGTAAGTTGTCTGTCGAAATATTTATTTTACTTTTCGGATTCATCTATACATTTGTAGAATAACAGTATGTCATCATATGTTGCAGATTAATAATCAGTGGTTGGCTACTTGTGACATTTACATTTGTTCTTTGTGGATTGTTTGTAATTCTTGACAAGTAAGTTGGTTCGTTTACAGTTTGGATATATCTGTTACGTTGTCACATTAACTAATAATTGTTGTATTGGTTTATGAAAGGAAACTGCTAAATATACCGTAATCACTTCTAATTTAATAGACTGAGTTCATAACCTGTTTTTCTCTTTCAGTGCTGTTTCTGATACATGTATGGCAATGGAAGAATGGGTGGATAATCCACATGCAGAAACAGCTCTTAGCAACATCCTTCCATGTGTTGACCATAAAACCACAAACCAGACACTGATCCAAAGCAAAAAGATCGTTAACGACATCGTGAGTGTCGTCGATCAATTTGTCTACAACTTCGCCAATGCAAATCCACCCCCGGGTTCTCCCAACTACTGCAACCAGTCAGGACCTTCCATGCCAGCTCTCTGTTACCCATACAACTCTCAGCTGGAAGAAAGTAGATGTGGCGACAACGACGTGACTATTGAAAATGCATCAACTGTAAGATATATAAATTTTAACTTAGGAATGTCTAGTCTATACACCGAATATCAGGGCAATTCATATCAGTGATTATGGTGGCTATCTCCACTGCCAATCATCTTCCTGTGTTTATTTCAATAATGACCGATAATTTGGTTATAAATGCAGGTATGGCAGAAGTTTGTGTGTCAAGTATCGGAATCTGGGCTGTGCATCACAGTCGGAAGGGTCTCGCCAAACATCCACTCCCAGATGGTGGCTGCAGTGAATGAGAGTTATGCACTTCAACATTACACTCCCCCTTTGCTCAGCTTCCAGAATTGCAATTTTGTAAGGGAAACATTTCACAACATTACCACAGCTTACTGCCCTCATCTGCATCGGCATCTTAAGATTGTGAATGTTGGACTTGCAATGATTTCAGTAGGAATATTGCTGTGTCTGTTGCTATGGATACTATATGCAAACCACCCCCAAAGGGAGGATGTGTCTGCAAAGCTATCGTTTTCGATAAACCGGAGGAGGAATGGTAACCAAAATACGAATAATAATAGCAGCGGAAACGACGAATCCACGACATCAAGCATCAGAAGCATCAGAAGTGGAGTTTAGAGAAGTAGAGAGAAAAAAGAA

mRNA sequence

ATGTGAGTCCCTTTCCCTCTCTCTTTTTCATCTTTTTTTCTCTCCACAGAGTTCTATAAATCTCCATTAACTTCGCTCTGAAAATCCTTGATCATAGTATAACCGGCCTTCTCTGTTCTTCACTCTTTGAGTTTTGAGGCCTTAATGCCTCTAGATTTTGGCAAATGAGGTAGTTTTATGTATTATGCGGTCTCTGTACCGGGATTTTCTTTGCCGATGAAGGGTTTTCCTGCTTCGTTTTTGCTTCTATTTTTGGTGGGTTTTACCACTTTCAGCTGGGTTTTGGCTTTGCCTCAGGATGTCATGCCGAAAGATTCTGGCAAATTCTTTCTAGGACAACAAAACTTGGGTCCTTGGAAGAATGAGATATTGGAGACTGCTGAGGGCCCAGGATCTGCAAATAACAACTCTCAAAACCCTCTTGTATTGGCAGCAAATAGAACGAAGCGCCCAGATATTCTTCATGGATTTAGGGTGTATGAAGGTGGCTGGGACATTGCCAATCCTAATTACTGGGCTTCTGTTGGGTTTACTGGTGCAACTGGTTTCATTCTTTCTATCTTCTGGTTCATTTCCTTTGGCATTGCTCTTCTTATTCATCGTTGCTGTGGATGGAAATTAAACCTTAAAGGCGAAGAATCAAAGACTTCACAGTGGATTTGCCTAGCATTACTTGTTGTTTTCACATCTGCTGCAGCCATTGGCTGCATACTTCTATGTATTGGACAGAATAATTTTTACAATGAAGGTTTGCATACTTTGAAGTATGTTGTAAACCAGTCGGACTACACTGTACACACGCTTAAAAATGTTACAGAATATCTCTCACTTGCAAAGACCATTAGTGTGGCCCAGGTGTTCCTTCCATCTGACGTAATGAACGACATTGATGAGTTAAATGTGGATTTAAATACTGCTGCAGATACAGTGGCAGACAAGACAAGCGTAAATTCTCGTAAAATAAGAAAAGTTTTCACTGCTATGCGTTCGGCATTAATTACAGTTGCTGCACTCATGCTTCTTCTGGCTCTCATTGGTCTTTGTATTTTTCGTGTTTACTGGACTTGTTCATACCTTTTTACTCACTGGCTGGAATTGTCGTTGCAGTCCTGTCCTTCTTTGGATATCAACATGCAATCTATATTAATAATCAGTGGTTGGCTACTTGTGACATTTACATTTGTTCTTTGTGGATTGTTTGTAATTCTTGACAATGCTGTTTCTGATACATGTATGGCAATGGAAGAATGGGTGGATAATCCACATGCAGAAACAGCTCTTAGCAACATCCTTCCATGTGTTGACCATAAAACCACAAACCAGACACTGATCCAAAGCAAAAAGATCGTTAACGACATCGTGAGTGTCGTCGATCAATTTGTCTACAACTTCGCCAATGCAAATCCACCCCCGGGTTCTCCCAACTACTGCAACCAGTCAGGACCTTCCATGCCAGCTCTCTGTTACCCATACAACTCTCAGCTGGAAGAAAGTAGATGTGGCGACAACGACGTGACTATTGAAAATGCATCAACTGTATGGCAGAAGTTTGTGTGTCAAGTATCGGAATCTGGGCTGTGCATCACAGTCGGAAGGGTCTCGCCAAACATCCACTCCCAGATGGTGGCTGCAGTGAATGAGAGTTATGCACTTCAACATTACACTCCCCCTTTGCTCAGCTTCCAGAATTGCAATTTTGTAAGGGAAACATTTCACAACATTACCACAGCTTACTGCCCTCATCTGCATCGGCATCTTAAGATTGTGAATGTTGGACTTGCAATGATTTCAGTAGGAATATTGCTGTGTCTGTTGCTATGGATACTATATGCAAACCACCCCCAAAGGGAGGATGTGTCTGCAAAGCTATCGTTTTCGATAAACCGGAGGAGGAATGGTAACCAAAATACGAATAATAATAGCAGCGGAAACGACGAATCCACGACATCAAGCATCAGAAGCATCAGAAGTGGAGTTTAGAGAAGTAGAGAGAAAAAAGAA

Coding sequence (CDS)

ATGTATTATGCGGTCTCTGTACCGGGATTTTCTTTGCCGATGAAGGGTTTTCCTGCTTCGTTTTTGCTTCTATTTTTGGTGGGTTTTACCACTTTCAGCTGGGTTTTGGCTTTGCCTCAGGATGTCATGCCGAAAGATTCTGGCAAATTCTTTCTAGGACAACAAAACTTGGGTCCTTGGAAGAATGAGATATTGGAGACTGCTGAGGGCCCAGGATCTGCAAATAACAACTCTCAAAACCCTCTTGTATTGGCAGCAAATAGAACGAAGCGCCCAGATATTCTTCATGGATTTAGGGTGTATGAAGGTGGCTGGGACATTGCCAATCCTAATTACTGGGCTTCTGTTGGGTTTACTGGTGCAACTGGTTTCATTCTTTCTATCTTCTGGTTCATTTCCTTTGGCATTGCTCTTCTTATTCATCGTTGCTGTGGATGGAAATTAAACCTTAAAGGCGAAGAATCAAAGACTTCACAGTGGATTTGCCTAGCATTACTTGTTGTTTTCACATCTGCTGCAGCCATTGGCTGCATACTTCTATGTATTGGACAGAATAATTTTTACAATGAAGGTTTGCATACTTTGAAGTATGTTGTAAACCAGTCGGACTACACTGTACACACGCTTAAAAATGTTACAGAATATCTCTCACTTGCAAAGACCATTAGTGTGGCCCAGGTGTTCCTTCCATCTGACGTAATGAACGACATTGATGAGTTAAATGTGGATTTAAATACTGCTGCAGATACAGTGGCAGACAAGACAAGCGTAAATTCTCGTAAAATAAGAAAAGTTTTCACTGCTATGCGTTCGGCATTAATTACAGTTGCTGCACTCATGCTTCTTCTGGCTCTCATTGGTCTTTGTATTTTTCGTGTTTACTGGACTTGTTCATACCTTTTTACTCACTGGCTGGAATTGTCGTTGCAGTCCTGTCCTTCTTTGGATATCAACATGCAATCTATATTAATAATCAGTGGTTGGCTACTTGTGACATTTACATTTGTTCTTTGTGGATTGTTTGTAATTCTTGACAATGCTGTTTCTGATACATGTATGGCAATGGAAGAATGGGTGGATAATCCACATGCAGAAACAGCTCTTAGCAACATCCTTCCATGTGTTGACCATAAAACCACAAACCAGACACTGATCCAAAGCAAAAAGATCGTTAACGACATCGTGAGTGTCGTCGATCAATTTGTCTACAACTTCGCCAATGCAAATCCACCCCCGGGTTCTCCCAACTACTGCAACCAGTCAGGACCTTCCATGCCAGCTCTCTGTTACCCATACAACTCTCAGCTGGAAGAAAGTAGATGTGGCGACAACGACGTGACTATTGAAAATGCATCAACTGTATGGCAGAAGTTTGTGTGTCAAGTATCGGAATCTGGGCTGTGCATCACAGTCGGAAGGGTCTCGCCAAACATCCACTCCCAGATGGTGGCTGCAGTGAATGAGAGTTATGCACTTCAACATTACACTCCCCCTTTGCTCAGCTTCCAGAATTGCAATTTTGTAAGGGAAACATTTCACAACATTACCACAGCTTACTGCCCTCATCTGCATCGGCATCTTAAGATTGTGAATGTTGGACTTGCAATGATTTCAGTAGGAATATTGCTGTGTCTGTTGCTATGGATACTATATGCAAACCACCCCCAAAGGGAGGATGTGTCTGCAAAGCTATCGTTTTCGATAAACCGGAGGAGGAATGGTAACCAAAATACGAATAATAATAGCAGCGGAAACGACGAATCCACGACATCAAGCATCAGAAGCATCAGAAGTGGAGTTTAG

Protein sequence

MYYAVSVPGFSLPMKGFPASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGSANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFISFGIALLIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLHTLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVADKTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCIFRVYWTCSYLFTHWLELSLQSCPSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNILPCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPYNSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYALQHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYANHPQREDVSAKLSFSINRRRNGNQNTNNNSSGNDESTTSSIRSIRSGV
Homology
BLAST of ClCG05G022220 vs. NCBI nr
Match: XP_038893119.1 (uncharacterized protein LOC120081994 [Benincasa hispida])

HSP 1 Score: 1019.2 bits (2634), Expect = 1.4e-293
Identity = 525/586 (89.59%), Postives = 535/586 (91.30%), Query Frame = 0

Query: 14  MKGFPASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGS 73
           MKGFPASFLLLFLVGF TFSWVLA PQDV+PKDSGKF LGQ+NLGPWKNEILETAEG GS
Sbjct: 1   MKGFPASFLLLFLVGFATFSWVLAFPQDVLPKDSGKFKLGQENLGPWKNEILETAEGSGS 60

Query: 74  ANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFIS 133
           A NNSQ+PLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFI 
Sbjct: 61  AYNNSQSPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFIF 120

Query: 134 FGIALLIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLH 193
           FGIALLIHRCCGWK NL G+ESKTSQWICLALLVVFTSAA IGCILLCIGQNNFYNEGLH
Sbjct: 121 FGIALLIHRCCGWKFNLNGKESKTSQWICLALLVVFTSAATIGCILLCIGQNNFYNEGLH 180

Query: 194 TLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 253
           TLK+VVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD
Sbjct: 181 TLKFVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 240

Query: 254 KTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCI-FRVYWTCSYLFTHWLELSLQSC 313
           KT+VNSRK RKVFT MRSALITVAALMLLLALIGL + F  Y    Y             
Sbjct: 241 KTTVNSRKTRKVFTMMRSALITVAALMLLLALIGLFLSFFGYQHAIY------------- 300

Query: 314 PSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNIL 373
                    ILIISGWLLVT TFVLCGLFVILDNAVSDTCMAMEEWVDNP AETALSNIL
Sbjct: 301 ---------ILIISGWLLVTITFVLCGLFVILDNAVSDTCMAMEEWVDNPQAETALSNIL 360

Query: 374 PCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPY 433
           PCVDHKTTNQTLIQSKKIVNDIVSVVDQF+YNFANANPPPGSPNYCNQSGP MPALCYPY
Sbjct: 361 PCVDHKTTNQTLIQSKKIVNDIVSVVDQFIYNFANANPPPGSPNYCNQSGPPMPALCYPY 420

Query: 434 NSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYAL 493
           NSQLEESRC DNDVTIENASTVWQKFVCQVSESGLCITVGRVSP+IHSQMVAAVNESY L
Sbjct: 421 NSQLEESRCSDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPDIHSQMVAAVNESYGL 480

Query: 494 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYA 553
           QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVN GLAMISVGILLCLLLWILYA
Sbjct: 481 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNTGLAMISVGILLCLLLWILYA 540

Query: 554 NHPQREDVSAKLSFSINRRRNGNQNTNNNSSGNDESTTSSIRSIRS 599
           NHPQREDVSAKLSFSIN RR+ N NTNNN SGNDESTTSSIRSIRS
Sbjct: 541 NHPQREDVSAKLSFSINCRRSSNPNTNNNGSGNDESTTSSIRSIRS 564

BLAST of ClCG05G022220 vs. NCBI nr
Match: XP_008446693.1 (PREDICTED: uncharacterized protein LOC103489338 [Cucumis melo])

HSP 1 Score: 1007.7 bits (2604), Expect = 4.3e-290
Identity = 519/589 (88.12%), Postives = 537/589 (91.17%), Query Frame = 0

Query: 14  MKGFPASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGS 73
           MKGFPASF LLF VG  TFSWVLALP DV+PKDSGKF LGQ+NLGPWKNEILETAEGPGS
Sbjct: 1   MKGFPASFFLLFFVGLATFSWVLALPHDVLPKDSGKFILGQENLGPWKNEILETAEGPGS 60

Query: 74  ANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFIS 133
           ANNNSQ+PLVLAANRT+RPDILHGFRVYEGGWDIAN NYWASVGFTGATGFILS FWFIS
Sbjct: 61  ANNNSQSPLVLAANRTRRPDILHGFRVYEGGWDIANRNYWASVGFTGATGFILSFFWFIS 120

Query: 134 FGIALLIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLH 193
           FG ALL+HRCCGWKLNLKGEESKTS WICLALLVVFTSAA IGCILLCIGQN+FYNEGLH
Sbjct: 121 FGFALLVHRCCGWKLNLKGEESKTSHWICLALLVVFTSAATIGCILLCIGQNDFYNEGLH 180

Query: 194 TLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 253
           TLKYVVNQSDYTV TLKNVTEYLSLAKTI+VAQVFLPSDVMNDIDELNVDLNTAADTVAD
Sbjct: 181 TLKYVVNQSDYTVDTLKNVTEYLSLAKTINVAQVFLPSDVMNDIDELNVDLNTAADTVAD 240

Query: 254 KTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCI-FRVYWTCSYLFTHWLELSLQSC 313
           KTS+NSRKIRKVF AMRSALITVAA+MLLLALIGL + F  Y    Y             
Sbjct: 241 KTSLNSRKIRKVFAAMRSALITVAAIMLLLALIGLFLSFFGYQHAIY------------- 300

Query: 314 PSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNIL 373
                    ILIISGWLLVT TFVLCGLFVILDNAVSDTCMAMEEWV+N HAETALSNIL
Sbjct: 301 ---------ILIISGWLLVTITFVLCGLFVILDNAVSDTCMAMEEWVENTHAETALSNIL 360

Query: 374 PCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPY 433
           PCVDHKTTNQTLIQSKKIVNDIV+VVDQFVYNFANANPP GSPNY NQSGP MPALCYPY
Sbjct: 361 PCVDHKTTNQTLIQSKKIVNDIVNVVDQFVYNFANANPPSGSPNYRNQSGPPMPALCYPY 420

Query: 434 NSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYAL 493
           NSQLEESRCGDNDVTI+NASTVWQKFVC+VSESG+CITVGRVSP+IHSQMVAAVNESYAL
Sbjct: 421 NSQLEESRCGDNDVTIDNASTVWQKFVCEVSESGICITVGRVSPDIHSQMVAAVNESYAL 480

Query: 494 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYA 553
           QHYTPPLLSFQNCNFVRETFHNITTAYCPHLH HLKIVNVGLAMISVGILLCLLLWILYA
Sbjct: 481 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHHHLKIVNVGLAMISVGILLCLLLWILYA 540

Query: 554 NHPQREDVSAKLSFSINRRRNGNQNTNNNS-SGNDESTTSSIRSIRSGV 601
           NH QREDVSAKLSFS+NRRRN NQNTNNN+ SGNDESTTSSIRSIRSGV
Sbjct: 541 NHSQREDVSAKLSFSLNRRRNSNQNTNNNNGSGNDESTTSSIRSIRSGV 567

BLAST of ClCG05G022220 vs. NCBI nr
Match: XP_004135062.1 (uncharacterized protein LOC101211567 [Cucumis sativus] >KGN52138.1 hypothetical protein Csa_009105 [Cucumis sativus])

HSP 1 Score: 996.9 bits (2576), Expect = 7.6e-287
Identity = 516/590 (87.46%), Postives = 532/590 (90.17%), Query Frame = 0

Query: 14  MKGFPASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGS 73
           MKGFPASF LLF VGF TFSWVLALP DV+PKDSGKF LGQ+NL PWKNEILETAEGPGS
Sbjct: 1   MKGFPASFFLLFFVGFATFSWVLALPHDVLPKDSGKFILGQENLVPWKNEILETAEGPGS 60

Query: 74  ANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFIS 133
           A NNSQ+PLVLAANRTKRPDILHGFRVYEGGWDIAN NYWASVGFTGATGFILSIFWFIS
Sbjct: 61  AKNNSQSPLVLAANRTKRPDILHGFRVYEGGWDIANQNYWASVGFTGATGFILSIFWFIS 120

Query: 134 FGIALLIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLH 193
           FG ALL+HRCCGWKLNLKGEESKTS WICLALLVVFTSAA IGCILLCIGQNNFYNEGLH
Sbjct: 121 FGCALLVHRCCGWKLNLKGEESKTSHWICLALLVVFTSAATIGCILLCIGQNNFYNEGLH 180

Query: 194 TLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 253
           TLKYVVNQSDYTV TL+NVTEYLSLAKTI+VAQVFLPSDVMN+IDELNV LNTAADTVAD
Sbjct: 181 TLKYVVNQSDYTVDTLRNVTEYLSLAKTINVAQVFLPSDVMNEIDELNVGLNTAADTVAD 240

Query: 254 KTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCI-FRVYWTCSYLFTHWLELSLQSC 313
           KTS+NSRKIRKVFT MRSALITVAA+MLLLALIGL + F  Y    Y             
Sbjct: 241 KTSLNSRKIRKVFTVMRSALITVAAIMLLLALIGLFLSFFGYQHAIY------------- 300

Query: 314 PSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNIL 373
                    ILIISGWLLVT TFVLCGLFVILDNAVSDTCMAMEEWV+N HAETALSNIL
Sbjct: 301 ---------ILIISGWLLVTITFVLCGLFVILDNAVSDTCMAMEEWVENTHAETALSNIL 360

Query: 374 PCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPY 433
           PCVDHKTTNQTLIQSKKIVNDIV+VVDQFVYNFANANP P SPNY NQSGP MPALCYPY
Sbjct: 361 PCVDHKTTNQTLIQSKKIVNDIVNVVDQFVYNFANANPSPDSPNYRNQSGPPMPALCYPY 420

Query: 434 NSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYAL 493
           NSQLEESRCGDNDVTI+NASTVWQKFVCQVSESG C+TVGRVSP+IHSQMVAAVNESYAL
Sbjct: 421 NSQLEESRCGDNDVTIDNASTVWQKFVCQVSESGTCVTVGRVSPDIHSQMVAAVNESYAL 480

Query: 494 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYA 553
           QHYTPPLLSFQNCNFVRETFHNITTAYCPHLH HLKIVNVGLAMISVGILLCLLLWILYA
Sbjct: 481 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHHHLKIVNVGLAMISVGILLCLLLWILYA 540

Query: 554 NHPQREDVSAKLSFSINRRRNGNQNTNNNS--SGNDESTTSSIRSIRSGV 601
           NH QRE VS KLSFS+NRRRN NQNTNNNS  SGNDESTTSSIRSIRSGV
Sbjct: 541 NHSQREAVSVKLSFSLNRRRNSNQNTNNNSNGSGNDESTTSSIRSIRSGV 568

BLAST of ClCG05G022220 vs. NCBI nr
Match: XP_022150603.1 (uncharacterized protein LOC111018699 [Momordica charantia])

HSP 1 Score: 925.6 bits (2391), Expect = 2.1e-265
Identity = 472/582 (81.10%), Postives = 512/582 (87.97%), Query Frame = 0

Query: 14  MKGFPASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGS 73
           MKGFPAS LLLFL+ F +FSWVLALPQ  + + SGKF LG++NLGPWKNEILE+AEGPGS
Sbjct: 1   MKGFPASLLLLFLLAFASFSWVLALPQHEVHEASGKFILGEENLGPWKNEILESAEGPGS 60

Query: 74  ANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFIS 133
           ANN+SQ PLVLAANRTKRPDILHGFRVYE GWD  N NYWASVGFTGATGFILSIFWFIS
Sbjct: 61  ANNDSQPPLVLAANRTKRPDILHGFRVYEAGWDFTNRNYWASVGFTGATGFILSIFWFIS 120

Query: 134 FGIALLIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLH 193
           FGIALL+H CCGWK+NLKGEESK SQW+CLALLVVFT AA IGCILL IGQNNFYNE ++
Sbjct: 121 FGIALLVHHCCGWKINLKGEESKASQWVCLALLVVFTCAATIGCILLSIGQNNFYNEAMN 180

Query: 194 TLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 253
           TLKYVVNQSDYTV TLKNVTEYLSLAKTI+VAQVFLP DVMN+IDELNV+LNTAADTVA+
Sbjct: 181 TLKYVVNQSDYTVDTLKNVTEYLSLAKTINVAQVFLPFDVMNEIDELNVNLNTAADTVAE 240

Query: 254 KTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCI-FRVYWTCSYLFTHWLELSLQSC 313
           KT+ NS KI++VF A+RSALITVAALMLLLALIGL + F  Y    Y             
Sbjct: 241 KTTTNSHKIKRVFIAVRSALITVAALMLLLALIGLFLSFFGYQHAIY------------- 300

Query: 314 PSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNIL 373
                    ILIISGWLLV FTFVLCGLFVILDNAVSDTCMAMEEWVD+PHAETALSNIL
Sbjct: 301 ---------ILIISGWLLVAFTFVLCGLFVILDNAVSDTCMAMEEWVDHPHAETALSNIL 360

Query: 374 PCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPY 433
           PCVDH+TTNQTLIQSKKIVNDIV VVDQFVYNFANANPPPGSPNY NQSGP MPALCYPY
Sbjct: 361 PCVDHRTTNQTLIQSKKIVNDIVGVVDQFVYNFANANPPPGSPNYRNQSGPQMPALCYPY 420

Query: 434 NSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYAL 493
           NSQL+ESRCGDNDVTIENA+TVWQKFVCQ SESG+C TVGRV P+ ++++VAAVNESYAL
Sbjct: 421 NSQLQESRCGDNDVTIENAATVWQKFVCQASESGVCTTVGRVPPDFYAELVAAVNESYAL 480

Query: 494 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYA 553
           QHYTPPLLSFQNCNFVR+TFHNITTAYCPHLH HLK+VN+GLAM+SVGILLCLLLWILYA
Sbjct: 481 QHYTPPLLSFQNCNFVRDTFHNITTAYCPHLHHHLKMVNIGLAMVSVGILLCLLLWILYA 540

Query: 554 NHPQREDVSAKLSFSINRRRNGNQNTNNNSSGNDESTTSSIR 595
           NHPQ E+VSAKLS SINRRRN N+NT N + GNDE T+SSIR
Sbjct: 541 NHPQWEEVSAKLSLSINRRRNANRNT-NETGGNDEPTSSSIR 559

BLAST of ClCG05G022220 vs. NCBI nr
Match: KAG7012983.1 (hypothetical protein SDJN02_25737, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 901.4 bits (2328), Expect = 4.3e-258
Identity = 472/573 (82.37%), Postives = 499/573 (87.09%), Query Frame = 0

Query: 19  ASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGSANNNS 78
           AS  LLFLV F  F  VLAL  D++PKDSGKF LGQ+NLGPWKNEILETAE PGSANN+S
Sbjct: 8   ASISLLFLVDFAAFFLVLALSHDLVPKDSGKFVLGQENLGPWKNEILETAEAPGSANNDS 67

Query: 79  QNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFISFGIAL 138
           Q PL+LAANRTKRPDILHGFRVYEGGWDIAN +YWASV FTGATGFILSI WFISFGIAL
Sbjct: 68  QGPLLLAANRTKRPDILHGFRVYEGGWDIANRDYWASVAFTGATGFILSILWFISFGIAL 127

Query: 139 LIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLHTLKYV 198
            IH CCGWKLN+KGEESKTSQ ICLALLVV T AA IGCILLCIGQN+FYNEGLHTLKYV
Sbjct: 128 FIHLCCGWKLNIKGEESKTSQRICLALLVVLTCAATIGCILLCIGQNDFYNEGLHTLKYV 187

Query: 199 VNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVADKTSVN 258
           VNQSDYTV TLKNVTEYLSLAKTISVA+VFLP DV+NDIDELNVDLNTAADTVA+KTS+N
Sbjct: 188 VNQSDYTVDTLKNVTEYLSLAKTISVAEVFLPIDVINDIDELNVDLNTAADTVAEKTSIN 247

Query: 259 SRKIRKVFTAMRSALITVAALMLLLALIGLCI-FRVYWTCSYLFTHWLELSLQSCPSLDI 318
           S KI +VF AMRSALITVAALMLLLAL+GL + F  Y    Y                  
Sbjct: 248 SHKITRVFIAMRSALITVAALMLLLALVGLFLSFFGYQHAMY------------------ 307

Query: 319 NMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNILPCVDH 378
               ILI+SGWLLVT TFVL GLFVILD+AVSDTCMAMEEWVDNPHAETALSNILPCVDH
Sbjct: 308 ----ILILSGWLLVTITFVLYGLFVILDSAVSDTCMAMEEWVDNPHAETALSNILPCVDH 367

Query: 379 KTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPYNSQLE 438
           KTTN+TLIQSKKIVNDIVSVVDQFVYNFANANPPPG PNYCNQSGP MPALCYPYNSQLE
Sbjct: 368 KTTNRTLIQSKKIVNDIVSVVDQFVYNFANANPPPGLPNYCNQSGPPMPALCYPYNSQLE 427

Query: 439 ESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYALQHYTP 498
           ESRCGDNDVTI+NASTVWQKFVCQVSES LC TVGRV+P+I+SQMVAAVNESYALQHYTP
Sbjct: 428 ESRCGDNDVTIDNASTVWQKFVCQVSESKLCTTVGRVTPDIYSQMVAAVNESYALQHYTP 487

Query: 499 PLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYANHPQ- 558
           PLLS QNCNFVRETFHNITT YCPHLH HLKIVNVGLAMISVG+LLCLLLWILYANH Q 
Sbjct: 488 PLLSLQNCNFVRETFHNITTGYCPHLHHHLKIVNVGLAMISVGVLLCLLLWILYANHLQR 547

Query: 559 REDVSAKLSFSINRRRNGNQNTNNNSSGNDEST 590
           R DVSAK+S SINR RN +QN  +NS GNDES+
Sbjct: 548 RSDVSAKISLSINRWRNTSQNL-SNSGGNDESS 557

BLAST of ClCG05G022220 vs. ExPASy TrEMBL
Match: A0A1S3BGC4 (uncharacterized protein LOC103489338 OS=Cucumis melo OX=3656 GN=LOC103489338 PE=4 SV=1)

HSP 1 Score: 1007.7 bits (2604), Expect = 2.1e-290
Identity = 519/589 (88.12%), Postives = 537/589 (91.17%), Query Frame = 0

Query: 14  MKGFPASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGS 73
           MKGFPASF LLF VG  TFSWVLALP DV+PKDSGKF LGQ+NLGPWKNEILETAEGPGS
Sbjct: 1   MKGFPASFFLLFFVGLATFSWVLALPHDVLPKDSGKFILGQENLGPWKNEILETAEGPGS 60

Query: 74  ANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFIS 133
           ANNNSQ+PLVLAANRT+RPDILHGFRVYEGGWDIAN NYWASVGFTGATGFILS FWFIS
Sbjct: 61  ANNNSQSPLVLAANRTRRPDILHGFRVYEGGWDIANRNYWASVGFTGATGFILSFFWFIS 120

Query: 134 FGIALLIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLH 193
           FG ALL+HRCCGWKLNLKGEESKTS WICLALLVVFTSAA IGCILLCIGQN+FYNEGLH
Sbjct: 121 FGFALLVHRCCGWKLNLKGEESKTSHWICLALLVVFTSAATIGCILLCIGQNDFYNEGLH 180

Query: 194 TLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 253
           TLKYVVNQSDYTV TLKNVTEYLSLAKTI+VAQVFLPSDVMNDIDELNVDLNTAADTVAD
Sbjct: 181 TLKYVVNQSDYTVDTLKNVTEYLSLAKTINVAQVFLPSDVMNDIDELNVDLNTAADTVAD 240

Query: 254 KTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCI-FRVYWTCSYLFTHWLELSLQSC 313
           KTS+NSRKIRKVF AMRSALITVAA+MLLLALIGL + F  Y    Y             
Sbjct: 241 KTSLNSRKIRKVFAAMRSALITVAAIMLLLALIGLFLSFFGYQHAIY------------- 300

Query: 314 PSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNIL 373
                    ILIISGWLLVT TFVLCGLFVILDNAVSDTCMAMEEWV+N HAETALSNIL
Sbjct: 301 ---------ILIISGWLLVTITFVLCGLFVILDNAVSDTCMAMEEWVENTHAETALSNIL 360

Query: 374 PCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPY 433
           PCVDHKTTNQTLIQSKKIVNDIV+VVDQFVYNFANANPP GSPNY NQSGP MPALCYPY
Sbjct: 361 PCVDHKTTNQTLIQSKKIVNDIVNVVDQFVYNFANANPPSGSPNYRNQSGPPMPALCYPY 420

Query: 434 NSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYAL 493
           NSQLEESRCGDNDVTI+NASTVWQKFVC+VSESG+CITVGRVSP+IHSQMVAAVNESYAL
Sbjct: 421 NSQLEESRCGDNDVTIDNASTVWQKFVCEVSESGICITVGRVSPDIHSQMVAAVNESYAL 480

Query: 494 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYA 553
           QHYTPPLLSFQNCNFVRETFHNITTAYCPHLH HLKIVNVGLAMISVGILLCLLLWILYA
Sbjct: 481 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHHHLKIVNVGLAMISVGILLCLLLWILYA 540

Query: 554 NHPQREDVSAKLSFSINRRRNGNQNTNNNS-SGNDESTTSSIRSIRSGV 601
           NH QREDVSAKLSFS+NRRRN NQNTNNN+ SGNDESTTSSIRSIRSGV
Sbjct: 541 NHSQREDVSAKLSFSLNRRRNSNQNTNNNNGSGNDESTTSSIRSIRSGV 567

BLAST of ClCG05G022220 vs. ExPASy TrEMBL
Match: A0A0A0KRA9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G611690 PE=4 SV=1)

HSP 1 Score: 996.9 bits (2576), Expect = 3.7e-287
Identity = 516/590 (87.46%), Postives = 532/590 (90.17%), Query Frame = 0

Query: 14  MKGFPASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGS 73
           MKGFPASF LLF VGF TFSWVLALP DV+PKDSGKF LGQ+NL PWKNEILETAEGPGS
Sbjct: 1   MKGFPASFFLLFFVGFATFSWVLALPHDVLPKDSGKFILGQENLVPWKNEILETAEGPGS 60

Query: 74  ANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFIS 133
           A NNSQ+PLVLAANRTKRPDILHGFRVYEGGWDIAN NYWASVGFTGATGFILSIFWFIS
Sbjct: 61  AKNNSQSPLVLAANRTKRPDILHGFRVYEGGWDIANQNYWASVGFTGATGFILSIFWFIS 120

Query: 134 FGIALLIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLH 193
           FG ALL+HRCCGWKLNLKGEESKTS WICLALLVVFTSAA IGCILLCIGQNNFYNEGLH
Sbjct: 121 FGCALLVHRCCGWKLNLKGEESKTSHWICLALLVVFTSAATIGCILLCIGQNNFYNEGLH 180

Query: 194 TLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 253
           TLKYVVNQSDYTV TL+NVTEYLSLAKTI+VAQVFLPSDVMN+IDELNV LNTAADTVAD
Sbjct: 181 TLKYVVNQSDYTVDTLRNVTEYLSLAKTINVAQVFLPSDVMNEIDELNVGLNTAADTVAD 240

Query: 254 KTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCI-FRVYWTCSYLFTHWLELSLQSC 313
           KTS+NSRKIRKVFT MRSALITVAA+MLLLALIGL + F  Y    Y             
Sbjct: 241 KTSLNSRKIRKVFTVMRSALITVAAIMLLLALIGLFLSFFGYQHAIY------------- 300

Query: 314 PSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNIL 373
                    ILIISGWLLVT TFVLCGLFVILDNAVSDTCMAMEEWV+N HAETALSNIL
Sbjct: 301 ---------ILIISGWLLVTITFVLCGLFVILDNAVSDTCMAMEEWVENTHAETALSNIL 360

Query: 374 PCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPY 433
           PCVDHKTTNQTLIQSKKIVNDIV+VVDQFVYNFANANP P SPNY NQSGP MPALCYPY
Sbjct: 361 PCVDHKTTNQTLIQSKKIVNDIVNVVDQFVYNFANANPSPDSPNYRNQSGPPMPALCYPY 420

Query: 434 NSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYAL 493
           NSQLEESRCGDNDVTI+NASTVWQKFVCQVSESG C+TVGRVSP+IHSQMVAAVNESYAL
Sbjct: 421 NSQLEESRCGDNDVTIDNASTVWQKFVCQVSESGTCVTVGRVSPDIHSQMVAAVNESYAL 480

Query: 494 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYA 553
           QHYTPPLLSFQNCNFVRETFHNITTAYCPHLH HLKIVNVGLAMISVGILLCLLLWILYA
Sbjct: 481 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHHHLKIVNVGLAMISVGILLCLLLWILYA 540

Query: 554 NHPQREDVSAKLSFSINRRRNGNQNTNNNS--SGNDESTTSSIRSIRSGV 601
           NH QRE VS KLSFS+NRRRN NQNTNNNS  SGNDESTTSSIRSIRSGV
Sbjct: 541 NHSQREAVSVKLSFSLNRRRNSNQNTNNNSNGSGNDESTTSSIRSIRSGV 568

BLAST of ClCG05G022220 vs. ExPASy TrEMBL
Match: A0A6J1D9V7 (uncharacterized protein LOC111018699 OS=Momordica charantia OX=3673 GN=LOC111018699 PE=4 SV=1)

HSP 1 Score: 925.6 bits (2391), Expect = 1.0e-265
Identity = 472/582 (81.10%), Postives = 512/582 (87.97%), Query Frame = 0

Query: 14  MKGFPASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGS 73
           MKGFPAS LLLFL+ F +FSWVLALPQ  + + SGKF LG++NLGPWKNEILE+AEGPGS
Sbjct: 1   MKGFPASLLLLFLLAFASFSWVLALPQHEVHEASGKFILGEENLGPWKNEILESAEGPGS 60

Query: 74  ANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFIS 133
           ANN+SQ PLVLAANRTKRPDILHGFRVYE GWD  N NYWASVGFTGATGFILSIFWFIS
Sbjct: 61  ANNDSQPPLVLAANRTKRPDILHGFRVYEAGWDFTNRNYWASVGFTGATGFILSIFWFIS 120

Query: 134 FGIALLIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLH 193
           FGIALL+H CCGWK+NLKGEESK SQW+CLALLVVFT AA IGCILL IGQNNFYNE ++
Sbjct: 121 FGIALLVHHCCGWKINLKGEESKASQWVCLALLVVFTCAATIGCILLSIGQNNFYNEAMN 180

Query: 194 TLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 253
           TLKYVVNQSDYTV TLKNVTEYLSLAKTI+VAQVFLP DVMN+IDELNV+LNTAADTVA+
Sbjct: 181 TLKYVVNQSDYTVDTLKNVTEYLSLAKTINVAQVFLPFDVMNEIDELNVNLNTAADTVAE 240

Query: 254 KTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCI-FRVYWTCSYLFTHWLELSLQSC 313
           KT+ NS KI++VF A+RSALITVAALMLLLALIGL + F  Y    Y             
Sbjct: 241 KTTTNSHKIKRVFIAVRSALITVAALMLLLALIGLFLSFFGYQHAIY------------- 300

Query: 314 PSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNIL 373
                    ILIISGWLLV FTFVLCGLFVILDNAVSDTCMAMEEWVD+PHAETALSNIL
Sbjct: 301 ---------ILIISGWLLVAFTFVLCGLFVILDNAVSDTCMAMEEWVDHPHAETALSNIL 360

Query: 374 PCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPY 433
           PCVDH+TTNQTLIQSKKIVNDIV VVDQFVYNFANANPPPGSPNY NQSGP MPALCYPY
Sbjct: 361 PCVDHRTTNQTLIQSKKIVNDIVGVVDQFVYNFANANPPPGSPNYRNQSGPQMPALCYPY 420

Query: 434 NSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYAL 493
           NSQL+ESRCGDNDVTIENA+TVWQKFVCQ SESG+C TVGRV P+ ++++VAAVNESYAL
Sbjct: 421 NSQLQESRCGDNDVTIENAATVWQKFVCQASESGVCTTVGRVPPDFYAELVAAVNESYAL 480

Query: 494 QHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYA 553
           QHYTPPLLSFQNCNFVR+TFHNITTAYCPHLH HLK+VN+GLAM+SVGILLCLLLWILYA
Sbjct: 481 QHYTPPLLSFQNCNFVRDTFHNITTAYCPHLHHHLKMVNIGLAMVSVGILLCLLLWILYA 540

Query: 554 NHPQREDVSAKLSFSINRRRNGNQNTNNNSSGNDESTTSSIR 595
           NHPQ E+VSAKLS SINRRRN N+NT N + GNDE T+SSIR
Sbjct: 541 NHPQWEEVSAKLSLSINRRRNANRNT-NETGGNDEPTSSSIR 559

BLAST of ClCG05G022220 vs. ExPASy TrEMBL
Match: A0A6J1G0F9 (uncharacterized protein LOC111449579 OS=Cucurbita moschata OX=3662 GN=LOC111449579 PE=4 SV=1)

HSP 1 Score: 901.4 bits (2328), Expect = 2.1e-258
Identity = 472/573 (82.37%), Postives = 499/573 (87.09%), Query Frame = 0

Query: 19  ASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGSANNNS 78
           AS  LLFLV F  F  VLAL  D++PKDSGKF LGQ+NLGPWKNEILETAE PGSANN+S
Sbjct: 8   ASISLLFLVDFAAFFLVLALSHDLVPKDSGKFVLGQENLGPWKNEILETAEAPGSANNDS 67

Query: 79  QNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFISFGIAL 138
           Q PL+LAANRTKRPDILHGFRVYEGGWDIAN +YWASV FTGATGFILSI WFISFGIAL
Sbjct: 68  QGPLLLAANRTKRPDILHGFRVYEGGWDIANRDYWASVAFTGATGFILSILWFISFGIAL 127

Query: 139 LIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLHTLKYV 198
            IH CCGWKLN+KGEESKTSQ ICLALLVV T AA IGCILLCIGQN+FYNEGLHTLKYV
Sbjct: 128 FIHLCCGWKLNIKGEESKTSQRICLALLVVLTCAATIGCILLCIGQNDFYNEGLHTLKYV 187

Query: 199 VNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVADKTSVN 258
           VNQSDYTV TLKNVTEYLSLAKTISVA+VFLP DV+NDIDELNVDLNTAADTVA+KTS+N
Sbjct: 188 VNQSDYTVDTLKNVTEYLSLAKTISVAEVFLPIDVINDIDELNVDLNTAADTVAEKTSIN 247

Query: 259 SRKIRKVFTAMRSALITVAALMLLLALIGLCI-FRVYWTCSYLFTHWLELSLQSCPSLDI 318
           S KI +VF AMRSALITVAALMLLLAL+GL + F  Y    Y                  
Sbjct: 248 SHKITRVFIAMRSALITVAALMLLLALVGLFLSFFGYQHAMY------------------ 307

Query: 319 NMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNILPCVDH 378
               ILI+SGWLLVT TFVL GLFVILD+AVSDTCMAMEEWVDNPHAETALSNILPCVDH
Sbjct: 308 ----ILILSGWLLVTITFVLYGLFVILDSAVSDTCMAMEEWVDNPHAETALSNILPCVDH 367

Query: 379 KTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPYNSQLE 438
           KTTN+TLIQSKKIVNDIVSVVDQFVYNFANANPPPG PNYCNQSGP MPALCYPYNSQLE
Sbjct: 368 KTTNRTLIQSKKIVNDIVSVVDQFVYNFANANPPPGLPNYCNQSGPPMPALCYPYNSQLE 427

Query: 439 ESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYALQHYTP 498
           ESRCGDNDVTI+NASTVWQKFVCQVSES LC TVGRV+P+I+SQMVAAVNESYALQHYTP
Sbjct: 428 ESRCGDNDVTIDNASTVWQKFVCQVSESKLCTTVGRVTPDIYSQMVAAVNESYALQHYTP 487

Query: 499 PLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYANHPQ- 558
           PLLS QNCNFVRETFHNITT YCPHLH HLKIVNVGLAMISVG+LLCLLLWILYANH Q 
Sbjct: 488 PLLSLQNCNFVRETFHNITTGYCPHLHHHLKIVNVGLAMISVGVLLCLLLWILYANHLQR 547

Query: 559 REDVSAKLSFSINRRRNGNQNTNNNSSGNDEST 590
           R DVSAK+S SINR RN +QN  +NS GNDES+
Sbjct: 548 RSDVSAKISLSINRWRNTSQNL-SNSGGNDESS 557

BLAST of ClCG05G022220 vs. ExPASy TrEMBL
Match: A0A6J1H021 (uncharacterized protein LOC111458797 OS=Cucurbita moschata OX=3662 GN=LOC111458797 PE=4 SV=1)

HSP 1 Score: 896.3 bits (2315), Expect = 6.7e-257
Identity = 468/581 (80.55%), Postives = 501/581 (86.23%), Query Frame = 0

Query: 14  MKGFPASFLLLFLVGFTTFSWVLALPQDVMPKDSGKFFLGQQNLGPWKNEILETAEGPGS 73
           MKGFPAS  LLFLVGF TFSWVLALP DV+ +DSG F LGQ N GPW+N+IL+TA+  GS
Sbjct: 1   MKGFPAS--LLFLVGFATFSWVLALPHDVVREDSGNFILGQNNFGPWENQILQTAKASGS 60

Query: 74  ANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFIS 133
           + N+SQ+PL+LAANRTKRPDI HGFRVYEGGWDIANPNYWASVGFTGATGFILSI WFIS
Sbjct: 61  SKNDSQSPLLLAANRTKRPDIRHGFRVYEGGWDIANPNYWASVGFTGATGFILSILWFIS 120

Query: 134 FGIALLIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLH 193
           FGIALLIHR CGWKLNLKGEESKTSQWICLALLVVFT  A+IG ILLCIGQNNFY+E L 
Sbjct: 121 FGIALLIHRFCGWKLNLKGEESKTSQWICLALLVVFTCVASIGSILLCIGQNNFYHESLD 180

Query: 194 TLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 253
           TLKYVVNQSDY V TLKNVTEYLSLAKTISVAQVFLPSDVM+DIDELNVDLNTAADTVAD
Sbjct: 181 TLKYVVNQSDYIVDTLKNVTEYLSLAKTISVAQVFLPSDVMDDIDELNVDLNTAADTVAD 240

Query: 254 KTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCIFRVYWTCSYLFTHWLELSLQSCP 313
           K  +NS KIRK F AMRSALIT+A +MLLLAL+GL +         LF +          
Sbjct: 241 KMRINSHKIRKYFAAMRSALITIAVVMLLLALVGLFL--------SLFGY---------- 300

Query: 314 SLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNILP 373
               ++  IL+ISGWLLVT TFVLCGLFVILDN+VSDTCMAMEEWVDNP AETALSNILP
Sbjct: 301 ---RHVVYILMISGWLLVTITFVLCGLFVILDNSVSDTCMAMEEWVDNPQAETALSNILP 360

Query: 374 CVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPYN 433
           CVD KTTNQTLIQSKKIVNDIVSV +QF+YNFANANP PGSPN  NQSGP MPALCYPYN
Sbjct: 361 CVDPKTTNQTLIQSKKIVNDIVSVANQFIYNFANANPSPGSPNDHNQSGPPMPALCYPYN 420

Query: 434 SQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYALQ 493
           SQLEE+RCGDNDVTI NASTVWQKFVCQVSE G C +VGRV+P+I+SQMVAAVNESYALQ
Sbjct: 421 SQLEETRCGDNDVTIGNASTVWQKFVCQVSEPGKCTSVGRVTPDIYSQMVAAVNESYALQ 480

Query: 494 HYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYAN 553
           HYTPPLLS QNCNFVRETFHNITT YCP+LH+HLKIVN+GLAM SVG LLCLLLWILYAN
Sbjct: 481 HYTPPLLSLQNCNFVRETFHNITTGYCPYLHQHLKIVNIGLAMTSVGTLLCLLLWILYAN 540

Query: 554 HPQREDVSAKLSFSINRRRNGNQNTNNNSSGNDESTTSSIR 595
           HPQ  DVSAKLSFSI RRRNG QN  NN S NDE TTSSIR
Sbjct: 541 HPQMGDVSAKLSFSIQRRRNGTQNI-NNPSRNDELTTSSIR 557

BLAST of ClCG05G022220 vs. TAIR 10
Match: AT1G71110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G12400.1); Has 173 Blast hits to 169 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi - 0; Plants - 165; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink). )

HSP 1 Score: 580.5 bits (1495), Expect = 1.6e-165
Identity = 302/549 (55.01%), Postives = 402/549 (73.22%), Query Frame = 0

Query: 20  SFLLLFLVGFTTFSWVLALPQDV-----MPKDSGKFFLGQQNLGPWKNEILETAEGPGSA 79
           SF +L +V F + ++  +LP  V       +D  +  LG  N G WK  I   A GP S 
Sbjct: 2   SFFILSVVVFVSLAF-FSLPHSVDSSVSASQDPLRLILGSPNFGTWKGGI-SLAPGPES- 61

Query: 80  NNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFISF 139
           ++   + L+LAA+RTKRPDIL  F+ Y GGW+I N +YWASVGFTGA GFIL++ W +SF
Sbjct: 62  DDVVSDYLLLAAHRTKRPDILRAFKPYHGGWNITNNHYWASVGFTGAPGFILAVIWLLSF 121

Query: 140 GIALLIHRCCGWKLNLKGEESK-TSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLH 199
           G  L+++ C  W++  K + S   ++ IC  LL+VFT  AA+GCILL +GQ+ F+ E +H
Sbjct: 122 GSLLVVYHCFKWRICDKAKGSSFDTRRICFILLIVFTCVAAVGCILLSVGQDKFHTEAMH 181

Query: 200 TLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVAD 259
           TLKYVVNQSDYTV  L+NVT+YLSLAKTI+V Q+ +PSDVM +ID+LNV+LNTAA T+ +
Sbjct: 182 TLKYVVNQSDYTVEILQNVTQYLSLAKTINVTQIVIPSDVMGEIDKLNVNLNTAAVTLGE 241

Query: 260 KTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCIFRVYWTCSYLFTHWLELSLQSCP 319
            T+ N+ KI++VF A+RSALITVA +ML+L+ +GL +         +  H          
Sbjct: 242 TTTDNAAKIKRVFYAVRSALITVATVMLILSFVGLLL--------SVLRHQ--------- 301

Query: 320 SLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNILP 379
               ++  I ++SGW+LV  TFVLCG+F+IL+NA+SDTC+AM+EWVDNPHAETALS+ILP
Sbjct: 302 ----HVVHIFVVSGWILVAVTFVLCGVFLILNNAISDTCVAMKEWVDNPHAETALSSILP 361

Query: 380 CVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYPYN 439
           CVD +TTNQTL QSK ++N IV+VV+ FVY  AN NP PG   Y NQSGP MP LC P++
Sbjct: 362 CVDQQTTNQTLSQSKVVINSIVTVVNTFVYAVANTNPAPGQDRYYNQSGPPMPPLCIPFD 421

Query: 440 SQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESYALQ 499
           + +E+ +C   +++IENAS+VW+ + C+V+ SG+C TVGRV+P+   Q+VAAVNESYAL+
Sbjct: 422 ANMEDRQCSPWELSIENASSVWENYKCEVTPSGICTTVGRVTPDTFGQLVAAVNESYALE 481

Query: 500 HYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWILYAN 559
           HYTPPLLSF++CNFVRETF +IT+ YCP L R+L+IVN GL +ISVG+LLCL+LWI YAN
Sbjct: 482 HYTPPLLSFRDCNFVRETFMSITSDYCPPLVRNLRIVNAGLGLISVGVLLCLVLWIFYAN 526

Query: 560 HPQREDVSA 563
            PQRE+V A
Sbjct: 542 RPQREEVFA 526

BLAST of ClCG05G022220 vs. TAIR 10
Match: AT2G12400.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G25270.1); Has 177 Blast hits to 172 proteins in 23 species: Archae - 0; Bacteria - 2; Metazoa - 3; Fungi - 0; Plants - 164; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 366.7 bits (940), Expect = 3.6e-101
Identity = 202/498 (40.56%), Postives = 295/498 (59.24%), Query Frame = 0

Query: 60  WKNEILETAEGPGSANNNSQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFT 119
           W+  ++E      S  N+S   L+LAA RT+R D    F++Y GGW+I+N +Y  SVG+T
Sbjct: 46  WRTSVIERVIAEESGENSS---LILAAKRTRRKDPADNFKLYTGGWNISNSHYLTSVGYT 105

Query: 120 GATGFILSIFWFISFGIAL----LIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAI 179
            A   I+++ WF+ FG++L    L + CC          S+ +  + L LL+ FT AA I
Sbjct: 106 AAPFIIIALVWFVFFGLSLSLICLCYCCCA---RQSYGYSRVAYALSLILLISFTIAAII 165

Query: 180 GCILLCIGQNNFYNEGLHTLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMN 239
           GC+ L  GQ  F+     TL YVV+Q++ T   L+NV++YL+ AK + V    LP DV++
Sbjct: 166 GCVFLYTGQGKFHASTTDTLDYVVSQANLTSENLRNVSDYLNAAKKVDVQSSILPQDVLS 225

Query: 240 DIDELNVDLNTAADTVADKTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCIFRVYW 299
            ID +   +N++A T++ KT  N  KI+ V   MR AL+ +AA+ML LA IG  +  ++ 
Sbjct: 226 SIDNIQGKINSSATTLSVKTMENQDKIQNVLDIMRLALVIIAAVMLFLAFIGF-LLSIFG 285

Query: 300 TCSYLFTHWLELSLQSCPSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAM 359
               ++T                    L+I GW+LVT TFVLCG F++L N V DTC+AM
Sbjct: 286 LQCLVYT--------------------LVILGWILVTVTFVLCGGFLLLHNVVGDTCVAM 345

Query: 360 EEWVDNPHAETALSNILPCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANAN-PPPGS 419
           ++WV NP A TAL +ILPCVD+ T  +TL ++K +   +V+++D  + N  N N PP   
Sbjct: 346 DQWVQNPTAHTALDDILPCVDNATARETLTRTKLVTYQLVNLLDNAISNMTNRNFPPQFR 405

Query: 420 PNYCNQSGPSMPALCYPYNSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRV 479
           P Y NQSGP MP LC P+N+ L + +C    V + NA+ VW+ F CQ+   G C T GR+
Sbjct: 406 PLYYNQSGPLMPLLCNPFNADLSDRQCQPGQVHLNNATEVWKNFTCQIVTPGTCSTPGRL 465

Query: 480 SPNIHSQMVAAVNESYALQHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGL 539
           +P ++SQM AAVN SY L  Y P L   Q C+FVR TF +I   +CP L R+ + + VGL
Sbjct: 466 TPKLYSQMAAAVNVSYGLYKYGPFLADLQGCDFVRSTFTDIERDHCPGLKRYTQWIYVGL 516

Query: 540 AMISVGILLCLLLWILYA 553
            ++S  ++  L+ W++YA
Sbjct: 526 VVVSASVMSSLVFWVIYA 516

BLAST of ClCG05G022220 vs. TAIR 10
Match: AT2G25270.1 (unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G12400.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 342.4 bits (877), Expect = 7.2e-94
Identity = 189/489 (38.65%), Postives = 294/489 (60.12%), Query Frame = 0

Query: 70  GPGSANNN---SQNPLVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFIL 129
           GP   NN        + LAA RT R D L+GF  Y GGW+I+N +YWASV +T    F+L
Sbjct: 55  GPAGFNNPQVIEVASVALAAQRTYRKDPLNGFEKYTGGWNISNQHYWASVSYTAVPLFVL 114

Query: 130 SIFWFISFGIALLIHRCCG--WKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQ 189
           +  WF+ FGI LL+   C    + N  G  SK +  + L  L++FT  A IGC+LL  GQ
Sbjct: 115 AAVWFLGFGICLLVICMCHICHRTNSVG-YSKVAYVVSLIFLLIFTVIAIIGCVLLYSGQ 174

Query: 190 NNFYNEGLHTLKYVVNQSDYTVHTLKNVTEYLSLAKTISVAQVFLPSDVMNDIDELNVDL 249
             +      TL+YV++Q+D T+  L+ +++YL+ AK  +V QV LP++V  +ID++ V L
Sbjct: 175 IRYNKSTTETLEYVMSQADSTISQLRAISDYLASAKQAAVLQVLLPANVQTEIDQIGVKL 234

Query: 250 NTAADTVADKTSVNSRKIRKVFTAMRSALITVAALMLLLALIGLCIFRVYWTCSYLFTHW 309
           +++  T+ +K++ +S  IR    ++R ALI V+ +ML++  +GL +  ++     ++T  
Sbjct: 235 DSSVATITEKSTNSSNHIRHFLDSVRVALIVVSIVMLVVTFLGL-VSSIFGMQVIVYT-- 294

Query: 310 LELSLQSCPSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHA 369
                             L+I GW+LVT TF+L G F++L NA +DTC+AM EWV+ P +
Sbjct: 295 ------------------LVILGWILVTGTFILSGTFLVLHNATADTCVAMSEWVERPSS 354

Query: 370 ETALSNILPCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANAN-PPPGSPNYCNQSGP 429
            TAL  ILPC D+ T  +TL++S+++   +V +++  + N +N N  P   P Y NQSGP
Sbjct: 355 NTALDEILPCTDNATAQETLMRSREVTGQLVELINTVITNVSNINFSPVFVPMYYNQSGP 414

Query: 430 SMPALCYPYNSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMV 489
            +P LC P+N  L +  C   D+ + NA+  W  FVCQVS++G C T GR++P ++SQM 
Sbjct: 415 LLPLLCNPFNHDLTDRSCSPGDLDLNNATEAWTSFVCQVSQNGTCTTTGRLTPALYSQMA 474

Query: 490 AAVNESYALQHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILL 549
           + VN S  L    P L+  Q+C++ ++TF +IT  +CP L R+   V VGLA+++  ++L
Sbjct: 475 SGVNISTGLIRDAPFLVQLQDCSYAKQTFRDITNDHCPGLQRYGYWVYVGLAILATAVML 521

Query: 550 CLLLWILYA 553
            L+ WI+Y+
Sbjct: 535 SLMFWIIYS 521

BLAST of ClCG05G022220 vs. TAIR 10
Match: AT1G80540.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G12400.1); Has 175 Blast hits to 171 proteins in 20 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 171; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 274.6 bits (701), Expect = 1.8e-73
Identity = 168/500 (33.60%), Postives = 265/500 (53.00%), Query Frame = 0

Query: 82  LVLAANRTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFISFGIAL--- 141
           LVLAA RT+RPD L+ F +Y  GW++ N +Y ASVGF+     +++I WF+  G+ L   
Sbjct: 64  LVLAAERTQRPDPLNHFNIYVDGWNVTNSHYIASVGFSAVPFIVIAIAWFVLLGLFLICS 123

Query: 142 -LIHRCCGWKLNLKGEESKTSQWICLALLVVFTSAAAIGCILLCIGQNNFYNEGLHTLKY 201
            L   CCG      G  S+    + L  L++FT AA IG  +L  GQN FY     T  Y
Sbjct: 124 CLCCCCCGCGRRNYG-YSRVCYTLSLVFLLLFTIAAVIGSAMLYTGQNEFYGSVERTFMY 183

Query: 202 VVNQSDYTVHTLKNVTEYLSLAKTISV-AQVFLPSDVMNDIDELNVDLNTAADTVADKTS 261
           +V Q+   +  L ++ + +  AK I +      P +   +ID  N  +  +  T  D+ +
Sbjct: 184 IVKQATGVLTKLTSLWDSIQSAKDIQLDGHNLFPPEFRGNIDHFNNMIKMSNITYPDRVA 243

Query: 262 VNS-RKIRKVFTAMRSALITVAALMLLLALIGL----CIFRVYWTCSYLFTHWLELSLQS 321
             + R +      +R  L  +A +ML +A +GL    C  RV                  
Sbjct: 244 NQTIRYLTGALNPVRYVLNVIAGVMLAVAFLGLLFSFCGLRV------------------ 303

Query: 322 CPSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNI 381
                  +  +L+I GW+LVT T +L  +F++  N V+DTCMAM++WV +P A++ALS +
Sbjct: 304 -------LVYLLVILGWILVTATILLSAVFLVFHNVVADTCMAMDQWVHDPAADSALSQL 363

Query: 382 LPCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANP-PPGSPNYCNQSGPSMPALCY 441
           LPC+D KT  +TL  +K +    V + + +  N +N +  PP +P Y NQSGP +P LC 
Sbjct: 364 LPCLDPKTIGETLDITKTMTATAVDMTNAYTVNVSNHDQFPPNAPFYHNQSGPLVPLLCN 423

Query: 442 PYNSQLEESRCGDNDVTIENASTVWQKFVCQVSESGLCITVGRVSPNIHSQMVAAVNESY 501
           P +   +   C  ++V + NAS V++ ++CQV+  G+C T GR++   + QM+ A+N ++
Sbjct: 424 PLDQNHKPRPCAPDEVLLANASQVYKGYICQVNAEGICTTQGRLTQGSYDQMMGAINVAF 483

Query: 502 ALQHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCLLLWIL 561
            L HY P L S  +C FVR+TF +ITT  CP L    + +  GLA +S  ++  L+ W++
Sbjct: 484 TLDHYGPFLASIADCTFVRDTFRDITTKNCPGLSITSQWIYAGLASLSGAVMFSLIFWLI 537

Query: 562 YANHPQREDVSAKLSFSINR 571
           +    +    + K    +NR
Sbjct: 544 FVRERRHRSQTKKSMIQMNR 537

BLAST of ClCG05G022220 vs. TAIR 10
Match: AT5G67550.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: flower; EXPRESSED DURING: 4 anthesis; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G71110.1); Has 161 Blast hits to 154 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 80.5 bits (197), Expect = 5.1e-15
Identity = 100/487 (20.53%), Postives = 198/487 (40.66%), Query Frame = 0

Query: 88  RTKRPDILHGFRVYEGGWDIANPNYWASVGFTGATGFILSIFWFISFGIALLIHRCCGWK 147
           R KR D L+ FR Y+GG+++ N +YWA+  FTG  G+ ++       G+ +++  C G  
Sbjct: 36  RFKRRDPLNSFRYYDGGFNVRNKHYWAATAFTGIHGYAVA-------GVLIIVGICLGLY 95

Query: 148 LNLKGEESKTSQ------------WICLALLVVFTSAAAIGCILLCIGQNNFYNEGLHTL 207
           +    +  + S                L LL +F S    G +   I  N         +
Sbjct: 96  VAFSDKRRRVSSTRRRYLDRYYLPLFLLLLLFMFLSVVTTGIV---IAANQRSKNRTEEM 155

Query: 208 KYVVNQSDYTVHTLKNV-TEYLSLAKTISVAQVFLPSDVMNDIDELNVDLNTAADTVADK 267
           K  ++++   V+  +N+ T  +SL K   +  + LP D  N    LNV  +         
Sbjct: 156 KETIDKAGEDVN--QNIRTVIVSLTK---IQYLLLPYD-QNTTHLLNVTTHRLGKGSRLI 215

Query: 268 TSVNSRKIRKVFTAMRSALIT---VAALMLLLALIGLCIFRVYWTCSYLFTHWLELSLQS 327
            S    K R +  A++ + ++   + +  L L L+      ++W   ++           
Sbjct: 216 QSFLHHKGRSIDLAIKISYVSHLMITSTNLFLLLLAFLPLLLHWHPGFI----------- 275

Query: 328 CPSLDINMQSILIISGWLLVTFTFVLCGLFVILDNAVSDTCMAMEEWVDNPHAETALSNI 387
                     ++I   W++ T  +VL G    +     D C A   +V NP   T L+N+
Sbjct: 276 ----------MVIFLCWIITTLCWVLTGFDFFIHTFAEDLCSAFNGFVQNPRNST-LTNL 335

Query: 388 LPCVDHKTTNQTLIQSKKIVNDIVSVVDQFVYNFANANPPPGSPNYCNQSGPSMPALCYP 447
            PC+D   +++TLI+   ++++ ++ ++  V     +N      N  + + P    +C P
Sbjct: 336 FPCMDPLHSDKTLIEISLMIHNFITQLNSKVAESMRSNALTDRSNTVSWA-PESGIICDP 395

Query: 448 YNSQLEES----RCGDNDVTIENASTVWQKFVCQVSE-SGLCITVGRVSPN-IHSQMVAA 507
           +  Q   S     C +  + I     +  +F C   +    C   G+  P   + ++ A 
Sbjct: 396 FVGQQINSYTPQSCSNGAIPIGEFPNILSRFTCHDKDPPETCRITGKFIPEAAYLKVYAY 455

Query: 508 VNESYALQHYTPPLLSFQNCNFVRETFHNITTAYCPHLHRHLKIVNVGLAMISVGILLCL 553
            N +  +    P   +   C  V++T  +I +  C      +  +   +  +S+ +++ +
Sbjct: 456 SNSAQGMLDILPSFQNLTECLAVKDTLSSIVSNQCDPFRASMYRLWASILALSLIMVVLV 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893119.11.4e-29389.59uncharacterized protein LOC120081994 [Benincasa hispida][more]
XP_008446693.14.3e-29088.12PREDICTED: uncharacterized protein LOC103489338 [Cucumis melo][more]
XP_004135062.17.6e-28787.46uncharacterized protein LOC101211567 [Cucumis sativus] >KGN52138.1 hypothetical ... [more]
XP_022150603.12.1e-26581.10uncharacterized protein LOC111018699 [Momordica charantia][more]
KAG7012983.14.3e-25882.37hypothetical protein SDJN02_25737, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3BGC42.1e-29088.12uncharacterized protein LOC103489338 OS=Cucumis melo OX=3656 GN=LOC103489338 PE=... [more]
A0A0A0KRA93.7e-28787.46Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G611690 PE=4 SV=1[more]
A0A6J1D9V71.0e-26581.10uncharacterized protein LOC111018699 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A6J1G0F92.1e-25882.37uncharacterized protein LOC111449579 OS=Cucurbita moschata OX=3662 GN=LOC1114495... [more]
A0A6J1H0216.7e-25780.55uncharacterized protein LOC111458797 OS=Cucurbita moschata OX=3662 GN=LOC1114587... [more]
Match NameE-valueIdentityDescription
AT1G71110.11.6e-16555.01unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G12400.13.6e-10140.56unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G25270.17.2e-9438.65unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 18 plant structures;... [more]
AT1G80540.11.8e-7333.60unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G67550.15.1e-1520.53unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 571..600
NoneNo IPR availablePANTHERPTHR31414:SF16TRANSMEMBRANE PROTEINcoord: 20..582
IPR040283Transmembrane protein DDB_G0292058-likePANTHERPTHR31414TRANSMEMBRANE PROTEIN DDB_G0292058coord: 20..582

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG05G022220.1ClCG05G022220.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane