Clc01G18010.1 (mRNA) Watermelon (cordophanus) v2

Overview
NameClc01G18010.1
TypemRNA
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionCys-Gly metallodipeptidase DUG1
LocationClcChr01: 30598516 .. 30604942 (-)
Sequence length2139
RNA-Seq ExpressionClc01G18010.1
SyntenyClc01G18010.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTAATTGCTTCTATCCCCTTTCCAGCCCTTCAAATATTCTCTTTTTTTCTTTTTTAATTTCAGTTTAAGTGATGGCATTATGCATCCGTATAAGAAGCTTATAGGCAGGCTTGTTTAAAGCTAAAGAAGTTGCTTTACTTTTTGGATTCTGAACTTCCAGTAGCTGTTGTCTTTTCCAATGCGATGGAAAAGGAAGAACAACCCAACTCTGCTTCTACCCCTAATCTTGAACACCAACCCAATGGTGTCACAGTATGTTTCTTCTTATTTTTCTGAAATGGGTTTCCTCTTTTTGGTTCGTTTTTGCATTCCTCCCATCCACTTTTGATTGATTTCGTCACCCGAGTCTCTGTTGAAACGCGATGGGGACAGTAAAAGTTTGTGGGGTTTCGCTTTTCTTTTTCGATCTCTTGTGTTTAATTTGTTCTTGTATTTGGGGATTGCATTATGCTAATTACAGATACTGCTGTTGCTAACAGAGCAAGAATGAGAAAAGCGTATCCGATAGGACAGATGAAGCAAAAACCGCCAAATCTGGATGCCAATTTTTGGAGAATGCGGCTCGCCAAAATCAGCAGTACACTGCGCTTCTTCAAAGGGCACTCAACCCTCAACATGCAGGGGAAAGGTCTTCACCGTCAAATGCCCCAGCCGCCGTGAATGAGCGGCTTCAGCTGCCACAGAATCTGGCTAACCTTCAGCACCAGCTCAGCCCGCCTCCACCGCCGCAGCCGCAGCAATTTGTCCTTTCTTCACAACCCTTTTGGGTACAGCCGCAGCCCAACATTTCTTTTGGAGCAACTGAAGGTAGCTGGCACGCCGCCGGAGCCTCGCCGAGATGTCAACCCCAAGCTCCTAATTTCTATTACCCTGTTGGATATCCGACATATCCAGGCTTTCCAGGTGAGTTTCACTTTTATTACTGTTTGGTCAAGTAATTCTGTGGAGATTATTTGATCAATTTGTGGGTGTTTTTATACACTTATCTGTGACTGAATGTGATGTATATTGATCGGTTTGAGTTTTGAAATTTCAAATTCGGGTCTGGTCTGAACTTAATAACAGGTTCCAGGGATGCTTCAATTTGGTGGGGTCAAGCACAACCATTATTGTTTCCTGGATTATCCAATTACCCAAGGGCATCATGTGGTTTTGCCTCTTCTCAATCTTGGCCGATGCCAATTCCTAGTTGTGTAACATCTTCCTCTGGACAACCCCTTTTAAGAGGAGTCATCAAACCCCCTGAAAAGCTTTCTCAGAAGCATCAAAGACTTTGGGAAGCTCAGGTTTGACATGAGATTTCTCTTTAGTTTATGTGATTTTTTCAGATATTAATGTAATGCTTTTTTTGTTCCTGTTTGATGCAGTCTGCGGAAAATGTACAATTGTGGAGTATGATAGGAGAGTTGCAAGGGGAATTAGCAGACTACAAGGGCCGCTTGAGTAAGCTTGAAGCTGAAATTTCATCTTTAAGATCAGCAGCTACGGATGAGCCTGCTGTGGAAGTTGGAAATGGTAGCATTACAGTGAGGGGACAACCAACGAAGCGAGGAAGGTCGAAACGAGCAATAGCCCCAGTTGGTTCACAGCCTCCATTGCAAGCTCGGACACGGGGACGAAAGCGAGCAATTGCAAGGACAAAAGTAGAAGAAGCAAAACCAACTTTTCTTGGAAAAGATAGCTTGAATAAGGTGGATGATAAACATAAAGATTTTACTTCTTTTGACATTACAGAGCAAGAAAAGAATGGTATTTCACCTGCCATCAATCAAAACAATGGGATCATGGAGATTGACGATGGCACTCTCAAGATGCCTGCCTCTGAAGACAATCAAGTTATTCAACAATGCCCTGAAAGTCAATCACGTGGAATTGAATTCAAGCCAGCATCATTATTGAAATCCAATTATGAAGGTAATCAAAGTTCAAAAAACACTTGCTTAATTCTGTCTCAGGATACCAAGGGTTCCTCAACGATCGCGATGTCAGTTATACTAATGCATGTAAAAAAAAAATTTATATGGTCTTTAGGAATCATCTCTCAAGACTCCGAACAAAACGACTTTAGTATAGCGTCCCCATCAATATACACAAATGGAAATGTTAGCAGACAAGGAATAACTAGGTGGAACTTTAAACACGAAGACGAAGCTGCCGAATTGGGATTCCCTGCAGTAGGACACAAAAGGGAGAACGAAGAAATGGCAGATGAATTCAGCTCAGGACCTGAAGAGATTGAAACACAAAATGGCTCCTCGTGGTGTTGAAAGTAAGTTCTAAATGATCAAATGTTCCTGTGTTCTACCTTCAATAACAGGCATTACTGGATAATTCTGCCCCATTTGGCAAGCTACTGCCAATTGATGGAGATGGTTTGACCAAAATTTTGTACAAAGGAAGAATGGATTCAGCTTGCTTAACATGGTACGACTTTGACCCAAATTTCTTATACAAGAAAATCGTGCATGCTCAAAATTTTTCTGTGTTGACTTGTTTTTTTGTTCTGAACTAATATCGAAGATTCAATTTACCACAAAAAAATTGCAATGTCACAAGAGTAATACTGCATGGAGAATAATGGACTGAGGGTGCAATCTCATACATTCTGGACACATGGAAGAAAAAACATTACAACTAAGTTTTGGCACTACATGTGAATATGGCATATCAAATAGTGGGGTCATTTTCGTAAGTACATAAATTCTGGAAAAAGTTGTCTAGTTCTAGTGTCTGTGATCCGGATACGGCGGAAGAATCTTTACTGATTTCATTTTAGGAAAAATTACATTTTTTGTCCTGAATTTTCAAAGATAAGTGCGTTTGGTTGGTTCTATATTTTAAATTAGCTATTTTAGTCCTTAAATTTTACATAATAGATTTAAAAAGTTCCTTTAATTAAAAAAAAAAAAAAAGTTAAATTTAAACATAAAATTGACTATAGAAGTTGATGTGGCAACATGACTTGGCAAAAAAGTTATATTTTTTTCCCCTCCTTTCCCCACTTTTTTTCGCTCCTCTCTCTTCCCTCTTCCTCTCCTCTTTCCCTCTCTTCTTCTCTTCTCCTCCATTACCAAATCAGAGAAACTACAAAAAAATAAAAAAAATAAAAAACCCAAAATGTTGTTGTTCTCTCTCCTCATCCTCCATGGTTGTGAACCCCATTAATCAATTATATTTTTATTCTCAATTTTGAGGAAAATGACAATGAGTTTCTTCTTATTCTCTTCAATTCTGCAATTAATTAATCAAGTCTTAATACCATGTGTGGACTTTGAACCACCGGGGGAAAGCAGCGGCAAACCATGGCGTAGCGATGGTGATAATGTGGAGGAAGATGAGGGAAAGGCCGCAGCGAGAGGGGGAGAAGACGATGGGAGAGGTTTAGTAAGATGAAGTTGAAAGGGGCGCCAAAAGGATGTGGTAGTGGAATCGACTCGTTCAGATTGATGGGGTTCGGACTTATGGAGAAATCCATTGTATTTCTTTTTGAGTTTGATGGTTTCACAACTTGATGGATTTATTTTTTATTTTTTATTTTGGAAGATTTGGTAATGGTGAGAGAACGGAGAGAACAAGGGGAGAGGAGAGAGAGGGAAAGGGAATGAAAAGATTAAACAAAAATATATATATTTTTTTTCTAAGTCATCTTGTTACTTAATATGTTTAGTCAAAATTTTGTTTAAATTTAATGTTTCTTTTTGTGGAGTAATCTTTTAAATCTATTCTTGAAAATTTAAGGACTAAAGTGGCTCATCTGAAATATCAGTGACCAAATGCACTTATTCTTGAAAACTTAGGGATCAAAATGATATTTTTTTCCTTCATTTTACATAGAAGATGACGATGGATAATTTAATCAAAATCTTGAACGAAGGAAACTCCTATGGTTTAAAAATAATTATAAACTTAATTAATGGTGGATGAAATGTTTCGCAATTGGAATCAAACCCTTAATTTGTCTTGCTCTAAAAGTCTTTCATAAGTCATGTAGAACCCTACATTTAGGTCCTACCTTGCTAACAACTCTCCCATGGTTGGAATTCACTCTGGAAAAACTGTGGTTGATTTGGATGGCATCAAACCAAGTAGGGGTTTTAATTGGAATTTGAAAGGAGTTTCCTTTTTAGAAAAAGAAACTTTGAATAGATACCCATATTGTAGAAAGAGAACTTATCTTATCTTTGCACATCTTGAAAAATAAAAAGGAAGCCTTTTTCTTTACCCATTTTGAAAAATGAAATTTTTATTAATACCCATATTGTAAAATAAGAAAAGATGGAGCCTTTCTCTTTCTTAGTTTTCTGAAGTCCCATGGTTCATGAAGAAAATAAGGCATAACCAAGTCCAAAGAATAGAACTATTCTATAGAGAGAAAAATAGAATGTGTGGATTGAAAAAAGTAGAACCAAAGATTCAAAATTTTCCAACTGTGTGGACCGTCAAACGTCAGAGAAAAAAACAGGGAAACCCATTTTTTTTTTTTTTTTGTTGGTTTTAAAAAAGAATCCCTTAACCCAACCCTACCATTCATATCATACTATCTATCATAGTACAGACTTGAGAAGAGTCAAATTATGACTCTCTCACAAGTCATAATTTATGAGAACTTGCCATTCTTATAAATCTTGGCGCTTTTTTCGGTCCAAATCTGTTAGGCAAACTCCATGCCTCTCCTCCAAATCCGGACCTCCAAATCTCTACATTAACCAAAATTGACCCTTTTTTTTTTTTTTTTATTTTTTATTTTACCCTCTCTAATGGCCACCGCCACCGTCAAGCCTCCCCGGCCAAAACTAGCCTGTTTCTCCTTCGCCGCCTATGCCAAAACCGTCATCGACCATCTCAAATCTCTCCAAATTCCCGTCCTTCCTGGCCTCTCAGACCCTGAATTTGCATCCGTCGAGTCCACCTTCCGATTCTCTTTCCCGCCGGACCTCCGTTCAATCCTACGAGAGGGTCTCCCAATTGGGTCTGGTTTCCCCAATTGGCGATCCTCCTCCATTCAACAGCTTAACATTCTGATCAATCTCCCCAAATTCTGCCTCCTCAAGGAAATTTCTCAAAGAAAATTCTGGTGTCAATCTTGGGGGACCCAACCTGATGACGCAAACGACGCCGTTGCTTTGGCCAAGCAATTCTTGGACAGAGCCCCTGTTCTTGTCCCCATTTACAGAAACTGTTACATCCCCTCCGCGCCGAACATGGCCGGAAACCCTGTATTTCACCTCGACGGTGGAGAGATTCGTGTTTCCAGCTTTGACTTGGCTGGATTCTTTCAAACCCATGAATATTCTCAACTGAGCAAGGCTGAATCGGACCGATTGGTGATCGACTCACCAGCTTGGGCCGCGACAGAGGCCCGAGCGGTGGAATTCTGGACGGAAGTGGCTTCGGGAAAGAAAGCGGAGGCGGCGCGTGAGGTAACGGAAGGGTGGTGGAATGAAGGGGAATTTGAAATGGGGTTGGAGGGATGCCTTGAACACGTGTTTTGGAAGTTGCGACAAGGCGGGTGGAGGGATGAAGATGTGAGAGATATGATGATGATGGACGGCCATGATCGGAGCTTAGAACAGAATGGAGCAACGATGGAGAAACAGAGAGTATCAGTATGTGAGATTTTATTGAGTGGGGGGTGGAGCAGGGACGATGTAGTGTACTCTCTTGATCTTGAAGACAAATCCGCCATTGTTATTCCTGAAGAAGAATCAACATTTGAAATCAATCTTCATCGTCATCCGCCGATTAGAATCCCCCGAGTAGAACGCAAGAAAAAGCCTCGTAGTACCACCACCAATCACCTTAAAATGCCCCCTTTTTTCTTTGCACCACATCGAAATTTAATCCTCTAACTACTTTTTGCTTTTATTTTGTTTGTAAATTATACAATGTAAATAAACACACATTCTTTTTTCTTTGAGTTAATAAAAAAGATACGATAAAGGTCCAATTTTGATCCACCTTCCACCGATTCATATTTCAATATTGCATTGTTTGTGTCATTATTATATATTAATATATCACTCTTTCTTTTTGTTGTATTCGAAAAGAAGGGAAAATGGGATGTCATATCATGTGGGCTAGGAAAATGACCTTATGAATGAAGTAATATGTCTTTTACGTTACAGAAATAATTACTTCCTCCCCGTCTATTGATGGGGCGGTGGCGTTCTGAAATGGGCAGCCGGCGTTGACAAATTGGAGAGGGAAATCGGTAACGGCGGTGACTGAGAGTCATCGGAGACCACCAGCTACGTTGACGGAGAAAAAAGGGCCTCAAAGTTGGAGTCGAAAACCGACTTTTGAAATGAAAAAGCCGACGAATCGGTCAAGATGTGGACTCTCTTACGCGTCGGTTTAAAAGGGGTATTAATGACAGTCTATAAAAGGGTTTTAATTAGAGGTGCTTCGAAAGGTAGAAAAAAAAA

mRNA sequence

CTTTAATTGCTTCTATCCCCTTTCCAGCCCTTCAAATATTCTCTTTTTTTCTTTTTTAATTTCAGTTTAAGTGATGGCATTATGCATCCGTATAAGAAGCTTATAGGCAGGCTTGTTTAAAGCTAAAGAAGTTGCTTTACTTTTTGGATTCTGAACTTCCAGTAGCTGTTGTCTTTTCCAATGCGATGGAAAAGGAAGAACAACCCAACTCTGCTTCTACCCCTAATCTTGAACACCAACCCAATGGTGTCACAAGCAAGAATGAGAAAAGCGTATCCGATAGGACAGATGAAGCAAAAACCGCCAAATCTGGATGCCAATTTTTGGAGAATGCGGCTCGCCAAAATCAGCAGTACACTGCGCTTCTTCAAAGGGCACTCAACCCTCAACATGCAGGGGAAAGGTCTTCACCGTCAAATGCCCCAGCCGCCGTGAATGAGCGGCTTCAGCTGCCACAGAATCTGGCTAACCTTCAGCACCAGCTCAGCCCGCCTCCACCGCCGCAGCCGCAGCAATTTGTCCTTTCTTCACAACCCTTTTGGGTACAGCCGCAGCCCAACATTTCTTTTGGAGCAACTGAAGGTAGCTGGCACGCCGCCGGAGCCTCGCCGAGATGTCAACCCCAAGCTCCTAATTTCTATTACCCTGTTGGATATCCGACATATCCAGGCTTTCCAGGTTCCAGGGATGCTTCAATTTGGTGGGGTCAAGCACAACCATTATTGTTTCCTGGATTATCCAATTACCCAAGGGCATCATGTGGTTTTGCCTCTTCTCAATCTTGGCCGATGCCAATTCCTAGTTGTGTAACATCTTCCTCTGGACAACCCCTTTTAAGAGGAGTCATCAAACCCCCTGAAAAGCTTTCTCAGAAGCATCAAAGACTTTGGGAAGCTCAGTCTGCGGAAAATGTACAATTGTGGAGTATGATAGGAGAGTTGCAAGGGGAATTAGCAGACTACAAGGGCCGCTTGAGTAAGCTTGAAGCTGAAATTTCATCTTTAAGATCAGCAGCTACGGATGAGCCTGCTGTGGAAGTTGGAAATGGTAGCATTACAGTGAGGGGACAACCAACGAAGCGAGGAAGGTCGAAACGAGCAATAGCCCCAGTTGGTTCACAGCCTCCATTGCAAGCTCGGACACGGGGACGAAAGCGAGCAATTGCAAGGACAAAAGTAGAAGAAGCAAAACCAACTTTTCTTGGAAAAGATAGCTTGAATAAGGTGGATGATAAACATAAAGATTTTACTTCTTTTGACATTACAGAGCAAGAAAAGAATGGTATTTCACCTGCCATCAATCAAAACAATGGGATCATGGAGATTGACGATGGCACTCTCAAGATGCCTGCCTCTGAAGACAATCAAGTTATTCAACAATGCCCTGAAAGTCAATCACGTGGAATTGAATTCAAGCCAGCATCATTATTGAAATCCAATTATGAAGGAATCATCTCTCAAGACTCCGAACAAAACGACTTTAGTATAGCGTCCCCATCAATATACACAAATGGAAATGTTAGCAGACAAGGAATAACTAGGTGGAACTTTAAACACGAAGACGAAGCTGCCGAATTGGGATTCCCTGCAGTAGGACACAAAAGGGAGAACGAAGAAATGGCAGATGAATTCAGCTCAGGACCTGAAGAGATTGAAACACAAAATGGCTCCTCGTGGTGTTGAAAGTAAGTTCTAAATGATCAAATGTTCCTGTGTTCTACCTTCAATAACAGGCATTACTGGATAATTCTGCCCCATTTGGCAAGCTACTGCCAATTGATGGAGATGGTTTGACCAAAATTTTGTACAAAGGAAGAATGGATTCAGCTTGCTTAACATGAAATAATTACTTCCTCCCCGTCTATTGATGGGGCGGTGGCGTTCTGAAATGGGCAGCCGGCGTTGACAAATTGGAGAGGGAAATCGGTAACGGCGGTGACTGAGAGTCATCGGAGACCACCAGCTACGTTGACGGAGAAAAAAGGGCCTCAAAGTTGGAGTCGAAAACCGACTTTTGAAATGAAAAAGCCGACGAATCGGTCAAGATGTGGACTCTCTTACGCGTCGGTTTAAAAGGGGTATTAATGACAGTCTATAAAAGGGTTTTAATTAGAGGTGCTTCGAAAGGTAGAAAAAAAAA

Coding sequence (CDS)

ATGGAAAAGGAAGAACAACCCAACTCTGCTTCTACCCCTAATCTTGAACACCAACCCAATGGTGTCACAAGCAAGAATGAGAAAAGCGTATCCGATAGGACAGATGAAGCAAAAACCGCCAAATCTGGATGCCAATTTTTGGAGAATGCGGCTCGCCAAAATCAGCAGTACACTGCGCTTCTTCAAAGGGCACTCAACCCTCAACATGCAGGGGAAAGGTCTTCACCGTCAAATGCCCCAGCCGCCGTGAATGAGCGGCTTCAGCTGCCACAGAATCTGGCTAACCTTCAGCACCAGCTCAGCCCGCCTCCACCGCCGCAGCCGCAGCAATTTGTCCTTTCTTCACAACCCTTTTGGGTACAGCCGCAGCCCAACATTTCTTTTGGAGCAACTGAAGGTAGCTGGCACGCCGCCGGAGCCTCGCCGAGATGTCAACCCCAAGCTCCTAATTTCTATTACCCTGTTGGATATCCGACATATCCAGGCTTTCCAGGTTCCAGGGATGCTTCAATTTGGTGGGGTCAAGCACAACCATTATTGTTTCCTGGATTATCCAATTACCCAAGGGCATCATGTGGTTTTGCCTCTTCTCAATCTTGGCCGATGCCAATTCCTAGTTGTGTAACATCTTCCTCTGGACAACCCCTTTTAAGAGGAGTCATCAAACCCCCTGAAAAGCTTTCTCAGAAGCATCAAAGACTTTGGGAAGCTCAGTCTGCGGAAAATGTACAATTGTGGAGTATGATAGGAGAGTTGCAAGGGGAATTAGCAGACTACAAGGGCCGCTTGAGTAAGCTTGAAGCTGAAATTTCATCTTTAAGATCAGCAGCTACGGATGAGCCTGCTGTGGAAGTTGGAAATGGTAGCATTACAGTGAGGGGACAACCAACGAAGCGAGGAAGGTCGAAACGAGCAATAGCCCCAGTTGGTTCACAGCCTCCATTGCAAGCTCGGACACGGGGACGAAAGCGAGCAATTGCAAGGACAAAAGTAGAAGAAGCAAAACCAACTTTTCTTGGAAAAGATAGCTTGAATAAGGTGGATGATAAACATAAAGATTTTACTTCTTTTGACATTACAGAGCAAGAAAAGAATGGTATTTCACCTGCCATCAATCAAAACAATGGGATCATGGAGATTGACGATGGCACTCTCAAGATGCCTGCCTCTGAAGACAATCAAGTTATTCAACAATGCCCTGAAAGTCAATCACGTGGAATTGAATTCAAGCCAGCATCATTATTGAAATCCAATTATGAAGGAATCATCTCTCAAGACTCCGAACAAAACGACTTTAGTATAGCGTCCCCATCAATATACACAAATGGAAATGTTAGCAGACAAGGAATAACTAGGTGGAACTTTAAACACGAAGACGAAGCTGCCGAATTGGGATTCCCTGCAGTAGGACACAAAAGGGAGAACGAAGAAATGGCAGATGAATTCAGCTCAGGACCTGAAGAGATTGAAACACAAAATGGCTCCTCGTGGTGTTGA

Protein sequence

MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTALLQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWVQPQPNISFGATEGSWHAAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQAQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLWEAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQPTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDFTSFDITEQEKNGISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPASLLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGFPAVGHKRENEEMADEFSSGPEEIETQNGSSWC
Homology
BLAST of Clc01G18010.1 vs. NCBI nr
Match: XP_038883396.1 (uncharacterized protein LOC120074371 [Benincasa hispida])

HSP 1 Score: 848.2 bits (2190), Expect = 3.6e-242
Identity = 445/512 (86.91%), Postives = 461/512 (90.04%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTS-KNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTA 60
           MEKEEQP  ASTPNLEHQ NGV S KNEKSVSD TD AK AKSGCQFLENA  QNQQ TA
Sbjct: 1   MEKEEQPKFASTPNLEHQANGVFSGKNEKSVSDGTDAAKNAKSGCQFLENAPLQNQQCTA 60

Query: 61  LLQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFW 120
            LQRALNPQHAGE+ SPS APAAVNERLQLPQNLANLQHQLS  PPPQPQQFV+SSQPFW
Sbjct: 61  FLQRALNPQHAGEK-SPSTAPAAVNERLQLPQNLANLQHQLS--PPPQPQQFVISSQPFW 120

Query: 121 VQPQPNISFGATEGSWHA-----AGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWG 180
           VQPQP+ISFGATEGSW A     AGASPRCQPQAPNFYYPVGYPTY GFPG RDASIWWG
Sbjct: 121 VQPQPSISFGATEGSWQAPVAFGAGASPRCQPQAPNFYYPVGYPTYSGFPGPRDASIWWG 180

Query: 181 QAQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRL 240
           Q QPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRL
Sbjct: 181 QTQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRL 240

Query: 241 WEAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRG 300
           WEAQSAENVQLWS+IGELQGELADYKGRLSKLE EISSLRSAATDEPAVEVGN  ITVRG
Sbjct: 241 WEAQSAENVQLWSLIGELQGELADYKGRLSKLEVEISSLRSAATDEPAVEVGNDGITVRG 300

Query: 301 QPTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDF 360
           QP KRGRSKRAIAPVGSQPPLQ RTRGRK A ARTKVEEAKPTFLGKDSLNKV+DKHKDF
Sbjct: 301 QPAKRGRSKRAIAPVGSQPPLQPRTRGRKPAFARTKVEEAKPTFLGKDSLNKVNDKHKDF 360

Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
           TS DITEQ+KN GIS  INQNNG MEI++GTLKMPA  DNQV+QQCPE QS GIEFKP+S
Sbjct: 361 TSLDITEQDKNEGISATINQNNGSMEINEGTLKMPAPLDNQVLQQCPEIQSCGIEFKPSS 420

Query: 421 LLKSNYE-------GIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF 480
           LLKSNYE       GIIS+DSEQN+FSIASP+IYTNGNVSRQGI RWNFKHEDEAAELGF
Sbjct: 421 LLKSNYEENFIWSLGIISEDSEQNNFSIASPTIYTNGNVSRQGIARWNFKHEDEAAELGF 480

Query: 481 PAVGHKRENEEMADEFSSGPEEIETQNGSSWC 499
           PAV HK+E+EEM DEFSSGPEEIET+NGSSWC
Sbjct: 481 PAVEHKKEDEEMVDEFSSGPEEIETKNGSSWC 509

BLAST of Clc01G18010.1 vs. NCBI nr
Match: XP_031742364.1 (uncharacterized protein LOC101204298 isoform X2 [Cucumis sativus] >KGN48653.1 hypothetical protein Csa_003715 [Cucumis sativus])

HSP 1 Score: 777.3 bits (2006), Expect = 7.8e-221
Identity = 414/506 (81.82%), Postives = 436/506 (86.17%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
           MEKEEQP   STP+LEHQ NG++SKNEKSVSD TD AK AKSG QFLEN A  NQ YTAL
Sbjct: 1   MEKEEQPEFCSTPDLEHQANGISSKNEKSVSDGTDAAKKAKSGSQFLENGAPHNQHYTAL 60

Query: 61  LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
           LQRA  PQHA + SSP+ APAAVNERLQLPQN ANL HQLS   PPQPQQFVLSSQPFWV
Sbjct: 61  LQRAHYPQHAEKPSSPT-APAAVNERLQLPQNAANLPHQLS--QPPQPQQFVLSSQPFWV 120

Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
           QPQP+ISFGATEGSW      +AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPVAISAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180

Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
            QP+LFPGLSNYPRASCGF SSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQ+LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQKLW 240

Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
           EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRSAAT+EPAVEVGN  I +RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSAATNEPAVEVGNDDIILRGQ 300

Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
           P KRGRSKRA APVGSQPPLQ RTR RK A+ARTKVEEAK T LGKDSLNK DD KHK F
Sbjct: 301 PAKRGRSKRATAPVGSQPPLQPRTRVRKPAVARTKVEEAKQTLLGKDSLNKADDNKHKYF 360

Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
           TS DIT+Q+KN  IS +INQNNGI+EIDD TLKMP S D QV++QC E    GIEFKP S
Sbjct: 361 TSLDITKQDKNEDISASINQNNGIVEIDDDTLKMPVSLDTQVLEQCSEIHPCGIEFKPPS 420

Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
           +LKSNYEGIIS+DSE NDFSIASP+IYTNGNV+RQGITRWNFK E   AELGF PAV HK
Sbjct: 421 VLKSNYEGIISKDSEPNDFSIASPTIYTNGNVTRQGITRWNFKLEGGTAELGFPPAVVHK 480

Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
             NEEMADEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEEMADEFSSGPEEIETQNGSSWC 503

BLAST of Clc01G18010.1 vs. NCBI nr
Match: XP_031742362.1 (uncharacterized protein LOC101204298 isoform X1 [Cucumis sativus] >XP_031742363.1 uncharacterized protein LOC101204298 isoform X1 [Cucumis sativus])

HSP 1 Score: 770.4 bits (1988), Expect = 9.5e-219
Identity = 414/513 (80.70%), Postives = 436/513 (84.99%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
           MEKEEQP   STP+LEHQ NG++SKNEKSVSD TD AK AKSG QFLEN A  NQ YTAL
Sbjct: 1   MEKEEQPEFCSTPDLEHQANGISSKNEKSVSDGTDAAKKAKSGSQFLENGAPHNQHYTAL 60

Query: 61  LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
           LQRA  PQHA + SSP+ APAAVNERLQLPQN ANL HQLS   PPQPQQFVLSSQPFWV
Sbjct: 61  LQRAHYPQHAEKPSSPT-APAAVNERLQLPQNAANLPHQLS--QPPQPQQFVLSSQPFWV 120

Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
           QPQP+ISFGATEGSW      +AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPVAISAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180

Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
            QP+LFPGLSNYPRASCGF SSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQ+LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQKLW 240

Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
           EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRSAAT+EPAVEVGN  I +RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSAATNEPAVEVGNDDIILRGQ 300

Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
           P KRGRSKRA APVGSQPPLQ RTR RK A+ARTKVEEAK T LGKDSLNK DD KHK F
Sbjct: 301 PAKRGRSKRATAPVGSQPPLQPRTRVRKPAVARTKVEEAKQTLLGKDSLNKADDNKHKYF 360

Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
           TS DIT+Q+KN  IS +INQNNGI+EIDD TLKMP S D QV++QC E    GIEFKP S
Sbjct: 361 TSLDITKQDKNEDISASINQNNGIVEIDDDTLKMPVSLDTQVLEQCSEIHPCGIEFKPPS 420

Query: 421 LLKSNYE-------GIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF 480
           +LKSNYE       GIIS+DSE NDFSIASP+IYTNGNV+RQGITRWNFK E   AELGF
Sbjct: 421 VLKSNYEENFIWSLGIISKDSEPNDFSIASPTIYTNGNVTRQGITRWNFKLEGGTAELGF 480

Query: 481 -PAVGHKRENEEMADEFSSGPEEIETQNGSSWC 499
            PAV HK  NEEMADEFSSGPEEIETQNGSSWC
Sbjct: 481 PPAVVHKTGNEEMADEFSSGPEEIETQNGSSWC 510

BLAST of Clc01G18010.1 vs. NCBI nr
Match: XP_016899165.1 (PREDICTED: uncharacterized protein LOC103484887 [Cucumis melo] >XP_016899166.1 PREDICTED: uncharacterized protein LOC103484887 [Cucumis melo] >KAA0036381.1 Cys-Gly metallodipeptidase DUG1 [Cucumis melo var. makuwa] >TYK12777.1 Cys-Gly metallodipeptidase DUG1 [Cucumis melo var. makuwa])

HSP 1 Score: 765.8 bits (1976), Expect = 2.4e-217
Identity = 404/506 (79.84%), Postives = 434/506 (85.77%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
           MEKEEQP  +STPNLEHQ NGV+SKNEKSVSD TD AK AKSGCQ LEN +  NQ YTAL
Sbjct: 1   MEKEEQPKFSSTPNLEHQANGVSSKNEKSVSDGTDAAKNAKSGCQLLENESPHNQHYTAL 60

Query: 61  LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
           LQ A  P+HA E+ SPS AP+AVNER QLPQ+ ANLQHQLS   PPQPQQFVLSSQPFW+
Sbjct: 61  LQSAHYPEHA-EKPSPSTAPSAVNERHQLPQSPANLQHQLS--QPPQPQQFVLSSQPFWI 120

Query: 121 QPQPNISFGATEGSWHA-----AGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
           QPQP+ISFGATEGSW +     AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPAAFGAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180

Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
            QP+LFPGLSNYPRASCGF SSQSWPMPIPSC TSSSGQPLLRGVIKPPEKLSQKH++LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCATSSSGQPLLRGVIKPPEKLSQKHKKLW 240

Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
           EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRS+AT+EPAVEVGNG IT+RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSSATNEPAVEVGNGDITLRGQ 300

Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
           PTKRGR KR  APVGSQ PLQ  TR RK A+ RTKVE+AK T LGKDSLNK DD KHK F
Sbjct: 301 PTKRGRLKRGTAPVGSQSPLQPHTRVRKPAVGRTKVEDAKQTLLGKDSLNKADDNKHKYF 360

Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
           TS DIT+Q+KN   S  INQNNGI+EIDD TLKMPAS DNQV++QC E QS GIEFKP S
Sbjct: 361 TSLDITKQDKNEDSSTTINQNNGIVEIDDDTLKMPASLDNQVLEQCSEIQSCGIEFKPPS 420

Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
           +LKSNYEGIIS+DSE+NDF IAS +IYTNGNV+RQGI+RWNFK  DEAAELGF P V HK
Sbjct: 421 VLKSNYEGIISEDSERNDFRIASSTIYTNGNVTRQGISRWNFKLVDEAAELGFPPPVVHK 480

Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
             NE+M DEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEDMTDEFSSGPEEIETQNGSSWC 503

BLAST of Clc01G18010.1 vs. NCBI nr
Match: KAG7034256.1 (hypothetical protein SDJN02_03983, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 735.3 bits (1897), Expect = 3.4e-208
Identity = 395/504 (78.37%), Postives = 420/504 (83.33%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
           MEKE++   AST  L+H PNGV+ KNEKSV D TD AK  KSGCQFLENAA QNQQYT L
Sbjct: 1   MEKEDELKFASTAKLQHHPNGVSRKNEKSVFDGTDSAKNVKSGCQFLENAAPQNQQYTEL 60

Query: 61  LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
           LQRALNP+HAGE+SS   APAAVNERLQ P+NL  LQHQL PPP PQPQQFVLSSQPFWV
Sbjct: 61  LQRALNPRHAGEKSSLPAAPAAVNERLQPPENLPKLQHQLIPPPQPQPQQFVLSSQPFWV 120

Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
           QPQP+IS GATEGSW        GASPRCQPQAPNF YPVGYPTYPGF GS DASIWWGQ
Sbjct: 121 QPQPSISLGATEGSWQTPAAFGTGASPRCQPQAPNFCYPVGYPTYPGFQGSWDASIWWGQ 180

Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
             PLLFPGLSNYPRAS GFASSQS PMPIP+CVT SSGQPLLRGVIKPPE+LSQKHQRLW
Sbjct: 181 TPPLLFPGLSNYPRASYGFASSQSCPMPIPNCVTFSSGQPLLRGVIKPPERLSQKHQRLW 240

Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
           EAQSAENVQLWSMIG+ QGELAD KGRL KLEAEISSLRS AT+EPAVEVGNG ITVRGQ
Sbjct: 241 EAQSAENVQLWSMIGQFQGELADCKGRLIKLEAEISSLRSVATNEPAVEVGNGGITVRGQ 300

Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDFT 360
           P+KRGRSKRAIAPVGS    Q+RTR RK A+  TKV E KPT LGKDSLNKVDD HKDFT
Sbjct: 301 PSKRGRSKRAIAPVGS----QSRTRARKPAVGGTKV-EVKPTLLGKDSLNKVDDTHKDFT 360

Query: 361 SFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPASL 420
             DITEQ+KN GIS  I    GIMEID+GTLKMP S  NQ +QQ P+ QS GIEFK  S 
Sbjct: 361 PLDITEQDKNEGISATI----GIMEIDEGTLKMPISFGNQDLQQFPDIQSCGIEFKSPSS 420

Query: 421 LKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGFPAVGHKRE 480
           LKSNYEGII  DS+ ND SIASP+IYTNGNVSRQGITRWNF+ E EAAE GFP VG+K+E
Sbjct: 421 LKSNYEGIICGDSKLNDLSIASPTIYTNGNVSRQGITRWNFEDEVEAAESGFPVVGNKKE 480

Query: 481 NEEMADEFSSGPEEIETQNGSSWC 499
           N+EMADEFSSG EEIETQNGSSWC
Sbjct: 481 NKEMADEFSSGAEEIETQNGSSWC 495

BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match: A0A0A0KGK4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G497040 PE=4 SV=1)

HSP 1 Score: 777.3 bits (2006), Expect = 3.8e-221
Identity = 414/506 (81.82%), Postives = 436/506 (86.17%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
           MEKEEQP   STP+LEHQ NG++SKNEKSVSD TD AK AKSG QFLEN A  NQ YTAL
Sbjct: 1   MEKEEQPEFCSTPDLEHQANGISSKNEKSVSDGTDAAKKAKSGSQFLENGAPHNQHYTAL 60

Query: 61  LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
           LQRA  PQHA + SSP+ APAAVNERLQLPQN ANL HQLS   PPQPQQFVLSSQPFWV
Sbjct: 61  LQRAHYPQHAEKPSSPT-APAAVNERLQLPQNAANLPHQLS--QPPQPQQFVLSSQPFWV 120

Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
           QPQP+ISFGATEGSW      +AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPVAISAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180

Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
            QP+LFPGLSNYPRASCGF SSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQ+LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQKLW 240

Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
           EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRSAAT+EPAVEVGN  I +RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSAATNEPAVEVGNDDIILRGQ 300

Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
           P KRGRSKRA APVGSQPPLQ RTR RK A+ARTKVEEAK T LGKDSLNK DD KHK F
Sbjct: 301 PAKRGRSKRATAPVGSQPPLQPRTRVRKPAVARTKVEEAKQTLLGKDSLNKADDNKHKYF 360

Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
           TS DIT+Q+KN  IS +INQNNGI+EIDD TLKMP S D QV++QC E    GIEFKP S
Sbjct: 361 TSLDITKQDKNEDISASINQNNGIVEIDDDTLKMPVSLDTQVLEQCSEIHPCGIEFKPPS 420

Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
           +LKSNYEGIIS+DSE NDFSIASP+IYTNGNV+RQGITRWNFK E   AELGF PAV HK
Sbjct: 421 VLKSNYEGIISKDSEPNDFSIASPTIYTNGNVTRQGITRWNFKLEGGTAELGFPPAVVHK 480

Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
             NEEMADEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEEMADEFSSGPEEIETQNGSSWC 503

BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match: A0A1S4DT48 (uncharacterized protein LOC103484887 OS=Cucumis melo OX=3656 GN=LOC103484887 PE=4 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 1.1e-217
Identity = 404/506 (79.84%), Postives = 434/506 (85.77%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
           MEKEEQP  +STPNLEHQ NGV+SKNEKSVSD TD AK AKSGCQ LEN +  NQ YTAL
Sbjct: 1   MEKEEQPKFSSTPNLEHQANGVSSKNEKSVSDGTDAAKNAKSGCQLLENESPHNQHYTAL 60

Query: 61  LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
           LQ A  P+HA E+ SPS AP+AVNER QLPQ+ ANLQHQLS   PPQPQQFVLSSQPFW+
Sbjct: 61  LQSAHYPEHA-EKPSPSTAPSAVNERHQLPQSPANLQHQLS--QPPQPQQFVLSSQPFWI 120

Query: 121 QPQPNISFGATEGSWHA-----AGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
           QPQP+ISFGATEGSW +     AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPAAFGAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180

Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
            QP+LFPGLSNYPRASCGF SSQSWPMPIPSC TSSSGQPLLRGVIKPPEKLSQKH++LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCATSSSGQPLLRGVIKPPEKLSQKHKKLW 240

Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
           EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRS+AT+EPAVEVGNG IT+RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSSATNEPAVEVGNGDITLRGQ 300

Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
           PTKRGR KR  APVGSQ PLQ  TR RK A+ RTKVE+AK T LGKDSLNK DD KHK F
Sbjct: 301 PTKRGRLKRGTAPVGSQSPLQPHTRVRKPAVGRTKVEDAKQTLLGKDSLNKADDNKHKYF 360

Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
           TS DIT+Q+KN   S  INQNNGI+EIDD TLKMPAS DNQV++QC E QS GIEFKP S
Sbjct: 361 TSLDITKQDKNEDSSTTINQNNGIVEIDDDTLKMPASLDNQVLEQCSEIQSCGIEFKPPS 420

Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
           +LKSNYEGIIS+DSE+NDF IAS +IYTNGNV+RQGI+RWNFK  DEAAELGF P V HK
Sbjct: 421 VLKSNYEGIISEDSERNDFRIASSTIYTNGNVTRQGISRWNFKLVDEAAELGFPPPVVHK 480

Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
             NE+M DEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEDMTDEFSSGPEEIETQNGSSWC 503

BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match: A0A5D3CLF7 (Cys-Gly metallodipeptidase DUG1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G003660 PE=4 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 1.1e-217
Identity = 404/506 (79.84%), Postives = 434/506 (85.77%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
           MEKEEQP  +STPNLEHQ NGV+SKNEKSVSD TD AK AKSGCQ LEN +  NQ YTAL
Sbjct: 1   MEKEEQPKFSSTPNLEHQANGVSSKNEKSVSDGTDAAKNAKSGCQLLENESPHNQHYTAL 60

Query: 61  LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
           LQ A  P+HA E+ SPS AP+AVNER QLPQ+ ANLQHQLS   PPQPQQFVLSSQPFW+
Sbjct: 61  LQSAHYPEHA-EKPSPSTAPSAVNERHQLPQSPANLQHQLS--QPPQPQQFVLSSQPFWI 120

Query: 121 QPQPNISFGATEGSWHA-----AGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
           QPQP+ISFGATEGSW +     AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPAAFGAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180

Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
            QP+LFPGLSNYPRASCGF SSQSWPMPIPSC TSSSGQPLLRGVIKPPEKLSQKH++LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCATSSSGQPLLRGVIKPPEKLSQKHKKLW 240

Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
           EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRS+AT+EPAVEVGNG IT+RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSSATNEPAVEVGNGDITLRGQ 300

Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
           PTKRGR KR  APVGSQ PLQ  TR RK A+ RTKVE+AK T LGKDSLNK DD KHK F
Sbjct: 301 PTKRGRLKRGTAPVGSQSPLQPHTRVRKPAVGRTKVEDAKQTLLGKDSLNKADDNKHKYF 360

Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
           TS DIT+Q+KN   S  INQNNGI+EIDD TLKMPAS DNQV++QC E QS GIEFKP S
Sbjct: 361 TSLDITKQDKNEDSSTTINQNNGIVEIDDDTLKMPASLDNQVLEQCSEIQSCGIEFKPPS 420

Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
           +LKSNYEGIIS+DSE+NDF IAS +IYTNGNV+RQGI+RWNFK  DEAAELGF P V HK
Sbjct: 421 VLKSNYEGIISEDSERNDFRIASSTIYTNGNVTRQGISRWNFKLVDEAAELGFPPPVVHK 480

Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
             NE+M DEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEDMTDEFSSGPEEIETQNGSSWC 503

BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match: A0A6J1IVR4 (uncharacterized protein LOC111478840 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111478840 PE=4 SV=1)

HSP 1 Score: 732.6 bits (1890), Expect = 1.1e-207
Identity = 394/504 (78.17%), Postives = 418/504 (82.94%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
           MEKE++   AST NL+H PNGV+ KNEKSV D TD AK AKSGCQFLENAA QNQQYT L
Sbjct: 1   MEKEDELKCASTANLQHHPNGVSRKNEKSVFDGTDSAKNAKSGCQFLENAAPQNQQYTEL 60

Query: 61  LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
           LQRALNP+HAGE+SS   APAAVNERLQ P+NL   QHQL PPP PQPQQFVLSSQPFWV
Sbjct: 61  LQRALNPRHAGEKSSLPAAPAAVNERLQPPENLPKFQHQLIPPPQPQPQQFVLSSQPFWV 120

Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
           QPQ +IS GATEGSW       AGASPRCQPQAPNF YPVGYPTYPGF GS DASIWWGQ
Sbjct: 121 QPQSSISLGATEGSWQTPAAFGAGASPRCQPQAPNFCYPVGYPTYPGFQGSWDASIWWGQ 180

Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
             PLLFPGLSNYPRAS GFASSQS PMPIPSCV SSSGQPLLRGVIKPPE+LSQKHQRLW
Sbjct: 181 TPPLLFPGLSNYPRASYGFASSQSCPMPIPSCVASSSGQPLLRGVIKPPERLSQKHQRLW 240

Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
           EAQSAENVQLWSMIG+LQ ELAD KGRL KLEAEISSLRS ATDE AVEVGNG ITVRGQ
Sbjct: 241 EAQSAENVQLWSMIGQLQVELADCKGRLIKLEAEISSLRSVATDEAAVEVGNGGITVRGQ 300

Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDFT 360
           P KRGRSKRAIAPVGS    Q+RTR RK  +  TKV E KPT LGKDSLNKVDD H+DFT
Sbjct: 301 PAKRGRSKRAIAPVGS----QSRTRARKPTVGGTKVGEVKPTLLGKDSLNKVDDTHEDFT 360

Query: 361 SFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPASL 420
             DITEQ+KN GIS  I    GIMEID+GTLK+P S  NQ +QQ P+ QS GIEFK  S 
Sbjct: 361 PLDITEQDKNEGISATI----GIMEIDEGTLKVPISFVNQDLQQFPDIQSCGIEFKSPSS 420

Query: 421 LKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGFPAVGHKRE 480
           LKSNYEGII  DS+ ND SIASP+IYTNGNVSRQGITRWNF+ E EAAE GFP VG+K+E
Sbjct: 421 LKSNYEGIICGDSKLNDLSIASPTIYTNGNVSRQGITRWNFEDEVEAAESGFPIVGNKKE 480

Query: 481 NEEMADEFSSGPEEIETQNGSSWC 499
           N+EMADEFSSG EEIETQNGSSWC
Sbjct: 481 NKEMADEFSSGAEEIETQNGSSWC 496

BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match: A0A6J1GDK6 (uncharacterized protein LOC111453031 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453031 PE=4 SV=1)

HSP 1 Score: 727.6 bits (1877), Expect = 3.4e-206
Identity = 392/504 (77.78%), Postives = 418/504 (82.94%), Query Frame = 0

Query: 1   MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
           MEKE++   AST NL+H PNGV+ KNEKSV D TD AK  KSGCQFLENAA QNQQYT L
Sbjct: 1   MEKEDELKFASTANLQHHPNGVSRKNEKSVFDGTDSAKNVKSGCQFLENAAPQNQQYTEL 60

Query: 61  LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
           LQRALNP+HAGE+SS   APAAVNERLQ P+NL  LQHQL     PQPQQFVLSSQPFWV
Sbjct: 61  LQRALNPRHAGEKSSLPAAPAAVNERLQPPENLPKLQHQLI----PQPQQFVLSSQPFWV 120

Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
           QPQP+IS GATEGSW       AGASPRCQPQAPNF YPVGYPTYPGF GS DASIWWGQ
Sbjct: 121 QPQPSISLGATEGSWQTPAAFGAGASPRCQPQAPNFCYPVGYPTYPGFQGSWDASIWWGQ 180

Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
             PLLFPGLSNYPRAS G ASSQS PMPIP+CVTSSSGQPLLRGVIKPPE+LSQKHQRLW
Sbjct: 181 TPPLLFPGLSNYPRASYGLASSQSCPMPIPNCVTSSSGQPLLRGVIKPPERLSQKHQRLW 240

Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
           EAQSAENVQLWSMIG+LQGELAD KGRL KLEAEIS LRS AT+EPAVEVGNG ITVRGQ
Sbjct: 241 EAQSAENVQLWSMIGQLQGELADCKGRLIKLEAEISPLRSVATNEPAVEVGNGGITVRGQ 300

Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDFT 360
           P+KRGRSKRAIAPVGS    Q+RTR RK A+  TKV E KPT LGKDSLNKVDD HK+FT
Sbjct: 301 PSKRGRSKRAIAPVGS----QSRTRARKPAVGGTKVGEVKPTLLGKDSLNKVDDTHKNFT 360

Query: 361 SFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPASL 420
             DITEQ+KN GIS  I    GIMEID+GTLKMP S  NQ +QQ P+ QS GIEFK  S 
Sbjct: 361 PLDITEQDKNEGISTTI----GIMEIDEGTLKMPISFGNQDLQQFPDIQSCGIEFKSPSS 420

Query: 421 LKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGFPAVGHKRE 480
           LKSNYEGII  DS+ ND SIASP+IYTNGNVSRQGITRWNF+ E EAAE GFP VG+K+E
Sbjct: 421 LKSNYEGIICGDSKLNDLSIASPTIYTNGNVSRQGITRWNFEDEVEAAESGFPVVGNKKE 480

Query: 481 NEEMADEFSSGPEEIETQNGSSWC 499
           N+EMADEFSSG EEIETQNG SWC
Sbjct: 481 NKEMADEFSSGAEEIETQNGPSWC 492

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038883396.13.6e-24286.91uncharacterized protein LOC120074371 [Benincasa hispida][more]
XP_031742364.17.8e-22181.82uncharacterized protein LOC101204298 isoform X2 [Cucumis sativus] >KGN48653.1 hy... [more]
XP_031742362.19.5e-21980.70uncharacterized protein LOC101204298 isoform X1 [Cucumis sativus] >XP_031742363.... [more]
XP_016899165.12.4e-21779.84PREDICTED: uncharacterized protein LOC103484887 [Cucumis melo] >XP_016899166.1 P... [more]
KAG7034256.13.4e-20878.37hypothetical protein SDJN02_03983, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KGK43.8e-22181.82Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G497040 PE=4 SV=1[more]
A0A1S4DT481.1e-21779.84uncharacterized protein LOC103484887 OS=Cucumis melo OX=3656 GN=LOC103484887 PE=... [more]
A0A5D3CLF71.1e-21779.84Cys-Gly metallodipeptidase DUG1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A6J1IVR41.1e-20778.17uncharacterized protein LOC111478840 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1GDK63.4e-20677.78uncharacterized protein LOC111453031 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 249..276
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 7..27
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 46..81
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..81
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 287..323
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 468..498

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Clc01G18010Clc01G18010gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Clc01G18010.1-exonClc01G18010.1-exon-ClcChr01:30598516..30598815exon
Clc01G18010.1-exonClc01G18010.1-exon-ClcChr01:30602517..30602909exon
Clc01G18010.1-exonClc01G18010.1-exon-ClcChr01:30603027..30603573exon
Clc01G18010.1-exonClc01G18010.1-exon-ClcChr01:30603655..30603875exon
Clc01G18010.1-exonClc01G18010.1-exon-ClcChr01:30604038..30604461exon
Clc01G18010.1-exonClc01G18010.1-exon-ClcChr01:30604689..30604942exon


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Clc01G18010.1-three_prime_utrClc01G18010.1-three_prime_utr-ClcChr01:30598516..30598815three_prime_UTR
Clc01G18010.1-three_prime_utrClc01G18010.1-three_prime_utr-ClcChr01:30602517..30602673three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Clc01G18010.1-cdsClc01G18010.1-cds-ClcChr01:30602674..30602909CDS
Clc01G18010.1-cdsClc01G18010.1-cds-ClcChr01:30603027..30603573CDS
Clc01G18010.1-cdsClc01G18010.1-cds-ClcChr01:30603655..30603875CDS
Clc01G18010.1-cdsClc01G18010.1-cds-ClcChr01:30604038..30604461CDS
Clc01G18010.1-cdsClc01G18010.1-cds-ClcChr01:30604689..30604757CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Clc01G18010.1-five_prime_utrClc01G18010.1-five_prime_utr-ClcChr01:30604758..30604942five_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Clc01G18010.1Clc01G18010.1-proteinpolypeptide