Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTAATTGCTTCTATCCCCTTTCCAGCCCTTCAAATATTCTCTTTTTTTCTTTTTTAATTTCAGTTTAAGTGATGGCATTATGCATCCGTATAAGAAGCTTATAGGCAGGCTTGTTTAAAGCTAAAGAAGTTGCTTTACTTTTTGGATTCTGAACTTCCAGTAGCTGTTGTCTTTTCCAATGCGATGGAAAAGGAAGAACAACCCAACTCTGCTTCTACCCCTAATCTTGAACACCAACCCAATGGTGTCACAGTATGTTTCTTCTTATTTTTCTGAAATGGGTTTCCTCTTTTTGGTTCGTTTTTGCATTCCTCCCATCCACTTTTGATTGATTTCGTCACCCGAGTCTCTGTTGAAACGCGATGGGGACAGTAAAAGTTTGTGGGGTTTCGCTTTTCTTTTTCGATCTCTTGTGTTTAATTTGTTCTTGTATTTGGGGATTGCATTATGCTAATTACAGATACTGCTGTTGCTAACAGAGCAAGAATGAGAAAAGCGTATCCGATAGGACAGATGAAGCAAAAACCGCCAAATCTGGATGCCAATTTTTGGAGAATGCGGCTCGCCAAAATCAGCAGTACACTGCGCTTCTTCAAAGGGCACTCAACCCTCAACATGCAGGGGAAAGGTCTTCACCGTCAAATGCCCCAGCCGCCGTGAATGAGCGGCTTCAGCTGCCACAGAATCTGGCTAACCTTCAGCACCAGCTCAGCCCGCCTCCACCGCCGCAGCCGCAGCAATTTGTCCTTTCTTCACAACCCTTTTGGGTACAGCCGCAGCCCAACATTTCTTTTGGAGCAACTGAAGGTAGCTGGCACGCCGCCGGAGCCTCGCCGAGATGTCAACCCCAAGCTCCTAATTTCTATTACCCTGTTGGATATCCGACATATCCAGGCTTTCCAGGTGAGTTTCACTTTTATTACTGTTTGGTCAAGTAATTCTGTGGAGATTATTTGATCAATTTGTGGGTGTTTTTATACACTTATCTGTGACTGAATGTGATGTATATTGATCGGTTTGAGTTTTGAAATTTCAAATTCGGGTCTGGTCTGAACTTAATAACAGGTTCCAGGGATGCTTCAATTTGGTGGGGTCAAGCACAACCATTATTGTTTCCTGGATTATCCAATTACCCAAGGGCATCATGTGGTTTTGCCTCTTCTCAATCTTGGCCGATGCCAATTCCTAGTTGTGTAACATCTTCCTCTGGACAACCCCTTTTAAGAGGAGTCATCAAACCCCCTGAAAAGCTTTCTCAGAAGCATCAAAGACTTTGGGAAGCTCAGGTTTGACATGAGATTTCTCTTTAGTTTATGTGATTTTTTCAGATATTAATGTAATGCTTTTTTTGTTCCTGTTTGATGCAGTCTGCGGAAAATGTACAATTGTGGAGTATGATAGGAGAGTTGCAAGGGGAATTAGCAGACTACAAGGGCCGCTTGAGTAAGCTTGAAGCTGAAATTTCATCTTTAAGATCAGCAGCTACGGATGAGCCTGCTGTGGAAGTTGGAAATGGTAGCATTACAGTGAGGGGACAACCAACGAAGCGAGGAAGGTCGAAACGAGCAATAGCCCCAGTTGGTTCACAGCCTCCATTGCAAGCTCGGACACGGGGACGAAAGCGAGCAATTGCAAGGACAAAAGTAGAAGAAGCAAAACCAACTTTTCTTGGAAAAGATAGCTTGAATAAGGTGGATGATAAACATAAAGATTTTACTTCTTTTGACATTACAGAGCAAGAAAAGAATGGTATTTCACCTGCCATCAATCAAAACAATGGGATCATGGAGATTGACGATGGCACTCTCAAGATGCCTGCCTCTGAAGACAATCAAGTTATTCAACAATGCCCTGAAAGTCAATCACGTGGAATTGAATTCAAGCCAGCATCATTATTGAAATCCAATTATGAAGGTAATCAAAGTTCAAAAAACACTTGCTTAATTCTGTCTCAGGATACCAAGGGTTCCTCAACGATCGCGATGTCAGTTATACTAATGCATGTAAAAAAAAAATTTATATGGTCTTTAGGAATCATCTCTCAAGACTCCGAACAAAACGACTTTAGTATAGCGTCCCCATCAATATACACAAATGGAAATGTTAGCAGACAAGGAATAACTAGGTGGAACTTTAAACACGAAGACGAAGCTGCCGAATTGGGATTCCCTGCAGTAGGACACAAAAGGGAGAACGAAGAAATGGCAGATGAATTCAGCTCAGGACCTGAAGAGATTGAAACACAAAATGGCTCCTCGTGGTGTTGAAAGTAAGTTCTAAATGATCAAATGTTCCTGTGTTCTACCTTCAATAACAGGCATTACTGGATAATTCTGCCCCATTTGGCAAGCTACTGCCAATTGATGGAGATGGTTTGACCAAAATTTTGTACAAAGGAAGAATGGATTCAGCTTGCTTAACATGGTACGACTTTGACCCAAATTTCTTATACAAGAAAATCGTGCATGCTCAAAATTTTTCTGTGTTGACTTGTTTTTTTGTTCTGAACTAATATCGAAGATTCAATTTACCACAAAAAAATTGCAATGTCACAAGAGTAATACTGCATGGAGAATAATGGACTGAGGGTGCAATCTCATACATTCTGGACACATGGAAGAAAAAACATTACAACTAAGTTTTGGCACTACATGTGAATATGGCATATCAAATAGTGGGGTCATTTTCGTAAGTACATAAATTCTGGAAAAAGTTGTCTAGTTCTAGTGTCTGTGATCCGGATACGGCGGAAGAATCTTTACTGATTTCATTTTAGGAAAAATTACATTTTTTGTCCTGAATTTTCAAAGATAAGTGCGTTTGGTTGGTTCTATATTTTAAATTAGCTATTTTAGTCCTTAAATTTTACATAATAGATTTAAAAAGTTCCTTTAATTAAAAAAAAAAAAAAAGTTAAATTTAAACATAAAATTGACTATAGAAGTTGATGTGGCAACATGACTTGGCAAAAAAGTTATATTTTTTTCCCCTCCTTTCCCCACTTTTTTTCGCTCCTCTCTCTTCCCTCTTCCTCTCCTCTTTCCCTCTCTTCTTCTCTTCTCCTCCATTACCAAATCAGAGAAACTACAAAAAAATAAAAAAAATAAAAAACCCAAAATGTTGTTGTTCTCTCTCCTCATCCTCCATGGTTGTGAACCCCATTAATCAATTATATTTTTATTCTCAATTTTGAGGAAAATGACAATGAGTTTCTTCTTATTCTCTTCAATTCTGCAATTAATTAATCAAGTCTTAATACCATGTGTGGACTTTGAACCACCGGGGGAAAGCAGCGGCAAACCATGGCGTAGCGATGGTGATAATGTGGAGGAAGATGAGGGAAAGGCCGCAGCGAGAGGGGGAGAAGACGATGGGAGAGGTTTAGTAAGATGAAGTTGAAAGGGGCGCCAAAAGGATGTGGTAGTGGAATCGACTCGTTCAGATTGATGGGGTTCGGACTTATGGAGAAATCCATTGTATTTCTTTTTGAGTTTGATGGTTTCACAACTTGATGGATTTATTTTTTATTTTTTATTTTGGAAGATTTGGTAATGGTGAGAGAACGGAGAGAACAAGGGGAGAGGAGAGAGAGGGAAAGGGAATGAAAAGATTAAACAAAAATATATATATTTTTTTTCTAAGTCATCTTGTTACTTAATATGTTTAGTCAAAATTTTGTTTAAATTTAATGTTTCTTTTTGTGGAGTAATCTTTTAAATCTATTCTTGAAAATTTAAGGACTAAAGTGGCTCATCTGAAATATCAGTGACCAAATGCACTTATTCTTGAAAACTTAGGGATCAAAATGATATTTTTTTCCTTCATTTTACATAGAAGATGACGATGGATAATTTAATCAAAATCTTGAACGAAGGAAACTCCTATGGTTTAAAAATAATTATAAACTTAATTAATGGTGGATGAAATGTTTCGCAATTGGAATCAAACCCTTAATTTGTCTTGCTCTAAAAGTCTTTCATAAGTCATGTAGAACCCTACATTTAGGTCCTACCTTGCTAACAACTCTCCCATGGTTGGAATTCACTCTGGAAAAACTGTGGTTGATTTGGATGGCATCAAACCAAGTAGGGGTTTTAATTGGAATTTGAAAGGAGTTTCCTTTTTAGAAAAAGAAACTTTGAATAGATACCCATATTGTAGAAAGAGAACTTATCTTATCTTTGCACATCTTGAAAAATAAAAAGGAAGCCTTTTTCTTTACCCATTTTGAAAAATGAAATTTTTATTAATACCCATATTGTAAAATAAGAAAAGATGGAGCCTTTCTCTTTCTTAGTTTTCTGAAGTCCCATGGTTCATGAAGAAAATAAGGCATAACCAAGTCCAAAGAATAGAACTATTCTATAGAGAGAAAAATAGAATGTGTGGATTGAAAAAAGTAGAACCAAAGATTCAAAATTTTCCAACTGTGTGGACCGTCAAACGTCAGAGAAAAAAACAGGGAAACCCATTTTTTTTTTTTTTTTGTTGGTTTTAAAAAAGAATCCCTTAACCCAACCCTACCATTCATATCATACTATCTATCATAGTACAGACTTGAGAAGAGTCAAATTATGACTCTCTCACAAGTCATAATTTATGAGAACTTGCCATTCTTATAAATCTTGGCGCTTTTTTCGGTCCAAATCTGTTAGGCAAACTCCATGCCTCTCCTCCAAATCCGGACCTCCAAATCTCTACATTAACCAAAATTGACCCTTTTTTTTTTTTTTTTATTTTTTATTTTACCCTCTCTAATGGCCACCGCCACCGTCAAGCCTCCCCGGCCAAAACTAGCCTGTTTCTCCTTCGCCGCCTATGCCAAAACCGTCATCGACCATCTCAAATCTCTCCAAATTCCCGTCCTTCCTGGCCTCTCAGACCCTGAATTTGCATCCGTCGAGTCCACCTTCCGATTCTCTTTCCCGCCGGACCTCCGTTCAATCCTACGAGAGGGTCTCCCAATTGGGTCTGGTTTCCCCAATTGGCGATCCTCCTCCATTCAACAGCTTAACATTCTGATCAATCTCCCCAAATTCTGCCTCCTCAAGGAAATTTCTCAAAGAAAATTCTGGTGTCAATCTTGGGGGACCCAACCTGATGACGCAAACGACGCCGTTGCTTTGGCCAAGCAATTCTTGGACAGAGCCCCTGTTCTTGTCCCCATTTACAGAAACTGTTACATCCCCTCCGCGCCGAACATGGCCGGAAACCCTGTATTTCACCTCGACGGTGGAGAGATTCGTGTTTCCAGCTTTGACTTGGCTGGATTCTTTCAAACCCATGAATATTCTCAACTGAGCAAGGCTGAATCGGACCGATTGGTGATCGACTCACCAGCTTGGGCCGCGACAGAGGCCCGAGCGGTGGAATTCTGGACGGAAGTGGCTTCGGGAAAGAAAGCGGAGGCGGCGCGTGAGGTAACGGAAGGGTGGTGGAATGAAGGGGAATTTGAAATGGGGTTGGAGGGATGCCTTGAACACGTGTTTTGGAAGTTGCGACAAGGCGGGTGGAGGGATGAAGATGTGAGAGATATGATGATGATGGACGGCCATGATCGGAGCTTAGAACAGAATGGAGCAACGATGGAGAAACAGAGAGTATCAGTATGTGAGATTTTATTGAGTGGGGGGTGGAGCAGGGACGATGTAGTGTACTCTCTTGATCTTGAAGACAAATCCGCCATTGTTATTCCTGAAGAAGAATCAACATTTGAAATCAATCTTCATCGTCATCCGCCGATTAGAATCCCCCGAGTAGAACGCAAGAAAAAGCCTCGTAGTACCACCACCAATCACCTTAAAATGCCCCCTTTTTTCTTTGCACCACATCGAAATTTAATCCTCTAACTACTTTTTGCTTTTATTTTGTTTGTAAATTATACAATGTAAATAAACACACATTCTTTTTTCTTTGAGTTAATAAAAAAGATACGATAAAGGTCCAATTTTGATCCACCTTCCACCGATTCATATTTCAATATTGCATTGTTTGTGTCATTATTATATATTAATATATCACTCTTTCTTTTTGTTGTATTCGAAAAGAAGGGAAAATGGGATGTCATATCATGTGGGCTAGGAAAATGACCTTATGAATGAAGTAATATGTCTTTTACGTTACAGAAATAATTACTTCCTCCCCGTCTATTGATGGGGCGGTGGCGTTCTGAAATGGGCAGCCGGCGTTGACAAATTGGAGAGGGAAATCGGTAACGGCGGTGACTGAGAGTCATCGGAGACCACCAGCTACGTTGACGGAGAAAAAAGGGCCTCAAAGTTGGAGTCGAAAACCGACTTTTGAAATGAAAAAGCCGACGAATCGGTCAAGATGTGGACTCTCTTACGCGTCGGTTTAAAAGGGGTATTAATGACAGTCTATAAAAGGGTTTTAATTAGAGGTGCTTCGAAAGGTAGAAAAAAAAA
mRNA sequence
CTTTAATTGCTTCTATCCCCTTTCCAGCCCTTCAAATATTCTCTTTTTTTCTTTTTTAATTTCAGTTTAAGTGATGGCATTATGCATCCGTATAAGAAGCTTATAGGCAGGCTTGTTTAAAGCTAAAGAAGTTGCTTTACTTTTTGGATTCTGAACTTCCAGTAGCTGTTGTCTTTTCCAATGCGATGGAAAAGGAAGAACAACCCAACTCTGCTTCTACCCCTAATCTTGAACACCAACCCAATGGTGTCACAAGCAAGAATGAGAAAAGCGTATCCGATAGGACAGATGAAGCAAAAACCGCCAAATCTGGATGCCAATTTTTGGAGAATGCGGCTCGCCAAAATCAGCAGTACACTGCGCTTCTTCAAAGGGCACTCAACCCTCAACATGCAGGGGAAAGGTCTTCACCGTCAAATGCCCCAGCCGCCGTGAATGAGCGGCTTCAGCTGCCACAGAATCTGGCTAACCTTCAGCACCAGCTCAGCCCGCCTCCACCGCCGCAGCCGCAGCAATTTGTCCTTTCTTCACAACCCTTTTGGGTACAGCCGCAGCCCAACATTTCTTTTGGAGCAACTGAAGGTAGCTGGCACGCCGCCGGAGCCTCGCCGAGATGTCAACCCCAAGCTCCTAATTTCTATTACCCTGTTGGATATCCGACATATCCAGGCTTTCCAGGTTCCAGGGATGCTTCAATTTGGTGGGGTCAAGCACAACCATTATTGTTTCCTGGATTATCCAATTACCCAAGGGCATCATGTGGTTTTGCCTCTTCTCAATCTTGGCCGATGCCAATTCCTAGTTGTGTAACATCTTCCTCTGGACAACCCCTTTTAAGAGGAGTCATCAAACCCCCTGAAAAGCTTTCTCAGAAGCATCAAAGACTTTGGGAAGCTCAGTCTGCGGAAAATGTACAATTGTGGAGTATGATAGGAGAGTTGCAAGGGGAATTAGCAGACTACAAGGGCCGCTTGAGTAAGCTTGAAGCTGAAATTTCATCTTTAAGATCAGCAGCTACGGATGAGCCTGCTGTGGAAGTTGGAAATGGTAGCATTACAGTGAGGGGACAACCAACGAAGCGAGGAAGGTCGAAACGAGCAATAGCCCCAGTTGGTTCACAGCCTCCATTGCAAGCTCGGACACGGGGACGAAAGCGAGCAATTGCAAGGACAAAAGTAGAAGAAGCAAAACCAACTTTTCTTGGAAAAGATAGCTTGAATAAGGTGGATGATAAACATAAAGATTTTACTTCTTTTGACATTACAGAGCAAGAAAAGAATGGTATTTCACCTGCCATCAATCAAAACAATGGGATCATGGAGATTGACGATGGCACTCTCAAGATGCCTGCCTCTGAAGACAATCAAGTTATTCAACAATGCCCTGAAAGTCAATCACGTGGAATTGAATTCAAGCCAGCATCATTATTGAAATCCAATTATGAAGGAATCATCTCTCAAGACTCCGAACAAAACGACTTTAGTATAGCGTCCCCATCAATATACACAAATGGAAATGTTAGCAGACAAGGAATAACTAGGTGGAACTTTAAACACGAAGACGAAGCTGCCGAATTGGGATTCCCTGCAGTAGGACACAAAAGGGAGAACGAAGAAATGGCAGATGAATTCAGCTCAGGACCTGAAGAGATTGAAACACAAAATGGCTCCTCGTGGTGTTGAAAGTAAGTTCTAAATGATCAAATGTTCCTGTGTTCTACCTTCAATAACAGGCATTACTGGATAATTCTGCCCCATTTGGCAAGCTACTGCCAATTGATGGAGATGGTTTGACCAAAATTTTGTACAAAGGAAGAATGGATTCAGCTTGCTTAACATGAAATAATTACTTCCTCCCCGTCTATTGATGGGGCGGTGGCGTTCTGAAATGGGCAGCCGGCGTTGACAAATTGGAGAGGGAAATCGGTAACGGCGGTGACTGAGAGTCATCGGAGACCACCAGCTACGTTGACGGAGAAAAAAGGGCCTCAAAGTTGGAGTCGAAAACCGACTTTTGAAATGAAAAAGCCGACGAATCGGTCAAGATGTGGACTCTCTTACGCGTCGGTTTAAAAGGGGTATTAATGACAGTCTATAAAAGGGTTTTAATTAGAGGTGCTTCGAAAGGTAGAAAAAAAAA
Coding sequence (CDS)
ATGGAAAAGGAAGAACAACCCAACTCTGCTTCTACCCCTAATCTTGAACACCAACCCAATGGTGTCACAAGCAAGAATGAGAAAAGCGTATCCGATAGGACAGATGAAGCAAAAACCGCCAAATCTGGATGCCAATTTTTGGAGAATGCGGCTCGCCAAAATCAGCAGTACACTGCGCTTCTTCAAAGGGCACTCAACCCTCAACATGCAGGGGAAAGGTCTTCACCGTCAAATGCCCCAGCCGCCGTGAATGAGCGGCTTCAGCTGCCACAGAATCTGGCTAACCTTCAGCACCAGCTCAGCCCGCCTCCACCGCCGCAGCCGCAGCAATTTGTCCTTTCTTCACAACCCTTTTGGGTACAGCCGCAGCCCAACATTTCTTTTGGAGCAACTGAAGGTAGCTGGCACGCCGCCGGAGCCTCGCCGAGATGTCAACCCCAAGCTCCTAATTTCTATTACCCTGTTGGATATCCGACATATCCAGGCTTTCCAGGTTCCAGGGATGCTTCAATTTGGTGGGGTCAAGCACAACCATTATTGTTTCCTGGATTATCCAATTACCCAAGGGCATCATGTGGTTTTGCCTCTTCTCAATCTTGGCCGATGCCAATTCCTAGTTGTGTAACATCTTCCTCTGGACAACCCCTTTTAAGAGGAGTCATCAAACCCCCTGAAAAGCTTTCTCAGAAGCATCAAAGACTTTGGGAAGCTCAGTCTGCGGAAAATGTACAATTGTGGAGTATGATAGGAGAGTTGCAAGGGGAATTAGCAGACTACAAGGGCCGCTTGAGTAAGCTTGAAGCTGAAATTTCATCTTTAAGATCAGCAGCTACGGATGAGCCTGCTGTGGAAGTTGGAAATGGTAGCATTACAGTGAGGGGACAACCAACGAAGCGAGGAAGGTCGAAACGAGCAATAGCCCCAGTTGGTTCACAGCCTCCATTGCAAGCTCGGACACGGGGACGAAAGCGAGCAATTGCAAGGACAAAAGTAGAAGAAGCAAAACCAACTTTTCTTGGAAAAGATAGCTTGAATAAGGTGGATGATAAACATAAAGATTTTACTTCTTTTGACATTACAGAGCAAGAAAAGAATGGTATTTCACCTGCCATCAATCAAAACAATGGGATCATGGAGATTGACGATGGCACTCTCAAGATGCCTGCCTCTGAAGACAATCAAGTTATTCAACAATGCCCTGAAAGTCAATCACGTGGAATTGAATTCAAGCCAGCATCATTATTGAAATCCAATTATGAAGGAATCATCTCTCAAGACTCCGAACAAAACGACTTTAGTATAGCGTCCCCATCAATATACACAAATGGAAATGTTAGCAGACAAGGAATAACTAGGTGGAACTTTAAACACGAAGACGAAGCTGCCGAATTGGGATTCCCTGCAGTAGGACACAAAAGGGAGAACGAAGAAATGGCAGATGAATTCAGCTCAGGACCTGAAGAGATTGAAACACAAAATGGCTCCTCGTGGTGTTGA
Protein sequence
MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTALLQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWVQPQPNISFGATEGSWHAAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQAQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLWEAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQPTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDFTSFDITEQEKNGISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPASLLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGFPAVGHKRENEEMADEFSSGPEEIETQNGSSWC
Homology
BLAST of Clc01G18010.1 vs. NCBI nr
Match:
XP_038883396.1 (uncharacterized protein LOC120074371 [Benincasa hispida])
HSP 1 Score: 848.2 bits (2190), Expect = 3.6e-242
Identity = 445/512 (86.91%), Postives = 461/512 (90.04%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTS-KNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTA 60
MEKEEQP ASTPNLEHQ NGV S KNEKSVSD TD AK AKSGCQFLENA QNQQ TA
Sbjct: 1 MEKEEQPKFASTPNLEHQANGVFSGKNEKSVSDGTDAAKNAKSGCQFLENAPLQNQQCTA 60
Query: 61 LLQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFW 120
LQRALNPQHAGE+ SPS APAAVNERLQLPQNLANLQHQLS PPPQPQQFV+SSQPFW
Sbjct: 61 FLQRALNPQHAGEK-SPSTAPAAVNERLQLPQNLANLQHQLS--PPPQPQQFVISSQPFW 120
Query: 121 VQPQPNISFGATEGSWHA-----AGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWG 180
VQPQP+ISFGATEGSW A AGASPRCQPQAPNFYYPVGYPTY GFPG RDASIWWG
Sbjct: 121 VQPQPSISFGATEGSWQAPVAFGAGASPRCQPQAPNFYYPVGYPTYSGFPGPRDASIWWG 180
Query: 181 QAQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRL 240
Q QPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRL
Sbjct: 181 QTQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRL 240
Query: 241 WEAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRG 300
WEAQSAENVQLWS+IGELQGELADYKGRLSKLE EISSLRSAATDEPAVEVGN ITVRG
Sbjct: 241 WEAQSAENVQLWSLIGELQGELADYKGRLSKLEVEISSLRSAATDEPAVEVGNDGITVRG 300
Query: 301 QPTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDF 360
QP KRGRSKRAIAPVGSQPPLQ RTRGRK A ARTKVEEAKPTFLGKDSLNKV+DKHKDF
Sbjct: 301 QPAKRGRSKRAIAPVGSQPPLQPRTRGRKPAFARTKVEEAKPTFLGKDSLNKVNDKHKDF 360
Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
TS DITEQ+KN GIS INQNNG MEI++GTLKMPA DNQV+QQCPE QS GIEFKP+S
Sbjct: 361 TSLDITEQDKNEGISATINQNNGSMEINEGTLKMPAPLDNQVLQQCPEIQSCGIEFKPSS 420
Query: 421 LLKSNYE-------GIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF 480
LLKSNYE GIIS+DSEQN+FSIASP+IYTNGNVSRQGI RWNFKHEDEAAELGF
Sbjct: 421 LLKSNYEENFIWSLGIISEDSEQNNFSIASPTIYTNGNVSRQGIARWNFKHEDEAAELGF 480
Query: 481 PAVGHKRENEEMADEFSSGPEEIETQNGSSWC 499
PAV HK+E+EEM DEFSSGPEEIET+NGSSWC
Sbjct: 481 PAVEHKKEDEEMVDEFSSGPEEIETKNGSSWC 509
BLAST of Clc01G18010.1 vs. NCBI nr
Match:
XP_031742364.1 (uncharacterized protein LOC101204298 isoform X2 [Cucumis sativus] >KGN48653.1 hypothetical protein Csa_003715 [Cucumis sativus])
HSP 1 Score: 777.3 bits (2006), Expect = 7.8e-221
Identity = 414/506 (81.82%), Postives = 436/506 (86.17%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
MEKEEQP STP+LEHQ NG++SKNEKSVSD TD AK AKSG QFLEN A NQ YTAL
Sbjct: 1 MEKEEQPEFCSTPDLEHQANGISSKNEKSVSDGTDAAKKAKSGSQFLENGAPHNQHYTAL 60
Query: 61 LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
LQRA PQHA + SSP+ APAAVNERLQLPQN ANL HQLS PPQPQQFVLSSQPFWV
Sbjct: 61 LQRAHYPQHAEKPSSPT-APAAVNERLQLPQNAANLPHQLS--QPPQPQQFVLSSQPFWV 120
Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
QPQP+ISFGATEGSW +AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPVAISAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180
Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
QP+LFPGLSNYPRASCGF SSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQ+LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQKLW 240
Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRSAAT+EPAVEVGN I +RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSAATNEPAVEVGNDDIILRGQ 300
Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
P KRGRSKRA APVGSQPPLQ RTR RK A+ARTKVEEAK T LGKDSLNK DD KHK F
Sbjct: 301 PAKRGRSKRATAPVGSQPPLQPRTRVRKPAVARTKVEEAKQTLLGKDSLNKADDNKHKYF 360
Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
TS DIT+Q+KN IS +INQNNGI+EIDD TLKMP S D QV++QC E GIEFKP S
Sbjct: 361 TSLDITKQDKNEDISASINQNNGIVEIDDDTLKMPVSLDTQVLEQCSEIHPCGIEFKPPS 420
Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
+LKSNYEGIIS+DSE NDFSIASP+IYTNGNV+RQGITRWNFK E AELGF PAV HK
Sbjct: 421 VLKSNYEGIISKDSEPNDFSIASPTIYTNGNVTRQGITRWNFKLEGGTAELGFPPAVVHK 480
Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
NEEMADEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEEMADEFSSGPEEIETQNGSSWC 503
BLAST of Clc01G18010.1 vs. NCBI nr
Match:
XP_031742362.1 (uncharacterized protein LOC101204298 isoform X1 [Cucumis sativus] >XP_031742363.1 uncharacterized protein LOC101204298 isoform X1 [Cucumis sativus])
HSP 1 Score: 770.4 bits (1988), Expect = 9.5e-219
Identity = 414/513 (80.70%), Postives = 436/513 (84.99%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
MEKEEQP STP+LEHQ NG++SKNEKSVSD TD AK AKSG QFLEN A NQ YTAL
Sbjct: 1 MEKEEQPEFCSTPDLEHQANGISSKNEKSVSDGTDAAKKAKSGSQFLENGAPHNQHYTAL 60
Query: 61 LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
LQRA PQHA + SSP+ APAAVNERLQLPQN ANL HQLS PPQPQQFVLSSQPFWV
Sbjct: 61 LQRAHYPQHAEKPSSPT-APAAVNERLQLPQNAANLPHQLS--QPPQPQQFVLSSQPFWV 120
Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
QPQP+ISFGATEGSW +AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPVAISAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180
Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
QP+LFPGLSNYPRASCGF SSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQ+LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQKLW 240
Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRSAAT+EPAVEVGN I +RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSAATNEPAVEVGNDDIILRGQ 300
Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
P KRGRSKRA APVGSQPPLQ RTR RK A+ARTKVEEAK T LGKDSLNK DD KHK F
Sbjct: 301 PAKRGRSKRATAPVGSQPPLQPRTRVRKPAVARTKVEEAKQTLLGKDSLNKADDNKHKYF 360
Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
TS DIT+Q+KN IS +INQNNGI+EIDD TLKMP S D QV++QC E GIEFKP S
Sbjct: 361 TSLDITKQDKNEDISASINQNNGIVEIDDDTLKMPVSLDTQVLEQCSEIHPCGIEFKPPS 420
Query: 421 LLKSNYE-------GIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF 480
+LKSNYE GIIS+DSE NDFSIASP+IYTNGNV+RQGITRWNFK E AELGF
Sbjct: 421 VLKSNYEENFIWSLGIISKDSEPNDFSIASPTIYTNGNVTRQGITRWNFKLEGGTAELGF 480
Query: 481 -PAVGHKRENEEMADEFSSGPEEIETQNGSSWC 499
PAV HK NEEMADEFSSGPEEIETQNGSSWC
Sbjct: 481 PPAVVHKTGNEEMADEFSSGPEEIETQNGSSWC 510
BLAST of Clc01G18010.1 vs. NCBI nr
Match:
XP_016899165.1 (PREDICTED: uncharacterized protein LOC103484887 [Cucumis melo] >XP_016899166.1 PREDICTED: uncharacterized protein LOC103484887 [Cucumis melo] >KAA0036381.1 Cys-Gly metallodipeptidase DUG1 [Cucumis melo var. makuwa] >TYK12777.1 Cys-Gly metallodipeptidase DUG1 [Cucumis melo var. makuwa])
HSP 1 Score: 765.8 bits (1976), Expect = 2.4e-217
Identity = 404/506 (79.84%), Postives = 434/506 (85.77%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
MEKEEQP +STPNLEHQ NGV+SKNEKSVSD TD AK AKSGCQ LEN + NQ YTAL
Sbjct: 1 MEKEEQPKFSSTPNLEHQANGVSSKNEKSVSDGTDAAKNAKSGCQLLENESPHNQHYTAL 60
Query: 61 LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
LQ A P+HA E+ SPS AP+AVNER QLPQ+ ANLQHQLS PPQPQQFVLSSQPFW+
Sbjct: 61 LQSAHYPEHA-EKPSPSTAPSAVNERHQLPQSPANLQHQLS--QPPQPQQFVLSSQPFWI 120
Query: 121 QPQPNISFGATEGSWHA-----AGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
QPQP+ISFGATEGSW + AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPAAFGAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180
Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
QP+LFPGLSNYPRASCGF SSQSWPMPIPSC TSSSGQPLLRGVIKPPEKLSQKH++LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCATSSSGQPLLRGVIKPPEKLSQKHKKLW 240
Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRS+AT+EPAVEVGNG IT+RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSSATNEPAVEVGNGDITLRGQ 300
Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
PTKRGR KR APVGSQ PLQ TR RK A+ RTKVE+AK T LGKDSLNK DD KHK F
Sbjct: 301 PTKRGRLKRGTAPVGSQSPLQPHTRVRKPAVGRTKVEDAKQTLLGKDSLNKADDNKHKYF 360
Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
TS DIT+Q+KN S INQNNGI+EIDD TLKMPAS DNQV++QC E QS GIEFKP S
Sbjct: 361 TSLDITKQDKNEDSSTTINQNNGIVEIDDDTLKMPASLDNQVLEQCSEIQSCGIEFKPPS 420
Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
+LKSNYEGIIS+DSE+NDF IAS +IYTNGNV+RQGI+RWNFK DEAAELGF P V HK
Sbjct: 421 VLKSNYEGIISEDSERNDFRIASSTIYTNGNVTRQGISRWNFKLVDEAAELGFPPPVVHK 480
Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
NE+M DEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEDMTDEFSSGPEEIETQNGSSWC 503
BLAST of Clc01G18010.1 vs. NCBI nr
Match:
KAG7034256.1 (hypothetical protein SDJN02_03983, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 735.3 bits (1897), Expect = 3.4e-208
Identity = 395/504 (78.37%), Postives = 420/504 (83.33%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
MEKE++ AST L+H PNGV+ KNEKSV D TD AK KSGCQFLENAA QNQQYT L
Sbjct: 1 MEKEDELKFASTAKLQHHPNGVSRKNEKSVFDGTDSAKNVKSGCQFLENAAPQNQQYTEL 60
Query: 61 LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
LQRALNP+HAGE+SS APAAVNERLQ P+NL LQHQL PPP PQPQQFVLSSQPFWV
Sbjct: 61 LQRALNPRHAGEKSSLPAAPAAVNERLQPPENLPKLQHQLIPPPQPQPQQFVLSSQPFWV 120
Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
QPQP+IS GATEGSW GASPRCQPQAPNF YPVGYPTYPGF GS DASIWWGQ
Sbjct: 121 QPQPSISLGATEGSWQTPAAFGTGASPRCQPQAPNFCYPVGYPTYPGFQGSWDASIWWGQ 180
Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
PLLFPGLSNYPRAS GFASSQS PMPIP+CVT SSGQPLLRGVIKPPE+LSQKHQRLW
Sbjct: 181 TPPLLFPGLSNYPRASYGFASSQSCPMPIPNCVTFSSGQPLLRGVIKPPERLSQKHQRLW 240
Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
EAQSAENVQLWSMIG+ QGELAD KGRL KLEAEISSLRS AT+EPAVEVGNG ITVRGQ
Sbjct: 241 EAQSAENVQLWSMIGQFQGELADCKGRLIKLEAEISSLRSVATNEPAVEVGNGGITVRGQ 300
Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDFT 360
P+KRGRSKRAIAPVGS Q+RTR RK A+ TKV E KPT LGKDSLNKVDD HKDFT
Sbjct: 301 PSKRGRSKRAIAPVGS----QSRTRARKPAVGGTKV-EVKPTLLGKDSLNKVDDTHKDFT 360
Query: 361 SFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPASL 420
DITEQ+KN GIS I GIMEID+GTLKMP S NQ +QQ P+ QS GIEFK S
Sbjct: 361 PLDITEQDKNEGISATI----GIMEIDEGTLKMPISFGNQDLQQFPDIQSCGIEFKSPSS 420
Query: 421 LKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGFPAVGHKRE 480
LKSNYEGII DS+ ND SIASP+IYTNGNVSRQGITRWNF+ E EAAE GFP VG+K+E
Sbjct: 421 LKSNYEGIICGDSKLNDLSIASPTIYTNGNVSRQGITRWNFEDEVEAAESGFPVVGNKKE 480
Query: 481 NEEMADEFSSGPEEIETQNGSSWC 499
N+EMADEFSSG EEIETQNGSSWC
Sbjct: 481 NKEMADEFSSGAEEIETQNGSSWC 495
BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match:
A0A0A0KGK4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G497040 PE=4 SV=1)
HSP 1 Score: 777.3 bits (2006), Expect = 3.8e-221
Identity = 414/506 (81.82%), Postives = 436/506 (86.17%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
MEKEEQP STP+LEHQ NG++SKNEKSVSD TD AK AKSG QFLEN A NQ YTAL
Sbjct: 1 MEKEEQPEFCSTPDLEHQANGISSKNEKSVSDGTDAAKKAKSGSQFLENGAPHNQHYTAL 60
Query: 61 LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
LQRA PQHA + SSP+ APAAVNERLQLPQN ANL HQLS PPQPQQFVLSSQPFWV
Sbjct: 61 LQRAHYPQHAEKPSSPT-APAAVNERLQLPQNAANLPHQLS--QPPQPQQFVLSSQPFWV 120
Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
QPQP+ISFGATEGSW +AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPVAISAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180
Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
QP+LFPGLSNYPRASCGF SSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQ+LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQKLW 240
Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRSAAT+EPAVEVGN I +RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSAATNEPAVEVGNDDIILRGQ 300
Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
P KRGRSKRA APVGSQPPLQ RTR RK A+ARTKVEEAK T LGKDSLNK DD KHK F
Sbjct: 301 PAKRGRSKRATAPVGSQPPLQPRTRVRKPAVARTKVEEAKQTLLGKDSLNKADDNKHKYF 360
Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
TS DIT+Q+KN IS +INQNNGI+EIDD TLKMP S D QV++QC E GIEFKP S
Sbjct: 361 TSLDITKQDKNEDISASINQNNGIVEIDDDTLKMPVSLDTQVLEQCSEIHPCGIEFKPPS 420
Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
+LKSNYEGIIS+DSE NDFSIASP+IYTNGNV+RQGITRWNFK E AELGF PAV HK
Sbjct: 421 VLKSNYEGIISKDSEPNDFSIASPTIYTNGNVTRQGITRWNFKLEGGTAELGFPPAVVHK 480
Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
NEEMADEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEEMADEFSSGPEEIETQNGSSWC 503
BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match:
A0A1S4DT48 (uncharacterized protein LOC103484887 OS=Cucumis melo OX=3656 GN=LOC103484887 PE=4 SV=1)
HSP 1 Score: 765.8 bits (1976), Expect = 1.1e-217
Identity = 404/506 (79.84%), Postives = 434/506 (85.77%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
MEKEEQP +STPNLEHQ NGV+SKNEKSVSD TD AK AKSGCQ LEN + NQ YTAL
Sbjct: 1 MEKEEQPKFSSTPNLEHQANGVSSKNEKSVSDGTDAAKNAKSGCQLLENESPHNQHYTAL 60
Query: 61 LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
LQ A P+HA E+ SPS AP+AVNER QLPQ+ ANLQHQLS PPQPQQFVLSSQPFW+
Sbjct: 61 LQSAHYPEHA-EKPSPSTAPSAVNERHQLPQSPANLQHQLS--QPPQPQQFVLSSQPFWI 120
Query: 121 QPQPNISFGATEGSWHA-----AGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
QPQP+ISFGATEGSW + AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPAAFGAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180
Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
QP+LFPGLSNYPRASCGF SSQSWPMPIPSC TSSSGQPLLRGVIKPPEKLSQKH++LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCATSSSGQPLLRGVIKPPEKLSQKHKKLW 240
Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRS+AT+EPAVEVGNG IT+RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSSATNEPAVEVGNGDITLRGQ 300
Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
PTKRGR KR APVGSQ PLQ TR RK A+ RTKVE+AK T LGKDSLNK DD KHK F
Sbjct: 301 PTKRGRLKRGTAPVGSQSPLQPHTRVRKPAVGRTKVEDAKQTLLGKDSLNKADDNKHKYF 360
Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
TS DIT+Q+KN S INQNNGI+EIDD TLKMPAS DNQV++QC E QS GIEFKP S
Sbjct: 361 TSLDITKQDKNEDSSTTINQNNGIVEIDDDTLKMPASLDNQVLEQCSEIQSCGIEFKPPS 420
Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
+LKSNYEGIIS+DSE+NDF IAS +IYTNGNV+RQGI+RWNFK DEAAELGF P V HK
Sbjct: 421 VLKSNYEGIISEDSERNDFRIASSTIYTNGNVTRQGISRWNFKLVDEAAELGFPPPVVHK 480
Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
NE+M DEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEDMTDEFSSGPEEIETQNGSSWC 503
BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match:
A0A5D3CLF7 (Cys-Gly metallodipeptidase DUG1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G003660 PE=4 SV=1)
HSP 1 Score: 765.8 bits (1976), Expect = 1.1e-217
Identity = 404/506 (79.84%), Postives = 434/506 (85.77%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
MEKEEQP +STPNLEHQ NGV+SKNEKSVSD TD AK AKSGCQ LEN + NQ YTAL
Sbjct: 1 MEKEEQPKFSSTPNLEHQANGVSSKNEKSVSDGTDAAKNAKSGCQLLENESPHNQHYTAL 60
Query: 61 LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
LQ A P+HA E+ SPS AP+AVNER QLPQ+ ANLQHQLS PPQPQQFVLSSQPFW+
Sbjct: 61 LQSAHYPEHA-EKPSPSTAPSAVNERHQLPQSPANLQHQLS--QPPQPQQFVLSSQPFWI 120
Query: 121 QPQPNISFGATEGSWHA-----AGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
QPQP+ISFGATEGSW + AGASP CQPQAPNFYYPVGYPTYPGFPGSRD SIWWGQ
Sbjct: 121 QPQPSISFGATEGSWQSPAAFGAGASPICQPQAPNFYYPVGYPTYPGFPGSRDGSIWWGQ 180
Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
QP+LFPGLSNYPRASCGF SSQSWPMPIPSC TSSSGQPLLRGVIKPPEKLSQKH++LW
Sbjct: 181 TQPILFPGLSNYPRASCGFVSSQSWPMPIPSCATSSSGQPLLRGVIKPPEKLSQKHKKLW 240
Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
EAQSAENVQLWSMIGELQGELA YKGRLSKLEAEIS LRS+AT+EPAVEVGNG IT+RGQ
Sbjct: 241 EAQSAENVQLWSMIGELQGELAVYKGRLSKLEAEISCLRSSATNEPAVEVGNGDITLRGQ 300
Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDD-KHKDF 360
PTKRGR KR APVGSQ PLQ TR RK A+ RTKVE+AK T LGKDSLNK DD KHK F
Sbjct: 301 PTKRGRLKRGTAPVGSQSPLQPHTRVRKPAVGRTKVEDAKQTLLGKDSLNKADDNKHKYF 360
Query: 361 TSFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPAS 420
TS DIT+Q+KN S INQNNGI+EIDD TLKMPAS DNQV++QC E QS GIEFKP S
Sbjct: 361 TSLDITKQDKNEDSSTTINQNNGIVEIDDDTLKMPASLDNQVLEQCSEIQSCGIEFKPPS 420
Query: 421 LLKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGF-PAVGHK 480
+LKSNYEGIIS+DSE+NDF IAS +IYTNGNV+RQGI+RWNFK DEAAELGF P V HK
Sbjct: 421 VLKSNYEGIISEDSERNDFRIASSTIYTNGNVTRQGISRWNFKLVDEAAELGFPPPVVHK 480
Query: 481 RENEEMADEFSSGPEEIETQNGSSWC 499
NE+M DEFSSGPEEIETQNGSSWC
Sbjct: 481 TGNEDMTDEFSSGPEEIETQNGSSWC 503
BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match:
A0A6J1IVR4 (uncharacterized protein LOC111478840 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111478840 PE=4 SV=1)
HSP 1 Score: 732.6 bits (1890), Expect = 1.1e-207
Identity = 394/504 (78.17%), Postives = 418/504 (82.94%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
MEKE++ AST NL+H PNGV+ KNEKSV D TD AK AKSGCQFLENAA QNQQYT L
Sbjct: 1 MEKEDELKCASTANLQHHPNGVSRKNEKSVFDGTDSAKNAKSGCQFLENAAPQNQQYTEL 60
Query: 61 LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
LQRALNP+HAGE+SS APAAVNERLQ P+NL QHQL PPP PQPQQFVLSSQPFWV
Sbjct: 61 LQRALNPRHAGEKSSLPAAPAAVNERLQPPENLPKFQHQLIPPPQPQPQQFVLSSQPFWV 120
Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
QPQ +IS GATEGSW AGASPRCQPQAPNF YPVGYPTYPGF GS DASIWWGQ
Sbjct: 121 QPQSSISLGATEGSWQTPAAFGAGASPRCQPQAPNFCYPVGYPTYPGFQGSWDASIWWGQ 180
Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
PLLFPGLSNYPRAS GFASSQS PMPIPSCV SSSGQPLLRGVIKPPE+LSQKHQRLW
Sbjct: 181 TPPLLFPGLSNYPRASYGFASSQSCPMPIPSCVASSSGQPLLRGVIKPPERLSQKHQRLW 240
Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
EAQSAENVQLWSMIG+LQ ELAD KGRL KLEAEISSLRS ATDE AVEVGNG ITVRGQ
Sbjct: 241 EAQSAENVQLWSMIGQLQVELADCKGRLIKLEAEISSLRSVATDEAAVEVGNGGITVRGQ 300
Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDFT 360
P KRGRSKRAIAPVGS Q+RTR RK + TKV E KPT LGKDSLNKVDD H+DFT
Sbjct: 301 PAKRGRSKRAIAPVGS----QSRTRARKPTVGGTKVGEVKPTLLGKDSLNKVDDTHEDFT 360
Query: 361 SFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPASL 420
DITEQ+KN GIS I GIMEID+GTLK+P S NQ +QQ P+ QS GIEFK S
Sbjct: 361 PLDITEQDKNEGISATI----GIMEIDEGTLKVPISFVNQDLQQFPDIQSCGIEFKSPSS 420
Query: 421 LKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGFPAVGHKRE 480
LKSNYEGII DS+ ND SIASP+IYTNGNVSRQGITRWNF+ E EAAE GFP VG+K+E
Sbjct: 421 LKSNYEGIICGDSKLNDLSIASPTIYTNGNVSRQGITRWNFEDEVEAAESGFPIVGNKKE 480
Query: 481 NEEMADEFSSGPEEIETQNGSSWC 499
N+EMADEFSSG EEIETQNGSSWC
Sbjct: 481 NKEMADEFSSGAEEIETQNGSSWC 496
BLAST of Clc01G18010.1 vs. ExPASy TrEMBL
Match:
A0A6J1GDK6 (uncharacterized protein LOC111453031 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453031 PE=4 SV=1)
HSP 1 Score: 727.6 bits (1877), Expect = 3.4e-206
Identity = 392/504 (77.78%), Postives = 418/504 (82.94%), Query Frame = 0
Query: 1 MEKEEQPNSASTPNLEHQPNGVTSKNEKSVSDRTDEAKTAKSGCQFLENAARQNQQYTAL 60
MEKE++ AST NL+H PNGV+ KNEKSV D TD AK KSGCQFLENAA QNQQYT L
Sbjct: 1 MEKEDELKFASTANLQHHPNGVSRKNEKSVFDGTDSAKNVKSGCQFLENAAPQNQQYTEL 60
Query: 61 LQRALNPQHAGERSSPSNAPAAVNERLQLPQNLANLQHQLSPPPPPQPQQFVLSSQPFWV 120
LQRALNP+HAGE+SS APAAVNERLQ P+NL LQHQL PQPQQFVLSSQPFWV
Sbjct: 61 LQRALNPRHAGEKSSLPAAPAAVNERLQPPENLPKLQHQLI----PQPQQFVLSSQPFWV 120
Query: 121 QPQPNISFGATEGSWH-----AAGASPRCQPQAPNFYYPVGYPTYPGFPGSRDASIWWGQ 180
QPQP+IS GATEGSW AGASPRCQPQAPNF YPVGYPTYPGF GS DASIWWGQ
Sbjct: 121 QPQPSISLGATEGSWQTPAAFGAGASPRCQPQAPNFCYPVGYPTYPGFQGSWDASIWWGQ 180
Query: 181 AQPLLFPGLSNYPRASCGFASSQSWPMPIPSCVTSSSGQPLLRGVIKPPEKLSQKHQRLW 240
PLLFPGLSNYPRAS G ASSQS PMPIP+CVTSSSGQPLLRGVIKPPE+LSQKHQRLW
Sbjct: 181 TPPLLFPGLSNYPRASYGLASSQSCPMPIPNCVTSSSGQPLLRGVIKPPERLSQKHQRLW 240
Query: 241 EAQSAENVQLWSMIGELQGELADYKGRLSKLEAEISSLRSAATDEPAVEVGNGSITVRGQ 300
EAQSAENVQLWSMIG+LQGELAD KGRL KLEAEIS LRS AT+EPAVEVGNG ITVRGQ
Sbjct: 241 EAQSAENVQLWSMIGQLQGELADCKGRLIKLEAEISPLRSVATNEPAVEVGNGGITVRGQ 300
Query: 301 PTKRGRSKRAIAPVGSQPPLQARTRGRKRAIARTKVEEAKPTFLGKDSLNKVDDKHKDFT 360
P+KRGRSKRAIAPVGS Q+RTR RK A+ TKV E KPT LGKDSLNKVDD HK+FT
Sbjct: 301 PSKRGRSKRAIAPVGS----QSRTRARKPAVGGTKVGEVKPTLLGKDSLNKVDDTHKNFT 360
Query: 361 SFDITEQEKN-GISPAINQNNGIMEIDDGTLKMPASEDNQVIQQCPESQSRGIEFKPASL 420
DITEQ+KN GIS I GIMEID+GTLKMP S NQ +QQ P+ QS GIEFK S
Sbjct: 361 PLDITEQDKNEGISTTI----GIMEIDEGTLKMPISFGNQDLQQFPDIQSCGIEFKSPSS 420
Query: 421 LKSNYEGIISQDSEQNDFSIASPSIYTNGNVSRQGITRWNFKHEDEAAELGFPAVGHKRE 480
LKSNYEGII DS+ ND SIASP+IYTNGNVSRQGITRWNF+ E EAAE GFP VG+K+E
Sbjct: 421 LKSNYEGIICGDSKLNDLSIASPTIYTNGNVSRQGITRWNFEDEVEAAESGFPVVGNKKE 480
Query: 481 NEEMADEFSSGPEEIETQNGSSWC 499
N+EMADEFSSG EEIETQNG SWC
Sbjct: 481 NKEMADEFSSGAEEIETQNGPSWC 492
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038883396.1 | 3.6e-242 | 86.91 | uncharacterized protein LOC120074371 [Benincasa hispida] | [more] |
XP_031742364.1 | 7.8e-221 | 81.82 | uncharacterized protein LOC101204298 isoform X2 [Cucumis sativus] >KGN48653.1 hy... | [more] |
XP_031742362.1 | 9.5e-219 | 80.70 | uncharacterized protein LOC101204298 isoform X1 [Cucumis sativus] >XP_031742363.... | [more] |
XP_016899165.1 | 2.4e-217 | 79.84 | PREDICTED: uncharacterized protein LOC103484887 [Cucumis melo] >XP_016899166.1 P... | [more] |
KAG7034256.1 | 3.4e-208 | 78.37 | hypothetical protein SDJN02_03983, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KGK4 | 3.8e-221 | 81.82 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G497040 PE=4 SV=1 | [more] |
A0A1S4DT48 | 1.1e-217 | 79.84 | uncharacterized protein LOC103484887 OS=Cucumis melo OX=3656 GN=LOC103484887 PE=... | [more] |
A0A5D3CLF7 | 1.1e-217 | 79.84 | Cys-Gly metallodipeptidase DUG1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... | [more] |
A0A6J1IVR4 | 1.1e-207 | 78.17 | uncharacterized protein LOC111478840 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1GDK6 | 3.4e-206 | 77.78 | uncharacterized protein LOC111453031 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
Relationships
This mRNA is a part of the following gene feature(s):
The following exon feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Clc01G18010.1-exon | Clc01G18010.1-exon-ClcChr01:30598516..30598815 | exon |
Clc01G18010.1-exon | Clc01G18010.1-exon-ClcChr01:30602517..30602909 | exon |
Clc01G18010.1-exon | Clc01G18010.1-exon-ClcChr01:30603027..30603573 | exon |
Clc01G18010.1-exon | Clc01G18010.1-exon-ClcChr01:30603655..30603875 | exon |
Clc01G18010.1-exon | Clc01G18010.1-exon-ClcChr01:30604038..30604461 | exon |
Clc01G18010.1-exon | Clc01G18010.1-exon-ClcChr01:30604689..30604942 | exon |
The following three_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Clc01G18010.1-three_prime_utr | Clc01G18010.1-three_prime_utr-ClcChr01:30598516..30598815 | three_prime_UTR |
Clc01G18010.1-three_prime_utr | Clc01G18010.1-three_prime_utr-ClcChr01:30602517..30602673 | three_prime_UTR |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Clc01G18010.1-cds | Clc01G18010.1-cds-ClcChr01:30602674..30602909 | CDS |
Clc01G18010.1-cds | Clc01G18010.1-cds-ClcChr01:30603027..30603573 | CDS |
Clc01G18010.1-cds | Clc01G18010.1-cds-ClcChr01:30603655..30603875 | CDS |
Clc01G18010.1-cds | Clc01G18010.1-cds-ClcChr01:30604038..30604461 | CDS |
Clc01G18010.1-cds | Clc01G18010.1-cds-ClcChr01:30604689..30604757 | CDS |
The following five_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Clc01G18010.1-five_prime_utr | Clc01G18010.1-five_prime_utr-ClcChr01:30604758..30604942 | five_prime_UTR |
The following polypeptide feature(s) derives from this mRNA:
Feature Name | Unique Name | Type |
Clc01G18010.1 | Clc01G18010.1-protein | polypeptide |