Clc01G19880 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G19880
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
LocationClcChr01: 31986023 .. 31991657 (+)
RNA-Seq ExpressionClc01G19880
SyntenyClc01G19880
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATATTTGCGGAGAAGGTGTCTGCGTGTGCGTCAAGGCGAAGAGGAAGAGGCGGTCGTTGCGATTCTAAAAAAAAAAAAAATAAAAAAAAACTTGGAGAGGAGGATTTCAATTTGTCTCTTTCTCTCTCTTGTTTCTCTCTCTTCAGCCCTCCATTTATCTTCATGCTCTTTTTATGAAGGACGGGGAAGAGTCTAGAGAAATGGAATCGATACTGTAAAATGGGACTTTCTGTCCTTTCTATCAACTTACCGACTCCTCCCCGATCCTCTGCTTTCCTTCTTGCCTCTTGCGCTTAGGGTTTTTATTGCTACCTTCTTATTTTTAGAGGTACGTTCTGTTCTGGGTTGTTTTTGTTTCATTTTGTGGGATTTTGTTGGATGTTTGAATGAGGGGTTTTGGAGAATCGTGTTTTGGAAATTTTCTTTTTGGGTTTTTGAGGGTTGGGTGTGCGGGCCATTTCAGAATCTTTGATGGGTTTTATTTGGATTTTTTTTCTTTCTTCGTTTAATTTGGTTTTTTTCGAACGATTTATTTTAGGGGATTTTCTTTGTTTTGAAACCTTCTCTTGCTTTATTGGAGGTTGAGCTTTGTGGTTTTTTAAAGATTTAATCTGGTTTGAATTTTTTTCCCCTTGTAGATTCGTTTTTGGTGTGATTGTTTTGTGTAATAGCGAGTTCTTAAGCTGTTCTTATGGCTATCTATGATTTTTTATTTAAATTCTTTTATTTTATATTTTTAAGGATTTGGGTTATGCTTTGTTTTTAAAAATTTGGCTGAATGGGACTATTGTGGGATGTGTTTACTTTAGTATTTTTTTTTTGGTTTTAAATTAGTGCAATTCGATTGAGGTTTTGGCATTTCTTTAGCAATGGTGTATGCTTTGTTTTTTAAAATTTGGCTGAAAGTGACTATTATGGGATATGTTTACTGTAGCAATGTTTTGTTTTAAATTAGTGCAATTCGATTGAGGCTTCGGCATTTCTTTAGCAGGGGTTTATACTTTGTTTCTTAAAATTTGGTTGAATGCGACTTTTTCTGGTATGTGTTTTCTTTAGCAATTTCTTGTTTTAGATTAGTGCAATTTGTTTGAGGTTTTGGCATTACTTTATCAATGGTTTATACTTTGTTATCGAAAATTTGGCTGAATGGGACTATAATGGGATGTGTTCGCTGTAGCAATGTTTTGTTTTAAATTAGTGCAATTCAATTGAGGTTTTGGCATTCTTGTATACTCACTTGGTTTGCTGCTTCTTCGAATGCAGAATCATGCCAACATTTACTACAATTGCGTTGGACAGGTTGTTAGAACCTGGAACTTCCAAATCTGTTGATAAGTCCCTTCCTAAACCTAAGCCTGCTTTGACCTTTAACCGTGCTCCAAGCACTAAGTTGGAGAGGAGAAATAGCTCATCAGTTGCTGACAGAAAAGTTCAGCGGCCTCAAATTAAGCCAGCACTGTATACCACCCCAGAGGCAACTCCTCTTCCGGATTCACCATCTTCCTTTCCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGGCCTCGTCTCTTGAAGAGTTTCTCTGAGGACGATGTCTCTCTTAAAAAGATGAACGATGGGGATGTAGGAAATGAGACTGTGAAGGGTTCAGATAAAAATGATGTACAATTGACTGAGGGTGCTTCTGTTCCTGGTGACATACCTATTCAAGACAAAGATGAAGAAAGAAATGGTCTAGATTGTGCTAGTAGTAGTAATGTGGGTCAAAATGGGAGTGTTGGTGGTGATCATGGTGCTACAGCTGTTCAACCTGTGAGCAATCACAATAATCATGAAAGCAGTATAATGATGAGTAATGGTGTTGCTCGGGAAAAGGATTCATTGAAGGTTGTTGTGCCAAATTCAGAAAGTATTGGAGATGCTGAAGACTTCTTTGACCCAAATGATTCTTTGAGTGTTACGAGTAACACGGATGGAGAGGATAATGGTTATGAACGTTCAGCTAAGTTTGGTACTCCTATGGGTGAATTTTATGATGCTTGGGAAGGTAAGCATGACTCTGAATTGAAAAAGTTATTTCTGATATATACTTTATTCTATTCTCGTCTCTAATTATGTAATTTACTGTTCAGTAGTTGGTACTTCATTTTTCAGTTTTTTAAAAATTGTTTTGAACTTTTGTAGTGCAACTCATGCAGAGTTGTTGTTGTTGGTGTTTTTTTTTTTTTAATTAATTAATTATTTAAATTAATTATCATTATTGTTCTTTAAATTCTAGAATCTAGTGTAACTGGATCAGACATTAGGATGAGGGTGTTGAGGGTGGGTGTTCGTTATATCGTGTTTAATTTCTGTATGTTTTGGGTCGCTCTTACTCTTTGTTTCCTGGACTTCTTACCCACTTGCTAGTTGCTACTGCATCAGAGTTTTATGGAACTTCTTTATAATAGATAGAGAGAAATAGTGAATGGCTAGATTACTGTAATGACTAGATTAAAATTTTTTTTGAATTTAAATAAAGGTTTCCAATTAAGAGGGGCATGATAGTAATAATTATTAATTATTTTTCAGATAAGAATAGTAAGAACTGTTAATTAGGGGCGTGGTTTTTCAGAGGATCATTCTTACATGCAAAGTAATGGATGAAAGCCTGTAATGACCAGATTAAAATTTTTAAAAACTGTTTTGGACTTTTGTAGTGCAAGAGGATTTTTTTTTTTTTTCATTCATCATTTTTCATTAATCATTTTTCAATGTTCTAGCATCTAGTGTAATTGGATCTAGCACATGCTTCTTTTGTAGTTGGGTTGGATTTATGGATATGTTATGATGAAATTCAGAGATCATGCAAGTAGAACGCTAGGATATGGGATGAAATGTTGAATTTTTTAAATCTATGAGTTGCCTTTGAACTACTTGCCTAAATTATGATTGCTAGGTTTAAGTCGACTTCAGTACCAATGACCTAGTTTCATTCTGCAGAGCTTTCCTCTGAGGGCTTGCCACAACCATCTATTTCCGATATTGAAGCTGAATTACGTGAAATGAAACTAATGCTACTGATGGAGCTAGAGAAACGAAAGCAGGCTGAGGAAGCACTGAATAAATTGCAGGGCCAGTGGCAGAGGCTTAGAGAACAACTATGGCTGGTAGGATTGACCCTTCCCTCAGATCCCACAGTAGCCACAGAAGGGAAGCAGTTAGATTCTGACCCTGCTGAAGAATTGTGCCAACAAGTTCATCTTGCCAGGTTTGTGTCAGATGCTATTGGAAGGGGTATAGCGAGGGCAGAGGTGGAGACTGAGATGGAGGCGCAGCTTGAAGTCAAGAATTTTGAGATCGCTCGATTGCTGGACCGGCTCCGTTACTATGAGGCAGTGAATCATGAAATGTCCCAGAGGAATCAAGAAGCTGTAGGTAAGAACCCTGTAAACTCGACTGGTGTTCTGTAATTTCAATCACTCAATGTTGGTGTCTTGACCTCCCCCTCTCTTTCATTTTTGTTTTATTCTTCTTTAGGGTGGTGGCTGTTGAGGGGTTCGTTATCTTGTGTATTTAATTTCTGTATGTTTTGTCACTTTTATTCCTTTTTTTCCTGGACTTCTTACCCACTTGACAGTTGCCACTGCATCAGATTTTTATCGAACTTCTTTCTAATATATGGAGAGAAATAGTGAGTAGTTAGGTTACTGTAATGACTAGATTAGAAGTTTTGTGCGTATAAATAAAGGCTTCCAGTTAAGAAGGGCATGATAGAAATAATTATTAATTAGGAGGCGTGGTTTTTCAAAGGTCATTCTTACATGGAAAGTAAGGGATGAAGGCAGGACTGTAATGACTAGATTAGAATTTTCGTGCATATAAATAAAGGCTTCCGGTTAAGAAGGGCATGGTGGAAATGATTATTAGTTAGGAGGCGTGGTTTTTTCATGGGATTGTTCTTACATGCAAAGTAAGGGAGGAAGGCTTCGGCCAGTCATTGGATTCCATTTAGTTGACATGATCACATGCTTAAATGTCAACTATTTCAAACAGTGCCCTTGACTGGTGGAGCTTATATTATCAAGGTTGTAGTCCGTGGTTGACTTGAATTTTCAGAAACAATGCAAATGCAGTGAACAGATCAATGAGTACTGTTGTCTACTGTTGACGAATTTTTAATGAGTTATTACCATTATGATTACTCCAATATTGGTTTGAGGAGTGATCTACTGGTATTGATGTGATACTGTGTGCTTTACAGTTTGTCCCCTCTTATTGCCTTGCAAGTTCACTTGAAATAATGGATCTTGAAGGAATATAACATGGAAGTATTTGTATGTAAATCTTCAATTGCGTTTAATTTTCCATGGATTCTAGTGCCAGTATGGGGGGTTCTTGTTCTAATTAGTAGTGGGTTGTCAAAAATGTGCAAGTTGCCTCAAGCTATATAGCTCATTTTTACTAGGAGTTCCACTGGTCTAGTTGTGGAACAAATTCCTTTAGATCCATTGGTCTTTTTTGGATCCTACGGTCCTCCATTTGGCCTTGGTGGTTTTTTCTTTCATAGATTGTTAAGTCAAGTATTATAAAAATATTCAGACGGTGAGGAATAAAATCTTCTTTTCATACATCATTCCTGAAATCTTCTTTTCATACATCATTCCTGGACCCTATGATCTACCATTTTAAGCCTCAGTTCTCTGTTCTCCTTGGTTATGTGTCGAAAGTCTGTACTGTTGGCTAAGCTTTGTCTGTTTAATATTTGGTTGTCAGATTTGGCACGGCGCGAGAGGTTGAGAAGGAAAAGGAGGCAAAGATGGATCTGGGGTTCGGTTGCCACTGCAATCACACTCGGCACTGCAGTCTTAGCTTGGTCATACCTTCCATCAGGAAAAGATTTGCCATCCAGCAACAATTCGAAGGCCGAGCACGACGATGTGACAGATTGATAGCCGCTGACACAAGACTTAGCTTATGTACCGTGTTATAGAAAGGAAAAATAAAAGGAGCCATACATCCAATCATACATGTGGTATAATGTTGATATCAAATGTTGCTTTTGTCAGAAGTATTCGCATGGCCAAGGTCTAGTTCTTATAAATCGTCGGCCCTTATAATTAAAGGTGAAAAGGGTTGATTCTTGATATTCTTTCTTATTTTTCAGATCTAACAGGGTTATAACACCTGAAATTTTGGCAATTCATGCAATGATATACAGTCTGTTTACTGCTAACTGTAGTTGAAATATTTCTTTGAGCATCATCAAATAATTGTTTGAAGAGAGAACTTGGAACTCTAATGAAGTCTTTCCGAGTAATGCATATTCTAATCCTTCCATTACAAGTAGAGAGGGAGAATAATATCTTTGAGTTTGATGGCATGGTGTGGATATATCCAATAAGCTTTTAATGAATAAGCTTTTAATGTTTGCAGATGATGCAATAAATGGGAATTTTGTTCAATTAATTGAAGGTTAAGACAGTGATGTTTCCTAAAGTTGGGGACTGGCTTCATGCCTTGTTAGCGTGGAACTTTGACAATTCTCATTTAAAGAGTACGTTTAAAAAAAAAGTGAAAAAAAAAAAAGTGGGACTATTTAGGTCCATCTTTCTTGAATTGACCTATCAATTTATTAAAGAATTTGGAACTTTGACAATTCTCATTT

mRNA sequence

AAATATTTGCGGAGAAGGTGTCTGCGTGTGCGTCAAGGCGAAGAGGAAGAGGCGGTCGTTGCGATTCTAAAAAAAAAAAAAATAAAAAAAAACTTGGAGAGGAGGATTTCAATTTGTCTCTTTCTCTCTCTTGTTTCTCTCTCTTCAGCCCTCCATTTATCTTCATGCTCTTTTTATGAAGGACGGGGAAGAGTCTAGAGAAATGGAATCGATACTGTAAAATGGGACTTTCTGTCCTTTCTATCAACTTACCGACTCCTCCCCGATCCTCTGCTTTCCTTCTTGCCTCTTGCGCTTAGGGTTTTTATTGCTACCTTCTTATTTTTAGAGAATCATGCCAACATTTACTACAATTGCGTTGGACAGGTTGTTAGAACCTGGAACTTCCAAATCTGTTGATAAGTCCCTTCCTAAACCTAAGCCTGCTTTGACCTTTAACCGTGCTCCAAGCACTAAGTTGGAGAGGAGAAATAGCTCATCAGTTGCTGACAGAAAAGTTCAGCGGCCTCAAATTAAGCCAGCACTGTATACCACCCCAGAGGCAACTCCTCTTCCGGATTCACCATCTTCCTTTCCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGGCCTCGTCTCTTGAAGAGTTTCTCTGAGGACGATGTCTCTCTTAAAAAGATGAACGATGGGGATGTAGGAAATGAGACTGTGAAGGGTTCAGATAAAAATGATGTACAATTGACTGAGGGTGCTTCTGTTCCTGGTGACATACCTATTCAAGACAAAGATGAAGAAAGAAATGGTCTAGATTGTGCTAGTAGTAGTAATGTGGGTCAAAATGGGAGTGTTGGTGGTGATCATGGTGCTACAGCTGTTCAACCTGTGAGCAATCACAATAATCATGAAAGCAGTATAATGATGAGTAATGGTGTTGCTCGGGAAAAGGATTCATTGAAGGTTGTTGTGCCAAATTCAGAAAGTATTGGAGATGCTGAAGACTTCTTTGACCCAAATGATTCTTTGAGTGTTACGAGTAACACGGATGGAGAGGATAATGGTTATGAACGTTCAGCTAAGTTTGGTACTCCTATGGGTGAATTTTATGATGCTTGGGAAGAGCTTTCCTCTGAGGGCTTGCCACAACCATCTATTTCCGATATTGAAGCTGAATTACGTGAAATGAAACTAATGCTACTGATGGAGCTAGAGAAACGAAAGCAGGCTGAGGAAGCACTGAATAAATTGCAGGGCCAGTGGCAGAGGCTTAGAGAACAACTATGGCTGGTAGGATTGACCCTTCCCTCAGATCCCACAGTAGCCACAGAAGGGAAGCAGTTAGATTCTGACCCTGCTGAAGAATTGTGCCAACAAGTTCATCTTGCCAGGTTTGTGTCAGATGCTATTGGAAGGGGTATAGCGAGGGCAGAGGTGGAGACTGAGATGGAGGCGCAGCTTGAAGTCAAGAATTTTGAGATCGCTCGATTGCTGGACCGGCTCCGTTACTATGAGGCAGTGAATCATGAAATGTCCCAGAGGAATCAAGAAGCTGTAGATTTGGCACGGCGCGAGAGGTTGAGAAGGAAAAGGAGGCAAAGATGGATCTGGGGTTCGGTTGCCACTGCAATCACACTCGGCACTGCAGTCTTAGCTTGGTCATACCTTCCATCAGGAAAAGATTTGCCATCCAGCAACAATTCGAAGGCCGAGCACGACGATGTGACAGATTGATAGCCGCTGACACAAGACTTAGCTTATGTACCGTGTTATAGAAAGGAAAAATAAAAGGAGCCATACATCCAATCATACATGTGGTATAATGTTGATATCAAATGTTGCTTTTGTCAGAAGTATTCGCATGGCCAAGGTCTAGTTCTTATAAATCGTCGGCCCTTATAATTAAAGGTGAAAAGGGTTGATTCTTGATATTCTTTCTTATTTTTCAGATCTAACAGGGTTATAACACCTGAAATTTTGGCAATTCATGCAATGATATACAGTCTGTTTACTGCTAACTGTAGTTGAAATATTTCTTTGAGCATCATCAAATAATTGTTTGAAGAGAGAACTTGGAACTCTAATGAAGTCTTTCCGAGTAATGCATATTCTAATCCTTCCATTACAAGTAGAGAGGGAGAATAATATCTTTGAGTTTGATGGCATGGTGTGGATATATCCAATAAGCTTTTAATGAATAAGCTTTTAATGTTTGCAGATGATGCAATAAATGGGAATTTTGTTCAATTAATTGAAGGTTAAGACAGTGATGTTTCCTAAAGTTGGGGACTGGCTTCATGCCTTGTTAGCGTGGAACTTTGACAATTCTCATTTAAAGAGTACGTTTAAAAAAAAAGTGAAAAAAAAAAAAGTGGGACTATTTAGGTCCATCTTTCTTGAATTGACCTATCAATTTATTAAAGAATTTGGAACTTTGACAATTCTCATTT

Coding sequence (CDS)

ATGCCAACATTTACTACAATTGCGTTGGACAGGTTGTTAGAACCTGGAACTTCCAAATCTGTTGATAAGTCCCTTCCTAAACCTAAGCCTGCTTTGACCTTTAACCGTGCTCCAAGCACTAAGTTGGAGAGGAGAAATAGCTCATCAGTTGCTGACAGAAAAGTTCAGCGGCCTCAAATTAAGCCAGCACTGTATACCACCCCAGAGGCAACTCCTCTTCCGGATTCACCATCTTCCTTTCCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGGCCTCGTCTCTTGAAGAGTTTCTCTGAGGACGATGTCTCTCTTAAAAAGATGAACGATGGGGATGTAGGAAATGAGACTGTGAAGGGTTCAGATAAAAATGATGTACAATTGACTGAGGGTGCTTCTGTTCCTGGTGACATACCTATTCAAGACAAAGATGAAGAAAGAAATGGTCTAGATTGTGCTAGTAGTAGTAATGTGGGTCAAAATGGGAGTGTTGGTGGTGATCATGGTGCTACAGCTGTTCAACCTGTGAGCAATCACAATAATCATGAAAGCAGTATAATGATGAGTAATGGTGTTGCTCGGGAAAAGGATTCATTGAAGGTTGTTGTGCCAAATTCAGAAAGTATTGGAGATGCTGAAGACTTCTTTGACCCAAATGATTCTTTGAGTGTTACGAGTAACACGGATGGAGAGGATAATGGTTATGAACGTTCAGCTAAGTTTGGTACTCCTATGGGTGAATTTTATGATGCTTGGGAAGAGCTTTCCTCTGAGGGCTTGCCACAACCATCTATTTCCGATATTGAAGCTGAATTACGTGAAATGAAACTAATGCTACTGATGGAGCTAGAGAAACGAAAGCAGGCTGAGGAAGCACTGAATAAATTGCAGGGCCAGTGGCAGAGGCTTAGAGAACAACTATGGCTGGTAGGATTGACCCTTCCCTCAGATCCCACAGTAGCCACAGAAGGGAAGCAGTTAGATTCTGACCCTGCTGAAGAATTGTGCCAACAAGTTCATCTTGCCAGGTTTGTGTCAGATGCTATTGGAAGGGGTATAGCGAGGGCAGAGGTGGAGACTGAGATGGAGGCGCAGCTTGAAGTCAAGAATTTTGAGATCGCTCGATTGCTGGACCGGCTCCGTTACTATGAGGCAGTGAATCATGAAATGTCCCAGAGGAATCAAGAAGCTGTAGATTTGGCACGGCGCGAGAGGTTGAGAAGGAAAAGGAGGCAAAGATGGATCTGGGGTTCGGTTGCCACTGCAATCACACTCGGCACTGCAGTCTTAGCTTGGTCATACCTTCCATCAGGAAAAGATTTGCCATCCAGCAACAATTCGAAGGCCGAGCACGACGATGTGACAGATTGA

Protein sequence

MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQIKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGDVGNETVKGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVSNHNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYERSAKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQGQWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGSVATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Homology
BLAST of Clc01G19880 vs. NCBI nr
Match: XP_038882592.1 (uncharacterized protein LOC120073808 [Benincasa hispida])

HSP 1 Score: 797.0 bits (2057), Expect = 8.8e-227
Identity = 418/458 (91.27%), Postives = 435/458 (94.98%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALT NRAPSTKLERRNS+SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTLNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGDVGNETV 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KKMND DVGN +V
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSRKKMNDNDVGNGSV 120

Query: 121 KGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVSN 180
           KGSD NDV+ TEG+SV  D+PI +KD +RNG DCASSSNV QNGSV GDHGATAVQ V+N
Sbjct: 121 KGSDSNDVKSTEGSSVTVDMPIPEKDGDRNGPDCASSSNVRQNGSVDGDHGATAVQLVNN 180

Query: 181 HNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYERS 240
           H+NHES I++SNGVAREK+SLKVVV NSESIGD EDFFDP+DSLSVTSNTDGEDNG+ERS
Sbjct: 181 HSNHESRIVVSNGVAREKNSLKVVVSNSESIGDTEDFFDPHDSLSVTSNTDGEDNGFERS 240

Query: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQG 300
           AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKL LLMELEKRKQAEEALNKLQG
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLTLLMELEKRKQAEEALNKLQG 300

Query: 301 QWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAEV 360
           QW RLREQL LVGLTLPSDP VATEG QLDSDPAEELCQQV+LARFVSD+IGRGIARAEV
Sbjct: 301 QWWRLREQLLLVGLTLPSDPPVATEGNQLDSDPAEELCQQVYLARFVSDSIGRGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           ETEMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 459
           VATAITLGTAVLAWSYLPSGKDLPSSNN+KAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNTKAEHDDVTD 458

BLAST of Clc01G19880 vs. NCBI nr
Match: TYK12610.1 (uncharacterized protein E5676_scaffold255G001960 [Cucumis melo var. makuwa])

HSP 1 Score: 767.3 bits (1980), Expect = 7.4e-218
Identity = 405/458 (88.43%), Postives = 429/458 (93.67%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNS+SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGDVGNETV 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KKMND DVGN +V
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVSN 180
           + SD NDV+LTEGASV    PI DK  +RNGLDCASSSN+G+NG V GDHGATAVQ VS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYERS 240
           HNNHESSI+ S+G+A+EKDSLK VV NSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQG 300
           AKFGTPMGEFYDAWEELSSEG+PQPSISDIE + REM+  LLME+EKRKQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKRKQAEEALNKLQC 300

Query: 301 QWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAEV 360
           QWQRLREQL LVGLTLPSDPTVATEGKQLDSDPAEELCQQV+LARFVS++IG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 459
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of Clc01G19880 vs. NCBI nr
Match: XP_008440744.1 (PREDICTED: uncharacterized protein LOC103485065 [Cucumis melo] >KAA0036213.1 uncharacterized protein E6C27_scaffold18G00100 [Cucumis melo var. makuwa])

HSP 1 Score: 765.8 bits (1976), Expect = 2.2e-217
Identity = 404/458 (88.21%), Postives = 429/458 (93.67%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNS+SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGDVGNETV 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KKMND DVGN +V
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVSN 180
           + SD NDV+LTEGASV    PI DK  +RNGLDCASSSN+G+NG V GDHGATAVQ VS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYERS 240
           HNNHESSI+ S+G+A+EKDSLK VV NSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQG 300
           AKFGTPMGEFYDAWEELSSEG+PQPSISDIE + REM+  LLME+EK+KQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKQKQAEEALNKLQC 300

Query: 301 QWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAEV 360
           QWQRLREQL LVGLTLPSDPTVATEGKQLDSDPAEELCQQV+LARFVS++IG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 459
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of Clc01G19880 vs. NCBI nr
Match: XP_004143521.1 (uncharacterized protein LOC101222171 [Cucumis sativus] >KGN48848.1 hypothetical protein Csa_002999 [Cucumis sativus])

HSP 1 Score: 763.1 bits (1969), Expect = 1.4e-216
Identity = 403/458 (87.99%), Postives = 429/458 (93.67%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPS+KLERRNS+SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSSKLERRNSTSVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGDVGNETV 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KK ND DVGN +V
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSRKKKNDKDVGNGSV 120

Query: 121 KGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVSN 180
           KGSD +DV+LTEGASV  + PI DKD +RNGLDCASSS+VG+NG VGGDHGATAVQ VS+
Sbjct: 121 KGSDGSDVKLTEGASVTVNTPIPDKDGDRNGLDCASSSSVGENGCVGGDHGATAVQLVSS 180

Query: 181 HNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYERS 240
           HNNHESSIM SNG+A+EKDSLK VV NSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSIMTSNGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQG 300
           AKFGTPMGEFYDAWEELSSEG+ QPSISD E +LREM+  LLME+EKRKQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVLQPSISDTEPDLREMR--LLMEIEKRKQAEEALNKLQC 300

Query: 301 QWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAEV 360
           QWQRLR +L LVGLTLPSDPTVATE KQLDSDPAEELCQQV+LARFVS++IG+GIARAEV
Sbjct: 301 QWQRLRARLLLVGLTLPSDPTVATEEKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           ETEMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 459
           VATAITLGTAVL WSYLPSGKDLPSSNNSK+EHDDVTD
Sbjct: 421 VATAITLGTAVLTWSYLPSGKDLPSSNNSKSEHDDVTD 455

BLAST of Clc01G19880 vs. NCBI nr
Match: KAG7034132.1 (hypothetical protein SDJN02_03859 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 726.1 bits (1873), Expect = 1.9e-205
Identity = 392/459 (85.40%), Postives = 409/459 (89.11%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIAL+RLLEPGTS+SVDKSLPKPKP+L  +RAPSTKLERRNS SVADRK+QRPQI
Sbjct: 1   MPTFTTIALERLLEPGTSRSVDKSLPKPKPSLNSDRAPSTKLERRNSPSVADRKIQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV-SLKKMNDGDVGNET 120
           KPALY TPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV S KKMND D+GN  
Sbjct: 61  KPALYATPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSSRKKMNDKDIGNGN 120

Query: 121 VKGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVS 180
           VKG+D NDV+LTEGASV  D+PI   D  RNGLDCASSS+VGQNGSV  DHGA  VQ  S
Sbjct: 121 VKGTDSNDVKLTEGASVVVDMPI--PDGHRNGLDCASSSHVGQNGSVDDDHGAAGVQLAS 180

Query: 181 NHNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYER 240
           NH+NH     MSNGV REKDSLKVVV NS  +GD EDFFDP DSLSVTSNTDGEDNG ER
Sbjct: 181 NHSNHG----MSNGVTREKDSLKVVVSNSGGVGDTEDFFDPQDSLSVTSNTDGEDNGIER 240

Query: 241 SAKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQ 300
           SAK GTP+GEFYDA E LSSEGLPQP ISDIEAEL EMKL L MELEKRKQAEE L+K +
Sbjct: 241 SAKIGTPVGEFYDALEALSSEGLPQPCISDIEAELCEMKLTLSMELEKRKQAEEILDKFR 300

Query: 301 GQWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAE 360
           GQWQRLREQL LVGLTLPSDPTVATEGKQLDSDPAEELCQQV+LARFVSD+IGRGIARAE
Sbjct: 301 GQWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVYLARFVSDSIGRGIARAE 360

Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420
           VETEMEAQLEVKNFEIARLLDRL YYEA NHEMSQRNQEAVDLARRERLRRKRRQRWIWG
Sbjct: 361 VETEMEAQLEVKNFEIARLLDRLHYYEAANHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420

Query: 421 SVATAITLGTAVLAWSYLPSGKDLPSSNNSKA-EHDDVT 458
           SVATAITLGT VLAWSYLPSGKDLPSSNNSKA EHDD T
Sbjct: 421 SVATAITLGTVVLAWSYLPSGKDLPSSNNSKAVEHDDAT 453

BLAST of Clc01G19880 vs. ExPASy TrEMBL
Match: A0A5D3CMF0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G001960 PE=4 SV=1)

HSP 1 Score: 767.3 bits (1980), Expect = 3.6e-218
Identity = 405/458 (88.43%), Postives = 429/458 (93.67%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNS+SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGDVGNETV 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KKMND DVGN +V
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVSN 180
           + SD NDV+LTEGASV    PI DK  +RNGLDCASSSN+G+NG V GDHGATAVQ VS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYERS 240
           HNNHESSI+ S+G+A+EKDSLK VV NSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQG 300
           AKFGTPMGEFYDAWEELSSEG+PQPSISDIE + REM+  LLME+EKRKQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKRKQAEEALNKLQC 300

Query: 301 QWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAEV 360
           QWQRLREQL LVGLTLPSDPTVATEGKQLDSDPAEELCQQV+LARFVS++IG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 459
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of Clc01G19880 vs. ExPASy TrEMBL
Match: A0A5A7T005 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold18G00100 PE=4 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 1.0e-217
Identity = 404/458 (88.21%), Postives = 429/458 (93.67%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNS+SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGDVGNETV 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KKMND DVGN +V
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVSN 180
           + SD NDV+LTEGASV    PI DK  +RNGLDCASSSN+G+NG V GDHGATAVQ VS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYERS 240
           HNNHESSI+ S+G+A+EKDSLK VV NSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQG 300
           AKFGTPMGEFYDAWEELSSEG+PQPSISDIE + REM+  LLME+EK+KQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKQKQAEEALNKLQC 300

Query: 301 QWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAEV 360
           QWQRLREQL LVGLTLPSDPTVATEGKQLDSDPAEELCQQV+LARFVS++IG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 459
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of Clc01G19880 vs. ExPASy TrEMBL
Match: A0A1S3B1E0 (uncharacterized protein LOC103485065 OS=Cucumis melo OX=3656 GN=LOC103485065 PE=4 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 1.0e-217
Identity = 404/458 (88.21%), Postives = 429/458 (93.67%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPSTKLERRNS+SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGDVGNETV 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KKMND DVGN +V
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120

Query: 121 KGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVSN 180
           + SD NDV+LTEGASV    PI DK  +RNGLDCASSSN+G+NG V GDHGATAVQ VS+
Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180

Query: 181 HNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYERS 240
           HNNHESSI+ S+G+A+EKDSLK VV NSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQG 300
           AKFGTPMGEFYDAWEELSSEG+PQPSISDIE + REM+  LLME+EK+KQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKQKQAEEALNKLQC 300

Query: 301 QWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAEV 360
           QWQRLREQL LVGLTLPSDPTVATEGKQLDSDPAEELCQQV+LARFVS++IG+GIARAEV
Sbjct: 301 QWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           E EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 EAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 459
           VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD
Sbjct: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455

BLAST of Clc01G19880 vs. ExPASy TrEMBL
Match: A0A0A0KH17 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G502840 PE=4 SV=1)

HSP 1 Score: 763.1 bits (1969), Expect = 6.8e-217
Identity = 403/458 (87.99%), Postives = 429/458 (93.67%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIALDRLLEPGT+KS+DKSLPKPKPALTFNRAPS+KLERRNS+SVADRKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSSKLERRNSTSVADRKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGDVGNETV 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVS KK ND DVGN +V
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSRKKKNDKDVGNGSV 120

Query: 121 KGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVSN 180
           KGSD +DV+LTEGASV  + PI DKD +RNGLDCASSS+VG+NG VGGDHGATAVQ VS+
Sbjct: 121 KGSDGSDVKLTEGASVTVNTPIPDKDGDRNGLDCASSSSVGENGCVGGDHGATAVQLVSS 180

Query: 181 HNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYERS 240
           HNNHESSIM SNG+A+EKDSLK VV NSES GD EDFFDP+DSLSV SNTDGEDNG+ERS
Sbjct: 181 HNNHESSIMTSNGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFERS 240

Query: 241 AKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQG 300
           AKFGTPMGEFYDAWEELSSEG+ QPSISD E +LREM+  LLME+EKRKQAEEALNKLQ 
Sbjct: 241 AKFGTPMGEFYDAWEELSSEGVLQPSISDTEPDLREMR--LLMEIEKRKQAEEALNKLQC 300

Query: 301 QWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAEV 360
           QWQRLR +L LVGLTLPSDPTVATE KQLDSDPAEELCQQV+LARFVS++IG+GIARAEV
Sbjct: 301 QWQRLRARLLLVGLTLPSDPTVATEEKQLDSDPAEELCQQVNLARFVSESIGKGIARAEV 360

Query: 361 ETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420
           ETEMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS
Sbjct: 361 ETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWGS 420

Query: 421 VATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 459
           VATAITLGTAVL WSYLPSGKDLPSSNNSK+EHDDVTD
Sbjct: 421 VATAITLGTAVLTWSYLPSGKDLPSSNNSKSEHDDVTD 455

BLAST of Clc01G19880 vs. ExPASy TrEMBL
Match: A0A6J1HDH8 (uncharacterized protein LOC111463164 OS=Cucurbita moschata OX=3662 GN=LOC111463164 PE=4 SV=1)

HSP 1 Score: 724.9 bits (1870), Expect = 2.0e-205
Identity = 387/460 (84.13%), Postives = 412/460 (89.57%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTFTTIALDRLLEPGTSKSVDK LPK  PALTFNRAP+T LERRNS+S A+RKVQRPQI
Sbjct: 1   MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQI 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLK-KMNDGDVGNET 120
           KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSED VS + KMND D+GN  
Sbjct: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDVVSSRQKMNDNDIGNVN 120

Query: 121 VKGSDKNDVQLTEGASVPGDIPIQDKDEERNGLDCASSSNVGQNGSVGGDHGATAVQPVS 180
           V GSD NDV+L+EGASV  D+PI +KD  RNGLDCA+SSNVGQNGSV GDHGATAVQ  S
Sbjct: 121 VNGSDSNDVKLSEGASVTVDLPIPNKDGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGS 180

Query: 181 NHNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGEDNGYER 240
           NH N+ S+IM+SN VAREKDSLKVVVP  +S+GDAEDFFDP DSLSV SNTDGEDNGYER
Sbjct: 181 NHTNNGSTIMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYER 240

Query: 241 SAKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEALNKLQ 300
           SAKF TPMGEFYDAWEE+SS+GLP PSIS IEAELREM+L LLMELEKRKQAEEAL+ L+
Sbjct: 241 SAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEALDNLR 300

Query: 301 GQWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGIARAE 360
           GQWQRLRE L LVGLTLPSDPTVAT G  L SDPAEELCQQV++ARFVS +IGRGIARAE
Sbjct: 301 GQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAE 360

Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420
           VETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRERLRRKRR RW+WG
Sbjct: 361 VETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLRWMWG 420

Query: 421 SVATAITLGTAVLAWSYLPSGKDLPSSNNSKA-EHDDVTD 459
           SVATAITLGTAVLAWSYLPSGKD  S N+SKA EHDD TD
Sbjct: 421 SVATAITLGTAVLAWSYLPSGKDSSSMNDSKATEHDDATD 460

BLAST of Clc01G19880 vs. TAIR 10
Match: AT3G50910.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G66480.1); Has 76 Blast hits to 75 proteins in 28 species: Archae - 0; Bacteria - 10; Metazoa - 7; Fungi - 2; Plants - 49; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 372.9 bits (956), Expect = 3.8e-103
Identity = 229/454 (50.44%), Postives = 304/454 (66.96%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQI 60
           MPTF+ IALDR+LEPG S SV+ S+P     L +++ P +KLE+       +R V RP +
Sbjct: 1   MPTFSAIALDRMLEPGASTSVE-SVPS-TTNLFYSKPPISKLEKGKGKLPNERTVTRPLM 60

Query: 61  KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRG-PRLLKSFSEDDVSLKKMNDGDVGNET 120
            PALY TP+A PLP+SPSSFPPSPYI+NHK RG PRLLKS SE +V +   +   +  ET
Sbjct: 61  SPALYATPDAIPLPNSPSSFPPSPYIINHKSRGPPRLLKSSSEANV-VSSSHQKTLEEET 120

Query: 121 VKGSDKNDVQLT-EGASVPGDIPIQD--KDEERNGLDCASSSNVGQNGSVGGDHGATAVQ 180
           +  + + DV+++    S     PI +  +D+  NG+   +  N   +G V G  G  +  
Sbjct: 121 I--TAETDVKVSPRRRSTSFSFPITEVTEDDYSNGVHARTVGNYNFDGIVDGPVGNWSPL 180

Query: 181 PVSNHNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSNTDGE-DN 240
              + N        +NG+ R     + V   ++   ++EDF+DP +S S TSNTD E D 
Sbjct: 181 DGKSGNGKSELDNAANGLERVNGLSEPVPIKTDKESESEDFYDPGESASFTSNTDVEGDA 240

Query: 241 GYERSAKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKRKQAEEAL 300
           G E S +  TP+GEFYDAW+ELS++   Q S+++IE+EL E++L LLME+EKRKQ EEAL
Sbjct: 241 GDESSQRLATPVGEFYDAWDELSTDSGMQSSVNNIESELSEIRLSLLMEIEKRKQTEEAL 300

Query: 301 NKLQGQWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVSDAIGRGI 360
            ++Q  WQRLREQ+  VGL +P DPT +T    L    +EEL  Q+ +ARFVSD++GRG+
Sbjct: 301 EQMQIHWQRLREQMAQVGLFVPIDPTASTNNMNL----SEELRCQLEIARFVSDSLGRGM 360

Query: 361 ARAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERLRRKRRQR 420
           A+AEVE EME+ LE KNFEI RL DRL YYEAVN EMSQRNQEA+++ARRER +RK+RQR
Sbjct: 361 AKAEVEMEMESMLETKNFEITRLSDRLHYYEAVNREMSQRNQEAIEVARRERQKRKKRQR 420

Query: 421 WIWGSVATAITLGTAVLAWSYLPSGKDLPSSNNS 450
           WIWGS+A  ITLG+A LAWSY+P+ K  PSS  S
Sbjct: 421 WIWGSIAATITLGSAALAWSYIPAAK--PSSEVS 443

BLAST of Clc01G19880 vs. TAIR 10
Match: AT5G66480.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50910.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 280.8 bits (717), Expect = 2.0e-75
Identity = 193/452 (42.70%), Postives = 262/452 (57.96%), Query Frame = 0

Query: 1   MPTFTTIALDRLLEPGTS-KSVDKSLPKPKPALTFNRAPSTKLERRNSSSVADRKVQRPQ 60
           MPTF+  AL R L  GTS  S   S  + KP++  + +   K          ++   RPQ
Sbjct: 1   MPTFSAAALGRSLNSGTSLSSKFPSTLQSKPSILNDESKQPK----------EKTFTRPQ 60

Query: 61  IKPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSLKKMNDGD---VG 120
           + P+LY T +  P P+SPSS+PPSPYI+NHK RGP L    SE D     +  G+    G
Sbjct: 61  MSPSLYATTKEIPHPNSPSSYPPSPYIINHKARGPVLFNRDSEVDGPSHPITSGEEKISG 120

Query: 121 NETVKG------SDKNDVQLTEGASVPGDIPIQDKD-EERNGLDCASSSNVGQNGSVGGD 180
           N  V+       S      +TE  +V     +  +   ER   DC+       N   G D
Sbjct: 121 NVDVEATASLSKSTSLSFPITEAIAVDHTNGVHTQGIHERPVWDCSPPLGTFLNEKSGRD 180

Query: 181 HGATAVQPVSNHNNHESSIMMSNGVAREKDSLKVVVPNSESIGDAEDFFDPNDSLSVTSN 240
                +      NN  S++   + +      L+ V   ++   + E+F++P + +S TSN
Sbjct: 181 ISNGGI----GSNNATSNLEWQSYL------LEPVRIKADKELEPENFYNPGELVSFTSN 240

Query: 241 TDGED-NGYERSAKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLMLLMELEKR 300
           T+ ED    E S    T +GEFYDA +ELS++   Q S ++IE+E+REM+L LLME+E+R
Sbjct: 241 TEVEDFERAESSHSLATHVGEFYDACDELSTDSGMQSSANNIESEVREMRLGLLMEIERR 300

Query: 301 KQAEEALNKLQGQWQRLREQLWLVGLTLPSDPTVATEGKQLDSDPAEELCQQVHLARFVS 360
           +QAE  L ++Q  W+RLR+QL  VG+ LP DPT      Q   + A+EL  Q+ + RFVS
Sbjct: 301 RQAEATLEQMQVHWRRLRDQLADVGMFLPLDPT----RSQYSMNLADELRCQLEVTRFVS 360

Query: 361 DAIGRGIARAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERL 420
           D +G  +A+ EVE EMEA+LE KNFEI RL DRL YYE VN EMSQRNQEA+++ARR+  
Sbjct: 361 DTLGSDLAKTEVEMEMEAELEAKNFEITRLSDRLHYYETVNQEMSQRNQEAIEVARRDGQ 420

Query: 421 RRKRRQRWIWGSVATAITLGTAVLAWSYLPSG 441
           +RKRRQRWIWGS+A  ITLG+ VLAWSYLP G
Sbjct: 421 KRKRRQRWIWGSIAATITLGSGVLAWSYLPPG 428

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882592.18.8e-22791.27uncharacterized protein LOC120073808 [Benincasa hispida][more]
TYK12610.17.4e-21888.43uncharacterized protein E5676_scaffold255G001960 [Cucumis melo var. makuwa][more]
XP_008440744.12.2e-21788.21PREDICTED: uncharacterized protein LOC103485065 [Cucumis melo] >KAA0036213.1 unc... [more]
XP_004143521.11.4e-21687.99uncharacterized protein LOC101222171 [Cucumis sativus] >KGN48848.1 hypothetical ... [more]
KAG7034132.11.9e-20585.40hypothetical protein SDJN02_03859 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3CMF03.6e-21888.43Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5A7T0051.0e-21788.21Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3B1E01.0e-21788.21uncharacterized protein LOC103485065 OS=Cucumis melo OX=3656 GN=LOC103485065 PE=... [more]
A0A0A0KH176.8e-21787.99Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G502840 PE=4 SV=1[more]
A0A6J1HDH82.0e-20584.13uncharacterized protein LOC111463164 OS=Cucurbita moschata OX=3662 GN=LOC1114631... [more]
Match NameE-valueIdentityDescription
AT3G50910.13.8e-10350.44unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G66480.12.0e-7542.70unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 281..308
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 15..91
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 107..186
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 212..237
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 107..121
NoneNo IPR availablePANTHERPTHR35490:SF2BACTERIOPHAGE N4 ADSORPTION B PROTEINcoord: 1..455
NoneNo IPR availablePANTHERPTHR35490BACTERIOPHAGE N4 ADSORPTION B PROTEINcoord: 1..455

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G19880.1Clc01G19880.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane