CcUC01G007120.1 (mRNA) Watermelon (PI 537277) v1

Overview
NameCcUC01G007120.1
TypemRNA
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionGAGA-binding transcriptional activator
LocationCicolChr01: 7701245 .. 7709339 (-)
Sequence length1749
RNA-Seq ExpressionCcUC01G007120.1
SyntenyCcUC01G007120.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTCTCCCTTCTTTTTCCTTACTCTATTTGTTTGTCGCTGCAATCTTGCTCTTCCTCAACTTAGATCAGTCGCCATTGTCGTTTGCGCCTTGTTTTCTATAAATTGGAACTTTTTGAAGACGGATCGAACTGACCAGTTAGTTGATTGCAACAACAAAAAAAAATTGTCAATTTTCGTTTTCAGAATTGAAAATCGACAAGTTAAGTTTTAACGTTGGACTTTTATTAAAATCAACATAACGAACTAACTAATGAACACTTCATTAAAAACCAATATGAAATTGAACCGAGTTTGAAATTCAAAATTAAAACAAGATTTGAAAAAAAAATCAAAACAAGAATTTCATTTTAAAGAAAAAGTAAATGCAACCAAACTATTTTCTTTTATTAAACTTCCACCAAAATGCTATGATGATTTATGAAAAAAGAGAAGAAAAAGAAAAGAAAGAGAAGAAAGAACAAAAAGATATTACAAAGATAATGACAAAATTAAATTTTGTGTTGCAAAGGCACTTTTGGTTTAGTTTGGAAGATTTTTCTCACTACTTCTTCTACTTATAAATTATAATATAAATAAAAAGGAGCTCTCTTTTTTCACCCACTTTTCTCTTCCTTTTCCTTTTCCGCTGCTTTCATCTCTCTCTCTCTCTCTCTCTCTCTCTCCTTCCCTTTCAGTTTCTCTCTCCTCCGGCAAAACCCTAAAGTGCCGTTTACTCCCACCCAAGTAAGTCTTCTCTCTCCTCTCCTCTCCTCTCCCCTCTCTCTCTCTCTCTCTGTCTTTGTCTCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCAATATATGGGGTCCTTGTGTGTGCTCTTTGACCATCACTGACATTTCATATATAAATATTGCTATCATTTGTTCTTTATACTTCTTCTATATCCACTTTTTTTATATATTTTTTTGTTTCTTTTTTGTTTCTGATGAACTGTGATTTCTGACGGTTTATTCCAAGATATTTTCTAACCCCTGATTTTGGTTTATGTAATCGAATGATGAGAATGTTTAATCTGAAATTTGGCTCCTTTTTAGGGTTTATGGTAGCAATCTATGTAAATGGATTACTTTATGTTTGCCCTTTATAATGCTTTTTTCCTTTTTGCGATATATGTATTGAAGAGGGGCTTTAATTAGATAGAAGTCCTAATTTTTAGTTGGTTGGTGGGATATTGGAATTTCCAATTTTGGAACTTTTGAATCAAGAGTGTGCTGTTAAATCCACCCATTTTCGTTATGTCGAGATCTTATTGGAATTAGTTTATAATCCTCACTGTTGGATTTGGAAAGAAAGTTAATTTGAGAGTATTTGATGAATTTCGTTTTCTTTAGGGCTAAATTGCAGGGGGTAGGCCAATTTTTTAATTTGTAGGAATATCGAAGCGATGCTGAAAATTCACTTGGTTTTTCCCTTTAAAATTTCTATGTTCCTTTTGTTTGTTGGGTGATTGAGGTTGATTTGAAACACCTGTTTTCTTCCTTCTCGGTGCATGTTTTGAGTGATTTTGAATTGTTTAATATCACTTTAAGATCACTCCAAAACATACCTTTAATCAATCAAAATTGACTCAATGTTTAATTTTACACTTTTAAATAAATTTTCATACTATTAAAATTGATTTTGTATGATTGAAAGCAATTTCTAAGTGATTTTGCAAACGACAGAAGTGATTTTATCCATTTTAGAATCACTTCCAACCATGTCATTAGTCGTACTTTTTGTGTTTTTAGTGTTTCTCATTATACTTCCAGGACTTTGATGACTATTTTAAGCTGTCACTCTTGTGTCTATGGTTGTTATGTTTAGAACATGGGAGTTTCTTGAGTCTAACTTGTAATCTCTGCTTATTGTGTTTACTCTGTAGTTCTAATGGATGGTGATGCATTGAACATGCGTAATTGGGGTTACTATGAACCATCGTTGAAAGCGCATCTTGGTCTGCAGCTCATGTCCACAATTGGTGAGCAAGATGTGAAACATTTTCTACCTGGACGGGACACTTCTGCTATTGTTAACATGAATGCTGCATTTCATTCACGGGAGTCTGTTGTTTCTGATGCATCAGTACCGACAAGCTGGGGGAGGGATGGTTGGATAAATCATCATAGAGACAAGCTTTTCAATATGTTAACCCCAACTACTAGTTATTCTCTGCTTGCAGAAACATCAGGCCCGCAACCCTTGCAAATGTTACAGCCGCATAACACATCAAGAGATGAAATGGTCCTCAAGATTGAAGAACCACCTGTGAAGAAGGGAACTAAACAACCGAAGAAAAGACAGAATGGGGGTGCTTCCAAAACACCAAAACCAAAGAAGCCGCGAAAGCCCAAAAATAATGATCCTTCAGTTCAGCAGGTGAAGGCACCAAAAAAGAAGATGGAACTTGTTATAAATGGGTTTGATATGGATATATCTAGCATCCCAATTCCAGTATGCTCTTGCACTGGAACTCCTCACCAATGTTATAGATGGGGCTATGGTGGCTGGCAATCAGCTTGTTGTACCACAAGTTTATCTCTACATCCTTTGCCCATGAGTGAGAAGCGGCGAGGTGCAAGAATTGCTGGTCGAAAAATGAGTCAAGGTGCTTTTAAGAAGGTTTTGGAGAAACTAGCAGCTCAAGGCTATAACTTTTCTAACCCAATTGATTTAAGATGCCATTGGGCAAGGCATGGGACTAATAAGTTTGTCACCATCAGGTAGATTAGCAGAGTATTATGATGGTAGGGGGCCGAGTCTGTTTTACATGTATATATCTGTGGCCTTTTGCAGCAATCTTGGGGTAGCTATAGCTGGTAATTTTTTCTCTTTATAGTTTATGGGTTCATAGAGTTTGGAAATTTCCATCTTCAATCTTGATTCGTTTTGTAGTAGTTTTTGATTTATTCGAATATCTTTTAAGGCTATTTTCTTTCAATCGTTGTCTTGGATCAGTTATCTAATTATTTTGTAACTTTTTCTTTTCTGCGTACTCCAAAATTGAGTAAAGAATGATGACAATGAATGATATGGCGGTAGTATTAAGTTGAAGGTCAAGTGGTCAACTGATTTGATACAGTTCAATATGCATTCTCTTTTATTTTGGTCGATGGATATGGATGACCTTGCCTTCTTGGAGAGGCAGAACTTGCAGTTAGTCGTTTTATTCTATTTCTTCACTGCCAACATTTTTTCTGATGTCCCACTTTCATCTCTCATTCATTTTGATTTTTTTATTTTAACTTTAGATAATTATAGTTCATTATAGTCCATATGCTTTAGAATTGTCTATTTTGGTCTCTGGGCTTTGAAAAAGTACATTTGATCTAATAGAATTATTTTGCTGTATCCATGGACTCTTAAAAATGAAAACAAAGGGACTAAAATGGACACTTTTTAAAAATATGAGAACAAAGTGGACATTTTAAAACTACATATACCAAAATGTGTTTAAGCTTTTTTGTTGTTACAGTATCACTAAAAGACTTGCATTTCATTGTCATCCGGCTTTATCCAAGTACATGATCCAAGTACATGGTCTATGAGATTGTCATCCTGATCTATCCAAGTACATGGTCTATGAGATGCCCTCTCGTCTATTTATAAATTTTCTTTTGTACATGTTCAACATTCGCTAAAGCAACAGTAGGAAAGAGTTTATGCAATCCTCAATTGCTCTCTATGAACTGATACTGAAAGACAACCGTAGGAAAGTTTTTAATAGTTTTTATTCAATTAGATTGCATCAGAAGATTATTTAGATTTTGGCATGTCTATATTGTCTTAATGCACGTTAGAAGTTGATACATGAATGCTGGACTTGATGTCTGGTCGTCATAACATAAGATTATACAGCTCTTTCTGCAATTTGTTCTAGTTTGTTATGAAGTCTTACAATATATTATCATCAGGCGGTCTTAACCTCGCATTAAGGCTCATTTTTGTCCATGCATTTGATTCACCTATGTCACTCGGAAACCTTGTCTCTGAAGTTTTTGGTTCCATTCGGTTCTTTTCCATTGGTTTTGTTATTTTGCAGAGAAACATTTGTAAAGTAGCAATTGATTCTTTTCTATTGATTTATAGAAATGAGATTGTCTTGAAACTGCCCGAGCATTAATGTCCTGTAGTTGATTGATTGATGAAATTCGAGGTTACAGGAGGATTAGAGAGTTCACAAAGTAAATTGTTACTTGTTTAGTGGAAACTTTCAAGGGCTATGTATAGTTGTTAGAACACTCTGTTATATGAGGTTGAAACTAAACTGCATATTGCATAAGGTGGATTTGCAAGGAAAGCAGATTCTGAACTCAGTTGTATTTTACGTTGTAAGTTCCAATGATAGTGTTCATCTAGTAATTATAAAGGTCAAGCAACAAAGTAGATGCCCTGTTATGCAATACTAGTTTTGAAGAACTGTGTTACTTTGAGTTACTTTAAAAAATAGGTTGCTTGACACTTGGAATAAACAAATGCTACATCTTAAGAACAGTTAACTAAGCATGTTCCTGAAGAAATCTTATTCTGGAAAATTACACAAACAGTTTCTGAAAAGCAATTTTCAAAGGTGTTGAGTTTTCTCAATTTTGAACCCCATGATTTGAACACAAAGAGAGCAAATGCAAATTCATGTTGATGAATCATTTGTTAGTTGAATAATTGATCTTGCACATGCTACTGTAGAACCTTGAACATATAAACTTACAATTCCTTTGCTTGTGTTTCATTGGTTAAGAAATCATGTTAATCTGATAATTACTCTAAAAGTCCAATTAATACCCCACTACAGCTCCATTATTACTAAACAAACGCACCAGCTTATTTCTAGATCAAAGATTATATGAAGAGGATCCCATCTATCATTTAATAATAGAAATTGCATCACTATGATATTAACATTACAAAATTAAAGGTCAAATCAATCTTTTGAACTTCTTTTTTCTTTAATTCAAACAATCCAATCATATAATAACTAAAGATGCATATTTAAAGCAATTGTATAATATCACAAAGGTGCCCTTTTTTAGAAAAAAAAAATGTTAAAAGCTATTGTATTAACATCAAGAAGGTACTTCATCGATACCAAGAGTGAGATATCATACTTTAGCAATGGTCACTATATCCTAAACTAGATATTTTTGTGGAGTTAACCTAATTTAAACTATAGGTTAAACTACAATTTTAGTTTCTAAACTTCGAGGTTTGTGTCCATTTAGTTTTTGAACTTTTAAAGTTACTGATGGTACCTAGATTTTGATATATATATTTTTTTGACACAGGTAAAAGTTCATGAACCAAATAGGTACGCTAGTTCGAGAACTAAACTTATAATTTAACTTAAAACATACAAATCTAACACGAGGTAGAGAAACAAACCTGATTACGTAGAACAAATAGATCTGGTCAAATCTCTTTGAATCAAAAGGGATGATTAAGAAGAAAAGTGGAGGGTAGAGAATGGAAATAATGATGGGGAAGAAATAGAATTAGGGTTTATAATGAGAAGAGTAGAATGAAGAAGGGAGAAAGTAGATAAGATTAGATAGACAATACTAACATATAGAAAAGGAAAAGAAGAAAAATGGAATGTGGAGCCCTCACTGAAAGAAAGAATAGGGGAATGAAATGAAACGAGGGATAAGAATAAGAAGAAATAATGAACAGATCTTTAAAAAATAATAATAGTAATAAAATAAAATAATAATAAATAAATATGGGGTTGTCTCATGTGCAAAATTTCCCTTAATCCCAAACATAAAAACATGGTATATAATATATATATATATATCACTTTTTTTTTTAATAAAAAATTTGTTAAATAAAGAGAAAAAAAAAAAGAAAAAAATTTATAAAATTGGTTTGGTGTTGCCCTTTCCCTCTCTCCTTATTTTAAAGCGACAACTACTTCTTTGATAGATACCCATTTTTAGCTCTCTCTTTCTCTCTCTCAGTTTTCTCTTTCCTCTCTCTCTTTCTGAGGTTTCTCTCTCCTCTCTCCTCTCTCCTCTCTCCTCTCTGTCTCTGTCTCTCTCTCTCTCTCTCTCTCCAGAAGGGCAAAACCCTAAAGAGCCGTTTACACCCACCCAAGTAAGTTTCTCTCTTTCTCTTTCTCTCTCTCTCTTACCCCTTCTTTCTCTCTCTCCTTCTCTTTCTCTCTCTTTCTCTCTTTCTCTCTCTCGAGTTTGAATATATGGGTCCTTCTTGTACTATTAGCCCTGTTATCTGCAACCCCATATAAATATGCATTTGCTTCTTCTTTTGTTTTTGTGATTCCCGGATTTTGAATCTACAATTTTGCTTAGATCTGTGGTTTTTGAGATTTGGGTGTTTTCAAAACCCTAATTTGTTGCAGTTTCTGCATGATGGATGTTTGAGATCTGATGTTTTCTTTACTTCGAAGTGGTTTTTGACTTTTTGGGATCTCTGTTTAGGGCTTCTATTTTGATTTCGATGTTGTCAATGAGAACAAGTTTTCATTTTGATGATTTCTTCTTTTGTTTAATTTTGGTTGTAATGTTTGAATATTCGAGTTAGGCTGCGGAAATGAGATCTGAACCCTAATTCATTTGATTGATCGAATTAAATGTTGGATCTGGAATCTTGTTTTATTACGAAATAATATCTTGAAAGAAGGTCGATCTACGAAATTAGTGTCTAAATTTTCATTGCTCCCTATCTCATGTAAGATTTCAGTTCCTCTTTGATTGTCCTGTTGTTTTTTTTTTCTTCATTTCCCTTCTCCTTTTAGGATATTTTTGTTGCTTCATTTGTTTTCATTCCCAAGTCATTATGATGTGGTTGAACAATTGCCATCCAAGTACTGTTATTGGGAAGATTCAAAACCTGAGAACTTTTCTCTTATTGTTATTATGTTTGATATGTAGTTCTCATGGATGACGATGCGTTAAACATGCGCAATTGGGGCTATTATGAGCCCTCTTTTAAAGGGCATCTCGGCCTGCAGCTCATGTCCTCCATTTCTGAGCGGGACATGAAACATTTCTTACCCGGCCGTGAACCTTCTGTTATGGTTAATGCTAATGGCTCGTTCCATCCACGGGATTGCGTTGTTTCGGAAGCACCGGTGCATATGAACTATGTGAGGGACAATTGGGGGGGCAACCCCAGAGATAGGTTTCTTAATATGTTGCCTGCCAATCATAACTATCCTGTCATTCCAGAAACTTCAGGAGCTCACTCCTTGCAGATCTTGCAACCACCCTCTTCATCTAGGGATGAAATAGCAGCAAGTAGAGTTGAAGAGCCTCCAGTGAAGAAGGAAGGTGGGAAAGCAAAGAAAAGACAGAATAGTGAGAGTGGCCCCAAAACCCCCAAAGCTAAAAAACCAAGGAAACCGAAAGATACTAGCATGGCTGTGCAGCGTGTGAAACCACCAAAGAAGAATATTGATCTTGTTATAAATGGGATTGATATGGACATTTCAAGTATTCCAATCCCAGTCTGCTCTTGCACTGGAGCTCCTCATCAATGCTATAGGTGGGGATGTGGTGGTTGGCAGTCTGCTTGTTGTACTACCAACATATCAACTTATCCTTTGCCCATGAGTGACAAAAGACGTGGGGCAAGGATAGCTGGGCGAAAAATGAGTCAGGGTGCGTTTAAGAAGGTACTTGAGAAACTAGCAGCTGATGGCTATAATTTTGCTAACCCGATCGATTTGAGGACTCACTGGGCGAGACATGGTACTAATAAGTTTGTCACAATCAGGTAGACTAATTCTCTTGGTACTTCATGGGTTCCCGGGTTATGTTGCATCACGTCTGCTTGAATTGTGTATATATCTGCACCCTTCTGCAGAGCTCTGCTGGTAGTTTGTAGTTGATGAGTTTTATTAACATTTCATATTTGTTTACGACATTTGGACACGTTGGTTCTTTTATTTATTTTCCCCTTCAGCTTATGTGATACGTGCAAACAAAAAAATTTATCTGGATGAACTTTCATTTATTTGAGATTCCTTTTGAAAGTATTTCTTATTCTTTCTTCCATGTTGAAGATTAGTAACAGATCCTTCCAAGTTCTAACTATTATTGGAGCATGTGACACTTCCCCTCCATGTTGA

mRNA sequence

ATGTCTCTCCCTTCTTTTTCCTTACTCTATTTGTTTGTCGCTGCAATCTTGCTCTTCCTCAACTTAGATCATGCCGTTTACTCCCACCCAATTCTAATGGATGGTGATGCATTGAACATGCGTAATTGGGGTTACTATGAACCATCGTTGAAAGCGCATCTTGGTCTGCAGCTCATGTCCACAATTGTACCGACAAGCTGGGGGAGGGATGGTTGGATAAATCATCATAGAGACAAGCTTTTCAATATGTTAACCCCAACTACTAGTTATTCTCTGCTTGCAGAAACATCAGGCCCGCAACCCTTGCAAATGTTACAGCCGCATAACACATCAAGAGATGAAATGGTCCTCAAGATTGAAGAACCACCTGTGAAGAAGGGAACTAAACAACCGAAGAAAAGACAGAATGGGGGTGCTTCCAAAACACCAAAACCAAAGAAGCCGCGAAAGCCCAAAAATAATGATCCTTCAGTTCAGCAGGTGAAGGCACCAAAAAAGAAGATGGAACTTGTTATAAATGGGTTTGATATGGATATATCTAGCATCCCAATTCCAGTATGCTCTTGCACTGGAACTCCTCACCAATGTTATAGATGGGGCTATGGTGGCTGGCAATCAGCTTGTTGTACCACAAGTTTATCTCTACATCCTTTGCCCATGAGTGAGAAGCGGCGAGGTGCAAGAATTGCTGGTCGAAAAATGAGTCAAGGTGCTTTTAAGAAGGTTTTGGAGAAACTAGCAGCTCAAGGCTATAACTTTTCTAACCCAATTGATTTAAGATGCCATTGGGCAAGGCATGGGACTAATAAGTTTGTCACCATCAGAGCCGTTTACACCCACCCAATTCTCATGGATGACGATGCGTTAAACATGCGCAATTGGGGCTATTATGAGCCCTCTTTTAAAGGGCATCTCGGCCTGCAGCTCATGTCCTCCATTTCTGAGCGGGACATGAAACATTTCTTACCCGGCCGTGAACCTTCTGTTATGGTTAATGCTAATGGCTCGTTCCATCCACGGGATTGCGTTGTTTCGGAAGCACCGGTGCATATGAACTATGTGAGGGACAATTGGGGGGGCAACCCCAGAGATAGGTTTCTTAATATGTTGCCTGCCAATCATAACTATCCTGTCATTCCAGAAACTTCAGGAGCTCACTCCTTGCAGATCTTGCAACCACCCTCTTCATCTAGGGATGAAATAGCAGCAAGTAGAGTTGAAGAGCCTCCAGTGAAGAAGGAAGGTGGGAAAGCAAAGAAAAGACAGAATAGTGAGAGTGGCCCCAAAACCCCCAAAGCTAAAAAACCAAGGAAACCGAAAGATACTAGCATGGCTGTGCAGCGTGTGAAACCACCAAAGAAGAATATTGATCTTGTTATAAATGGGATTGATATGGACATTTCAAGTATTCCAATCCCAGTCTGCTCTTGCACTGGAGCTCCTCATCAATGCTATAGGTGGGGATGTGGTGGTTGGCAGTCTGCTTGTTGTACTACCAACATATCAACTTATCCTTTGCCCATGAGTGACAAAAGACGTGGGGCAAGGATAGCTGGGCGAAAAATGAGTCAGGGTGCGTTTAAGAAGGTACTTGAGAAACTAGCAGCTGATGGCTATAATTTTGCTAACCCGATCGATTTGAGGACTCACTGGGCGAGACATGGTACTAATAAGTTTGTCACAATCAGTAACAGATCCTTCCAAGTTCTAACTATTATTGGAGCATGTGACACTTCCCCTCCATGTTGA

Coding sequence (CDS)

ATGTCTCTCCCTTCTTTTTCCTTACTCTATTTGTTTGTCGCTGCAATCTTGCTCTTCCTCAACTTAGATCATGCCGTTTACTCCCACCCAATTCTAATGGATGGTGATGCATTGAACATGCGTAATTGGGGTTACTATGAACCATCGTTGAAAGCGCATCTTGGTCTGCAGCTCATGTCCACAATTGTACCGACAAGCTGGGGGAGGGATGGTTGGATAAATCATCATAGAGACAAGCTTTTCAATATGTTAACCCCAACTACTAGTTATTCTCTGCTTGCAGAAACATCAGGCCCGCAACCCTTGCAAATGTTACAGCCGCATAACACATCAAGAGATGAAATGGTCCTCAAGATTGAAGAACCACCTGTGAAGAAGGGAACTAAACAACCGAAGAAAAGACAGAATGGGGGTGCTTCCAAAACACCAAAACCAAAGAAGCCGCGAAAGCCCAAAAATAATGATCCTTCAGTTCAGCAGGTGAAGGCACCAAAAAAGAAGATGGAACTTGTTATAAATGGGTTTGATATGGATATATCTAGCATCCCAATTCCAGTATGCTCTTGCACTGGAACTCCTCACCAATGTTATAGATGGGGCTATGGTGGCTGGCAATCAGCTTGTTGTACCACAAGTTTATCTCTACATCCTTTGCCCATGAGTGAGAAGCGGCGAGGTGCAAGAATTGCTGGTCGAAAAATGAGTCAAGGTGCTTTTAAGAAGGTTTTGGAGAAACTAGCAGCTCAAGGCTATAACTTTTCTAACCCAATTGATTTAAGATGCCATTGGGCAAGGCATGGGACTAATAAGTTTGTCACCATCAGAGCCGTTTACACCCACCCAATTCTCATGGATGACGATGCGTTAAACATGCGCAATTGGGGCTATTATGAGCCCTCTTTTAAAGGGCATCTCGGCCTGCAGCTCATGTCCTCCATTTCTGAGCGGGACATGAAACATTTCTTACCCGGCCGTGAACCTTCTGTTATGGTTAATGCTAATGGCTCGTTCCATCCACGGGATTGCGTTGTTTCGGAAGCACCGGTGCATATGAACTATGTGAGGGACAATTGGGGGGGCAACCCCAGAGATAGGTTTCTTAATATGTTGCCTGCCAATCATAACTATCCTGTCATTCCAGAAACTTCAGGAGCTCACTCCTTGCAGATCTTGCAACCACCCTCTTCATCTAGGGATGAAATAGCAGCAAGTAGAGTTGAAGAGCCTCCAGTGAAGAAGGAAGGTGGGAAAGCAAAGAAAAGACAGAATAGTGAGAGTGGCCCCAAAACCCCCAAAGCTAAAAAACCAAGGAAACCGAAAGATACTAGCATGGCTGTGCAGCGTGTGAAACCACCAAAGAAGAATATTGATCTTGTTATAAATGGGATTGATATGGACATTTCAAGTATTCCAATCCCAGTCTGCTCTTGCACTGGAGCTCCTCATCAATGCTATAGGTGGGGATGTGGTGGTTGGCAGTCTGCTTGTTGTACTACCAACATATCAACTTATCCTTTGCCCATGAGTGACAAAAGACGTGGGGCAAGGATAGCTGGGCGAAAAATGAGTCAGGGTGCGTTTAAGAAGGTACTTGAGAAACTAGCAGCTGATGGCTATAATTTTGCTAACCCGATCGATTTGAGGACTCACTGGGCGAGACATGGTACTAATAAGTTTGTCACAATCAGTAACAGATCCTTCCAAGTTCTAACTATTATTGGAGCATGTGACACTTCCCCTCCATGTTGA

Protein sequence

MSLPSFSLLYLFVAAILLFLNLDHAVYSHPILMDGDALNMRNWGYYEPSLKAHLGLQLMSTIVPTSWGRDGWINHHRDKLFNMLTPTTSYSLLAETSGPQPLQMLQPHNTSRDEMVLKIEEPPVKKGTKQPKKRQNGGASKTPKPKKPRKPKNNDPSVQQVKAPKKKMELVINGFDMDISSIPIPVCSCTGTPHQCYRWGYGGWQSACCTTSLSLHPLPMSEKRRGARIAGRKMSQGAFKKVLEKLAAQGYNFSNPIDLRCHWARHGTNKFVTIRAVYTHPILMDDDALNMRNWGYYEPSFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPRDCVVSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDEIAASRVEEPPVKKEGGKAKKRQNSESGPKTPKAKKPRKPKDTSMAVQRVKPPKKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTISNRSFQVLTIIGACDTSPPC
Homology
BLAST of CcUC01G007120.1 vs. NCBI nr
Match: KAG6579174.1 (Protein BASIC PENTACYSTEINE1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1018.1 bits (2631), Expect = 3.1e-293
Identity = 488/534 (91.39%), Postives = 509/534 (95.32%), Query Frame = 0

Query: 33  MDGDALNMRNWGYYEPSLKAHLGLQLMSTI---VPTSWGRDGWINHHRDKLFNMLTPTTS 92
           MDGDALNMRNWGYYEPSLK  LGLQL+STI   VPT+W RDGW+NH  DK FNML P TS
Sbjct: 110 MDGDALNMRNWGYYEPSLKPPLGLQLISTIAERVPTNWARDGWMNHQSDKFFNMLPPNTS 169

Query: 93  YSLLAETSGPQPLQMLQPHNTSRDEMVLKIEEPPVKKGTKQPKKRQNGGASKTPKPKKPR 152
           YSLLA+TS PQPL +LQPH+TSRDEMVL+IEEP VKKG KQ KKRQNGGA KTP PKKPR
Sbjct: 170 YSLLAQTSAPQPLPILQPHDTSRDEMVLRIEEPSVKKGAKQSKKRQNGGAPKTPNPKKPR 229

Query: 153 KPKNNDPSVQQVKAPKKKMELVINGFDMDISSIPIPVCSCTGTPHQCYRWGYGGWQSACC 212
           KPKN DPSVQQVKAPKKK++LVINGF+MDISSIPIPVCSCTGTPHQCYRWGYGGWQSACC
Sbjct: 230 KPKNKDPSVQQVKAPKKKLDLVINGFNMDISSIPIPVCSCTGTPHQCYRWGYGGWQSACC 289

Query: 213 TTSLSLHPLPMSEKRRGARIAGRKMSQGAFKKVLEKLAAQGYNFSNPIDLRCHWARHGTN 272
           TTSLS+HPLPMSEKRRGARIAGRKMSQGAFKKVLEKLAA+GYNFSNPIDLR HWARHGTN
Sbjct: 290 TTSLSVHPLPMSEKRRGARIAGRKMSQGAFKKVLEKLAAEGYNFSNPIDLRSHWARHGTN 349

Query: 273 KFVTIRAVYTHPILMDDDALNMRNWGYYEPSFKGHLGLQLMSSISERDMKHFLPGREPSV 332
           KFVTIR VYTHPILMDDDALNMRNWGYYEPSFKGHLGLQLMS++SERD+KH+LPGR+PSV
Sbjct: 350 KFVTIRTVYTHPILMDDDALNMRNWGYYEPSFKGHLGLQLMSTMSERDIKHYLPGRDPSV 409

Query: 333 MVNANGSFHPRDCVVSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQ 392
           MVNANGSFHPRDCVVSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQ
Sbjct: 410 MVNANGSFHPRDCVVSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQ 469

Query: 393 ILQPPSSSRDEIAASRVEEPPVKKEGGKAKKRQNSESGPKTPKAKKPRKPKDTS-MAVQR 452
           ILQPPSSSRDE+A SRVEEPPVKKEGGKAKKRQ++E+GPKTPKAKKPRKPKDTS  AVQR
Sbjct: 470 ILQPPSSSRDEMAGSRVEEPPVKKEGGKAKKRQSNEAGPKTPKAKKPRKPKDTSTAAVQR 529

Query: 453 VKPPKKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPM 512
           VKPPKKNIDLVINGIDMDIS IPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPM
Sbjct: 530 VKPPKKNIDLVINGIDMDISGIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPM 589

Query: 513 SDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           SDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI
Sbjct: 590 SDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 643

BLAST of CcUC01G007120.1 vs. NCBI nr
Match: KAG7016690.1 (Protein BASIC PENTACYSTEINE1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 989.2 bits (2556), Expect = 1.5e-284
Identity = 482/570 (84.56%), Postives = 508/570 (89.12%), Query Frame = 0

Query: 30  PILMDGDALNMRNWGYYEPSLKAHLGLQLMSTI--------------------------- 89
           P+ MDGDALNMRNWGYYEPSLK  LGLQL+STI                           
Sbjct: 94  PVPMDGDALNMRNWGYYEPSLKPPLGLQLISTIAERGVKHFLPGRDPSAIVNINGAFHSR 153

Query: 90  --------VPTSWGRDGWINHHRDKLFNMLTPTTSYSLLAETSGPQPLQMLQPHNTSRDE 149
                   VPT+W RDGW+NH  DK FNML P TSYSLLA+TS PQPL +LQPH+TSRDE
Sbjct: 154 ESVVSEASVPTNWARDGWMNHQSDKFFNMLPPNTSYSLLAQTSAPQPLPILQPHDTSRDE 213

Query: 150 MVLKIEEPPVKKGTKQPKKRQNGGASKTPKPKKPRKPKNNDPSVQQVKAPKKKMELVING 209
           MVL+IEEP VKKG KQ KKRQNGGA KTP PKKPRKPKNNDPSVQQVKAPKKK++L+ING
Sbjct: 214 MVLRIEEPSVKKGAKQSKKRQNGGAPKTPNPKKPRKPKNNDPSVQQVKAPKKKLDLIING 273

Query: 210 FDMDISSIPIPVCSCTGTPHQCYRWGYGGWQSACCTTSLSLHPLPMSEKRRGARIAGRKM 269
           F+MDISSIPIPVCSCTGTPHQCYRWGYGGWQSACCTTSLS+HPLPMSEKRRGARIAGRKM
Sbjct: 274 FNMDISSIPIPVCSCTGTPHQCYRWGYGGWQSACCTTSLSVHPLPMSEKRRGARIAGRKM 333

Query: 270 SQGAFKKVLEKLAAQGYNFSNPIDLRCHWARHGTNKFVTIRAV-YTHPILMDDDALNMRN 329
           SQGAFKKVLEKLAA+GYNFSNPIDLR HWARHGTNKFVTI  +  +  +LMDDDALNMRN
Sbjct: 334 SQGAFKKVLEKLAAEGYNFSNPIDLRSHWARHGTNKFVTISNLGVSLAVLMDDDALNMRN 393

Query: 330 WGYYEPSFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPRDCVVSEAPVHMNY 389
           WGYYEPSFKGHLGLQLMS++SERD+KH+LPGR+PSVMVNANGSFHPRDCVVSEAPVHMNY
Sbjct: 394 WGYYEPSFKGHLGLQLMSTMSERDIKHYLPGRDPSVMVNANGSFHPRDCVVSEAPVHMNY 453

Query: 390 VRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDEIAASRVEEPPVKK 449
           VRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDE+A SRVEEPPVKK
Sbjct: 454 VRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDEMAGSRVEEPPVKK 513

Query: 450 EGGKAKKRQNSESGPKTPKAKKPRKPKDTS-MAVQRVKPPKKNIDLVINGIDMDISSIPI 509
           EGGKAKKRQ++E+GPKTPKAKKPRKPKDTS  AVQRVKPPKKNIDLVINGIDMDIS IPI
Sbjct: 514 EGGKAKKRQSNEAGPKTPKAKKPRKPKDTSTAAVQRVKPPKKNIDLVINGIDMDISGIPI 573

Query: 510 PVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMSQGAFKKVLE 563
           PVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMSQGAFKKVLE
Sbjct: 574 PVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMSQGAFKKVLE 633

BLAST of CcUC01G007120.1 vs. NCBI nr
Match: KAG6601899.1 (Protein BASIC PENTACYSTEINE1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 894.4 bits (2310), Expect = 5.1e-256
Identity = 447/585 (76.41%), Postives = 475/585 (81.20%), Query Frame = 0

Query: 13  VAAILLFLNLDHAVYSHPILMDGDALNMRNWGYYEPSLKAHLGLQLMSTI---------- 72
           V  +++   L  A Y+HPILMDGDALNMRNWGYYEPS KA L LQLMSTI          
Sbjct: 26  VENVVIIKELTTAGYTHPILMDGDALNMRNWGYYEPSGKAQLRLQLMSTIFEQDMKHFLP 85

Query: 73  -------------------------VPTSWGRDGWINHHRDKLFNMLTPTTSYSLLAETS 132
                                    VPT+W RDGWINH+RDKL NML P  +YS  AETS
Sbjct: 86  GRNPSAMINMNGVFHSRGSGVSEPPVPTNWVRDGWINHNRDKLLNMLPPNPTYSTHAETS 145

Query: 133 GPQPLQMLQPHNTSRDEMVLKIEEPPVKKGTKQPKKRQNGGASKTPKPKKPRKPKNNDPS 192
             QP+QMLQ H+TS DEMV +I+EP  KK TKQ KKRQ+GGA K PKPKKPRK KNNDPS
Sbjct: 146 AAQPVQMLQAHDTSIDEMVGRIDEPSEKKETKQLKKRQHGGAPKVPKPKKPRKAKNNDPS 205

Query: 193 VQQVKAPKKKMELVINGFDMDISSIPIPVCSCTGTPHQCYRWGYGGWQSACCTTSLSLHP 252
           VQQVKAPKKKM+LVING DMDIS IPIP+CSCTGTPHQCYRWGYGGWQSACCTT+LS+HP
Sbjct: 206 VQQVKAPKKKMDLVINGLDMDISGIPIPICSCTGTPHQCYRWGYGGWQSACCTTTLSVHP 265

Query: 253 LPMSEKRRGARIAGRKMSQGAFKKVLEKLAAQGYNFSNPIDLRCHWARHGTNKFVTIRAV 312
           LPMSEKRRGARIAGRKMS GAFKKVLEKLA Q                            
Sbjct: 266 LPMSEKRRGARIAGRKMSLGAFKKVLEKLAGQ---------------------------- 325

Query: 313 YTHPILMDDDALNMRNWGYYEPSFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSF 372
               +LMDDDALNMRNWGYYEPSFKGHLGLQLMS I+ERDMKHFLPGR+PSVMVNAN +F
Sbjct: 326 ----VLMDDDALNMRNWGYYEPSFKGHLGLQLMSPITERDMKHFLPGRDPSVMVNANATF 385

Query: 373 HPRDCVVSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSS 432
           HPRDCVVSEAPVHMNY+RDNWGGNPRDRFLNMLPANHNYPV+PETSGAHSLQILQPPSSS
Sbjct: 386 HPRDCVVSEAPVHMNYIRDNWGGNPRDRFLNMLPANHNYPVLPETSGAHSLQILQPPSSS 445

Query: 433 RDEIAASRVEEPPVKKEGGKAKKRQNSESGPKTPKAKKPRKPKDTSMAVQRVKPPKKNID 492
           RDE+ A+RVEEPPVKKEGGKAKKRQ+SE+GPKTPKAKKPRK KDTS A  R KPPKKNID
Sbjct: 446 RDELVANRVEEPPVKKEGGKAKKRQSSETGPKTPKAKKPRKLKDTSTAAPRAKPPKKNID 505

Query: 493 LVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARI 552
           LVINGIDMDIS IPIPVCSCTG+PHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARI
Sbjct: 506 LVINGIDMDISGIPIPVCSCTGSPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARI 565

Query: 553 AGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           AGRKMSQGAFKKVLEKLAADGYNFANPIDLR+HWARHGTNKFVTI
Sbjct: 566 AGRKMSQGAFKKVLEKLAADGYNFANPIDLRSHWARHGTNKFVTI 578

BLAST of CcUC01G007120.1 vs. NCBI nr
Match: KAG7032597.1 (Protein BASIC PENTACYSTEINE1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 881.3 bits (2276), Expect = 4.5e-252
Identity = 439/565 (77.70%), Postives = 463/565 (81.95%), Query Frame = 0

Query: 33  MDGDALNMRNWGYYEPSLKAHLGLQLMSTI------------------------------ 92
           MDGDALNMRNWGYYEPS KA L LQLMSTI                              
Sbjct: 1   MDGDALNMRNWGYYEPSGKAQLRLQLMSTIFEQDMKHFLPGRNPSAMINMNGVFHSRGSG 60

Query: 93  -----VPTSWGRDGWINHHRDKLFNMLTPTTSYSLLAETSGPQPLQMLQPHNTSRDEMVL 152
                VPT+W RDGWINH+RDKL NML P  +YS  AETS  QP+QMLQ H+TS DEMV 
Sbjct: 61  VSEPPVPTNWVRDGWINHNRDKLLNMLPPNPTYSTHAETSAAQPVQMLQAHDTSIDEMVG 120

Query: 153 KIEEPPVKKGTKQPKKRQNGGASKTPKPKKPRKPKNNDPSVQQVKAPKKKMELVINGFDM 212
           +I+EP  KK TKQ KKRQ+GGA K PKPKKPRK KNNDPSVQQVKAPKKKM+LVING DM
Sbjct: 121 RIDEPSEKKETKQLKKRQHGGAPKVPKPKKPRKAKNNDPSVQQVKAPKKKMDLVINGLDM 180

Query: 213 DISSIPIPVCSCTGTPHQCYRWGYGGWQSACCTTSLSLHPLPMSEKRRGARIAGRKMSQG 272
           DIS IPIP+CSCTGTPHQCYRWGYGGWQSACCTT+LS+HPLPMSEKRRGARIAGRKMS G
Sbjct: 181 DISGIPIPICSCTGTPHQCYRWGYGGWQSACCTTTLSVHPLPMSEKRRGARIAGRKMSLG 240

Query: 273 AFKKVLEKLAAQGYNFSNPIDLRCHWARHGTNKFVTIRAVYTHPILMDDDALNMRNWGYY 332
           AFKKVLEKLA Q                                +LMDDDALNMRNWGYY
Sbjct: 241 AFKKVLEKLAGQ--------------------------------VLMDDDALNMRNWGYY 300

Query: 333 EPSFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPRDCVVSEAPVHMNYVRDN 392
           EPSFKGHLGLQLMS I+ERDMKHFLPGR+PSVMVNAN +FHPRDCVVSEAPVHMNY+RDN
Sbjct: 301 EPSFKGHLGLQLMSPITERDMKHFLPGRDPSVMVNANATFHPRDCVVSEAPVHMNYIRDN 360

Query: 393 WGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDEIAASRVEEPPVKKEGGK 452
           WGGNPRDRFLNMLPANHNYPV+PETSGAHSLQILQPPSSSRDE+ A+RVEEPPVKKEGGK
Sbjct: 361 WGGNPRDRFLNMLPANHNYPVLPETSGAHSLQILQPPSSSRDELVANRVEEPPVKKEGGK 420

Query: 453 AKKRQNSESGPKTPKAKKPRKPKDTSMAVQRVKPPKKNIDLVINGIDMDISSIPIPVCSC 512
           AKKRQ+SE+GPKTPKAKKPRK KDTS A  R KPPKKNIDLVINGIDMDIS IPIPVCSC
Sbjct: 421 AKKRQSSETGPKTPKAKKPRKLKDTSTAAPRAKPPKKNIDLVINGIDMDISGIPIPVCSC 480

Query: 513 TGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMSQGAFKKVLEKLAAD 563
           TG+PHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMSQGAFKKVLEKLAAD
Sbjct: 481 TGSPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMSQGAFKKVLEKLAAD 533

BLAST of CcUC01G007120.1 vs. NCBI nr
Match: PPS01424.1 (hypothetical protein GOBAR_AA19225 [Gossypium barbadense])

HSP 1 Score: 667.5 bits (1721), Expect = 1.0e-187
Identity = 356/574 (62.02%), Postives = 418/574 (72.82%), Query Frame = 0

Query: 33  MDGDALNMRNWGYYEPSLKAHLGLQLMSTIV----------------------------- 92
           MD +ALNMRNWGYYEPS K HL LQLMS++V                             
Sbjct: 1   MDENALNMRNWGYYEPSFKGHLSLQLMSSMVERDAKSFIPGRDSNLMVTTNTAFHQQDPV 60

Query: 93  ------PTSWGRDGWINHHRDKLFNMLTPTT-SYSLLAETSGPQPLQMLQ--PHNTSRDE 152
                 P ++ RD WI   R+K+F+M   TT +Y++L ET     L +LQ  P +++RDE
Sbjct: 61  VSEVHIPMNYVRDSWI-ADREKIFSMFPATTPNYAVLLETPAAYSLPILQPPPDSSTRDE 120

Query: 153 MVL-KIEEPPVKKGTKQPKKRQNGGASKTPKPKKPRKPKNN-DPSVQQVKAPKKKMELVI 212
            V   +EEPP  K   +PKKRQ G A K P+ KKP+KPK N + +VQ VK+ KK +   I
Sbjct: 121 RVASSVEEPPANKEGVEPKKRQGGAAPKMPEAKKPKKPKENANYTVQCVKSAKKSIVFKI 180

Query: 213 NGFDMDISSIPIPVCSCTGTPHQCYRWGYGGWQSACCTTSLSLHPLPMSEKRRGARIAGR 272
           NG+DMDIS IPIPVCSCTGT  QCYRWG+GGWQSACCTT++S++PLPMS KRRGARIAGR
Sbjct: 181 NGYDMDISGIPIPVCSCTGTAQQCYRWGFGGWQSACCTTNVSMYPLPMSTKRRGARIAGR 240

Query: 273 KMSQGAFKKVLEKLAAQGYNFSNPIDLRCHWARHGTNKFVTIRAVYTHPILMDDDALNMR 332
           KMSQGAFKKVLEKLAA+ YNFS+PIDLR HWARHGTNKFVTI +       MD++ALNMR
Sbjct: 241 KMSQGAFKKVLEKLAAENYNFSSPIDLRSHWARHGTNKFVTISS------FMDENALNMR 300

Query: 333 NWGYYEPSFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPRDCVVSEAPVHMN 392
           NWGYYEPSFKGHL LQLMSS+ ERD K F+PGR+ ++MV  N +FH +D VVSE  + MN
Sbjct: 301 NWGYYEPSFKGHLSLQLMSSMVERDAKSFIPGRDSNLMVTTNTAFHQQDPVVSEVHIPMN 360

Query: 393 YVRDNWGGNPRDRFLNMLPA-NHNYPVIPETSGAHSLQILQPP--SSSRDEIAASRVEEP 452
           YVRD+W  + R++  +M PA   NY V+ ET  A+SL ILQPP  SS+RDE  AS VEEP
Sbjct: 361 YVRDSWIAD-REKIFSMFPATTPNYAVLLETPAAYSLPILQPPPDSSTRDERVASSVEEP 420

Query: 453 PVKKEGGKAKKRQNSESGPKTPKAKKPRKPKD-TSMAVQRVKPPKKNIDLVINGIDMDIS 512
           P  KEG + KKRQ   + PK P+AKKP+KPK+  +  VQ VK  KK+I   ING DMDIS
Sbjct: 421 PANKEGVEPKKRQGG-AAPKMPEAKKPKKPKENANYTVQCVKSAKKSIVFKINGYDMDIS 480

Query: 513 SIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMSQGAFK 563
            IPIPVCSCTG   QCYRWG GGWQSACCTTN+S YPLPMS KRRGARIAGRKMSQGAFK
Sbjct: 481 GIPIPVCSCTGTAQQCYRWGFGGWQSACCTTNVSMYPLPMSTKRRGARIAGRKMSQGAFK 540

BLAST of CcUC01G007120.1 vs. ExPASy Swiss-Prot
Match: Q9LDE2 (Protein BASIC PENTACYSTEINE2 OS=Arabidopsis thaliana OX=3702 GN=BPC2 PE=1 SV=1)

HSP 1 Score: 327.0 bits (837), Expect = 4.3e-88
Identity = 177/297 (59.60%), Postives = 214/297 (72.05%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEP---SFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPR 343
           MDDD    RNWGYYEP   +FKG+LGLQLMS+I +R+ K FLPGR+P++M+  NGS+H +
Sbjct: 1   MDDD--GFRNWGYYEPAAATFKGNLGLQLMSTI-DRNTKPFLPGRDPNLMMGPNGSYHHQ 60

Query: 344 DCVVSEAPVHMNYVRDNWGGNPRDRFLNMLP---ANHNY-PVIPETSGAHSLQILQPPSS 403
                E P+HM+Y   NW    +D+F NMLP   A  NY  V+PETS A S+Q+      
Sbjct: 61  -----EPPIHMSY---NWINQQKDKFFNMLPVTTATPNYGNVLPETSSAPSMQM------ 120

Query: 404 SRDEIAASRVEEPPVKKEGG---KAKKRQNSESGPKTPKAKKPRKPKD--------TSMA 463
             +     + EE PVK E     + KKR+ +     TPKAKKPRKPKD         +  
Sbjct: 121 --NLHHHLQTEENPVKLEEEIVVQTKKRKTNAKAGSTPKAKKPRKPKDENSNNNNNNNTN 180

Query: 464 VQRVKPPKKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYP 523
           V RVKP KK++DLVING+ MDIS +P+P+C+CTGAP QCYRWGCGGWQSACCTTNIS +P
Sbjct: 181 VTRVKPAKKSVDLVINGVSMDISGLPVPICTCTGAPQQCYRWGCGGWQSACCTTNISMHP 240

Query: 524 LPMSDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           LPMS KRRGARI+GRKMSQGAFKKVLEKLA+DG+NF NPIDL++HWARHGTNKFVTI
Sbjct: 241 LPMSTKRRGARISGRKMSQGAFKKVLEKLASDGFNFGNPIDLKSHWARHGTNKFVTI 278

BLAST of CcUC01G007120.1 vs. ExPASy Swiss-Prot
Match: Q9SKD0 (Protein BASIC PENTACYSTEINE1 OS=Arabidopsis thaliana OX=3702 GN=BPC1 PE=1 SV=1)

HSP 1 Score: 314.7 bits (805), Expect = 2.2e-84
Identity = 168/295 (56.95%), Postives = 211/295 (71.53%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEP----SFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHP 343
           MDDD    RNWGYYEP    SFKG+LGLQLMS+I +R+ K FLPGRE ++M+ +NGS+H 
Sbjct: 1   MDDD--GFRNWGYYEPAAASSFKGNLGLQLMSTI-DRNTKPFLPGRESNLMIGSNGSYHS 60

Query: 344 RDCVVSEAPVHMNYVRDNWGGNPRD-RFLNMLP-ANHNYP-VIPETSGAHSLQILQPPSS 403
           R+         MNY   +W   P+D +F NMLP +  +Y  V+ ETSG++S+Q++  P  
Sbjct: 61  RE-------QDMNY---SWINQPKDNKFFNMLPISTPSYSNVLSETSGSNSIQMIHQPVL 120

Query: 404 SRDEIAASRVEEP-PVKKEGGKAKKRQNSESGPKTPKAKKPRKPKD--------TSMAVQ 463
           +      + +  P P +++ GK +K + S + P  PKAKK RKPK+             Q
Sbjct: 121 NSSRFEENPIPPPAPCEEQTGKKRKMRGSIATPTVPKAKKMRKPKEERDVTNNNVQQQQQ 180

Query: 464 RVKPPKKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLP 523
           RVKP KK++DLVING+ MDIS +P+PVC+CTG P QCYRWGCGGWQSACCTTNIS YPLP
Sbjct: 181 RVKPVKKSVDLVINGVSMDISGLPVPVCTCTGTPQQCYRWGCGGWQSACCTTNISVYPLP 240

Query: 524 MSDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           MS KRRGARI+GRKMSQGAFKKVLEKL+ +GY+F N IDL++HWARHGTNKFVTI
Sbjct: 241 MSTKRRGARISGRKMSQGAFKKVLEKLSTEGYSFGNAIDLKSHWARHGTNKFVTI 282

BLAST of CcUC01G007120.1 vs. ExPASy Swiss-Prot
Match: Q9C9X6 (Protein BASIC PENTACYSTEINE3 OS=Arabidopsis thaliana OX=3702 GN=BPC3 PE=1 SV=1)

HSP 1 Score: 256.1 bits (653), Expect = 9.3e-67
Identity = 148/296 (50.00%), Postives = 191/296 (64.53%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEPS-FKGHLGLQLMSSISERDMKHFLPGRE-------PSVMVNANG 343
           M++D LN RNWGYYEPS F+ +LG QL+ SI +R+ K FL           PS +     
Sbjct: 1   MEEDGLNNRNWGYYEPSQFRPNLGFQLIPSILDRNEKPFLSPHSQNLNFITPSNVYGGGS 60

Query: 344 S---FHPRDCVVSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSG-AHSLQIL 403
           S    +PRD  VS+AP  M+Y   +W        LN    +  +  +PE S    S+Q+L
Sbjct: 61  SSVVSYPRDYTVSDAP-FMSY---SW--------LNQHKDSKFFSNVPEVSRMTQSMQLL 120

Query: 404 QPPSSSRDEIAASRVEEPPVKKEGGKAKKRQNSESGPK--TPKAKKPRKPKDTSM-AVQR 463
           Q                P V  E  ++ KR++   G +   PK KK +K KD +M  VQR
Sbjct: 121 Q----------------PEVVTEVDESVKRRHCSGGQRGGVPKVKKEKKLKDNNMPRVQR 180

Query: 464 VKPP--KKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPL 523
            + P  +K I++VING+ MDI  +P+PVCSCTG P QCYRWGCGGWQSACCTTN+S YPL
Sbjct: 181 ERSPLLRKCIEMVINGVSMDIGGLPVPVCSCTGMPQQCYRWGCGGWQSACCTTNVSMYPL 240

Query: 524 PMSDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           P++ KRRGARIAGRKMSQGAF+KVLEKL++DG++F+NPIDL++HWA+HGTNKFVTI
Sbjct: 241 PVNTKRRGARIAGRKMSQGAFRKVLEKLSSDGFDFSNPIDLKSHWAKHGTNKFVTI 268

BLAST of CcUC01G007120.1 vs. ExPASy Swiss-Prot
Match: Q8GUC3 (Protein Barley B recombinant OS=Hordeum vulgare OX=4513 GN=BBR PE=1 SV=1)

HSP 1 Score: 216.9 bits (551), Expect = 6.2e-55
Identity = 115/201 (57.21%), Postives = 137/201 (68.16%), Query Frame = 0

Query: 88  TSYSLLAETSGPQPLQMLQPHN---TSRDEMVLKIEEPPVKKGTKQP-KKRQNGGASKTP 147
           T + ++ +  G   LQM+QP        +++   + E     G+K P KKRQ G   K P
Sbjct: 153 TGFGMMPDARGAHTLQMMQPQEPPVPDEEKITPPLVEDHSVVGSKPPVKKRQQGRQPKVP 212

Query: 148 KPKKPRK---------PKNNDPSVQQVKAPKKKMELVINGFDMDISSIPIPVCSCTGTPH 207
           KPKKP+K         PK   P   + + P K +E+VING D DIS IP PVCSCTG P 
Sbjct: 213 KPKKPKKDATPGEDGAPKARAP---RSRGPLKPVEMVINGIDFDISRIPTPVCSCTGAPQ 272

Query: 208 QCYRWGYGGWQSACCTTSLSLHPLPMSEKRRGARIAGRKMSQGAFKKVLEKLAAQGYNFS 267
           QCYRWG GGWQSACCTTS+S +PLPM+ KRRGARIAGRKMSQGAFKKVLEKLA +GYN +
Sbjct: 273 QCYRWGAGGWQSACCTTSISTYPLPMNTKRRGARIAGRKMSQGAFKKVLEKLAGEGYNLN 332

Query: 268 NPIDLRCHWARHGTNKFVTIR 276
           NPIDL+  WA+HGTNKFVTIR
Sbjct: 333 NPIDLKTFWAKHGTNKFVTIR 350

BLAST of CcUC01G007120.1 vs. ExPASy Swiss-Prot
Match: P0DH88 (Barley B recombinant-like protein A OS=Oryza sativa subsp. japonica OX=39947 GN=Os10g0114500 PE=2 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 2.0e-53
Identity = 114/214 (53.27%), Postives = 144/214 (67.29%), Query Frame = 0

Query: 73  INHHRDKLFNMLTPTTSYSLLAETSGPQPLQMLQPHNTSRDEMV---LKIEEPPVKKGTK 132
           I HH    + M+  T +  ++ + + PQ LQ   P    ++E +   L  E  PV     
Sbjct: 130 IAHHDPVGYGMIPGTHTLQMMQQQTEPQ-LQPPPPPQQPKEECISSPLIEENVPVIDEPP 189

Query: 133 QPKKRQNGGASKTPKPKKPRK---PKN-----NDPSVQQVKAPKKKMELVINGFDMDISS 192
            PKKRQ G   K P+ KKP+K   P+      N P+ ++ + P+K + +VING D+D+S 
Sbjct: 190 PPKKRQQGRQPKVPRAKKPKKSAAPREDGAPPNAPAPRR-RGPRKNIGMVINGIDLDLSR 249

Query: 193 IPIPVCSCTGTPHQCYRWGYGGWQSACCTTSLSLHPLPMSEKRRGARIAGRKMSQGAFKK 252
           IP P+CSCTG P QCYRWG GGWQSACCTT++S +PLPMS KRRGARIAGRKMS GAFKK
Sbjct: 250 IPTPICSCTGAPQQCYRWGAGGWQSACCTTTISTYPLPMSTKRRGARIAGRKMSHGAFKK 309

Query: 253 VLEKLAAQGYNFSNPIDLRCHWARHGTNKFVTIR 276
           VLEKLA +GYN +NPIDL+  WA+HGTNKFVTIR
Sbjct: 310 VLEKLAGEGYNLNNPIDLKTFWAKHGTNKFVTIR 341

BLAST of CcUC01G007120.1 vs. ExPASy TrEMBL
Match: A0A2P5XDL7 (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_AA19225 PE=3 SV=1)

HSP 1 Score: 667.5 bits (1721), Expect = 4.9e-188
Identity = 356/574 (62.02%), Postives = 418/574 (72.82%), Query Frame = 0

Query: 33  MDGDALNMRNWGYYEPSLKAHLGLQLMSTIV----------------------------- 92
           MD +ALNMRNWGYYEPS K HL LQLMS++V                             
Sbjct: 1   MDENALNMRNWGYYEPSFKGHLSLQLMSSMVERDAKSFIPGRDSNLMVTTNTAFHQQDPV 60

Query: 93  ------PTSWGRDGWINHHRDKLFNMLTPTT-SYSLLAETSGPQPLQMLQ--PHNTSRDE 152
                 P ++ RD WI   R+K+F+M   TT +Y++L ET     L +LQ  P +++RDE
Sbjct: 61  VSEVHIPMNYVRDSWI-ADREKIFSMFPATTPNYAVLLETPAAYSLPILQPPPDSSTRDE 120

Query: 153 MVL-KIEEPPVKKGTKQPKKRQNGGASKTPKPKKPRKPKNN-DPSVQQVKAPKKKMELVI 212
            V   +EEPP  K   +PKKRQ G A K P+ KKP+KPK N + +VQ VK+ KK +   I
Sbjct: 121 RVASSVEEPPANKEGVEPKKRQGGAAPKMPEAKKPKKPKENANYTVQCVKSAKKSIVFKI 180

Query: 213 NGFDMDISSIPIPVCSCTGTPHQCYRWGYGGWQSACCTTSLSLHPLPMSEKRRGARIAGR 272
           NG+DMDIS IPIPVCSCTGT  QCYRWG+GGWQSACCTT++S++PLPMS KRRGARIAGR
Sbjct: 181 NGYDMDISGIPIPVCSCTGTAQQCYRWGFGGWQSACCTTNVSMYPLPMSTKRRGARIAGR 240

Query: 273 KMSQGAFKKVLEKLAAQGYNFSNPIDLRCHWARHGTNKFVTIRAVYTHPILMDDDALNMR 332
           KMSQGAFKKVLEKLAA+ YNFS+PIDLR HWARHGTNKFVTI +       MD++ALNMR
Sbjct: 241 KMSQGAFKKVLEKLAAENYNFSSPIDLRSHWARHGTNKFVTISS------FMDENALNMR 300

Query: 333 NWGYYEPSFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPRDCVVSEAPVHMN 392
           NWGYYEPSFKGHL LQLMSS+ ERD K F+PGR+ ++MV  N +FH +D VVSE  + MN
Sbjct: 301 NWGYYEPSFKGHLSLQLMSSMVERDAKSFIPGRDSNLMVTTNTAFHQQDPVVSEVHIPMN 360

Query: 393 YVRDNWGGNPRDRFLNMLPA-NHNYPVIPETSGAHSLQILQPP--SSSRDEIAASRVEEP 452
           YVRD+W  + R++  +M PA   NY V+ ET  A+SL ILQPP  SS+RDE  AS VEEP
Sbjct: 361 YVRDSWIAD-REKIFSMFPATTPNYAVLLETPAAYSLPILQPPPDSSTRDERVASSVEEP 420

Query: 453 PVKKEGGKAKKRQNSESGPKTPKAKKPRKPKD-TSMAVQRVKPPKKNIDLVINGIDMDIS 512
           P  KEG + KKRQ   + PK P+AKKP+KPK+  +  VQ VK  KK+I   ING DMDIS
Sbjct: 421 PANKEGVEPKKRQGG-AAPKMPEAKKPKKPKENANYTVQCVKSAKKSIVFKINGYDMDIS 480

Query: 513 SIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMSQGAFK 563
            IPIPVCSCTG   QCYRWG GGWQSACCTTN+S YPLPMS KRRGARIAGRKMSQGAFK
Sbjct: 481 GIPIPVCSCTGTAQQCYRWGFGGWQSACCTTNVSMYPLPMSTKRRGARIAGRKMSQGAFK 540

BLAST of CcUC01G007120.1 vs. ExPASy TrEMBL
Match: A0A5A7VPX9 (GAGA-binding transcriptional activator OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold174G00040 PE=3 SV=1)

HSP 1 Score: 569.7 bits (1467), Expect = 1.4e-158
Identity = 276/287 (96.17%), Postives = 282/287 (98.26%), Query Frame = 0

Query: 280 HPILMDDDALNMRNWGYYEPSFKG-HLGLQLMSSISERDMKHFLPGREPSVMVNANGSFH 339
           HPILMDDDALNMRNWGYYEPSFKG HLGLQLMS+ISERDMKHFLPGR+PSVMVNANGSFH
Sbjct: 9   HPILMDDDALNMRNWGYYEPSFKGNHLGLQLMSTISERDMKHFLPGRDPSVMVNANGSFH 68

Query: 340 PRDCVVSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSR 399
           PRDCVVSEAPVHMNYVRDNWGGN RDRFLNMLPANH+YPV+PETSGAHSLQILQPPSSSR
Sbjct: 69  PRDCVVSEAPVHMNYVRDNWGGN-RDRFLNMLPANHSYPVMPETSGAHSLQILQPPSSSR 128

Query: 400 DEIAASRVEEPPVKKEGGKAKKRQNSESGPKTPKAKKPRKPKDTSMAVQRVKPPKKNIDL 459
           DEIAASRVEEPPVKKEGGKAKKRQ+SE+GPKTPKAKKPRKPKDTS AVQRVKPPKKNIDL
Sbjct: 129 DEIAASRVEEPPVKKEGGKAKKRQSSEAGPKTPKAKKPRKPKDTSTAVQRVKPPKKNIDL 188

Query: 460 VINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIA 519
           VINGIDMDIS IPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIA
Sbjct: 189 VINGIDMDISCIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIA 248

Query: 520 GRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTISNR 566
           GRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTIS R
Sbjct: 249 GRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTISYR 294

BLAST of CcUC01G007120.1 vs. ExPASy TrEMBL
Match: A0A6J1JYQ0 (GAGA-binding transcriptional activator OS=Cucurbita maxima OX=3661 GN=LOC111489086 PE=3 SV=1)

HSP 1 Score: 560.5 bits (1443), Expect = 8.5e-156
Identity = 267/280 (95.36%), Postives = 276/280 (98.57%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEPSFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPRDCV 343
           MDDDALNMRNWGYYEPSFKGHLGLQLMS++SERD+KH+LPGR+PSVMVNANGSFHPRDCV
Sbjct: 1   MDDDALNMRNWGYYEPSFKGHLGLQLMSTMSERDIKHYLPGRDPSVMVNANGSFHPRDCV 60

Query: 344 VSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDEIAA 403
           VSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDE+A 
Sbjct: 61  VSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDEMAG 120

Query: 404 SRVEEPPVKKEGGKAKKRQNSESGPKTPKAKKPRKPKDTS-MAVQRVKPPKKNIDLVING 463
           SRVEEPPVKKEGGKAKKRQ++E+GPKTPKAKKPRKPKDTS  AVQRVKPPKKNIDLVING
Sbjct: 121 SRVEEPPVKKEGGKAKKRQSNEAGPKTPKAKKPRKPKDTSTAAVQRVKPPKKNIDLVING 180

Query: 464 IDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKM 523
           IDMDIS IPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKM
Sbjct: 181 IDMDISGIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKM 240

Query: 524 SQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           SQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI
Sbjct: 241 SQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 280

BLAST of CcUC01G007120.1 vs. ExPASy TrEMBL
Match: A0A0A0KJH7 (GAGA-binding transcriptional activator OS=Cucumis sativus OX=3659 GN=Csa_5G092910 PE=3 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 3.2e-155
Identity = 268/279 (96.06%), Postives = 274/279 (98.21%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEPSFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPRDCV 343
           MDDDALNMRNWGYYEPSFKGHLGLQLMS+ISERDMKHFLPGR+PSVMVNANGSFHPRDCV
Sbjct: 1   MDDDALNMRNWGYYEPSFKGHLGLQLMSTISERDMKHFLPGRDPSVMVNANGSFHPRDCV 60

Query: 344 VSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDEIAA 403
           VSEAPVHMNYVRDNWGGN RDRFLNMLP NH+YPV+PETSGAHSLQILQPPSSSRDEIAA
Sbjct: 61  VSEAPVHMNYVRDNWGGN-RDRFLNMLPTNHSYPVMPETSGAHSLQILQPPSSSRDEIAA 120

Query: 404 SRVEEPPVKKEGGKAKKRQNSESGPKTPKAKKPRKPKDTSMAVQRVKPPKKNIDLVINGI 463
           SRVEEPPVKKEGGKAKKRQ+SE+GPK PKAKKPRKPKDTS AVQRVKPPKKNIDLVINGI
Sbjct: 121 SRVEEPPVKKEGGKAKKRQSSEAGPKAPKAKKPRKPKDTSTAVQRVKPPKKNIDLVINGI 180

Query: 464 DMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMS 523
           DMDIS IPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMS
Sbjct: 181 DMDISCIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKMS 240

Query: 524 QGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           QGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI
Sbjct: 241 QGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 278

BLAST of CcUC01G007120.1 vs. ExPASy TrEMBL
Match: A0A6J1FK68 (GAGA-binding transcriptional activator OS=Cucurbita moschata OX=3662 GN=LOC111444968 PE=3 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 3.2e-155
Identity = 266/280 (95.00%), Postives = 275/280 (98.21%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEPSFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPRDCV 343
           MDDDALNMRNWGYYEPSFKGHLGLQLMS++SERD+KH+LPGR+PSVMVNANGSFHPRDCV
Sbjct: 1   MDDDALNMRNWGYYEPSFKGHLGLQLMSTMSERDIKHYLPGRDPSVMVNANGSFHPRDCV 60

Query: 344 VSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDEIAA 403
           VSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDE+A 
Sbjct: 61  VSEAPVHMNYVRDNWGGNPRDRFLNMLPANHNYPVIPETSGAHSLQILQPPSSSRDEMAG 120

Query: 404 SRVEEPPVKKEGGKAKKRQNSESGPKTPKAKKPRKPKDTS-MAVQRVKPPKKNIDLVING 463
           SRVEEPPVKKEGGKAKKRQ++E+GPKTPKAKKPRKPKDTS  AV RVKPPKKNIDLVING
Sbjct: 121 SRVEEPPVKKEGGKAKKRQSNEAGPKTPKAKKPRKPKDTSTAAVHRVKPPKKNIDLVING 180

Query: 464 IDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKM 523
           IDMDIS IPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKM
Sbjct: 181 IDMDISGIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLPMSDKRRGARIAGRKM 240

Query: 524 SQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           SQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI
Sbjct: 241 SQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 280

BLAST of CcUC01G007120.1 vs. TAIR 10
Match: AT1G14685.3 (basic pentacysteine 2 )

HSP 1 Score: 327.0 bits (837), Expect = 3.0e-89
Identity = 177/297 (59.60%), Postives = 214/297 (72.05%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEP---SFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPR 343
           MDDD    RNWGYYEP   +FKG+LGLQLMS+I +R+ K FLPGR+P++M+  NGS+H +
Sbjct: 1   MDDD--GFRNWGYYEPAAATFKGNLGLQLMSTI-DRNTKPFLPGRDPNLMMGPNGSYHHQ 60

Query: 344 DCVVSEAPVHMNYVRDNWGGNPRDRFLNMLP---ANHNY-PVIPETSGAHSLQILQPPSS 403
                E P+HM+Y   NW    +D+F NMLP   A  NY  V+PETS A S+Q+      
Sbjct: 61  -----EPPIHMSY---NWINQQKDKFFNMLPVTTATPNYGNVLPETSSAPSMQM------ 120

Query: 404 SRDEIAASRVEEPPVKKEGG---KAKKRQNSESGPKTPKAKKPRKPKD--------TSMA 463
             +     + EE PVK E     + KKR+ +     TPKAKKPRKPKD         +  
Sbjct: 121 --NLHHHLQTEENPVKLEEEIVVQTKKRKTNAKAGSTPKAKKPRKPKDENSNNNNNNNTN 180

Query: 464 VQRVKPPKKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYP 523
           V RVKP KK++DLVING+ MDIS +P+P+C+CTGAP QCYRWGCGGWQSACCTTNIS +P
Sbjct: 181 VTRVKPAKKSVDLVINGVSMDISGLPVPICTCTGAPQQCYRWGCGGWQSACCTTNISMHP 240

Query: 524 LPMSDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           LPMS KRRGARI+GRKMSQGAFKKVLEKLA+DG+NF NPIDL++HWARHGTNKFVTI
Sbjct: 241 LPMSTKRRGARISGRKMSQGAFKKVLEKLASDGFNFGNPIDLKSHWARHGTNKFVTI 278

BLAST of CcUC01G007120.1 vs. TAIR 10
Match: AT1G14685.2 (basic pentacysteine 2 )

HSP 1 Score: 327.0 bits (837), Expect = 3.0e-89
Identity = 177/297 (59.60%), Postives = 214/297 (72.05%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEP---SFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPR 343
           MDDD    RNWGYYEP   +FKG+LGLQLMS+I +R+ K FLPGR+P++M+  NGS+H +
Sbjct: 1   MDDD--GFRNWGYYEPAAATFKGNLGLQLMSTI-DRNTKPFLPGRDPNLMMGPNGSYHHQ 60

Query: 344 DCVVSEAPVHMNYVRDNWGGNPRDRFLNMLP---ANHNY-PVIPETSGAHSLQILQPPSS 403
                E P+HM+Y   NW    +D+F NMLP   A  NY  V+PETS A S+Q+      
Sbjct: 61  -----EPPIHMSY---NWINQQKDKFFNMLPVTTATPNYGNVLPETSSAPSMQM------ 120

Query: 404 SRDEIAASRVEEPPVKKEGG---KAKKRQNSESGPKTPKAKKPRKPKD--------TSMA 463
             +     + EE PVK E     + KKR+ +     TPKAKKPRKPKD         +  
Sbjct: 121 --NLHHHLQTEENPVKLEEEIVVQTKKRKTNAKAGSTPKAKKPRKPKDENSNNNNNNNTN 180

Query: 464 VQRVKPPKKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYP 523
           V RVKP KK++DLVING+ MDIS +P+P+C+CTGAP QCYRWGCGGWQSACCTTNIS +P
Sbjct: 181 VTRVKPAKKSVDLVINGVSMDISGLPVPICTCTGAPQQCYRWGCGGWQSACCTTNISMHP 240

Query: 524 LPMSDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           LPMS KRRGARI+GRKMSQGAFKKVLEKLA+DG+NF NPIDL++HWARHGTNKFVTI
Sbjct: 241 LPMSTKRRGARISGRKMSQGAFKKVLEKLASDGFNFGNPIDLKSHWARHGTNKFVTI 278

BLAST of CcUC01G007120.1 vs. TAIR 10
Match: AT1G14685.1 (basic pentacysteine 2 )

HSP 1 Score: 327.0 bits (837), Expect = 3.0e-89
Identity = 177/297 (59.60%), Postives = 214/297 (72.05%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEP---SFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHPR 343
           MDDD    RNWGYYEP   +FKG+LGLQLMS+I +R+ K FLPGR+P++M+  NGS+H +
Sbjct: 1   MDDD--GFRNWGYYEPAAATFKGNLGLQLMSTI-DRNTKPFLPGRDPNLMMGPNGSYHHQ 60

Query: 344 DCVVSEAPVHMNYVRDNWGGNPRDRFLNMLP---ANHNY-PVIPETSGAHSLQILQPPSS 403
                E P+HM+Y   NW    +D+F NMLP   A  NY  V+PETS A S+Q+      
Sbjct: 61  -----EPPIHMSY---NWINQQKDKFFNMLPVTTATPNYGNVLPETSSAPSMQM------ 120

Query: 404 SRDEIAASRVEEPPVKKEGG---KAKKRQNSESGPKTPKAKKPRKPKD--------TSMA 463
             +     + EE PVK E     + KKR+ +     TPKAKKPRKPKD         +  
Sbjct: 121 --NLHHHLQTEENPVKLEEEIVVQTKKRKTNAKAGSTPKAKKPRKPKDENSNNNNNNNTN 180

Query: 464 VQRVKPPKKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYP 523
           V RVKP KK++DLVING+ MDIS +P+P+C+CTGAP QCYRWGCGGWQSACCTTNIS +P
Sbjct: 181 VTRVKPAKKSVDLVINGVSMDISGLPVPICTCTGAPQQCYRWGCGGWQSACCTTNISMHP 240

Query: 524 LPMSDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           LPMS KRRGARI+GRKMSQGAFKKVLEKLA+DG+NF NPIDL++HWARHGTNKFVTI
Sbjct: 241 LPMSTKRRGARISGRKMSQGAFKKVLEKLASDGFNFGNPIDLKSHWARHGTNKFVTI 278

BLAST of CcUC01G007120.1 vs. TAIR 10
Match: AT2G01930.1 (basic pentacysteine1 )

HSP 1 Score: 314.7 bits (805), Expect = 1.6e-85
Identity = 168/295 (56.95%), Postives = 211/295 (71.53%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEP----SFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHP 343
           MDDD    RNWGYYEP    SFKG+LGLQLMS+I +R+ K FLPGRE ++M+ +NGS+H 
Sbjct: 1   MDDD--GFRNWGYYEPAAASSFKGNLGLQLMSTI-DRNTKPFLPGRESNLMIGSNGSYHS 60

Query: 344 RDCVVSEAPVHMNYVRDNWGGNPRD-RFLNMLP-ANHNYP-VIPETSGAHSLQILQPPSS 403
           R+         MNY   +W   P+D +F NMLP +  +Y  V+ ETSG++S+Q++  P  
Sbjct: 61  RE-------QDMNY---SWINQPKDNKFFNMLPISTPSYSNVLSETSGSNSIQMIHQPVL 120

Query: 404 SRDEIAASRVEEP-PVKKEGGKAKKRQNSESGPKTPKAKKPRKPKD--------TSMAVQ 463
           +      + +  P P +++ GK +K + S + P  PKAKK RKPK+             Q
Sbjct: 121 NSSRFEENPIPPPAPCEEQTGKKRKMRGSIATPTVPKAKKMRKPKEERDVTNNNVQQQQQ 180

Query: 464 RVKPPKKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLP 523
           RVKP KK++DLVING+ MDIS +P+PVC+CTG P QCYRWGCGGWQSACCTTNIS YPLP
Sbjct: 181 RVKPVKKSVDLVINGVSMDISGLPVPVCTCTGTPQQCYRWGCGGWQSACCTTNISVYPLP 240

Query: 524 MSDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           MS KRRGARI+GRKMSQGAFKKVLEKL+ +GY+F N IDL++HWARHGTNKFVTI
Sbjct: 241 MSTKRRGARISGRKMSQGAFKKVLEKLSTEGYSFGNAIDLKSHWARHGTNKFVTI 282

BLAST of CcUC01G007120.1 vs. TAIR 10
Match: AT2G01930.2 (basic pentacysteine1 )

HSP 1 Score: 314.7 bits (805), Expect = 1.6e-85
Identity = 168/295 (56.95%), Postives = 211/295 (71.53%), Query Frame = 0

Query: 284 MDDDALNMRNWGYYEP----SFKGHLGLQLMSSISERDMKHFLPGREPSVMVNANGSFHP 343
           MDDD    RNWGYYEP    SFKG+LGLQLMS+I +R+ K FLPGRE ++M+ +NGS+H 
Sbjct: 1   MDDD--GFRNWGYYEPAAASSFKGNLGLQLMSTI-DRNTKPFLPGRESNLMIGSNGSYHS 60

Query: 344 RDCVVSEAPVHMNYVRDNWGGNPRD-RFLNMLP-ANHNYP-VIPETSGAHSLQILQPPSS 403
           R+         MNY   +W   P+D +F NMLP +  +Y  V+ ETSG++S+Q++  P  
Sbjct: 61  RE-------QDMNY---SWINQPKDNKFFNMLPISTPSYSNVLSETSGSNSIQMIHQPVL 120

Query: 404 SRDEIAASRVEEP-PVKKEGGKAKKRQNSESGPKTPKAKKPRKPKD--------TSMAVQ 463
           +      + +  P P +++ GK +K + S + P  PKAKK RKPK+             Q
Sbjct: 121 NSSRFEENPIPPPAPCEEQTGKKRKMRGSIATPTVPKAKKMRKPKEERDVTNNNVQQQQQ 180

Query: 464 RVKPPKKNIDLVINGIDMDISSIPIPVCSCTGAPHQCYRWGCGGWQSACCTTNISTYPLP 523
           RVKP KK++DLVING+ MDIS +P+PVC+CTG P QCYRWGCGGWQSACCTTNIS YPLP
Sbjct: 181 RVKPVKKSVDLVINGVSMDISGLPVPVCTCTGTPQQCYRWGCGGWQSACCTTNISVYPLP 240

Query: 524 MSDKRRGARIAGRKMSQGAFKKVLEKLAADGYNFANPIDLRTHWARHGTNKFVTI 563
           MS KRRGARI+GRKMSQGAFKKVLEKL+ +GY+F N IDL++HWARHGTNKFVTI
Sbjct: 241 MSTKRRGARISGRKMSQGAFKKVLEKLSTEGYSFGNAIDLKSHWARHGTNKFVTI 282

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG6579174.13.1e-29391.39Protein BASIC PENTACYSTEINE1, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7016690.11.5e-28484.56Protein BASIC PENTACYSTEINE1, partial [Cucurbita argyrosperma subsp. argyrosperm... [more]
KAG6601899.15.1e-25676.41Protein BASIC PENTACYSTEINE1, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7032597.14.5e-25277.70Protein BASIC PENTACYSTEINE1, partial [Cucurbita argyrosperma subsp. argyrosperm... [more]
PPS01424.11.0e-18762.02hypothetical protein GOBAR_AA19225 [Gossypium barbadense][more]
Match NameE-valueIdentityDescription
Q9LDE24.3e-8859.60Protein BASIC PENTACYSTEINE2 OS=Arabidopsis thaliana OX=3702 GN=BPC2 PE=1 SV=1[more]
Q9SKD02.2e-8456.95Protein BASIC PENTACYSTEINE1 OS=Arabidopsis thaliana OX=3702 GN=BPC1 PE=1 SV=1[more]
Q9C9X69.3e-6750.00Protein BASIC PENTACYSTEINE3 OS=Arabidopsis thaliana OX=3702 GN=BPC3 PE=1 SV=1[more]
Q8GUC36.2e-5557.21Protein Barley B recombinant OS=Hordeum vulgare OX=4513 GN=BBR PE=1 SV=1[more]
P0DH882.0e-5353.27Barley B recombinant-like protein A OS=Oryza sativa subsp. japonica OX=39947 GN=... [more]
Match NameE-valueIdentityDescription
A0A2P5XDL74.9e-18862.02Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_AA19225 PE=3 SV... [more]
A0A5A7VPX91.4e-15896.17GAGA-binding transcriptional activator OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A6J1JYQ08.5e-15695.36GAGA-binding transcriptional activator OS=Cucurbita maxima OX=3661 GN=LOC1114890... [more]
A0A0A0KJH73.2e-15596.06GAGA-binding transcriptional activator OS=Cucumis sativus OX=3659 GN=Csa_5G09291... [more]
A0A6J1FK683.2e-15595.00GAGA-binding transcriptional activator OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
Match NameE-valueIdentityDescription
AT1G14685.33.0e-8959.60basic pentacysteine 2 [more]
AT1G14685.23.0e-8959.60basic pentacysteine 2 [more]
AT1G14685.13.0e-8959.60basic pentacysteine 2 [more]
AT2G01930.11.6e-8556.95basic pentacysteine1 [more]
AT2G01930.21.6e-8556.95basic pentacysteine1 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010409GAGA-binding transcriptional activatorSMARTSM01226GAGA_bind_2coord: 33..275
e-value: 2.7E-127
score: 438.9
coord: 284..563
e-value: 2.2E-166
score: 568.7
IPR010409GAGA-binding transcriptional activatorPFAMPF06217GAGA_bindcoord: 284..562
e-value: 3.1E-108
score: 362.6
coord: 69..275
e-value: 3.0E-84
score: 283.9
IPR010409GAGA-binding transcriptional activatorPANTHERPTHR31421FAMILY NOT NAMEDcoord: 284..563
IPR010409GAGA-binding transcriptional activatorPANTHERPTHR31421FAMILY NOT NAMEDcoord: 33..62
IPR010409GAGA-binding transcriptional activatorPANTHERPTHR31421FAMILY NOT NAMEDcoord: 67..275
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 389..451
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 401..444
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..165
NoneNo IPR availablePANTHERPTHR31421:SF0PROTEIN BASIC PENTACYSTEINE1-RELATEDcoord: 67..275
NoneNo IPR availablePANTHERPTHR31421:SF0PROTEIN BASIC PENTACYSTEINE1-RELATEDcoord: 284..563
NoneNo IPR availablePANTHERPTHR31421:SF0PROTEIN BASIC PENTACYSTEINE1-RELATEDcoord: 33..62

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CcUC01G007120CcUC01G007120gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC01G007120.1-exonCcUC01G007120.1-exon-CicolChr01:7701245..7701305exon
CcUC01G007120.1-exonCcUC01G007120.1-exon-CicolChr01:7701600..7702443exon
CcUC01G007120.1-exonCcUC01G007120.1-exon-CicolChr01:7703241..7703260exon
CcUC01G007120.1-exonCcUC01G007120.1-exon-CicolChr01:7706608..7707244exon
CcUC01G007120.1-exonCcUC01G007120.1-exon-CicolChr01:7707350..7707445exon
CcUC01G007120.1-exonCcUC01G007120.1-exon-CicolChr01:7708613..7708632exon
CcUC01G007120.1-exonCcUC01G007120.1-exon-CicolChr01:7709269..7709339exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC01G007120.1-cdsCcUC01G007120.1-cds-CicolChr01:7701245..7701305CDS
CcUC01G007120.1-cdsCcUC01G007120.1-cds-CicolChr01:7701600..7702443CDS
CcUC01G007120.1-cdsCcUC01G007120.1-cds-CicolChr01:7703241..7703260CDS
CcUC01G007120.1-cdsCcUC01G007120.1-cds-CicolChr01:7706608..7707244CDS
CcUC01G007120.1-cdsCcUC01G007120.1-cds-CicolChr01:7707350..7707445CDS
CcUC01G007120.1-cdsCcUC01G007120.1-cds-CicolChr01:7708613..7708632CDS
CcUC01G007120.1-cdsCcUC01G007120.1-cds-CicolChr01:7709269..7709339CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CcUC01G007120.1CcUC01G007120.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0009723 response to ethylene
cellular_component GO:0005634 nucleus
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0043565 sequence-specific DNA binding