HG10003884 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003884
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionglutamic acid-rich protein-like isoform X2
LocationChr08: 11220440 .. 11231445 (-)
RNA-Seq ExpressionHG10003884
SyntenyHG10003884
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGAGGAATTACAGGACAACGATGCTCCAAACGAAGAAGCCGTGGATGTTGCTGTTGATATAGAGACGAAGATTCATAACGCTATGCGCTCTTGCTTCTCTCACATCAAGGAACAAGCCGAGTGAGCTTTTCTCTCTCACTTTCTCCTCATTTTCCGTTACCGTAATTCGTCTGTGTGGACTTGTGAAGCTTGTGCTGTAGTTTGTGTTCTTTTCGAGGAATTTCTCTTATTCTCTGTCATTTTTGGTGTTTTTGTTTTCAATTGCTGATAATGATGCCGTAATGTTTATCCTGCACCTGCCAAAATGCTTATTAGACCACCTCTAGGTGATTTTTTGTTCATATTTTTCTCTATCTTGGTAATATTTTCCTCGTTTAGTTTAATTATGTGTATGTATGTTTTGGGAGTTATGCAAGTTTTCGAACCTGTACTGTCAGCTGCGTTTCTTTTCTCTAAGTTCACGGTGGTTCTAGGGTTTCTTTTGGGCACGGCAATCTGTAAAAGGGAGGTGAATTTTATATCTCGTTCCATTCTTCTCGGTATGGTTAAATGTTTGGGCGGTTGTATTAAGCGGCTTTTAGTGTTTGGTATCAAATTGTAGGTTCGCGTCTAAACACAAGACAAGAGTAACGTTCACGTCACTTTCTTGCATGTAGATGCACGTGTGTAGCAACAGAACCTTTACATTTATTTGTGATTGTTAATTTTTGAATTGACATAATCTTATTGTTCCATTTCATTTTCTTTCTCTTTCATGCTCACTGGGGAAACATTATTGAATTAGGCCTTCACAATTTGGCTGCCTTTTATTAACCTGTTGGATGCCCTCTAATAACCACTAGATACTAAATTCTTGATGTTTTTGTAGTTTATCTCTCATACATCCAACAATTTTAACTGCTAGCCATGTTGTGGTGGTCTCCTAAATAATTTCTCGTCACTTGTAGTTGGTTTGGAAACCAATTTCATATCAGTTGCTGAATTTTGTTCTCAAAGCTAATTTTAATAACCATGGATGGGCGGCTGATGAAAATTTAAGTTATTTCTATAGTATTACATTGCAAATTCAGTGTAGGTACTCGAACCTACCATTTTTAGGAGGTAGAGATTACTTAACACTGAGCTCTACTTAACAATAACAATAAATTTGTTCTCTCTCACATCACCTTTCACAAGTCACGAGTACAACCCCATGAATTGAAAGAATCATATTCAACTGTTTCTGAACGCCACCAAGGACTCAAAAACAGGTCATCTAAGAAAGCAAGAGGGAAGGGGTTTTAAAATCAGTTACCAAACGTAATTTCACTTTTCTGTTTTCTCATAAGCACAAAACATGTTTGAAATGTCTCCCAAACAAGAATTTGGACTGACTTTTTGTTTCCTTGTGCACTTGCGTATTTTGTGCTCTTGACAAGTATTTTTGTGCGATTGATCTAAAATCAGCTGGAATTTGAACATCTGGTATAGAGCAATTGTCTTGACTGCAGGTTGTGTAAAATTATCAACATTGTGAGTGGATTCAAATAAACTTTTTTTTCTTTCTTTTAAACATACCTTCAGAAGATGGTGCGTCTTTATTTCAAGGATAAGGCACAAGGATGTTGCTCTGCTTCCTGCATTTGAAATTTTTTGCACTTTTCTATCAACTTATAAGAGCTGTTGATATCACACACAACGGCGGGAGATTATTATGGGGCTCAAGATACACGTCCTTGTATAAAAATCTTACAACATTGTTAACCACCATTATGTAGAGACTGGTTAAAAAGTTTGGATAATGTGAAAGTATAAGTGTATGACCTTACTAATTACTGGCAGCAGCATAACTGGAAGATTTGCCTCATTATAAGTTTTTTCAGTTACAAGAAATTTTATTTATTACTTTATTCTTAGACCTTAATTATAAGAGTAATGAAAAACATTGATACCTTTATCCAATCTTCTATTTATTTGTCTGTTTTTGGTTACAGCTCTTTAACTTTTGAGGGGGTTAGAAGATTGTTAGAGAAGGACTTGTGTATGGAGATGTATACTTTAGATGTGCACAAAAGATATGTCAAGCAGTGTTTGGAGAAGGTAATGTTTTCTTCTTTCTCTATATGTGACATCTTTCTTTTCTTGGAGGTGGCATCCTATTTCTGTTATGCCTTCTGCTCAATTATATCATGTTCGGTTTTATTTTTGATATTTGTGGTGACATCTGATAAGTTCTTATATTTTAACGAAAAATGACAATTGAAAGAGTAGAAGAGCCTCAAGGAAAAAGTGCGTTTTATCTTTTCTTTTGTTAATATACTGTTTTATTAGAAGCTGAAGTCAAGTTGACAAAGATATATAATAAGAATGTAGACCTAAGAAATGCATTCTTGTAGGTTTATTGATAGAGTATCAGGATAATCCTCCCTTCCCCGATGAAAACGTTATATGCATACCATGACATCAACTTATTAATCCAGACTAATGAATCCATAAATCACAGAAGGAACCTATATCTTTTTGGCGTGAATATTGGATCACCTTCGTCAAATTAGCCCATTCATCTGCTTCTTGATCATTCAACCTCTTCATGGGTACCCTCCTCAAAAACTTATGAACATGATATCTTACACCAGTCTAGTCACTAGCATGTTACTCAACATGTGAATCTTCTATTTATAAAATTGAGTAAGTTTGCTGAACTCTATTGGGATTTCCTGTTGCTTCCATGTGTGTGAATTTGAAGGTAGAACAATTAATTGGTGGCATCCAACTTAAGCCTATCGTTTGGGTAGTGGCTATGAAAGCCACACTCGGGAAGATGTGGAATGAGAAAAATTCTAGAATTTTGGAGAGAGGAGGGTGAGACTGGGTGCAGTTTGGGACATCACTCATTTTTTGGCCTTTGCTTGGTGTACTCTCTTTGTCTCTATCTTGTTCTACAATTATACTCTTTGGTTAAAACATGAACTCAGGTCTCTCTTTGTAAGCCTTGGTGGCTCCTTTTGTATCTATTTAATATATGCTTAAAAGAAGTTCATTTCTTATTGAAGGAATATGGTCATAATTGAAAGTAATATGCTCCAAGTACTTTCTGAGGTGCCTTTGAAATTTAGAGTTCTCAATTTCTTGATCACAATGTTATTCTAATTTCTAGTTATTTCAGCACTCGGATGAAAATTTAGTTTGGACACCTAACATCATTCTTGAATTGAGTTTTTTATTTCCTTCTGTATCCTAGTGCATAGAAACTGCTTTGAAAGACAATGTATCCAAGGATTCTGAGGAGACTGGGAGAAAAAATGTAAATAAAGAAGAAGCAGCTGAGTCACCTGAAGGGCATCAGTCCAAGAAAGTTGTAAAGGAACCTTGCTTGGAAGATGAGGAAAACATGGAAGACTCTCCAGATATGGGCCTCATAGGACGTAACACAAAAAATGTTGAATCTGATGGAATCAAAGGAATCAAAGACAAAGATGACAAAGATATTCCTAGTGAGAGTAGAATAACAAAAGCTATTATAGAAAGAATTTCTTATCTTAAAGCTAATTCGGAGTAAGTTATTATTATTTTCTCCATTATGGATCACCGTAATGCTATGAGTTTTCATTCGTTTATGTTATATTTGTCTATGTACCTTTTCCTGTATTTTGATCCTTGTTGCTTTCTTAAAGCCCAAAATTGGTATGGAAGAGCATCATAATGGAGCCATATGTTTCTAGAAATTTATAATGTATTGTTTTACTTCTGGTTCACAAAAAACATTGTGGAATTGAAAAGAGTAATAGAGTAATTTTTTCAACTTTCAAGGATGGGTATCAGAAATGTTAATCAAGTACTCATAACTGAGGCTGTGGAAGCTGTGCAGCCTATTGAAGATGATGCAGCCGTAGGGCCTTTCCAACAAAGGGAAGGTAAGGCATATGTACAAGAAGAGGAATTTGATGACCTCTTCGATTATGAGGTTAGAGAGCAGACCCCAAAAGAGAGTTCTTTAGAGGGGCAGGAAGACTCTAGTGCAGCTGAAGAGGTGGAGGAGGTAGTCAGGCAACTGAAAAGGTGACTCCAAAGAGGGTCTCTAGATCGGCAAGAAAGAGACAAGCCAAGGAAAAAAAGGAGAGGTCAGAGGTGAAGAGGCCCAAAATGAGAAACACCTCAAGAATTGTGATTACGTCATCCAGTGATGAAGTTGAGGAAGAGGAGGAGTTGGCTAAGAAGAGGGAAATTCTGAGGAGAAGGGAAATCTTTGAAGAGAATGGGTTCTCCAACAAAGCAGGAGAGCTGCCACCATTCATTACTTCTGTGGTGGCTTACTATGAGTGGGAGTCCCTGTGCAGGCAACCCTTGCCCACCAGTATTGCACTTGTGAGGGAGTTCTACCTCGGGATGCAGCCCGAGAGAAGCATCTCCATCGTTCGGGGGGTGGAAGTGGATTTTTCTGCTAGCACAATTAACTCCCTCTACTTGGTGTCGGACGAGCACGAGGAGGAAAGCTTTATGTGAATGTATGCTCCATCTGATGAGCAGATAGAGGATGCCTTGAGGATGGTGGGCACAAAAGGTGCTGCGTGGGTGATTTCATCAACTGGATGCCGCACCCTTCGGCCTGGGGACATTAAAGATAAGTTGGCTATCTGGTTGTACTTTATCAAGCATCGGGTAATGCCAACAACACGTGACACAACCATTTCCTTGGAAAGAGTCATGCTTTTCTACAGCATAAGGAGGATCCTCCCCATTAACTTAGGAGGAATCATTAGGCAGGAAATGGTGGAATGTGGCCCCAAGACGAGAGGGCGCTTGTTCTTCCCCTCCCTTATAGGCCAGTTATGTGCTGAGGTAGGTGTCGTGGTTGATGCTACTGAGGAATGTTGCAAGGTCAAGCTAGCAATCGATCTTGGTTTGATCAAGAGACTTCAAGAAAATATTGCTACCAGGAAGAGCAGGTCCTCAGCTTCAAAGTTGCCCTCAATGTCCTCCCCTCCATCGAGGACGCTTCCTCCTGTGATGATGCAAACTCCTATCAATGATGCCACATTCTCCTCGCCAGAACTAAACTTTGAAACTCCAATTCCAGATCATCAGACTCCTGTTTCTGCTCTTGCCTCAACTTGCCAAGTACCTTCTTCTGCTGCTCTTGCCACTATGGCTGGTGATGTGCCCGCAACTCCCTCTCAACAGCCATGTGGTGCCAATGCCCCTTCTATGCACGCCACCACTGTGGTGGGTTATATGGCTGAAAGACTTGAGAAATTTACGGCACAGTCACGTGCTTATTGGGCATATGCCAAAGAGAGAGATGATGCCTTCAAAAGATTCCTCAGCTCCGCCAAGCTTGACTATTATCCTGCAGTTCCTTCGTTTCCCGATCACATTCTTCATGACCCCAAGGATGAAGAACAAGGCAATGAAGTTGACGATGATGCTCCAGCTCATATCTAGGAGAGTATCATGTTCCTTCTCTTGTTCTTCATTTTCTATCTTAGAATTTGTGCTGATCTTAATTTGCTTTAGTTTCTTTTATTTTAGGTTCAGTTAGGTTTTGGCTTTTGAATAGGTTAGTTTTTGTTTTCTCTGCTGGAAATAAAATTTGTTTGACTGATGTTGATTTATGATAGTCTTGAGCATGCTTGTCTTGGTTTGTTGTGCAAAGTAGTTCTTAATGATGACATCTAGAGCATATCTTGCTTTCAACCCCAAAAGGATACAAAACCAAGAGTTTTATTTTGTCTATATGTACAAAATTTCCTTAGACTAGTTTCCTGCATGTTCCACTAGTTTAGCACACTAGGAGCGCCTAGCGTTACATGTTGCAGCCTAACTTTTGACCCATTTTATTTAGTTCTCTTCCTTAGTCTATGAGGACATTGCTTATTTTAGGTTGGGGGGAGAACTTTCACACATTCTATGTATCAAAGTAAGCTTCTGTTGTAGGAATGTAGTATATGCTATGTGCTACGCTGATTAAAAAAAGAAAGAAAGAAAGAAAAGAAGAAGAAAAAAAAAACTAAAATGACACTTCTCCTTTCCAAATTAACTAAAGTAGCCGATCATGACGCCCATGCTTTCTTTTGTAGCGGGTGTACATAAAGGTCTCGAAAACCGATCCGACTGATCGAATTAAGGTCAGTTGATTGGTTCTAGGGGTCGGTCGGTCGGTGTTAGTTTTGGGAGTCTCGGTTTTCTCCCAAAACCGACACCGATCGACGGTCGAAATTTTCAATATTTTCATGGTCGGGTTTCTCCCAAAATCGACTCTGACCGGCTTTTATACACCCGTAGAGGGAAGTGAGTGGTACCTAACGTGTGAGCCTTGCCCGTAGTCAGATGATCCGCATGACACCTGTGTGGGTAAGCTGGAACGAAAGTAAGGATGACATTGAGGATATGTCCTAGGCAAGAAAAAGGAAAAAGAAAATAACAAAGGAATAAAGGCCTAGCAAAAGTTCAAGTTTGGGGAAAGAGAGTTTTGTACTTCATATTAGATATGGTGTATTGCAAAGATGTAACTTGTTAGAGATCTCACTAGGTTCAATGACTAGTAGCTAGGTTAGTTGGGCATGAATAGGTAGATAGTTAAGGGATGTTGAGCCATTGGTCAAGTGAAACTCTTGTGAGGATTCTTTGGGAGCTTGTCACGTACTTATAAGGTCTAACATCCACCGGCCTGCCCCTTCCTTCAAAGTAATCCGATGATCCACCTCCATTTTTGGGGGTAATGTAGGTGGCCATTCAAAAATTGATTCCAATACATCTAGCAATGATTGGATTATCCTCGGAGTCTCATTAGATTTTGCTTGCTCCGATTGGCTTTGCTGTAGGTTTTGCTCCTCGGTTGCGATTGGTTGTTTGTAGTTCAACCAAAAAACCTTGATCCTCCTCAGAGCAAGCTCATGCCAACATCTTGAGAGAAAGCTCTGCTTTAGTCAAGGTCGGATCTCCTTTAATAGTTACCATATTCTCTTTCGAGGTGAATATCATAGTAAGGGCAGCCTAATTGACCCCCATAAATCCTATTGTTCTCAGCCATTGCATTCCCAATATCACGTCAACATTTTCCAAATCCAGAGGCAAAAGTCTTCTACAATTGACAACTCCGGTAGACTCACGTCAACCTTTACCTCTAATTGTCGTAAGTACCCCATAGTTAGAAGTCTCCATAATCAGAATATTTAGTTTCTCTTCTCTTTGCTTATGAATGGAGTTATGTGTAGCACCACAATCCATTACCACAACAACTTGTCTTCCCTTACCTCTCCTTTTCTTTTACTTTCATGGTTCTAGGAGCAAAAAACTCAAAACTGTTTGGCAATTTCGTCCACCTCTTGGGGTTTATTTTTCAGTTGCTATCTTCCAAACTTGCCAATTCAAACTCAGTTTCATCTTCCTCGTTCACTACGAACACTCTTAGCTTTCGTACTTCACAATTCTTGCAGTGATGGCCAATAAAGAATTCATCATAGCGAAAACACATTCCGTTTTCTTGTGTTGCCTGGTATTCTACATTATACAAACATTTCATGGGCACAGCCTTCCATTGAGTTTCGTGTCACTGTGCGAGTTATCAAAGGGTCTAAAGCCTTATTAAACAAGAGGCCATTGAGAAACATACTCTCCAAGACATCTTTAGCACCATTAGGCAACTAAGCAGTACTATTAAAAAACCTGCCTAAATTCGGCCACATCACCTTCTTGTTTTATCTTTAGAGATCAAACCATCGAACGTCTTGTGTAGGACAGAATCGTTCAAATAATGGGCCTTCAAGTCCGCCGAGGTGTCGAACTTACGACGATTGTTAGCCCATTGATACCAGTCGACTACATCTTTCTCAAAATGCACTATCGAAACTTTAATCTTTTCCGAATCTGTCATTTCTTGTCTTTCAGAATGCCTCTTTGCCCCAAAGCATGAGTTAGGATTTTCTTAAAAAAAAACCTAGCATCTCCAAGCTTTCCCTTATCCTTGATTGAAGTGGAATCTTCCTCTTCAATGATTTCTGTTTTCCTCTTAACGAAATTTCACGCACCTCGACTGTAAGTCTTTCCAAACATTTGTTCATATTTTGAAGCACATCTTTAATGCTTGATACTTCCTTCTCTGTTCCTTCCATTCTTTCTTCAAGTTATTTTGAGCCATTTTCCATGTTCTCTTTGGGATTAACATGCTCTGATACCAATTTGCAAGGTGTCTCTTACAAATTACTCAAAGAATCTTCCAAGCAAACCAAATAGTTCCAGATAACACAATTCAATGATATTCAAAAACTCAACTAAATGATAGTTTCAGAATTATAAGATAAAAGCAATCAAAGATGAACAAGAAAATGAAAATAAGCAAAGAATGTGATAACTTGGTTACTCGAGAGAATTGATACTCTCCCCCTCAAGCATTCAACTTCTACTCAAAACCTGAAAACTCTTCCCTCCTGCCAAACCCCTTTGGTGGTCCCCACCAGGCACACGCCACAACCACGTTTCTCTAACCAACTGAATTATGTGAAAGTTTCTTTTTGCTCTCCCTTTCATAGATATGAATAATCAGAGGTTTTATAGATAAACAATCAGAGGATTGCTCATGCTAGTTTCTTAATTTCCGAAATTACTATTGGGTTTTCCATTAATTCCCTATATTAATTAATTGATGCAAGAAAATGGTAAGTCTACCCTCCATTAAGAAGGTTAAATATAAGAACTCTTGTTAATTAATGGGAAACAATAGGCTTTCTGGAGTCAACTTTGAGGATCACCCCATAGTAATTTTTAATTCTCACCTCATCTAGTCCTTTTACAATAAGGATCACATCCAAATATGTGACTATCAACAATGATACGTAGGGGGATTTTAATATCAAATTTTTGTCATCAAAACACTTTAATTCTGGGAATGCATCAAAGTATTGATGGCCACCCTTAGGATTTAATATCCTATCGGTTTTCTTGGAAACTAAATGTATTTAATTAGGGTTTGTCCTGTGAGAACAAGACAAATGTCGAAAAAAAGGATTGAGTACACTACATGTAATTTTTCAATACATTGTACTGTTAGGGTTAGAAGACTCAAGTTTTATAGGGTTTCGGGTACACTACATGTACTCTATCATAATACTATTTTGTTTTGGGATGTGATGAGGGTGCTATGGGTGTGTCAACCTAATTGAGATAGTCGGTGCATGATCCCTAGTATTAGACTATTAATATTCGTATTTACTTAGTCTTTTTCTAATTCGAGCAATAGTCTCTTTTCATTTCATTAATGAAAAGTGTTGTTTCTGTTTATAAAAAAAGACTCAAATTTGATAGCATACTTAAAATGGTTTTGTTTGTTTCTTTTTTTTTTTTTTAATACTGACCTGTATCTTTTTTTTTTCCCTTTCTGATTATGGATATTGCAGGAACAAATGCTCCTCATAGTTCTACACTGGAAGTTCCAATATTTTGGTTCATTCATACAGAACCCTTATTAGTTGACAAACATTATCAGGCAAAGGCACTCTCCGACATGGTTATTGTAGTACAGTCGGAGATTTCATCCTGGGAAAGCCATTTGCAGTACAATGGGAAATCACTTATATGGGATATGAGGTTAGTTCAAGCATCTTTAGTTGCCTTGTGACCCTTATGATGTTTGGACTTTGAAAACATGATGGAATTGTATAACTTCCAGAGTTGCTGGTTTCCCATCATTCTAGGTGCTGGGCTCATCTGAATGAATTATTACTTGCCTTGGGTCGGGTGTCAAGTGCACGTCATTTTACTACCCATTTTCCTATTTTCTCCCCTTTCTGTAGAATCTGTCTCTGTCCCTCTCTCATTCTATCTCTATATTTATTTATTTTAATGAATAACGGAACTTTCATTGTGAAAAAATGAAAGAATACATGGACATAGGGTATCGGTTGGTCTACATCGATTTTGGGATTAAACTAGTGTCGACCACTCAAATGTTGGTTTCCAGCAGTCGATTGGAGTTGGTTTCGAAGAGGCTATGTATGTTGATTGACCAACCACATCGACGATCAGTCGGTTTTAGCTGGTTTTGTGAATCTTGAAAACTCAGGAAGAACAAACTAAAAAGAAAGAACGAAGAAAAGGAGAAGAAAAGGAGAAGAAAAAAGGACAGAACAAGAAGAAAATAAGGAGGAGGAAAGAGGGAGAAATGGAGGAGAATAACCTTGAGAAAGATTTATTTATTATTTTTTTCCCATAATAATTGTACAAACCTTGGGAAGGGCAACAAAGAGGTCAATATATTAGTCTATTGTTTTTATTGGTCAAATGATCCATTAATCAAGATGGAAATTTATAGTCAGAAAAGGGGAGCAAAATGTAATTTGTGTTTTATTCTATTTTGAAACAGAAAACTAATAATATAATCCTTTTGCTAACCCATGCATAATGCAATTAGTTTAGGGCTTTTAGGCAACTCTCGACTACAAGGTCGGTCAGATGTTCAAATTTCCCACATGTGGTTGAACTAAAAAAAAAAATTATAATCTTTTACTTCTCTATTTGGATCAATATGTATGATTGTGTGCTTGGATAATAAATACAAAATGCTCTCTGTAACTTATGGCATTCTGTAGAAACCTTAAGTTGTTGAAGGTAAACATGTAGAACCCAAACTTGTTGGTAATCATCAGTGACAGTGGTGACACCATATAAGTGGTAAACATGTGGACGACATTTCAATACGATTATCTCAGGAGTGCTGCTTGCTTCAAAATTTGCTTTTCTTTTCTGTTTTGAACCCTTTTTTGTAAATGAAATGTGATAAAATTAGTATGAAGGGCTTCATTTTCAATATATGTGGTTCACCACTTGTTTTAGCTGTTAACATTAAAGGCTCCTGTCATCAGAAATTATCTTAGAATATTAATTGTTTATTTCTTCTTATACTTTTCAATACGTAGGAAGCCAATTGAAGCTGCTCTTTTTGCTACTGCAGAACATCTTTCCGGTCTGCTCCCTCTTCATCCTGCAGACAGGTACCATAGTGAAATCCTTTGGGTTGGAGTGATGTAG

mRNA sequence

ATGGCGGAGGAATTACAGGACAACGATGCTCCAAACGAAGAAGCCGTGGATGTTGCTGTTGATATAGAGACGAAGATTCATAACGCTATGCGCTCTTGCTTCTCTCACATCAAGGAACAAGCCGACTCTTTAACTTTTGAGGGGGTTAGAAGATTGTTAGAGAAGGACTTGTGTATGGAGATGTATACTTTAGATGTGCACAAAAGATATGTCAAGCAGTGTTTGGAGAAGTGCATAGAAACTGCTTTGAAAGACAATGTATCCAAGGATTCTGAGGAGACTGGGAGAAAAAATGTAAATAAAGAAGAAGCAGCTGAGTCACCTGAAGGGCATCAGTCCAAGAAAGTTGTAAAGGAACCTTGCTTGGAAGATGAGGAAAACATGGAAGACTCTCCAGATATGGGCCTCATAGGACGTAACACAAAAAATGTTGAATCTGATGGAATCAAAGGAATCAAAGACAAAGATGACAAAGATATTCCTAGTGAGAGTAGAATAACAAAAGCTATTATAGAAAGAATTTCTTATCTTAAAGCTAATTCGGAGATGGGTATCAGAAATGTTAATCAAGTACTCATAACTGAGGCTGTGGAAGCTGTGCAGCCTATTGAAGATGATGCAGCCGTAGGGCCTTTCCAACAAAGGGAAGGAACAAATGCTCCTCATAGTTCTACACTGGAAGTTCCAATATTTTGGTTCATTCATACAGAACCCTTATTAGTTGACAAACATTATCAGGCAAAGGCACTCTCCGACATGGTTATTGTAGTACAGTCGGAGATTTCATCCTGGGAAAGCCATTTGCAGTACAATGGGAAATCACTTATATGGGATATGAGGAAGCCAATTGAAGCTGCTCTTTTTGCTACTGCAGAACATCTTTCCGGTCTGCTCCCTCTTCATCCTGCAGACAGGTACCATAGTGAAATCCTTTGGGTTGGAGTGATGTAG

Coding sequence (CDS)

ATGGCGGAGGAATTACAGGACAACGATGCTCCAAACGAAGAAGCCGTGGATGTTGCTGTTGATATAGAGACGAAGATTCATAACGCTATGCGCTCTTGCTTCTCTCACATCAAGGAACAAGCCGACTCTTTAACTTTTGAGGGGGTTAGAAGATTGTTAGAGAAGGACTTGTGTATGGAGATGTATACTTTAGATGTGCACAAAAGATATGTCAAGCAGTGTTTGGAGAAGTGCATAGAAACTGCTTTGAAAGACAATGTATCCAAGGATTCTGAGGAGACTGGGAGAAAAAATGTAAATAAAGAAGAAGCAGCTGAGTCACCTGAAGGGCATCAGTCCAAGAAAGTTGTAAAGGAACCTTGCTTGGAAGATGAGGAAAACATGGAAGACTCTCCAGATATGGGCCTCATAGGACGTAACACAAAAAATGTTGAATCTGATGGAATCAAAGGAATCAAAGACAAAGATGACAAAGATATTCCTAGTGAGAGTAGAATAACAAAAGCTATTATAGAAAGAATTTCTTATCTTAAAGCTAATTCGGAGATGGGTATCAGAAATGTTAATCAAGTACTCATAACTGAGGCTGTGGAAGCTGTGCAGCCTATTGAAGATGATGCAGCCGTAGGGCCTTTCCAACAAAGGGAAGGAACAAATGCTCCTCATAGTTCTACACTGGAAGTTCCAATATTTTGGTTCATTCATACAGAACCCTTATTAGTTGACAAACATTATCAGGCAAAGGCACTCTCCGACATGGTTATTGTAGTACAGTCGGAGATTTCATCCTGGGAAAGCCATTTGCAGTACAATGGGAAATCACTTATATGGGATATGAGGAAGCCAATTGAAGCTGCTCTTTTTGCTACTGCAGAACATCTTTCCGGTCTGCTCCCTCTTCATCCTGCAGACAGGTACCATAGTGAAATCCTTTGGGTTGGAGTGATGTAG

Protein sequence

MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCMEMYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEPCLEDEENMEDSPDMGLIGRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKANSEMGIRNVNQVLITEAVEAVQPIEDDAAVGPFQQREGTNAPHSSTLEVPIFWFIHTEPLLVDKHYQAKALSDMVIVVQSEISSWESHLQYNGKSLIWDMRKPIEAALFATAEHLSGLLPLHPADRYHSEILWVGVM
Homology
BLAST of HG10003884 vs. NCBI nr
Match: XP_004145363.1 (DNA ligase 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 281.6 bits (719), Expect = 8.5e-72
Identity = 151/183 (82.51%), Postives = 159/183 (86.89%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQ ND P EE +DVAV IETKIHNAMRS  SH KEQADSLTFEGVRRLLEKDLCME
Sbjct: 1   MAEELQGNDTPKEEPMDVAVGIETKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
            YTLDVHKRYVKQCL KC+E  L+DNVSKDSE TGRK+VNKEEA ESPEGHQSKK  KEP
Sbjct: 61  TYTLDVHKRYVKQCLVKCLEADLEDNVSKDSELTGRKSVNKEEAPESPEGHQSKKGAKEP 120

Query: 121 CLEDEENMEDSPDMGLI-GRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           CLEDEE MEDSP MGL+ GR+TKNVESDGIKGIK KDDKD+PSES I KAI +R SYLKA
Sbjct: 121 CLEDEEKMEDSPVMGLLTGRSTKNVESDGIKGIKGKDDKDVPSESTIMKAIRKRTSYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. NCBI nr
Match: XP_011649194.1 (DNA ligase 1 isoform X2 [Cucumis sativus] >KAE8651851.1 hypothetical protein Csa_006536 [Cucumis sativus])

HSP 1 Score: 281.6 bits (719), Expect = 8.5e-72
Identity = 151/183 (82.51%), Postives = 159/183 (86.89%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQ ND P EE +DVAV IETKIHNAMRS  SH KEQADSLTFEGVRRLLEKDLCME
Sbjct: 1   MAEELQGNDTPKEEPMDVAVGIETKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
            YTLDVHKRYVKQCL KC+E  L+DNVSKDSE TGRK+VNKEEA ESPEGHQSKK  KEP
Sbjct: 61  TYTLDVHKRYVKQCLVKCLEADLEDNVSKDSELTGRKSVNKEEAPESPEGHQSKKGAKEP 120

Query: 121 CLEDEENMEDSPDMGLI-GRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           CLEDEE MEDSP MGL+ GR+TKNVESDGIKGIK KDDKD+PSES I KAI +R SYLKA
Sbjct: 121 CLEDEEKMEDSPVMGLLTGRSTKNVESDGIKGIKGKDDKDVPSESTIMKAIRKRTSYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. NCBI nr
Match: XP_038884710.1 (glutamic acid-rich protein isoform X2 [Benincasa hispida])

HSP 1 Score: 272.7 bits (696), Expect = 4.0e-69
Identity = 150/183 (81.97%), Postives = 159/183 (86.89%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQD DA N++A+DVAVDIETKI+NAMRS  S+ KE+ADSLTFEGVRRLLEKDLCME
Sbjct: 1   MAEELQDKDASNDKAMDVAVDIETKIYNAMRSRVSYFKEEADSLTFEGVRRLLEKDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
           MYTLDVHKR VKQCL KC E   +DNVSK SEETGRK+VNKEEAAE  EGHQSKK VKEP
Sbjct: 61  MYTLDVHKRLVKQCLVKCFEADWEDNVSKKSEETGRKSVNKEEAAEPLEGHQSKKGVKEP 120

Query: 121 CLEDEENMEDSPDMG-LIGRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           C EDEE MEDSP MG LI RNTKNVESDGIKGIKDKDDKDIPSES I KAI +R SYLKA
Sbjct: 121 CSEDEEKMEDSPVMGLLIPRNTKNVESDGIKGIKDKDDKDIPSESIIAKAIRKRTSYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. NCBI nr
Match: XP_038884709.1 (glutamic acid-rich protein isoform X1 [Benincasa hispida])

HSP 1 Score: 272.7 bits (696), Expect = 4.0e-69
Identity = 150/183 (81.97%), Postives = 159/183 (86.89%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQD DA N++A+DVAVDIETKI+NAMRS  S+ KE+ADSLTFEGVRRLLEKDLCME
Sbjct: 1   MAEELQDKDASNDKAMDVAVDIETKIYNAMRSRVSYFKEEADSLTFEGVRRLLEKDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
           MYTLDVHKR VKQCL KC E   +DNVSK SEETGRK+VNKEEAAE  EGHQSKK VKEP
Sbjct: 61  MYTLDVHKRLVKQCLVKCFEADWEDNVSKKSEETGRKSVNKEEAAEPLEGHQSKKGVKEP 120

Query: 121 CLEDEENMEDSPDMG-LIGRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           C EDEE MEDSP MG LI RNTKNVESDGIKGIKDKDDKDIPSES I KAI +R SYLKA
Sbjct: 121 CSEDEEKMEDSPVMGLLIPRNTKNVESDGIKGIKDKDDKDIPSESIIAKAIRKRTSYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. NCBI nr
Match: XP_022939456.1 (DNA ligase 1-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 260.0 bits (663), Expect = 2.7e-65
Identity = 142/183 (77.60%), Postives = 152/183 (83.06%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQDNDAPNEEA+DV V IETKI NAM S  SH KEQADSLTFEGVRRLLEKDLCME
Sbjct: 1   MAEELQDNDAPNEEAMDVDVGIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
            Y LDVHKRY+KQCL KC+E   +DN SK SEETG K+V++ EAAES EGHQSKK  KEP
Sbjct: 61  TYALDVHKRYIKQCLVKCLEGVEEDNASKSSEETGGKSVSRGEAAESLEGHQSKKGAKEP 120

Query: 121 CLEDEENMEDSPDMGLI-GRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           CLEDEE MEDSP MGL+ G  TKNVESD IKGIKDKDDKDIP+ES I KAI +R  YLKA
Sbjct: 121 CLEDEEKMEDSPVMGLLAGHKTKNVESDKIKGIKDKDDKDIPTESTIKKAIRKRTPYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. ExPASy TrEMBL
Match: A0A0A0LIS6 (CHZ domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G232480 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 4.1e-72
Identity = 151/183 (82.51%), Postives = 159/183 (86.89%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQ ND P EE +DVAV IETKIHNAMRS  SH KEQADSLTFEGVRRLLEKDLCME
Sbjct: 1   MAEELQGNDTPKEEPMDVAVGIETKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
            YTLDVHKRYVKQCL KC+E  L+DNVSKDSE TGRK+VNKEEA ESPEGHQSKK  KEP
Sbjct: 61  TYTLDVHKRYVKQCLVKCLEADLEDNVSKDSELTGRKSVNKEEAPESPEGHQSKKGAKEP 120

Query: 121 CLEDEENMEDSPDMGLI-GRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           CLEDEE MEDSP MGL+ GR+TKNVESDGIKGIK KDDKD+PSES I KAI +R SYLKA
Sbjct: 121 CLEDEEKMEDSPVMGLLTGRSTKNVESDGIKGIKGKDDKDVPSESTIMKAIRKRTSYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. ExPASy TrEMBL
Match: A0A6J1FGV2 (glutamic acid-rich protein-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445354 PE=4 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 1.3e-65
Identity = 142/183 (77.60%), Postives = 152/183 (83.06%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQDNDAPNEEA+DV V IETKI NAM S  SH KEQADSLTFEGVRRLLEKDLCME
Sbjct: 1   MAEELQDNDAPNEEAMDVDVGIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
            Y LDVHKRY+KQCL KC+E   +DN SK SEETG K+V++ EAAES EGHQSKK  KEP
Sbjct: 61  TYALDVHKRYIKQCLVKCLEGVEEDNASKSSEETGGKSVSRGEAAESLEGHQSKKGAKEP 120

Query: 121 CLEDEENMEDSPDMGLI-GRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           CLEDEE MEDSP MGL+ G  TKNVESD IKGIKDKDDKDIP+ES I KAI +R  YLKA
Sbjct: 121 CLEDEEKMEDSPVMGLLAGHKTKNVESDKIKGIKDKDDKDIPTESTIKKAIRKRTPYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. ExPASy TrEMBL
Match: A0A6J1FFY5 (DNA ligase 1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445354 PE=4 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 1.3e-65
Identity = 142/183 (77.60%), Postives = 152/183 (83.06%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQDNDAPNEEA+DV V IETKI NAM S  SH KEQADSLTFEGVRRLLEKDLCME
Sbjct: 1   MAEELQDNDAPNEEAMDVDVGIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLEKDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
            Y LDVHKRY+KQCL KC+E   +DN SK SEETG K+V++ EAAES EGHQSKK  KEP
Sbjct: 61  TYALDVHKRYIKQCLVKCLEGVEEDNASKSSEETGGKSVSRGEAAESLEGHQSKKGAKEP 120

Query: 121 CLEDEENMEDSPDMGLI-GRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           CLEDEE MEDSP MGL+ G  TKNVESD IKGIKDKDDKDIP+ES I KAI +R  YLKA
Sbjct: 121 CLEDEEKMEDSPVMGLLAGHKTKNVESDKIKGIKDKDDKDIPTESTIKKAIRKRTPYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. ExPASy TrEMBL
Match: A0A6J1JTY1 (glutamic acid-rich protein-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489718 PE=4 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 9.2e-64
Identity = 138/183 (75.41%), Postives = 149/183 (81.42%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQD DAPNEEA+DV V IETKI NAM S  SH KEQADSLTFEGVRRLLE DLCME
Sbjct: 1   MAEELQDTDAPNEEAMDVDVGIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLENDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
            Y LDVHKRY+KQCL KC+E   +DN SK SEETG K+V++ EAAES EGHQSKK  KEP
Sbjct: 61  TYALDVHKRYIKQCLVKCLEGVEEDNASKSSEETGGKSVSRGEAAESLEGHQSKKGAKEP 120

Query: 121 CLEDEENMEDSPDMGLI-GRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           CLEDEE MEDSP MGL+ G  TKN ESD +KGIKDKDDKDIP+ES I KAI +R  YLKA
Sbjct: 121 CLEDEEKMEDSPVMGLLAGHKTKNAESDKVKGIKDKDDKDIPTESTIKKAIRKRTPYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. ExPASy TrEMBL
Match: A0A6J1K3E3 (glutamic acid-rich protein-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489718 PE=4 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 9.2e-64
Identity = 138/183 (75.41%), Postives = 149/183 (81.42%), Query Frame = 0

Query: 1   MAEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCME 60
           MAEELQD DAPNEEA+DV V IETKI NAM S  SH KEQADSLTFEGVRRLLE DLCME
Sbjct: 1   MAEELQDTDAPNEEAMDVDVGIETKIQNAMLSRVSHFKEQADSLTFEGVRRLLENDLCME 60

Query: 61  MYTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKNVNKEEAAESPEGHQSKKVVKEP 120
            Y LDVHKRY+KQCL KC+E   +DN SK SEETG K+V++ EAAES EGHQSKK  KEP
Sbjct: 61  TYALDVHKRYIKQCLVKCLEGVEEDNASKSSEETGGKSVSRGEAAESLEGHQSKKGAKEP 120

Query: 121 CLEDEENMEDSPDMGLI-GRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 180
           CLEDEE MEDSP MGL+ G  TKN ESD +KGIKDKDDKDIP+ES I KAI +R  YLKA
Sbjct: 121 CLEDEEKMEDSPVMGLLAGHKTKNAESDKVKGIKDKDDKDIPTESTIKKAIRKRTPYLKA 180

Query: 181 NSE 183
           NSE
Sbjct: 181 NSE 183

BLAST of HG10003884 vs. TAIR 10
Match: AT5G58100.1 (unknown protein; INVOLVED IN: pollen exine formation; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 8 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G28720.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 133.7 bits (335), Expect = 2.7e-31
Identity = 59/86 (68.60%), Postives = 74/86 (86.05%), Query Frame = 0

Query: 216 EGTNAPHSSTLEVPIFWFIHTEPLLVDKHYQAKALSDMVIVVQSEISSWESHLQYNGKSL 275
           +G +A   STLE+PIFW I  +PLL+DKHYQAKALS+MV+VVQSE SSWESHLQ NG+SL
Sbjct: 682 KGGHAHSRSTLEIPIFWLISGDPLLIDKHYQAKALSNMVVVVQSEASSWESHLQCNGRSL 741

Query: 276 IWDMRKPIEAALFATAEHLSGLLPLH 302
           +WD+R P++AA+ + AEHL+GLLPLH
Sbjct: 742 LWDLRSPVKAAMASVAEHLAGLLPLH 767

BLAST of HG10003884 vs. TAIR 10
Match: AT4G08310.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ (InterPro:IPR019098); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G44780.2); Has 53711 Blast hits to 33687 proteins in 1618 species: Archae - 142; Bacteria - 4400; Metazoa - 24303; Fungi - 6688; Plants - 2484; Viruses - 449; Other Eukaryotes - 15245 (source: NCBI BLink). )

HSP 1 Score: 109.0 bits (271), Expect = 7.1e-24
Identity = 72/164 (43.90%), Postives = 107/164 (65.24%), Query Frame = 0

Query: 21  DIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCMEMYTLDVHKRYVKQCLEKCIE 80
           DIE++I  AM+S  ++++++AD+ TFEGVRRLLE+DL +E + LDVHK +VKQ L +C+ 
Sbjct: 33  DIESQILAAMQSRVTYLRDKADNFTFEGVRRLLEEDLKLEKHALDVHKSFVKQHLVQCLA 92

Query: 81  TALKDNVSKDSEETGRKN--VNKEEAAESPEGHQSKKVVKEPCLEDEENMEDSPDMGLIG 140
            A  D  S++S ET +K+     +EAAE  + H +KK  KE    D+E  +DSP MGL+ 
Sbjct: 93  GAENDETSENSLETEKKDDVTPVKEAAELSKEHTTKKDGKEDMTGDDEKTKDSPVMGLL- 152

Query: 141 RNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKANSE 183
              +N      +  KD+D + + S+  I KA+ +R SY+KANSE
Sbjct: 153 -TEENTSKSVAEQTKDEDKEVLQSD--IKKALRKRSSYIKANSE 192

BLAST of HG10003884 vs. TAIR 10
Match: AT1G44780.1 (CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ (InterPro:IPR019098); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G08310.1); Has 18105 Blast hits to 11200 proteins in 808 species: Archae - 37; Bacteria - 1195; Metazoa - 7724; Fungi - 1727; Plants - 674; Viruses - 183; Other Eukaryotes - 6565 (source: NCBI BLink). )

HSP 1 Score: 78.2 bits (191), Expect = 1.3e-14
Identity = 63/183 (34.43%), Postives = 99/183 (54.10%), Query Frame = 0

Query: 2   AEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCMEM 61
           AE   ++   N + VD A +IE KI  A+RS  ++++ +AD  T   VRR+LE+D+ +E 
Sbjct: 4   AEMNHNSGKSNLKNVD-ATEIEFKILAALRSRVTYLRNEADCFTLVSVRRMLEEDIGLEK 63

Query: 62  YTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKN--VNKEEAAESPEGHQSKKVVKE 121
             LDV+K +VK+ L KC+E A  ++ S++S+ET R++  +  +E AE  E H        
Sbjct: 64  CDLDVYKSFVKEHLVKCLEEAGNNDTSENSQETEREDDEIPTKEVAEQSEEH-------- 123

Query: 122 PCLEDEENMEDSPDMGLIGRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 181
                 E M D+ +     R  K+V+  G K    +D         I +A+ +R SY+KA
Sbjct: 124 ------EPMNDAGEENTSKREAKDVKGKGNKETLQRD---------IKRALRKRASYIKA 162

Query: 182 NSE 183
           NSE
Sbjct: 184 NSE 162

BLAST of HG10003884 vs. TAIR 10
Match: AT1G44780.2 (INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 9 growth stages; CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ (InterPro:IPR019098); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G08310.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 78.2 bits (191), Expect = 1.3e-14
Identity = 63/183 (34.43%), Postives = 99/183 (54.10%), Query Frame = 0

Query: 2   AEELQDNDAPNEEAVDVAVDIETKIHNAMRSCFSHIKEQADSLTFEGVRRLLEKDLCMEM 61
           AE   ++   N + VD A +IE KI  A+RS  ++++ +AD  T   VRR+LE+D+ +E 
Sbjct: 4   AEMNHNSGKSNLKNVD-ATEIEFKILAALRSRVTYLRNEADCFTLVSVRRMLEEDIGLEK 63

Query: 62  YTLDVHKRYVKQCLEKCIETALKDNVSKDSEETGRKN--VNKEEAAESPEGHQSKKVVKE 121
             LDV+K +VK+ L KC+E A  ++ S++S+ET R++  +  +E AE  E H        
Sbjct: 64  CDLDVYKSFVKEHLVKCLEEAGNNDTSENSQETEREDDEIPTKEVAEQSEEH-------- 123

Query: 122 PCLEDEENMEDSPDMGLIGRNTKNVESDGIKGIKDKDDKDIPSESRITKAIIERISYLKA 181
                 E M D+ +     R  K+V+  G K    +D         I +A+ +R SY+KA
Sbjct: 124 ------EPMNDAGEENTSKREAKDVKGKGNKETLQRD---------IKRALRKRASYIKA 162

Query: 182 NSE 183
           NSE
Sbjct: 184 NSE 162

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004145363.18.5e-7282.51DNA ligase 1 isoform X1 [Cucumis sativus][more]
XP_011649194.18.5e-7282.51DNA ligase 1 isoform X2 [Cucumis sativus] >KAE8651851.1 hypothetical protein Csa... [more]
XP_038884710.14.0e-6981.97glutamic acid-rich protein isoform X2 [Benincasa hispida][more]
XP_038884709.14.0e-6981.97glutamic acid-rich protein isoform X1 [Benincasa hispida][more]
XP_022939456.12.7e-6577.60DNA ligase 1-like isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LIS64.1e-7282.51CHZ domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G232480 PE=4 SV... [more]
A0A6J1FGV21.3e-6577.60glutamic acid-rich protein-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A6J1FFY51.3e-6577.60DNA ligase 1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445354 PE=4 ... [more]
A0A6J1JTY19.2e-6475.41glutamic acid-rich protein-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A6J1K3E39.2e-6475.41glutamic acid-rich protein-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
Match NameE-valueIdentityDescription
AT5G58100.12.7e-3168.60unknown protein; INVOLVED IN: pollen exine formation; EXPRESSED IN: 19 plant str... [more]
AT4G08310.17.1e-2443.90FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT1G44780.11.3e-1434.43CONTAINS InterPro DOMAIN/s: Histone chaperone domain CHZ (InterPro:IPR019098); B... [more]
AT1G44780.21.3e-1434.43INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown;... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 87..122
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 87..142
IPR037647HIRA-interacting protein 3PANTHERPTHR15410HIRA-INTERACTING PROTEIN 3coord: 1..182

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003884.1HG10003884.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005634 nucleus
molecular_function GO:0016874 ligase activity