HG10020033 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10020033
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCSL zinc finger domain-containing protein
LocationChr04: 28141998 .. 28144270 (+)
RNA-Seq ExpressionHG10020033
SyntenyHG10020033
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGATTTCCGATCTCCCACCGGCGTTCGCCGTGCTGCTTCTGTTGGTGATGGCGGTGGTTGTCGAAGCTAGCGACAACAACCGAGTTTTCTCACCTTGCACGGACACAACTGTTGAGAAGTCCGACGGCTTCACCTTAGGGTTTGCTTTTGCGACGGAGCAGAAGTTTCTCTTCAATAAAACCTTGCAGTTGTCTCCTTGCGACAGCAGGCTCGGTCTTACGAATGGAAATTCTCTGATCTCTGTGTTTAGACCTAAGGTTGATGAGATCTCCCTCCTTACCGTCAACACTTCTTCCTCTGTGTCTTCCTTCAATCCGGTTAGTTCTTCAAACTCCGTGTTCTTTGTTGATTTCGAATTTATGGATTTTCTGTTTGCTCTTCGTTTGAAGTTGAGACTAGTTCTAATTAGGTTTTGTGCGTTTATTTGTCTAATGGGACGATCAGTAGGTTTTGTGGAATTGGAGTAATAGGTTTGTTAACTCTCCAACTAAACAGGGCTAATCCTTATGGAATCACTTGGCTTACTTTGATTGCGTGATGCTAGCTCGATTTGATTTGGGATTTGTTGGTATGTTAATTGTTCTTTGAGGGAAATTTAGGATCAATCACTATTGGTTGACATTCTGCACGATCCAATTTTAACAAGGAACTCAAATCTCGAACTTGAGTAGGAAAAAAAGGGAAGAACTTGGTGTTTGATAGGGTAGAATTATATGAGAAGGAAGAAATTAATATCTCTGCTACTTTGATTTCTATTGCTCACATAAATCTGAATTTCTTTCTACTAGTTTATCTACTTTTACATGCTAGTTATCAGTTCTTACCAGTCATGGGCTCATGATTTCCTTTTCTTTTCATTCATTACCAGTCGTCAAATGGCTATATGGTTGCATTTGCTGGTCGAAAATATGCTGCAAGGTCCCCTCCAATTTTTGTCGCAGATGAACAACATACCGTAACCAGCTTTACTCTGGTAAGTTTATCAGTTTATGGCATTGAAAGTAGGAAGTAAAGTAAAGTAAAAGACACTAGACATGGACAAAATACTACACTGGCTTGGTGATATGTTGTTTTTAAGAACTCAATGGATATATGGCAGGGTCATATGTAACAAATTATTTTTTCTGTGAAATGTAAGAGATAGTGATGGAAAAATTATTGAATAAGGTCGTCATGATATGGTAGTCTCCAATACTTTTATAGTTTTCTCTCATGCATCCAACAGTATAAAATTACTCGTGACATTATGACAGCATCCTTGGGTGAGGAGGGGAGGAGTTATTGTTGGAGATGTGAGTGGGTAGGTTTGATAAATCATGATGTGTTCAACATTTTCAAAACAAAATTTTGATTGTTTAGGGTCTCGAAGAGATTTGAAAACAAGGGATTTAATGAAAATAAAGTCGTCTTTCATGTTTTCAAATGTGTGTTTGGTAACATATTTAGAAATTGGATTTCAGTTGAAACAATGGTTCAAAATGTGCAGGATATGATATTTATAAACCATAAAACTAATGTTACAATATTAGGTTGAGTATAAACTATTAATTATTAATTTGTGACATGATTTATTTCAATTTTTTTTGTATAATTAGTATTTTGTAATATCTTTTAATTTATAATACATTATATAATATATATTTTAGTTTCAAAATGAATTGAGTTTATAACCGAAATACTATATTTGTTAACATTATTATCTCTTTTTATATAGTTATGTTGTTTAAAATTTAAAATTCAAATTTTGGATATGCTTACAGCATAAAAATGTTGTTTTTCAAAATTTACACTATTTGAATCACATAATAAAAAATAGTTTTAAAAAACCAAATTTATATTGCCTACCAAACATGTATTCATTGAATCTAAAAATATAAAACATAATCTAGACTGACTGTGACTACTAAACAGTCCCTTCGAGATTGATAACGAATCCATTGAATTTTCTTGTGTATAGGTGCTTGAGTTTGAGAAAGGCAGGCTGCAAAACTTGTTCTGGAAAAGGGATGGCTGTGCTAGATGTTCAAACAACAATACCTTTGTTTGCATCCAGAATCAGGATTGTGCAATAAGAACGAACAACTGCAAATATCGTGGTGGTTCTGTCGATTGCAGTCTTGCAATACAGTTAGCGTTCTCTGGCACGGATAAGCACCTTTCTGTCTTCAACTCTTGGTACGAAGTGTCGAAGCTTCGGCAATACTCACTCCTCAATCTGTATTCGAACCTCAGAGATTCTCTCACAAGTCAGTATAACAAGATCTTCTAA

mRNA sequence

ATGGGGATTTCCGATCTCCCACCGGCGTTCGCCGTGCTGCTTCTGTTGGTGATGGCGGTGGTTGTCGAAGCTAGCGACAACAACCGAGTTTTCTCACCTTGCACGGACACAACTGTTGAGAAGTCCGACGGCTTCACCTTAGGGTTTGCTTTTGCGACGGAGCAGAAGTTTCTCTTCAATAAAACCTTGCAGTTGTCTCCTTGCGACAGCAGGCTCGGTCTTACGAATGGAAATTCTCTGATCTCTGTGTTTAGACCTAAGGTTGATGAGATCTCCCTCCTTACCGTCAACACTTCTTCCTCTGTGTCTTCCTTCAATCCGTCGTCAAATGGCTATATGGTTGCATTTGCTGGTCGAAAATATGCTGCAAGGTCCCCTCCAATTTTTGTCGCAGATGAACAACATACCGTAACCAGCTTTACTCTGGTGCTTGAGTTTGAGAAAGGCAGGCTGCAAAACTTGTTCTGGAAAAGGGATGGCTGTGCTAGATGTTCAAACAACAATACCTTTGTTTGCATCCAGAATCAGGATTGTGCAATAAGAACGAACAACTGCAAATATCGTGGTGGTTCTGTCGATTGCAGTCTTGCAATACAGTTAGCGTTCTCTGGCACGGATAAGCACCTTTCTGTCTTCAACTCTTGGTACGAAGTGTCGAAGCTTCGGCAATACTCACTCCTCAATCTGTATTCGAACCTCAGAGATTCTCTCACAAGTCAGTATAACAAGATCTTCTAA

Coding sequence (CDS)

ATGGGGATTTCCGATCTCCCACCGGCGTTCGCCGTGCTGCTTCTGTTGGTGATGGCGGTGGTTGTCGAAGCTAGCGACAACAACCGAGTTTTCTCACCTTGCACGGACACAACTGTTGAGAAGTCCGACGGCTTCACCTTAGGGTTTGCTTTTGCGACGGAGCAGAAGTTTCTCTTCAATAAAACCTTGCAGTTGTCTCCTTGCGACAGCAGGCTCGGTCTTACGAATGGAAATTCTCTGATCTCTGTGTTTAGACCTAAGGTTGATGAGATCTCCCTCCTTACCGTCAACACTTCTTCCTCTGTGTCTTCCTTCAATCCGTCGTCAAATGGCTATATGGTTGCATTTGCTGGTCGAAAATATGCTGCAAGGTCCCCTCCAATTTTTGTCGCAGATGAACAACATACCGTAACCAGCTTTACTCTGGTGCTTGAGTTTGAGAAAGGCAGGCTGCAAAACTTGTTCTGGAAAAGGGATGGCTGTGCTAGATGTTCAAACAACAATACCTTTGTTTGCATCCAGAATCAGGATTGTGCAATAAGAACGAACAACTGCAAATATCGTGGTGGTTCTGTCGATTGCAGTCTTGCAATACAGTTAGCGTTCTCTGGCACGGATAAGCACCTTTCTGTCTTCAACTCTTGGTACGAAGTGTCGAAGCTTCGGCAATACTCACTCCTCAATCTGTATTCGAACCTCAGAGATTCTCTCACAAGTCAGTATAACAAGATCTTCTAA

Protein sequence

MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFNKTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRKYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAIRTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQYNKIF
Homology
BLAST of HG10020033 vs. NCBI nr
Match: XP_004136522.1 (uncharacterized protein LOC101209667 [Cucumis sativus] >KGN59226.1 hypothetical protein Csa_001956 [Cucumis sativus])

HSP 1 Score: 446.0 bits (1146), Expect = 2.0e-121
Identity = 222/245 (90.61%), Postives = 231/245 (94.29%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M IS L PA  VL+LLV AVV+EASDNNRVFSPCTDTTVE SDGFTLGFAFATEQKF FN
Sbjct: 1   MAISYLSPASGVLILLVTAVVIEASDNNRVFSPCTDTTVENSDGFTLGFAFATEQKFFFN 60

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLT+NTS SVSSF+PSSNGYMVAFAGRK
Sbjct: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTINTSRSVSSFDPSSNGYMVAFAGRK 120

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFVAD+QH VTSFTLVLEFEKGRLQNLFWKRDGCARCSNN+TFVCI NQDCAI
Sbjct: 121 YAARSPPIFVADQQHIVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNSTFVCIHNQDCAI 180

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RTN+CK  GGSVDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSL NLYSNL+DSLTSQ
Sbjct: 181 RTNSCKNNGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLFNLYSNLKDSLTSQ 240

Query: 241 YNKIF 246
           YNKIF
Sbjct: 241 YNKIF 245

BLAST of HG10020033 vs. NCBI nr
Match: XP_038903347.1 (uncharacterized protein LOC120089965 [Benincasa hispida])

HSP 1 Score: 445.7 bits (1145), Expect = 2.7e-121
Identity = 223/245 (91.02%), Postives = 230/245 (93.88%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M ISDL PA  VLLL V AVVVEA DNNRVFS CTDTTVEKSDGFTLGFAFATEQKF+FN
Sbjct: 1   MAISDLLPASVVLLLFVTAVVVEAGDNNRVFSTCTDTTVEKSDGFTLGFAFATEQKFVFN 60

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTL+LSPCDSRL LTNGNSLISVFRPKVDEISLLT+NTS SVSSF+PSSNGYMVAFAGRK
Sbjct: 61  KTLKLSPCDSRLALTNGNSLISVFRPKVDEISLLTINTSRSVSSFDPSSNGYMVAFAGRK 120

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFVAD+QHTVTSFTLVLEFEKGRLQNLFWKRDGCA+CSNNNTFVCI NQDCAI
Sbjct: 121 YAARSPPIFVADQQHTVTSFTLVLEFEKGRLQNLFWKRDGCAKCSNNNTFVCIHNQDCAI 180

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RTNNCK  GGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSL NLYSNL+ SLTSQ
Sbjct: 181 RTNNCKNNGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKGSLTSQ 240

Query: 241 YNKIF 246
           YNKIF
Sbjct: 241 YNKIF 245

BLAST of HG10020033 vs. NCBI nr
Match: XP_008442938.1 (PREDICTED: uncharacterized protein LOC103486692 [Cucumis melo] >KAA0043800.1 uncharacterized protein E6C27_scaffold236G001320 [Cucumis melo var. makuwa] >TYK25333.1 uncharacterized protein E5676_scaffold352G004720 [Cucumis melo var. makuwa])

HSP 1 Score: 441.8 bits (1135), Expect = 3.8e-120
Identity = 221/245 (90.20%), Postives = 230/245 (93.88%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M IS   PA AVL+LLV AV+VEA DNNRVFSPCTDTTVE SDGFTLGFAFAT+QKF FN
Sbjct: 1   MAISYPSPASAVLILLVTAVLVEAGDNNRVFSPCTDTTVETSDGFTLGFAFATQQKFFFN 60

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTS SVSSF+PSSNGYMVAFAGRK
Sbjct: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSRSVSSFDPSSNGYMVAFAGRK 120

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFVAD+QH VTSFTLVLEFEKGRLQNLFWKRDGCA+CSNNNTFVCI NQDCAI
Sbjct: 121 YAARSPPIFVADQQHIVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCIHNQDCAI 180

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RTN+CK  GGSVDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSL NLYSNL+DSLTSQ
Sbjct: 181 RTNSCKNNGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLFNLYSNLKDSLTSQ 240

Query: 241 YNKIF 246
           YNKIF
Sbjct: 241 YNKIF 245

BLAST of HG10020033 vs. NCBI nr
Match: XP_023538924.1 (uncharacterized protein LOC111799707 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 434.5 bits (1116), Expect = 6.1e-118
Identity = 217/245 (88.57%), Postives = 225/245 (91.84%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M I DL  A AVLLLL+ A  VEA D+NRVFSPC DTTVEKSDGFTLG AFAT QKF+FN
Sbjct: 1   MAIYDLSRASAVLLLLMTAAAVEARDSNRVFSPCADTTVEKSDGFTLGLAFATHQKFVFN 60

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTL LSPCDSRL LTNGNSLISVFRP VDEISLLTVNT+ SVS+FNPSSNGYMVAFAGRK
Sbjct: 61  KTLNLSPCDSRLALTNGNSLISVFRPMVDEISLLTVNTTPSVSNFNPSSNGYMVAFAGRK 120

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFVADEQH VTSFTLVLEFEKGRLQNLFWKRDGCA+CSNNNTFVCI NQDCAI
Sbjct: 121 YAARSPPIFVADEQHAVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCINNQDCAI 180

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RT+NCKYRGG VDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSL NLYSNL+DSLTSQ
Sbjct: 181 RTSNCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTSQ 240

Query: 241 YNKIF 246
           YNKIF
Sbjct: 241 YNKIF 245

BLAST of HG10020033 vs. NCBI nr
Match: KAG6596627.1 (hypothetical protein SDJN03_09807, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 431.4 bits (1108), Expect = 5.2e-117
Identity = 214/245 (87.35%), Postives = 224/245 (91.43%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M I D   A AVLLLL+ A  VEA D+NR+FSPC DTTVEKSDGFTLG AFAT+QKF+FN
Sbjct: 31  MAIYDFSTASAVLLLLMTAAAVEARDSNRIFSPCADTTVEKSDGFTLGLAFATQQKFVFN 90

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTL LSPCDSRL LTNGNSLISVFRP VDEISLLTVNT+ SVS+FNPSSN YMVAFAGRK
Sbjct: 91  KTLNLSPCDSRLALTNGNSLISVFRPMVDEISLLTVNTTPSVSNFNPSSNSYMVAFAGRK 150

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFVADEQH VTSFTLVLEFEKGRLQNLFWKRDGCA+CSNNNTFVCI NQDCAI
Sbjct: 151 YAARSPPIFVADEQHAVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCINNQDCAI 210

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RT+NCKYRGG VDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSL NLYSNL+DSLTSQ
Sbjct: 211 RTSNCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLFNLYSNLKDSLTSQ 270

Query: 241 YNKIF 246
           YNKIF
Sbjct: 271 YNKIF 275

BLAST of HG10020033 vs. ExPASy TrEMBL
Match: A0A0A0LGU1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G782730 PE=4 SV=1)

HSP 1 Score: 446.0 bits (1146), Expect = 9.8e-122
Identity = 222/245 (90.61%), Postives = 231/245 (94.29%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M IS L PA  VL+LLV AVV+EASDNNRVFSPCTDTTVE SDGFTLGFAFATEQKF FN
Sbjct: 1   MAISYLSPASGVLILLVTAVVIEASDNNRVFSPCTDTTVENSDGFTLGFAFATEQKFFFN 60

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLT+NTS SVSSF+PSSNGYMVAFAGRK
Sbjct: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTINTSRSVSSFDPSSNGYMVAFAGRK 120

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFVAD+QH VTSFTLVLEFEKGRLQNLFWKRDGCARCSNN+TFVCI NQDCAI
Sbjct: 121 YAARSPPIFVADQQHIVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNSTFVCIHNQDCAI 180

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RTN+CK  GGSVDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSL NLYSNL+DSLTSQ
Sbjct: 181 RTNSCKNNGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLFNLYSNLKDSLTSQ 240

Query: 241 YNKIF 246
           YNKIF
Sbjct: 241 YNKIF 245

BLAST of HG10020033 vs. ExPASy TrEMBL
Match: A0A5A7TKL4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004720 PE=4 SV=1)

HSP 1 Score: 441.8 bits (1135), Expect = 1.9e-120
Identity = 221/245 (90.20%), Postives = 230/245 (93.88%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M IS   PA AVL+LLV AV+VEA DNNRVFSPCTDTTVE SDGFTLGFAFAT+QKF FN
Sbjct: 1   MAISYPSPASAVLILLVTAVLVEAGDNNRVFSPCTDTTVETSDGFTLGFAFATQQKFFFN 60

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTS SVSSF+PSSNGYMVAFAGRK
Sbjct: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSRSVSSFDPSSNGYMVAFAGRK 120

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFVAD+QH VTSFTLVLEFEKGRLQNLFWKRDGCA+CSNNNTFVCI NQDCAI
Sbjct: 121 YAARSPPIFVADQQHIVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCIHNQDCAI 180

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RTN+CK  GGSVDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSL NLYSNL+DSLTSQ
Sbjct: 181 RTNSCKNNGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLFNLYSNLKDSLTSQ 240

Query: 241 YNKIF 246
           YNKIF
Sbjct: 241 YNKIF 245

BLAST of HG10020033 vs. ExPASy TrEMBL
Match: A0A1S3B6Y8 (uncharacterized protein LOC103486692 OS=Cucumis melo OX=3656 GN=LOC103486692 PE=4 SV=1)

HSP 1 Score: 441.8 bits (1135), Expect = 1.9e-120
Identity = 221/245 (90.20%), Postives = 230/245 (93.88%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M IS   PA AVL+LLV AV+VEA DNNRVFSPCTDTTVE SDGFTLGFAFAT+QKF FN
Sbjct: 1   MAISYPSPASAVLILLVTAVLVEAGDNNRVFSPCTDTTVETSDGFTLGFAFATQQKFFFN 60

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTS SVSSF+PSSNGYMVAFAGRK
Sbjct: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSRSVSSFDPSSNGYMVAFAGRK 120

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFVAD+QH VTSFTLVLEFEKGRLQNLFWKRDGCA+CSNNNTFVCI NQDCAI
Sbjct: 121 YAARSPPIFVADQQHIVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCIHNQDCAI 180

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RTN+CK  GGSVDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSL NLYSNL+DSLTSQ
Sbjct: 181 RTNSCKNNGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLFNLYSNLKDSLTSQ 240

Query: 241 YNKIF 246
           YNKIF
Sbjct: 241 YNKIF 245

BLAST of HG10020033 vs. ExPASy TrEMBL
Match: A0A6J1FLU2 (uncharacterized protein LOC111445105 OS=Cucurbita moschata OX=3662 GN=LOC111445105 PE=4 SV=1)

HSP 1 Score: 431.0 bits (1107), Expect = 3.3e-117
Identity = 214/245 (87.35%), Postives = 225/245 (91.84%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M I DL  A AVLLLL+ A  VEA D+NRVFSPC DTTVEKSDGFTLG AFAT+QKF+FN
Sbjct: 1   MAIYDLSTASAVLLLLMTAAAVEARDSNRVFSPCADTTVEKSDGFTLGLAFATQQKFVFN 60

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTL LSPCDSRL LTNGNSLIS+FRP VDEISLLTVNT+ SVS+FNPSSN YMVAFAGRK
Sbjct: 61  KTLNLSPCDSRLALTNGNSLISMFRPMVDEISLLTVNTTPSVSNFNPSSNSYMVAFAGRK 120

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFVADEQH VTSFTLVLEFEKGRLQNLFWKRDGCA+CSNNNTFVCI NQDCAI
Sbjct: 121 YAARSPPIFVADEQHAVTSFTLVLEFEKGRLQNLFWKRDGCAQCSNNNTFVCINNQDCAI 180

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RT+NCKYRGG VDCSLAIQLAFSGTDKHLSVFNSWYEVS+LRQYSL NLYSNL+DSLTSQ
Sbjct: 181 RTSNCKYRGGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSRLRQYSLFNLYSNLKDSLTSQ 240

Query: 241 YNKIF 246
           YNKIF
Sbjct: 241 YNKIF 245

BLAST of HG10020033 vs. ExPASy TrEMBL
Match: A0A6J1J6R6 (uncharacterized protein LOC111481736 OS=Cucurbita maxima OX=3661 GN=LOC111481736 PE=4 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 9.5e-117
Identity = 217/245 (88.57%), Postives = 227/245 (92.65%), Query Frame = 0

Query: 1   MGISDLPPAFAVLLLLVMAVVVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFN 60
           M ISDL P   VLLLLV AVVVEA DNNRVFSPC DTTVEKSDGFTLG AFATEQKF+FN
Sbjct: 1   MAISDLSPLSFVLLLLVTAVVVEARDNNRVFSPCIDTTVEKSDGFTLGLAFATEQKFVFN 60

Query: 61  KTLQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRK 120
           KTL+LSPCDSRL LTNGN+LISVFRPKVDEISLLTVNT+ SVS+FNPSSNGYMVAFAGRK
Sbjct: 61  KTLKLSPCDSRLALTNGNALISVFRPKVDEISLLTVNTTPSVSTFNPSSNGYMVAFAGRK 120

Query: 121 YAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAI 180
           YAARSPPIFV+D QH VTSFTLVLEFEKGRLQNLFWKRDGCARCSNN+TFVCI NQDCAI
Sbjct: 121 YAARSPPIFVSDGQHIVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNHTFVCINNQDCAI 180

Query: 181 RTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQ 240
           RTNNCK   G VDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSL++LYSNL+DSLTSQ
Sbjct: 181 RTNNCK-NSGPVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLVDLYSNLKDSLTSQ 240

Query: 241 YNKIF 246
           YNKIF
Sbjct: 241 YNKIF 244

BLAST of HG10020033 vs. TAIR 10
Match: AT3G11800.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G44150.1); Has 74 Blast hits to 73 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 72; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 297.0 bits (759), Expect = 1.4e-80
Identity = 151/244 (61.89%), Postives = 185/244 (75.82%), Query Frame = 0

Query: 10  FAVLLLLVMAV----VVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLF---NKT 69
           F +L  L  AV    + EA DNN+V+SPC+D+TV   DGFT G AFA +  F     +K+
Sbjct: 4   FFLLCCLFAAVLTSSLTEAGDNNQVYSPCSDSTVAIGDGFTFGIAFAAKDSFFSTNRSKS 63

Query: 70  LQLSPCDSRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNP-SSNGYMVAFAGRKY 129
           +Q SPCD R    NGNS ++VFRPKVDEI+LLT+NTSSS SSF P +S GYMVAFAG KY
Sbjct: 64  VQYSPCDHRHLSLNGNSEVAVFRPKVDEITLLTINTSSS-SSFRPDASKGYMVAFAGAKY 123

Query: 130 AARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAIR 189
           AARS PI VAD  H VTSFTLVLEF+KGRL+N+FWK+DGC++CS ++ FVC+  ++CAI+
Sbjct: 124 AARSLPIMVADSNHIVTSFTLVLEFQKGRLENMFWKKDGCSKCSGDSKFVCLNKEECAIK 183

Query: 190 TNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQY 246
             NCK +GG VDCSL IQLAFSGTDKH +  NSWYEV+ L+QYSL  LYSNL+DSLT+ +
Sbjct: 184 PQNCKNQGGQVDCSLGIQLAFSGTDKHYTALNSWYEVANLKQYSLYGLYSNLKDSLTNPF 243

BLAST of HG10020033 vs. TAIR 10
Match: AT3G44150.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: cultured cell; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11800.1); Has 76 Blast hits to 75 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 74; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 295.0 bits (754), Expect = 5.4e-80
Identity = 146/236 (61.86%), Postives = 177/236 (75.00%), Query Frame = 0

Query: 11  AVLLLLVMAVVVEASDN-NRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFNKTLQLSPCD 70
           AV+L + +      S N N ++SPC+DT +++SDGFT G AF++   F  N+T+ LSPCD
Sbjct: 14  AVILTVALGGDSGGSGNTNTIYSPCSDTRIQRSDGFTFGIAFSSRPSFFINQTVLLSPCD 73

Query: 71  SRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPSSNGYMVAFAGRKYAARSPPIF 130
            RL L   NS  SVFRPK+DEISLL++NTS   + F  +  GYMVAFAGRKYAARS P F
Sbjct: 74  RRLSLAAMNSQFSVFRPKIDEISLLSINTS---AFFPDNYGGYMVAFAGRKYAARSIPAF 133

Query: 131 VADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCARCSNNNTFVCIQNQDCAIRTNNCKYRG 190
           +A+    VTSFTLV+EF+KGRLQNL+WKRDGCA C  N  FVC+  QDCAIRT +CK RG
Sbjct: 134 IANSTFIVTSFTLVMEFQKGRLQNLYWKRDGCASCKGNQNFVCLNKQDCAIRTPSCKGRG 193

Query: 191 GSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSLTSQYNKIF 246
           G+VDCSL IQLAFSGTDKHL+V NSWYEV  L+QYSL  LYSNL+ SLT+Q+N  F
Sbjct: 194 GAVDCSLGIQLAFSGTDKHLAVLNSWYEVENLKQYSLYGLYSNLKSSLTNQFNNFF 246

BLAST of HG10020033 vs. TAIR 10
Match: AT2G15910.1 (CSL zinc finger domain-containing protein )

HSP 1 Score: 235.7 bits (600), Expect = 3.9e-62
Identity = 123/241 (51.04%), Postives = 173/241 (71.78%), Query Frame = 0

Query: 12  VLLLLVMAV--VVEASDNNRVFSPCTDTTVEKSDGFTLGFAFATEQKFLFNKTLQLSPCD 71
           +++++VM V   V A+DNN V+SPC+DT + K DGFT+G A ++++ F  ++ +QLSPCD
Sbjct: 128 MIMMIVMMVDDWVGAADNNPVYSPCSDTQISKGDGFTIGIAISSKEAFFLDQ-VQLSPCD 187

Query: 72  SRLGLTNGNSLISVFRPKVDEISLLTVNTSSSVSSFNPS-SNGYMVAFAGRKYAARSPPI 131
           +RLGL    + +++FRPKVDEISLL+++T    S FNPS + G+MV FAG KYAARS P+
Sbjct: 188 TRLGLAAKMAQLALFRPKVDEISLLSIDT----SKFNPSEAGGFMVGFAGSKYAARSYPV 247

Query: 132 FVADEQHTVTSFT---------LVLEFEKGRLQNLFWKRDGCARC--SNNNTFVCIQNQD 191
            VAD  +T+T+FT         LVLEF+KG LQNLFWK  GC  C  + +++ VC+   D
Sbjct: 248 KVADGSNTITAFTLVMKLTLSPLVLEFQKGVLQNLFWKSFGCDLCKGTGSSSSVCLNGTD 307

Query: 192 CAIRTNNCKYRGGSVDCSLAIQLAFSGTDKHLSVFNSWYEVSKLRQYSLLNLYSNLRDSL 239
           CA+ T+ CK  GG  +C++ IQ+AFSGTD++L   N+WYEV+ LRQYSL +LY+N  DSL
Sbjct: 308 CAVPTSKCKANGGQANCNIGIQVAFSGTDRNLESLNTWYEVNNLRQYSLTDLYANAVDSL 363

BLAST of HG10020033 vs. TAIR 10
Match: AT3G48630.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G44150.1); Has 64 Blast hits to 64 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 64; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 60.8 bits (146), Expect = 1.7e-09
Identity = 27/52 (51.92%), Postives = 35/52 (67.31%), Query Frame = 0

Query: 112 YMVAFAGRKYAARSPPIFVADEQHTVTSFTLVLEFEKGRLQNLFWKRDGCAR 164
           Y V   G +  +   P F+A+    VTSFT V+EF+KGRLQNL+WKRD CA+
Sbjct: 2   YNVGTRGSEIRSEVDPAFIANSTFIVTSFTWVMEFQKGRLQNLYWKRDVCAK 53

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004136522.12.0e-12190.61uncharacterized protein LOC101209667 [Cucumis sativus] >KGN59226.1 hypothetical ... [more]
XP_038903347.12.7e-12191.02uncharacterized protein LOC120089965 [Benincasa hispida][more]
XP_008442938.13.8e-12090.20PREDICTED: uncharacterized protein LOC103486692 [Cucumis melo] >KAA0043800.1 unc... [more]
XP_023538924.16.1e-11888.57uncharacterized protein LOC111799707 [Cucurbita pepo subsp. pepo][more]
KAG6596627.15.2e-11787.35hypothetical protein SDJN03_09807, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LGU19.8e-12290.61Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G782730 PE=4 SV=1[more]
A0A5A7TKL41.9e-12090.20Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B6Y81.9e-12090.20uncharacterized protein LOC103486692 OS=Cucumis melo OX=3656 GN=LOC103486692 PE=... [more]
A0A6J1FLU23.3e-11787.35uncharacterized protein LOC111445105 OS=Cucurbita moschata OX=3662 GN=LOC1114451... [more]
A0A6J1J6R69.5e-11788.57uncharacterized protein LOC111481736 OS=Cucurbita maxima OX=3661 GN=LOC111481736... [more]
Match NameE-valueIdentityDescription
AT3G11800.11.4e-8061.89unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G44150.15.4e-8061.86unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G15910.13.9e-6251.04CSL zinc finger domain-containing protein [more]
AT3G48630.11.7e-0951.92unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR21454:SF33SUBFAMILY NOT NAMEDcoord: 5..245
IPR044248Diphthamide biosynthesis protein 3/4-likePANTHERPTHR21454DPH3 HOMOLOG-RELATEDcoord: 5..245

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10020033.1HG10020033.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0017183 peptidyl-diphthamide biosynthetic process from peptidyl-histidine
biological_process GO:0002098 tRNA wobble uridine modification
cellular_component GO:0005829 cytosol
cellular_component GO:0005634 nucleus
molecular_function GO:0046872 metal ion binding