HG10008547 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10008547
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSANT domain-containing protein
LocationChr10: 24076939 .. 24079302 (+)
RNA-Seq ExpressionHG10008547
SyntenyHG10008547
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGTTATTTTGTTTCGTTGATACTAGAATAGAACTTGCTATTATTAATAGGATATCAAAGGTACGAGTGTTTCAGTCATACACATGCAGTCGCATTGTGATAGAAGATTGAAGGAGAGACATCGATAATGCTCTCTAAGAAATTTAGCAAAGAAAAATTTCCAAACTTTACCTGTCTTTGATTAAGACCCTTTCAAGGTTTTGATGAGTTTTGATTTTCTCTAGAATGCTTTTGGTTTAAAGAAACTGTGGAAGTGTGGAAGATACTGTGTTTCCCCTCTTCCTACTTGTATGATTGAAGTAATATTTGTTATTTTGCCTCAATCTGTTTTTTTATTTGTCGATGAGATTTCTAGCAACTTTTTTAACATATTTTTTGCTCCTCAGATCACAATTGGTGCTGCTTTCAAAATGGTGCAGAAACAGCCTTTTGATGATGGAGAACCACTGGAGATCTCTTCAAAGCACTTGAAACAAGTGGTTGAACAGAGTAATCAAATACTGTCATTTTCAGAATCTGTTATCCCTGAAGATTCTCTACAATATCATTATGCTCTAGGTGCGAGTTTGGCCATAATATTATGTGGAAGTTTTTAATAAGGGTCTTAGACTAGCAGTCTGTAAAATTTGCTGTCTTGCAGGATGGTAAAATGGCAGCTGCAAATTGAGCAGTTGTAAATAATATATATAATATTTATTTTTGTATCCGGTACATTAAACTGTAAATGAAATCTGAAGATTTTAATCTATGAGGATTCAGGTAGTAGTGATTCTGATTTTCAACAATTTGGGAAGTGATAGATATATTTACTTGGGACAATTAATTTATGCAGGATTTGTGTGTTCTTGATAATTGATTTGATAAGATTAGCTGCAAAACAGTTTTTATCTGATCGTTTCCTAATTCGAGTTGTCTGGATGACTCTGACGGCAATTTCATTTTCCATTATGATTGATAGGGGATGAATTTCAAAAGAATGATACAGAAAGTGACGAAAAGCATTCAAGTGGAGTTTTCTCAGAGCCTCATGGAAGCAGTGATGACTTTGATACTAGTATTCCTCGTTGCTTATCCTTCTCCTCTGGCACGAACAACAATAAGACCCTTGAAGAGGGGTCCCCATCTAAATCTCCTCGACATTATTCTATTTCTTCTGAATATTTTAACCCTGTAAATCATCAGCGAAGAATACTGACGTACTGTGAAGAAATATATTCTCTACTATTGGACCACGCTCCTCAAAAATCTGTTTCCATTGGTCCTGACCATCAAGCTGTTGTTCCACCTTGGAGACCACGGGAGGTGGAGGTTATGTCATACGTGTCTGGGTCAGATTTGAAGTCTACAGGTGACGAATACGAGAAGAGGTTGATTGGTACCTGTGTTATTCCAATGCCAGATATGGATTCATCCATCAGTTGTGGTCAAGAAGTTGGGAGTGGAAGAGAAGCTTGCAACTGTGAGGATGATGGTTCTGTGGGATGTGTTAGTACGCACATTGTAGAAGCAAGAGAGCAGCTTAAAAGTTCTATTGGACCAGATAGGTTTGTGGAGCTAGGGTTTTCTGAAATGGGAGAGCAGGTAGCACAGAAATGGAGTGAAGAAGAAGAACGGTTGTTTTATGAGGTTGTCTTTTCTAATCCAGTCTCACTGGGGAAAAACTTCTGGAGTGATCTTTCAGTCGTGTTTGCTTCCAAATCTAAAAAGGAGATTGTTAGCTATTACTTCAATGTCTTTATGCTTCGGAGGCGAGCAGAGCAGAACCGATGTGACTCATTAAACATTGATAGTGACAACGATGAATGGCAAGGAACTGATGATTATGGCGATAATGAACCTGGAATGACAGAAGAGGACGATGACTCTGTTGTTGAATCACCACTACATGATGTTGGACCTGGTTTCAGTCGCAGCAGGGAAGTTGAATTGCAAGAGTATGACGAGGATATTGCTGATGGATTTGATGATAATGAAAGTGGGGGTATTGGTAATTGTTTTAATAATTGTGGTTCCAGTCCCATGCTTCAGGAAAAAATTCCTTGCGATGAAAGAGGAGAACATGAAGTTCAAGACGATTCTTGCACATCATCTGACACATGTCCAGCAACCCAAGAACTTCCAGCCAAGACGGAGCATTGTGATCAATGGCTAGGTAGCTTTACGGGGCCAAACAATAGTGTTGGTCTTGGTCACGAGCCATCATCAGTTCAGGAGCATTGTGATGCTAAGGTGTGGGACGTTGGATACTTGACTTGTTCGAAAAGTGAAGTTGACTTCTTGCCAACTAGCAGTATGATAGAAGAAGTTTTTGGAGATGATTCGAGTAATTACAAGGCAAGGGATGGGAAGAACTTGAGTTAG

mRNA sequence

ATGAGGTTATTTTGTTTCGTTGATACTAGAATAGAACTTGCTATTATTAATAGGATATCAAAGATCACAATTGGTGCTGCTTTCAAAATGGTGCAGAAACAGCCTTTTGATGATGGAGAACCACTGGAGATCTCTTCAAAGCACTTGAAACAAGTGGTTGAACAGAGTAATCAAATACTGTCATTTTCAGAATCTGTTATCCCTGAAGATTCTCTACAATATCATTATGCTCTAGGGGATGAATTTCAAAAGAATGATACAGAAAGTGACGAAAAGCATTCAAGTGGAGTTTTCTCAGAGCCTCATGGAAGCAGTGATGACTTTGATACTAGTATTCCTCGTTGCTTATCCTTCTCCTCTGGCACGAACAACAATAAGACCCTTGAAGAGGGGTCCCCATCTAAATCTCCTCGACATTATTCTATTTCTTCTGAATATTTTAACCCTGTAAATCATCAGCGAAGAATACTGACGTACTGTGAAGAAATATATTCTCTACTATTGGACCACGCTCCTCAAAAATCTGTTTCCATTGGTCCTGACCATCAAGCTGTTGTTCCACCTTGGAGACCACGGGAGGTGGAGGTTATGTCATACGTGTCTGGGTCAGATTTGAAGTCTACAGGTGACGAATACGAGAAGAGGTTGATTGGTACCTGTGTTATTCCAATGCCAGATATGGATTCATCCATCAGTTGTGGTCAAGAAGTTGGGAGTGGAAGAGAAGCTTGCAACTGTGAGGATGATGGTTCTGTGGGATGTGTTAGTACGCACATTGTAGAAGCAAGAGAGCAGCTTAAAAGTTCTATTGGACCAGATAGGTTTGTGGAGCTAGGGTTTTCTGAAATGGGAGAGCAGGTAGCACAGAAATGGAGTGAAGAAGAAGAACGGTTGTTTTATGAGGTTGTCTTTTCTAATCCAGTCTCACTGGGGAAAAACTTCTGGAGTGATCTTTCAGTCGTGTTTGCTTCCAAATCTAAAAAGGAGATTGTTAGCTATTACTTCAATGTCTTTATGCTTCGGAGGCGAGCAGAGCAGAACCGATGTGACTCATTAAACATTGATAGTGACAACGATGAATGGCAAGGAACTGATGATTATGGCGATAATGAACCTGGAATGACAGAAGAGGACGATGACTCTGTTGTTGAATCACCACTACATGATGTTGGACCTGGTTTCAGTCGCAGCAGGGAAGTTGAATTGCAAGAGTATGACGAGGATATTGCTGATGGATTTGATGATAATGAAAGTGGGGGTATTGGTAATTGTTTTAATAATTGTGGTTCCAGTCCCATGCTTCAGGAAAAAATTCCTTGCGATGAAAGAGGAGAACATGAAGTTCAAGACGATTCTTGCACATCATCTGACACATGTCCAGCAACCCAAGAACTTCCAGCCAAGACGGAGCATTGTGATCAATGGCTAGGTAGCTTTACGGGGCCAAACAATAGTGTTGGTCTTGGTCACGAGCCATCATCAGTTCAGGAGCATTGTGATGCTAAGGTGTGGGACGTTGGATACTTGACTTGTTCGAAAAGTGAAGTTGACTTCTTGCCAACTAGCAGTATGATAGAAGAAGTTTTTGGAGATGATTCGAGTAATTACAAGGCAAGGGATGGGAAGAACTTGAGTTAG

Coding sequence (CDS)

ATGAGGTTATTTTGTTTCGTTGATACTAGAATAGAACTTGCTATTATTAATAGGATATCAAAGATCACAATTGGTGCTGCTTTCAAAATGGTGCAGAAACAGCCTTTTGATGATGGAGAACCACTGGAGATCTCTTCAAAGCACTTGAAACAAGTGGTTGAACAGAGTAATCAAATACTGTCATTTTCAGAATCTGTTATCCCTGAAGATTCTCTACAATATCATTATGCTCTAGGGGATGAATTTCAAAAGAATGATACAGAAAGTGACGAAAAGCATTCAAGTGGAGTTTTCTCAGAGCCTCATGGAAGCAGTGATGACTTTGATACTAGTATTCCTCGTTGCTTATCCTTCTCCTCTGGCACGAACAACAATAAGACCCTTGAAGAGGGGTCCCCATCTAAATCTCCTCGACATTATTCTATTTCTTCTGAATATTTTAACCCTGTAAATCATCAGCGAAGAATACTGACGTACTGTGAAGAAATATATTCTCTACTATTGGACCACGCTCCTCAAAAATCTGTTTCCATTGGTCCTGACCATCAAGCTGTTGTTCCACCTTGGAGACCACGGGAGGTGGAGGTTATGTCATACGTGTCTGGGTCAGATTTGAAGTCTACAGGTGACGAATACGAGAAGAGGTTGATTGGTACCTGTGTTATTCCAATGCCAGATATGGATTCATCCATCAGTTGTGGTCAAGAAGTTGGGAGTGGAAGAGAAGCTTGCAACTGTGAGGATGATGGTTCTGTGGGATGTGTTAGTACGCACATTGTAGAAGCAAGAGAGCAGCTTAAAAGTTCTATTGGACCAGATAGGTTTGTGGAGCTAGGGTTTTCTGAAATGGGAGAGCAGGTAGCACAGAAATGGAGTGAAGAAGAAGAACGGTTGTTTTATGAGGTTGTCTTTTCTAATCCAGTCTCACTGGGGAAAAACTTCTGGAGTGATCTTTCAGTCGTGTTTGCTTCCAAATCTAAAAAGGAGATTGTTAGCTATTACTTCAATGTCTTTATGCTTCGGAGGCGAGCAGAGCAGAACCGATGTGACTCATTAAACATTGATAGTGACAACGATGAATGGCAAGGAACTGATGATTATGGCGATAATGAACCTGGAATGACAGAAGAGGACGATGACTCTGTTGTTGAATCACCACTACATGATGTTGGACCTGGTTTCAGTCGCAGCAGGGAAGTTGAATTGCAAGAGTATGACGAGGATATTGCTGATGGATTTGATGATAATGAAAGTGGGGGTATTGGTAATTGTTTTAATAATTGTGGTTCCAGTCCCATGCTTCAGGAAAAAATTCCTTGCGATGAAAGAGGAGAACATGAAGTTCAAGACGATTCTTGCACATCATCTGACACATGTCCAGCAACCCAAGAACTTCCAGCCAAGACGGAGCATTGTGATCAATGGCTAGGTAGCTTTACGGGGCCAAACAATAGTGTTGGTCTTGGTCACGAGCCATCATCAGTTCAGGAGCATTGTGATGCTAAGGTGTGGGACGTTGGATACTTGACTTGTTCGAAAAGTGAAGTTGACTTCTTGCCAACTAGCAGTATGATAGAAGAAGTTTTTGGAGATGATTCGAGTAATTACAAGGCAAGGGATGGGAAGAACTTGAGTTAG

Protein sequence

MRLFCFVDTRIELAIINRISKITIGAAFKMVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTESDEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEYFNPVNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKSTGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLKSSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSKKEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPLHDVGPGFSRSREVELQEYDEDIADGFDDNESGGIGNCFNNCGSSPMLQEKIPCDERGEHEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWDVGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS
Homology
BLAST of HG10008547 vs. NCBI nr
Match: XP_038879062.1 (uncharacterized protein LOC120071091 [Benincasa hispida] >XP_038879063.1 uncharacterized protein LOC120071091 [Benincasa hispida])

HSP 1 Score: 993.8 bits (2568), Expect = 5.8e-286
Identity = 495/520 (95.19%), Postives = 506/520 (97.31%), Query Frame = 0

Query: 30  MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 89
           MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES
Sbjct: 1   MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 60

Query: 90  DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEYFNP 149
           DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISS++FNP
Sbjct: 61  DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSDFFNP 120

Query: 150 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKS-- 209
           VNHQRRILTYCEEIYSLLLDHAPQKSVSIGP+HQAVVPPWRPREVEV+SYVSGSD KS  
Sbjct: 121 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPEHQAVVPPWRPREVEVISYVSGSDSKSDL 180

Query: 210 TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLK 269
           TGDEYEKRLIGTCVIPMP MDSSISCGQEVGSGREAC+CED GSVGCV++HI EAREQL+
Sbjct: 181 TGDEYEKRLIGTCVIPMPYMDSSISCGQEVGSGREACSCEDGGSVGCVNSHIAEAREQLR 240

Query: 270 SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 329
           SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASK+K
Sbjct: 241 SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKTK 300

Query: 330 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 389
           KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL
Sbjct: 301 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 360

Query: 390 HDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPMLQEKIPCDER-GE 449
           HDVGPGF+RSRE ELQEYDEDIAD  FDDNESGGIGNCFNNCGSSP LQ+KIP D R GE
Sbjct: 361 HDVGPGFNRSREDELQEYDEDIADERFDDNESGGIGNCFNNCGSSPTLQDKIPRDGRVGE 420

Query: 450 HEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWD 509
            EVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWD
Sbjct: 421 LEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWD 480

Query: 510 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS
Sbjct: 481 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 520

BLAST of HG10008547 vs. NCBI nr
Match: XP_004138186.1 (uncharacterized protein LOC101205795 [Cucumis sativus] >KGN63712.1 hypothetical protein Csa_013330 [Cucumis sativus])

HSP 1 Score: 954.1 bits (2465), Expect = 5.1e-274
Identity = 473/520 (90.96%), Postives = 491/520 (94.42%), Query Frame = 0

Query: 30  MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 89
           MVQKQPFDDGEPLEISSK LKQVVEQSNQILSFSESVIPEDSLQYHY LGDEFQKNDTES
Sbjct: 1   MVQKQPFDDGEPLEISSKRLKQVVEQSNQILSFSESVIPEDSLQYHYGLGDEFQKNDTES 60

Query: 90  DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEYFNP 149
           DEKHSSGVFSEPHGSSDDFDTS+P CLSFSSGTNNNKTLEEGSPSKSP HYSISS++FNP
Sbjct: 61  DEKHSSGVFSEPHGSSDDFDTSVPHCLSFSSGTNNNKTLEEGSPSKSPPHYSISSDFFNP 120

Query: 150 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKS-- 209
           VNHQRRILTYCEEIYSLLLDHAPQKSVSIGP+HQA+VPPWRPREV+V+ +  GSD KS  
Sbjct: 121 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPEHQAIVPPWRPREVDVILHAPGSDSKSNF 180

Query: 210 TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLK 269
           TGDEYEKRL GTCVIPMPD+DSSIS GQEVGSGR AC+CED GSVGCVSTHI EAREQLK
Sbjct: 181 TGDEYEKRLTGTCVIPMPDVDSSISSGQEVGSGRAACSCEDCGSVGCVSTHIAEAREQLK 240

Query: 270 SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 329
           SSIGPDRF +LGFSEMGEQ+AQKWSEEEERLFYEVVFSNPVS+GKNFWSDLSVVFASKSK
Sbjct: 241 SSIGPDRFADLGFSEMGEQLAQKWSEEEERLFYEVVFSNPVSMGKNFWSDLSVVFASKSK 300

Query: 330 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 389
           +EIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEW GTDDYGDNEPGMTEEDDDSVVESPL
Sbjct: 301 REIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWPGTDDYGDNEPGMTEEDDDSVVESPL 360

Query: 390 HDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPMLQEKIPCDER-GE 449
           HD+G  F RSRE ELQEYDEDIAD  FDD+ESGGIGNCFNNCGSSP LQEKIP DER G+
Sbjct: 361 HDIGSCFDRSREDELQEYDEDIADERFDDDESGGIGNCFNNCGSSPTLQEKIPHDERGGD 420

Query: 450 HEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWD 509
           HEVQDDSCTSSDTCPATQ LPAKTEHCDQWL SFTGPNN VGLGHEPSSVQEHCDAKVWD
Sbjct: 421 HEVQDDSCTSSDTCPATQVLPAKTEHCDQWLSSFTGPNNGVGLGHEPSSVQEHCDAKVWD 480

Query: 510 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGK+LS
Sbjct: 481 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKSLS 520

BLAST of HG10008547 vs. NCBI nr
Match: XP_008453251.1 (PREDICTED: uncharacterized protein LOC103494027 [Cucumis melo] >KAA0057981.1 uncharacterized protein E6C27_scaffold274G002850 [Cucumis melo var. makuwa])

HSP 1 Score: 951.0 bits (2457), Expect = 4.3e-273
Identity = 476/520 (91.54%), Postives = 488/520 (93.85%), Query Frame = 0

Query: 30  MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 89
           MVQKQPFDDGEPLEISSK LKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES
Sbjct: 1   MVQKQPFDDGEPLEISSKRLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 60

Query: 90  DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEYFNP 149
           DEKHSSGVFSEPHGSSDDFDTS+P CLSFSSGTNNNKTLEEGSPSKSP HYSISSE+FNP
Sbjct: 61  DEKHSSGVFSEPHGSSDDFDTSVPHCLSFSSGTNNNKTLEEGSPSKSPPHYSISSEFFNP 120

Query: 150 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKS-- 209
           VNHQRRILTYCEEIYSLLLDHAPQKSVSIGP+HQA+VPPWRPRE        GSD KS  
Sbjct: 121 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPEHQAIVPPWRPRE------APGSDSKSNF 180

Query: 210 TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLK 269
           TGDEYEKRL GTC+IPMPD+DSSIS GQEVGSGR AC+CED GSVGCVSTHI EAREQLK
Sbjct: 181 TGDEYEKRLTGTCIIPMPDLDSSISSGQEVGSGRAACSCEDSGSVGCVSTHIAEAREQLK 240

Query: 270 SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 329
           SS+GPDRF +LGFSEMGEQ+AQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK
Sbjct: 241 SSVGPDRFADLGFSEMGEQLAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 300

Query: 330 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 389
           KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEW G+DDYGDNEPGMTEEDDDSVVESPL
Sbjct: 301 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWPGSDDYGDNEPGMTEEDDDSVVESPL 360

Query: 390 HDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPMLQEKIPCDER-GE 449
           HDVG  F RSRE ELQEYDEDIAD  FDDNESGGIGNCFNNCGSSP LQ+KIP DER GE
Sbjct: 361 HDVGSCFDRSREDELQEYDEDIADERFDDNESGGIGNCFNNCGSSPTLQDKIPHDERGGE 420

Query: 450 HEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWD 509
           HEVQDDSCTSSDT PATQELPAKTEHCDQWL SFTGPNNSVGLGHEPSSVQEHCDAKVWD
Sbjct: 421 HEVQDDSCTSSDTFPATQELPAKTEHCDQWLSSFTGPNNSVGLGHEPSSVQEHCDAKVWD 480

Query: 510 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS
Sbjct: 481 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 514

BLAST of HG10008547 vs. NCBI nr
Match: KAG6589536.1 (Plant intracellular Ras-group-related LRR protein 7, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 941.8 bits (2433), Expect = 2.6e-270
Identity = 467/529 (88.28%), Postives = 487/529 (92.06%), Query Frame = 0

Query: 20  SKITIGAAFKMVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALG 79
           S+ITIGAAFKMVQ QPFDDGE +EISSKHLKQVVEQSNQILSFSESVIPED+LQYHYALG
Sbjct: 396 SEITIGAAFKMVQNQPFDDGESMEISSKHLKQVVEQSNQILSFSESVIPEDTLQYHYALG 455

Query: 80  DEFQKNDTESDEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRH 139
           DE QKNDTE DEKHSS VF E HGS  DFDTS P C+SFSSG NN+K  EEGSPSKSPRH
Sbjct: 456 DESQKNDTEIDEKHSSAVFLELHGS--DFDTSFPGCISFSSGANNSKAPEEGSPSKSPRH 515

Query: 140 YSISSEYFNPVNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSY 199
           YSISS++FNPVNHQRRILTYCEEIYSLLLDH PQKSVSIGPDHQA+VPPWR R+VEVMSY
Sbjct: 516 YSISSDFFNPVNHQRRILTYCEEIYSLLLDHPPQKSVSIGPDHQAIVPPWRSRDVEVMSY 575

Query: 200 VSGSDLKS--TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVST 259
           VSGSD KS  TGDEYEKRLIGTCVIPMPD DS ISC QE GSGR AC+CED GSVGCV+ 
Sbjct: 576 VSGSDSKSDLTGDEYEKRLIGTCVIPMPDEDSPISCSQEFGSGRAACSCEDGGSVGCVNM 635

Query: 260 HIVEAREQLKSSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSD 319
           HI EAREQLKSSIG +RFVELGF EMGEQVAQKWSEEEERLFYEVVFSNPVS+GKNFW D
Sbjct: 636 HIAEAREQLKSSIGQERFVELGFFEMGEQVAQKWSEEEERLFYEVVFSNPVSVGKNFWGD 695

Query: 320 LSVVFASKSKKEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEE 379
           LSVVFASKSK+EIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEE
Sbjct: 696 LSVVFASKSKREIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEE 755

Query: 380 DDDSVVESPLHDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPMLQE 439
           DDDSVVESPLHDVG GF+R RE +LQEYDEDIAD  FDD+ESGGIGNCFNNCGS+P L  
Sbjct: 756 DDDSVVESPLHDVGSGFNRGREDDLQEYDEDIADERFDDDESGGIGNCFNNCGSTPTLPN 815

Query: 440 KIPCDERGEHEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQ 499
           KIPCDERGEHEVQDDSCTSSDTCP TQ LPAKTEHCDQWLGSFTGPNNS GLGHEPS+VQ
Sbjct: 816 KIPCDERGEHEVQDDSCTSSDTCPTTQALPAKTEHCDQWLGSFTGPNNSAGLGHEPSTVQ 875

Query: 500 EHCDAKVWDVGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           EHCDAKVWDVGYLTCSKSEVDFLPTSSMIEEVFGDDSS+YKA DGKNLS
Sbjct: 876 EHCDAKVWDVGYLTCSKSEVDFLPTSSMIEEVFGDDSSHYKAMDGKNLS 922

BLAST of HG10008547 vs. NCBI nr
Match: KAG7023221.1 (AT-rich interactive domain-containing protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 939.1 bits (2426), Expect = 1.7e-269
Identity = 467/532 (87.78%), Postives = 487/532 (91.54%), Query Frame = 0

Query: 17  NRISKITIGAAFKMVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHY 76
           N +S ITIGAAFKMVQ QPFDDGE +EISSKHLKQVVEQSNQILSFSESVIPED+LQYHY
Sbjct: 16  NILSFITIGAAFKMVQNQPFDDGESMEISSKHLKQVVEQSNQILSFSESVIPEDTLQYHY 75

Query: 77  ALGDEFQKNDTESDEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKS 136
           ALGDE QKNDTE DEKHSS VF E HGS  DFDTS P C+SFSSG NN+K  EEGSPSKS
Sbjct: 76  ALGDESQKNDTEIDEKHSSAVFLELHGS--DFDTSFPGCISFSSGANNSKAPEEGSPSKS 135

Query: 137 PRHYSISSEYFNPVNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEV 196
           PRHYSISS++FNPVNHQRRILTYCEEIYSLLLDH PQKSVSIGPDHQA+VPPWR R+VEV
Sbjct: 136 PRHYSISSDFFNPVNHQRRILTYCEEIYSLLLDHPPQKSVSIGPDHQAIVPPWRSRDVEV 195

Query: 197 MSYVSGSDLKS--TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGC 256
           MSYVSGSD KS  TG EYEKRLIGTCVIPMPD DS ISC QE GSGR AC+CED GSVGC
Sbjct: 196 MSYVSGSDSKSDLTGAEYEKRLIGTCVIPMPDEDSPISCSQEFGSGRAACSCEDGGSVGC 255

Query: 257 VSTHIVEAREQLKSSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNF 316
           V+ HI EAREQLKSSIG +RFVELGF EMGEQVAQKWSEEEERLFYEVVFSNPVS+GKNF
Sbjct: 256 VNMHIAEAREQLKSSIGQERFVELGFFEMGEQVAQKWSEEEERLFYEVVFSNPVSVGKNF 315

Query: 317 WSDLSVVFASKSKKEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGM 376
           W DLSVVFASKSK+EIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGM
Sbjct: 316 WGDLSVVFASKSKREIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGM 375

Query: 377 TEEDDDSVVESPLHDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPM 436
           TEEDDDSVVESPLHDVG GF+R RE +LQEYDEDIAD  FDD+ESGGIGNCFNNCGS+P 
Sbjct: 376 TEEDDDSVVESPLHDVGSGFNRGREDDLQEYDEDIADERFDDDESGGIGNCFNNCGSTPT 435

Query: 437 LQEKIPCDERGEHEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPS 496
           L  KIPCDERGEHEVQDDSCTSSDTCP TQ LPAKTEHCDQWLGSFTGPNNS GLGHEPS
Sbjct: 436 LPNKIPCDERGEHEVQDDSCTSSDTCPTTQALPAKTEHCDQWLGSFTGPNNSAGLGHEPS 495

Query: 497 SVQEHCDAKVWDVGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           +VQEHCDAKVWDVGYLTCSKSEVDFLPTSSMIEEVFGDDSS+YKA DGKNLS
Sbjct: 496 TVQEHCDAKVWDVGYLTCSKSEVDFLPTSSMIEEVFGDDSSHYKAMDGKNLS 545

BLAST of HG10008547 vs. ExPASy Swiss-Prot
Match: Q9LDD4 (AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=ARID2 PE=1 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 4.5e-23
Identity = 68/215 (31.63%), Postives = 116/215 (53.95%), Query Frame = 0

Query: 172 PQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKSTGDEYEKRLIGTCVIPMPDMDS-S 231
           P++ + +G  HQA V  W                  +G + + + +GT + P  + ++  
Sbjct: 370 PRRCIKVGHQHQAQVDEW----------------TESGVDSDSKWLGTRIWPPENSEALD 429

Query: 232 ISCGQE-VGSGR-EACNCEDDGSVGCVSTHIVEAREQLKSSIGPDRFVELGFSEMGEQVA 291
            + G + VG GR ++C+CE  G V C   HI E R +LK  +G D F    F++MGE+V 
Sbjct: 430 QTLGNDLVGKGRPDSCSCELSGFVECTRLHIAEKRMELKRELGDD-FFHWRFNQMGEEVC 489

Query: 292 QKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSKKEIVSYYFNVFMLRRRAEQNR 351
            +W+EEEE+ F +++ ++P    ++FW++ +  F  K ++E+VSYYFNVF++ RR  QNR
Sbjct: 490 LRWTEEEEKRFKDMIIADP----QSFWTNAAKNFPKKKREELVSYYFNVFLINRRRYQNR 549

Query: 352 CDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVV 384
               +IDSD++   G+         +T    D ++
Sbjct: 550 VTPKSIDSDDEGAFGSVGGSFGRDAVTSSGSDVMI 563

BLAST of HG10008547 vs. ExPASy Swiss-Prot
Match: Q84JT7 (AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=ARID1 PE=2 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.6e-15
Identity = 59/196 (30.10%), Postives = 92/196 (46.94%), Query Frame = 0

Query: 169 DHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKSTGDEYEKRLIGTCVIPMPDMD 228
           D   +    +G   QA VP W                     E + + +GT + P+    
Sbjct: 353 DEEDRPCALVGSKFQAKVPEW----------------TGITPESDSKWLGTRIWPLTKEQ 412

Query: 229 SSISC---GQEVGSGR-EACNCEDDGSVGCVSTHIVEAREQLKSSIGPDRFVELGFSEMG 288
           +  +       +G GR + C C + GS+ CV  HI   R++LK  +GP  F    F  MG
Sbjct: 413 TKANLLIERDRIGKGRQDPCGCHNPGSIECVKFHITAKRDKLKLELGP-AFYMWCFDVMG 472

Query: 289 EQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSKKEIVSYYFNVFMLRRRA 348
           E   Q W++ E +   + + S+P SL   F     ++  SKS+ +IVSY++NV +L+ RA
Sbjct: 473 ECTLQYWTDLELKKI-KSLMSSPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRA 530

Query: 349 EQNRCDSLNIDSDNDE 361
            Q+R    +IDSD D+
Sbjct: 533 SQSRITPHDIDSDTDQ 530

BLAST of HG10008547 vs. ExPASy TrEMBL
Match: A0A0A0LSE5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G012130 PE=4 SV=1)

HSP 1 Score: 954.1 bits (2465), Expect = 2.5e-274
Identity = 473/520 (90.96%), Postives = 491/520 (94.42%), Query Frame = 0

Query: 30  MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 89
           MVQKQPFDDGEPLEISSK LKQVVEQSNQILSFSESVIPEDSLQYHY LGDEFQKNDTES
Sbjct: 1   MVQKQPFDDGEPLEISSKRLKQVVEQSNQILSFSESVIPEDSLQYHYGLGDEFQKNDTES 60

Query: 90  DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEYFNP 149
           DEKHSSGVFSEPHGSSDDFDTS+P CLSFSSGTNNNKTLEEGSPSKSP HYSISS++FNP
Sbjct: 61  DEKHSSGVFSEPHGSSDDFDTSVPHCLSFSSGTNNNKTLEEGSPSKSPPHYSISSDFFNP 120

Query: 150 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKS-- 209
           VNHQRRILTYCEEIYSLLLDHAPQKSVSIGP+HQA+VPPWRPREV+V+ +  GSD KS  
Sbjct: 121 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPEHQAIVPPWRPREVDVILHAPGSDSKSNF 180

Query: 210 TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLK 269
           TGDEYEKRL GTCVIPMPD+DSSIS GQEVGSGR AC+CED GSVGCVSTHI EAREQLK
Sbjct: 181 TGDEYEKRLTGTCVIPMPDVDSSISSGQEVGSGRAACSCEDCGSVGCVSTHIAEAREQLK 240

Query: 270 SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 329
           SSIGPDRF +LGFSEMGEQ+AQKWSEEEERLFYEVVFSNPVS+GKNFWSDLSVVFASKSK
Sbjct: 241 SSIGPDRFADLGFSEMGEQLAQKWSEEEERLFYEVVFSNPVSMGKNFWSDLSVVFASKSK 300

Query: 330 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 389
           +EIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEW GTDDYGDNEPGMTEEDDDSVVESPL
Sbjct: 301 REIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWPGTDDYGDNEPGMTEEDDDSVVESPL 360

Query: 390 HDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPMLQEKIPCDER-GE 449
           HD+G  F RSRE ELQEYDEDIAD  FDD+ESGGIGNCFNNCGSSP LQEKIP DER G+
Sbjct: 361 HDIGSCFDRSREDELQEYDEDIADERFDDDESGGIGNCFNNCGSSPTLQEKIPHDERGGD 420

Query: 450 HEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWD 509
           HEVQDDSCTSSDTCPATQ LPAKTEHCDQWL SFTGPNN VGLGHEPSSVQEHCDAKVWD
Sbjct: 421 HEVQDDSCTSSDTCPATQVLPAKTEHCDQWLSSFTGPNNGVGLGHEPSSVQEHCDAKVWD 480

Query: 510 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGK+LS
Sbjct: 481 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKSLS 520

BLAST of HG10008547 vs. ExPASy TrEMBL
Match: A0A5A7UWW6 (SANT domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G002850 PE=4 SV=1)

HSP 1 Score: 951.0 bits (2457), Expect = 2.1e-273
Identity = 476/520 (91.54%), Postives = 488/520 (93.85%), Query Frame = 0

Query: 30  MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 89
           MVQKQPFDDGEPLEISSK LKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES
Sbjct: 1   MVQKQPFDDGEPLEISSKRLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 60

Query: 90  DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEYFNP 149
           DEKHSSGVFSEPHGSSDDFDTS+P CLSFSSGTNNNKTLEEGSPSKSP HYSISSE+FNP
Sbjct: 61  DEKHSSGVFSEPHGSSDDFDTSVPHCLSFSSGTNNNKTLEEGSPSKSPPHYSISSEFFNP 120

Query: 150 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKS-- 209
           VNHQRRILTYCEEIYSLLLDHAPQKSVSIGP+HQA+VPPWRPRE        GSD KS  
Sbjct: 121 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPEHQAIVPPWRPRE------APGSDSKSNF 180

Query: 210 TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLK 269
           TGDEYEKRL GTC+IPMPD+DSSIS GQEVGSGR AC+CED GSVGCVSTHI EAREQLK
Sbjct: 181 TGDEYEKRLTGTCIIPMPDLDSSISSGQEVGSGRAACSCEDSGSVGCVSTHIAEAREQLK 240

Query: 270 SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 329
           SS+GPDRF +LGFSEMGEQ+AQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK
Sbjct: 241 SSVGPDRFADLGFSEMGEQLAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 300

Query: 330 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 389
           KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEW G+DDYGDNEPGMTEEDDDSVVESPL
Sbjct: 301 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWPGSDDYGDNEPGMTEEDDDSVVESPL 360

Query: 390 HDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPMLQEKIPCDER-GE 449
           HDVG  F RSRE ELQEYDEDIAD  FDDNESGGIGNCFNNCGSSP LQ+KIP DER GE
Sbjct: 361 HDVGSCFDRSREDELQEYDEDIADERFDDNESGGIGNCFNNCGSSPTLQDKIPHDERGGE 420

Query: 450 HEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWD 509
           HEVQDDSCTSSDT PATQELPAKTEHCDQWL SFTGPNNSVGLGHEPSSVQEHCDAKVWD
Sbjct: 421 HEVQDDSCTSSDTFPATQELPAKTEHCDQWLSSFTGPNNSVGLGHEPSSVQEHCDAKVWD 480

Query: 510 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS
Sbjct: 481 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 514

BLAST of HG10008547 vs. ExPASy TrEMBL
Match: A0A1S3BV79 (uncharacterized protein LOC103494027 OS=Cucumis melo OX=3656 GN=LOC103494027 PE=4 SV=1)

HSP 1 Score: 951.0 bits (2457), Expect = 2.1e-273
Identity = 476/520 (91.54%), Postives = 488/520 (93.85%), Query Frame = 0

Query: 30  MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 89
           MVQKQPFDDGEPLEISSK LKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES
Sbjct: 1   MVQKQPFDDGEPLEISSKRLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 60

Query: 90  DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEYFNP 149
           DEKHSSGVFSEPHGSSDDFDTS+P CLSFSSGTNNNKTLEEGSPSKSP HYSISSE+FNP
Sbjct: 61  DEKHSSGVFSEPHGSSDDFDTSVPHCLSFSSGTNNNKTLEEGSPSKSPPHYSISSEFFNP 120

Query: 150 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKS-- 209
           VNHQRRILTYCEEIYSLLLDHAPQKSVSIGP+HQA+VPPWRPRE        GSD KS  
Sbjct: 121 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPEHQAIVPPWRPRE------APGSDSKSNF 180

Query: 210 TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLK 269
           TGDEYEKRL GTC+IPMPD+DSSIS GQEVGSGR AC+CED GSVGCVSTHI EAREQLK
Sbjct: 181 TGDEYEKRLTGTCIIPMPDLDSSISSGQEVGSGRAACSCEDSGSVGCVSTHIAEAREQLK 240

Query: 270 SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 329
           SS+GPDRF +LGFSEMGEQ+AQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK
Sbjct: 241 SSVGPDRFADLGFSEMGEQLAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 300

Query: 330 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 389
           KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEW G+DDYGDNEPGMTEEDDDSVVESPL
Sbjct: 301 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWPGSDDYGDNEPGMTEEDDDSVVESPL 360

Query: 390 HDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPMLQEKIPCDER-GE 449
           HDVG  F RSRE ELQEYDEDIAD  FDDNESGGIGNCFNNCGSSP LQ+KIP DER GE
Sbjct: 361 HDVGSCFDRSREDELQEYDEDIADERFDDNESGGIGNCFNNCGSSPTLQDKIPHDERGGE 420

Query: 450 HEVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWD 509
           HEVQDDSCTSSDT PATQELPAKTEHCDQWL SFTGPNNSVGLGHEPSSVQEHCDAKVWD
Sbjct: 421 HEVQDDSCTSSDTFPATQELPAKTEHCDQWLSSFTGPNNSVGLGHEPSSVQEHCDAKVWD 480

Query: 510 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS
Sbjct: 481 VGYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 514

BLAST of HG10008547 vs. ExPASy TrEMBL
Match: A0A6J1JIR0 (uncharacterized protein LOC111484951 OS=Cucurbita maxima OX=3661 GN=LOC111484951 PE=4 SV=1)

HSP 1 Score: 934.5 bits (2414), Expect = 2.0e-268
Identity = 463/519 (89.21%), Postives = 481/519 (92.68%), Query Frame = 0

Query: 30  MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 89
           MVQ QPFDDGE +EISSKHLKQVVEQSNQILSFSESVIPED+LQYHYALGDE QKNDTE 
Sbjct: 1   MVQNQPFDDGESMEISSKHLKQVVEQSNQILSFSESVIPEDTLQYHYALGDESQKNDTEI 60

Query: 90  DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEYFNP 149
           DEKHSS VFSEPHGS  DFDTSIP C+SFSSG NN+K  EEGSPSKSPRHYSISSE+FNP
Sbjct: 61  DEKHSSAVFSEPHGS--DFDTSIPGCISFSSGANNSKAPEEGSPSKSPRHYSISSEFFNP 120

Query: 150 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKS-- 209
           VNHQRRILTYCEEIYSLLLDH PQKSVSIGPDHQA+VPPWR R+VEVMSYVSGSD KS  
Sbjct: 121 VNHQRRILTYCEEIYSLLLDHPPQKSVSIGPDHQAIVPPWRSRDVEVMSYVSGSDSKSDL 180

Query: 210 TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLK 269
           TG+EYEKRLIGTCVIPMPD+DS ISC QE GSGR AC+CED GSVGCV+ HI EAREQLK
Sbjct: 181 TGNEYEKRLIGTCVIPMPDVDSPISCSQEFGSGRAACSCEDGGSVGCVNMHIAEAREQLK 240

Query: 270 SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 329
           S+IG +RFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVS+GKNFW DLSVVFASKSK
Sbjct: 241 SNIGQERFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSVGKNFWGDLSVVFASKSK 300

Query: 330 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 389
           KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL
Sbjct: 301 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 360

Query: 390 HDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPMLQEKIPCDERGEH 449
           HDVG GF R RE +LQEYDEDIAD  FDD+ESGGIG CFNNCGS+P L  KIPCDERGEH
Sbjct: 361 HDVGSGFDRGREDDLQEYDEDIADERFDDDESGGIGICFNNCGSTPTLPNKIPCDERGEH 420

Query: 450 EVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWDV 509
           EVQDDSCTSSDTCP TQ LPAKTEHCDQWLGSFTGPNNSVGLGHEPS+VQEHCDAKVWDV
Sbjct: 421 EVQDDSCTSSDTCPTTQALPAKTEHCDQWLGSFTGPNNSVGLGHEPSTVQEHCDAKVWDV 480

Query: 510 GYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           GYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKA DGKNLS
Sbjct: 481 GYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKAMDGKNLS 517

BLAST of HG10008547 vs. ExPASy TrEMBL
Match: A0A6J1E4L3 (uncharacterized protein LOC111429872 OS=Cucurbita moschata OX=3662 GN=LOC111429872 PE=4 SV=1)

HSP 1 Score: 929.9 bits (2402), Expect = 5.0e-267
Identity = 460/519 (88.63%), Postives = 479/519 (92.29%), Query Frame = 0

Query: 30  MVQKQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTES 89
           MVQ QPFDDGE +EISSKHLKQVVEQSNQILSFSESVIPED+LQYHYALGDE QKNDTE 
Sbjct: 1   MVQNQPFDDGESMEISSKHLKQVVEQSNQILSFSESVIPEDTLQYHYALGDESQKNDTEI 60

Query: 90  DEKHSSGVFSEPHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEYFNP 149
           DEKHSS VF EPHGS  DFDTSIP C+SFSSG NN+K  EEGSPSKSPRHYSISS++FNP
Sbjct: 61  DEKHSSAVFLEPHGS--DFDTSIPGCISFSSGANNSKAPEEGSPSKSPRHYSISSDFFNP 120

Query: 150 VNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKS-- 209
           VNHQRRILTYCEEIYSLLLDH PQKSVSIGPDHQA+VPPWR R+VEVMSY+SGSD KS  
Sbjct: 121 VNHQRRILTYCEEIYSLLLDHPPQKSVSIGPDHQAIVPPWRSRDVEVMSYMSGSDSKSDL 180

Query: 210 TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLK 269
           TGDEYEKRLIGTCVIPMPD DS ISC QE GSGR AC+CED GSVGCVS HI EAREQLK
Sbjct: 181 TGDEYEKRLIGTCVIPMPDEDSPISCSQEFGSGRAACSCEDGGSVGCVSMHIAEAREQLK 240

Query: 270 SSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSK 329
           SSIG +RFVELGF EMGEQVAQKWSEEEERLFYEVVFSNPVS+GKNFW DLSVVFASKSK
Sbjct: 241 SSIGQERFVELGFFEMGEQVAQKWSEEEERLFYEVVFSNPVSVGKNFWGDLSVVFASKSK 300

Query: 330 KEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 389
           +EIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL
Sbjct: 301 REIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPL 360

Query: 390 HDVGPGFSRSREVELQEYDEDIAD-GFDDNESGGIGNCFNNCGSSPMLQEKIPCDERGEH 449
           HDVG GF+R RE +LQEYDEDIAD  FDD+ESGGIGNCFNNCGS+P L  KIPCDERGEH
Sbjct: 361 HDVGSGFNRGREDDLQEYDEDIADERFDDDESGGIGNCFNNCGSTPTLPNKIPCDERGEH 420

Query: 450 EVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWDV 509
           EVQDDSCTSSDTCP TQ LPAKTEHCDQWLGSFTGPNNS GLGHEPS+VQEHCDAKVWDV
Sbjct: 421 EVQDDSCTSSDTCPTTQALPAKTEHCDQWLGSFTGPNNSAGLGHEPSTVQEHCDAKVWDV 480

Query: 510 GYLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARDGKNLS 546
           GYLTCSKSEVDFLPTSSMIEEVFGDDSS+YKA DGKNLS
Sbjct: 481 GYLTCSKSEVDFLPTSSMIEEVFGDDSSHYKAMDGKNLS 517

BLAST of HG10008547 vs. TAIR 10
Match: AT1G26580.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: ELM2 domain-containing protein (TAIR:AT2G03470.1); Has 161 Blast hits to 161 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 4; Plants - 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 232.3 bits (591), Expect = 9.5e-61
Identity = 177/523 (33.84%), Postives = 258/523 (49.33%), Query Frame = 0

Query: 33  KQPFDDGEPLEISSKHLKQVVEQSNQILSFSESVIPEDSLQYHYALGDEFQKNDTESDEK 92
           K+PF+D +  E+  KH +Q ++ +++   F E        Q   A  +E     ++  E 
Sbjct: 4   KRPFEDEKFHELPLKHSRQ-LDYNDKSTQFEEVSPHHAGFQKTVATVNEGNLCKSQGGES 63

Query: 93  HSSGVFSE------PHGSSDDFDTSIPRCLSFSSGTNNNKTLEEGSPSKSPRHYSISSEY 152
               +F E      P    DD         +F+  T       +G   +   +   S +Y
Sbjct: 64  SEGDMFDEESNYVYPGHDMDD---------TFTWDT-------QGCGGRDATYSPHSGKY 123

Query: 153 FNPVNHQRRILTYCEEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLK 212
           F  ++   R+    E  Y LLLD   +K V IGP HQA +P W   +   +   SG  ++
Sbjct: 124 FE-LDIPPRVFAPVETFYYLLLDQRAKKQVPIGPGHQAEIPEWEGSQTGNIE-TSGMSVQ 183

Query: 213 S--TGDEYEKRLIGTCVIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEARE 272
           +  +G    ++L GT VIPMP + +       VG GR+ C C D  SV CV  HI EARE
Sbjct: 184 NHISGCADGEKLFGTSVIPMPGLTTVAHIDDIVGKGRKFCVCRDRDSVRCVCQHIKEARE 243

Query: 273 QLKSSIGPDRFVELGFSEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFAS 332
           +L  + G + F ELG  EMGE+ A KWS+E+ +LF+EVV+SNPV+LG+NFW  L   F S
Sbjct: 244 ELVKTFGNETFKELGLCEMGEKGALKWSDEDAQLFHEVVYSNPVTLGQNFWRHLEAAFCS 303

Query: 333 KSKKEIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVE 392
           +++KEIVS+YFNVF+LRRRA QNR   L+IDSD+DEW G          + E+++DS +E
Sbjct: 304 RTQKEIVSFYFNVFVLRRRAIQNRAFILDIDSDDDEWHGCYGGSSGTRYVEEDEEDSAIE 363

Query: 393 SPLHDVGPGFSRSREVELQEYDEDIA-----DGFDDNESGGIGNCFNNCGSSPMLQEKIP 452
           SPLH    G  +   +  +E +ED++     +  DD + GG G    +  SS +      
Sbjct: 364 SPLHQ---GTKKVYPLHHEEGEEDVSHSSNDEDDDDTKEGGTGLYDEHKMSSTVEYMDRF 423

Query: 453 CDERGEH-EVQDDSCTSSDTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEH 512
               GE   V+DDSCTS        EL     +C +   +  G         +     + 
Sbjct: 424 SGNNGERLNVEDDSCTSF-------ELARNAVNCAEKDETVPGEQQK-----KLKDCNDP 483

Query: 513 CDAKVWDVG-YLTCSKSEVDFLPTSSMIEEVFGDDSSNYKARD 541
            D KVWD    L    +  D  PT  ++EE+FG+     KAR+
Sbjct: 484 IDTKVWDASRCLNVPTNGKDLQPTRRIMEEIFGNGCWENKARN 492

BLAST of HG10008547 vs. TAIR 10
Match: AT2G03470.1 (ELM2 domain-containing protein )

HSP 1 Score: 210.7 bits (535), Expect = 3.0e-54
Identity = 142/388 (36.60%), Postives = 202/388 (52.06%), Query Frame = 0

Query: 161 EEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKSTGDEYEKRLIGTC 220
           E++Y+ L++  P+K V +G +HQA +P +   E+   S         T ++ E +L+  C
Sbjct: 108 EDVYTCLMNQPPRKQVLVGSNHQADIPEFVKEEILDQSEA------RTKEDLEGKLMRKC 167

Query: 221 VIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLKSSIGPDRFVELGF 280
           VIPM D D    CG   G GR+ C C D GS+ CV  HI+EARE L  +IG +RF+ELG 
Sbjct: 168 VIPMSDSD---LCG--TGQGRKECLCLDKGSIRCVRRHIIEARESLVETIGYERFMELGL 227

Query: 281 SEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSKKEIVSYYFNVFML 340
            EMGE+VA  W+EEEE LF++VV+SNP S G++FW  L   F S++ KE+VSYYFNVF+L
Sbjct: 228 CEMGEEVASLWTEEEEDLFHKVVYSNPFSAGRDFWKQLKGTFPSRTMKELVSYYFNVFIL 287

Query: 341 RRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPLHDVGPGFSRSREV 400
           RRR  QNR  +L++DSD+DEWQ   +  ++   + EED++    S   D         E 
Sbjct: 288 RRRGIQNRFKALDVDSDDDEWQVEYNIFNSTKSLDEEDNNG-NRSSYED------NEEEE 347

Query: 401 ELQEYDEDIADGFDDNESGGIGNCFNNCGSSPMLQEKIPCDERGEH-EVQDDSCTS---S 460
           E    D+D  +  +D+ S    +C +         +K   D  GE   V+DDSC S    
Sbjct: 348 ETSSNDDDEEEEEEDDSSSNDAHCVDT--------DKASRDGFGEEVNVEDDSCMSFELQ 407

Query: 461 DTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWDVGYLTCSKSEV 520
           D+       P K   C +              G +  S  +             C     
Sbjct: 408 DSNLIFSHNPIKNRECHR-------------SGEDSYSFDDQRFTS-------DCWNKNN 447

Query: 521 DFLPTSSMIEEVFGDDSSNYKARDGKNL 545
           D LPTS++IEE+FG D  ++  +D  NL
Sbjct: 468 DLLPTSNIIEEIFGQD--DWGDKDDNNL 447

BLAST of HG10008547 vs. TAIR 10
Match: AT2G03470.2 (ELM2 domain-containing protein )

HSP 1 Score: 210.7 bits (535), Expect = 3.0e-54
Identity = 142/388 (36.60%), Postives = 202/388 (52.06%), Query Frame = 0

Query: 161 EEIYSLLLDHAPQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKSTGDEYEKRLIGTC 220
           E++Y+ L++  P+K V +G +HQA +P +   E+   S         T ++ E +L+  C
Sbjct: 107 EDVYTCLMNQPPRKQVLVGSNHQADIPEFVKEEILDQSEA------RTKEDLEGKLMRKC 166

Query: 221 VIPMPDMDSSISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLKSSIGPDRFVELGF 280
           VIPM D D    CG   G GR+ C C D GS+ CV  HI+EARE L  +IG +RF+ELG 
Sbjct: 167 VIPMSDSD---LCG--TGQGRKECLCLDKGSIRCVRRHIIEARESLVETIGYERFMELGL 226

Query: 281 SEMGEQVAQKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSKKEIVSYYFNVFML 340
            EMGE+VA  W+EEEE LF++VV+SNP S G++FW  L   F S++ KE+VSYYFNVF+L
Sbjct: 227 CEMGEEVASLWTEEEEDLFHKVVYSNPFSAGRDFWKQLKGTFPSRTMKELVSYYFNVFIL 286

Query: 341 RRRAEQNRCDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPLHDVGPGFSRSREV 400
           RRR  QNR  +L++DSD+DEWQ   +  ++   + EED++    S   D         E 
Sbjct: 287 RRRGIQNRFKALDVDSDDDEWQVEYNIFNSTKSLDEEDNNG-NRSSYED------NEEEE 346

Query: 401 ELQEYDEDIADGFDDNESGGIGNCFNNCGSSPMLQEKIPCDERGEH-EVQDDSCTS---S 460
           E    D+D  +  +D+ S    +C +         +K   D  GE   V+DDSC S    
Sbjct: 347 ETSSNDDDEEEEEEDDSSSNDAHCVDT--------DKASRDGFGEEVNVEDDSCMSFELQ 406

Query: 461 DTCPATQELPAKTEHCDQWLGSFTGPNNSVGLGHEPSSVQEHCDAKVWDVGYLTCSKSEV 520
           D+       P K   C +              G +  S  +             C     
Sbjct: 407 DSNLIFSHNPIKNRECHR-------------SGEDSYSFDDQRFTS-------DCWNKNN 446

Query: 521 DFLPTSSMIEEVFGDDSSNYKARDGKNL 545
           D LPTS++IEE+FG D  ++  +D  NL
Sbjct: 467 DLLPTSNIIEEIFGQD--DWGDKDDNNL 446

BLAST of HG10008547 vs. TAIR 10
Match: AT1G13880.1 (ELM2 domain-containing protein )

HSP 1 Score: 182.6 bits (462), Expect = 8.7e-46
Identity = 107/263 (40.68%), Postives = 157/263 (59.70%), Query Frame = 0

Query: 171 APQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKSTGDEY-EKRLIGTCVIPMPDMDS 230
           +P+K+V IG D+QA +P     E         +D    G  Y E+++ G CVIPMPD ++
Sbjct: 123 SPRKTVPIGSDYQADIPECVKEE--------ANDQSGQGVGYDEEQVTGKCVIPMPDCET 182

Query: 231 SISCGQEVGSGREACNCEDDGSVGCVSTHIVEAREQLKSSIGPDRFVELGFSEMGEQVAQ 290
            +    ++G GR+ C C D GS+ CV  HI+E RE L ++IG DR +++G  EMGE+VA 
Sbjct: 183 EVC---KIGKGRKECICLDKGSIRCVQQHIMENREDLFATIGYDRCLDIGLCEMGEEVAA 242

Query: 291 KWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSKKEIVSYYFNVFMLRRRAEQNRC 350
           + +E+EE LF+E+V+SNPVS+ ++FW  L   F S++ KEIVSYYFNVF+LRRRA QNR 
Sbjct: 243 RLTEDEEDLFHEIVYSNPVSMDRDFWKHLKSAFPSRTMKEIVSYYFNVFILRRRAIQNRS 302

Query: 351 DSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVVESPLHDVGPGFSRSREVELQEYDEDI 410
            SL++DSD+DEWQ   +Y +   G     DD   +S   D G   + + +  +   ++  
Sbjct: 303 KSLDVDSDDDEWQ--VEYDNTFYGAETPSDDKAEKSLSRDEGEEVNANEDSYMSFENQSN 362

Query: 411 A----DGFDDNESGGIGNCFNNC 429
           A        + E   IGN + +C
Sbjct: 363 AIYSRCPVRNREESNIGNYWRHC 372

BLAST of HG10008547 vs. TAIR 10
Match: AT4G11400.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )

HSP 1 Score: 110.9 bits (276), Expect = 3.2e-24
Identity = 68/215 (31.63%), Postives = 116/215 (53.95%), Query Frame = 0

Query: 172 PQKSVSIGPDHQAVVPPWRPREVEVMSYVSGSDLKSTGDEYEKRLIGTCVIPMPDMDS-S 231
           P++ + +G  HQA V  W                  +G + + + +GT + P  + ++  
Sbjct: 370 PRRCIKVGHQHQAQVDEW----------------TESGVDSDSKWLGTRIWPPENSEALD 429

Query: 232 ISCGQE-VGSGR-EACNCEDDGSVGCVSTHIVEAREQLKSSIGPDRFVELGFSEMGEQVA 291
            + G + VG GR ++C+CE  G V C   HI E R +LK  +G D F    F++MGE+V 
Sbjct: 430 QTLGNDLVGKGRPDSCSCELSGFVECTRLHIAEKRMELKRELGDD-FFHWRFNQMGEEVC 489

Query: 292 QKWSEEEERLFYEVVFSNPVSLGKNFWSDLSVVFASKSKKEIVSYYFNVFMLRRRAEQNR 351
            +W+EEEE+ F +++ ++P    ++FW++ +  F  K ++E+VSYYFNVF++ RR  QNR
Sbjct: 490 LRWTEEEEKRFKDMIIADP----QSFWTNAAKNFPKKKREELVSYYFNVFLINRRRYQNR 549

Query: 352 CDSLNIDSDNDEWQGTDDYGDNEPGMTEEDDDSVV 384
               +IDSD++   G+         +T    D ++
Sbjct: 550 VTPKSIDSDDEGAFGSVGGSFGRDAVTSSGSDVMI 563

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879062.15.8e-28695.19uncharacterized protein LOC120071091 [Benincasa hispida] >XP_038879063.1 unchara... [more]
XP_004138186.15.1e-27490.96uncharacterized protein LOC101205795 [Cucumis sativus] >KGN63712.1 hypothetical ... [more]
XP_008453251.14.3e-27391.54PREDICTED: uncharacterized protein LOC103494027 [Cucumis melo] >KAA0057981.1 unc... [more]
KAG6589536.12.6e-27088.28Plant intracellular Ras-group-related LRR protein 7, partial [Cucurbita argyrosp... [more]
KAG7023221.11.7e-26987.78AT-rich interactive domain-containing protein 2, partial [Cucurbita argyrosperma... [more]
Match NameE-valueIdentityDescription
Q9LDD44.5e-2331.63AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 ... [more]
Q84JT71.6e-1530.10AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A0A0LSE52.5e-27490.96Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G012130 PE=4 SV=1[more]
A0A5A7UWW62.1e-27391.54SANT domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_s... [more]
A0A1S3BV792.1e-27391.54uncharacterized protein LOC103494027 OS=Cucumis melo OX=3656 GN=LOC103494027 PE=... [more]
A0A6J1JIR02.0e-26889.21uncharacterized protein LOC111484951 OS=Cucurbita maxima OX=3661 GN=LOC111484951... [more]
A0A6J1E4L35.0e-26788.63uncharacterized protein LOC111429872 OS=Cucurbita moschata OX=3662 GN=LOC1114298... [more]
Match NameE-valueIdentityDescription
AT1G26580.19.5e-6133.84FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT2G03470.13.0e-5436.60ELM2 domain-containing protein [more]
AT2G03470.23.0e-5436.60ELM2 domain-containing protein [more]
AT1G13880.18.7e-4640.68ELM2 domain-containing protein [more]
AT4G11400.13.2e-2431.63ARID/BRIGHT DNA-binding domain;ELM2 domain protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainSMARTSM00717santcoord: 287..340
e-value: 0.0087
score: 25.3
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 290..338
e-value: 2.48652E-6
score: 42.5626
IPR017930Myb domainPFAMPF00249Myb_DNA-bindingcoord: 289..337
e-value: 7.5E-6
score: 26.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 359..383
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 84..140
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 354..394
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 98..140
NoneNo IPR availablePANTHERPTHR46872DNA BINDING PROTEINcoord: 30..538
NoneNo IPR availablePANTHERPTHR46872:SF10DNA BINDING PROTEINcoord: 30..538

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10008547.1HG10008547.1mRNA