HG10022895 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022895
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionWD_REPEATS_REGION domain-containing protein
LocationChr05: 29392631 .. 29396659 (-)
RNA-Seq ExpressionHG10022895
SyntenyHG10022895
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAAGGAACTAGGAAATGCGGTTGCTGAAAGGATTCTTCCTCCGACTGAACAGGTGCTGTATTTGTACTTTTTGTATTTTCAGTAAGTTGTATATATTTGGTAATACGCTTCTTATGCAACCTTAGATCTATAGAACTTCATGAGGTATCAAACTTGGTTGGAATGTGAATTCTAAGTTGCTGAATGACAATCGCATTTACTAATCTGAAACTTGTCTGTCATTCACATATTGTTGTGCAGGAGGTATCAAATGAGATAGATGTGAAAGTTAAAAAATATCTGCGAGGAGAAGGTGCTAATCTAGAGGTAGTATGGATATGGCCTATGGGCTTATTGATGTACATTTTTCTAATTGTATTTTCTTTTTCCCTTTGTGTTTGATGCTAACTGTTCCATTCCAGGTTCTGAAAGATAAAAAGTTGAAGGGTCAACTTTCTGTTATAGAAGATTTATATGGAAAATCTGCTAAAGCTGCCGCGAAGGTCGAAAAGGTAAATTCCCATTCTCCAAATCTCTTAAATTGCATTTCTGCTTCCTAAGCTTCTAATAGTTTACTGTAAAGATGCACTAATCATTTCTCCCTCTTTTCCATATAAGAAACATGAAATTTATCAATAGTGTCAAAGAAAGAAACACAAAAGTAGGTAAATTTGTGTCTATCAAATGAGCTGAAAGATTTTATTTAATGATACAAGTTCAACTAGACTTTGCTTTCCCTCTTTTAATTTGAATTCTTGGCGTTGCAGACACATTTTTTTCCCTTGATTTTTTTAACCTTTAATTTTATCTCTAGTGGCTTATGCCAAGTGAGGGAGGCTATTTGGAGGCTGAAGGATTGGAGAAGACATGGAGAATCAAACAGGAAACGATTTCTCACGAAGTAGATATCTTAAGTAGAAGGAATCAACATGATATTATTTTACCAGGTTTCTCTTACAGTTGCTATGCAATTTTTTCATACTTTAGGATACCACTTCCCTTGAAATCATTGTTATGAAATTTTTATATCTGTTCCTTTTGTTTGCTTGTGTTTCTTCTTCTTATTGTTGGCCATTGACTTTGCCAACATGTTTGTGCTCCGTGACATTGTAAAATGTAAATTTCGGCTTAGGTTCTGTTGTAAGCAATGTGGTTTTGATCGTATGTAGAAATTCATTCAATTTTCTGGACTTGGTGAGATTTTTAACAACATTGAGATGTACTCTTTCTTGAGATTTATGTAACTGGACTTCTATGCCATCGGCTTATCTTTCTTTTTGTATTCCCTTTTTAAGGAAATGACTTCTGTAACATAACTTCAATATTTTCTTGTACATTTAGGCATTTTATGTGATGATAATGGCCCCAAAATACACATCTTGTCATTTTTGTACCTCTTTGGCTTGTGCTTGCCTTTCCTCTCTTTCTCTCTCTTTTCTGATTTCATTTACTATTTAGTCTAGACAATTATTTTCTTCTATGTATATAAAAAATTCTGATTCCATTGCAAAATTTTGTTCATAATCTGTCTTAAATTTCAGGATTAATTTTTATTATTCGATTCATATGTGCAGCTCTTGGACCATATTCTCTTGACTATACTTCAAACGGTAGATATATGGCCATTGCTGGACGTAAGGGTCACCTGGCACTTGTAGACACAAAGGACCTCAATTTAATTAAAGAGTTTCAGGTAGTGATGGTGATCCCTACAAGATCAACTTGATTTGGTATAAGTTTATTTCCTAGAAAAAGGAGATTTTCCTCAATGCTTGAATAGGAATTGAATGGAAAATTTCTTTAGACTGATATGGTATGCAAACATGGAATGTGACGTACAAGATACAGTACATGTCTAGGAGCATCGGGTATACTTCATAGGGAGGCCAAAAGTGATCATCCAAAAAGGATTTCTAATTAGCAAAATAAAATAACATGGAATGACTATAAATTCTTGGTTTTCGTGTTAATCACATATCAGATTTTCTCCAAGAAGTCATTATTTACAGTGTACTTCAATGCTTGTTTGTACCAATTAGAGTTATTCTAGAGGATGTGTTACGTTGAGTTTATAGTCATTGCTCACAGTTAACTGTTTTCAGGTTATGGAAACTGTTCGTGACGTGGTCTTCTTGCACAATGAGCTGTTCTTTGCTGCTGCACAAAAAAAGTATGCTTTTTTTTTTCTCTCCCCTCTCTCTTGATTGTTGATTGATTTTGTAGGCAATAATTAACTCCTAATTTGTACATTATTGGATGTCTAAATGGAGACATTTCATTTATCATTAGAAATTTAGGATTCTGGATTTACTATAGAATCTATCTGATGATTGCCTCCAAACTTCTCCAAGTGAAAAAGAAAAAGGATTCATATATTAGGCCTTTAAATCAAATCTTCATGCTTATAATACAAAAGGGGTACCTTGCATTGCTTAGATTATGCAATTTCACACTACACACAGCTGAATGTTTTGTATAAAAAAATGATAATATAATTGAAATTTTGAAATTTAACAGGTATCCGTATATTTATAATCGGGAGGGCACAGAGCTTCATTGCCTTAAGGTTGGTATATCATACTATTTGAGTGAGATCTTCCTGCTATATTTCTCCTATTTCTCATTCTGGCAGCTGTTCACAGTTTTCCTTCCTTCGAGTGTAGTTTATATTGAATGTCATCTTGATGCTGTTTGCTTAAGCTTCAATAAATTGATGGCAAGTTGTTAAATTGCGGACAAGTTTGAGTTCACGTGTTTTTTAAGCTTCACTTAGATGATGTTTGTTTCACTTGAGCTTCTGTCGAATGATAGCGTGCTCAAAGTTCTGGATCAATGTGCATAGGGACCATATGCTTGAATCTTATTTGAACTCTTTTAGTGTATTATGATGACATTAGAAGCGTTTCAATTCTTCCAGCGTAAATGTTCGTACAGTTAGTGTTATCGTTTTGTTCATTCTCTGAAGTATCTATTATCATGTATAAATTGAACTGACCAAATTCTGAAAACGTATTAGGAGCATGGATCAGTCTTGAGGCTTCAATTTCTGAAAAATCACTTCCTTTTGGCATCCATAAACAAGTTTGGACAGCTTCACTATCAAGATGTAACAACTGGTGGCATGGTCGGATCCTTCCGCACTGGGTTAGGTCGTACTGATGTGATGCAGGTAAATCCATTCAATGGGGTTATAGCAACTGGTCATTCAGGTGGTTCAGTCGCTATGTGGAAGCCTACAAGCTCTGCTCCTCTTGTAAAAATGCTTTGTCATCAAGGGCCTGTGTCAGCGCTAGCATTCCACCCGAATGGCCATCTCATGGCTACATCTGGTTCCGAGAGGAAAATTAAGCTCTGGGACTTGAGGAAGTTTGAGGTCCTTCAGACTCTGCCTGGGCATGCTAAGACCTTAGATTTCAGTCAGAAAGGGCTACTTGCCTATGGAATTGGGTCGTCCATACAGATTCTGGGTGATTTATGTGGAGCTCAGAATTACACTAGGTATATGGCTCACTCAATGGTAAAAGGTTACCAAATAGGAAAAATTTTGTTTCGACCATATGAAGATGTTTTAGGCATAGGTCATTCGATGGGTTGGTCAAGTATCCTCATTCCAGGATCTGGCGAACCCAACTTTGATACGTGGGTAGCAAACCCATTTGAGACATCGAAACAACGGAGAGAAAAGGAGGTTCGGTCTCTTCTTGATAAGCTTCCTCCTGAGACAATTTCGCTCAATCCCTCAAAAATTGGTACCGTCATGGCCGTAAAGAAGAAGGAAAAGAAGACGAAGAAGGAAAGAGATGCTGAAGAGGAGGCTGCAATTGACGCTGCCAAGGGTGTTACCATGAAGAAGAAAACCAAGGGAAGGAATAAGCCAACCAAGAGAGAAAAGAAGAAACATGAAATTATTGAGAAGGCCAAAAGGCCTTTCCTTCAGGAACATATGAAGGAAGAAGAATTGTCTCGAAAGAGGTCAAGGTTGAGCGAGGAAGTTGAACTTCCCAAGTCTTTGCAGAGGTTTGCTCGTAAGAAAACTGCGACGTGA

mRNA sequence

ATGGAGAAGGAACTAGGAAATGCGGTTGCTGAAAGGATTCTTCCTCCGACTGAACAGGAGGTATCAAATGAGATAGATGTGAAAGTTAAAAAATATCTGCGAGGAGAAGGTGCTAATCTAGAGGTTCTGAAAGATAAAAAGTTGAAGGGTCAACTTTCTGTTATAGAAGATTTATATGGAAAATCTGCTAAAGCTGCCGCGAAGGTCGAAAAGTGGCTTATGCCAAGTGAGGGAGGCTATTTGGAGGCTGAAGGATTGGAGAAGACATGGAGAATCAAACAGGAAACGATTTCTCACGAAGTAGATATCTTAAGTAGAAGGAATCAACATGATATTATTTTACCAGGATTAATTTTTATTATTCGATTCATATGTGCAGCTCTTGGACCATATTCTCTTGACTATACTTCAAACGGTAGATATATGGCCATTGCTGGACGTAAGGGTCACCTGGCACTTGTAGACACAAAGGACCTCAATTTAATTAAAGAGTTTCAGGTTATGGAAACTGTTCGTGACGTGGTCTTCTTGCACAATGAGCTGTTCTTTGCTGCTGCACAAAAAAAGTATCCGTATATTTATAATCGGGAGGGCACAGAGCTTCATTGCCTTAAGGAGCATGGATCAGTCTTGAGGCTTCAATTTCTGAAAAATCACTTCCTTTTGGCATCCATAAACAAGTTTGGACAGCTTCACTATCAAGATGTAACAACTGGTGGCATGGTCGGATCCTTCCGCACTGGGTTAGGTCGTACTGATGTGATGCAGGTAAATCCATTCAATGGGGTTATAGCAACTGGTCATTCAGGTGGTTCAGTCGCTATGTGGAAGCCTACAAGCTCTGCTCCTCTTGTAAAAATGCTTTGTCATCAAGGGCCTGTGTCAGCGCTAGCATTCCACCCGAATGGCCATCTCATGGCTACATCTGGTTCCGAGAGGAAAATTAAGCTCTGGGACTTGAGGAAGTTTGAGGTCCTTCAGACTCTGCCTGGGCATGCTAAGACCTTAGATTTCAGTCAGAAAGGGCTACTTGCCTATGGAATTGGGTCGTCCATACAGATTCTGGGTGATTTATGTGGAGCTCAGAATTACACTAGGTATATGGCTCACTCAATGGTAAAAGGTTACCAAATAGGAAAAATTTTGTTTCGACCATATGAAGATGTTTTAGGCATAGGTCATTCGATGGGTTGGTCAAGTATCCTCATTCCAGGATCTGGCGAACCCAACTTTGATACGTGGGTAGCAAACCCATTTGAGACATCGAAACAACGGAGAGAAAAGGAGGTTCGGTCTCTTCTTGATAAGCTTCCTCCTGAGACAATTTCGCTCAATCCCTCAAAAATTGGTACCGTCATGGCCGTAAAGAAGAAGGAAAAGAAGACGAAGAAGGAAAGAGATGCTGAAGAGGAGGCTGCAATTGACGCTGCCAAGGGTGTTACCATGAAGAAGAAAACCAAGGGAAGGAATAAGCCAACCAAGAGAGAAAAGAAGAAACATGAAATTATTGAGAAGGCCAAAAGGCCTTTCCTTCAGGAACATATGAAGGAAGAAGAATTGTCTCGAAAGAGGTCAAGGTTGAGCGAGGAAGTTGAACTTCCCAAGTCTTTGCAGAGGTTTGCTCGTAAGAAAACTGCGACGTGA

Coding sequence (CDS)

ATGGAGAAGGAACTAGGAAATGCGGTTGCTGAAAGGATTCTTCCTCCGACTGAACAGGAGGTATCAAATGAGATAGATGTGAAAGTTAAAAAATATCTGCGAGGAGAAGGTGCTAATCTAGAGGTTCTGAAAGATAAAAAGTTGAAGGGTCAACTTTCTGTTATAGAAGATTTATATGGAAAATCTGCTAAAGCTGCCGCGAAGGTCGAAAAGTGGCTTATGCCAAGTGAGGGAGGCTATTTGGAGGCTGAAGGATTGGAGAAGACATGGAGAATCAAACAGGAAACGATTTCTCACGAAGTAGATATCTTAAGTAGAAGGAATCAACATGATATTATTTTACCAGGATTAATTTTTATTATTCGATTCATATGTGCAGCTCTTGGACCATATTCTCTTGACTATACTTCAAACGGTAGATATATGGCCATTGCTGGACGTAAGGGTCACCTGGCACTTGTAGACACAAAGGACCTCAATTTAATTAAAGAGTTTCAGGTTATGGAAACTGTTCGTGACGTGGTCTTCTTGCACAATGAGCTGTTCTTTGCTGCTGCACAAAAAAAGTATCCGTATATTTATAATCGGGAGGGCACAGAGCTTCATTGCCTTAAGGAGCATGGATCAGTCTTGAGGCTTCAATTTCTGAAAAATCACTTCCTTTTGGCATCCATAAACAAGTTTGGACAGCTTCACTATCAAGATGTAACAACTGGTGGCATGGTCGGATCCTTCCGCACTGGGTTAGGTCGTACTGATGTGATGCAGGTAAATCCATTCAATGGGGTTATAGCAACTGGTCATTCAGGTGGTTCAGTCGCTATGTGGAAGCCTACAAGCTCTGCTCCTCTTGTAAAAATGCTTTGTCATCAAGGGCCTGTGTCAGCGCTAGCATTCCACCCGAATGGCCATCTCATGGCTACATCTGGTTCCGAGAGGAAAATTAAGCTCTGGGACTTGAGGAAGTTTGAGGTCCTTCAGACTCTGCCTGGGCATGCTAAGACCTTAGATTTCAGTCAGAAAGGGCTACTTGCCTATGGAATTGGGTCGTCCATACAGATTCTGGGTGATTTATGTGGAGCTCAGAATTACACTAGGTATATGGCTCACTCAATGGTAAAAGGTTACCAAATAGGAAAAATTTTGTTTCGACCATATGAAGATGTTTTAGGCATAGGTCATTCGATGGGTTGGTCAAGTATCCTCATTCCAGGATCTGGCGAACCCAACTTTGATACGTGGGTAGCAAACCCATTTGAGACATCGAAACAACGGAGAGAAAAGGAGGTTCGGTCTCTTCTTGATAAGCTTCCTCCTGAGACAATTTCGCTCAATCCCTCAAAAATTGGTACCGTCATGGCCGTAAAGAAGAAGGAAAAGAAGACGAAGAAGGAAAGAGATGCTGAAGAGGAGGCTGCAATTGACGCTGCCAAGGGTGTTACCATGAAGAAGAAAACCAAGGGAAGGAATAAGCCAACCAAGAGAGAAAAGAAGAAACATGAAATTATTGAGAAGGCCAAAAGGCCTTTCCTTCAGGAACATATGAAGGAAGAAGAATTGTCTCGAAAGAGGTCAAGGTTGAGCGAGGAAGTTGAACTTCCCAAGTCTTTGCAGAGGTTTGCTCGTAAGAAAACTGCGACGTGA

Protein sequence

MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFIIRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNELFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGGMVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCGAQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFETSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGVTMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMKEEELSRKRSRLSEEVELPKSLQRFARKKTAT
Homology
BLAST of HG10022895 vs. NCBI nr
Match: XP_038886607.1 (probable U3 small nucleolar RNA-associated protein 7 [Benincasa hispida])

HSP 1 Score: 1008.4 bits (2606), Expect = 2.3e-290
Identity = 518/547 (94.70%), Postives = 524/547 (95.80%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           MEKELGNAVAERILPPTEQEVSNEIDVKV+KYLRGEGANLEVLKDKKLKGQLSVIEDLYG
Sbjct: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVQKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILS+RNQHDIILP     
Sbjct: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSKRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYSLDYTSNGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSLDYTSNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNR GTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNRVGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH
Sbjct: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNG+LMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYG GS +QILGDL G
Sbjct: 301 PNGYLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQILGDLSG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
            QNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE
Sbjct: 361 TQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTK ERDAEEEAAIDAAKG+
Sbjct: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKMERDAEEEAAIDAAKGI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMKEEELSRKRSRLSEEVELPKSLQRF 540
           TMKKKTKGRNKPTKREKKKHEIIEKAKRPFL E +KEEELSRKRSRLSEEVELPKSLQRF
Sbjct: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLHEQIKEEELSRKRSRLSEEVELPKSLQRF 536

Query: 541 ARKKTAT 548
           ARKKT T
Sbjct: 541 ARKKTVT 536

BLAST of HG10022895 vs. NCBI nr
Match: XP_008456565.1 (PREDICTED: probable U3 small nucleolar RNA-associated protein 7 [Cucumis melo] >KAA0057125.1 putative U3 small nucleolar RNA-associated protein 7 [Cucumis melo var. makuwa] >TYJ97106.1 putative U3 small nucleolar RNA-associated protein 7 [Cucumis melo var. makuwa])

HSP 1 Score: 1002.3 bits (2590), Expect = 1.6e-288
Identity = 509/547 (93.05%), Postives = 522/547 (95.43%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           MEKELGN VAERILPPTEQE+SNEIDVKVKKY+RGEGANLEVLKDKKLKGQLSVIEDLYG
Sbjct: 1   MEKELGNVVAERILPPTEQEISNEIDVKVKKYMRGEGANLEVLKDKKLKGQLSVIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAAKVEKWLMPSEGGYLE EGLEKTWRIKQETISHEVDILSRRNQHDIILP     
Sbjct: 61  KSAKAAAKVEKWLMPSEGGYLETEGLEKTWRIKQETISHEVDILSRRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYS+DYTSNGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSIDYTSNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLL SINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLVSINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCH GPVSALAFH
Sbjct: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHPGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNGHLMATSG+ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYG GS +Q+LGDL G
Sbjct: 301 PNGHLMATSGAERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDLSG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
           AQ+YTRYMAHSM KGYQIGK+LFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE
Sbjct: 361 AQSYTRYMAHSMAKGYQIGKVLFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETISLNPSKIGT++AVKKKEKKTKKERDAEEEAA+DAAKG+
Sbjct: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTLVAVKKKEKKTKKERDAEEEAAVDAAKGI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMKEEELSRKRSRLSEEVELPKSLQRF 540
           TMKKKTKGRNKPTKREKKKHEIIEKAKRPFL E +KEEELSRKRSRLSEEVELPKSLQRF
Sbjct: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLHEQIKEEELSRKRSRLSEEVELPKSLQRF 536

Query: 541 ARKKTAT 548
           A KKTAT
Sbjct: 541 AHKKTAT 536

BLAST of HG10022895 vs. NCBI nr
Match: XP_004138833.1 (probable U3 small nucleolar RNA-associated protein 7 [Cucumis sativus] >XP_031737225.1 probable U3 small nucleolar RNA-associated protein 7 [Cucumis sativus] >KGN63037.1 hypothetical protein Csa_022263 [Cucumis sativus])

HSP 1 Score: 996.1 bits (2574), Expect = 1.2e-286
Identity = 507/547 (92.69%), Postives = 518/547 (94.70%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           MEKELGN V ERILPPTEQEVSNEIDVKVKKY+RGEGANLEVLKDKKLKGQLS IEDLYG
Sbjct: 1   MEKELGNVVTERILPPTEQEVSNEIDVKVKKYMRGEGANLEVLKDKKLKGQLSAIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAA+VEKWLMPSEGGYLE EGLEKTWRIKQETISHEVDILSRRNQHDIILP     
Sbjct: 61  KSAKAAAEVEKWLMPSEGGYLETEGLEKTWRIKQETISHEVDILSRRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYS+DYTSNGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSIDYTSNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNREGTELHCLKEHGSV RLQFLKNHFLL SINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVRRLQFLKNHFLLVSINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCH GPVSALAFH
Sbjct: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHPGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNGHLMATSG+ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYG GS +QILGD  G
Sbjct: 301 PNGHLMATSGAERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQILGDFSG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
           AQNY RYMAHSM KGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE
Sbjct: 361 AQNYNRYMAHSMAKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETISLNP+KIGT+MAVKKKEKKTKKERDAEEEAA+DAAKG+
Sbjct: 421 TSKQRREKEVRSLLDKLPPETISLNPTKIGTLMAVKKKEKKTKKERDAEEEAAVDAAKGI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMKEEELSRKRSRLSEEVELPKSLQRF 540
           TMKKKTKGRNKPTKREKKKHEIIEKAKRPFL E +KEEELSRK+SRLSEEVELPKSLQRF
Sbjct: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLHEQIKEEELSRKKSRLSEEVELPKSLQRF 536

Query: 541 ARKKTAT 548
           ARKKTAT
Sbjct: 541 ARKKTAT 536

BLAST of HG10022895 vs. NCBI nr
Match: KAG6578785.1 (hypothetical protein SDJN03_23233, partial [Cucurbita argyrosperma subsp. sororia] >KAG7016316.1 utp7 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 972.6 bits (2513), Expect = 1.4e-279
Identity = 499/548 (91.06%), Postives = 515/548 (93.98%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           ME+ELG  VAERILPPTEQEV NE DVK+KKYLRGEGANLEVLKDKKLKGQLSVIEDLYG
Sbjct: 1   MEEELGKPVAERILPPTEQEVLNEEDVKIKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRI+QETISHEVDILSRRNQHDIILP     
Sbjct: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIRQETISHEVDILSRRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYSLDYT NGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSLDYTLNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNR+GTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNRDGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           M G FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSS+PLVKMLCHQGPVSALAFH
Sbjct: 241 MAGVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSSPLVKMLCHQGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNGHLMATSG ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLA G GS +QILGDL G
Sbjct: 301 PNGHLMATSGCERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLACGTGSHVQILGDLAG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
           +QNYTRYM+HSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE
Sbjct: 361 SQNYTRYMSHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETI+LNPSKIGTVMAVKKKEKKTKKER+AEEE+AIDAAK +
Sbjct: 421 TSKQRREKEVRSLLDKLPPETIALNPSKIGTVMAVKKKEKKTKKEREAEEESAIDAAKNI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMK-EEELSRKRSRLSEEVELPKSLQR 540
           TMKKKTKGRNKPTKREKKK EII+K+K+PFLQE +K EEELSRKR R SEEVELPKSLQR
Sbjct: 481 TMKKKTKGRNKPTKREKKKREIIDKSKKPFLQEQIKEEEELSRKRPRSSEEVELPKSLQR 537

Query: 541 FARKKTAT 548
           FARKKTAT
Sbjct: 541 FARKKTAT 537

BLAST of HG10022895 vs. NCBI nr
Match: XP_022939750.1 (probable U3 small nucleolar RNA-associated protein 7 [Cucurbita moschata])

HSP 1 Score: 971.8 bits (2511), Expect = 2.4e-279
Identity = 498/548 (90.88%), Postives = 515/548 (93.98%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           ME+ELG  VAERILPPTEQEV NE DVK+KKYLRGEGANLEVLKDKKLKGQLSVIEDLYG
Sbjct: 1   MEEELGKPVAERILPPTEQEVLNEEDVKIKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRI+QETISHEVDILSRRNQHDIILP     
Sbjct: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIRQETISHEVDILSRRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYSLDYT NGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSLDYTLNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNR+GTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNRDGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           M G FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSS+PLVKMLCHQGPVSALAFH
Sbjct: 241 MAGVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSSPLVKMLCHQGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNGHLMATSG ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLA G GS +QILGDL G
Sbjct: 301 PNGHLMATSGCERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLACGTGSYVQILGDLAG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
           +QNYTRYM+HSMVKGYQIGKILFRPYEDVLGIGHSMGWSSIL+PGSGEPNFDTWVANPFE
Sbjct: 361 SQNYTRYMSHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILVPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETI+LNPSKIGTVMAVKKKEKKTKKER+AEEE+AIDAAK +
Sbjct: 421 TSKQRREKEVRSLLDKLPPETIALNPSKIGTVMAVKKKEKKTKKEREAEEESAIDAAKNI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMK-EEELSRKRSRLSEEVELPKSLQR 540
           TMKKKTKGRNKPTKREKKK EII+K+K+PFLQE +K EEELSRKR R SEEVELPKSLQR
Sbjct: 481 TMKKKTKGRNKPTKREKKKREIIDKSKKPFLQEQIKEEEELSRKRPRSSEEVELPKSLQR 537

Query: 541 FARKKTAT 548
           FARKKTAT
Sbjct: 541 FARKKTAT 537

BLAST of HG10022895 vs. ExPASy Swiss-Prot
Match: Q9P4X3 (Probable U3 small nucleolar RNA-associated protein 7 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=utp7 PE=3 SV=1)

HSP 1 Score: 341.7 bits (875), Expect = 1.6e-92
Identity = 207/530 (39.06%), Postives = 292/530 (55.09%), Query Frame = 0

Query: 24  EIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYGKSAKAAAKVEKWLMPSEGGYLEA 83
           +++   KKY RG   N + +KDKKL+  +  IE+       + AK E  L     G LEA
Sbjct: 10  DVNASTKKYSRGRKLNPKKIKDKKLRSNIQKIEERIENVESSLAKTE-ILHEDNPGLLEA 69

Query: 84  EGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFIIRFICAALGPYSLDYTSNGRYMA 143
           EGLE+T++ +Q+ ++  V + +      + L              G YS DYT +GR + 
Sbjct: 70  EGLERTYKFRQDQLAPNVALETATKSFSLDLD-----------KFGGYSFDYTRDGRMIL 129

Query: 144 IAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNELFFAAAQKKYPYIYNREGTELHC 203
           + GRKGH++  D +   L+ E  + ETVRDV + HN  +FA AQKKY Y+Y+  GTE+HC
Sbjct: 130 LGGRKGHISAFDWRTGKLLTELHLRETVRDVKWFHNHQYFAVAQKKYVYVYDNMGTEIHC 189

Query: 204 LKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGGMVGSFRTGLGRTDVMQVNPFNGV 263
           LK H  V  L FL  H LL SI   G L YQDV+TG +V   RTG+G + V+  NP N V
Sbjct: 190 LKRHIEVNALDFLPYHLLLTSIGNAGYLKYQDVSTGQLVAEHRTGMGASHVLHQNPHNAV 249

Query: 264 IATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGSERKIKLWDLRKF 323
              GH+ G V +W P+S+ PLVKML H+GPV  LA + +G  M T+G++  +K+WDLR +
Sbjct: 250 EHVGHANGQVTLWSPSSTTPLVKMLTHRGPVRDLAVNRDGRYMVTAGADSLLKVWDLRTY 309

Query: 324 EVLQT--LPGHAKTLDFSQKGLLAYGIGSSIQILGDLCGAQNYTRYMAHSMVKGYQIGKI 383
           + L +   P  A+ L  S +GLLA G G    I  D    +    YM H ++    +  +
Sbjct: 310 KELHSYYTPTPAQRLTLSDRGLLAVGWGPHATIWKDALRTKQNFPYMNH-LLPSSSVVDL 369

Query: 384 LFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFETSKQRREKEVRSLLDKLPPET 443
            + PYED+LGIGH+ G+ SI++PGSGEPN+D++  +PF + KQR+E EVR LL+KL PE 
Sbjct: 370 HYCPYEDILGIGHAKGFESIIVPGSGEPNYDSYENDPFASRKQRQETEVRQLLEKLRPEM 429

Query: 444 ISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGVTMKKKTKGRNKPTKREKKKHE 503
           ISLN   IG V      ++     R AE E      +    K K +G+N   +R  +KH 
Sbjct: 430 ISLNADFIGNV------DRAAPSLRKAEAEEEKPPEEKWVPKAKARGKNSALRRYLRKHA 489

Query: 504 IIEKAKRPFLQEHMKEEELSRKRSRLSEEVELPK-------SLQRFARKK 545
                +R    E   E E   +  R+  E +LP+       +L RF  KK
Sbjct: 490 RNVVDQRRLKVEKSLEIEKKMRAQRVRREQKLPEEREKWGYALSRFVSKK 520

BLAST of HG10022895 vs. ExPASy Swiss-Prot
Match: P40055 (U3 small nucleolar RNA-associated protein 7 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=UTP7 PE=1 SV=1)

HSP 1 Score: 310.1 bits (793), Expect = 5.1e-83
Identity = 192/532 (36.09%), Postives = 292/532 (54.89%), Query Frame = 0

Query: 19  QEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYGKSAKAAAKVEKWLMPSEG 78
           +E  N+   +   Y      N    KDKKL+  L  I++ Y K+  +AA  + +L+P   
Sbjct: 13  KERENQNKFERSTYTNNAKNNHTQTKDKKLRAGLKKIDEQYKKAVSSAAATD-YLLPESN 72

Query: 79  GYLEAEG-LEKTWRIKQETISHEVDILSRRNQHDIILPGLIFIIRFICAALGPYSLDYTS 138
           GYLE E  LEKT++++Q  I   VD+ +     D+ L              GPY + Y  
Sbjct: 73  GYLEPENELEKTFKVQQSEIKSSVDVSTANKALDLSL-----------KEFGPYHIKYAK 132

Query: 139 NGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNELFFAAAQKKYPYIYNRE 198
           NG ++ I GRKGH+A +D +   L  E  + ET     +L NE +FA AQKKY +IY+ E
Sbjct: 133 NGTHLLITGRKGHVASMDWRKGQLRAELFLNETCHSATYLQNEQYFAVAQKKYTFIYDHE 192

Query: 199 GTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGGMVGSFRTGLGRTDVMQV 258
           GTELH LK+H     L FL  H+LL +  + G L Y DV+TG +V   RT  G T  M  
Sbjct: 193 GTELHRLKQHIEARHLDFLPYHYLLVTAGETGWLKYHDVSTGQLVSELRTKAGPTMAMAQ 252

Query: 259 NPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGSERKIKL 318
           NP+N V+  GHS G+V++W P+   PLVK+L  +GPV+++A   +G+ MAT+G++R +K+
Sbjct: 253 NPWNAVMHLGHSNGTVSLWSPSMPEPLVKLLSARGPVNSIAIDRSGYYMATTGADRSMKI 312

Query: 319 WDLRKFEVL---QTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLC--------------- 378
           WD+R F+ L   ++LP     +  S  GLLA   G  + +  D                 
Sbjct: 313 WDIRNFKQLHSVESLPTPGTNVSISDTGLLALSRGPHVTLWKDALKLSGDSKPCFGSMGG 372

Query: 379 GAQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPF 438
                T YM+H +  G ++  + F P+ED+LG+GH  G +++++PG+GE N+D    NPF
Sbjct: 373 NPHRNTPYMSH-LFAGNKVENLGFVPFEDLLGVGHQTGITNLIVPGAGEANYDALELNPF 432

Query: 439 ETSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKG 498
           ET KQR+E+EVR+LL+KLP +TI+L+P+ IG+V       +   K+       A + AK 
Sbjct: 433 ETKKQRQEQEVRTLLNKLPADTITLDPNSIGSVDKRSSTIRLNAKDLAQTTMDANNKAKT 492

Query: 499 VT----MKKKTKGRNKPTKR--EKKKHEIIEKAKRPFLQEHMKEEELSRKRS 526
            +    +K   KG+N   +    KK   +I++ K   +Q+ + +E+  RKR+
Sbjct: 493 NSDIPDVKPDVKGKNSGLRSFLRKKTQNVIDERKLR-VQKQLDKEKNIRKRN 530

BLAST of HG10022895 vs. ExPASy Swiss-Prot
Match: Q9Z0H1 (WD repeat-containing protein 46 OS=Mus musculus OX=10090 GN=Wdr46 PE=2 SV=1)

HSP 1 Score: 302.4 bits (773), Expect = 1.1e-80
Identity = 194/522 (37.16%), Postives = 291/522 (55.75%), Query Frame = 0

Query: 28  KVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLE 87
           + +K+ R + +        K + +L   E    +++  AA+ E  L+  E G+L  E  E
Sbjct: 107 EARKFCRIDKSKTLPHSKPKTQSKLEKAEAQEEEASVRAARAE-LLLAEEPGFLVGEDGE 166

Query: 88  KTWRIKQETISHEVDILSRRNQHDIILPGLIFIIRFICAALGPYSLDYTSNGRYMAIAGR 147
            T +I Q  I   VDI S     D+ L              GPY L+Y+  GR++A+ GR
Sbjct: 167 DTAKILQTDIVEAVDIASAAKHFDLNL-----------RQFGPYRLNYSRTGRHLALGGR 226

Query: 148 KGHLALVDTKDLNLIKEFQVMETVRDVVFLHNELFFAAAQKKYPYIYNREGTELHCLKEH 207
           +GH+A +D     L+ E  VME VRD+ FLH+E   A AQ ++ YIY+ +G ELHC++  
Sbjct: 227 RGHVAALDWVTKKLMCEINVMEAVRDIHFLHSEALLAVAQNRWLYIYDNQGIELHCIRRC 286

Query: 208 GSVLRLQFLKNHFLLASINKFGQLHYQDVTTGGMVGSFRTGLGRTDVMQVNPFNGVIATG 267
             V RL+FL  HFLLA+ ++ G L Y DV+ G +V +     GR  VM  NP+N VI  G
Sbjct: 287 DRVTRLEFLPFHFLLATTSETGFLTYLDVSVGKIVTALNVRAGRLSVMAQNPYNAVIHLG 346

Query: 268 HSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGSERKIKLWDLR-KFEVL 327
           HS G+V++W P    PL K+LCH+G V A+A    G  MATSG + ++K++DLR  F+ L
Sbjct: 347 HSNGTVSLWSPAVKEPLAKILCHRGGVRAVAVDSTGTYMATSGLDHQLKIFDLRGTFQPL 406

Query: 328 --QTLPGHAKTLDFSQKGLLAYGIGSSIQI---LGDLCGAQNYTRYMAHSMVKGYQIGKI 387
             +TLP  A  L FSQ+GLL  G+G  + I    G          Y+ H +  G+  G +
Sbjct: 407 SSRTLPQGAGHLAFSQRGLLVAGMGDVVNIWAGQGKASPPSLEQPYLTHRL-SGHVHG-L 466

Query: 388 LFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFETSKQRREKEVRSLLDKLPPET 447
            F P+EDVLG+GHS G++S+L+PG+ EPNFD    NP+ + KQR+E EV++LL+K+P E 
Sbjct: 467 QFCPFEDVLGVGHSGGFTSMLVPGAAEPNFDGLENNPYRSRKQRQEWEVKALLEKVPAEL 526

Query: 448 ISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGVTMKKKTKGRNKPTKREKKKHE 507
           I LNP  +  V  V  +++  KKER        DA      K K KGR+      K+K +
Sbjct: 527 ICLNPRALAEVDVVTLEQQ--KKERIERLGYDPDAKAAFQPKAKQKGRSSTASLVKRKKK 586

Query: 508 IIEKAKRPFLQEHMKEEELSRKRSRLSEEVELPKSLQRFARK 544
           ++++  R  +++ +++++  +K+         P +L RF R+
Sbjct: 587 VMDQEHRDKVRQSLEQQQQKKKQDMAMPPGARPSALDRFVRR 612

BLAST of HG10022895 vs. ExPASy Swiss-Prot
Match: Q5TJE7 (WD repeat-containing protein 46 OS=Canis lupus familiaris OX=9615 GN=WDR46 PE=3 SV=1)

HSP 1 Score: 296.6 bits (758), Expect = 5.8e-79
Identity = 194/506 (38.34%), Postives = 286/506 (56.52%), Query Frame = 0

Query: 47  KLKGQLSVIEDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSR 106
           K + +L V E    + +  AA+ E  L+  E G+LE E  E T +I+Q  I   VDI S 
Sbjct: 128 KTRSRLEVAEAEEEEISIKAARSE-LLLAEEPGFLEGEDGEDTAKIRQAEIVEAVDIASA 187

Query: 107 RNQHDIILPGLIFIIRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQ 166
               D+ L              GPY L+Y+  GR++A  G +GH+A +D     L+ E  
Sbjct: 188 AKHFDLNL-----------RQFGPYRLNYSPVGRHLAFGGHRGHVATLDWVTKRLMCEIN 247

Query: 167 VMETVRDVVFLHNELFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASIN 226
           VME VRD+ FLH+E  FA AQ ++ +IY+ +G ELHC++    V RL+FL  HFLLA+ +
Sbjct: 248 VMEAVRDIRFLHSEALFAVAQNRWLHIYDNQGIELHCIRRCDRVTRLEFLPFHFLLATAS 307

Query: 227 KFGQLHYQDVTTGGMVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVK 286
           + G L Y DV+ G +V +     GR DVM  NP+N VI  GHS G+V++W P    PL K
Sbjct: 308 ETGFLTYLDVSVGKIVAALNARAGRLDVMTKNPYNAVIHLGHSNGTVSLWSPAMKEPLAK 367

Query: 287 MLCHQGPVSALAFHPNGHLMATSGSERKIKLWDLR-KFEVL--QTLPGHAKTLDFSQKGL 346
           +LCH+G V A+A    G  MATSG + ++K++DLR  F+ L  +TLP  A  L FSQ+GL
Sbjct: 368 ILCHRGGVRAVAVDSTGTYMATSGLDHQLKIFDLRGMFQPLSARTLPQGAGHLAFSQRGL 427

Query: 347 LAYGIGSSIQI---LGDLCGAQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSS 406
           LA G+   + I    G          Y+ H +  G+  G + F P+EDVLG+GHS G +S
Sbjct: 428 LAAGMSDVVNIWMGQGMASPPSLEQPYLTHRL-SGHVHG-LHFCPFEDVLGLGHSGGITS 487

Query: 407 ILIPGSGEPNFDTWVANPFETSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVK-KKE 466
           +L+PG+ EPNFD    NP+ + KQR+E EV++LL+K+P E I L+P  +  V  +  ++E
Sbjct: 488 MLVPGAAEPNFDGLENNPYRSQKQRQEWEVKALLEKVPAELICLDPRALAEVDVISLEQE 547

Query: 467 KKTKKER---DAEEEAAIDAAKGVTMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMK 526
           KK + ER   D E +A          K K KGR+      K++ ++++K  R  +++ + 
Sbjct: 548 KKERIERLGYDPEAKAPFQP------KPKQKGRSSTASLVKRRRKVMDKEHRDKVRQSL- 607

Query: 527 EEELSRKRSRLSEEVELPKSLQRFAR 543
           E++  ++  +       P +L RF R
Sbjct: 608 EQQPQKQEKKAKPLKARPSALDRFVR 612

BLAST of HG10022895 vs. ExPASy Swiss-Prot
Match: O15213 (WD repeat-containing protein 46 OS=Homo sapiens OX=9606 GN=WDR46 PE=1 SV=3)

HSP 1 Score: 295.8 bits (756), Expect = 9.9e-79
Identity = 200/556 (35.97%), Postives = 300/556 (53.96%), Query Frame = 0

Query: 3   KELGNAVAERILPPTEQEVSNEIDVK---VKKYLRGEGANLEVLKDKKLKGQLSVIEDLY 62
           +E  N  ++R L  T+        V    V+K+ R + +        K + +L V E   
Sbjct: 80  REWKNPESQRGLSGTQDPFPGPAPVPVEVVQKFCRIDKSRKLPHSKAKTRSRLEVAEAEE 139

Query: 63  GKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIF 122
            +++  AA+ E  L+  E G+LE E  E T +I Q  I   VDI S     D+ L     
Sbjct: 140 EETSIKAARSE-LLLAEEPGFLEGEDGEDTAKICQADIVEAVDIASAAKHFDLNL----- 199

Query: 123 IIRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHN 182
                    GPY L+Y+  GR++A  GR+GH+A +D     L+ E  VME VRD+ FLH+
Sbjct: 200 ------RQFGPYRLNYSRTGRHLAFGGRRGHVAALDWVTKKLMCEINVMEAVRDIRFLHS 259

Query: 183 ELFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTG 242
           E   A AQ ++ +IY+ +G ELHC++    V RL+FL  HFLLA+ ++ G L Y DV+ G
Sbjct: 260 EALLAVAQNRWLHIYDNQGIELHCIRRCDRVTRLEFLPFHFLLATASETGFLTYLDVSVG 319

Query: 243 GMVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAF 302
            +V +     GR DVM  NP+N VI  GHS G+V++W P    PL K+LCH+G V A+A 
Sbjct: 320 KIVAALNARAGRLDVMSQNPYNAVIHLGHSNGTVSLWSPAMKEPLAKILCHRGGVRAVAV 379

Query: 303 HPNGHLMATSGSERKIKLWDLR-KFEVL--QTLPGHAKTLDFSQKGLLAYGIGSSIQILG 362
              G  MATSG + ++K++DLR  ++ L  +TLP  A  L FSQ+GLL  G+G  + I  
Sbjct: 380 DSTGTYMATSGLDHQLKIFDLRGTYQPLSTRTLPHGAGHLAFSQRGLLVAGMGDVVNIWA 439

Query: 363 DLCGA------QNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPN 422
               A      Q Y  +     V G Q     F P+EDVLG+GH+ G +S+L+PG+GEPN
Sbjct: 440 GQGKASPPSLEQPYLTHRLSGPVHGLQ-----FCPFEDVLGVGHTGGITSMLVPGAGEPN 499

Query: 423 FDTWVANPFETSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVK----KKEKKTKKER 482
           FD   +NP+ + KQR+E EV++LL+K+P E I L+P  +  V  +     KKE+  +   
Sbjct: 500 FDGLESNPYRSRKQRQEWEVKALLEKVPAELICLDPRALAEVDVISLEQGKKEQIERLGY 559

Query: 483 DAEEEAAIDAAKGVTMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMKEEELSRKRSR 542
           D + +A          K K KGR+      K+K +++++  R  +++ ++++    K ++
Sbjct: 560 DPQAKAPFQP------KPKQKGRSSTASLVKRKRKVMDEEHRDKVRQSLQQQH--HKEAK 610

BLAST of HG10022895 vs. ExPASy TrEMBL
Match: A0A5D3BDN6 (Putative U3 small nucleolar RNA-associated protein 7 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold371G00120 PE=4 SV=1)

HSP 1 Score: 1002.3 bits (2590), Expect = 7.9e-289
Identity = 509/547 (93.05%), Postives = 522/547 (95.43%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           MEKELGN VAERILPPTEQE+SNEIDVKVKKY+RGEGANLEVLKDKKLKGQLSVIEDLYG
Sbjct: 1   MEKELGNVVAERILPPTEQEISNEIDVKVKKYMRGEGANLEVLKDKKLKGQLSVIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAAKVEKWLMPSEGGYLE EGLEKTWRIKQETISHEVDILSRRNQHDIILP     
Sbjct: 61  KSAKAAAKVEKWLMPSEGGYLETEGLEKTWRIKQETISHEVDILSRRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYS+DYTSNGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSIDYTSNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLL SINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLVSINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCH GPVSALAFH
Sbjct: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHPGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNGHLMATSG+ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYG GS +Q+LGDL G
Sbjct: 301 PNGHLMATSGAERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDLSG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
           AQ+YTRYMAHSM KGYQIGK+LFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE
Sbjct: 361 AQSYTRYMAHSMAKGYQIGKVLFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETISLNPSKIGT++AVKKKEKKTKKERDAEEEAA+DAAKG+
Sbjct: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTLVAVKKKEKKTKKERDAEEEAAVDAAKGI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMKEEELSRKRSRLSEEVELPKSLQRF 540
           TMKKKTKGRNKPTKREKKKHEIIEKAKRPFL E +KEEELSRKRSRLSEEVELPKSLQRF
Sbjct: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLHEQIKEEELSRKRSRLSEEVELPKSLQRF 536

Query: 541 ARKKTAT 548
           A KKTAT
Sbjct: 541 AHKKTAT 536

BLAST of HG10022895 vs. ExPASy TrEMBL
Match: A0A1S3C483 (probable U3 small nucleolar RNA-associated protein 7 OS=Cucumis melo OX=3656 GN=LOC103496480 PE=4 SV=1)

HSP 1 Score: 1002.3 bits (2590), Expect = 7.9e-289
Identity = 509/547 (93.05%), Postives = 522/547 (95.43%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           MEKELGN VAERILPPTEQE+SNEIDVKVKKY+RGEGANLEVLKDKKLKGQLSVIEDLYG
Sbjct: 1   MEKELGNVVAERILPPTEQEISNEIDVKVKKYMRGEGANLEVLKDKKLKGQLSVIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAAKVEKWLMPSEGGYLE EGLEKTWRIKQETISHEVDILSRRNQHDIILP     
Sbjct: 61  KSAKAAAKVEKWLMPSEGGYLETEGLEKTWRIKQETISHEVDILSRRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYS+DYTSNGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSIDYTSNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLL SINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLVSINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCH GPVSALAFH
Sbjct: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHPGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNGHLMATSG+ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYG GS +Q+LGDL G
Sbjct: 301 PNGHLMATSGAERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDLSG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
           AQ+YTRYMAHSM KGYQIGK+LFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE
Sbjct: 361 AQSYTRYMAHSMAKGYQIGKVLFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETISLNPSKIGT++AVKKKEKKTKKERDAEEEAA+DAAKG+
Sbjct: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTLVAVKKKEKKTKKERDAEEEAAVDAAKGI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMKEEELSRKRSRLSEEVELPKSLQRF 540
           TMKKKTKGRNKPTKREKKKHEIIEKAKRPFL E +KEEELSRKRSRLSEEVELPKSLQRF
Sbjct: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLHEQIKEEELSRKRSRLSEEVELPKSLQRF 536

Query: 541 ARKKTAT 548
           A KKTAT
Sbjct: 541 AHKKTAT 536

BLAST of HG10022895 vs. ExPASy TrEMBL
Match: A0A0A0LQI6 (WD_REPEATS_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G385060 PE=4 SV=1)

HSP 1 Score: 996.1 bits (2574), Expect = 5.7e-287
Identity = 507/547 (92.69%), Postives = 518/547 (94.70%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           MEKELGN V ERILPPTEQEVSNEIDVKVKKY+RGEGANLEVLKDKKLKGQLS IEDLYG
Sbjct: 1   MEKELGNVVTERILPPTEQEVSNEIDVKVKKYMRGEGANLEVLKDKKLKGQLSAIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAA+VEKWLMPSEGGYLE EGLEKTWRIKQETISHEVDILSRRNQHDIILP     
Sbjct: 61  KSAKAAAEVEKWLMPSEGGYLETEGLEKTWRIKQETISHEVDILSRRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYS+DYTSNGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSIDYTSNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNREGTELHCLKEHGSV RLQFLKNHFLL SINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVRRLQFLKNHFLLVSINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCH GPVSALAFH
Sbjct: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHPGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNGHLMATSG+ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYG GS +QILGD  G
Sbjct: 301 PNGHLMATSGAERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQILGDFSG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
           AQNY RYMAHSM KGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE
Sbjct: 361 AQNYNRYMAHSMAKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETISLNP+KIGT+MAVKKKEKKTKKERDAEEEAA+DAAKG+
Sbjct: 421 TSKQRREKEVRSLLDKLPPETISLNPTKIGTLMAVKKKEKKTKKERDAEEEAAVDAAKGI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMKEEELSRKRSRLSEEVELPKSLQRF 540
           TMKKKTKGRNKPTKREKKKHEIIEKAKRPFL E +KEEELSRK+SRLSEEVELPKSLQRF
Sbjct: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLHEQIKEEELSRKKSRLSEEVELPKSLQRF 536

Query: 541 ARKKTAT 548
           ARKKTAT
Sbjct: 541 ARKKTAT 536

BLAST of HG10022895 vs. ExPASy TrEMBL
Match: A0A6J1FNM3 (probable U3 small nucleolar RNA-associated protein 7 OS=Cucurbita moschata OX=3662 GN=LOC111445539 PE=4 SV=1)

HSP 1 Score: 971.8 bits (2511), Expect = 1.1e-279
Identity = 498/548 (90.88%), Postives = 515/548 (93.98%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           ME+ELG  VAERILPPTEQEV NE DVK+KKYLRGEGANLEVLKDKKLKGQLSVIEDLYG
Sbjct: 1   MEEELGKPVAERILPPTEQEVLNEEDVKIKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRI+QETISHEVDILSRRNQHDIILP     
Sbjct: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIRQETISHEVDILSRRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYSLDYT NGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSLDYTLNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNR+GTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNRDGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           M G FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSS+PLVKMLCHQGPVSALAFH
Sbjct: 241 MAGVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSSPLVKMLCHQGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNGHLMATSG ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLA G GS +QILGDL G
Sbjct: 301 PNGHLMATSGCERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLACGTGSYVQILGDLAG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
           +QNYTRYM+HSMVKGYQIGKILFRPYEDVLGIGHSMGWSSIL+PGSGEPNFDTWVANPFE
Sbjct: 361 SQNYTRYMSHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILVPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETI+LNPSKIGTVMAVKKKEKKTKKER+AEEE+AIDAAK +
Sbjct: 421 TSKQRREKEVRSLLDKLPPETIALNPSKIGTVMAVKKKEKKTKKEREAEEESAIDAAKNI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMK-EEELSRKRSRLSEEVELPKSLQR 540
           TMKKKTKGRNKPTKREKKK EII+K+K+PFLQE +K EEELSRKR R SEEVELPKSLQR
Sbjct: 481 TMKKKTKGRNKPTKREKKKREIIDKSKKPFLQEQIKEEEELSRKRPRSSEEVELPKSLQR 537

Query: 541 FARKKTAT 548
           FARKKTAT
Sbjct: 541 FARKKTAT 537

BLAST of HG10022895 vs. ExPASy TrEMBL
Match: A0A6J1JVD7 (probable U3 small nucleolar RNA-associated protein 7 OS=Cucurbita maxima OX=3661 GN=LOC111489219 PE=4 SV=1)

HSP 1 Score: 969.9 bits (2506), Expect = 4.4e-279
Identity = 496/548 (90.51%), Postives = 514/548 (93.80%), Query Frame = 0

Query: 1   MEKELGNAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60
           ME+ELGN VAERILPP EQEVSNE D K+KKYLRGEGANLEVLKDKKLKGQLSVIEDLYG
Sbjct: 1   MEEELGNPVAERILPPAEQEVSNEEDAKIKKYLRGEGANLEVLKDKKLKGQLSVIEDLYG 60

Query: 61  KSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFI 120
           KSAKAAAKVEKWL+PSEGGYLEAEGLEKTWRI+QETISHEVDILSRRNQHDIILP     
Sbjct: 61  KSAKAAAKVEKWLLPSEGGYLEAEGLEKTWRIRQETISHEVDILSRRNQHDIILP----- 120

Query: 121 IRFICAALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNE 180
                 ALGPYSLDYT NGRYMAIAGRKGHLALVD KDLNLIKEFQV ETVRDVVFLHNE
Sbjct: 121 ------ALGPYSLDYTLNGRYMAIAGRKGHLALVDMKDLNLIKEFQVKETVRDVVFLHNE 180

Query: 181 LFFAAAQKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGG 240
           LFFAAAQKKYPYIYNR+GTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTG 
Sbjct: 181 LFFAAAQKKYPYIYNRDGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGS 240

Query: 241 MVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFH 300
           M G FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSS+PLVKMLCHQGPVSALAFH
Sbjct: 241 MAGVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSSPLVKMLCHQGPVSALAFH 300

Query: 301 PNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGIGSSIQILGDLCG 360
           PNGHLMATSG ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLA G GS +QILGDL G
Sbjct: 301 PNGHLMATSGCERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLACGTGSYVQILGDLAG 360

Query: 361 AQNYTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420
           +QNYTRYM+HS+VKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE
Sbjct: 361 SQNYTRYMSHSIVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFE 420

Query: 421 TSKQRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGV 480
           TSKQRREKEVRSLLDKLPPETI+LNPSKIGTVMAVKKKEKKTKKER+AEEE+AIDAAK +
Sbjct: 421 TSKQRREKEVRSLLDKLPPETIALNPSKIGTVMAVKKKEKKTKKEREAEEESAIDAAKSI 480

Query: 481 TMKKKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMK-EEELSRKRSRLSEEVELPKSLQR 540
           TMKKKTKGRNKPTKREKKK EI +K+K+PFLQE +K EEELSRKR RL EEVELPKSLQR
Sbjct: 481 TMKKKTKGRNKPTKREKKKREIFDKSKKPFLQEQIKEEEELSRKRPRLGEEVELPKSLQR 537

Query: 541 FARKKTAT 548
           FARKKTAT
Sbjct: 541 FARKKTAT 537

BLAST of HG10022895 vs. TAIR 10
Match: AT3G10530.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 730.3 bits (1884), Expect = 1.1e-210
Identity = 374/542 (69.00%), Postives = 436/542 (80.44%), Query Frame = 0

Query: 7   NAVAERILPPTEQEVSNEIDVKVKKYLRGEGANLEVLKDKKLKGQLSVIEDLYGKSAKAA 66
           N + E++LPP EQE   E++ KVKKYLRGEGANLE LKDKKLK QL+  E LYGKSAKAA
Sbjct: 8   NNLMEKVLPPVEQESDVELETKVKKYLRGEGANLETLKDKKLKTQLASREKLYGKSAKAA 67

Query: 67  AKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQHDIILPGLIFIIRFICA 126
           AK+EKWL+P+E GYLE EGLEKTWR+KQ  I++EVDILS RNQ+DI+LP           
Sbjct: 68  AKIEKWLLPAEAGYLETEGLEKTWRVKQTDIANEVDILSSRNQYDIVLPD---------- 127

Query: 127 ALGPYSLDYTSNGRYMAIAGRKGHLALVDTKDLNLIKEFQVMETVRDVVFLHNELFFAAA 186
             GPY LD+T++GR+M   GRKGHLAL+D  +++LIKE QV ETVRDV FLHN+ FFAAA
Sbjct: 128 -FGPYKLDFTASGRHMLAGGRKGHLALLDMMNMSLIKEIQVRETVRDVAFLHNDQFFAAA 187

Query: 187 QKKYPYIYNREGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQLHYQDVTTGGMVGSFR 246
           QKKY YIY R+GTELHCLKE G V RL+FLKNHFLLAS+N  GQLHYQDVT GGMV S R
Sbjct: 188 QKKYAYIYGRDGTELHCLKERGPVARLRFLKNHFLLASVNMSGQLHYQDVTHGGMVASIR 247

Query: 247 TGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLM 306
           TG GRTDVM+VNP+N V+  GHSGG+V MWKPTS APLV+M CH GPVS++AFHPNGHLM
Sbjct: 248 TGKGRTDVMEVNPYNSVVGLGHSGGTVTMWKPTSQAPLVQMQCHPGPVSSVAFHPNGHLM 307

Query: 307 ATSGSERKIKLWDLRKFEVLQTLPG-HAKTLDFSQKGLLAYGIGSSIQILGDLCG--AQN 366
           ATSG ERKIK+WDLRKFE +QT+   HAKTL FSQKGLLA G GS +QILGD  G  + N
Sbjct: 308 ATSGKERKIKIWDLRKFEEVQTIHSFHAKTLSFSQKGLLAAGTGSFVQILGDSSGGSSHN 367

Query: 367 YTRYMAHSMVKGYQIGKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFETSK 426
           YTRYM HSMVKGYQI K++FRPYEDV+GIGHSMGWSSILIPGSGEPNFD+WVANPFETSK
Sbjct: 368 YTRYMNHSMVKGYQIEKVMFRPYEDVIGIGHSMGWSSILIPGSGEPNFDSWVANPFETSK 427

Query: 427 QRREKEVRSLLDKLPPETISLNPSKIGTVMAVKKKEKKTKKERDAEEEAAIDAAKGVTMK 486
           QRREKEV SLLDKLPPETI L+PSKIG +   ++KEK ++ E +AE+E AI+AAK   +K
Sbjct: 428 QRREKEVHSLLDKLPPETIMLDPSKIGAMRPSRRKEKPSRGEIEAEKEVAIEAAKSTELK 487

Query: 487 KKTKGRNKPTKREKKKHEIIEKAKRPFLQEHMKEEELSRKRSRLSEE--VELPKSLQRFA 544
            KTKGRNKP+KR KKK E++E AKR F ++   E   + K+ R+ E+   ELP SL+RFA
Sbjct: 488 NKTKGRNKPSKRTKKKKEMVENAKRTFPEQ---EHNTAIKKRRIVEDAAAELPTSLKRFA 535

BLAST of HG10022895 vs. TAIR 10
Match: AT1G11160.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 62.8 bits (151), Expect = 1.0e-09
Identity = 31/107 (28.97%), Postives = 53/107 (49.53%), Query Frame = 0

Query: 222 LASINKFGQLHYQDVTTGGMVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSS 281
           LAS +    L   D    G + +++        ++ +P    + +G     V +W  T+ 
Sbjct: 115 LASGSSDTNLRVWDTRKKGCIQTYKGHTRGISTIEFSPDGRWVVSGGLDNVVKVWDLTAG 174

Query: 282 APLVKMLCHQGPVSALAFHPNGHLMATSGSERKIKLWDLRKFEVLQT 329
             L +  CH+GP+ +L FHP   L+AT  ++R +K WDL  FE++ T
Sbjct: 175 KLLHEFKCHEGPIRSLDFHPLEFLLATGSADRTVKFWDLETFELIGT 221

BLAST of HG10022895 vs. TAIR 10
Match: AT5G23430.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 59.3 bits (142), Expect = 1.1e-08
Identity = 25/92 (27.17%), Postives = 48/92 (52.17%), Query Frame = 0

Query: 235 DVTTGGMVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPV 294
           D+   G + +++      +V++  P    + +G     V +W  T+   L +   H+G +
Sbjct: 129 DIRKKGCIHTYKGHTRGVNVLRFTPDGRWVVSGGEDNIVKVWDLTAGKLLTEFKSHEGQI 188

Query: 295 SALAFHPNGHLMATSGSERKIKLWDLRKFEVL 327
            +L FHP+  L+AT  ++R +K WDL  FE++
Sbjct: 189 QSLDFHPHEFLLATGSADRTVKFWDLETFELI 220

BLAST of HG10022895 vs. TAIR 10
Match: AT5G23430.2 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 59.3 bits (142), Expect = 1.1e-08
Identity = 25/92 (27.17%), Postives = 48/92 (52.17%), Query Frame = 0

Query: 235 DVTTGGMVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSSAPLVKMLCHQGPV 294
           D+   G + +++      +V++  P    + +G     V +W  T+   L +   H+G +
Sbjct: 129 DIRKKGCIHTYKGHTRGVNVLRFTPDGRWVVSGGEDNIVKVWDLTAGKLLTEFKSHEGQI 188

Query: 295 SALAFHPNGHLMATSGSERKIKLWDLRKFEVL 327
            +L FHP+  L+AT  ++R +K WDL  FE++
Sbjct: 189 QSLDFHPHEFLLATGSADRTVKFWDLETFELI 220

BLAST of HG10022895 vs. TAIR 10
Match: AT1G61210.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 58.9 bits (141), Expect = 1.5e-08
Identity = 35/137 (25.55%), Postives = 65/137 (47.45%), Query Frame = 0

Query: 222 LASINKFGQLHYQDVTTGGMVGSFRTGLGRTDVMQVNPFNGVIATGHSGGSVAMWKPTSS 281
           LAS +    L   D+   G + +++        ++  P    + +G     V +W  T+ 
Sbjct: 115 LASGSSDANLKIWDIRKKGCIQTYKGHSRGISTIRFTPDGRWVVSGGLDNVVKVWDLTAG 174

Query: 282 APLVKMLCHQGPVSALAFHPNGHLMATSGSERKIKLWDLRKFEVLQTLPGHA---KTLDF 341
             L +   H+GP+ +L FHP   L+AT  ++R +K WDL  FE++ +    A   +++ F
Sbjct: 175 KLLHEFKFHEGPIRSLDFHPLEFLLATGSADRTVKFWDLETFELIGSTRPEATGVRSIKF 234

Query: 342 SQKG-LLAYGIGSSIQI 355
              G  L  G+  S+++
Sbjct: 235 HPDGRTLFCGLDDSLKV 251

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886607.12.3e-29094.70probable U3 small nucleolar RNA-associated protein 7 [Benincasa hispida][more]
XP_008456565.11.6e-28893.05PREDICTED: probable U3 small nucleolar RNA-associated protein 7 [Cucumis melo] >... [more]
XP_004138833.11.2e-28692.69probable U3 small nucleolar RNA-associated protein 7 [Cucumis sativus] >XP_03173... [more]
KAG6578785.11.4e-27991.06hypothetical protein SDJN03_23233, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022939750.12.4e-27990.88probable U3 small nucleolar RNA-associated protein 7 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9P4X31.6e-9239.06Probable U3 small nucleolar RNA-associated protein 7 OS=Schizosaccharomyces pomb... [more]
P400555.1e-8336.09U3 small nucleolar RNA-associated protein 7 OS=Saccharomyces cerevisiae (strain ... [more]
Q9Z0H11.1e-8037.16WD repeat-containing protein 46 OS=Mus musculus OX=10090 GN=Wdr46 PE=2 SV=1[more]
Q5TJE75.8e-7938.34WD repeat-containing protein 46 OS=Canis lupus familiaris OX=9615 GN=WDR46 PE=3 ... [more]
O152139.9e-7935.97WD repeat-containing protein 46 OS=Homo sapiens OX=9606 GN=WDR46 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A5D3BDN67.9e-28993.05Putative U3 small nucleolar RNA-associated protein 7 OS=Cucumis melo var. makuwa... [more]
A0A1S3C4837.9e-28993.05probable U3 small nucleolar RNA-associated protein 7 OS=Cucumis melo OX=3656 GN=... [more]
A0A0A0LQI65.7e-28792.69WD_REPEATS_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G... [more]
A0A6J1FNM31.1e-27990.88probable U3 small nucleolar RNA-associated protein 7 OS=Cucurbita moschata OX=36... [more]
A0A6J1JVD74.4e-27990.51probable U3 small nucleolar RNA-associated protein 7 OS=Cucurbita maxima OX=3661... [more]
Match NameE-valueIdentityDescription
AT3G10530.11.1e-21069.00Transducin/WD40 repeat-like superfamily protein [more]
AT1G11160.11.0e-0928.97Transducin/WD40 repeat-like superfamily protein [more]
AT5G23430.11.1e-0827.17Transducin/WD40 repeat-like superfamily protein [more]
AT5G23430.21.1e-0827.17Transducin/WD40 repeat-like superfamily protein [more]
AT1G61210.11.5e-0825.55Transducin/WD40 repeat-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012952BING4, C-terminal domainSMARTSM01033BING4CT_2coord: 366..446
e-value: 1.6E-53
score: 193.8
IPR012952BING4, C-terminal domainPFAMPF08149BING4CTcoord: 367..445
e-value: 7.4E-34
score: 115.4
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 158..195
e-value: 470.0
score: 0.1
coord: 116..155
e-value: 280.0
score: 1.5
coord: 238..277
e-value: 22.0
score: 8.5
coord: 280..319
e-value: 5.7E-8
score: 42.5
IPR001680WD40 repeatPFAMPF00400WD40coord: 289..319
e-value: 4.8E-5
score: 24.0
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 287..328
score: 14.819399
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 122..249
e-value: 9.4E-8
score: 33.3
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 250..354
e-value: 6.1E-18
score: 67.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 497..539
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 476..547
NoneNo IPR availablePANTHERPTHR14085:SF4BNAANNG36350D PROTEINcoord: 9..543
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 287..321
score: 11.444681
IPR040315WD repeat-containing protein WDR46/Utp7PANTHERPTHR14085WD-REPEAT PROTEIN BING4coord: 9..543
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 112..381

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022895.1HG10022895.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000462 maturation of SSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA)
cellular_component GO:0005730 nucleolus
cellular_component GO:0032040 small-subunit processome
molecular_function GO:0005515 protein binding