HG10022860 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022860
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCUE domain-containing protein
LocationChr05: 29047376 .. 29051989 (+)
RNA-Seq ExpressionHG10022860
SyntenyHG10022860
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTTTGATAACGTTTACGACTGTCTGAAGGAACTCTTTCCGGAGGTATTATGTTGATATGATTTTAATTTTTTTTTTTTTCTTTTGCCAGTTATTATTGTTATTCTGTTGTCTTTGTGATAATTGAAATTTATGAAAATTCTGCTCTTAGTTTGTTAAGATGCCGAGGTTTTAAGTTCACTGGTGAAAATATGTTCATTCTCAGGTGGACCATCGGATACTAAGGGCTGTAGCCCGTGAACATCCTAAGGATGTTCATTTAGCTGTTAATGATGTTCTCACGGAGGTTATCCCTCGTTTTTGTGGAAAATTTAAATTACCCTCGCAGAATCCTCGTGTGGAGTTTGCATCTAAAACAGAAGGTATCCCTTCTGATTTGTATTGTCTTGTTATAACATTTTTCAATTTCTTGTCTTGTTTTCTAACTTCTAAGTGTGTGTTTGTAGTGTCATTGGAAAAAGAAGAATGTTTTTAAGGGAACACTTATATTACGAACACTTTTGAATGAAAATAGTAAAATGCTTCTTCTAAATACTCTTTATTAGTTTTTAGAATTTCTCAAACCACTATTGATTATTGGAAATTCTAACAAGTGTTTTTAAATGGTTTAAAACACTTCTTAGCACACTTTGAAATTTTAATAGAACACTACCAATTTCTAAAAAGGCACTTTATAAGACATGCCAAACACCCTGCCCAATCGTGTATTTATTTTTTGTTGGTTATGTGGAATGTGCAGTTGAGAACAAAATGCAAATGGATTCCTTGAGATGGGTGCAGAGGGGTATGGATACTTCAGAATCAGTTATAATTGTTGATGAAGCACGTGATGACACATTGAATCAATCTGTCTCTGTAAATAATTATGTCTCTGGTGATGATTGTGAACAATCACGTGAGCATACTGAAACTACAAGCCTCACTGTACCAGCCGACCAAGAAGACCGCAGTGAAGTAGAATTGAATCATGTAGCACCAGGAAAATCAAATGGTTTGATTCATGAAGATAATGAACATAATGATCACGAACAAAGTCCTCAAATTACCAAAATCTGGAACCAGGTTACTGAGGATATCGAACAGAATAAGACACTGCTTGATGGTGTTAATCATTCTCGTAACATTTGTGATTTGAACTCCTTTCTCGTGCATGAATTATACCATGATGATCGACCCGTTTCTGTGTATGAGAATTCTGATGGTCAGACTGCCAATCTATCTCTATCCGTAATCCATCTTCACTCAGAATCTGATCATCAGAAATCAAATGCAAATGGCACTTCAAATCCTAGTGCCAAGCAAGAGTGTTCTACTGGTGAGATGACTGCGATTGAAGATGGGTTGATGGGGCATACCATAATCACTCAGTCAGGCCAACCATGTAGTATCGATCTTCTTGACGAGATTATTGAGGATGCTAAAAGCAATAAGGTACTTAATAACTTGGTATTGATAAATCTCAGATTGAAAATTATAAGTATGGTAATGCTCAATGTTTTGGGTTCTTTGCACAACAGAGCCTTCACCATATTCCTTTAGTTTTCTTGGTTCTGTTCATTTATTAGGTTAAATGAAACCACTTTACTCTGGTGTGTCATTCTCATTTGGATTGTTTTTAGCCATCTGCCGGGGAGACTTTTTTATTTGAAATTGGGTGCTCAAACACATATTTGCACGTACCCTTTCCAAGGTGAAATTGTGCCTTGCTGCAGTGTTGTTAGTTGTTTCATATTGCAAAAACTTGTGCTAGTCTTTGAGAAATTATCAACTCACCAAAGATCATTGGGATATGAAGGTATTTGTTAGTTGGTTACTCTGTCCAGTTGCTTTTTCTGTCTGTCTTATGTGATGTTTTCTGACTAACCTTGGGAGAACTGTAGAAGTTAGATACTAACAAGAAAGTAAGTTGTGAATTGCCATGTATCTATGTTCTAGATTTTCAGAAATGACATGCACTCTATCGTATCATATTACAATATCCATGCTACTGTGTACTTTTGCAAATATGAAGTATCACTATGTCATTAATTTGATTCTCCATGTCGTAAGATTTGGATCCTTGTAAGTTCTTTGTAATAAAGTTGAAATCTGCAGCTCTGATTAAGGTTAGGCTTGGTAGAAGACCGCACTGCAAGGGCTCATATGATGCATAGGTTTAGTGATATCAACTTATTGATGTGTTCTAAGCAATTAACCTTTCTGCAGATAACGTTATTCTCAGCAATGCAATTAGTTATCAGTAAGATGAAGGTATTGGAAGATATGGAGAAACATGTTGAAAAAGTCAAAGAAGACGCTGTCAATAGAGAGTCAGAAATTCTGGCAAAAGTGGAGGAAATGAAACAGACAGTTGCCCGTACTAAGGAAGCAAATGATATGGTGAAACATTTTCTTGTTAGATCCTATGGGCACTGTATGGATTATGCTTGTATGTTTAATTAACGAGTGAAATTTCTACTTTCATAGCATGCTGGAGAAGTTTATGGAGAAAAGGCGATTTTAGCAACGGAAACAAGGGAACTCCAATCTCGTCTGCTCAGCTTGTCAGATGAAAGAGACAGTTCTCTATCAATTCTTGATGAGGTGTTTCTCTACTTCAACTATACGTTATTGGTTACCATCTTATCTGAAACTCTTTTGGCCATTGGCTATCGTTTATAAGGTTAAGGATCCATGTCTTGTGAAATATTTGGTCTGTGTCAGCTTATATACAAGACAACAAATTAATTTTGGATCATTTTAATCAACTAGCATGTAATAGAAGTGTTATTCTTTTTATGCATCTTCTTTAATGAAAACATGTTAGACTATTTGATTTTTGTAGCTGCCTTTCAAAAACTATTTCAATTTTTCATTTTTCATGTTTAAAAATAAATTGTTCAGTTTTTAGTTTTGAAGTTTTGCAAAATAATTGTTGAACTGTTAGTAAATTATAAAGCCACATAAATAAGTTTTTTAATACGTATCTTTTGGTTTTTCGAAATCTCACTCTCTTTTTGACCCTTTATGAACGCCTTGTAGTTATAACCCCTGTGTCTTTTTGTCTCTATCTCTAACAGCTCTCCTGATCTTACCTTTCTCTATCTTACATATTTTTCTATTAACTTTTTTTTTCTTCACTAACTTATATCTCTCTGTAGTTATTTCCTATTAATTCTCTCTTCTGTATGAAAAATATAAATATAATGATAAAGATAGAAAGGTATATTTATTTTTTTAAAAAAAAACATTTGTTTTTTTTAATTAAAAAATGAAAATAAAATTTTTTAAATCATCATATCATTGATAAAATAAGTACTATCAAATAAAATATGTTTCTTTTTTAGGAAAAAGGAATAGAAAACATTATATAACATGTTTTGATTTTTGGGTGTTCAGAAAAATAGAAGACTAAAATTAGTTATCAATTATGTTTAGTTTTGTTTCCAAGGACAGAAAACTAAAGATAACTCAAAATTTGAATTATTTCTAGGCATGAGTGTCAATGTCCATCAAAGATTAGTAGTTGAATAATAACTGGTAAAAGTTCTTTATCATCCAGATGCATGCCACTCTTAAATCTAGAATGGCTGCAGTAGAAGCTATGCTGAAAACATTGGAGGATGAAAAGTTAGCTAAGGAAGAACATGCACGAAAGGCTCTTGCCGAGCAGGAGGCCCTCATGGAGAAGGTAGTCCAGGAATCAAGGATACTACAACAGGAAGCAAACGAGAATTCCAAGGTAACTAACCATAGTTGTTTGTTAGGGGTGGAACCAAAAATTCTCATTTCTGAAAATTTGAGGGGTAAACTAGAGATCTTTTAGAATACCTAAGACGAACGAGGAGGGGGCAGCTTCTTTGCACTGCCCTGTGTGTAATGCACCCTTCAGGGGGGAAGTTGATTATTGACTGACTACTGATGTTAATAATTTCTTTCAATGCAGTTACGAGAGTTTCTAATCGATCGTGGACAATTAGTTGACGTATTACAGTAAGTTTTTATCTTCTATAATCTTGGACAACTAAATGATGTTGTAGACTAACCTTGTCCTTGTGCATTGTTGTACATCACATGAAAAACAAACAAGCCTGTATGTTATGATAAGCATTTGCATAACAAAGATGAACTTGTACCGTTAGTCATTCCTGTGTTCTCAAACTATCGTTACTGCTTTAAGATTTTGGAACGTCTATCTCGGTATGAGAAAATTATCAAATCACTACATGGTATGCTTGCAGGGGAGAAATTTCAGTTATTTGTCAAGACGTGAGGCATCTGAAGGAGAAGTTTGACTTGGAAGTACCACTAAGCAAATCTCTTTCCTCTAGTCAAACAAGCTTCATTCTAGCTTCATCAGGCTCATCTCTAAAAAGCGCAACTAGCGATTTGGCTCGTTTTGGCTCACCCCTAACGGACATAGCAGCTCATCTGGATGCTGACAAAGGTGTGTCATCAAATCTCGGAAAGGAAGGAAGCCAAGCATCCTCTGTCAGTAGCTCTAGCTTGGCATCGAACAATCTTGAAGAAGAAAGATCAGAAAGAAACCATTTCAAATCATCATTTTCAGATGATGGGTGGGATGTATTTGACAAAGATGCTGAGTTTGCTGAAGCTTCATATAAAGTTTGA

mRNA sequence

ATGGATTTTGATAACGTTTACGACTGTCTGAAGGAACTCTTTCCGGAGGTGGACCATCGGATACTAAGGGCTGTAGCCCGTGAACATCCTAAGGATGTTCATTTAGCTGTTAATGATGTTCTCACGGAGGTTATCCCTCGTTTTTGTGGAAAATTTAAATTACCCTCGCAGAATCCTCGTGTGGAGTTTGCATCTAAAACAGAAGTTGAGAACAAAATGCAAATGGATTCCTTGAGATGGGTGCAGAGGGGTATGGATACTTCAGAATCAGTTATAATTGTTGATGAAGCACGTGATGACACATTGAATCAATCTGTCTCTGTAAATAATTATGTCTCTGGTGATGATTGTGAACAATCACGTGAGCATACTGAAACTACAAGCCTCACTGTACCAGCCGACCAAGAAGACCGCAGTGAAGTAGAATTGAATCATGTAGCACCAGGAAAATCAAATGGTTTGATTCATGAAGATAATGAACATAATGATCACGAACAAAGTCCTCAAATTACCAAAATCTGGAACCAGGTTACTGAGGATATCGAACAGAATAAGACACTGCTTGATGGTGTTAATCATTCTCGTAACATTTGTGATTTGAACTCCTTTCTCGTGCATGAATTATACCATGATGATCGACCCGTTTCTGTGTATGAGAATTCTGATGGTCAGACTGCCAATCTATCTCTATCCGTAATCCATCTTCACTCAGAATCTGATCATCAGAAATCAAATGCAAATGGCACTTCAAATCCTAGTGCCAAGCAAGAGTGTTCTACTGGTGAGATGACTGCGATTGAAGATGGGTTGATGGGGCATACCATAATCACTCAGTCAGGCCAACCATGTAGTATCGATCTTCTTGACGAGATTATTGAGGATGCTAAAAGCAATAAGATAACGTTATTCTCAGCAATGCAATTAGTTATCAGTAAGATGAAGGTATTGGAAGATATGGAGAAACATGTTGAAAAAGTCAAAGAAGACGCTGTCAATAGAGAGTCAGAAATTCTGGCAAAAGTGGAGGAAATGAAACAGACAGTTGCCCGTACTAAGGAAGCAAATGATATGCATGCTGGAGAAGTTTATGGAGAAAAGGCGATTTTAGCAACGGAAACAAGGGAACTCCAATCTCGTCTGCTCAGCTTGTCAGATGAAAGAGACAGTTCTCTATCAATTCTTGATGAGATGCATGCCACTCTTAAATCTAGAATGGCTGCAGTAGAAGCTATGCTGAAAACATTGGAGGATGAAAAGTTAGCTAAGGAAGAACATGCACGAAAGGCTCTTGCCGAGCAGGAGGCCCTCATGGAGAAGGTAGTCCAGGAATCAAGGATACTACAACAGGAAGCAAACGAGAATTCCAAGTTACGAGAGTTTCTAATCGATCGTGGACAATTAGTTGACGTATTACAGGGAGAAATTTCAGTTATTTGTCAAGACGTGAGGCATCTGAAGGAGAAGTTTGACTTGGAAGTACCACTAAGCAAATCTCTTTCCTCTAGTCAAACAAGCTTCATTCTAGCTTCATCAGGCTCATCTCTAAAAAGCGCAACTAGCGATTTGGCTCGTTTTGGCTCACCCCTAACGGACATAGCAGCTCATCTGGATGCTGACAAAGGTGTGTCATCAAATCTCGGAAAGGAAGGAAGCCAAGCATCCTCTGTCAGTAGCTCTAGCTTGGCATCGAACAATCTTGAAGAAGAAAGATCAGAAAGAAACCATTTCAAATCATCATTTTCAGATGATGGGTGGGATGTATTTGACAAAGATGCTGAGTTTGCTGAAGCTTCATATAAAGTTTGA

Coding sequence (CDS)

ATGGATTTTGATAACGTTTACGACTGTCTGAAGGAACTCTTTCCGGAGGTGGACCATCGGATACTAAGGGCTGTAGCCCGTGAACATCCTAAGGATGTTCATTTAGCTGTTAATGATGTTCTCACGGAGGTTATCCCTCGTTTTTGTGGAAAATTTAAATTACCCTCGCAGAATCCTCGTGTGGAGTTTGCATCTAAAACAGAAGTTGAGAACAAAATGCAAATGGATTCCTTGAGATGGGTGCAGAGGGGTATGGATACTTCAGAATCAGTTATAATTGTTGATGAAGCACGTGATGACACATTGAATCAATCTGTCTCTGTAAATAATTATGTCTCTGGTGATGATTGTGAACAATCACGTGAGCATACTGAAACTACAAGCCTCACTGTACCAGCCGACCAAGAAGACCGCAGTGAAGTAGAATTGAATCATGTAGCACCAGGAAAATCAAATGGTTTGATTCATGAAGATAATGAACATAATGATCACGAACAAAGTCCTCAAATTACCAAAATCTGGAACCAGGTTACTGAGGATATCGAACAGAATAAGACACTGCTTGATGGTGTTAATCATTCTCGTAACATTTGTGATTTGAACTCCTTTCTCGTGCATGAATTATACCATGATGATCGACCCGTTTCTGTGTATGAGAATTCTGATGGTCAGACTGCCAATCTATCTCTATCCGTAATCCATCTTCACTCAGAATCTGATCATCAGAAATCAAATGCAAATGGCACTTCAAATCCTAGTGCCAAGCAAGAGTGTTCTACTGGTGAGATGACTGCGATTGAAGATGGGTTGATGGGGCATACCATAATCACTCAGTCAGGCCAACCATGTAGTATCGATCTTCTTGACGAGATTATTGAGGATGCTAAAAGCAATAAGATAACGTTATTCTCAGCAATGCAATTAGTTATCAGTAAGATGAAGGTATTGGAAGATATGGAGAAACATGTTGAAAAAGTCAAAGAAGACGCTGTCAATAGAGAGTCAGAAATTCTGGCAAAAGTGGAGGAAATGAAACAGACAGTTGCCCGTACTAAGGAAGCAAATGATATGCATGCTGGAGAAGTTTATGGAGAAAAGGCGATTTTAGCAACGGAAACAAGGGAACTCCAATCTCGTCTGCTCAGCTTGTCAGATGAAAGAGACAGTTCTCTATCAATTCTTGATGAGATGCATGCCACTCTTAAATCTAGAATGGCTGCAGTAGAAGCTATGCTGAAAACATTGGAGGATGAAAAGTTAGCTAAGGAAGAACATGCACGAAAGGCTCTTGCCGAGCAGGAGGCCCTCATGGAGAAGGTAGTCCAGGAATCAAGGATACTACAACAGGAAGCAAACGAGAATTCCAAGTTACGAGAGTTTCTAATCGATCGTGGACAATTAGTTGACGTATTACAGGGAGAAATTTCAGTTATTTGTCAAGACGTGAGGCATCTGAAGGAGAAGTTTGACTTGGAAGTACCACTAAGCAAATCTCTTTCCTCTAGTCAAACAAGCTTCATTCTAGCTTCATCAGGCTCATCTCTAAAAAGCGCAACTAGCGATTTGGCTCGTTTTGGCTCACCCCTAACGGACATAGCAGCTCATCTGGATGCTGACAAAGGTGTGTCATCAAATCTCGGAAAGGAAGGAAGCCAAGCATCCTCTGTCAGTAGCTCTAGCTTGGCATCGAACAATCTTGAAGAAGAAAGATCAGAAAGAAACCATTTCAAATCATCATTTTCAGATGATGGGTGGGATGTATTTGACAAAGATGCTGAGTTTGCTGAAGCTTCATATAAAGTTTGA

Protein sequence

MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPRVEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDTLNQSVSVNNYVSGDDCEQSREHTETTSLTVPADQEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQITKIWNQVTEDIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTANLSLSVIHLHSESDHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSGQPCSIDLLDEIIEDAKSNKITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQTVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMAAVEAMLKTLEDEKLAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQLVDVLQGEISVICQDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFGSPLTDIAAHLDADKGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWDVFDKDAEFAEASYKV
Homology
BLAST of HG10022860 vs. NCBI nr
Match: XP_008451385.1 (PREDICTED: uncharacterized protein LOC103492692 isoform X1 [Cucumis melo] >XP_008451386.1 PREDICTED: uncharacterized protein LOC103492692 isoform X1 [Cucumis melo])

HSP 1 Score: 832.4 bits (2149), Expect = 2.5e-237
Identity = 473/602 (78.57%), Postives = 513/602 (85.22%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           MDFD+VY  LKELFPEVDHRILRAVA E+PKDVHLAVND+LTEVIPR   +FK P Q+  
Sbjct: 1   MDFDSVYKSLKELFPEVDHRILRAVALENPKDVHLAVNDILTEVIPRCHPEFKSPLQDHC 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDTLNQSVSVNNYVSGDDCEQS 120
           VEFASK E E+    D       G      V  +    D TLNQSVSVN+YV+ DDCE+ 
Sbjct: 61  VEFASKIEEESGTVGDE---ACNGTSYKSCVAHL----DGTLNQSVSVNSYVASDDCER- 120

Query: 121 REHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQITKIWNQVTE 180
            E+TETTSL+VPA+ QEDRSEVE+N VAP KSNGLI ED+ HNDHEQSPQITK W QVTE
Sbjct: 121 HENTETTSLSVPANIQEDRSEVEMNRVAPEKSNGLIQEDSGHNDHEQSPQITKTWTQVTE 180

Query: 181 DIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTANLSLSVIHLHSES 240
           DI+QNKTLLDGV+HS N+CDL+S LV EL HDDRP+ V ENSD QTAN S      HSES
Sbjct: 181 DIKQNKTLLDGVHHSLNVCDLDSLLVQELLHDDRPIPVDENSDSQTANPS---FENHSES 240

Query: 241 DHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSGQPCSIDLLDEIIEDAKSNK 300
           D++KSNANGTSNP  KQE STGEMT IED LMG +I+T+SGQPCSID LDEIIE+AKSNK
Sbjct: 241 DYKKSNANGTSNPEPKQESSTGEMTTIEDRLMGPSILTRSGQPCSIDHLDEIIENAKSNK 300

Query: 301 ITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQTVARTKEANDMHA 360
           ITLFSAMQ V +KMK LEDMEK+ EKVKED  N ESEILAKVEEMKQTVARTKEANDMHA
Sbjct: 301 ITLFSAMQSVTNKMKELEDMEKYFEKVKEDTANSESEILAKVEEMKQTVARTKEANDMHA 360

Query: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMAAVEAMLKTLEDEK 420
           GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMH TLKSRMA++EA LK LE+EK
Sbjct: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHTTLKSRMASLEATLKALEEEK 420

Query: 421 LAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQLVDVLQGEISVIC 480
           LAKEEHARKALAEQEALMEKVVQES+ILQ EANEN+KLREFLI+RGQLVDVLQGEISVIC
Sbjct: 421 LAKEEHARKALAEQEALMEKVVQESKILQHEANENAKLREFLIERGQLVDVLQGEISVIC 480

Query: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFGSPLTDIAAHLDAD 540
           QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLK+ATSDL  F S L D  AHLDA+
Sbjct: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKTATSDLTHFSSILMDTVAHLDAE 540

Query: 541 KGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWDVFDKDAEFAEASY 600
           KG S NLGKEG+QASSVSSSSL+SNNL+EE SERNHFKSSFSDDGWDVFDKDAEF+EASY
Sbjct: 541 KGTSLNLGKEGNQASSVSSSSLSSNNLKEEGSERNHFKSSFSDDGWDVFDKDAEFSEASY 591

Query: 601 KV 602
            V
Sbjct: 601 FV 591

BLAST of HG10022860 vs. NCBI nr
Match: XP_008451387.1 (PREDICTED: uncharacterized protein LOC103492692 isoform X2 [Cucumis melo] >XP_008451388.1 PREDICTED: uncharacterized protein LOC103492692 isoform X2 [Cucumis melo])

HSP 1 Score: 828.2 bits (2138), Expect = 4.7e-236
Identity = 472/602 (78.41%), Postives = 514/602 (85.38%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           MDFD+VY  LKELFPEVDHRILRAVA E+PKDVHLAVND+LTEVIPR   +FK P Q+  
Sbjct: 1   MDFDSVYKSLKELFPEVDHRILRAVALENPKDVHLAVNDILTEVIPRCHPEFKSPLQDHC 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDTLNQSVSVNNYVSGDDCEQS 120
           VEFASK E E  +  ++         TS    +     D TLNQSVSVN+YV+ DDCE+ 
Sbjct: 61  VEFASKIE-EGTVGDEACN------GTSYKSCVAH--LDGTLNQSVSVNSYVASDDCER- 120

Query: 121 REHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQITKIWNQVTE 180
            E+TETTSL+VPA+ QEDRSEVE+N VAP KSNGLI ED+ HNDHEQSPQITK W QVTE
Sbjct: 121 HENTETTSLSVPANIQEDRSEVEMNRVAPEKSNGLIQEDSGHNDHEQSPQITKTWTQVTE 180

Query: 181 DIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTANLSLSVIHLHSES 240
           DI+QNKTLLDGV+HS N+CDL+S LV EL HDDRP+ V ENSD QTAN S      HSES
Sbjct: 181 DIKQNKTLLDGVHHSLNVCDLDSLLVQELLHDDRPIPVDENSDSQTANPS---FENHSES 240

Query: 241 DHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSGQPCSIDLLDEIIEDAKSNK 300
           D++KSNANGTSNP  KQE STGEMT IED LMG +I+T+SGQPCSID LDEIIE+AKSNK
Sbjct: 241 DYKKSNANGTSNPEPKQESSTGEMTTIEDRLMGPSILTRSGQPCSIDHLDEIIENAKSNK 300

Query: 301 ITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQTVARTKEANDMHA 360
           ITLFSAMQ V +KMK LEDMEK+ EKVKED  N ESEILAKVEEMKQTVARTKEANDMHA
Sbjct: 301 ITLFSAMQSVTNKMKELEDMEKYFEKVKEDTANSESEILAKVEEMKQTVARTKEANDMHA 360

Query: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMAAVEAMLKTLEDEK 420
           GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMH TLKSRMA++EA LK LE+EK
Sbjct: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHTTLKSRMASLEATLKALEEEK 420

Query: 421 LAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQLVDVLQGEISVIC 480
           LAKEEHARKALAEQEALMEKVVQES+ILQ EANEN+KLREFLI+RGQLVDVLQGEISVIC
Sbjct: 421 LAKEEHARKALAEQEALMEKVVQESKILQHEANENAKLREFLIERGQLVDVLQGEISVIC 480

Query: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFGSPLTDIAAHLDAD 540
           QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLK+ATSDL  F S L D  AHLDA+
Sbjct: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKTATSDLTHFSSILMDTVAHLDAE 540

Query: 541 KGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWDVFDKDAEFAEASY 600
           KG S NLGKEG+QASSVSSSSL+SNNL+EE SERNHFKSSFSDDGWDVFDKDAEF+EASY
Sbjct: 541 KGTSLNLGKEGNQASSVSSSSLSSNNLKEEGSERNHFKSSFSDDGWDVFDKDAEFSEASY 589

Query: 601 KV 602
            V
Sbjct: 601 FV 589

BLAST of HG10022860 vs. NCBI nr
Match: KAA0057604.1 (golgin candidate 5 [Cucumis melo var. makuwa] >TYK20989.1 golgin candidate 5 [Cucumis melo var. makuwa])

HSP 1 Score: 828.2 bits (2138), Expect = 4.7e-236
Identity = 472/602 (78.41%), Postives = 512/602 (85.05%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           MDFD+VY  LKELFPEVDHRILRAVA E+PKDVHLAVND+LTEVIPR   +FK P Q+  
Sbjct: 1   MDFDSVYKSLKELFPEVDHRILRAVALENPKDVHLAVNDILTEVIPRCHPEFKSPLQDHC 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDTLNQSVSVNNYVSGDDCEQS 120
           VEFASK E E+    D       G      V  +    D TLNQSVSVN+YV+ DDCE+ 
Sbjct: 61  VEFASKIEEESGTVGDE---ACNGTSYESGVAHL----DGTLNQSVSVNSYVASDDCER- 120

Query: 121 REHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQITKIWNQVTE 180
            E+TETTSL+VPA+ QEDRSEVE+N VAP KSNGLI ED+ HNDHEQSPQITK WNQVTE
Sbjct: 121 HENTETTSLSVPANIQEDRSEVEMNRVAPEKSNGLIQEDSGHNDHEQSPQITKTWNQVTE 180

Query: 181 DIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTANLSLSVIHLHSES 240
           DI+QNKTLLDGV+HS N+ DL+S LV EL HDDRP+ V ENSD QTAN S      HSES
Sbjct: 181 DIKQNKTLLDGVHHSLNVRDLDSLLVQELLHDDRPIPVDENSDSQTANPS---FENHSES 240

Query: 241 DHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSGQPCSIDLLDEIIEDAKSNK 300
           D++KSNANGTSNP  KQE STGEMT IED LMG +I+T+SGQPCSID LDEIIE+AKSNK
Sbjct: 241 DYKKSNANGTSNPEPKQESSTGEMTTIEDRLMGPSILTRSGQPCSIDHLDEIIENAKSNK 300

Query: 301 ITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQTVARTKEANDMHA 360
           ITLFSAMQ V +KMK LEDMEK+ EKVKED  N ESEILAKVEEMKQTVARTKEANDMHA
Sbjct: 301 ITLFSAMQSVTNKMKELEDMEKYFEKVKEDTANSESEILAKVEEMKQTVARTKEANDMHA 360

Query: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMAAVEAMLKTLEDEK 420
           GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMH TLKSRMA++EA LK LE+EK
Sbjct: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHTTLKSRMASLEATLKALEEEK 420

Query: 421 LAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQLVDVLQGEISVIC 480
           LAKEEHARKALAEQEAL EKVVQES+ILQ EANEN+KLREFLI+RGQLVDVLQGEISVIC
Sbjct: 421 LAKEEHARKALAEQEALTEKVVQESKILQHEANENAKLREFLIERGQLVDVLQGEISVIC 480

Query: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFGSPLTDIAAHLDAD 540
           QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLK+ATSDL  F S L D  AHLDA+
Sbjct: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKTATSDLTHFSSILMDTVAHLDAE 540

Query: 541 KGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWDVFDKDAEFAEASY 600
           KG S NLGKEG+QASSVSSSSL+SNNL+EE SERNHFKSSFSDDGWDVFDKDAEF+EASY
Sbjct: 541 KGTSLNLGKEGNQASSVSSSSLSSNNLKEEGSERNHFKSSFSDDGWDVFDKDAEFSEASY 591

Query: 601 KV 602
            V
Sbjct: 601 FV 591

BLAST of HG10022860 vs. NCBI nr
Match: XP_022992500.1 (uncharacterized protein LOC111488818 isoform X1 [Cucurbita maxima])

HSP 1 Score: 807.0 bits (2083), Expect = 1.1e-229
Identity = 461/623 (74.00%), Postives = 514/623 (82.50%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           M+ D+VY CL ELFPEVDHR+LRAVA E+PKDVH A+NDVLTEV+P F G+  LP Q+P+
Sbjct: 1   MNLDSVYKCLMELFPEVDHRMLRAVALENPKDVHAAINDVLTEVLPCFLGESILPPQDPK 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDT------------LNQSVSV 120
                  EVENKMQMDS  WV+R MD+ ES  I DEA D T            LNQSVS 
Sbjct: 61  -------EVENKMQMDSSTWVKREMDSLESGTIGDEASDVTSQASGVAHLDVALNQSVSE 120

Query: 121 NNYVSGDDCEQSREHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQS 180
            +YV+ DDC+QS E+TET SL V A  QEDRS+VELN VAPGK NGLIHED+E+NDHEQS
Sbjct: 121 TSYVASDDCKQSCENTETASLEVLASVQEDRSDVELNQVAPGKLNGLIHEDSEYNDHEQS 180

Query: 181 PQITKIWNQVTEDIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTAN 240
           PQITKI N VTEDI+QNKTL DGVNHS NI DL+ + VHE Y DDRP S  EN D QTAN
Sbjct: 181 PQITKILNPVTEDIKQNKTLFDGVNHSHNIHDLDGWHVHESYRDDRPFSADENCDSQTAN 240

Query: 241 LSL-------SVIHLHSESDHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSG 300
            S        SVIHLHSESD+QKSNANGTSNPS KQ+CSTGEM  IEDGL+GHTI+TQSG
Sbjct: 241 PSFGNHSPVQSVIHLHSESDYQKSNANGTSNPSLKQDCSTGEMIVIEDGLVGHTIVTQSG 300

Query: 301 QPCSIDLLDEIIEDAKSNKITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAK 360
           QPCSI+LL++ IEDAKSNKITLFSAMQ VI KMK LE +EK+VEKVKED+ N E EILAK
Sbjct: 301 QPCSIELLEKNIEDAKSNKITLFSAMQSVIDKMKALEHVEKYVEKVKEDSTNGELEILAK 360

Query: 361 VEEMKQTVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHAT 420
           VEEMKQ VARTKEANDMHAGEVYGEKAILATETRELQSRLLSLS+ERD SLSILDEMHAT
Sbjct: 361 VEEMKQIVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSEERDKSLSILDEMHAT 420

Query: 421 LKSRMAAVEAMLKTLEDEKLAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREF 480
           + +RM AVEA+LK +++E LAKEE ARKALAEQEALMEKV++ESRILQQEA EN+KLREF
Sbjct: 421 IGARMTAVEAVLKVIQEENLAKEECARKALAEQEALMEKVLEESRILQQEAEENAKLREF 480

Query: 481 LIDRGQLVDVLQGEISVICQDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATS 540
           L+  GQLVD+LQGEISVI QDVRHLKEKFDL+VPLSKSLSSSQTS ILASSGSSLKSA S
Sbjct: 481 LVHHGQLVDILQGEISVIVQDVRHLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAAS 540

Query: 541 DLARFGSPLTDIAAHLDADKGVSSNLGKEGSQASSV--SSSSLASNNLEEERSERNHFKS 600
           DLARF SPLTD A+HLDA+KG SSNL KEGS+ASS   S SS++SNNL+EERSERNH K+
Sbjct: 541 DLARFFSPLTDTASHLDAEKGSSSNLDKEGSRASSFISSKSSMSSNNLKEERSERNHLKA 600

Query: 601 SFSDDGWDVFDKDAEFAEASYKV 602
             SDDGWDVFDKDAEFA+A + V
Sbjct: 601 C-SDDGWDVFDKDAEFADAPHLV 615

BLAST of HG10022860 vs. NCBI nr
Match: XP_023548577.1 (uncharacterized protein LOC111807200 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 800.8 bits (2067), Expect = 8.0e-228
Identity = 458/623 (73.52%), Postives = 514/623 (82.50%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           MDFD+VY CL ELFPEVDHR+LRAVA E+PKDVH+A+NDVLTEVIP F G+++LP Q+P+
Sbjct: 1   MDFDSVYKCLVELFPEVDHRMLRAVALENPKDVHVAINDVLTEVIPCFLGEYQLPPQDPK 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDT------------LNQSVSV 120
                  EVENKMQMDS  WV+R MD+ +S  I DEA D T            LNQSVS 
Sbjct: 61  -------EVENKMQMDSSTWVKREMDSLKSGTIGDEASDVTSQGSGVAHLDFALNQSVSE 120

Query: 121 NNYVSGDDCEQSREHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQS 180
            +YV+ DD +QS E+TET SL   A  QEDRS+VELN VAPGK NGLIHED+E+NDHEQS
Sbjct: 121 TSYVASDDFKQSCENTETASLEALASVQEDRSDVELNQVAPGKLNGLIHEDSEYNDHEQS 180

Query: 181 PQITKIWNQVTEDIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTAN 240
            QITKI N VTEDI+QNKT  DGVNHS NI DL+ + VHE YHDDRP +  EN D QTAN
Sbjct: 181 LQITKILNLVTEDIKQNKTPFDGVNHSHNIHDLDGWHVHESYHDDRPFAADENCDSQTAN 240

Query: 241 LSL-------SVIHLHSESDHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSG 300
            S        SVIHLHSESD+QKSNANGTSNPS KQECSTGEM  IEDGL+GHTI+TQSG
Sbjct: 241 PSFGNHSPVQSVIHLHSESDYQKSNANGTSNPSPKQECSTGEMIVIEDGLVGHTIVTQSG 300

Query: 301 QPCSIDLLDEIIEDAKSNKITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAK 360
           QPCSI+LL++ IEDAKSNKITLFSAMQ VI KMK LE +EK+VEKVKED+ N E EILAK
Sbjct: 301 QPCSIELLEKNIEDAKSNKITLFSAMQSVIDKMKALEHVEKYVEKVKEDSANGELEILAK 360

Query: 361 VEEMKQTVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHAT 420
           VEEMKQ VARTKEANDMHAGEVYGEKAILATETRELQSRLLSLS+ERD SLSILDEMHAT
Sbjct: 361 VEEMKQIVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSEERDKSLSILDEMHAT 420

Query: 421 LKSRMAAVEAMLKTLEDEKLAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREF 480
           + +RM AVEA+LK +++E LAKEE ARKALAEQEALMEKV++ESRIL++EA EN+KLREF
Sbjct: 421 IGARMTAVEAVLKVIQEENLAKEECARKALAEQEALMEKVLEESRILEREAKENAKLREF 480

Query: 481 LIDRGQLVDVLQGEISVICQDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATS 540
           LI  GQLVD+LQGEISVI QDVRHLKEKFDL+VPLSKSLSSSQTS ILASSGSSLKSA S
Sbjct: 481 LIHHGQLVDILQGEISVIVQDVRHLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAAS 540

Query: 541 DLARFGSPLTDIAAHLDADKGVSSNLGKEGSQASSV--SSSSLASNNLEEERSERNHFKS 600
           DL RF SPLTD A+HLDA+KG SSNL KEGS+ASS   S SS++SNNL+EERSERNH K+
Sbjct: 541 DLTRFCSPLTDTASHLDAEKGSSSNLDKEGSRASSFISSKSSMSSNNLKEERSERNHLKA 600

Query: 601 SFSDDGWDVFDKDAEFAEASYKV 602
             SDDGWDVFDKDAEFA+A + V
Sbjct: 601 C-SDDGWDVFDKDAEFADAPHLV 615

BLAST of HG10022860 vs. ExPASy TrEMBL
Match: A0A1S3BQS2 (uncharacterized protein LOC103492692 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492692 PE=4 SV=1)

HSP 1 Score: 832.4 bits (2149), Expect = 1.2e-237
Identity = 473/602 (78.57%), Postives = 513/602 (85.22%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           MDFD+VY  LKELFPEVDHRILRAVA E+PKDVHLAVND+LTEVIPR   +FK P Q+  
Sbjct: 1   MDFDSVYKSLKELFPEVDHRILRAVALENPKDVHLAVNDILTEVIPRCHPEFKSPLQDHC 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDTLNQSVSVNNYVSGDDCEQS 120
           VEFASK E E+    D       G      V  +    D TLNQSVSVN+YV+ DDCE+ 
Sbjct: 61  VEFASKIEEESGTVGDE---ACNGTSYKSCVAHL----DGTLNQSVSVNSYVASDDCER- 120

Query: 121 REHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQITKIWNQVTE 180
            E+TETTSL+VPA+ QEDRSEVE+N VAP KSNGLI ED+ HNDHEQSPQITK W QVTE
Sbjct: 121 HENTETTSLSVPANIQEDRSEVEMNRVAPEKSNGLIQEDSGHNDHEQSPQITKTWTQVTE 180

Query: 181 DIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTANLSLSVIHLHSES 240
           DI+QNKTLLDGV+HS N+CDL+S LV EL HDDRP+ V ENSD QTAN S      HSES
Sbjct: 181 DIKQNKTLLDGVHHSLNVCDLDSLLVQELLHDDRPIPVDENSDSQTANPS---FENHSES 240

Query: 241 DHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSGQPCSIDLLDEIIEDAKSNK 300
           D++KSNANGTSNP  KQE STGEMT IED LMG +I+T+SGQPCSID LDEIIE+AKSNK
Sbjct: 241 DYKKSNANGTSNPEPKQESSTGEMTTIEDRLMGPSILTRSGQPCSIDHLDEIIENAKSNK 300

Query: 301 ITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQTVARTKEANDMHA 360
           ITLFSAMQ V +KMK LEDMEK+ EKVKED  N ESEILAKVEEMKQTVARTKEANDMHA
Sbjct: 301 ITLFSAMQSVTNKMKELEDMEKYFEKVKEDTANSESEILAKVEEMKQTVARTKEANDMHA 360

Query: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMAAVEAMLKTLEDEK 420
           GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMH TLKSRMA++EA LK LE+EK
Sbjct: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHTTLKSRMASLEATLKALEEEK 420

Query: 421 LAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQLVDVLQGEISVIC 480
           LAKEEHARKALAEQEALMEKVVQES+ILQ EANEN+KLREFLI+RGQLVDVLQGEISVIC
Sbjct: 421 LAKEEHARKALAEQEALMEKVVQESKILQHEANENAKLREFLIERGQLVDVLQGEISVIC 480

Query: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFGSPLTDIAAHLDAD 540
           QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLK+ATSDL  F S L D  AHLDA+
Sbjct: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKTATSDLTHFSSILMDTVAHLDAE 540

Query: 541 KGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWDVFDKDAEFAEASY 600
           KG S NLGKEG+QASSVSSSSL+SNNL+EE SERNHFKSSFSDDGWDVFDKDAEF+EASY
Sbjct: 541 KGTSLNLGKEGNQASSVSSSSLSSNNLKEEGSERNHFKSSFSDDGWDVFDKDAEFSEASY 591

Query: 601 KV 602
            V
Sbjct: 601 FV 591

BLAST of HG10022860 vs. ExPASy TrEMBL
Match: A0A1S3BRF4 (uncharacterized protein LOC103492692 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103492692 PE=4 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 2.3e-236
Identity = 472/602 (78.41%), Postives = 514/602 (85.38%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           MDFD+VY  LKELFPEVDHRILRAVA E+PKDVHLAVND+LTEVIPR   +FK P Q+  
Sbjct: 1   MDFDSVYKSLKELFPEVDHRILRAVALENPKDVHLAVNDILTEVIPRCHPEFKSPLQDHC 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDTLNQSVSVNNYVSGDDCEQS 120
           VEFASK E E  +  ++         TS    +     D TLNQSVSVN+YV+ DDCE+ 
Sbjct: 61  VEFASKIE-EGTVGDEACN------GTSYKSCVAH--LDGTLNQSVSVNSYVASDDCER- 120

Query: 121 REHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQITKIWNQVTE 180
            E+TETTSL+VPA+ QEDRSEVE+N VAP KSNGLI ED+ HNDHEQSPQITK W QVTE
Sbjct: 121 HENTETTSLSVPANIQEDRSEVEMNRVAPEKSNGLIQEDSGHNDHEQSPQITKTWTQVTE 180

Query: 181 DIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTANLSLSVIHLHSES 240
           DI+QNKTLLDGV+HS N+CDL+S LV EL HDDRP+ V ENSD QTAN S      HSES
Sbjct: 181 DIKQNKTLLDGVHHSLNVCDLDSLLVQELLHDDRPIPVDENSDSQTANPS---FENHSES 240

Query: 241 DHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSGQPCSIDLLDEIIEDAKSNK 300
           D++KSNANGTSNP  KQE STGEMT IED LMG +I+T+SGQPCSID LDEIIE+AKSNK
Sbjct: 241 DYKKSNANGTSNPEPKQESSTGEMTTIEDRLMGPSILTRSGQPCSIDHLDEIIENAKSNK 300

Query: 301 ITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQTVARTKEANDMHA 360
           ITLFSAMQ V +KMK LEDMEK+ EKVKED  N ESEILAKVEEMKQTVARTKEANDMHA
Sbjct: 301 ITLFSAMQSVTNKMKELEDMEKYFEKVKEDTANSESEILAKVEEMKQTVARTKEANDMHA 360

Query: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMAAVEAMLKTLEDEK 420
           GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMH TLKSRMA++EA LK LE+EK
Sbjct: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHTTLKSRMASLEATLKALEEEK 420

Query: 421 LAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQLVDVLQGEISVIC 480
           LAKEEHARKALAEQEALMEKVVQES+ILQ EANEN+KLREFLI+RGQLVDVLQGEISVIC
Sbjct: 421 LAKEEHARKALAEQEALMEKVVQESKILQHEANENAKLREFLIERGQLVDVLQGEISVIC 480

Query: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFGSPLTDIAAHLDAD 540
           QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLK+ATSDL  F S L D  AHLDA+
Sbjct: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKTATSDLTHFSSILMDTVAHLDAE 540

Query: 541 KGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWDVFDKDAEFAEASY 600
           KG S NLGKEG+QASSVSSSSL+SNNL+EE SERNHFKSSFSDDGWDVFDKDAEF+EASY
Sbjct: 541 KGTSLNLGKEGNQASSVSSSSLSSNNLKEEGSERNHFKSSFSDDGWDVFDKDAEFSEASY 589

Query: 601 KV 602
            V
Sbjct: 601 FV 589

BLAST of HG10022860 vs. ExPASy TrEMBL
Match: A0A5A7UR71 (Golgin candidate 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold328G00140 PE=4 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 2.3e-236
Identity = 472/602 (78.41%), Postives = 512/602 (85.05%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           MDFD+VY  LKELFPEVDHRILRAVA E+PKDVHLAVND+LTEVIPR   +FK P Q+  
Sbjct: 1   MDFDSVYKSLKELFPEVDHRILRAVALENPKDVHLAVNDILTEVIPRCHPEFKSPLQDHC 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDTLNQSVSVNNYVSGDDCEQS 120
           VEFASK E E+    D       G      V  +    D TLNQSVSVN+YV+ DDCE+ 
Sbjct: 61  VEFASKIEEESGTVGDE---ACNGTSYESGVAHL----DGTLNQSVSVNSYVASDDCER- 120

Query: 121 REHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQITKIWNQVTE 180
            E+TETTSL+VPA+ QEDRSEVE+N VAP KSNGLI ED+ HNDHEQSPQITK WNQVTE
Sbjct: 121 HENTETTSLSVPANIQEDRSEVEMNRVAPEKSNGLIQEDSGHNDHEQSPQITKTWNQVTE 180

Query: 181 DIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTANLSLSVIHLHSES 240
           DI+QNKTLLDGV+HS N+ DL+S LV EL HDDRP+ V ENSD QTAN S      HSES
Sbjct: 181 DIKQNKTLLDGVHHSLNVRDLDSLLVQELLHDDRPIPVDENSDSQTANPS---FENHSES 240

Query: 241 DHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSGQPCSIDLLDEIIEDAKSNK 300
           D++KSNANGTSNP  KQE STGEMT IED LMG +I+T+SGQPCSID LDEIIE+AKSNK
Sbjct: 241 DYKKSNANGTSNPEPKQESSTGEMTTIEDRLMGPSILTRSGQPCSIDHLDEIIENAKSNK 300

Query: 301 ITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQTVARTKEANDMHA 360
           ITLFSAMQ V +KMK LEDMEK+ EKVKED  N ESEILAKVEEMKQTVARTKEANDMHA
Sbjct: 301 ITLFSAMQSVTNKMKELEDMEKYFEKVKEDTANSESEILAKVEEMKQTVARTKEANDMHA 360

Query: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMAAVEAMLKTLEDEK 420
           GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMH TLKSRMA++EA LK LE+EK
Sbjct: 361 GEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHTTLKSRMASLEATLKALEEEK 420

Query: 421 LAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQLVDVLQGEISVIC 480
           LAKEEHARKALAEQEAL EKVVQES+ILQ EANEN+KLREFLI+RGQLVDVLQGEISVIC
Sbjct: 421 LAKEEHARKALAEQEALTEKVVQESKILQHEANENAKLREFLIERGQLVDVLQGEISVIC 480

Query: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFGSPLTDIAAHLDAD 540
           QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLK+ATSDL  F S L D  AHLDA+
Sbjct: 481 QDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKTATSDLTHFSSILMDTVAHLDAE 540

Query: 541 KGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWDVFDKDAEFAEASY 600
           KG S NLGKEG+QASSVSSSSL+SNNL+EE SERNHFKSSFSDDGWDVFDKDAEF+EASY
Sbjct: 541 KGTSLNLGKEGNQASSVSSSSLSSNNLKEEGSERNHFKSSFSDDGWDVFDKDAEFSEASY 591

Query: 601 KV 602
            V
Sbjct: 601 FV 591

BLAST of HG10022860 vs. ExPASy TrEMBL
Match: A0A6J1JVV4 (uncharacterized protein LOC111488818 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488818 PE=4 SV=1)

HSP 1 Score: 807.0 bits (2083), Expect = 5.4e-230
Identity = 461/623 (74.00%), Postives = 514/623 (82.50%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           M+ D+VY CL ELFPEVDHR+LRAVA E+PKDVH A+NDVLTEV+P F G+  LP Q+P+
Sbjct: 1   MNLDSVYKCLMELFPEVDHRMLRAVALENPKDVHAAINDVLTEVLPCFLGESILPPQDPK 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDT------------LNQSVSV 120
                  EVENKMQMDS  WV+R MD+ ES  I DEA D T            LNQSVS 
Sbjct: 61  -------EVENKMQMDSSTWVKREMDSLESGTIGDEASDVTSQASGVAHLDVALNQSVSE 120

Query: 121 NNYVSGDDCEQSREHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQS 180
            +YV+ DDC+QS E+TET SL V A  QEDRS+VELN VAPGK NGLIHED+E+NDHEQS
Sbjct: 121 TSYVASDDCKQSCENTETASLEVLASVQEDRSDVELNQVAPGKLNGLIHEDSEYNDHEQS 180

Query: 181 PQITKIWNQVTEDIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTAN 240
           PQITKI N VTEDI+QNKTL DGVNHS NI DL+ + VHE Y DDRP S  EN D QTAN
Sbjct: 181 PQITKILNPVTEDIKQNKTLFDGVNHSHNIHDLDGWHVHESYRDDRPFSADENCDSQTAN 240

Query: 241 LSL-------SVIHLHSESDHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSG 300
            S        SVIHLHSESD+QKSNANGTSNPS KQ+CSTGEM  IEDGL+GHTI+TQSG
Sbjct: 241 PSFGNHSPVQSVIHLHSESDYQKSNANGTSNPSLKQDCSTGEMIVIEDGLVGHTIVTQSG 300

Query: 301 QPCSIDLLDEIIEDAKSNKITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAK 360
           QPCSI+LL++ IEDAKSNKITLFSAMQ VI KMK LE +EK+VEKVKED+ N E EILAK
Sbjct: 301 QPCSIELLEKNIEDAKSNKITLFSAMQSVIDKMKALEHVEKYVEKVKEDSTNGELEILAK 360

Query: 361 VEEMKQTVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHAT 420
           VEEMKQ VARTKEANDMHAGEVYGEKAILATETRELQSRLLSLS+ERD SLSILDEMHAT
Sbjct: 361 VEEMKQIVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSEERDKSLSILDEMHAT 420

Query: 421 LKSRMAAVEAMLKTLEDEKLAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREF 480
           + +RM AVEA+LK +++E LAKEE ARKALAEQEALMEKV++ESRILQQEA EN+KLREF
Sbjct: 421 IGARMTAVEAVLKVIQEENLAKEECARKALAEQEALMEKVLEESRILQQEAEENAKLREF 480

Query: 481 LIDRGQLVDVLQGEISVICQDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATS 540
           L+  GQLVD+LQGEISVI QDVRHLKEKFDL+VPLSKSLSSSQTS ILASSGSSLKSA S
Sbjct: 481 LVHHGQLVDILQGEISVIVQDVRHLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAAS 540

Query: 541 DLARFGSPLTDIAAHLDADKGVSSNLGKEGSQASSV--SSSSLASNNLEEERSERNHFKS 600
           DLARF SPLTD A+HLDA+KG SSNL KEGS+ASS   S SS++SNNL+EERSERNH K+
Sbjct: 541 DLARFFSPLTDTASHLDAEKGSSSNLDKEGSRASSFISSKSSMSSNNLKEERSERNHLKA 600

Query: 601 SFSDDGWDVFDKDAEFAEASYKV 602
             SDDGWDVFDKDAEFA+A + V
Sbjct: 601 C-SDDGWDVFDKDAEFADAPHLV 615

BLAST of HG10022860 vs. ExPASy TrEMBL
Match: A0A6J1GNI7 (uncharacterized protein LOC111456040 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456040 PE=4 SV=1)

HSP 1 Score: 797.0 bits (2057), Expect = 5.6e-227
Identity = 457/623 (73.35%), Postives = 512/623 (82.18%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           M+F +VY CL ELFPEVDHR+LRAVA E+PKDVH A+NDVLTEV+P F G+  LP Q+P+
Sbjct: 1   MNFGSVYKCLVELFPEVDHRMLRAVALENPKDVHAAINDVLTEVLPCFLGESILPPQDPK 60

Query: 61  VEFASKTEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDT------------LNQSVSV 120
                  EVENKMQMDS  WV+R MD+ +S  I DEA D T            LNQSVS 
Sbjct: 61  -------EVENKMQMDSSTWVKREMDSLKSGTIGDEASDVTSQGSGVAHLDVALNQSVSE 120

Query: 121 NNYVSGDDCEQSREHTETTSLTVPAD-QEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQS 180
           N+YV+ DDC+QS E+TET SL   A  QEDRS+VELN VAPGK NGLI+ED+E+NDHEQS
Sbjct: 121 NSYVASDDCKQSCENTETASLEALASVQEDRSDVELNQVAPGKLNGLIYEDSEYNDHEQS 180

Query: 181 PQITKIWNQVTEDIEQNKTLLDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTAN 240
           PQITKI N VTEDI+QNKT  DGVNHS NI DL+ + VHE Y DDRP S  EN D QTAN
Sbjct: 181 PQITKILNLVTEDIKQNKTPFDGVNHSHNIHDLDGWHVHESYRDDRPFSADENCDSQTAN 240

Query: 241 LSL-------SVIHLHSESDHQKSNANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSG 300
            S        SVIHLHSESD+QKSNANGTSNPS KQECSTGEM  IEDGL+GHTI+TQSG
Sbjct: 241 PSFGNHSPVQSVIHLHSESDYQKSNANGTSNPSPKQECSTGEMIVIEDGLVGHTIVTQSG 300

Query: 301 QPCSIDLLDEIIEDAKSNKITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAK 360
           QPCSI+LL++ IEDAKSNKITLFSAMQ VI KMK LE  EK+VEKVKED+ N E EILAK
Sbjct: 301 QPCSIELLEKNIEDAKSNKITLFSAMQSVIDKMKALEHAEKYVEKVKEDSANGELEILAK 360

Query: 361 VEEMKQTVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHAT 420
           +EEMKQ VARTKEANDMHAGEVYGEKAILATETRELQSRLLSLS+ERD SLSILDEMHAT
Sbjct: 361 LEEMKQIVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSEERDKSLSILDEMHAT 420

Query: 421 LKSRMAAVEAMLKTLEDEKLAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREF 480
           + +RM AVEA+LK +++E LAKEE ARKALAEQEALMEKV++ESRIL++EA EN+KLREF
Sbjct: 421 IGARMTAVEAVLKVIQEENLAKEECARKALAEQEALMEKVLEESRILEREAKENAKLREF 480

Query: 481 LIDRGQLVDVLQGEISVICQDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATS 540
           LI  GQLVD+LQGEISVI QDVRHLKEKFDL+VPLSKSLSSSQTS ILASSGSSLKSA S
Sbjct: 481 LIHHGQLVDILQGEISVIVQDVRHLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAAS 540

Query: 541 DLARFGSPLTDIAAHLDADKGVSSNLGKEGSQASSV--SSSSLASNNLEEERSERNHFKS 600
           DLARF SPLTD A+HLDA+KG SSNL KEGS+ASS   S SS++SNNL+EERSERNH K+
Sbjct: 541 DLARFCSPLTDTASHLDAEKGSSSNLDKEGSRASSFISSKSSMSSNNLKEERSERNHLKA 600

Query: 601 SFSDDGWDVFDKDAEFAEASYKV 602
             SDDGWDVFDKDAEFA+A + V
Sbjct: 601 C-SDDGWDVFDKDAEFADAPHLV 615

BLAST of HG10022860 vs. TAIR 10
Match: AT4G02880.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G03290.2). )

HSP 1 Score: 286.6 bits (732), Expect = 4.7e-77
Identity = 243/612 (39.71%), Postives = 326/612 (53.27%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           M +  VY  L ELFP++D R+L+AVA EHPKDV+ A   V++E++P F       S  P 
Sbjct: 1   MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDVNEAAAVVVSEIVPFFYPNLADSSTQPE 60

Query: 61  VEFASK--TEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDTLNQSVSVNNYVSGDDCE 120
            +      TEVEN ++ D    V  G +   S                            
Sbjct: 61  NKTPGNVPTEVENAVERDMPFSVLSGSEMGGSY----------------------SGSAS 120

Query: 121 QSREHTETTSLTVPADQEDRSEVELNHVAP---------GKSNGLIHEDNEHNDHEQSPQ 180
            + E+ ET +   P  +      +L HV P         GK  GL   D       + P 
Sbjct: 121 MAFEYHETRA---PVTESVSKRNQLTHVMPNVVVDIQRKGKI-GLSGSDESGVVSSEPPV 180

Query: 181 ITKIWNQVTEDIEQNKTLLDGVNHSRNICDLNS-FLVHELYHDDRPVSVYENSDG-QTAN 240
             +   + T D  Q        N +      +S   VH+L +    +++ +NS   Q   
Sbjct: 181 SCQAGAKSTGDDWQGVEFHSTGNQAEASTSADSEDAVHKLVYPADNLAITQNSHPLQIRF 240

Query: 241 LSLSVIHLHSESDHQKSNANG-TSNPSAKQECSTGEMTAIEDG---LMG--HTIITQSGQ 300
            S+ V++  S       N++   S  +   E S G + A E+G   L G   ++  +S Q
Sbjct: 241 GSIDVVNETSSGSLAVENSDAELSGSNLVDEISKGSL-ADENGDPELDGAVSSVGNRSTQ 300

Query: 301 PCSIDLLDEIIEDAKSNKITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKV 360
            C++  L++IIEDAKSNK TLF+ M+ +++ M+ +E  EK  EK KEDA     + L KV
Sbjct: 301 GCNMVHLEQIIEDAKSNKRTLFTVMESIMNLMREVELQEKEAEKAKEDASIGGFDTLDKV 360

Query: 361 EEMKQTVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATL 420
           EE+K+ +   KEANDM AGEVYGE++IL TE  EL++RL+SLS+ERD+SLS+LDEM   L
Sbjct: 361 EELKKMLEHAKEANDMAAGEVYGERSILTTEVNELENRLISLSEERDNSLSVLDEMRVDL 420

Query: 421 KSRMAAVEAMLKTLEDEKLAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFL 480
           + R+A    +    E EK  KE  ARKA AEQEA+ME+VVQES++LQQEA ENSKLREFL
Sbjct: 421 EIRLATALGIKNAAEQEKQEKEGSARKAFAEQEAIMERVVQESKLLQQEAEENSKLREFL 480

Query: 481 IDRGQLVDVLQGEISVICQDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSD 540
           +D G++VD LQGEISVICQD+RHLKEKFD  VPLS+S+SSSQTS  LASS SS+KS  ++
Sbjct: 481 MDHGRIVDSLQGEISVICQDIRHLKEKFDNRVPLSQSISSSQTSCKLASSASSMKSLLTE 540

Query: 541 LARFGSPLTDIAAHLDADKGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFS 594
                 PL                   E  +ASS + S  AS N   ER E         
Sbjct: 541 -----KPL---------------EASYETPEASSNNKSPKASVN---ERKE-------LL 555

BLAST of HG10022860 vs. TAIR 10
Match: AT4G02880.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G03290.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 286.2 bits (731), Expect = 6.1e-77
Identity = 239/603 (39.64%), Postives = 323/603 (53.57%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGKFKLPSQNPR 60
           M +  VY  L ELFP++D R+L+AVA EHPKDV+ A   V++E++P F       S  P 
Sbjct: 1   MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDVNEAAAVVVSEIVPFFYPNLADSSTQPE 60

Query: 61  VEFASK--TEVENKMQMDSLRWVQRGMDTSESVIIVDEARDDTLNQSVSVNNYVSGDDCE 120
            +      TEVE  M    L   + G   S S  +  E  +     + SV+         
Sbjct: 61  NKTPGNVPTEVERDMPFSVLSGSEMGGSYSGSASMAFEYHETRAPVTESVS--------- 120

Query: 121 QSREHTETTSLTVPADQEDRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQITKIWNQVT 180
             R         V  D + + ++           GL   D       + P   +   + T
Sbjct: 121 -KRNQLTHVMPNVVVDIQRKGKI-----------GLSGSDESGVVSSEPPVSCQAGAKST 180

Query: 181 EDIEQNKTLLDGVNHSRNICDLNS-FLVHELYHDDRPVSVYENSDG-QTANLSLSVIHLH 240
            D  Q        N +      +S   VH+L +    +++ +NS   Q    S+ V++  
Sbjct: 181 GDDWQGVEFHSTGNQAEASTSADSEDAVHKLVYPADNLAITQNSHPLQIRFGSIDVVNET 240

Query: 241 SESDHQKSNANG-TSNPSAKQECSTGEMTAIEDG---LMG--HTIITQSGQPCSIDLLDE 300
           S       N++   S  +   E S G + A E+G   L G   ++  +S Q C++  L++
Sbjct: 241 SSGSLAVENSDAELSGSNLVDEISKGSL-ADENGDPELDGAVSSVGNRSTQGCNMVHLEQ 300

Query: 301 IIEDAKSNKITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQTVAR 360
           IIEDAKSNK TLF+ M+ +++ M+ +E  EK  EK KEDA     + L KVEE+K+ +  
Sbjct: 301 IIEDAKSNKRTLFTVMESIMNLMREVELQEKEAEKAKEDASIGGFDTLDKVEELKKMLEH 360

Query: 361 TKEANDMHAGEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMAAVEA 420
            KEANDM AGEVYGE++IL TE  EL++RL+SLS+ERD+SLS+LDEM   L+ R+A    
Sbjct: 361 AKEANDMAAGEVYGERSILTTEVNELENRLISLSEERDNSLSVLDEMRVDLEIRLATALG 420

Query: 421 MLKTLEDEKLAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQLVDV 480
           +    E EK  KE  ARKA AEQEA+ME+VVQES++LQQEA ENSKLREFL+D G++VD 
Sbjct: 421 IKNAAEQEKQEKEGSARKAFAEQEAIMERVVQESKLLQQEAEENSKLREFLMDHGRIVDS 480

Query: 481 LQGEISVICQDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFGSPLT 540
           LQGEISVICQD+RHLKEKFD  VPLS+S+SSSQTS  LASS SS+KS  ++      PL 
Sbjct: 481 LQGEISVICQDIRHLKEKFDNRVPLSQSISSSQTSCKLASSASSMKSLLTE-----KPL- 540

Query: 541 DIAAHLDADKGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWDVFDK 594
                             E  +ASS + S  AS N   ER E         DDGWD FDK
Sbjct: 541 --------------EASYETPEASSNNKSPKASVN---ERKE-------LLDDGWDFFDK 551

BLAST of HG10022860 vs. TAIR 10
Match: AT1G03290.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02880.2); Has 13587 Blast hits to 10183 proteins in 1114 species: Archae - 257; Bacteria - 2402; Metazoa - 5637; Fungi - 960; Plants - 675; Viruses - 54; Other Eukaryotes - 3602 (source: NCBI BLink). )

HSP 1 Score: 268.1 bits (684), Expect = 1.7e-71
Identity = 236/607 (38.88%), Postives = 328/607 (54.04%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGK-FKLPSQNP 60
           M F +VY  L E+FP++D RILRAVA EHPKD   A   VL+E+IP F    F   +Q+ 
Sbjct: 1   MGFGSVYRSLTEIFPQIDARILRAVAIEHPKDADEAAAVVLSEIIPSFSSNLFHNFTQSS 60

Query: 61  RVEFASKTEVENKMQMDSLRWVQR---GMDTSESVIIVDEARDDTLNQSVSVNNYVSGDD 120
                S +E E +  ++ +    R   G   S++      +  +TL   V+ ++      
Sbjct: 61  YKSSGSISEREVEHGLEDVASRCRPFLGASGSKASTSSSSSSSETLPLVVTRDHNTRALS 120

Query: 121 CEQSREHTETTSLTVPADQE------DRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQI 180
            +      E T+L    D +      +  E++    A GK NG         D   +   
Sbjct: 121 TDLVSNMNELTTLQPNVDPDVCHKDLESEEIQSVKKARGKENGNYDLFGRCFDVTSN--- 180

Query: 181 TKIWNQVTE-DIEQNKTL--LDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTAN 240
            KI   V E DI    +L  LD V  + N  +   F +     ++    + +++ G T  
Sbjct: 181 AKIGLDVPEDDIASVVSLFSLDNVKLASNFWEDLGFDITWNQAENAVSKLVDSTPGDT-- 240

Query: 241 LSLSVIHLHSESDHQKSN-ANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSGQPCSID 300
           ++ +      E  H  +N  + TSN S   E   G+ T I D     T +      CS+D
Sbjct: 241 MTTTQQGSCFEVGHGSTNLVDETSNRSLFSE--NGD-TEIGDAFSTSTHV------CSVD 300

Query: 301 LLDEIIEDAKSNKITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQ 360
            L++IIEDAKSNK  L + M+ V + M+ +E  EK  EK KE+A     + L KVEE+K+
Sbjct: 301 QLEDIIEDAKSNKKNLLTEMETVTNIMREVELKEKDAEKSKEEAARGGLDTLQKVEELKK 360

Query: 361 TVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMA 420
            +   KEANDMHAGEVYGEK+ILATE +EL++RLL+LS+ER+ SL+ILDEM  +L+ R+A
Sbjct: 361 MLEHAKEANDMHAGEVYGEKSILATEVKELENRLLNLSEERNKSLAILDEMRGSLEIRLA 420

Query: 421 AVEAMLKTLEDEKLAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQ 480
           A   + KT E EK  KE+ A KALAEQEA MEKVVQES++LQQEA ENSKLR+FL+DRGQ
Sbjct: 421 AALELKKTAEKEKKDKEDSALKALAEQEANMEKVVQESKLLQQEAEENSKLRDFLMDRGQ 480

Query: 481 LVDVLQGEISVICQDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFG 540
           +VD LQGEISVICQDV+ LKEKF+  VPL+KS+SSS TS    S GSS+KS   +     
Sbjct: 481 IVDTLQGEISVICQDVKLLKEKFENRVPLTKSISSSFTS----SCGSSMKSLVLE----- 540

Query: 541 SPLTDIAAHLDADKGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWD 594
                           S  L      +++      A+  + +E+ +         +DGWD
Sbjct: 541 --------------NPSERLNGVTETSNNNKFPEAAAFFMNKEKDDCR----DLLEDGWD 566

BLAST of HG10022860 vs. TAIR 10
Match: AT1G03290.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02880.2). )

HSP 1 Score: 268.1 bits (684), Expect = 1.7e-71
Identity = 236/607 (38.88%), Postives = 328/607 (54.04%), Query Frame = 0

Query: 1   MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIPRFCGK-FKLPSQNP 60
           M F +VY  L E+FP++D RILRAVA EHPKD   A   VL+E+IP F    F   +Q+ 
Sbjct: 1   MGFGSVYRSLTEIFPQIDARILRAVAIEHPKDADEAAAVVLSEIIPSFSSNLFHNFTQSS 60

Query: 61  RVEFASKTEVENKMQMDSLRWVQR---GMDTSESVIIVDEARDDTLNQSVSVNNYVSGDD 120
                S +E E +  ++ +    R   G   S++      +  +TL   V+ ++      
Sbjct: 61  YKSSGSISEREVEHGLEDVASRCRPFLGASGSKASTSSSSSSSETLPLVVTRDHNTRALS 120

Query: 121 CEQSREHTETTSLTVPADQE------DRSEVELNHVAPGKSNGLIHEDNEHNDHEQSPQI 180
            +      E T+L    D +      +  E++    A GK NG         D   +   
Sbjct: 121 TDLVSNMNELTTLQPNVDPDVCHKDLESEEIQSVKKARGKENGNYDLFGRCFDVTSN--- 180

Query: 181 TKIWNQVTE-DIEQNKTL--LDGVNHSRNICDLNSFLVHELYHDDRPVSVYENSDGQTAN 240
            KI   V E DI    +L  LD V  + N  +   F +     ++    + +++ G T  
Sbjct: 181 AKIGLDVPEDDIASVVSLFSLDNVKLASNFWEDLGFDITWNQAENAVSKLVDSTPGDT-- 240

Query: 241 LSLSVIHLHSESDHQKSN-ANGTSNPSAKQECSTGEMTAIEDGLMGHTIITQSGQPCSID 300
           ++ +      E  H  +N  + TSN S   E   G+ T I D     T +      CS+D
Sbjct: 241 MTTTQQGSCFEVGHGSTNLVDETSNRSLFSE--NGD-TEIGDAFSTSTHV------CSVD 300

Query: 301 LLDEIIEDAKSNKITLFSAMQLVISKMKVLEDMEKHVEKVKEDAVNRESEILAKVEEMKQ 360
            L++IIEDAKSNK  L + M+ V + M+ +E  EK  EK KE+A     + L KVEE+K+
Sbjct: 301 QLEDIIEDAKSNKKNLLTEMETVTNIMREVELKEKDAEKSKEEAARGGLDTLQKVEELKK 360

Query: 361 TVARTKEANDMHAGEVYGEKAILATETRELQSRLLSLSDERDSSLSILDEMHATLKSRMA 420
            +   KEANDMHAGEVYGEK+ILATE +EL++RLL+LS+ER+ SL+ILDEM  +L+ R+A
Sbjct: 361 MLEHAKEANDMHAGEVYGEKSILATEVKELENRLLNLSEERNKSLAILDEMRGSLEIRLA 420

Query: 421 AVEAMLKTLEDEKLAKEEHARKALAEQEALMEKVVQESRILQQEANENSKLREFLIDRGQ 480
           A   + KT E EK  KE+ A KALAEQEA MEKVVQES++LQQEA ENSKLR+FL+DRGQ
Sbjct: 421 AALELKKTAEKEKKDKEDSALKALAEQEANMEKVVQESKLLQQEAEENSKLRDFLMDRGQ 480

Query: 481 LVDVLQGEISVICQDVRHLKEKFDLEVPLSKSLSSSQTSFILASSGSSLKSATSDLARFG 540
           +VD LQGEISVICQDV+ LKEKF+  VPL+KS+SSS TS    S GSS+KS   +     
Sbjct: 481 IVDTLQGEISVICQDVKLLKEKFENRVPLTKSISSSFTS----SCGSSMKSLVLE----- 540

Query: 541 SPLTDIAAHLDADKGVSSNLGKEGSQASSVSSSSLASNNLEEERSERNHFKSSFSDDGWD 594
                           S  L      +++      A+  + +E+ +         +DGWD
Sbjct: 541 --------------NPSERLNGVTETSNNNKFPEAAAFFMNKEKDDCR----DLLEDGWD 566

BLAST of HG10022860 vs. TAIR 10
Match: AT5G64980.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G02880.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 47.8 bits (112), Expect = 3.7e-05
Identity = 22/46 (47.83%), Postives = 30/46 (65.22%), Query Frame = 0

Query: 1  MDFDNVYDCLKELFPEVDHRILRAVAREHPKDVHLAVNDVLTEVIP 47
          M F +VY  L ELFP++D +ILR VA EH  D   A + V++E+ P
Sbjct: 1  MGFRSVYQSLTELFPQIDPKILRGVAIEHQHDADEAASVVISEIFP 46

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008451385.12.5e-23778.57PREDICTED: uncharacterized protein LOC103492692 isoform X1 [Cucumis melo] >XP_00... [more]
XP_008451387.14.7e-23678.41PREDICTED: uncharacterized protein LOC103492692 isoform X2 [Cucumis melo] >XP_00... [more]
KAA0057604.14.7e-23678.41golgin candidate 5 [Cucumis melo var. makuwa] >TYK20989.1 golgin candidate 5 [Cu... [more]
XP_022992500.11.1e-22974.00uncharacterized protein LOC111488818 isoform X1 [Cucurbita maxima][more]
XP_023548577.18.0e-22873.52uncharacterized protein LOC111807200 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3BQS21.2e-23778.57uncharacterized protein LOC103492692 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3BRF42.3e-23678.41uncharacterized protein LOC103492692 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A5A7UR712.3e-23678.41Golgin candidate 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold328G0... [more]
A0A6J1JVV45.4e-23074.00uncharacterized protein LOC111488818 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1GNI75.6e-22773.35uncharacterized protein LOC111456040 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT4G02880.24.7e-7739.71unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G02880.16.1e-7739.64unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G03290.11.7e-7138.88unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G03290.21.7e-7138.88unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G64980.13.7e-0547.83unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 398..450
NoneNo IPR availableCOILSCoilCoilcoord: 316..336
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 545..565
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 242..259
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 238..259
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 543..586
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 113..168
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 566..586
NoneNo IPR availablePANTHERPTHR16223TRANSCRIPTION FACTOR BHLH83-RELATEDcoord: 4..594
NoneNo IPR availablePANTHERPTHR16223:SF163ELKS/RAB6-INTERACTING/CAST FAMILY PROTEINcoord: 4..594
NoneNo IPR availableCDDcd14279CUEcoord: 10..41
e-value: 0.00919216
score: 32.4402
IPR003892Ubiquitin system component CUEPROSITEPS51140CUEcoord: 2..45
score: 11.059128

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022860.1HG10022860.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0000978 RNA polymerase II cis-regulatory region sequence-specific DNA binding
molecular_function GO:0043130 ubiquitin binding