HG10008451 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10008451
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncarbon catabolite repressor protein 4 homolog 5
LocationChr10: 23257040 .. 23264655 (+)
RNA-Seq ExpressionHG10008451
SyntenyHG10008451
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACATTCGCGCCAAAACAGAGAAGAATAAGCGCAAACCCTCGACGAACGCCGCACACCGCGCTCGCAACGATCACCGGAAGAAGCGGCGGAGATTAGCATTCAGTTCAGAAACCACAATCCCAACACCTAGCCATCCTCAAAAGCTTGCCGAATCGAATAGCTTCAAGTCAATTCGTTCTTCCCCTCGAACTTCACGAAAGCACGAAAAAGGAAGGTCGAGTCAAACAGATGGTCATCGTCGATGGGTGTACTCTGCTCGTGATTGCTCGAGATTTATAGGTAATGGATTTTAATCAGTTCTTTGAGAATTTTCTCAAAGTGTTAGTAGTTACTCTAGAATTAATTGTAAGGCTCTTGCTGGATGATGGCTGTTATGCCATAATGTTCTAGAGTTGAATGTAGGTAGAGGGCCTCATGTTCTATGCTCTTTATTGAAGCCAATTTATCCTCCCATGCTTAATTTTTTGGATAGTTGCGTGCTATGAGCCTTACCTTTAATCCGGCTCTTGATGTTCGAATTACCTTAATTGGAAGAATTAGGGTATTCCAAATACTTCCTGCTTTTGATGTCCTTCAGTCTTGGTAGATATATTGAGAGCTGCAACTTAATCTAGTCCAATGTTTGTTTTGATTACTCTGGAGAAACTTGGAAGTTGTTCTCAGTTTAGATTCTGGTACCTGGTTCCTTTCTTTAGTTAACAGGCCTTGAAACTCAGAAACCTTCTATCAAATCAATATGGTTAGATTTTTTTTTTTTTCTCGGGAGGATTTTCAAAAGGCACTTGATGGTAACATTCTTCCTACCCCATTTCCCGTTAAAGAACTGAAGGAAAGGGGAGGGAAACAAGTGGTATTTCCATTTGTGCGGGTATTGCCATTGCATACTTAACAATTTATTGCCTTTATTTCAGATAAGATTATGGTTGCTTCATATAACATACTAGGAGTGGAAAATGCATTGAAGCATCCAGATTTGTATCATAGAGTGCCTTCCAAATTCTTGGATTGGAGTTTCCGGAAAGAGCTTATATGCAATGCAATTAAATTTTACAATGCAGGCATCTTATGCTTGCAGGTAATTTGTTAGACCTCCAATTTATTTTTTTATGTTTTAACTTTTAAGAATCAGATACGAGATGAAGTCTGTATACTGCTGTCACATTATGAGTTTTTATCCTCTTTGTCCTTCCTTATAAACGGTTTAATCCTCATTTCCTGTCCTTCTCACGTGTTCCTTAATTTCATTTAAGGGGCACAGGTATATCTTTGCTCATTAAAATTTCATAGCTTAATGTCATTGTGCAAAATTTAAACGAACTTTAGTTACCTTTTGATGTCTTTTGTTGTTCTGCAACTACTATAGTTGCTCATAGAAATGGGAAATAAGAGAAACATAACATAACAAGAAATAGATTTCGCCTACAAAAATAACATTCTGTCTTTCAACTGTCGTTCTTCTTTGTACATTTCACCATAATTTTCATTTTCCTACTAGACTGAAATTGCAAAACTACTGGAGCCTGGAGGTATAAGCTTGTTTCTGATGTAGGATTAGTAGTTAAAGAAAAACTAGATATCAAGAGTTTTGAGACTTGGCAAGCTAATGGCCTACTTTAAATGATTCTAAGCTTTACTCTAGGCTTTTTGGTGCTCCGAAGGCATAAAATACAAATGAAAACAGCCTGAGTTTGCTTACCAACTTATGGCCAATACTTTATGCACCAAATAGAAGACGTGAGAAATTAGGAAAGGTCAAGTCGCTGTTTTAGGGGGAGGGACATCTAGTGCCCCATTCTTATTCTCCTAGTCACTAAATAATATTGTTTCAAGTTCCCTTGTCTTTGAGGTCTTACTGTTCATCAAATCTCTTTTAACCATGGCATTTTATAGGTGCTAAGAATGATTTTTTACTCATGTAAGTGGAACTATATATCTGCAGGAGGTTGACCGTTTTAATGATTTAGATGAACTTTTCCAAAATTATGGCTACAAAGGTGTTTACAAGGTGTGTTTGAGATATTTTGCATAAGTTAGTAGCCAATAATCTACTTATGGATTATTTGTACTACATCAAGAAAACTTAGTGGCGATCTTTTTATAGGCTAGAACTGGTGAAGCAAATGATGGATGTGCTGTATTTTGGATCGACAAACTGTAAGGTCTCTCATCCTTCCAGATATCTTTGAGTTGTAGTTGTTGATTTATGAGATCACAGAATAGATTAATGATCTCCTGAATTGTTTGAAAATGACTCAGTACCTTGTAACTCTGCTGATATGTATGACTGTATGAGCACTTGGGCTTGAAATGTCAGAATTGAAAAGTTTTCCTATTATAAGCATACTGGACATTTAGATTTGTTGCAAATCGGATTCTCGTTTTCTATTAGGCCGCCCCTTACTATCTATATCTATGTGTATAGTAATTCAGCACATTTACAGGCCTTTATTGTTGTTTTACTTCTTATGTACTTTTCATATTTTCTATAGTGGTTCTCGCATTTTGCTGCTTGTTAGTTGAAAATGTTCTGGTTACAGATTTGCCCTTTTGCATCAAGAAACTATAGAGTTCCAGAGTTATGGGCTACGTAACAATGTTGCTCAACTATGTGTTTTGAAGGTATGAAAGAATTCATTTATATCATTCTGCTATATTTGTTATTCATTTGGGAAGTCAAGTTACCGTGCAAAACTAAAACTTCAATCGTTTTCGTGTATTCTTCCCTTTAATTGTTCATCATTTACATATCCATTAGATTCTTTTAATGGCTAGGTTGGTTTTTTTTAGGGGAGGGAATTTTTTTGGGAAGGCCTACTTTCTGTAGTTGGTCTCTGTCTCCTTAGAACTCGACAACAAAGTTGTACAACTTATAAAACATATCTTGTTCAAGCAAGGTCGTAACATTTGTTTGCTCTCAAAGTTGTAAGGAGTCATTTGAGGAGCTTATCCACCACCTGTCTGTTCGCGATAAAGGGCAGTTTTTGTGGTAGGCTGGTGCATGTGCTATTTTGTGGGAGATTTGGGGCAAGAGAAACAACAAAATATTTAGAGGGCATGAGAGTTTGTCTATCGATATTTGGTTCTTTGTTAGATTCTTTGTTTCTCTTTGGGTATCAGTTTCTAGTTTAGTTTTAATCTTTTTTTCTAAATTTTTTAAATCTTATCTCTTGATTGGATTTCTTTTCTATAGTTTGGCTCCTTTTTATGGATTTTTTCCTCATATTGTCTTTCATCTTCCTCTCAATGAAAGTCAAGTTTTCTCATTTAAAAAAGAAAAGAAAAAATAACTGTAGTTGGATGGGTAGATATATAATGTATTTCTTATTATTGTTAATTTAGTAGTGGTTCACATTTCATTTTCTTCTTAATTTGCAGCTACCTCCACTTTGAATGGAATTGTTATAAAGAAATATTGCTGCCTTTTATTTTTTATCTTTTATTTGTATGTTATTTGTATGGCATATTTCTGTATAATTGATCGTTGTATGTGCTTCAAATGATTTATTGTCAATCTATGGTTTTTGTAGAACATTTTACATTTTCTCTTCACTTTAAATTGACAGATGAATAAATCAAAATCGAAGTCCAAAACAAGGTACATTCTGCATTTCTGTAAATTTGTTTTTCACTTGTATGATATGAGTCAAATTTCTTCGTTGACCTCATTATTGAAAATGATAAACAAATTTCATGGATATGGTAAAAAAAACATTACATGCAAGGTTTTCTTGCCTTTGATTTTGCATAAATTATTTTGAGGAATAGTATCTAAAGTATTAGGGATATTATTTTCACTGCCTCTGCATGTATGTCATGTTTAGCAATTAAATAATACTCGATTAAAACTTGTTGTATTTTAGTAGTCTTGCTACTTGATCTTTCTTTAACTTCTATGCACGTGTTTATTAACTGTAGTCGAAGCTTTGTGATTGGGAATATACACGTTCTTTTCAACCCAAATCGTGGAGACATTAAGCTCGGGCAGGTATCGACTACTGTGATTCTTTGAATGTTTGTAGTTTTTGTTGCATAGTTAATAGTTTTATGTGATTTATGCCAACTATGTTTCTGTTTGAATCTTCTCGAATATCGGCTTAGTATTAGTTGCAGTCTCTTGGCTATTTCCCTCTCTTTTCTTTTTTCTTTTTTTTTTTTTCTCCTTTTTTTGGTATGGTTAAGTTAGGTAGTTATAAAATATTTTTTTATCTTCTATTTGAATTGTCCTTTAAGAAAAAAATGTTAGACATCTCAGTGCATTAATTTTGTTGGTATAGGTCAGACTGTTTCTTGAAAAAGCTCATAGTCTCTCCCAAAGATGGGGAAACATACCTGTTATTATTGCTGGGGATCTGAATAGCATACCAAAAGTAATTCTCCCTCGTTGGAGAGGATTCCTCCTAGCACATCAGCAACTATACTTACCCAATTACCCTGGCTAGCAATTTCACCATATTAGCAATTGATCTTGTTTTGTGTCTATCTTTTCATCTTTTTTTTTTTTTTTTTTTCAAATGATCAGAGTGCAATGTATCAGTTTTTGGCTTCATCAGATGTCAGTTTATTTTTAATTGATAATTTTCTATAATCTATTATCTTTTTTGGAGTGTTGAACACGATTTGCTTGAATTCCTCTGTTTTTTGTTTCCATGATTGGAATTTCACTTCAAGTTCTTTTGGTGGTTGAGCAGCTAGATATACAACTGCATGATCGCAGAAAGATTTCAGGGCAGCTTGATTTTTCATCATCACATGCAGCCTTCAGATTTTGTCGTGCGGGCACAAAATGGTATGACTAAACTTCATTGTAATATCTTTCTTCTTCTTCTTTTTTTATTTATTTTAAATCTCTTTTCTCAATCAATTATAGCTGATATTTCTATCTTGAGTAGCTACTAAGCATTGCATGTTTTCAGTACTATCACAATGTCCATAATATTTATGAGCAGTTCAATCCTAATTTATAAATAATACTAATATATTTAAGATTACTAAAAATATTTATAAATAAATTAAAATAAGTACCTCTTTTATTTTATTTTATTTCATCCGAAAAGTCCTTTCTTTTTAGGTGTCTATGTTATTTGGACCATATCGCAGATATGAATAGTGGTTAATATTGTGTGCCTTTTCTTTGATGTATTATATCATTTGTTGGCTAGTTCCAATGTTTCAGCCTCAAGGTCCTTCAGATGGAGCGATGAGGAAATAAGGATTGCATCTGGTAGCGAGCATGTTACCTGTCTTCAGCACCATTTAAAGCTTTCCAGTGCGTACTATGGGGTTCCTGTAAGATGTTTATCTTCTTGAATTCTCATTATTTATCGTGTAGACTTAGTGCATATCATATCGTATAAGTTAAAATGGAGAAAGAGGTGATTGATTCACATCGTGTTTGAATGAAATTTCATTGACTGGTATGTTTTAAAAAAGCAAATCACCAATGTAGACTTGGAATTTACTAATTTAAATATTAAAAAAGAATCACCCCAAGGCCCAAGGGAATCTTGAAAGAAGTTATACAGATGATACTACAATGTGTTGGAGGTAACGAGTTATTAGGAAATTGTTAGCCAGTGGAGATGGGTTCCAATTACTGTAAGGGGCTGTTTGGGGGGCTAGTAATGGAACGAGTTTGGATTATAATGTAATCCACAATCCATGTTTGGATTGAGCGGTTTGAGCCCAATTTGTAATACTAAACTCATTTCGTTCCGACAGTTTTTAACCAAACCCTCTTTTCTAATGTTTCATGTTCACGATCTTATTTCCGTTCTGCCCTCGATACCTCACGCATTCCAACACTCCTCTGATTCCCAGTAATCTTGATTACATTCCAACTTCCCAAACAACTCCTAAATGGTAGGAGACAGACACAACTTTATGAGTGAAAGGAACTTGAGAGGAAAGAGGCTGAAAAGTCAAAAGTATAAAAGATAGGTTAAAAAAGTGGAGTATTAGTTATGTCAAGCTTTGTAATAAAGCAAGATGCTTTGGCTGCAATGAATTCTAATACTTTGAAGTGTGACCTAAAAATAATATAACTTTTTTCTATATGATAAAGGAGAGCGGCATAGTTTAAATTCTTTATTTAAAGTATTTCCCTTTTTTTTAAAAAAAAAATGTGGTATATGGTTTATATATAATAATAAAGGGTCATATTTAAGATTTTTTTTTCCCTGTTTTCAACTCACACGTTTTAACAACATTGATTTGACTTTTTCTCGTTTTTGAAATAGAGGGTAAGAGTTATTTGCTGCTATTTAGTTTCGTTTACTCTCTCTCATCCAAGTCGGTTAAATTTTCAGGGAAGTTGTAAAACAAGAGATACTAATGGAGAACCTTTAGTAACTTCATTCCACTCCAAGTTTATGGGAACTGTTGATTATATATGGTATCCGATTTACTGTTGAAAGCAAATTTTGTGTTGCTTGGGCCTTGAGAACTTTGGTTACTAGCTTTGATTATTGTGGCAGGCACTCGGAAAAACTTGCCCCTGTTAGAGTTTTGGAAACATTGCCTGTTGATGCATTAAACAGGACTGGAGGACTTCCAAATGAGGTACTTATGAAATGTAACCATTAATTTTTTTAAAATATGTTTAAGTTGCGTTTTGTTTTTATCAGGAAAACTCTTCATTTGCCTCGGGATATCTTACCGTTATTGAAGTTTCGAAATGTCTACCTATTTTTCTTCTCTTTTCCTCCAATTTGTTATAAATCCTTAAAGAGATCGATGAGTTAACTAATAATCTTATAGAACCACGGTGGTGATGTAAGACAATTAATTAGATATAAAAATGAATCCTTTTAACATTGTACAGCTGGGTGTGGGTGGAATCTTAGTTTCGGAGAATAACGTAAAGCCATGTTTAGGAACGGGTCAGGCTTACTCAAAACATTGATACTTTCCTGATGGGGAAGATTTCAAGAGACGGGTTCTTTATGGTTCTGGCTATTGGCTATTCTCCCTTCAAAGTTTAAACTCTCTTCTTGCTGTCAATTAAAGAGTCATAAAATCATCCCTAAGAAACTCGTGGGAGAATTTGGGACGCAAAGCCCAAAAAGAATCAAGGACTTCCTTTGGTCTTATCCCCTAAAATACATCAAAAGTTATTGGTACGACCAAGAATAGAAATCCTCATTGTTTAATACAAAATCTCTTCTCCTTTTTTAAAAATATGCCCATCCATCTGTAACAAACCCACATCAACAATGCTGGATTAATCTTTTCATATTATAAAGAAGAAAAAACAGGATTCTTTTTTTCTCTCTATCAAAGTCTGTTATGTTATCTTTGCAGAAATGGGGGAGTGATCATCTTGCTCTTGTGTGTGAACTAGCCTTTGATGACGATGGAAATATGACTTGACTGACTATGTAATCTCAACAGATTGAATCATCTCCTTAACTTTTAGTTTAATGGAGGTTGTTTCTTCACCCTTTGCATGTGAAATTTGAGTACTTGAGTAGATTAGCTGTATTATTGTTGGGGGACATCTTTGAAGCTGCTACTTGTTTGGCAGGTAATGGTAAAAGACAGTAAAGAGTACTGTGGGAAAAAGCAGGAGCTGGCGTTGCCTGGTTTAGCTGTTACCCAGTCATTTCCAAATTGA

mRNA sequence

ATGGACATTCGCGCCAAAACAGAGAAGAATAAGCGCAAACCCTCGACGAACGCCGCACACCGCGCTCGCAACGATCACCGGAAGAAGCGGCGGAGATTAGCATTCAGTTCAGAAACCACAATCCCAACACCTAGCCATCCTCAAAAGCTTGCCGAATCGAATAGCTTCAAGTCAATTCGTTCTTCCCCTCGAACTTCACGAAAGCACGAAAAAGGAAGGTCGAGTCAAACAGATGGTCATCGTCGATGGGTGTACTCTGCTCGTGATTGCTCGAGATTTATAGATAAGATTATGGTTGCTTCATATAACATACTAGGAGTGGAAAATGCATTGAAGCATCCAGATTTGTATCATAGAGTGCCTTCCAAATTCTTGGATTGGAGTTTCCGGAAAGAGCTTATATGCAATGCAATTAAATTTTACAATGCAGGCATCTTATGCTTGCAGGAGGTTGACCGTTTTAATGATTTAGATGAACTTTTCCAAAATTATGGCTACAAAGGTGTTTACAAGGCTAGAACTGGTGAAGCAAATGATGGATGTGCTGTATTTTGGATCGACAAACTATTTGCCCTTTTGCATCAAGAAACTATAGAGTTCCAGAGTTATGGGCTACGTAACAATGTTGCTCAACTATGTGTTTTGAAGGTCAGACTGTTTCTTGAAAAAGCTCATAGTCTCTCCCAAAGATGGGGAAACATACCTGTTATTATTGCTGGGGATCTGAATAGCATACCAAAACTAGATATACAACTGCATGATCGCAGAAAGATTTCAGGGCAGCTTGATTTTTCATCATCACATGCAGCCTTCAGATTTTGTCGTGCGGGCACAAAATGTTCCAATGTTTCAGCCTCAAGGTCCTTCAGATGGAGCGATGAGGAAATAAGGATTGCATCTGGTAGCGAGCATGTTACCTGTCTTCAGCACCATTTAAAGCTTTCCAGTGCGTACTATGGGGTTCCTGGAAGTTGTAAAACAAGAGATACTAATGGAGAACCTTTAGTAACTTCATTCCACTCCAAGTTTATGGGAACTGTTGATTATATATGGCACTCGGAAAAACTTGCCCCTGTTAGAGTTTTGGAAACATTGCCTGTTGATGCATTAAACAGGACTGGAGGACTTCCAAATGAGGTAATGGTAAAAGACAGTAAAGAGTACTGTGGGAAAAAGCAGGAGCTGGCGTTGCCTGGTTTAGCTGTTACCCAGTCATTTCCAAATTGA

Coding sequence (CDS)

ATGGACATTCGCGCCAAAACAGAGAAGAATAAGCGCAAACCCTCGACGAACGCCGCACACCGCGCTCGCAACGATCACCGGAAGAAGCGGCGGAGATTAGCATTCAGTTCAGAAACCACAATCCCAACACCTAGCCATCCTCAAAAGCTTGCCGAATCGAATAGCTTCAAGTCAATTCGTTCTTCCCCTCGAACTTCACGAAAGCACGAAAAAGGAAGGTCGAGTCAAACAGATGGTCATCGTCGATGGGTGTACTCTGCTCGTGATTGCTCGAGATTTATAGATAAGATTATGGTTGCTTCATATAACATACTAGGAGTGGAAAATGCATTGAAGCATCCAGATTTGTATCATAGAGTGCCTTCCAAATTCTTGGATTGGAGTTTCCGGAAAGAGCTTATATGCAATGCAATTAAATTTTACAATGCAGGCATCTTATGCTTGCAGGAGGTTGACCGTTTTAATGATTTAGATGAACTTTTCCAAAATTATGGCTACAAAGGTGTTTACAAGGCTAGAACTGGTGAAGCAAATGATGGATGTGCTGTATTTTGGATCGACAAACTATTTGCCCTTTTGCATCAAGAAACTATAGAGTTCCAGAGTTATGGGCTACGTAACAATGTTGCTCAACTATGTGTTTTGAAGGTCAGACTGTTTCTTGAAAAAGCTCATAGTCTCTCCCAAAGATGGGGAAACATACCTGTTATTATTGCTGGGGATCTGAATAGCATACCAAAACTAGATATACAACTGCATGATCGCAGAAAGATTTCAGGGCAGCTTGATTTTTCATCATCACATGCAGCCTTCAGATTTTGTCGTGCGGGCACAAAATGTTCCAATGTTTCAGCCTCAAGGTCCTTCAGATGGAGCGATGAGGAAATAAGGATTGCATCTGGTAGCGAGCATGTTACCTGTCTTCAGCACCATTTAAAGCTTTCCAGTGCGTACTATGGGGTTCCTGGAAGTTGTAAAACAAGAGATACTAATGGAGAACCTTTAGTAACTTCATTCCACTCCAAGTTTATGGGAACTGTTGATTATATATGGCACTCGGAAAAACTTGCCCCTGTTAGAGTTTTGGAAACATTGCCTGTTGATGCATTAAACAGGACTGGAGGACTTCCAAATGAGGTAATGGTAAAAGACAGTAAAGAGTACTGTGGGAAAAAGCAGGAGCTGGCGTTGCCTGGTTTAGCTGTTACCCAGTCATTTCCAAATTGA

Protein sequence

MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLKVRLFLEKAHSLSQRWGNIPVIIAGDLNSIPKLDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLPNEVMVKDSKEYCGKKQELALPGLAVTQSFPN
Homology
BLAST of HG10008451 vs. NCBI nr
Match: XP_038901965.1 (carbon catabolite repressor protein 4 homolog 5 [Benincasa hispida])

HSP 1 Score: 679.9 bits (1753), Expect = 1.4e-191
Identity = 349/427 (81.73%), Postives = 359/427 (84.07%), Query Frame = 0

Query: 1   MDIRAKTEKNKRKPSTNA---AHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFK 60
           MDIRAK EKNKRKPSTNA   AH++ NDHRKKRRRLA  SETTIPT + PQKLAESNSF+
Sbjct: 10  MDIRAKPEKNKRKPSTNAAHCAHKSHNDHRKKRRRLASCSETTIPTSTEPQKLAESNSFE 69

Query: 61  SIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLY 120
           SIRS PRTSRKH+K RSSQTDGHRRWVYSARDCSRFID+IMVASYNILGVENALKHPDLY
Sbjct: 70  SIRSPPRTSRKHKKRRSSQTDGHRRWVYSARDCSRFIDRIMVASYNILGVENALKHPDLY 129

Query: 121 HRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEA 180
           HRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEA
Sbjct: 130 HRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEA 189

Query: 181 NDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK--------------------- 240
           NDGCA+FWIDK F+LLHQETIEFQS GLRNNVAQLCVLK                     
Sbjct: 190 NDGCAIFWIDKQFSLLHQETIEFQSCGLRNNVAQLCVLKMNKPESKSKTSRSFVIGNIHV 249

Query: 241 -------------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQL 300
                        VRLFLEKAH+LSQRWGNIPVIIAGDLNSIPK           LDIQL
Sbjct: 250 LFNPNRGDIKLGQVRLFLEKAHNLSQRWGNIPVIIAGDLNSIPKSAMYQFLASSELDIQL 309

Query: 301 HDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHL 360
           HDRRKISGQLDFSSSH   RFC  GTK SNVSASRSFRWSDEEIRIASGSE+VT LQHHL
Sbjct: 310 HDRRKISGQLDFSSSHRTLRFCSEGTKWSNVSASRSFRWSDEEIRIASGSENVTHLQHHL 369

Query: 361 KLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNR 380
           KLSSAYYGVPGSCKTRD NGEPL TSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDAL R
Sbjct: 370 KLSSAYYGVPGSCKTRDANGEPLATSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALKR 429

BLAST of HG10008451 vs. NCBI nr
Match: XP_008453389.1 (PREDICTED: carbon catabolite repressor protein 4 homolog 5 [Cucumis melo] >KAA0058071.1 carbon catabolite repressor protein 4-like protein 5 [Cucumis melo var. makuwa])

HSP 1 Score: 651.0 bits (1678), Expect = 6.9e-183
Identity = 337/422 (79.86%), Postives = 347/422 (82.23%), Query Frame = 0

Query: 3   IRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSS 62
           I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S+PQKLA SN  K+IRS 
Sbjct: 15  IPAKTDKYKRKPSTNA---ATNDHRKKRRRLAVSSETAVPKSSNPQKLAASNRLKTIRSP 74

Query: 63  PRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPS 122
            RTSRKH K RSSQTDGHRRWVYSARDCSRFID  MVASYNILGVENALKHPDLYHRVPS
Sbjct: 75  SRTSRKHGKRRSSQTDGHRRWVYSARDCSRFIDTFMVASYNILGVENALKHPDLYHRVPS 134

Query: 123 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 182
           KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA
Sbjct: 135 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 194

Query: 183 VFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK-------------------------- 242
           VFWIDKLF+LLHQETIEFQS+GLRNNVAQLCVLK                          
Sbjct: 195 VFWIDKLFSLLHQETIEFQSFGLRNNVAQLCVLKMNKSKSKSKTSRSFVIGNIHVLFNPN 254

Query: 243 --------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDRRK 302
                   VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK           LDIQLHDRRK
Sbjct: 255 RGDIKLGQVRLFLEKAHSLSQRWGNIPVIIAGDLNSIPKSAMYQFLASSELDIQLHDRRK 314

Query: 303 ISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSA 362
           ISGQLDFSSSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+VT LQH LKLSSA
Sbjct: 315 ISGQLDFSSSHGAFRFCSGGTKWSNVSTSKSFGWSDEEIRIASGSENVTRLQHQLKLSSA 374

Query: 363 YYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLP 380
           YYG+PGS KTRDTNGEPL TSFHSKF+GTVDYIWHSEKLAPVRVLETLPVDAL RTGGLP
Sbjct: 375 YYGIPGSYKTRDTNGEPLATSFHSKFLGTVDYIWHSEKLAPVRVLETLPVDALKRTGGLP 433

BLAST of HG10008451 vs. NCBI nr
Match: KAG6589550.1 (Carbon catabolite repressor protein 4-like 5, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 649.8 bits (1675), Expect = 1.5e-182
Identity = 330/424 (77.83%), Postives = 345/424 (81.37%), Query Frame = 0

Query: 1   MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIR 60
           MD+RAKT KN+R+ S N AH   ND RKKRRR AFSSETTIPT S PQKL  S   K +R
Sbjct: 1   MDVRAKTGKNRRERSANTAHCGHNDGRKKRRRFAFSSETTIPTSSEPQKLPSSKRSKPVR 60

Query: 61  SSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRV 120
           S  RTSRK+EK RSS++DGHRRWVYS RDCSRFIDKIMVASYNILGVENALKHPDLYHRV
Sbjct: 61  SPSRTSRKYEKRRSSESDGHRRWVYSTRDCSRFIDKIMVASYNILGVENALKHPDLYHRV 120

Query: 121 PSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDG 180
           PSKF+DWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYK VYKARTGEANDG
Sbjct: 121 PSKFMDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKSVYKARTGEANDG 180

Query: 181 CAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK------------------------ 240
           CA+FWI+KLF LLHQE+IEFQS+GLRNNVAQLCV K                        
Sbjct: 181 CALFWIEKLFTLLHQESIEFQSFGLRNNVAQLCVFKMNKPKSKSKTSRSFVIGNIHVLFN 240

Query: 241 ----------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDR 300
                     VRLFLEKAHSLSQRWGNIPVI+AGDLNSIPK           LDIQLHDR
Sbjct: 241 PNRGDIKLGQVRLFLEKAHSLSQRWGNIPVIMAGDLNSIPKSAIYQFLASSELDIQLHDR 300

Query: 301 RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLS 360
           RKISGQLDFSSS  AFRFC  GTK SNVSASRSFRWS EEIRIASG E+VTCLQHHLKLS
Sbjct: 301 RKISGQLDFSSSRGAFRFCSEGTKWSNVSASRSFRWSGEEIRIASGIENVTCLQHHLKLS 360

Query: 361 SAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGG 380
           SAYYGVPGSCKTRD NGEPLVTSFHS FMGTVDYIWHSEKLAPVRVLETLP+DAL RTGG
Sbjct: 361 SAYYGVPGSCKTRDANGEPLVTSFHSNFMGTVDYIWHSEKLAPVRVLETLPIDALKRTGG 420

BLAST of HG10008451 vs. NCBI nr
Match: XP_022921598.1 (carbon catabolite repressor protein 4 homolog 5 [Cucurbita moschata])

HSP 1 Score: 649.8 bits (1675), Expect = 1.5e-182
Identity = 330/424 (77.83%), Postives = 345/424 (81.37%), Query Frame = 0

Query: 1   MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIR 60
           MD+RAKT KN+R+ S N AH   ND RKKRRR AFSSETTIPT S PQKL  S   K +R
Sbjct: 12  MDVRAKTGKNRRERSANTAHCGHNDGRKKRRRFAFSSETTIPTSSEPQKLPSSKRSKPVR 71

Query: 61  SSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRV 120
           S  RTSRK+EK RSS++DGHRRWVYS RDCSRFIDKIMVASYNILGVENALKHPDLYHRV
Sbjct: 72  SPSRTSRKYEKRRSSESDGHRRWVYSTRDCSRFIDKIMVASYNILGVENALKHPDLYHRV 131

Query: 121 PSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDG 180
           PSKF+DWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYK VYKARTGEANDG
Sbjct: 132 PSKFMDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKSVYKARTGEANDG 191

Query: 181 CAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK------------------------ 240
           CA+FWI+KLF LLHQE+IEFQS+GLRNNVAQLCV K                        
Sbjct: 192 CALFWIEKLFTLLHQESIEFQSFGLRNNVAQLCVFKMNKPKSKSKTSRSFVIGNIHVLFN 251

Query: 241 ----------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDR 300
                     VRLFLEKAHSLSQRWGNIPVI+AGDLNSIPK           LDIQLHDR
Sbjct: 252 PNRGDIKLGQVRLFLEKAHSLSQRWGNIPVIMAGDLNSIPKSAIYQFLASSELDIQLHDR 311

Query: 301 RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLS 360
           RKISGQLDFSSS  AFRFC  GTK SNVSASRSFRWS EEIRIASG E+VTCLQHHLKLS
Sbjct: 312 RKISGQLDFSSSRGAFRFCSEGTKWSNVSASRSFRWSGEEIRIASGIENVTCLQHHLKLS 371

Query: 361 SAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGG 380
           SAYYGVPGSCKTRD NGEPLVTSFHS FMGTVDYIWHSEKLAPVRVLETLP+DAL RTGG
Sbjct: 372 SAYYGVPGSCKTRDANGEPLVTSFHSNFMGTVDYIWHSEKLAPVRVLETLPIDALKRTGG 431

BLAST of HG10008451 vs. NCBI nr
Match: TYK28421.1 (carbon catabolite repressor protein 4-like protein 5 [Cucumis melo var. makuwa])

HSP 1 Score: 648.7 bits (1672), Expect = 3.4e-182
Identity = 336/422 (79.62%), Postives = 345/422 (81.75%), Query Frame = 0

Query: 3   IRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSS 62
           I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S PQKLA SN  K+IRS 
Sbjct: 15  IPAKTDKYKRKPSTNA---ATNDHRKKRRRLAVSSETAVPKSSDPQKLAASNRLKTIRSP 74

Query: 63  PRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPS 122
            RTSRKH K RSSQTDGHRRWVYSARDCSRFID  MVASYNILGVENALKHPDLYHRVPS
Sbjct: 75  SRTSRKHGKRRSSQTDGHRRWVYSARDCSRFIDTFMVASYNILGVENALKHPDLYHRVPS 134

Query: 123 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 182
           KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA
Sbjct: 135 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 194

Query: 183 VFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK-------------------------- 242
           VFWIDKLF+LLHQETIEFQS+GLRNNVAQLCVLK                          
Sbjct: 195 VFWIDKLFSLLHQETIEFQSFGLRNNVAQLCVLKMNKSKSKSKTSRSFVIGNIHVLFNPN 254

Query: 243 --------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDRRK 302
                   VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK           LDIQLHDRRK
Sbjct: 255 RGDIKLGQVRLFLEKAHSLSQRWGNIPVIIAGDLNSIPKSAMYQFLASSELDIQLHDRRK 314

Query: 303 ISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSA 362
           ISGQLDF+SSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+ T LQH LKLSSA
Sbjct: 315 ISGQLDFTSSHGAFRFCSGGTKWSNVSTSKSFGWSDEEIRIASGSENATRLQHQLKLSSA 374

Query: 363 YYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLP 380
           YYG+PGS KTRDTNGEPL TSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDAL RTGGLP
Sbjct: 375 YYGIPGSYKTRDTNGEPLATSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALKRTGGLP 433

BLAST of HG10008451 vs. ExPASy Swiss-Prot
Match: Q0WKY2 (Carbon catabolite repressor protein 4 homolog 5 OS=Arabidopsis thaliana OX=3702 GN=CCR4-5 PE=2 SV=2)

HSP 1 Score: 360.1 bits (923), Expect = 3.2e-98
Identity = 196/424 (46.23%), Postives = 260/424 (61.32%), Query Frame = 0

Query: 12  RKPSTNAAHRARNDHRKKRRRLAFSSETTIP----TPSHPQKLAESNSFKSIRSSPRTSR 71
           ++   + + ++ N + K  R+    S T  P    TP   Q+  +    +  +SS R  R
Sbjct: 18  KRKRNSISEQSENVYEKSNRK---ESITLKPHRSFTPGFSQR--DCKPVRHSKSSLRRRR 77

Query: 72  KHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDW 131
           + ++  SS  +  R WV+SA +     DK+++ SYN+LGV+NA  H DLY+ VP K L+W
Sbjct: 78  RTKEKISSSVE--REWVFSANNFENLADKLVLVSYNLLGVDNASNHMDLYYNVPRKHLEW 137

Query: 132 SFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWID 191
           S RK LIC  I  YNA ILCLQEVDRF+DLD L +N G++GV+K+RTGEA+DGCA+FW +
Sbjct: 138 SRRKHLICKEISRYNASILCLQEVDRFDDLDVLLKNRGFRGVHKSRTGEASDGCAIFWKE 197

Query: 192 KLFALLHQETIEFQSYGLRNNVAQLCVL-------------------------------- 251
            LF LL  + IEF  +G+RNNVAQLCVL                                
Sbjct: 198 NLFELLDHQHIEFDKFGMRNNVAQLCVLEMNCEEDPKSKLRVRSSDPRRLVVGNIHVLFN 257

Query: 252 ---------KVRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDR 311
                    +VRLFLEKA+ LSQ WGNIPV IAGDLNS P+           LD QLHDR
Sbjct: 258 PKRGDIKLGQVRLFLEKAYKLSQEWGNIPVAIAGDLNSTPQSAIYDFIASADLDTQLHDR 317

Query: 312 RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLS 371
           R+ISGQ +      +FR   A +  +++S S    WS EE+++A+G +  T +QH LKL+
Sbjct: 318 RQISGQTEVEPKERSFRNHYAFSASASISGSLLNEWSQEELQLATGGQETTHVQHQLKLN 377

Query: 372 SAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGG 380
           SAY GVPG+ +TRD  GEPL T++HS+F+GTVDYIWH+++L PVRVLETLP D L RTGG
Sbjct: 378 SAYSGVPGTYRTRDQRGEPLATTYHSRFLGTVDYIWHTKELVPVRVLETLPADVLRRTGG 434

BLAST of HG10008451 vs. ExPASy Swiss-Prot
Match: Q9LS39 (Carbon catabolite repressor protein 4 homolog 3 OS=Arabidopsis thaliana OX=3702 GN=CCR4-3 PE=2 SV=2)

HSP 1 Score: 267.7 bits (683), Expect = 2.2e-70
Identity = 153/393 (38.93%), Postives = 223/393 (56.74%), Query Frame = 0

Query: 36  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRF 95
           SS T+ P+ S+P+  + + S+     +P   R+H ++  SSQ    R W+ S     S+ 
Sbjct: 49  SSSTSGPSDSNPES-SSNRSYSRRWQNPLPRRQHPDQIPSSQI--ARDWIDSDTTPVSQA 108

Query: 96  IDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDR 155
           +++  V SYNILG  N+  H +LY  V   +L W +RK LIC  +   N  I+ +QEVD+
Sbjct: 109 LERFTVVSYNILGDGNSSYHRELYSNVSVPYLKWGYRKRLICEELIRLNPDIISMQEVDK 168

Query: 156 FNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLC 215
           + DL  + +  GY G YK RTG+  DGCA+FW    F +L +E IEF  +G+R+NVAQL 
Sbjct: 169 YFDLFSMMEKAGYAGSYKRRTGDNVDGCAMFWKADRFGVLERENIEFSQFGMRDNVAQLA 228

Query: 216 VL------------------------------KVRLFLEKAHSLSQRWGNIPVIIAGDLN 275
           VL                              +VR    KAH LS++WG+IP+++ GD N
Sbjct: 229 VLELRKSNKSRKILLGNIHVLYNPNQGDVKLGQVRSLCSKAHLLSKKWGDIPIVLCGDFN 288

Query: 276 SIPK-----------LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSN-VSASRSFRW 335
           S PK           L++  HD++++SGQ +   +    +    G+K SN ++ S    W
Sbjct: 289 STPKSPLYNFLASSELNVMEHDKKELSGQKNCRPT----KVLETGSKSSNTITFSFCSSW 348

Query: 336 SDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIW 385
           + EEIR+A+G E+     H LKL+S+Y  V GS  TRD+ GEPL TS+HSKF+GTVDY+W
Sbjct: 349 TKEEIRVATGQENSYWAAHPLKLNSSYASVKGSANTRDSVGEPLATSYHSKFLGTVDYLW 408

BLAST of HG10008451 vs. ExPASy Swiss-Prot
Match: Q8VYU4 (Carbon catabolite repressor protein 4 homolog 6 OS=Arabidopsis thaliana OX=3702 GN=CCR4-6 PE=2 SV=2)

HSP 1 Score: 159.5 bits (402), Expect = 8.3e-38
Identity = 141/601 (23.46%), Postives = 196/601 (32.61%), Query Frame = 0

Query: 42  PTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVAS 101
           P P +  +++     +S R  PR          S+   +R W Y+    S   +K +V S
Sbjct: 138 PPPFYQNQMSRPPPQQSFRQRPR----------SKPSDYREWEYAKTPPSPGSEKFVVLS 197

Query: 102 YNILGVENALKH-PDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDEL 161
           YNIL    A  H   LY  +P   L W +RK  +   +  ++A I+CLQEVD+F DL+E 
Sbjct: 198 YNILADYLANDHWRSLYFHIPRNMLSWGWRKSKLVFELSLWSADIMCLQEVDKFQDLEEE 257

Query: 162 FQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVL----- 221
            ++ GY  ++K RTG A DGCA+FW    F L+H+E+I+F   GLR+NVAQ+CVL     
Sbjct: 258 MKHRGYSAIWKMRTGNAVDGCAIFWRSNRFKLVHEESIQFNQLGLRDNVAQICVLETLLT 317

Query: 222 ---------------------------------------KVRLFLEKAHSLSQRWGNIPV 281
                                                  +VR  L+KAH++S+ W + P+
Sbjct: 318 SHTKENETPPPESSAGSHRVVICNIHVLFNPKRGDFKLGQVRTLLDKAHAVSKLWDDAPI 377

Query: 282 IIAGDLNSIP-----------KLDIQLHDRRKISGQLD---------------------- 341
           ++ GD N  P           KLD+    R K+SGQ+                       
Sbjct: 378 VLCGDFNCTPKSPLYNFISDRKLDLSGLARDKVSGQVSAEFRPPRPENYTTRYQSANKSP 437

Query: 342 ---------------------------------------FSSSHAAFRFCRAGTKCSNVS 378
                                                    + H A         C N++
Sbjct: 438 QGQVQPPNLITNAHMENNSNIDVGTAPSEKTSELPCGDTILAGHEATSSSDQVLPCENMA 497

BLAST of HG10008451 vs. ExPASy Swiss-Prot
Match: A6H7I3 (Protein angel homolog 2 OS=Bos taurus OX=9913 GN=ANGEL2 PE=2 SV=1)

HSP 1 Score: 87.8 bits (216), Expect = 3.1e-16
Identity = 88/325 (27.08%), Postives = 134/325 (41.23%), Query Frame = 0

Query: 99  VASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEV--DRF-N 158
           V SYNIL  +    +  LY       L WSFR   I   IK ++A +LCLQEV  D +  
Sbjct: 169 VMSYNILSQDLLEDNSHLYKHCRRPVLHWSFRFPNILKEIKHFDADVLCLQEVQEDHYGT 228

Query: 159 DLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGL----RNNVAQ 218
           ++    ++ GY   YK RTG   DGCA+ +    F+LL    +EF    +    R+NV  
Sbjct: 229 EIRPSLESLGYHCEYKMRTGRKPDGCAICFKHSKFSLLSVNPVEFYRRDVPLLDRDNVGL 288

Query: 219 LCVLKVR--------LFLEKAHSL-SQRWGNI------------------------PVII 278
           + +L+ +        + +   H L + R G+I                        P+++
Sbjct: 289 VLLLQPKIPSATSPAICVANTHLLYNPRRGDIKLTQLAMLLAEISSVAHQKDGRFCPIVM 348

Query: 279 AGDLNSIP-----------KLDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASR 338
            GD NS+P           KL+ +     K+SGQ     S    R         N+  S+
Sbjct: 349 CGDFNSVPGSPLYSFIKEGKLNYEGLAIGKVSGQ---EQSSRGQRILSIPIWPPNLGISQ 408

Query: 339 SFRWSDEEIR-------------------IASGSEHVTCLQHHLKLSSAYYGVPGSCKTR 354
           +  +  +++                    + +  +  + LQHH  LSS Y     S    
Sbjct: 409 NCVYEVQQVPKVEKPDGDLTQPELDKTEVLVTAEKLSSNLQHHFSLSSVY-----SHYLP 468

BLAST of HG10008451 vs. ExPASy Swiss-Prot
Match: Q5VTE6 (Protein angel homolog 2 OS=Homo sapiens OX=9606 GN=ANGEL2 PE=1 SV=1)

HSP 1 Score: 87.0 bits (214), Expect = 5.2e-16
Identity = 88/325 (27.08%), Postives = 134/325 (41.23%), Query Frame = 0

Query: 99  VASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEV--DRFN- 158
           V SYNIL  +    +  LY       L WSFR   I   IK ++A +LCLQEV  D +  
Sbjct: 169 VMSYNILSQDLLEDNSHLYRHCRRPVLHWSFRFPNILKEIKHFDADVLCLQEVQEDHYGA 228

Query: 159 DLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGL----RNNVAQ 218
           ++    ++ GY   YK RTG   DGCA+ +    F+LL    +EF    +    R+NV  
Sbjct: 229 EIRPSLESLGYHCEYKMRTGRKPDGCAICFKHSKFSLLSVNPVEFFRPDISLLDRDNVGL 288

Query: 219 LCVLKVR--------LFLEKAHSL-SQRWGNI------------------------PVII 278
           + +L+ +        + +   H L + R G+I                        P+++
Sbjct: 289 VLLLQPKIPYAACPAICVANTHLLYNPRRGDIKLTQLAMLLAEISSVAHQKDGSFCPIVM 348

Query: 279 AGDLNSIP-----------KLDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASR 338
            GD NS+P           KL+ +     K+SGQ     S    R         N+  S+
Sbjct: 349 CGDFNSVPGSPLYSFIKEGKLNYEGLPIGKVSGQ---EQSSRGQRILSIPIWPPNLGISQ 408

Query: 339 SFRWSDEEIR-------------------IASGSEHVTCLQHHLKLSSAYYGVPGSCKTR 354
           +  +  +++                    + +  +  + LQHH  LSS Y     S    
Sbjct: 409 NCVYEVQQVPKVEKTDSDLTQTQLKQTEVLVTAEKLSSNLQHHFSLSSVY-----SHYFP 468

BLAST of HG10008451 vs. ExPASy TrEMBL
Match: A0A5A7UX51 (Carbon catabolite repressor protein 4-like protein 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G003780 PE=4 SV=1)

HSP 1 Score: 651.0 bits (1678), Expect = 3.3e-183
Identity = 337/422 (79.86%), Postives = 347/422 (82.23%), Query Frame = 0

Query: 3   IRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSS 62
           I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S+PQKLA SN  K+IRS 
Sbjct: 15  IPAKTDKYKRKPSTNA---ATNDHRKKRRRLAVSSETAVPKSSNPQKLAASNRLKTIRSP 74

Query: 63  PRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPS 122
            RTSRKH K RSSQTDGHRRWVYSARDCSRFID  MVASYNILGVENALKHPDLYHRVPS
Sbjct: 75  SRTSRKHGKRRSSQTDGHRRWVYSARDCSRFIDTFMVASYNILGVENALKHPDLYHRVPS 134

Query: 123 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 182
           KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA
Sbjct: 135 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 194

Query: 183 VFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK-------------------------- 242
           VFWIDKLF+LLHQETIEFQS+GLRNNVAQLCVLK                          
Sbjct: 195 VFWIDKLFSLLHQETIEFQSFGLRNNVAQLCVLKMNKSKSKSKTSRSFVIGNIHVLFNPN 254

Query: 243 --------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDRRK 302
                   VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK           LDIQLHDRRK
Sbjct: 255 RGDIKLGQVRLFLEKAHSLSQRWGNIPVIIAGDLNSIPKSAMYQFLASSELDIQLHDRRK 314

Query: 303 ISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSA 362
           ISGQLDFSSSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+VT LQH LKLSSA
Sbjct: 315 ISGQLDFSSSHGAFRFCSGGTKWSNVSTSKSFGWSDEEIRIASGSENVTRLQHQLKLSSA 374

Query: 363 YYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLP 380
           YYG+PGS KTRDTNGEPL TSFHSKF+GTVDYIWHSEKLAPVRVLETLPVDAL RTGGLP
Sbjct: 375 YYGIPGSYKTRDTNGEPLATSFHSKFLGTVDYIWHSEKLAPVRVLETLPVDALKRTGGLP 433

BLAST of HG10008451 vs. ExPASy TrEMBL
Match: A0A1S3BW46 (carbon catabolite repressor protein 4 homolog 5 OS=Cucumis melo OX=3656 GN=LOC103494121 PE=4 SV=1)

HSP 1 Score: 651.0 bits (1678), Expect = 3.3e-183
Identity = 337/422 (79.86%), Postives = 347/422 (82.23%), Query Frame = 0

Query: 3   IRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSS 62
           I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S+PQKLA SN  K+IRS 
Sbjct: 15  IPAKTDKYKRKPSTNA---ATNDHRKKRRRLAVSSETAVPKSSNPQKLAASNRLKTIRSP 74

Query: 63  PRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPS 122
            RTSRKH K RSSQTDGHRRWVYSARDCSRFID  MVASYNILGVENALKHPDLYHRVPS
Sbjct: 75  SRTSRKHGKRRSSQTDGHRRWVYSARDCSRFIDTFMVASYNILGVENALKHPDLYHRVPS 134

Query: 123 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 182
           KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA
Sbjct: 135 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 194

Query: 183 VFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK-------------------------- 242
           VFWIDKLF+LLHQETIEFQS+GLRNNVAQLCVLK                          
Sbjct: 195 VFWIDKLFSLLHQETIEFQSFGLRNNVAQLCVLKMNKSKSKSKTSRSFVIGNIHVLFNPN 254

Query: 243 --------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDRRK 302
                   VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK           LDIQLHDRRK
Sbjct: 255 RGDIKLGQVRLFLEKAHSLSQRWGNIPVIIAGDLNSIPKSAMYQFLASSELDIQLHDRRK 314

Query: 303 ISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSA 362
           ISGQLDFSSSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+VT LQH LKLSSA
Sbjct: 315 ISGQLDFSSSHGAFRFCSGGTKWSNVSTSKSFGWSDEEIRIASGSENVTRLQHQLKLSSA 374

Query: 363 YYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLP 380
           YYG+PGS KTRDTNGEPL TSFHSKF+GTVDYIWHSEKLAPVRVLETLPVDAL RTGGLP
Sbjct: 375 YYGIPGSYKTRDTNGEPLATSFHSKFLGTVDYIWHSEKLAPVRVLETLPVDALKRTGGLP 433

BLAST of HG10008451 vs. ExPASy TrEMBL
Match: A0A6J1E4C1 (carbon catabolite repressor protein 4 homolog 5 OS=Cucurbita moschata OX=3662 GN=LOC111429815 PE=4 SV=1)

HSP 1 Score: 649.8 bits (1675), Expect = 7.5e-183
Identity = 330/424 (77.83%), Postives = 345/424 (81.37%), Query Frame = 0

Query: 1   MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIR 60
           MD+RAKT KN+R+ S N AH   ND RKKRRR AFSSETTIPT S PQKL  S   K +R
Sbjct: 12  MDVRAKTGKNRRERSANTAHCGHNDGRKKRRRFAFSSETTIPTSSEPQKLPSSKRSKPVR 71

Query: 61  SSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRV 120
           S  RTSRK+EK RSS++DGHRRWVYS RDCSRFIDKIMVASYNILGVENALKHPDLYHRV
Sbjct: 72  SPSRTSRKYEKRRSSESDGHRRWVYSTRDCSRFIDKIMVASYNILGVENALKHPDLYHRV 131

Query: 121 PSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDG 180
           PSKF+DWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYK VYKARTGEANDG
Sbjct: 132 PSKFMDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKSVYKARTGEANDG 191

Query: 181 CAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK------------------------ 240
           CA+FWI+KLF LLHQE+IEFQS+GLRNNVAQLCV K                        
Sbjct: 192 CALFWIEKLFTLLHQESIEFQSFGLRNNVAQLCVFKMNKPKSKSKTSRSFVIGNIHVLFN 251

Query: 241 ----------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDR 300
                     VRLFLEKAHSLSQRWGNIPVI+AGDLNSIPK           LDIQLHDR
Sbjct: 252 PNRGDIKLGQVRLFLEKAHSLSQRWGNIPVIMAGDLNSIPKSAIYQFLASSELDIQLHDR 311

Query: 301 RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLS 360
           RKISGQLDFSSS  AFRFC  GTK SNVSASRSFRWS EEIRIASG E+VTCLQHHLKLS
Sbjct: 312 RKISGQLDFSSSRGAFRFCSEGTKWSNVSASRSFRWSGEEIRIASGIENVTCLQHHLKLS 371

Query: 361 SAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGG 380
           SAYYGVPGSCKTRD NGEPLVTSFHS FMGTVDYIWHSEKLAPVRVLETLP+DAL RTGG
Sbjct: 372 SAYYGVPGSCKTRDANGEPLVTSFHSNFMGTVDYIWHSEKLAPVRVLETLPIDALKRTGG 431

BLAST of HG10008451 vs. ExPASy TrEMBL
Match: A0A5D3DXW9 (Carbon catabolite repressor protein 4-like protein 5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold629G00510 PE=4 SV=1)

HSP 1 Score: 648.7 bits (1672), Expect = 1.7e-182
Identity = 336/422 (79.62%), Postives = 345/422 (81.75%), Query Frame = 0

Query: 3   IRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIRSS 62
           I AKT+K KRKPSTNA   A NDHRKKRRRLA SSET +P  S PQKLA SN  K+IRS 
Sbjct: 15  IPAKTDKYKRKPSTNA---ATNDHRKKRRRLAVSSETAVPKSSDPQKLAASNRLKTIRSP 74

Query: 63  PRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPS 122
            RTSRKH K RSSQTDGHRRWVYSARDCSRFID  MVASYNILGVENALKHPDLYHRVPS
Sbjct: 75  SRTSRKHGKRRSSQTDGHRRWVYSARDCSRFIDTFMVASYNILGVENALKHPDLYHRVPS 134

Query: 123 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 182
           KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA
Sbjct: 135 KFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCA 194

Query: 183 VFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK-------------------------- 242
           VFWIDKLF+LLHQETIEFQS+GLRNNVAQLCVLK                          
Sbjct: 195 VFWIDKLFSLLHQETIEFQSFGLRNNVAQLCVLKMNKSKSKSKTSRSFVIGNIHVLFNPN 254

Query: 243 --------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDRRK 302
                   VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK           LDIQLHDRRK
Sbjct: 255 RGDIKLGQVRLFLEKAHSLSQRWGNIPVIIAGDLNSIPKSAMYQFLASSELDIQLHDRRK 314

Query: 303 ISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLSSA 362
           ISGQLDF+SSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+ T LQH LKLSSA
Sbjct: 315 ISGQLDFTSSHGAFRFCSGGTKWSNVSTSKSFGWSDEEIRIASGSENATRLQHQLKLSSA 374

Query: 363 YYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGGLP 380
           YYG+PGS KTRDTNGEPL TSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDAL RTGGLP
Sbjct: 375 YYGIPGSYKTRDTNGEPLATSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALKRTGGLP 433

BLAST of HG10008451 vs. ExPASy TrEMBL
Match: A0A0A0LUW1 (Endo/exonuclease/phosphatase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G023040 PE=4 SV=1)

HSP 1 Score: 648.7 bits (1672), Expect = 1.7e-182
Identity = 336/424 (79.25%), Postives = 349/424 (82.31%), Query Frame = 0

Query: 1   MDIRAKTEKNKRKPSTNAAHRARNDHRKKRRRLAFSSETTIPTPSHPQKLAESNSFKSIR 60
           MDI AKT+KNKRKPST+A   A NDHRKKRRRLA SSET IP  S PQKLA S+  K+I 
Sbjct: 13  MDIPAKTDKNKRKPSTSA---APNDHRKKRRRLAVSSETAIPKSSDPQKLAASSRLKTIC 72

Query: 61  SSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRV 120
           S  RTSRKH K RSSQTDGHRRWVYSARDCSRFIDK MVASYNILGVENAL HPDLYHRV
Sbjct: 73  SPSRTSRKHGKRRSSQTDGHRRWVYSARDCSRFIDKFMVASYNILGVENALNHPDLYHRV 132

Query: 121 PSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDG 180
           PSKFLDWSFRKELICNAIKFYNAGILCLQEVDRF+DLDELFQNYGYKGVYKARTGEANDG
Sbjct: 133 PSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFDDLDELFQNYGYKGVYKARTGEANDG 192

Query: 181 CAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVLK------------------------ 240
           CAVFWIDKLF+LLHQETIEFQ++GLRNNVAQLCVLK                        
Sbjct: 193 CAVFWIDKLFSLLHQETIEFQNFGLRNNVAQLCVLKMNKSKSKSKTSRSFVIGNIHVLFN 252

Query: 241 ----------VRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDR 300
                     VRLFLEKAHSLSQRWGN+PVIIAGDLNSIPK           LDIQLHDR
Sbjct: 253 PNRGDIKLGQVRLFLEKAHSLSQRWGNVPVIIAGDLNSIPKSAIYQFLASSELDIQLHDR 312

Query: 301 RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLS 360
           RKISGQLDFSSSH AFRFC  GTK SNVS S+SF WSDEEIRIASGSE+VT LQH LKLS
Sbjct: 313 RKISGQLDFSSSHGAFRFCSGGTKWSNVSTSKSFGWSDEEIRIASGSENVTRLQHPLKLS 372

Query: 361 SAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGG 380
           SAYYG+PGS KTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDAL +TGG
Sbjct: 373 SAYYGIPGSYKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALKKTGG 432

BLAST of HG10008451 vs. TAIR 10
Match: AT1G73875.1 (DNAse I-like superfamily protein )

HSP 1 Score: 360.1 bits (923), Expect = 2.3e-99
Identity = 196/424 (46.23%), Postives = 260/424 (61.32%), Query Frame = 0

Query: 12  RKPSTNAAHRARNDHRKKRRRLAFSSETTIP----TPSHPQKLAESNSFKSIRSSPRTSR 71
           ++   + + ++ N + K  R+    S T  P    TP   Q+  +    +  +SS R  R
Sbjct: 18  KRKRNSISEQSENVYEKSNRK---ESITLKPHRSFTPGFSQR--DCKPVRHSKSSLRRRR 77

Query: 72  KHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVASYNILGVENALKHPDLYHRVPSKFLDW 131
           + ++  SS  +  R WV+SA +     DK+++ SYN+LGV+NA  H DLY+ VP K L+W
Sbjct: 78  RTKEKISSSVE--REWVFSANNFENLADKLVLVSYNLLGVDNASNHMDLYYNVPRKHLEW 137

Query: 132 SFRKELICNAIKFYNAGILCLQEVDRFNDLDELFQNYGYKGVYKARTGEANDGCAVFWID 191
           S RK LIC  I  YNA ILCLQEVDRF+DLD L +N G++GV+K+RTGEA+DGCA+FW +
Sbjct: 138 SRRKHLICKEISRYNASILCLQEVDRFDDLDVLLKNRGFRGVHKSRTGEASDGCAIFWKE 197

Query: 192 KLFALLHQETIEFQSYGLRNNVAQLCVL-------------------------------- 251
            LF LL  + IEF  +G+RNNVAQLCVL                                
Sbjct: 198 NLFELLDHQHIEFDKFGMRNNVAQLCVLEMNCEEDPKSKLRVRSSDPRRLVVGNIHVLFN 257

Query: 252 ---------KVRLFLEKAHSLSQRWGNIPVIIAGDLNSIPK-----------LDIQLHDR 311
                    +VRLFLEKA+ LSQ WGNIPV IAGDLNS P+           LD QLHDR
Sbjct: 258 PKRGDIKLGQVRLFLEKAYKLSQEWGNIPVAIAGDLNSTPQSAIYDFIASADLDTQLHDR 317

Query: 312 RKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSFRWSDEEIRIASGSEHVTCLQHHLKLS 371
           R+ISGQ +      +FR   A +  +++S S    WS EE+++A+G +  T +QH LKL+
Sbjct: 318 RQISGQTEVEPKERSFRNHYAFSASASISGSLLNEWSQEELQLATGGQETTHVQHQLKLN 377

Query: 372 SAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIWHSEKLAPVRVLETLPVDALNRTGG 380
           SAY GVPG+ +TRD  GEPL T++HS+F+GTVDYIWH+++L PVRVLETLP D L RTGG
Sbjct: 378 SAYSGVPGTYRTRDQRGEPLATTYHSRFLGTVDYIWHTKELVPVRVLETLPADVLRRTGG 434

BLAST of HG10008451 vs. TAIR 10
Match: AT3G18500.3 (DNAse I-like superfamily protein )

HSP 1 Score: 271.6 bits (693), Expect = 1.1e-72
Identity = 155/394 (39.34%), Postives = 223/394 (56.60%), Query Frame = 0

Query: 36  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRF 95
           SS T+ P+ S+P+  + + S+     +P   R+H ++  SSQ    R W+ S     S+ 
Sbjct: 49  SSSTSGPSDSNPES-SSNRSYSRRWQNPLPRRQHPDQIPSSQI--ARDWIDSDTTPVSQA 108

Query: 96  IDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDR 155
           +++  V SYNILG  N+  H +LY  V   +L W +RK LIC  +   N  I+ +QEVD+
Sbjct: 109 LERFTVVSYNILGDGNSSYHRELYSNVSVPYLKWGYRKRLICEELIRLNPDIISMQEVDK 168

Query: 156 FNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLC 215
           + DL  + +  GY G YK RTG+  DGCA+FW    F +L +E IEF  +G+R+NVAQL 
Sbjct: 169 YFDLFSMMEKAGYAGSYKRRTGDNVDGCAMFWKADRFGVLERENIEFSQFGMRDNVAQLA 228

Query: 216 VL------------------------------KVRLFLEKAHSLSQRWGNIPVIIAGDLN 275
           VL                              +VR    KAH LS++WG+IP+++ GD N
Sbjct: 229 VLELRKSNKSRKILLGNIHVLYNPNQGDVKLGQVRSLCSKAHLLSKKWGDIPIVLCGDFN 288

Query: 276 SIPK-----------LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSNVSASRSF--R 335
           S PK           L++  HD++++SGQ +   +    +    G+K SN    RSF   
Sbjct: 289 STPKSPLYNFLASSELNVMEHDKKELSGQKNCRPT----KVLETGSKSSNTITFRSFCSS 348

Query: 336 WSDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYI 385
           W+ EEIR+A+G E+     H LKL+S+Y  V GS  TRD+ GEPL TS+HSKF+GTVDY+
Sbjct: 349 WTKEEIRVATGQENSYWAAHPLKLNSSYASVKGSANTRDSVGEPLATSYHSKFLGTVDYL 408

BLAST of HG10008451 vs. TAIR 10
Match: AT3G18500.2 (DNAse I-like superfamily protein )

HSP 1 Score: 267.7 bits (683), Expect = 1.5e-71
Identity = 153/393 (38.93%), Postives = 223/393 (56.74%), Query Frame = 0

Query: 36  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRF 95
           SS T+ P+ S+P+  + + S+     +P   R+H ++  SSQ    R W+ S     S+ 
Sbjct: 49  SSSTSGPSDSNPES-SSNRSYSRRWQNPLPRRQHPDQIPSSQI--ARDWIDSDTTPVSQA 108

Query: 96  IDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDR 155
           +++  V SYNILG  N+  H +LY  V   +L W +RK LIC  +   N  I+ +QEVD+
Sbjct: 109 LERFTVVSYNILGDGNSSYHRELYSNVSVPYLKWGYRKRLICEELIRLNPDIISMQEVDK 168

Query: 156 FNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLC 215
           + DL  + +  GY G YK RTG+  DGCA+FW    F +L +E IEF  +G+R+NVAQL 
Sbjct: 169 YFDLFSMMEKAGYAGSYKRRTGDNVDGCAMFWKADRFGVLERENIEFSQFGMRDNVAQLA 228

Query: 216 VL------------------------------KVRLFLEKAHSLSQRWGNIPVIIAGDLN 275
           VL                              +VR    KAH LS++WG+IP+++ GD N
Sbjct: 229 VLELRKSNKSRKILLGNIHVLYNPNQGDVKLGQVRSLCSKAHLLSKKWGDIPIVLCGDFN 288

Query: 276 SIPK-----------LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSN-VSASRSFRW 335
           S PK           L++  HD++++SGQ +   +    +    G+K SN ++ S    W
Sbjct: 289 STPKSPLYNFLASSELNVMEHDKKELSGQKNCRPT----KVLETGSKSSNTITFSFCSSW 348

Query: 336 SDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIW 385
           + EEIR+A+G E+     H LKL+S+Y  V GS  TRD+ GEPL TS+HSKF+GTVDY+W
Sbjct: 349 TKEEIRVATGQENSYWAAHPLKLNSSYASVKGSANTRDSVGEPLATSYHSKFLGTVDYLW 408

BLAST of HG10008451 vs. TAIR 10
Match: AT3G18500.1 (DNAse I-like superfamily protein )

HSP 1 Score: 233.8 bits (595), Expect = 2.5e-61
Identity = 143/393 (36.39%), Postives = 209/393 (53.18%), Query Frame = 0

Query: 36  SSETTIPTPSHPQKLAESNSFKSIRSSPRTSRKH-EKGRSSQTDGHRRWVYS-ARDCSRF 95
           SS T+ P+ S+P+  + + S+     +P   R+H ++  SSQ    R W+ S     S+ 
Sbjct: 49  SSSTSGPSDSNPES-SSNRSYSRRWQNPLPRRQHPDQIPSSQI--ARDWIDSDTTPVSQA 108

Query: 96  IDKIMVASYNILGVENALKHPDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDR 155
           +++  V SYNILG  N+  H +LY  V   +L W +RK LIC  +   N  I+ +Q    
Sbjct: 109 LERFTVVSYNILGDGNSSYHRELYSNVSVPYLKWGYRKRLICEELIRLNPDIISMQR--- 168

Query: 156 FNDLDELFQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLC 215
                              RTG+  DGCA+FW    F +L +E IEF  +G+R+NVAQL 
Sbjct: 169 -------------------RTGDNVDGCAMFWKADRFGVLERENIEFSQFGMRDNVAQLA 228

Query: 216 VL------------------------------KVRLFLEKAHSLSQRWGNIPVIIAGDLN 275
           VL                              +VR    KAH LS++WG+IP+++ GD N
Sbjct: 229 VLELRKSNKSRKILLGNIHVLYNPNQGDVKLGQVRSLCSKAHLLSKKWGDIPIVLCGDFN 288

Query: 276 SIPK-----------LDIQLHDRRKISGQLDFSSSHAAFRFCRAGTKCSN-VSASRSFRW 335
           S PK           L++  HD++++SGQ +   +    +    G+K SN ++ S    W
Sbjct: 289 STPKSPLYNFLASSELNVMEHDKKELSGQKNCRPT----KVLETGSKSSNTITFSFCSSW 348

Query: 336 SDEEIRIASGSEHVTCLQHHLKLSSAYYGVPGSCKTRDTNGEPLVTSFHSKFMGTVDYIW 385
           + EEIR+A+G E+     H LKL+S+Y  V GS  TRD+ GEPL TS+HSKF+GTVDY+W
Sbjct: 349 TKEEIRVATGQENSYWAAHPLKLNSSYASVKGSANTRDSVGEPLATSYHSKFLGTVDYLW 408

BLAST of HG10008451 vs. TAIR 10
Match: AT5G11350.1 (DNAse I-like superfamily protein )

HSP 1 Score: 159.5 bits (402), Expect = 5.9e-39
Identity = 141/601 (23.46%), Postives = 196/601 (32.61%), Query Frame = 0

Query: 42  PTPSHPQKLAESNSFKSIRSSPRTSRKHEKGRSSQTDGHRRWVYSARDCSRFIDKIMVAS 101
           P P +  +++     +S R  PR          S+   +R W Y+    S   +K +V S
Sbjct: 138 PPPFYQNQMSRPPPQQSFRQRPR----------SKPSDYREWEYAKTPPSPGSEKFVVLS 197

Query: 102 YNILGVENALKH-PDLYHRVPSKFLDWSFRKELICNAIKFYNAGILCLQEVDRFNDLDEL 161
           YNIL    A  H   LY  +P   L W +RK  +   +  ++A I+CLQEVD+F DL+E 
Sbjct: 198 YNILADYLANDHWRSLYFHIPRNMLSWGWRKSKLVFELSLWSADIMCLQEVDKFQDLEEE 257

Query: 162 FQNYGYKGVYKARTGEANDGCAVFWIDKLFALLHQETIEFQSYGLRNNVAQLCVL----- 221
            ++ GY  ++K RTG A DGCA+FW    F L+H+E+I+F   GLR+NVAQ+CVL     
Sbjct: 258 MKHRGYSAIWKMRTGNAVDGCAIFWRSNRFKLVHEESIQFNQLGLRDNVAQICVLETLLT 317

Query: 222 ---------------------------------------KVRLFLEKAHSLSQRWGNIPV 281
                                                  +VR  L+KAH++S+ W + P+
Sbjct: 318 SHTKENETPPPESSAGSHRVVICNIHVLFNPKRGDFKLGQVRTLLDKAHAVSKLWDDAPI 377

Query: 282 IIAGDLNSIP-----------KLDIQLHDRRKISGQLD---------------------- 341
           ++ GD N  P           KLD+    R K+SGQ+                       
Sbjct: 378 VLCGDFNCTPKSPLYNFISDRKLDLSGLARDKVSGQVSAEFRPPRPENYTTRYQSANKSP 437

Query: 342 ---------------------------------------FSSSHAAFRFCRAGTKCSNVS 378
                                                    + H A         C N++
Sbjct: 438 QGQVQPPNLITNAHMENNSNIDVGTAPSEKTSELPCGDTILAGHEATSSSDQVLPCENMA 497

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901965.11.4e-19181.73carbon catabolite repressor protein 4 homolog 5 [Benincasa hispida][more]
XP_008453389.16.9e-18379.86PREDICTED: carbon catabolite repressor protein 4 homolog 5 [Cucumis melo] >KAA00... [more]
KAG6589550.11.5e-18277.83Carbon catabolite repressor protein 4-like 5, partial [Cucurbita argyrosperma su... [more]
XP_022921598.11.5e-18277.83carbon catabolite repressor protein 4 homolog 5 [Cucurbita moschata][more]
TYK28421.13.4e-18279.62carbon catabolite repressor protein 4-like protein 5 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q0WKY23.2e-9846.23Carbon catabolite repressor protein 4 homolog 5 OS=Arabidopsis thaliana OX=3702 ... [more]
Q9LS392.2e-7038.93Carbon catabolite repressor protein 4 homolog 3 OS=Arabidopsis thaliana OX=3702 ... [more]
Q8VYU48.3e-3823.46Carbon catabolite repressor protein 4 homolog 6 OS=Arabidopsis thaliana OX=3702 ... [more]
A6H7I33.1e-1627.08Protein angel homolog 2 OS=Bos taurus OX=9913 GN=ANGEL2 PE=2 SV=1[more]
Q5VTE65.2e-1627.08Protein angel homolog 2 OS=Homo sapiens OX=9606 GN=ANGEL2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7UX513.3e-18379.86Carbon catabolite repressor protein 4-like protein 5 OS=Cucumis melo var. makuwa... [more]
A0A1S3BW463.3e-18379.86carbon catabolite repressor protein 4 homolog 5 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1E4C17.5e-18377.83carbon catabolite repressor protein 4 homolog 5 OS=Cucurbita moschata OX=3662 GN... [more]
A0A5D3DXW91.7e-18279.62Carbon catabolite repressor protein 4-like protein 5 OS=Cucumis melo var. makuwa... [more]
A0A0A0LUW11.7e-18279.25Endo/exonuclease/phosphatase domain-containing protein OS=Cucumis sativus OX=365... [more]
Match NameE-valueIdentityDescription
AT1G73875.12.3e-9946.23DNAse I-like superfamily protein [more]
AT3G18500.31.1e-7239.34DNAse I-like superfamily protein [more]
AT3G18500.21.5e-7138.93DNAse I-like superfamily protein [more]
AT3G18500.12.5e-6136.39DNAse I-like superfamily protein [more]
AT5G11350.15.9e-3923.46DNAse I-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 219..384
e-value: 2.1E-22
score: 81.5
coord: 71..218
e-value: 9.3E-35
score: 122.2
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 96..359
IPR005135Endonuclease/exonuclease/phosphatasePFAMPF03372Exo_endo_phoscoord: 100..373
e-value: 3.7E-14
score: 52.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..78
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 36..61
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 63..78
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availablePANTHERPTHR12121:SF85CARBON CATABOLITE REPRESSOR PROTEIN 4 HOMOLOG 6coord: 21..216
NoneNo IPR availablePANTHERPTHR12121CARBON CATABOLITE REPRESSOR PROTEIN 4coord: 21..216
NoneNo IPR availablePANTHERPTHR12121:SF85CARBON CATABOLITE REPRESSOR PROTEIN 4 HOMOLOG 6coord: 283..378
coord: 217..261
NoneNo IPR availablePANTHERPTHR12121CARBON CATABOLITE REPRESSOR PROTEIN 4coord: 283..378
coord: 217..261

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10008451.1HG10008451.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090503 RNA phosphodiester bond hydrolysis, exonucleolytic
molecular_function GO:0000175 3'-5'-exoribonuclease activity
molecular_function GO:0003824 catalytic activity