HG10010739 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10010739
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionintegrator complex subunit 3 homolog
LocationChr06: 25395692 .. 25403878 (+)
RNA-Seq ExpressionHG10010739
SyntenyHG10010739
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCACACTCTCCTTGACTTTCTATTCCTTCTTGTGGATAACTATGATGTTGAAAGGAAGGATAAAATAGCTTTGGGTGTGTCTTCAGCTTTTAGTGCACTTATCGAAAAAGGAGTAATTTCCTCATTGGACACTTTGATTTCTTTTGACGGTCTTTCTCCATTTCTACGAGACAGGCTTAGGATACTTTCATCAGGTAAGAAGTTTCAGGTTCCAAATGAATTGCAATTAGTTATACCTAATCAATCTGTGAAGCCTCTGCCTTCTTCGAGTAAATCCTGTGTAGAAACTGGCATAATATATTCGGAAAGCCATCCTAACCGCGTTGTAGCCCATGCAAGTGCTACATCTGCTGGTGCTTCTGTTCCTATTGTAGTTGATGTATCTGCCTCTCGTCATTCAGTTGTGACGGATGTACAGCAATTTGACAATATAGAAATTTTGGTGAAAAATCTTGGTGAAGTTACTAGCAAATCCTATAAAATGGGCCTCAAAACTCTGGAAGAACTTCTAGTTTTATTTCTCTCGCTTGATGACAATGCACAAGCTGGCAGAACAATAAACACTGAAATACTGTCTTCCAGAATAGTAAATACCTATAACTTGAGTGGGTATAAACTATTCTGTGCTCTTGAATTACCTCCAAATGGTCCCGATTATGATGATGAGATAGAATCTGCTACTGCCTTAATAATCCGTACCTTCATCTTTCATCATGAAAGCAATATACAAGAATTGCTTCTATTTTGTTCTAGGAATGGTTTGCCTGTGGGAGCACGATTGTTATCTTATGTATCTCGTCTGGCTTATGAGGCGAACAAAGCAGGTTTAATAGGTAATGTTGAGTTTGAGAACAGTGATAGTGCGGAAATTGATTCGAAGCCGCAGTTATTGTTGTTCCATCTGAATGGGTATTTTTCTTTCAGGAATGGTATGGGAGAAAACCCCCAAGATAGAGTTCTCTCTTTTTCTGAAATAGACAAAGAGGTGATTTCTAAGTTGGTAACAAATGCCTTTTCTGCTTATAGGTGTTTCCTTTCTTATTCAAAATATATTTTGCACAAAGATGCAGATGTATCTTTGACCAAGATCTTCTATCTTGATTTGATGTCCTGTGTGGAATGGAATGGAAGGAGAGTGAAATTCTTATTCCATTGTGTTTTTATCTCTCTCAGATTTATGCATATGCAAGGAGGAGATTGTTAAATTATTTGTAACCCTGTTGGATGACACTGATCTTGTTAATATGCAGTTTGAGATTATTGCAAAGAAATTTTCTGTGTTTGGTAAAGACACTAAATCTATTTTTCTTTTACTTAAGAGCTCTCTGAATTGGAGTTGTCTCGAACAACGTAAACTCTGGGGCTTGATAAGGTCAGAGCTTATAGTTTCACAGGTTCGGGTCGAGAGCATAGTTTTGAAACTTTTCTGCTTGGGTGTCTTAGATGCAAGCAAGCATGCCATTGCCATTGAAGGTCTTCTAAACTTGTGCTGTTATAATGCACCATCACCTGAGCTTGTTGAGGCAATCATGTTATTACCCAATGATTCATTTCACGACTTCTCCGCTGCGGTCTTGGCTTCCTGGGTTGTATCAAACGAGTCAATGCTATTTCATAGCCTGGTTGATTTTGCTGAGAAACTTGGCAAGATGAGTGAGAGTGAGTTTGTGGTAAATCATTCTGCAATCTTATGGTTGGTGAATCATTTTAGGGCACAAGGACTGAGTGACTCAACCATTTACAGCAACATCATATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAATATGTGGGATGGAAGAATCAAACCTTTAATTTTGAGATTGATAATGCAAGCATTATGTTGATTGAACTATGTTCATTTTGGTTAGGTCTATCCATATTGTAGTAAGAGGAGAAAGGAAGATATATTTAATTTAGAACATTATGTAAGTGTATATTTATAATTAGAAACAATAGATTTGTTTCTTTAAAGAAAAAAAAACACGAGTAATACTCAAGTGCAGTCCAAAAAATTTTTTACTACGTGACAACTTGTGATTTGACACCTAGATTGCATAAAGATGGATGCAGTCCGAGATGTATCAAATATTTTTAAAAACACAATAATTTTTAAACATTCATGAACATGTGTATTACACGTGACAATGGTAGTATTTTTAAAAATTGAGATTGAGACCCTCACGGTAAGGAAAGCATTTTTTATGCCATCTTTGAAATTAAACCGAGAAAAATAATTTGTCTTTTGAGCCAACTTTATGAAGATTCCTTTCCCTTTTGAGGCTTATTTTTCTTTTAGTACATGCCATGGCTTCAATTCCATACATATATATATTTTTTATAGTCTCATTATATAATCATATATGGTAAATAATTTCTAAAATTTAAGGCATACATCTTTGATCGGGAGGTCAAATGTTCAAAACTTTCGTTCATTTCTTTAAAGAAAAAAAAAATAGACCATAAATACATATTATTTAAGAATAAAAAACACATATTATTTATAACTATAAAAAATTTATTTATGTATGTACTTCTAATCAAATATAGAATGACGTTGAACTAAGTAAGACATTATAATCTATCTCAAAAAGGATTAAATATTCAAATTTCTACTCTTAAATATCATTTTGGTCCTTATGCTTTGGGATTTATTCACTTTTAATCCTTATACTTTCAAACGCCCAACTTTAGTCCTTGTATTTTCAATTTCAATAAATCTTAAATTTTGGTTCTAATAATAATAATAATAATAATAATTTTCTTGCAAGAAATTACATTAAATGTGAATATGATTCCAAAATTTTTAGCGATAATGCTAATAAATAATAAAAAAAAAATTAAAAAAAAATCAACAACCAACTAGCAAATAGAGACTAAATTTAATATTTATTGAAAGTATAGAGATTAAAATTGAATATTTAAAATATAGGGACTAAAATTGAATAAATTCAAAATGATATTTTAACCCAAAAAAATTAAATAGACAAGAATTTAAGAGTGACATAAAAGGCATATCAATTGTAAAAGAAGTAAAGAGTCTAGAAAAGATCTTATTTTATTGTTAAAAAAGGAAAAAGATCTTATTTCATCAAAGTCATTTTAAAATTATACTCAATTCTTACTCATGTTTAAACCGAATAAAAAGGGACAAGGAAGAGAGAGAAAGAAATGTGTAAAAGGTGGAGTACATAATTGTCAAATTCGATTTGGGGTTAAGTAAATCAATTTAATTTATTATTTTATTTTAAATTTTTTTTAAAAAATATTGCCCACAAGAAATTACTCTAATAGGGTGTCGTCTAGGTTCTACCTAATTTCATGCATATTTTATTTATGTAAAAAAGAGCAATTATATAAATGAACAAAACAAAACGACTCGAATATTCTTCAAAGCGGCCAGCCTTAAAAATGAACAAAACCATATTATATATAAGGGTAGCTTTGAGAAGATCTTCAAGGGGAGCATAAATAAACTCAGTAGGGAAAAAATCAGCAGGCCATGAAATTTGTAGAATCCCGCTGATAGATAAGAAAAAAAAATTTTGCAGAATCAGATGTTTTTCACGTTACAATATCCTTAACAATATCACAACATTTTGCAAGACATGTTGTAGAGGATAGGAAGTCACCCAACAAAGATGCACTACACATATCAACATCTAAGAAATCAATCTCAAGGCAGAATCAACACAACACATAACATATTACAGAAACAAGATGTTCAAATGTTGCCATAATAAAATCAGCCAATCATTTGCTGAAACAAACTCCCCAACAAATTGAATTAGATAATAGTTAAAAAGGTATTAAAAATCAAACTTGTTGGTTACAAATTTGTACATCTCACTCTCCGTTGTCTCTTCGTATAGCTATCATTAGTTATAGAAAAAGGGATGCACCCATCTCTATGCTTTCCCTTCCATCATTAACACCAAAAAAAATATAAAATATTTAAAAAGTAAACAAAAATAGTTTGTCAAATGTCACATTATGATAGGACAATTTGGTGAGATGTATAAAGATAGATGCTCTTCGAGATGCACCTAAATATTTTTCATAGAAAAATCACGCTCGAGCTATATCACTATTAAAAAAAAAAAAAAAATTCCAAGAGAGCGAGGGTGAATTTTTTTTTCTTTTTAATTAGACTTTTTTTTTCTTTAAATTGCTTCAGCAAAATAGAGGTGAAATCATGAAGGTGAGGTTTAAAGAATTTTTGCACATCAAATGGAGGTTTTGTTGGATTCAATTCTCAAAGGTAGCCATGAAGAAATAGAAGAAACCATATCAATATAATCTTCGATTTCAAGATGATTAAGAGGTGGGTACACCGTACACATAATCAAGATATTGTAATTCAATTCATAGCTATATATTTGGTGGAATGAAATTACGAGTTTGGCATAATAAAAATAAAAAATAAAAAATAAGGCCCACCATTATTGATTCTCAACGTAAAAGTTCGGTAAACATTAATCCATGTTATATCCTCACTCACTAATTAATTTGTTAGAAGTAAATTAAGCCTTAATCTTTATTGTACTAAAATACAGCTAAAATTCAAATACCGAAAGTTTTAGACATTTTTTCAACTTTTTTTTTTTTAAATTTTTTTTAAGTTACTGGCCACAATTTCCCATCTACAAACACATTTATTCCTTAATTAATTTATGGATTTATATATATCATAAATTTAATTAATTTTTAATAACATTTTTAACGACAAAGACTAAATATTTACAAAATTTTAAATACAAGAAATAATTTGTTGCATATTAACTCTCATTGAGCATAGTTTAACTAGCATACAAATGTGCTAGTGATCAAGAGGTCTATAATTTAAATTCCCTACCTTTATTGTACTAAAAAAAAGTTACAATTGTCATGTCAACGTTTTGTTAGGTCGATGAATAATTGTGCTATTTAAAACCATTTTTCACACTTATTAAAGTGTTCATTTTGAAATTTCATGTACTAAATAGACATCAAACTCAAACGACCCAAAAAAATGTACTTTGTTAAAAAAAAAAAAAAAAGGGTAGGGTAGAATACCTTTTTGGTTCCTAAGTTTTGAGTCTAATTTCTATTTGGTCCCTAGATTTCAAAATGTTACCCATTTAGTCCTTAAGTTTTGATTTTATTTTCAATTTAGTTCCTAAGTTTCAAAATGTTATAATTTTACCCTTAAGATTTAATTTTTGTTTCAATTTGATCCTTAAGTTTCAAGATCTTACATTTTTAACCTCAATTTTTAATTAAATACTCCCTTCCAATCATTGGCGCCAATATCTAGTAATTAATTTAAAATGTTATGAAGTGAAATTTTAAATTTAAGTTTAAAAGTGATGAAAAATAGTGAAACTTAATTAATTATAATTCTCTTAATTCTTTTAAATTTATTAAGATACTTTAACACTAAAGATGAAAAATGAATATTTAGTAAAAAATCGAAGTTAAAAATGTAAAATCTTGAAATAAACAAAAATTAAATCTCAAGGGTAAAATTGTAATATTTTAAAATTTAGAGACTAAATTGAAATCAAAATCAAAATTTAAGGACTAAATGTGTAATATTTTAAAACTTAGGGACAAAATAGAAATTCGATCTAAAATTTAGGAACCAAAAATATTTTAGCGAAAAGTTGATGAGTGAGTATTATTTAAAAAGTATAATATTTTATGTAATTAGTCCAAGAGAATAATGTTGCGTGCAGTTGCAGGGTCTGGTGGTTACAGTTGCAAATTCGAATAAGAATGAAGCGACCCCCAACGGTCACTTACTCGGACAATATTACCAATTAAATTCAAATTAATAACTCAAAAAGAAAAGAAACCCATAAGCTCTCCGCTCCAAATTTCAAATTTCCAGATCTTTTCTTTTTTTCCCTTTCAAACGAGAAGAACAATCTTCCCGCAAATTCCCAATAAACTTCCATCAAAACCCATTATTAACTCAATCTTCTTCTTAATTCTTCTTCTTCTTCAAAGTAATTTGAATTACAAAGCAACAGAAGTAGTGGGTTTGTGAATTACGAAGACAGAGTGATGGGTTGCTTTTTTGCTTGCTTTGGTTCATCCAAAGATCCCAAACGCCTCAAGAATCATCACAAATCTCCTCATACTCACCACCCTTCCACCATTAAGGTACTTAATCTTCTTTCTTTTTTGACGATTTTTCTACAAAACAAAACCCTAATTTCTGTCTATTTTTCCTCTTTTCTTTATCTTATGAGCAATAGAATTTGTGGGGAAACTGCGATTGGATAATTTCGCTAATTTGAATGTTGATTTTTGGTGCAGAGGAATGCGATTCCTAAACCCAGCCAATCTTCTGTTCATACGTTGCTTGATCACTCTAACAACAAACTCTTGCATCCGGTTCAAGCTGGGTGAGTACTTCAAGTTTCGATTGGTAATTTTATAATTTTATATTCACATGATTATTAATCGGTAGAATTTTCTTGTAGTATAAGTTCTTTAGAAGTTTGAACAAAAATCAAGATATAAAACACTATGGTGATGGCCTAAAATGATGTGATTTTCGTTCTGTATCACCTACTGCACCATTTCATATCTCAATTTCCATCTAAATTCTAAGGAATTTCTACTCCAAGAATATTTTTGAAACATTTATGAAAGTTTAAGGGCAATATTGCGACTTTTGAAATTATAAGGGTGTTTTTTAACCAAATGACAAAGTTCAAGGATATTTTTATAATTTAACTTAATTTTTATTTGAATTTTAGGTTTCAAAATGTTACATTGAATTTTGAGTTTAGATTTCTTTGTAGGTTTCAACATTTACCCTTTTGCTTCGACTTTCACTCATTTTCGGTCTTTAATATTAAATTCTCAGTTAATTAATTTAAAATAATTTTGAATGAAATTTTTTAATTAATTTTAATAGCATTAATTCTAAGGACTAAAAGTGAGTCTTTAGTAAAAATTTAGGAATAACGGTGTAACATTTTAAAACCTATAATCTGTATTTTTGGAACAATTTCAAAGTTGAACTATAAAAATACAACTTTTTAAGAACATAAGTGAAAGTGTGGAACCTACCTTGGGAGTATTTGGTATAATTTAACTCTATTTTTGGTTAGAAATAAGAATAATGTGGCTTATGATCATATGAATAGGCGTGATGGGTCAGCAGTGACTGGGAAAGCCCATGAGGATGAGCATGTTGTTTTACCCCAAGAAGATGCTGCTTCAGTTCATGGCAAGAAACAAGTCCTGAAGGAGGAGGAAGACAGTGTGGAGAATTCAAGCAAGCCTGAATCTTCATCTGAAGATTTTGTTGTTCCTCTCAATCCCAACTTCAAATCATCTTGCCCTCCGATTCATCGATACAGGAACTGTAAGGATAGTGATGATGAAGATGAGGTTTTTGATAGTCATCTTGCTAGTGATGAAAATGATGAATTTGGAATGGTTGAATCAATGGGGGAAGATTCTTCTGTGGCTGAGAGTTCAATGAATGTTTGTAGGCTAAACCCAACAAATGTGAGGCATAGGACTGCTTATGTTTGCTCTGTACTGAATCCAGTTGAAAATCTTTCTCAATGGAATGCTGTGAAATCAAAGAAGGAATTTCCATCAACACTTCAGAAAGAGAATGTGGAGTTAGAACAAGAATCAAGTGTTGAGGAATTTCCATACAGCAGTTCCAAGCCGAGTCGAGAACTGTGTGTTGATGCCAGCCTTTCTAACTGGTTGGCTTCATCAGAAGCAACACCAGTCAGTAAGATTACTGCAACAACGGCTTTAGAAGCCACCATTACTCCGGTGAAAAGCAGCATATTGCAAGGATCCAGCTTGCCGAAAAGAAGCAGCCACAGGGAGATGCCTGAAGTCAGAACGGTTGGTATGTATTGTAGGCAAGGAGCGAGTGATAAGGATCGTGATTCAGCTTCTTCGTTTAAAGGAATACCGAATACAACTAGCAAGTATAGAGAGGTACAGATTTGATGTAGTTACTTATGGTCTGTTTGGAATAACTTTCTAAGCCCTTAAAAAAATGTTAGCAAATAGGTCCTTAGTGTAAGATGTGTTGAAGCTTTCGAACTAAGTTGTTCTTTTGTTTGTGAGTGCAGGATAAGACAGTGAATTGGCACTCTACACCATTTGAAACAAGGTTGGAGAGAGCTTTGAATAGTAGAGGAGTTGTAGCTTGA

mRNA sequence

ATGGTTCACACTCTCCTTGACTTTCTATTCCTTCTTGTGGATAACTATGATGTTGAAAGGAAGGATAAAATAGCTTTGGGTGTGTCTTCAGCTTTTAGTGCACTTATCGAAAAAGGAGTAATTTCCTCATTGGACACTTTGATTTCTTTTGACGGTCTTTCTCCATTTCTACGAGACAGGCTTAGGATACTTTCATCAGGTAAGAAGTTTCAGGTTCCAAATGAATTGCAATTAGTTATACCTAATCAATCTGTGAAGCCTCTGCCTTCTTCGAGTAAATCCTGTGTAGAAACTGGCATAATATATTCGGAAAGCCATCCTAACCGCGTTGTAGCCCATGCAAGTGCTACATCTGCTGGTGCTTCTGTTCCTATTGTAGTTGATGTATCTGCCTCTCGTCATTCAGTTGTGACGGATGTACAGCAATTTGACAATATAGAAATTTTGGTGAAAAATCTTGGTGAAGTTACTAGCAAATCCTATAAAATGGGCCTCAAAACTCTGGAAGAACTTCTAGTTTTATTTCTCTCGCTTGATGACAATGCACAAGCTGGCAGAACAATAAACACTGAAATACTGTCTTCCAGAATAGTAAATACCTATAACTTGAGTGGGTATAAACTATTCTGTGCTCTTGAATTACCTCCAAATGGTCCCGATTATGATGATGAGATAGAATCTGCTACTGCCTTAATAATCCGTACCTTCATCTTTCATCATGAAAGCAATATACAAGAATTGCTTCTATTTTGTTCTAGGAATGGTTTGCCTGTGGGAGCACGATTGTTATCTTATGTATCTCGTCTGGCTTATGAGGCGAACAAAGCAGGTTTAATAGGTAATGTTGAGTTTGAGAACAGTGATAGTGCGGAAATTGATTCGAAGCCGCAGTTATTGTTGTTCCATCTGAATGGGTATTTTTCTTTCAGGAATGATTTATGCATATGCAAGGAGGAGATTGTTAAATTATTTGTAACCCTGTTGGATGACACTGATCTTGTTAATATGCAGTTTGAGATTATTGCAAAGAAATTTTCTGTGTTTGGTAAAGACACTAAATCTATTTTTCTTTTACTTAAGAGCTCTCTGAATTGGAGTTGTCTCGAACAACGTAAACTCTGGGGCTTGATAAGGTCAGAGCTTATAGTTTCACAGGTTCGGGTCGAGAGCATAGTTTTGAAACTTTTCTGCTTGGGTGTCTTAGATGCAAGCAAGCATGCCATTGCCATTGAAGGTCTTCTAAACTTGTGCTGTTATAATGCACCATCACCTGAGCTTGTTGAGGCAATCATGTTATTACCCAATGATTCATTTCACGACTTCTCCGCTGCGGTCTTGGCTTCCTGGGTTGTATCAAACGAGTCAATGCTATTTCATAGCCTGGTTGATTTTGCTGAGAAACTTGGCAAGATGAGTGAGAGTGAGTTTGTGAGGAATGCGATTCCTAAACCCAGCCAATCTTCTGTTCATACGTTGCTTGATCACTCTAACAACAAACTCTTGCATCCGGTTCAAGCTGGGCGTGATGGGTCAGCAGTGACTGGGAAAGCCCATGAGGATGAGCATGTTGTTTTACCCCAAGAAGATGCTGCTTCAGTTCATGGCAAGAAACAAGTCCTGAAGGAGGAGGAAGACAGTGTGGAGAATTCAAGCAAGCCTGAATCTTCATCTGAAGATTTTGTTGTTCCTCTCAATCCCAACTTCAAATCATCTTGCCCTCCGATTCATCGATACAGGAACTGTAAGGATAGTGATGATGAAGATGAGGTTTTTGATAGTCATCTTGCTAGTGATGAAAATGATGAATTTGGAATGGTTGAATCAATGGGGGAAGATTCTTCTGTGGCTGAGAGTTCAATGAATGTTTGTAGGCTAAACCCAACAAATGTGAGGCATAGGACTGCTTATGTTTGCTCTGTACTGAATCCAGTTGAAAATCTTTCTCAATGGAATGCTGTGAAATCAAAGAAGGAATTTCCATCAACACTTCAGAAAGAGAATGTGGAGTTAGAACAAGAATCAAGTGTTGAGGAATTTCCATACAGCAGTTCCAAGCCGAGTCGAGAACTGTGTGTTGATGCCAGCCTTTCTAACTGGTTGGCTTCATCAGAAGCAACACCAGTCAGTAAGATTACTGCAACAACGGCTTTAGAAGCCACCATTACTCCGGTGAAAAGCAGCATATTGCAAGGATCCAGCTTGCCGAAAAGAAGCAGCCACAGGGAGATGCCTGAAGTCAGAACGGTTGGTATGTATTGTAGGCAAGGAGCGAGTGATAAGGATCGTGATTCAGCTTCTTCGTTTAAAGGAATACCGAATACAACTAGCAAGTATAGAGAGGATAAGACAGTGAATTGGCACTCTACACCATTTGAAACAAGGTTGGAGAGAGCTTTGAATAGTAGAGGAGTTGTAGCTTGA

Coding sequence (CDS)

ATGGTTCACACTCTCCTTGACTTTCTATTCCTTCTTGTGGATAACTATGATGTTGAAAGGAAGGATAAAATAGCTTTGGGTGTGTCTTCAGCTTTTAGTGCACTTATCGAAAAAGGAGTAATTTCCTCATTGGACACTTTGATTTCTTTTGACGGTCTTTCTCCATTTCTACGAGACAGGCTTAGGATACTTTCATCAGGTAAGAAGTTTCAGGTTCCAAATGAATTGCAATTAGTTATACCTAATCAATCTGTGAAGCCTCTGCCTTCTTCGAGTAAATCCTGTGTAGAAACTGGCATAATATATTCGGAAAGCCATCCTAACCGCGTTGTAGCCCATGCAAGTGCTACATCTGCTGGTGCTTCTGTTCCTATTGTAGTTGATGTATCTGCCTCTCGTCATTCAGTTGTGACGGATGTACAGCAATTTGACAATATAGAAATTTTGGTGAAAAATCTTGGTGAAGTTACTAGCAAATCCTATAAAATGGGCCTCAAAACTCTGGAAGAACTTCTAGTTTTATTTCTCTCGCTTGATGACAATGCACAAGCTGGCAGAACAATAAACACTGAAATACTGTCTTCCAGAATAGTAAATACCTATAACTTGAGTGGGTATAAACTATTCTGTGCTCTTGAATTACCTCCAAATGGTCCCGATTATGATGATGAGATAGAATCTGCTACTGCCTTAATAATCCGTACCTTCATCTTTCATCATGAAAGCAATATACAAGAATTGCTTCTATTTTGTTCTAGGAATGGTTTGCCTGTGGGAGCACGATTGTTATCTTATGTATCTCGTCTGGCTTATGAGGCGAACAAAGCAGGTTTAATAGGTAATGTTGAGTTTGAGAACAGTGATAGTGCGGAAATTGATTCGAAGCCGCAGTTATTGTTGTTCCATCTGAATGGGTATTTTTCTTTCAGGAATGATTTATGCATATGCAAGGAGGAGATTGTTAAATTATTTGTAACCCTGTTGGATGACACTGATCTTGTTAATATGCAGTTTGAGATTATTGCAAAGAAATTTTCTGTGTTTGGTAAAGACACTAAATCTATTTTTCTTTTACTTAAGAGCTCTCTGAATTGGAGTTGTCTCGAACAACGTAAACTCTGGGGCTTGATAAGGTCAGAGCTTATAGTTTCACAGGTTCGGGTCGAGAGCATAGTTTTGAAACTTTTCTGCTTGGGTGTCTTAGATGCAAGCAAGCATGCCATTGCCATTGAAGGTCTTCTAAACTTGTGCTGTTATAATGCACCATCACCTGAGCTTGTTGAGGCAATCATGTTATTACCCAATGATTCATTTCACGACTTCTCCGCTGCGGTCTTGGCTTCCTGGGTTGTATCAAACGAGTCAATGCTATTTCATAGCCTGGTTGATTTTGCTGAGAAACTTGGCAAGATGAGTGAGAGTGAGTTTGTGAGGAATGCGATTCCTAAACCCAGCCAATCTTCTGTTCATACGTTGCTTGATCACTCTAACAACAAACTCTTGCATCCGGTTCAAGCTGGGCGTGATGGGTCAGCAGTGACTGGGAAAGCCCATGAGGATGAGCATGTTGTTTTACCCCAAGAAGATGCTGCTTCAGTTCATGGCAAGAAACAAGTCCTGAAGGAGGAGGAAGACAGTGTGGAGAATTCAAGCAAGCCTGAATCTTCATCTGAAGATTTTGTTGTTCCTCTCAATCCCAACTTCAAATCATCTTGCCCTCCGATTCATCGATACAGGAACTGTAAGGATAGTGATGATGAAGATGAGGTTTTTGATAGTCATCTTGCTAGTGATGAAAATGATGAATTTGGAATGGTTGAATCAATGGGGGAAGATTCTTCTGTGGCTGAGAGTTCAATGAATGTTTGTAGGCTAAACCCAACAAATGTGAGGCATAGGACTGCTTATGTTTGCTCTGTACTGAATCCAGTTGAAAATCTTTCTCAATGGAATGCTGTGAAATCAAAGAAGGAATTTCCATCAACACTTCAGAAAGAGAATGTGGAGTTAGAACAAGAATCAAGTGTTGAGGAATTTCCATACAGCAGTTCCAAGCCGAGTCGAGAACTGTGTGTTGATGCCAGCCTTTCTAACTGGTTGGCTTCATCAGAAGCAACACCAGTCAGTAAGATTACTGCAACAACGGCTTTAGAAGCCACCATTACTCCGGTGAAAAGCAGCATATTGCAAGGATCCAGCTTGCCGAAAAGAAGCAGCCACAGGGAGATGCCTGAAGTCAGAACGGTTGGTATGTATTGTAGGCAAGGAGCGAGTGATAAGGATCGTGATTCAGCTTCTTCGTTTAAAGGAATACCGAATACAACTAGCAAGTATAGAGAGGATAAGACAGTGAATTGGCACTCTACACCATTTGAAACAAGGTTGGAGAGAGCTTTGAATAGTAGAGGAGTTGTAGCTTGA

Protein sequence

MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDRLRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAGASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDDNAQAGRTINTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLLLFHLNGYFSFRNDLCICKEEIVKLFVTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCLGVLDASKHAIAIEGLLNLCCYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESMLFHSLVDFAEKLGKMSESEFVRNAIPKPSQSSVHTLLDHSNNKLLHPVQAGRDGSAVTGKAHEDEHVVLPQEDAASVHGKKQVLKEEEDSVENSSKPESSSEDFVVPLNPNFKSSCPPIHRYRNCKDSDDEDEVFDSHLASDENDEFGMVESMGEDSSVAESSMNVCRLNPTNVRHRTAYVCSVLNPVENLSQWNAVKSKKEFPSTLQKENVELEQESSVEEFPYSSSKPSRELCVDASLSNWLASSEATPVSKITATTALEATITPVKSSILQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSFKGIPNTTSKYREDKTVNWHSTPFETRLERALNSRGVVA
Homology
BLAST of HG10010739 vs. NCBI nr
Match: XP_038879691.1 (integrator complex subunit 3 homolog isoform X1 [Benincasa hispida] >XP_038879692.1 integrator complex subunit 3 homolog isoform X1 [Benincasa hispida] >XP_038879693.1 integrator complex subunit 3 homolog isoform X1 [Benincasa hispida] >XP_038879694.1 integrator complex subunit 3 homolog isoform X1 [Benincasa hispida] >XP_038879695.1 integrator complex subunit 3 homolog isoform X1 [Benincasa hispida] >XP_038879696.1 integrator complex subunit 3 homolog isoform X1 [Benincasa hispida] >XP_038879697.1 integrator complex subunit 3 homolog isoform X1 [Benincasa hispida])

HSP 1 Score: 784.6 bits (2025), Expect = 7.9e-223
Identity = 428/561 (76.29%), Postives = 446/561 (79.50%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           MVHTLL+FLFLLVDNYDVERKDKIALGVSSAFSALI+KGVISSLDTLISFDGLSP LRDR
Sbjct: 393 MVHTLLEFLFLLVDNYDVERKDKIALGVSSAFSALIKKGVISSLDTLISFDGLSPLLRDR 452

Query: 61  LRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAG 120
           LR+LSSGKKFQVPNELQL IPN SV PLPSSSKSC   G IYSESHP+ +V + +AT  G
Sbjct: 453 LRVLSSGKKFQVPNELQLFIPNHSVNPLPSSSKSC--AGFIYSESHPSCIVGNVNATPVG 512

Query: 121 ASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDD 180
           +SVPIVVDVSA  HSVVTDVQQ DNIEILVKNLG VT KSYKMGLKTLEELLVLFLSL+D
Sbjct: 513 SSVPIVVDVSAPHHSVVTDVQQCDNIEILVKNLGAVTRKSYKMGLKTLEELLVLFLSLND 572

Query: 181 NAQAGRTINTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHH 240
           N QAGRTINTEILSSRIVNTY+L GYKLFCALELPPNG  Y+DEIESATALIIRTFIF+H
Sbjct: 573 NGQAGRTINTEILSSRIVNTYDLCGYKLFCALELPPNGSSYNDEIESATALIIRTFIFYH 632

Query: 241 ESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLLL 300
           E NIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGL GNVEFEN DSAEIDSKPQLLL
Sbjct: 633 EKNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLTGNVEFENIDSAEIDSKPQLLL 692

Query: 301 FHLNGYFSFRN------------------------------------------------- 360
           FHLNGY+ FRN                                                 
Sbjct: 693 FHLNGYYCFRNGMGGNPQDTVVSFSEIDKEVIAKLVTNAFSAYKCFLVYSKDILHKDADV 752

Query: 361 ---------------------------------DLCICKEEIVKLFVTLLDDTDLVNMQF 420
                                            DLCICKEEIVKL VTLLDDTDLVNMQF
Sbjct: 753 SLTKLFYLDLMSCVEWNARRVKFLFHCIFYLLSDLCICKEEIVKLLVTLLDDTDLVNMQF 812

Query: 421 EIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCL 480
           EIIAKKF VFGKDTKSIFLL+KSSLNW CLEQRKLWGLIRSELIVS+V+VES+VLKLFCL
Sbjct: 813 EIIAKKFCVFGKDTKSIFLLVKSSLNWGCLEQRKLWGLIRSELIVSKVQVESLVLKLFCL 872

BLAST of HG10010739 vs. NCBI nr
Match: XP_016900895.1 (PREDICTED: integrator complex subunit 3 homolog [Cucumis melo])

HSP 1 Score: 738.4 bits (1905), Expect = 6.5e-209
Identity = 410/578 (70.93%), Postives = 440/578 (76.12%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDR
Sbjct: 393 MVHTLLEFLFLLVDNYDVQRKDKIALGVSSAFSALIEKGVISSLDTLISFGGISPLLRDR 452

Query: 61  LRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAG 120
           LRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGII  ESHP+R+V + ++TS G
Sbjct: 453 LRILSSCKNFQVSNEVQLFVPDHSAKPLPSSTKSC--TGIIDLESHPSRIVGNLNSTSGG 512

Query: 121 ASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDD 180
           ASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDD
Sbjct: 513 ASVPIVEDASASHHSVATNVQQCDKIEILVKNLGEVTRKSYKMGLKTLEELLVLFLSLDD 572

Query: 181 NAQAGRTI-NTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFH 240
           NAQ   TI + EILSSRI+NTY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFH
Sbjct: 573 NAQDSSTIFSPEILSSRILNTYDSSGHKLFCALELPPNGPGYDDEIESATALIVRTFIFH 632

Query: 241 HESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL 300
           HE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Sbjct: 633 HEKNIQQLLLFCSRNGLPVGARLLSYVTRLAYEVNKAGLTENVEFENSEKAEIDSNAQLL 692

Query: 301 LFHLNGYFSFRN------------------------------------------------ 360
           LFH+NGYFSFRN                                                
Sbjct: 693 LFHVNGYFSFRNGMGENPQETVLSFSGTDKEEIAKLVTNAFSAYRCFLAYSKDILHKDAD 752

Query: 361 ----------------------------------DLCICKEEIVKLFVTLLDDTDLVNMQ 420
                                             DLCICKEEIVKL VTLLDDTDLVNMQ
Sbjct: 753 VSLTKVFYRDLMSCVEWNARRVKFLFHCIFDLLSDLCICKEEIVKLLVTLLDDTDLVNMQ 812

Query: 421 FEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFC 480
           FEIIAKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFC
Sbjct: 813 FEIIAKKFSVFGKDIKSIFLLVKNSLNWGCLEQRKLWGLIRSELIVSQVRVENIVSKLFC 872

Query: 481 LGVLDASKHAIAIEGLLNLCCYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESML 496
           LGVLDASKHAIAIEGLLNLCCYNAPSPE VEAIMLLPND+F  FSAAVLASW VSNESML
Sbjct: 873 LGVLDASKHAIAIEGLLNLCCYNAPSPEFVEAIMLLPNDAFDGFSAAVLASWAVSNESML 932

BLAST of HG10010739 vs. NCBI nr
Match: TYK27714.1 (integrator complex subunit 3-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 737.6 bits (1903), Expect = 1.1e-208
Identity = 405/561 (72.19%), Postives = 433/561 (77.18%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDR
Sbjct: 393 MVHTLLEFLFLLVDNYDVQRKDKIALGVSSAFSALIEKGVISSLDTLISFGGISPLLRDR 452

Query: 61  LRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAG 120
           LRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGII  ESHP+R+V + ++TS G
Sbjct: 453 LRILSSCKNFQVSNEVQLFVPDHSAKPLPSSTKSC--TGIIDLESHPSRIVGNLNSTSGG 512

Query: 121 ASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDD 180
           ASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDD
Sbjct: 513 ASVPIVEDASASHHSVATNVQQCDKIEILVKNLGEVTRKSYKMGLKTLEELLVLFLSLDD 572

Query: 181 NAQAGRTI-NTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFH 240
           NAQ   TI + EILSSRI+NTY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFH
Sbjct: 573 NAQDSSTIFSPEILSSRILNTYDSSGHKLFCALELPPNGPGYDDEIESATALIVRTFIFH 632

Query: 241 HESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL 300
           HE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Sbjct: 633 HEKNIQQLLLFCSRNGLPVGARLLSYVTRLAYEVNKAGLTENVEFENSEKAEIDSNAQLL 692

Query: 301 LFHLNGYFSFRN------------------------------------------------ 360
           LFH+NGYFSFRN                                                
Sbjct: 693 LFHVNGYFSFRNGMGENPQETVLSFSGTDKEEIAKLVTNAFSAYRCFLAYSKDILHKDAD 752

Query: 361 ----------------------------------DLCICKEEIVKLFVTLLDDTDLVNMQ 420
                                             DLCICKEEIVKL VTLLDDTDLVNMQ
Sbjct: 753 VSLTKVFYRDLMSCVEWNARRVKFLFHCIFDLLSDLCICKEEIVKLLVTLLDDTDLVNMQ 812

Query: 421 FEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFC 479
           FEIIAKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFC
Sbjct: 813 FEIIAKKFSVFGKDIKSIFLLVKNSLNWGCLEQRKLWGLIRSELIVSQVRVENIVSKLFC 872

BLAST of HG10010739 vs. NCBI nr
Match: KAA0056027.1 (integrator complex subunit 3-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 735.3 bits (1897), Expect = 5.5e-208
Identity = 404/561 (72.01%), Postives = 432/561 (77.01%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDR
Sbjct: 393 MVHTLLEFLFLLVDNYDVQRKDKIALGVSSAFSALIEKGVISSLDTLISFGGISPLLRDR 452

Query: 61  LRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAG 120
           LRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGII  ESHP+R+V + ++TS G
Sbjct: 453 LRILSSCKNFQVSNEVQLFVPDHSAKPLPSSTKSC--TGIIDLESHPSRIVGNLNSTSGG 512

Query: 121 ASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDD 180
           ASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDD
Sbjct: 513 ASVPIVEDASASHHSVATNVQQCDKIEILVKNLGEVTRKSYKMGLKTLEELLVLFLSLDD 572

Query: 181 NAQAGRTI-NTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFH 240
           NAQ   TI + EILSSRI+NTY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFH
Sbjct: 573 NAQDSSTIFSPEILSSRILNTYDSSGHKLFCALELPPNGPGYDDEIESATALIVRTFIFH 632

Query: 241 HESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL 300
           HE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Sbjct: 633 HEKNIQQLLLFCSRNGLPVGARLLSYVTRLAYEVNKAGLTENVEFENSEKAEIDSNAQLL 692

Query: 301 LFHLNGYFSFRN------------------------------------------------ 360
           LFH+NGYFSFRN                                                
Sbjct: 693 LFHVNGYFSFRNGMGENPQETVLSFSGTDKEEIAKLVTNAFSAYRCFLAYSKDILHKDAD 752

Query: 361 ----------------------------------DLCICKEEIVKLFVTLLDDTDLVNMQ 420
                                             DLCICKEEIVKL VTLLDDTDLVNMQ
Sbjct: 753 VSLTKVFYRDLMSCVEWNARRVKFLFHCIFDLLSDLCICKEEIVKLLVTLLDDTDLVNMQ 812

Query: 421 FEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFC 479
           FEI AKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFC
Sbjct: 813 FEISAKKFSVFGKDIKSIFLLVKNSLNWGCLEQRKLWGLIRSELIVSQVRVENIVSKLFC 872

BLAST of HG10010739 vs. NCBI nr
Match: XP_038879698.1 (uncharacterized protein LOC120071465 isoform X2 [Benincasa hispida])

HSP 1 Score: 722.2 bits (1863), Expect = 4.8e-204
Identity = 402/561 (71.66%), Postives = 420/561 (74.87%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           MVHTLL+FLFLLVDNYDVERKDKIALGVSSAFSALI+KGVISSLDTLISFDGLSP LRDR
Sbjct: 393 MVHTLLEFLFLLVDNYDVERKDKIALGVSSAFSALIKKGVISSLDTLISFDGLSPLLRDR 452

Query: 61  LRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAG 120
           LR+LSS                                G IYSESHP+ +V + +AT  G
Sbjct: 453 LRVLSS--------------------------------GFIYSESHPSCIVGNVNATPVG 512

Query: 121 ASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDD 180
           +SVPIVVDVSA  HSVVTDVQQ DNIEILVKNLG VT KSYKMGLKTLEELLVLFLSL+D
Sbjct: 513 SSVPIVVDVSAPHHSVVTDVQQCDNIEILVKNLGAVTRKSYKMGLKTLEELLVLFLSLND 572

Query: 181 NAQAGRTINTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFHH 240
           N QAGRTINTEILSSRIVNTY+L GYKLFCALELPPNG  Y+DEIESATALIIRTFIF+H
Sbjct: 573 NGQAGRTINTEILSSRIVNTYDLCGYKLFCALELPPNGSSYNDEIESATALIIRTFIFYH 632

Query: 241 ESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLLL 300
           E NIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGL GNVEFEN DSAEIDSKPQLLL
Sbjct: 633 EKNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLTGNVEFENIDSAEIDSKPQLLL 692

Query: 301 FHLNGYFSFRN------------------------------------------------- 360
           FHLNGY+ FRN                                                 
Sbjct: 693 FHLNGYYCFRNGMGGNPQDTVVSFSEIDKEVIAKLVTNAFSAYKCFLVYSKDILHKDADV 752

Query: 361 ---------------------------------DLCICKEEIVKLFVTLLDDTDLVNMQF 420
                                            DLCICKEEIVKL VTLLDDTDLVNMQF
Sbjct: 753 SLTKLFYLDLMSCVEWNARRVKFLFHCIFYLLSDLCICKEEIVKLLVTLLDDTDLVNMQF 812

Query: 421 EIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFCL 480
           EIIAKKF VFGKDTKSIFLL+KSSLNW CLEQRKLWGLIRSELIVS+V+VES+VLKLFCL
Sbjct: 813 EIIAKKFCVFGKDTKSIFLLVKSSLNWGCLEQRKLWGLIRSELIVSKVQVESLVLKLFCL 872

BLAST of HG10010739 vs. ExPASy Swiss-Prot
Match: F4IDQ5 (Protein JASON OS=Arabidopsis thaliana OX=3702 GN=JASON PE=2 SV=1)

HSP 1 Score: 52.8 bits (125), Expect = 2.2e-05
Identity = 53/178 (29.78%), Postives = 79/178 (44.38%), Query Frame = 0

Query: 631 RHRTAYVCSVLNPVENLSQWNAVKSKKEFPSTLQKENVELEQESSVEEFPYSSSKPSREL 690
           R R+ +V SV N +EN S +   K   E      +E +E E  SS E +     + S E 
Sbjct: 290 RIRSQFVHSVSNIMENASLYKVYKDSHE--GLDYEEQIEAETPSS-ETYGEKVEESSDEK 349

Query: 691 C--VDASLSNWLASSEATPVSKITATTALEATITPVKSSILQGSSLPKRSSHREMPEVRT 750
               +AS S WL       ++ +   T     ITP    I                    
Sbjct: 350 LSKFEASFSPWLNQINEN-IAALNERTPGVGVITPGDRPI-------------------- 409

Query: 751 VGMYCRQGASDKDRDSASSF---KGIPNTTSKYREDKTVNWHSTPFETRLERALNSRG 804
           +G+   Q   ++  + +       GIPN+T+KY+ED+ V+WH+TPFE RLE+AL+  G
Sbjct: 410 IGLVAAQWIENEQTEISPKMWDGNGIPNSTTKYKEDQKVSWHATPFEVRLEKALSEEG 443

BLAST of HG10010739 vs. ExPASy Swiss-Prot
Match: Q55EZ4 (Integrator complex subunit 3 homolog OS=Dictyostelium discoideum OX=44689 GN=ints3 PE=3 SV=1)

HSP 1 Score: 48.1 bits (113), Expect = 5.3e-04
Identity = 23/63 (36.51%), Postives = 42/63 (66.67%), Query Frame = 0

Query: 1   MVHTLLDFLF-LLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRD 60
           M   L++F+   L+D+YD++RKD I  G+ ++F+ ++EKGV+ SL  +   D L P L +
Sbjct: 447 MTIDLVEFIVGTLLDSYDIQRKDAIRQGIHNSFATVLEKGVVQSLSHVFPPDSLGPSLFE 506

Query: 61  RLR 63
           +++
Sbjct: 507 KVK 509

BLAST of HG10010739 vs. ExPASy TrEMBL
Match: A0A1S4DYU4 (integrator complex subunit 3 homolog OS=Cucumis melo OX=3656 GN=LOC103491988 PE=3 SV=1)

HSP 1 Score: 738.4 bits (1905), Expect = 3.2e-209
Identity = 410/578 (70.93%), Postives = 440/578 (76.12%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDR
Sbjct: 393 MVHTLLEFLFLLVDNYDVQRKDKIALGVSSAFSALIEKGVISSLDTLISFGGISPLLRDR 452

Query: 61  LRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAG 120
           LRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGII  ESHP+R+V + ++TS G
Sbjct: 453 LRILSSCKNFQVSNEVQLFVPDHSAKPLPSSTKSC--TGIIDLESHPSRIVGNLNSTSGG 512

Query: 121 ASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDD 180
           ASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDD
Sbjct: 513 ASVPIVEDASASHHSVATNVQQCDKIEILVKNLGEVTRKSYKMGLKTLEELLVLFLSLDD 572

Query: 181 NAQAGRTI-NTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFH 240
           NAQ   TI + EILSSRI+NTY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFH
Sbjct: 573 NAQDSSTIFSPEILSSRILNTYDSSGHKLFCALELPPNGPGYDDEIESATALIVRTFIFH 632

Query: 241 HESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL 300
           HE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Sbjct: 633 HEKNIQQLLLFCSRNGLPVGARLLSYVTRLAYEVNKAGLTENVEFENSEKAEIDSNAQLL 692

Query: 301 LFHLNGYFSFRN------------------------------------------------ 360
           LFH+NGYFSFRN                                                
Sbjct: 693 LFHVNGYFSFRNGMGENPQETVLSFSGTDKEEIAKLVTNAFSAYRCFLAYSKDILHKDAD 752

Query: 361 ----------------------------------DLCICKEEIVKLFVTLLDDTDLVNMQ 420
                                             DLCICKEEIVKL VTLLDDTDLVNMQ
Sbjct: 753 VSLTKVFYRDLMSCVEWNARRVKFLFHCIFDLLSDLCICKEEIVKLLVTLLDDTDLVNMQ 812

Query: 421 FEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFC 480
           FEIIAKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFC
Sbjct: 813 FEIIAKKFSVFGKDIKSIFLLVKNSLNWGCLEQRKLWGLIRSELIVSQVRVENIVSKLFC 872

Query: 481 LGVLDASKHAIAIEGLLNLCCYNAPSPELVEAIMLLPNDSFHDFSAAVLASWVVSNESML 496
           LGVLDASKHAIAIEGLLNLCCYNAPSPE VEAIMLLPND+F  FSAAVLASW VSNESML
Sbjct: 873 LGVLDASKHAIAIEGLLNLCCYNAPSPEFVEAIMLLPNDAFDGFSAAVLASWAVSNESML 932

BLAST of HG10010739 vs. ExPASy TrEMBL
Match: A0A5D3DW40 (Integrator complex subunit 3-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold225G00540 PE=3 SV=1)

HSP 1 Score: 737.6 bits (1903), Expect = 5.4e-209
Identity = 405/561 (72.19%), Postives = 433/561 (77.18%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDR
Sbjct: 393 MVHTLLEFLFLLVDNYDVQRKDKIALGVSSAFSALIEKGVISSLDTLISFGGISPLLRDR 452

Query: 61  LRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAG 120
           LRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGII  ESHP+R+V + ++TS G
Sbjct: 453 LRILSSCKNFQVSNEVQLFVPDHSAKPLPSSTKSC--TGIIDLESHPSRIVGNLNSTSGG 512

Query: 121 ASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDD 180
           ASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDD
Sbjct: 513 ASVPIVEDASASHHSVATNVQQCDKIEILVKNLGEVTRKSYKMGLKTLEELLVLFLSLDD 572

Query: 181 NAQAGRTI-NTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFH 240
           NAQ   TI + EILSSRI+NTY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFH
Sbjct: 573 NAQDSSTIFSPEILSSRILNTYDSSGHKLFCALELPPNGPGYDDEIESATALIVRTFIFH 632

Query: 241 HESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL 300
           HE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Sbjct: 633 HEKNIQQLLLFCSRNGLPVGARLLSYVTRLAYEVNKAGLTENVEFENSEKAEIDSNAQLL 692

Query: 301 LFHLNGYFSFRN------------------------------------------------ 360
           LFH+NGYFSFRN                                                
Sbjct: 693 LFHVNGYFSFRNGMGENPQETVLSFSGTDKEEIAKLVTNAFSAYRCFLAYSKDILHKDAD 752

Query: 361 ----------------------------------DLCICKEEIVKLFVTLLDDTDLVNMQ 420
                                             DLCICKEEIVKL VTLLDDTDLVNMQ
Sbjct: 753 VSLTKVFYRDLMSCVEWNARRVKFLFHCIFDLLSDLCICKEEIVKLLVTLLDDTDLVNMQ 812

Query: 421 FEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFC 479
           FEIIAKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFC
Sbjct: 813 FEIIAKKFSVFGKDIKSIFLLVKNSLNWGCLEQRKLWGLIRSELIVSQVRVENIVSKLFC 872

BLAST of HG10010739 vs. ExPASy TrEMBL
Match: A0A5A7UN45 (Integrator complex subunit 3-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold319G001810 PE=3 SV=1)

HSP 1 Score: 735.3 bits (1897), Expect = 2.7e-208
Identity = 404/561 (72.01%), Postives = 432/561 (77.01%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           MVHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLDTLISF G+SP LRDR
Sbjct: 393 MVHTLLEFLFLLVDNYDVQRKDKIALGVSSAFSALIEKGVISSLDTLISFGGISPLLRDR 452

Query: 61  LRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAG 120
           LRILSS K FQV NE+QL +P+ S KPLPSS+KSC  TGII  ESHP+R+V + ++TS G
Sbjct: 453 LRILSSCKNFQVSNEVQLFVPDHSAKPLPSSTKSC--TGIIDLESHPSRIVGNLNSTSGG 512

Query: 121 ASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDD 180
           ASVPIV D SAS HSV T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDD
Sbjct: 513 ASVPIVEDASASHHSVATNVQQCDKIEILVKNLGEVTRKSYKMGLKTLEELLVLFLSLDD 572

Query: 181 NAQAGRTI-NTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFH 240
           NAQ   TI + EILSSRI+NTY+ SG+KLFCALELPPNGP YDDEIESATALI+RTFIFH
Sbjct: 573 NAQDSSTIFSPEILSSRILNTYDSSGHKLFCALELPPNGPGYDDEIESATALIVRTFIFH 632

Query: 241 HESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL 300
           HE NIQ+LLLFCSRNGLPVGARLLSYV+RLAYE NKAGL  NVEFENS+ AEIDS  QLL
Sbjct: 633 HEKNIQQLLLFCSRNGLPVGARLLSYVTRLAYEVNKAGLTENVEFENSEKAEIDSNAQLL 692

Query: 301 LFHLNGYFSFRN------------------------------------------------ 360
           LFH+NGYFSFRN                                                
Sbjct: 693 LFHVNGYFSFRNGMGENPQETVLSFSGTDKEEIAKLVTNAFSAYRCFLAYSKDILHKDAD 752

Query: 361 ----------------------------------DLCICKEEIVKLFVTLLDDTDLVNMQ 420
                                             DLCICKEEIVKL VTLLDDTDLVNMQ
Sbjct: 753 VSLTKVFYRDLMSCVEWNARRVKFLFHCIFDLLSDLCICKEEIVKLLVTLLDDTDLVNMQ 812

Query: 421 FEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFC 479
           FEI AKKFSVFGKD KSIFLL+K+SLNW CLEQRKLWGLIRSELIVSQVRVE+IV KLFC
Sbjct: 813 FEISAKKFSVFGKDIKSIFLLVKNSLNWGCLEQRKLWGLIRSELIVSQVRVENIVSKLFC 872

BLAST of HG10010739 vs. ExPASy TrEMBL
Match: A0A0A0LXR4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G627430 PE=3 SV=1)

HSP 1 Score: 721.1 bits (1860), Expect = 5.2e-204
Identity = 396/560 (70.71%), Postives = 427/560 (76.25%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           +VHTLL+FLFLLVDNYDV+RKDKIALGVSSAFSALIEKGVISSLD LISF G+SP LRDR
Sbjct: 393 IVHTLLEFLFLLVDNYDVQRKDKIALGVSSAFSALIEKGVISSLDNLISFGGISPLLRDR 452

Query: 61  LRILSSGKKFQVPNELQLVIPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSAG 120
           LR+LSS KKFQV NE+QL +P+ S KPLPS +KSC   G+I SESHP+ +V +A +TS G
Sbjct: 453 LRVLSSCKKFQVSNEVQLFVPDHSAKPLPSLTKSC--AGMIDSESHPSCIVGNADSTSVG 512

Query: 121 ASVPIVVDVSASRHSVVTDVQQFDNIEILVKNLGEVTSKSYKMGLKTLEELLVLFLSLDD 180
            SVPIV D SAS HS  T+VQQ D IEILVKNLGEVT KSYKMGLKTLEELLVLFLSLDD
Sbjct: 513 VSVPIVEDASASYHSFATNVQQCDKIEILVKNLGEVTRKSYKMGLKTLEELLVLFLSLDD 572

Query: 181 NAQAGRTI-NTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIESATALIIRTFIFH 240
           NAQ   TI   EILSSRI+NTYN SG+KLFCALELPPNGP YDDEIESATALIIRTFIFH
Sbjct: 573 NAQDSSTIFCPEILSSRILNTYNSSGHKLFCALELPPNGPSYDDEIESATALIIRTFIFH 632

Query: 241 HESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFENSDSAEIDSKPQLL 300
           HE NI +LLLFCSRNGLPVGARLLSYV+RLAYEANKAGL  NVEFENS+ AE+DS  QLL
Sbjct: 633 HEKNILQLLLFCSRNGLPVGARLLSYVTRLAYEANKAGLTENVEFENSEKAEMDSNTQLL 692

Query: 301 LFHLNGYFSFRN------------------------------------------------ 360
           LFH+NGYFSFRN                                                
Sbjct: 693 LFHVNGYFSFRNGMGEYPQETVLSFSGINKEEIAKLVTNAFSAYRCFLAYLKDILHKDAD 752

Query: 361 ----------------------------------DLCICKEEIVKLFVTLLDDTDLVNMQ 420
                                             DLC+CKEEIVKL VTLLDDTDLVNMQ
Sbjct: 753 VSLTKVFYRDLMSCVEWNARRVKFLFHCIFDLLSDLCLCKEEIVKLLVTLLDDTDLVNMQ 812

Query: 421 FEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVSQVRVESIVLKLFC 478
           FEIIAKKF VFGKD KSIFLL+KSSLNW CLEQRKLWGLIRSEL+VSQVRVE+IV KLFC
Sbjct: 813 FEIIAKKFCVFGKDIKSIFLLVKSSLNWGCLEQRKLWGLIRSELVVSQVRVENIVSKLFC 872

BLAST of HG10010739 vs. ExPASy TrEMBL
Match: A0A6J1ERY7 (integrator complex subunit 3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435309 PE=3 SV=1)

HSP 1 Score: 703.0 bits (1813), Expect = 1.5e-198
Identity = 388/575 (67.48%), Postives = 419/575 (72.87%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           MVHTLL+FLFLLVDNYD++RKDKIALGVSSAFSAL+EK VI SLD LISFDGLSP LRDR
Sbjct: 393 MVHTLLEFLFLLVDNYDIKRKDKIALGVSSAFSALVEKRVIFSLDALISFDGLSPILRDR 452

Query: 61  LRILSSGKKFQVPNELQLV-IPNQSVKPLPSSSKSCVETGIIYSESHPNRVVAHASATSA 120
           LRILSSG+K QVP E QL  +P+ S+KP    SKSC ETG+IYSE  P+ +VAH SATS 
Sbjct: 453 LRILSSGRKVQVPKESQLFGVPDHSIKPHSPPSKSCAETGVIYSERQPSSIVAHGSATSV 512

Query: 121 GASVPIVVDVSASRHSVV-------------TDVQQFDNIEILVKNLGEVTSKSYKMGLK 180
           GASVP+VVDVSAS HSVV              DV+Q DN+EILVK LGEV  KSYKMGLK
Sbjct: 513 GASVPVVVDVSASHHSVVMDDVCASHHSVVADDVRQCDNVEILVKKLGEVIRKSYKMGLK 572

Query: 181 TLEELLVLFLSLDDNAQAGRTINTEILSSRIVNTYNLSGYKLFCALELPPNGPDYDDEIE 240
           TLEELLVLFLSLDDNAQA RTINTEILSSRIVNTY LSGY LF ALEL PN P YDDEI 
Sbjct: 573 TLEELLVLFLSLDDNAQASRTINTEILSSRIVNTYELSGYNLFSALELLPNDPSYDDEIG 632

Query: 241 SATALIIRTFIFHHESNIQELLLFCSRNGLPVGARLLSYVSRLAYEANKAGLIGNVEFEN 300
           SATALIIRTFIF H   +QELLLFCSRNGLPVGARLLSYVSRLAYE NKAGL GN + +N
Sbjct: 633 SATALIIRTFIFDHGKKLQELLLFCSRNGLPVGARLLSYVSRLAYEVNKAGLTGNSDIKN 692

Query: 301 SDSAEIDSKPQLLLFHLNGYFSFRN----------------------------------- 360
           SD AEIDSK Q L+FH+NGY+SFRN                                   
Sbjct: 693 SDGAEIDSKNQFLMFHMNGYYSFRNGMKENPQEAVVSFSKIDKEVIAELVTNAFSAYRSF 752

Query: 361 -----------------------------------------------DLCICKEEIVKLF 420
                                                          D+CICKEEIVKL 
Sbjct: 753 LANSKDILYKDADVSLTKVFYLDLVSCVERNARRAKHLFYCVFDLLSDICICKEEIVKLL 812

Query: 421 VTLLDDTDLVNMQFEIIAKKFSVFGKDTKSIFLLLKSSLNWSCLEQRKLWGLIRSELIVS 480
           VT LDDTDLVNMQFEII KKF VFGKD +SIFLL+KSSLNW C EQ KLWGLIRSELIVS
Sbjct: 813 VTQLDDTDLVNMQFEIIKKKFCVFGKDAESIFLLVKSSLNWGCFEQHKLWGLIRSELIVS 872

BLAST of HG10010739 vs. TAIR 10
Match: AT1G04030.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G44040.1); Has 1835 Blast hits to 1511 proteins in 238 species: Archae - 7; Bacteria - 164; Metazoa - 377; Fungi - 135; Plants - 187; Viruses - 22; Other Eukaryotes - 943 (source: NCBI BLink). )

HSP 1 Score: 126.3 bits (316), Expect = 1.1e-28
Identity = 132/391 (33.76%), Postives = 178/391 (45.52%), Query Frame = 0

Query: 481 IPKPSQSSVHTLLDHSNNKLLHPVQAGRDGSAVTGKAHEDEHVVLPQEDAASVHGKKQVL 540
           IPK S   +  + D +  K   P    R       K    EHVV   E++  +  +    
Sbjct: 50  IPKASVIPITEICDEAEEK-CSPSTISRKRVTFDSKVKTYEHVV--SEESVELSEE---- 109

Query: 541 KEEEDSVENSSKPESSSEDFVVPLNPNFKSSCPPIHRYRNCKDSDD---EDEVFDSHLAS 600
           K EE   E  S   S ++D ++ +  N   S P  HRY+NC++SDD   EDE   S    
Sbjct: 110 KNEEVESEKRSLKSSKTDDQIIEVASNSSGSYPENHRYKNCRESDDDIEEDEFDCSDSDL 169

Query: 601 DENDEFGMVESMGEDS---------------SVAESSMNVCRLNPTNVRHRTAY-VCSVL 660
           DE++E+       EDS                  E    + R N T VR    Y    VL
Sbjct: 170 DEDEEYYSDVGFSEDSLHNPTKEVYTQDIGDKTEEIDSKLRRSNET-VRDGNHYDGQGVL 229

Query: 661 NPVENLSQWNAVKSK---KEFPSTLQKENVELEQESSVEEFPYS----------SSKP-- 720
           NPVENL+QW + KSK   K+  S  +  N   +QE   +   +           S KP  
Sbjct: 230 NPVENLTQWKSAKSKGRTKQKQSQKENSNFIADQEEKRDSSSFGTDPQIDDITLSVKPKC 289

Query: 721 --------SRELCVDASLSNWLASSEA------------TPVSKITATTALEATI----- 780
                   ++EL VDASLS WL++SE+            TP  K+ +T+     +     
Sbjct: 290 RIEPKKLRNQELAVDASLSTWLSTSESGSECNSASMYTLTP-EKLKSTSCYSKPLRINHD 349

Query: 781 -TPVKSSI-------LQGSSLPKRS---SHREMPEVRTVGMYCRQGASDKDRDSASSFKG 802
             PV  ++          +S P++S   S  E P + TVG Y    +   D  SASSFKG
Sbjct: 350 DRPVLCALTLEDIKQFSATSTPRKSPSKSPDETPIIGTVGGYWGNRSKAIDCGSASSFKG 409

BLAST of HG10010739 vs. TAIR 10
Match: AT5G44040.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G04030.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 71.6 bits (174), Expect = 3.2e-12
Identity = 74/226 (32.74%), Postives = 115/226 (50.88%), Query Frame = 0

Query: 540 LKEEEDSVENSSKPESSSEDFVVPLNPNFKSSCPPIHRYRNCKDSDDEDE----VFDSHL 599
           L EE+     S +   SSE   V    N   S P  HRY+NC++SDDE+E      DS L
Sbjct: 131 LFEEKKEEVKSRQARCSSEGSDV--TSNSSGSYPSNHRYQNCRESDDEEEDVTDCDDSDL 190

Query: 600 ASDENDEFGMVES--MGEDS---------------SVAESSMNVCRL---NPTNVRHRTA 659
              ++D+ G+++     +D+                +A++ M++ R+      +VR R+ 
Sbjct: 191 EDTDDDDCGLLDDDYYNDDNYEDKLHNWDKVVYTEEIADNVMDIERVEEKGSVSVRDRSG 250

Query: 660 YVCSVLNPVENLSQWNAVKSKKEFPSTLQ--KENV---ELEQESSVEEF--PYSSSKPSR 719
           YV +VLNP+ENLSQW AVK+K    +  Q  KENV       ES V++    +S ++ SR
Sbjct: 251 YVNAVLNPIENLSQWKAVKAKGRTTTQTQPRKENVIIASFSLESQVDDLSSTFSLNRKSR 310

Query: 720 ---------ELCVDASLSNWLASSEATPVSKITATTALEATITPVK 726
                    E+ VDASLS WL++S+ T     +  +++E T++  K
Sbjct: 311 DETEKQRTQEIAVDASLSTWLSTSQTT----TSGCSSVETTMSEKK 350

BLAST of HG10010739 vs. TAIR 10
Match: AT4G14590.1 (embryo defective 2739 )

HSP 1 Score: 57.0 bits (136), Expect = 8.1e-08
Identity = 36/98 (36.73%), Postives = 54/98 (55.10%), Query Frame = 0

Query: 1   MVHTLLDFLFLLVDNYDVERKDKIALGVSSAFSALIEKGVISSLDTLISFDGLSPFLRDR 60
           + H+LL+FL  LV+ YD+ R+D I  G++SAF  +  KGVI SLD  ++   L+P L+ +
Sbjct: 375 ITHSLLEFLLHLVETYDITRRDTIVRGLTSAFREIERKGVIRSLDIFLANPALAPDLKKK 434

Query: 61  LRILSS---GKKFQVPNELQLVIPNQSVKPLPSSSKSC 96
           L  L S    K   V      V+  Q+V    ++ K C
Sbjct: 435 LANLLSCHQEKTVHVNLNQVSVLSKQTVLSSEANLKDC 472

BLAST of HG10010739 vs. TAIR 10
Match: AT2G30820.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G06660.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 53.5 bits (127), Expect = 9.0e-07
Identity = 29/79 (36.71%), Postives = 44/79 (55.70%), Query Frame = 0

Query: 725 KSSILQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSFKGIPNTTSKYREDKTV 784
           K  I + SS P   +  + P +  V  +  +    +         GIPN+T+KY+ED+ V
Sbjct: 311 KKQIGEISSSPLSINPGDRPIIGMVAAHWNEKEHSQISPKWWDGNGIPNSTNKYKEDQKV 370

Query: 785 NWHSTPFETRLERALNSRG 804
           +WH+TPFE RLE+AL+  G
Sbjct: 371 SWHATPFEERLEKALSEEG 389

BLAST of HG10010739 vs. TAIR 10
Match: AT2G30820.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G06660.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 53.5 bits (127), Expect = 9.0e-07
Identity = 29/79 (36.71%), Postives = 44/79 (55.70%), Query Frame = 0

Query: 725 KSSILQGSSLPKRSSHREMPEVRTVGMYCRQGASDKDRDSASSFKGIPNTTSKYREDKTV 784
           K  I + SS P   +  + P +  V  +  +    +         GIPN+T+KY+ED+ V
Sbjct: 311 KKQIGEISSSPLSINPGDRPIIGMVAAHWNEKEHSQISPKWWDGNGIPNSTNKYKEDQKV 370

Query: 785 NWHSTPFETRLERALNSRG 804
           +WH+TPFE RLE+AL+  G
Sbjct: 371 SWHATPFEERLEKALSEEG 389

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879691.17.9e-22376.29integrator complex subunit 3 homolog isoform X1 [Benincasa hispida] >XP_03887969... [more]
XP_016900895.16.5e-20970.93PREDICTED: integrator complex subunit 3 homolog [Cucumis melo][more]
TYK27714.11.1e-20872.19integrator complex subunit 3-like protein [Cucumis melo var. makuwa][more]
KAA0056027.15.5e-20872.01integrator complex subunit 3-like protein [Cucumis melo var. makuwa][more]
XP_038879698.14.8e-20471.66uncharacterized protein LOC120071465 isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
F4IDQ52.2e-0529.78Protein JASON OS=Arabidopsis thaliana OX=3702 GN=JASON PE=2 SV=1[more]
Q55EZ45.3e-0436.51Integrator complex subunit 3 homolog OS=Dictyostelium discoideum OX=44689 GN=int... [more]
Match NameE-valueIdentityDescription
A0A1S4DYU43.2e-20970.93integrator complex subunit 3 homolog OS=Cucumis melo OX=3656 GN=LOC103491988 PE=... [more]
A0A5D3DW405.4e-20972.19Integrator complex subunit 3-like protein OS=Cucumis melo var. makuwa OX=1194695... [more]
A0A5A7UN452.7e-20872.01Integrator complex subunit 3-like protein OS=Cucumis melo var. makuwa OX=1194695... [more]
A0A0A0LXR45.2e-20470.71Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G627430 PE=3 SV=1[more]
A0A6J1ERY71.5e-19867.48integrator complex subunit 3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
Match NameE-valueIdentityDescription
AT1G04030.11.1e-2833.76unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G44040.13.2e-1232.74unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G14590.18.1e-0836.73embryo defective 2739 [more]
AT2G30820.19.0e-0736.71unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G30820.29.0e-0736.71unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019333Integrator complex subunit 3PFAMPF10189Ints3coord: 2..62
e-value: 2.0E-10
score: 40.4
IPR019333Integrator complex subunit 3PANTHERPTHR13587INTEGRATOR COMPLEX SUBUNIT 3coord: 1..535
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 756..787
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 522..562
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 534..552

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10010739.1HG10010739.1mRNA