HG10001431 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10001431
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNOT2 / NOT3 / NOT5 family
LocationChr09: 17098283 .. 17102722 (+)
RNA-Seq ExpressionHG10001431
SyntenyHG10001431
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCTCTTGTTCTTTGCAGTCATCTCTTAATGGATCAGCTTCAAATCTTCCAGATGGTACTGGGCGATCTTTTACTACTTCATTTTCTGGTCAGTCTGGTGCTGCCTCCCCTGTTTTTCATCACTCTGGTGGGTGCATGTTCCTAGAGTTTGTAGTTTTCTTTCAAATTTATGAAACTTAGAGGTTTCATTTTCCAGGAGGAGGGTTGCATAACATTCATGGAAGCTTCAATATTCAAAACATGTCAGGTGCACTAACTTCAAGAAATTCAACGATAAATAATGTTCCATCTGGTGGGGTGCAGCAACCTACTGGAACACTTTCCAGTGGGCGTTTTGCATCGAACAACCTTCCTGTTGCTCTTTCTCAGGTATCTTTCATAGTCAGGGACTCAGGCTATTTATTCATCATTGAATGCGGTGTTTATTGACTGAGATCCTTGGTTTCAGTTTCCCCTGTAGTTTCTTTGAGTTGCATTTATGGTGGTCATGAAAGATTAACTGTTGAATGATAACATAATTTAGATATGATCACTTTGCTTAGTGAAGGAAAATACACTCGTTTATGATTTTGAAAAATTGTTGTATAGACATTTACTATCAGATAAATTTGTTGAAGAAATGATGGAGAGCTGCTTTACAAATTAGATACCAGATCATAATGTTCATAATTAGGCTTGTTATAGGATTTTTTTATCATTGAAGTCGATAAAATGACTGTCTTTTAAGAGATCTAGAAGCAATGAAGGTGGAACCTTAATCTATCCTCTCACTAACCCTTGTAATTGTGTGCTTCCACTCCATTTTAATTGTTTTTTTCTCTTCTGTAGTTGTCTCATGGCAGCTCCCATGGGCATTCAGGAGTCGCAAGTAGAGGAGGTATAAGTGTTGTTGGAAACCCTGGATTTAGTAGTAGCACAAACGCAGTTGGCGGTTCTATTCCTGGGATTCTGTCTACTTCTGCTGCTATTGGTAATCGAAATGCTGTTCCAGGATTGGGTGTATCCCCAATTTTGGGAAATGCAGGTCCTCGGATCACAAGTTCAATGGGAAATATGGTCAGTGGAGGCAACATAGGAAGGAGTATAACTACGGGTGGGGGATTGTCATTACCTGGTCTTGCTTCTCGTCTAAACCTTGGTGCAAATAGTGGATCCGGAAGCTTAACTGTGCAAGGACAAAACCGTCTAATGAGTGGGGTGCTACCACAAGGTATGATCCCTTGAATTTGTGGACGAATGTTTGCATTTATGTACATTATCCAGCATCTAATAGTAATGTTTTTATTCTGTACTTGGTTTTCTAACCACTACATGGGTATCTAGATGCATCTTACAATAATGGTGGTATTCTTTGCAAATGTATTTTGAGATTTTGTAGTTACTGAGCCACTATCTTGGAATTTGACTATTTCTTTCATTAATAATTTCATACAAACTAGACATTAGCTAGTGGGAGAACGTCATATCCTCCCCGTTATTACACCTTTTGCTCAAAGATGAAAATATCATATTAGATGTTAGCATAATTAAAATGAATCATTTTGTTCTGTTTGAAAGAGTAGTGTGCAGCATTGGAAACTATTTACTCATTTTACATGATTATGTAGTACTATTTTATTCTATACTGTTATTTATGTGTTATGTATTTGCTATTTATGATTTAATTTGTTATGTTCTGAAGTGTGCACATTATGTTTCCATGGATGTGCGTTTTTACATTTTTCCTACAACTATCTTATTTCTGAGGCTATATTGACTGGAAAATGGAAATACATTCATTACTATGTTGTTATATTTATTCTCACATCTCTATTCATGCTATATTTTAGTTGAAAATGTCATGTTCCATTTTGTCGTTGATGCCATGTTCTTGTTTCAGGATCTCAACAGGTCATTTCTATGTTAAGTAATTCTTATCCTAGTGCTGGAGGTCCCCTTTCCCAAAACCACATGCAGAGTGTGAATAGTTTGAATTCTTTGGGGATGTTGAATGATGTGAACACCAGTGACAATTCTCCTTTTGACATTAATGATTTTCCTCAGTTAACAAGTCGTCCAAGTTCAGCAGGAGGGCCTCAAGGACAATTAAGTTAGTAGCTAGATATTGTTGGGCTTTTCCTTTCCTTCTCTTCTTTTTCCTTTCTTTTCTCTTCTCTGTGCCAGTTTTGACCTATAGGGAGTCATGCTATTGTTTCAGGTTCGCTGAGAAAGCAGGGCCTGAGTCCTATTGTTCAACAAAACCAAGAGTTCAGCATTCAGAATGAAGACTTTCCAGCATTACCTAGATTTAAAGGTGCAAATCCCTTCTTTGCATTACCTAGTAGGTTTCATTCATGTAAATCATAGGCCAAAACTTCGTTTGCTGTACTCTAGCACTTTTTTTTTTTTTTTTTGGGAAAAAGGAGATTAAAATACAATAATGTTGGAGAACTGTTATTTGGAAATATTCTAGTTGACCTTGAATAGTTGCTGGATAGATGGAGTAGAATTTGAGATTTCGTTTTTCCCAGTAGGTCGTTGCAATTATTAAGCTCAGGTGCTTCTTTCATAGGGAGTTAGAATTGTTTCTGTTTTCTCTTCGGATTGTAAATTTATAGCATGGCACAAGTTAACGTTTTGCTTACTTTGAATATAAAATAGGACACTTTTTATCATTGATTATAGTTCTAGATCTTGTGCAGGTGGCAATGCTGATTATGGTATGGACATTCATCAGAAAGATCAACATGAAAATTCTGTGCCTATGATGCAGTCTCAGCAGTTCTCTGTGGGAACACGTTTTACTCAATCTTTTGCTAACTATCATCATGTGGCAATGGGGTGCTTATCAACTTTTCCCTTGTTTTGATCTTCAGATTGGAAGGTCTGCTGGATTTAACCTAGGGGGCACCTATACACATCGACCCCAGCAGCAGCAACAGCATTCTTCAGCCGTCAGTAACAGCACGGTCTCCTTTCCACCTGGAAATAATCAGGATCTCCTCCATTTACACGGATCAGATATATTCCCATCTTCACATCCTGCATCCTATCACCAGCAGGTGTGCGATTCTATTGCTCTAGTCTAAAGGAAGTAAATTTTTTTCCCCCTGGTATTTAAGCTTTTACTTCGCAGACATAAGGAATCAAACTGTTGCATTTTCAAATATTTCCACTCTTGGTTTTTCTTAACTGAAATGCAGTTTCAAGTTTCAACCTTTTGACGTCAATCATTCTCATGTATTAAAAAATTCAACGGGATTAAATGTTTTGTAGATTAGTAGCTTATATGTCTTAAATGTCTTCTCCATTGATAGTTTTCAATATCGATGTTTCTTTGTTCATGTTCAGTCTAGTGGGCCTCCTGGTATTGGTTTAAGACCTCTGAGCTCTCCTAATTCAGCTTCTGGAATGAGTTATGACCAACTTATCCAGCAATATCAGCAGCCTCACGGTCAATCTCAGTTTCGATTGCAACATATGTCTGGTGTTAGCCAGTCATTTAGAGACCAGGGCATGAAATCTATGCAGGCGGCTCAATCTTCTCCTGATCCATTTGGTTTACTTGGTTTGTTAAGTGTGATAAGGTTGAGTGATCCCGATCTTGCATCCCTTGCGCTCGGAATTGATTTGACCACGTTAGGATTAAATTTGAATTCAGCAGATAACCTTCACAAGACTTTTGGCTCCCCATGGTCTGATGAGCCTGCTAAGGGTGATCCCGATTTCAATGTACCTCAGTGCTACCTTATTAAACCACCACCTACACTACATGTGAGGTTCTTCACCTTTCATATCCCTCTCCTCGTCTAGGTGAAACTTCCGACAGTTTCGTCTCAACTGATCATTTTTTTGGTAATTGCAGCAAGGATACTTCTCAAAATTTACTCTGGAGACACTGTTTTATATATTTTTCAGGTGTGGTGTATTTAAATAATTTTTCGGTCTCTTTTTTCCTTTTTTAACTTTTCTTTTCTTAAACATAATTCCACTTTTTCTTGGTCTGCAGTATGCCAAAAGATGAAGCTCAGTTGTATGCTGCAAATGAACTGTAAGCTCTTATTATTTACTTACTGTGCAGTTTGTTATTTTTACCAAAGTCATAAACGTAATTTATGGTAAACTTTTTCCATTCATTGATTTTCTAATTTTTTGTAATAGATATAATAGAGGCTGGTTCTATCACAAAGAACAACGATTCTGGTTTATTCGGGTCTCTAACATGGAACCACTTGTGAAGACTAACACTTATGAGCGAGGATCGTACCTCTGTTTCGACCCCCAAACATTTGAAACTGTCCGCAAGGTTTGTACTTGTCCATATCTTTCATTGAAATTATTCATTCTGTCTGTTGCCCTGTTCCCATTGCTCATGTTACGTCTATCGTCTATTGGCTTCAGGATAATTTTGTTCTTCACTACGAGATGGTAGAAAAGAGACCAGCTCTACCGCAACATTAA

mRNA sequence

TGCTCTTGTTCTTTGCAGTCATCTCTTAATGGATCAGCTTCAAATCTTCCAGATGGTACTGGGCGATCTTTTACTACTTCATTTTCTGGTCAGTCTGGTGCTGCCTCCCCTGTTTTTCATCACTCTGGTGCACTAACTTCAAGAAATTCAACGATAAATAATGTTCCATCTGGTGGGGTGCAGCAACCTACTGGAACACTTTCCAGTGGGCGTTTTGCATCGAACAACCTTCCTGTTGCTCTTTCTCAGTTGTCTCATGGCAGCTCCCATGGGCATTCAGGAGTCGCAAGTAGAGGAGGTATAAGTGTTGTTGGAAACCCTGGATTTAGTAGTAGCACAAACGCAGTTGGCGGTTCTATTCCTGGGATTCTGTCTACTTCTGCTGCTATTGGTAATCGAAATGCTGTTCCAGGATTGGGTGTATCCCCAATTTTGGGAAATGCAGGTCCTCGGATCACAAGTTCAATGGGAAATATGGTCAGTGGAGGCAACATAGGAAGGAGTATAACTACGGGTGGGGGATTGTCATTACCTGGTCTTGCTTCTCGTCTAAACCTTGGTGCAAATAGTGGATCCGGAAGCTTAACTGTGCAAGGACAAAACCGTCTAATGAGTGGGGTGCTACCACAAGGATCTCAACAGGTCATTTCTATGTTAAGTAATTCTTATCCTAGTGCTGGAGGTCCCCTTTCCCAAAACCACATGCAGAGTGTGAATAGTTTGAATTCTTTGGGGATGTTGAATGATGTGAACACCAGTGACAATTCTCCTTTTGACATTAATGATTTTCCTCAGTTAACAAGTCGTCCAAGTTCAGCAGGAGGGCCTCAAGGACAATTAAGGAGTCATGCTATTGTTTCAGGTTCGCTGAGAAAGCAGGGCCTGAGTCCTATTGTTCAACAAAACCAAGAGTTCAGCATTCAGAATGAAGACTTTCCAGCATTACCTAGATTTAAAGGTGGCAATGCTGATTATGGTATGGACATTCATCAGAAAGATCAACATGAAAATTCTGTGCCTATGATGCAGTCTCAGCAGTTCTCTATTGGAAGGTCTGCTGGATTTAACCTAGGGGGCACCTATACACATCGACCCCAGCAGCAGCAACAGCATTCTTCAGCCGTCAGTAACAGCACGGTCTCCTTTCCACCTGGAAATAATCAGGATCTCCTCCATTTACACGGATCAGATATATTCCCATCTTCACATCCTGCATCCTATCACCAGCAGTCTAGTGGGCCTCCTGGTATTGGTTTAAGACCTCTGAGCTCTCCTAATTCAGCTTCTGGAATGAGTTATGACCAACTTATCCAGCAATATCAGCAGCCTCACGGTCAATCTCAGTTTCGATTGCAACATATGTCTGGTGTTAGCCAGTCATTTAGAGACCAGGGCATGAAATCTATGCAGGCGGCTCAATCTTCTCCTGATCCATTTGGTTTACTTGGTTTGTTAAGTGTGATAAGGTTGAGTGATCCCGATCTTGCATCCCTTGCGCTCGGAATTGATTTGACCACGTTAGGATTAAATTTGAATTCAGCAGATAACCTTCACAAGACTTTTGGCTCCCCATGGTCTGATGAGCCTGCTAAGGGTGATCCCGATTTCAATGTACCTCAGTGCTACCTTATTAAACCACCACCTACACTACATCAAGGATACTTCTCAAAATTTACTCTGGAGACACTGTTTTATATATTTTTCAGTATGCCAAAAGATGAAGCTCAGTTGTATGCTGCAAATGAACTATATAATAGAGGCTGGTTCTATCACAAAGAACAACGATTCTGGTTTATTCGGGTCTCTAACATGGAACCACTTGTGAAGACTAACACTTATGAGCGAGGATCGTACCTCTGTTTCGACCCCCAAACATTTGAAACTGTCCGCAAGGATAATTTTGTTCTTCACTACGAGATGGTAGAAAAGAGACCAGCTCTACCGCAACATTAA

Coding sequence (CDS)

TGCTCTTGTTCTTTGCAGTCATCTCTTAATGGATCAGCTTCAAATCTTCCAGATGGTACTGGGCGATCTTTTACTACTTCATTTTCTGGTCAGTCTGGTGCTGCCTCCCCTGTTTTTCATCACTCTGGTGCACTAACTTCAAGAAATTCAACGATAAATAATGTTCCATCTGGTGGGGTGCAGCAACCTACTGGAACACTTTCCAGTGGGCGTTTTGCATCGAACAACCTTCCTGTTGCTCTTTCTCAGTTGTCTCATGGCAGCTCCCATGGGCATTCAGGAGTCGCAAGTAGAGGAGGTATAAGTGTTGTTGGAAACCCTGGATTTAGTAGTAGCACAAACGCAGTTGGCGGTTCTATTCCTGGGATTCTGTCTACTTCTGCTGCTATTGGTAATCGAAATGCTGTTCCAGGATTGGGTGTATCCCCAATTTTGGGAAATGCAGGTCCTCGGATCACAAGTTCAATGGGAAATATGGTCAGTGGAGGCAACATAGGAAGGAGTATAACTACGGGTGGGGGATTGTCATTACCTGGTCTTGCTTCTCGTCTAAACCTTGGTGCAAATAGTGGATCCGGAAGCTTAACTGTGCAAGGACAAAACCGTCTAATGAGTGGGGTGCTACCACAAGGATCTCAACAGGTCATTTCTATGTTAAGTAATTCTTATCCTAGTGCTGGAGGTCCCCTTTCCCAAAACCACATGCAGAGTGTGAATAGTTTGAATTCTTTGGGGATGTTGAATGATGTGAACACCAGTGACAATTCTCCTTTTGACATTAATGATTTTCCTCAGTTAACAAGTCGTCCAAGTTCAGCAGGAGGGCCTCAAGGACAATTAAGGAGTCATGCTATTGTTTCAGGTTCGCTGAGAAAGCAGGGCCTGAGTCCTATTGTTCAACAAAACCAAGAGTTCAGCATTCAGAATGAAGACTTTCCAGCATTACCTAGATTTAAAGGTGGCAATGCTGATTATGGTATGGACATTCATCAGAAAGATCAACATGAAAATTCTGTGCCTATGATGCAGTCTCAGCAGTTCTCTATTGGAAGGTCTGCTGGATTTAACCTAGGGGGCACCTATACACATCGACCCCAGCAGCAGCAACAGCATTCTTCAGCCGTCAGTAACAGCACGGTCTCCTTTCCACCTGGAAATAATCAGGATCTCCTCCATTTACACGGATCAGATATATTCCCATCTTCACATCCTGCATCCTATCACCAGCAGTCTAGTGGGCCTCCTGGTATTGGTTTAAGACCTCTGAGCTCTCCTAATTCAGCTTCTGGAATGAGTTATGACCAACTTATCCAGCAATATCAGCAGCCTCACGGTCAATCTCAGTTTCGATTGCAACATATGTCTGGTGTTAGCCAGTCATTTAGAGACCAGGGCATGAAATCTATGCAGGCGGCTCAATCTTCTCCTGATCCATTTGGTTTACTTGGTTTGTTAAGTGTGATAAGGTTGAGTGATCCCGATCTTGCATCCCTTGCGCTCGGAATTGATTTGACCACGTTAGGATTAAATTTGAATTCAGCAGATAACCTTCACAAGACTTTTGGCTCCCCATGGTCTGATGAGCCTGCTAAGGGTGATCCCGATTTCAATGTACCTCAGTGCTACCTTATTAAACCACCACCTACACTACATCAAGGATACTTCTCAAAATTTACTCTGGAGACACTGTTTTATATATTTTTCAGTATGCCAAAAGATGAAGCTCAGTTGTATGCTGCAAATGAACTATATAATAGAGGCTGGTTCTATCACAAAGAACAACGATTCTGGTTTATTCGGGTCTCTAACATGGAACCACTTGTGAAGACTAACACTTATGAGCGAGGATCGTACCTCTGTTTCGACCCCCAAACATTTGAAACTGTCCGCAAGGATAATTTTGTTCTTCACTACGAGATGGTAGAAAAGAGACCAGCTCTACCGCAACATTAA

Protein sequence

CSCSLQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHHSGALTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHAIVSGSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQFSIGRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHPASYHQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQGMKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKRPALPQH
Homology
BLAST of HG10001431 vs. NCBI nr
Match: XP_038902637.1 (probable NOT transcription complex subunit VIP2 isoform X1 [Benincasa hispida])

HSP 1 Score: 1192.6 bits (3084), Expect = 0.0e+00
Identity = 618/663 (93.21%), Postives = 625/663 (94.27%), Query Frame = 0

Query: 5   LQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH-----------------SGALTS 64
           L SSLNGSASNLPDGTGRSF TSFSGQSGAASPVFHH                 SGALTS
Sbjct: 5   LNSSLNGSASNLPDGTGRSFATSFSGQSGAASPVFHHSGGGLHNIHGSFSLQNMSGALTS 64

Query: 65  RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP 124
           RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP
Sbjct: 65  RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP 124

Query: 125 GFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGR 184
           GFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGR
Sbjct: 125 GFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGR 184

Query: 185 SITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAG 244
           SIT GGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAG
Sbjct: 185 SITAGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAG 244

Query: 245 GPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHAIVS 304
           GPLSQNHMQSVNSLNSLGMLNDVNT+DNSPFDINDFPQLTSRPSSAGGPQGQL       
Sbjct: 245 GPLSQNHMQSVNSLNSLGMLNDVNTNDNSPFDINDFPQLTSRPSSAGGPQGQL------- 304

Query: 305 GSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQF 364
            SLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQH+NSVPMMQSQQF
Sbjct: 305 SSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHDNSVPMMQSQQF 364

Query: 365 SIGRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHPASY 424
           SIGRSAGFNLGGTYTHRPQQQQQHS AVSNSTVSFPP NNQDLLHLHGSDIFPSSH ASY
Sbjct: 365 SIGRSAGFNLGGTYTHRPQQQQQHSPAVSNSTVSFPPANNQDLLHLHGSDIFPSSHAASY 424

Query: 425 HQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQGMK 484
           HQQSSGPPGIGLRPLSSPNSASGM YDQLI  YQQPHGQSQFRLQHMSGVSQSFRDQG+K
Sbjct: 425 HQQSSGPPGIGLRPLSSPNSASGMGYDQLIPPYQQPHGQSQFRLQHMSGVSQSFRDQGLK 484

Query: 485 SMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD 544
           SMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD
Sbjct: 485 SMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD 544

Query: 545 EPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG 604
           EPAKGDPDFNVPQCYLIKPPP+LH+GYFSKFTLETLFY+FFSMPKDEAQLYAANELYNRG
Sbjct: 545 EPAKGDPDFNVPQCYLIKPPPSLHRGYFSKFTLETLFYMFFSMPKDEAQLYAANELYNRG 604

Query: 605 WFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKRPAL 651
           WFYHKE RFWFIRVSNMEPLVKT+TYERGSYLCFDP TFETVRKDNFVLHYEMVEKRP L
Sbjct: 605 WFYHKEHRFWFIRVSNMEPLVKTSTYERGSYLCFDPHTFETVRKDNFVLHYEMVEKRPVL 660

BLAST of HG10001431 vs. NCBI nr
Match: XP_038901690.1 (probable NOT transcription complex subunit VIP2 isoform X1 [Benincasa hispida])

HSP 1 Score: 1179.5 bits (3050), Expect = 0.0e+00
Identity = 613/663 (92.46%), Postives = 622/663 (93.82%), Query Frame = 0

Query: 5   LQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH-----------------SGALTS 64
           L SSLNGS SNLPDGTGRSF TSFSGQSGAASPVFHH                 SGALTS
Sbjct: 5   LNSSLNGSTSNLPDGTGRSFATSFSGQSGAASPVFHHSGGGLHNVHGSFSIQNMSGALTS 64

Query: 65  RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP 124
           RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGV +RGGISVVGNP
Sbjct: 65  RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVTNRGGISVVGNP 124

Query: 125 GFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGR 184
           GFSSS+NAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGR
Sbjct: 125 GFSSSSNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGR 184

Query: 185 SITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAG 244
           SITTGGGLSLPGLASRLNL ANSGSGSLT+QGQNRLMSGVLPQGSQQVISMLSNSYPSAG
Sbjct: 185 SITTGGGLSLPGLASRLNLSANSGSGSLTMQGQNRLMSGVLPQGSQQVISMLSNSYPSAG 244

Query: 245 GPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHAIVS 304
           GPLSQNHMQSVNSLNSLGMLNDVNT+DNSPFDINDFPQLTSRPSSAGGPQGQL       
Sbjct: 245 GPLSQNHMQSVNSLNSLGMLNDVNTNDNSPFDINDFPQLTSRPSSAGGPQGQL------- 304

Query: 305 GSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQF 364
            SLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQF
Sbjct: 305 SSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQF 364

Query: 365 SIGRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHPASY 424
           SIGRSAGFNLGGTY+HRPQQQQQHSSAVSN TVSFPP NNQDLLHLHGSDIFPSSH ASY
Sbjct: 365 SIGRSAGFNLGGTYSHRPQQQQQHSSAVSNGTVSFPPANNQDLLHLHGSDIFPSSHAASY 424

Query: 425 HQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQGMK 484
           HQQSSGPPGIGLRPLSSPNSASGM YDQL QQYQQ HGQSQFRLQH+SG SQSFRDQGMK
Sbjct: 425 HQQSSGPPGIGLRPLSSPNSASGMGYDQL-QQYQQHHGQSQFRLQHISGASQSFRDQGMK 484

Query: 485 SMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD 544
           S+QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD
Sbjct: 485 SLQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD 544

Query: 545 EPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG 604
           EPAKGDPDFNVPQCYLIKPPP+LHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG
Sbjct: 545 EPAKGDPDFNVPQCYLIKPPPSLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG 604

Query: 605 WFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKRPAL 651
           WFYHKE RFWFIRVSNMEPLVKT+TYERGSYLCFDP TFETVRKDNFVLHYEMVEKRP L
Sbjct: 605 WFYHKEHRFWFIRVSNMEPLVKTSTYERGSYLCFDPHTFETVRKDNFVLHYEMVEKRPVL 659

BLAST of HG10001431 vs. NCBI nr
Match: XP_023532180.1 (probable NOT transcription complex subunit VIP2 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1178.7 bits (3048), Expect = 0.0e+00
Identity = 610/661 (92.28%), Postives = 622/661 (94.10%), Query Frame = 0

Query: 5   LQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH---------------SGALTSRN 64
           L SS+NGSASNLPDGTGRSF  SFSGQSGAASPVFHH               SGALTSRN
Sbjct: 5   LNSSVNGSASNLPDGTGRSFANSFSGQSGAASPVFHHSGLHNIHGNFNLQNMSGALTSRN 64

Query: 65  STINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNPGF 124
           STINNVPSGGVQQPTGT+SSGRFASNNLPVALSQLSHGSSHGHSGVA+RGGISVVGNPGF
Sbjct: 65  STINNVPSGGVQQPTGTISSGRFASNNLPVALSQLSHGSSHGHSGVANRGGISVVGNPGF 124

Query: 125 SSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSI 184
           SSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSI
Sbjct: 125 SSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSI 184

Query: 185 TTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGP 244
           TTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGP
Sbjct: 185 TTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGP 244

Query: 245 LSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHAIVSGS 304
           LSQNH+Q+VNSLNSLGMLNDVN+SDNSPFDINDFPQLTSRPSSAGGPQGQL        S
Sbjct: 245 LSQNHIQNVNSLNSLGMLNDVNSSDNSPFDINDFPQLTSRPSSAGGPQGQL-------SS 304

Query: 305 LRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQFSI 364
           LRKQGLSPIVQQNQEFSIQ+EDFPAL RFKGGN DYGMDIHQKDQHENSVP+MQSQQFSI
Sbjct: 305 LRKQGLSPIVQQNQEFSIQSEDFPALSRFKGGNVDYGMDIHQKDQHENSVPIMQSQQFSI 364

Query: 365 GRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHPASYHQ 424
           GRSAGFNLG TY+HRPQQQQQHS AVSNSTVSFPP NNQDLLHLHGSDIFPSSH ASYHQ
Sbjct: 365 GRSAGFNLGSTYSHRPQQQQQHSPAVSNSTVSFPPANNQDLLHLHGSDIFPSSHAASYHQ 424

Query: 425 QSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQGMKSM 484
           QSSGPPGIGLRPLSSPNS SGM YDQLIQQYQQ H Q QFRLQHMSGVSQSFRDQGMKSM
Sbjct: 425 QSSGPPGIGLRPLSSPNSVSGMGYDQLIQQYQQHHSQPQFRLQHMSGVSQSFRDQGMKSM 484

Query: 485 QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEP 544
           QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEP
Sbjct: 485 QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEP 544

Query: 545 AKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWF 604
           AKGDPDFNVPQCYLIKPPP+LHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWF
Sbjct: 545 AKGDPDFNVPQCYLIKPPPSLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWF 604

Query: 605 YHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKRPALPQ 651
           YHKE RFWFIRVSNMEPLVKT++YERGSYLCFDP TFETVRKDNFVLHYEMVEKRPALPQ
Sbjct: 605 YHKEHRFWFIRVSNMEPLVKTSSYERGSYLCFDPHTFETVRKDNFVLHYEMVEKRPALPQ 658

BLAST of HG10001431 vs. NCBI nr
Match: XP_008448344.1 (PREDICTED: probable NOT transcription complex subunit VIP2 isoform X1 [Cucumis melo])

HSP 1 Score: 1178.3 bits (3047), Expect = 0.0e+00
Identity = 612/666 (91.89%), Postives = 624/666 (93.69%), Query Frame = 0

Query: 2   SCSLQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH-----------------SGA 61
           SCSLQSSLNGS SNLPDGTGRSF TSFSGQSGAASPVFHH                 SGA
Sbjct: 58  SCSLQSSLNGSTSNLPDGTGRSFATSFSGQSGAASPVFHHSGGGLHNIHGSFNIQNMSGA 117

Query: 62  LTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVV 121
           LTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGI+VV
Sbjct: 118 LTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGINVV 177

Query: 122 GNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN 181
           GNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN
Sbjct: 178 GNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN 237

Query: 182 IGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYP 241
           IGRS+T GGGLSLPGLASRLNL +NSGSGSLTVQGQNRL+SGVLPQGSQQV+SML NSYP
Sbjct: 238 IGRSVTAGGGLSLPGLASRLNLNSNSGSGSLTVQGQNRLISGVLPQGSQQVLSMLGNSYP 297

Query: 242 SAGGPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHA 301
           SAGGPLSQNHMQSVNSLNSLGMLNDVN +DNSPFDINDFPQLTSRPSSAGGPQGQL    
Sbjct: 298 SAGGPLSQNHMQSVNSLNSLGMLNDVNANDNSPFDINDFPQLTSRPSSAGGPQGQL---- 357

Query: 302 IVSGSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQS 361
               SLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQS
Sbjct: 358 ---SSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQS 417

Query: 362 QQFSIGRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHP 421
           QQFSIGRSAGFNLGGT++HRPQQQQQHSSAVSNSTVSFPP NNQDLLHLHGSDIFPSSH 
Sbjct: 418 QQFSIGRSAGFNLGGTFSHRPQQQQQHSSAVSNSTVSFPPANNQDLLHLHGSDIFPSSHA 477

Query: 422 ASYHQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQ 481
           ASYHQQSSGPPGIGLRPLSSPNSASGM YDQL QQYQQ HGQSQFRLQHMSGVSQSFRDQ
Sbjct: 478 ASYHQQSSGPPGIGLRPLSSPNSASGMGYDQL-QQYQQHHGQSQFRLQHMSGVSQSFRDQ 537

Query: 482 GMKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSP 541
           G+KSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSP
Sbjct: 538 GIKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSP 597

Query: 542 WSDEPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELY 601
           WSDEPAKGDPDFNVPQCYLIKPP +LHQGYF KF+LETLFYIFFSMPKDEAQLYAANELY
Sbjct: 598 WSDEPAKGDPDFNVPQCYLIKPPASLHQGYFPKFSLETLFYIFFSMPKDEAQLYAANELY 657

Query: 602 NRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKR 651
           NRGWFYHKE RFWFIRVSNMEPLVKT+TYERGSYLCFDP TFETVRKDNFVLHYEMVEKR
Sbjct: 658 NRGWFYHKEHRFWFIRVSNMEPLVKTSTYERGSYLCFDPHTFETVRKDNFVLHYEMVEKR 715

BLAST of HG10001431 vs. NCBI nr
Match: XP_011649310.1 (probable NOT transcription complex subunit VIP2 isoform X2 [Cucumis sativus])

HSP 1 Score: 1177.5 bits (3045), Expect = 0.0e+00
Identity = 610/663 (92.01%), Postives = 622/663 (93.82%), Query Frame = 0

Query: 5   LQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH-----------------SGALTS 64
           ++SSLNGSASNLPDGTGRSF TSFSGQSGAASPVFHH                 SGALTS
Sbjct: 1   MESSLNGSASNLPDGTGRSFATSFSGQSGAASPVFHHSGGGLHNIHGSFSIQNMSGALTS 60

Query: 65  RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP 124
           RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP
Sbjct: 61  RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP 120

Query: 125 GFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGR 184
           GFSSSTNAVGGSIPGILSTSAAIGNRN VPGLGVSPILGNAGPRITSSMGNM SGGNIGR
Sbjct: 121 GFSSSTNAVGGSIPGILSTSAAIGNRNTVPGLGVSPILGNAGPRITSSMGNMASGGNIGR 180

Query: 185 SITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAG 244
           SIT GGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAG
Sbjct: 181 SITAGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAG 240

Query: 245 GPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHAIVS 304
           GPLSQNHMQSVNSLNSLGMLN+VNT+DNSPFDINDFPQLTSRPSSAGGPQGQL       
Sbjct: 241 GPLSQNHMQSVNSLNSLGMLNEVNTNDNSPFDINDFPQLTSRPSSAGGPQGQL------- 300

Query: 305 GSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQF 364
            SLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQH+NSVPMMQSQQF
Sbjct: 301 SSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHDNSVPMMQSQQF 360

Query: 365 SIGRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHPASY 424
           SIGRSAGFNLGGTY+HRPQQQQQHS AVSNS+VSFPP NNQDLLHLHGSD+FPSSH ASY
Sbjct: 361 SIGRSAGFNLGGTYSHRPQQQQQHSPAVSNSSVSFPPANNQDLLHLHGSDMFPSSHAASY 420

Query: 425 HQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQGMK 484
           HQQSSGPPGIGLRPLSSPNSASGMSYDQLI QYQQ   QSQFRLQHMSGVSQSFRDQG+K
Sbjct: 421 HQQSSGPPGIGLRPLSSPNSASGMSYDQLIPQYQQHPSQSQFRLQHMSGVSQSFRDQGLK 480

Query: 485 SMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD 544
           SMQA QSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD
Sbjct: 481 SMQATQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD 540

Query: 545 EPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG 604
           EPAKGDPDFNVPQCYLIKPPP+LH+GYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG
Sbjct: 541 EPAKGDPDFNVPQCYLIKPPPSLHRGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG 600

Query: 605 WFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKRPAL 651
           WFYHKE RFWFIRVSNMEPLVKT+TYERGSYLCFDP TFETVRKDNFVLHYEMVEKRP L
Sbjct: 601 WFYHKEHRFWFIRVSNMEPLVKTSTYERGSYLCFDPHTFETVRKDNFVLHYEMVEKRPVL 656

BLAST of HG10001431 vs. ExPASy Swiss-Prot
Match: Q52JK6 (Probable NOT transcription complex subunit VIP2 (Fragment) OS=Nicotiana benthamiana OX=4100 GN=VIP2 PE=1 SV=1)

HSP 1 Score: 893.6 bits (2308), Expect = 1.3e-258
Identity = 463/611 (75.78%), Postives = 524/611 (85.76%), Query Frame = 0

Query: 43  GALTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGIS 102
           G LTSRN+ INNVPS GVQQ    LS GRF  NNLP ALSQ+  G+SHGHSG+ SRGG S
Sbjct: 3   GTLTSRNTAINNVPSSGVQQSGNNLSGGRFVPNNLPSALSQIPQGNSHGHSGMTSRGGTS 62

Query: 103 VVGNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSG 162
           VVGNPG+SS+TN VGGSIPGIL T AAIGNR++VPGLGVSPILGNAGPR+T+S+GN+V G
Sbjct: 63  VVGNPGYSSNTNGVGGSIPGILPTFAAIGNRSSVPGLGVSPILGNAGPRMTNSVGNIVGG 122

Query: 163 GNIGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNS 222
           GNIGRSI++G GLS+PGLASRLN+ ANSGSG+L VQG NRLMSGVL Q S QV+SML NS
Sbjct: 123 GNIGRSISSGAGLSVPGLASRLNMNANSGSGNLNVQGPNRLMSGVLQQASPQVLSMLGNS 182

Query: 223 YPSAGGPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRS 282
           YP AGGPLSQNH+Q++ + NS+G+LNDVN++D SPFDINDFPQL+SRPSSAGGPQGQL  
Sbjct: 183 YP-AGGPLSQNHVQAIGNFNSMGLLNDVNSNDGSPFDINDFPQLSSRPSSAGGPQGQL-- 242

Query: 283 HAIVSGSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQ-HENSVPM 342
                GSLRKQGLSPIVQQNQEFSIQNEDFPALP FKGGNADY MD HQK+Q H+N++ M
Sbjct: 243 -----GSLRKQGLSPIVQQNQEFSIQNEDFPALPGFKGGNADYAMDPHQKEQLHDNTLSM 302

Query: 343 MQSQQFSIGRSAGFNLGGTY-THRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFP 402
           MQ Q FS+GRSAGFNLGGTY ++RPQQQ QH+ +VS+  VSF   NNQDLL LHGSD+F 
Sbjct: 303 MQQQHFSMGRSAGFNLGGTYSSNRPQQQLQHAPSVSSGGVSFSNINNQDLLSLHGSDVFQ 362

Query: 403 SSHPASYHQQSSGPPGIGLRPLSSPNSASGM-SYDQLIQQYQQPHGQSQFRLQHMSGVSQ 462
           SSH +SY QQ  GPPGIGLRPL+S  + SG+ SYDQLIQQYQQ  GQSQFRLQ MS + Q
Sbjct: 363 SSH-SSYQQQGGGPPGIGLRPLNSSGTVSGIGSYDQLIQQYQQHQGQSQFRLQQMSTLGQ 422

Query: 463 SFRDQGMKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHK 522
            FRDQ +KSMQ +Q +PDPFG+LGLLSVIR+SDPDL SLALGIDLTTLGLNLNSA+NL+K
Sbjct: 423 PFRDQSLKSMQ-SQVAPDPFGMLGLLSVIRMSDPDLTSLALGIDLTTLGLNLNSAENLYK 482

Query: 523 TFGSPWSDEPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYA 582
           TFGSPWSDEPAKGDP+F VPQCY  K PP L+Q YFSKF L+TLFYIF+SMPKDEAQLYA
Sbjct: 483 TFGSPWSDEPAKGDPEFTVPQCYYAKQPPPLNQAYFSKFQLDTLFYIFYSMPKDEAQLYA 542

Query: 583 ANELYNRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYE 642
           ANELYNRGWFYH+E R WF+RV+NMEPLVKTN YERGSY+CFDP T+ET+ KDNFVLH E
Sbjct: 543 ANELYNRGWFYHREHRLWFMRVANMEPLVKTNAYERGSYICFDPNTWETIHKDNFVLHCE 602

Query: 643 MVEKRPALPQH 651
           M+EKRP LPQH
Sbjct: 603 MLEKRPVLPQH 603

BLAST of HG10001431 vs. ExPASy Swiss-Prot
Match: Q9FPW4 (Probable NOT transcription complex subunit VIP2 OS=Arabidopsis thaliana OX=3702 GN=VIP2 PE=1 SV=2)

HSP 1 Score: 738.0 bits (1904), Expect = 9.0e-212
Identity = 417/669 (62.33%), Postives = 498/669 (74.44%), Query Frame = 0

Query: 4   SLQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHHS-------------------GA 63
           +L SSLNGSASNLPDG+GRSFT S+SGQSGA SP FHH+                   G 
Sbjct: 3   NLHSSLNGSASNLPDGSGRSFTASYSGQSGAPSPSFHHTGNLQGLHNIHGNYNVGNMQGT 62

Query: 64  LTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVV 123
           LTSRNS++N++PS GVQQP G+ SSGRFASNNLPV LSQLSHGSSHGHSG+ +R G++VV
Sbjct: 63  LTSRNSSMNSIPSAGVQQPNGSFSSGRFASNNLPVNLSQLSHGSSHGHSGIPNR-GLNVV 122

Query: 124 GNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN 183
           GNPGFSS+ N VGGSIPGILSTSA + NRN+VPG+G+S +LGN+GPRIT+SMGNMV GGN
Sbjct: 123 GNPGFSSNANGVGGSIPGILSTSAGLSNRNSVPGMGISQLLGNSGPRITNSMGNMVGGGN 182

Query: 184 IGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYP 243
           +GR+I++ GGLS+PGL+SRLNL ANSGSG L VQGQNR+M GVLPQGS QV+SML NSY 
Sbjct: 183 LGRNISS-GGLSIPGLSSRLNLAANSGSG-LNVQGQNRMMGGVLPQGS-QVMSMLGNSYH 242

Query: 244 SAGGPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDI-NDFPQLTSRPSSAGGPQGQLRSH 303
           + GGPLSQNH+QSVN++    ML+D + +D+S FDI NDFPQLTSRP SAGG QG L   
Sbjct: 243 TGGGPLSQNHVQSVNNM----MLSD-HPNDSSLFDINNDFPQLTSRPGSAGGTQGHL--- 302

Query: 304 AIVSGSLRKQGLS-PIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQ-HENSVPM 363
               GSLRKQGL  P+VQQNQEFSIQNEDFPALP +KGGN++Y MD+HQK+Q H+N++ M
Sbjct: 303 ----GSLRKQGLGVPLVQQNQEFSIQNEDFPALPGYKGGNSEYPMDLHQKEQLHDNAMSM 362

Query: 364 MQSQQFSIGRSAGFNLGGTY-THRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFP 423
           M SQ FS+GRS GFNLG TY +HRPQQQ QH+S+                          
Sbjct: 363 MHSQNFSMGRSGGFNLGATYSSHRPQQQPQHTSS-------------------------- 422

Query: 424 SSHPASYHQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQS 483
                     + G  G+GLRPLSSPN+ S + YDQLIQQYQQ   QSQF +Q MS ++Q 
Sbjct: 423 ----------TGGLQGLGLRPLSSPNAVSSIGYDQLIQQYQQHQNQSQFPVQQMSSINQ- 482

Query: 484 FRDQGMKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKT 543
           FRD  MKS    QS  DPF LLGLL V+  S+P+L SLALGIDLTTLGL+LNS  NL+KT
Sbjct: 483 FRDSEMKS---TQSEADPFCLLGLLDVLNRSNPELTSLALGIDLTTLGLDLNSTGNLYKT 542

Query: 544 FGSPWSDEPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAA 603
           F SPW++EPAK + +F VP CY    PP L +  F +F+ E LFY F+SMPKDEAQLYAA
Sbjct: 543 FASPWTNEPAKSEVEFTVPNCYYATEPPPLTRASFKRFSYELLFYTFYSMPKDEAQLYAA 602

Query: 604 NELYNRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEM 650
           +ELY RGWFYHKE R WF RV   EPLV+  TYERG+Y   DP +F+TVRK++FV+ YE+
Sbjct: 603 DELYERGWFYHKELRVWFFRVG--EPLVRAATYERGTYEYLDPNSFKTVRKEHFVIKYEL 613

BLAST of HG10001431 vs. ExPASy Swiss-Prot
Match: Q9NZN8 (CCR4-NOT transcription complex subunit 2 OS=Homo sapiens OX=9606 GN=CNOT2 PE=1 SV=1)

HSP 1 Score: 143.3 bits (360), Expect = 9.8e-33
Identity = 167/616 (27.11%), Postives = 249/616 (40.42%), Query Frame = 0

Query: 47  SRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGN 106
           SR   +  V S    +      S  F   +    L+  S        G +  G  S +G 
Sbjct: 25  SRKKFVEGVDSDYHDENMYYSQSSMFPHRSEKDMLASPSTSGQLSQFGASLYGQQSALGL 84

Query: 107 P--GFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN 166
           P  G S++T  +  S+       + +     VP + +      +   +  +  NM++   
Sbjct: 85  PMRGMSNNTPQLNRSLSQGTQLPSHVTPTTGVPTMSLHTPPSPSRGILPMNPRNMMNHSQ 144

Query: 167 IGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYP 226
           +G+ I         G+ SR N  ++SG GS      NR    ++    QQ          
Sbjct: 145 VGQGI---------GIPSRTNSMSSSGLGS-----PNRSSPSIICMPKQQPSRQPFTVNS 204

Query: 227 SAGGPLSQNHMQSVNSLNSLGMLNDVNTSDN-SPFDINDFPQLT--SRPSSAGGPQ---G 286
            +G  +++N    +N+  S  + N  + S+N +  D++DFP L   +R   +G P     
Sbjct: 205 MSGFGMNRNQAFGMNNSLSSNIFNGTDGSENVTGLDLSDFPALADRNRREGSGNPTPLIN 264

Query: 287 QLRSHAIVSGSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENS 346
            L   A   G + K    P  +Q+Q+FSI NEDFPALP      + Y       D  +++
Sbjct: 265 PLAGRAPYVGMVTK----PANEQSQDFSIHNEDFPALP-----GSSYKDPTSSNDDSKSN 324

Query: 347 VPMMQSQQFSIGRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDI 406
           +                               ++S  + S+   P               
Sbjct: 325 L-------------------------------NTSGKTTSSTDGPK-------------- 384

Query: 407 FPSSHPASYHQQSSGPPGIGLRPLSS-PNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGV 466
           FP    ++    +    GI + P     N   GM  DQ                      
Sbjct: 385 FPGDKSSTTQNNNQQKKGIQVLPDGRVTNIPQGMVTDQ---------------------- 444

Query: 467 SQSFRDQGMKSMQAAQSSPDPFGLLGLLSVIRL--SDPDLASLALGIDLTTLGLNLNSAD 526
                                FG++GLL+ IR   +DP +  LALG DLTTLGLNLNS +
Sbjct: 445 ---------------------FGMIGLLTFIRAAETDPGMVHLALGSDLTTLGLNLNSPE 504

Query: 527 NLHKTFGSPWSDEPAK-GDPDFNVPQCYL--IKPPPTLHQGYFSKFTLETLFYIFFSMPK 586
           NL+  F SPW+  P +  D DF+VP  YL  I     L      ++  + LFY+++    
Sbjct: 505 NLYPKFASPWASSPCRPQDIDFHVPSEYLTNIHIRDKLAAIKLGRYGEDLLFYLYYMNGG 528

Query: 587 DEAQLYAANELYNRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKD 646
           D  QL AA EL+NR W YHKE+R W  R   MEP +KTNTYERG+Y  FD   +  V K+
Sbjct: 565 DVLQLLAAVELFNRDWRYHKEERVWITRAPGMEPTMKTNTYERGTYYFFDCLNWRKVAKE 528

Query: 647 NFVLHYEMVEKRPALP 649
            F L Y+ +E+RP LP
Sbjct: 625 -FHLEYDKLEERPHLP 528

BLAST of HG10001431 vs. ExPASy Swiss-Prot
Match: Q8C5L3 (CCR4-NOT transcription complex subunit 2 OS=Mus musculus OX=10090 GN=Cnot2 PE=1 SV=2)

HSP 1 Score: 142.9 bits (359), Expect = 1.3e-32
Identity = 170/618 (27.51%), Postives = 245/618 (39.64%), Query Frame = 0

Query: 47  SRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGN 106
           SR   +  V S    +      S  F   +    L+  S        G +  G  S +G 
Sbjct: 25  SRKKFVEGVDSDYHDENMYYSQSSMFPHRSEKDMLASPSTSGQLSQFGASLYGQQSALGL 84

Query: 107 P--GFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN 166
           P  G S++T  +  S+       + +     VP + +      +   +  +  NM++   
Sbjct: 85  PMRGMSNNTPQLNRSLSQGTQLPSHVTPTTGVPTMSLHTPPSPSRGILPMNPRNMMNHSQ 144

Query: 167 IGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYP 226
           +G+ I         G+ SR N  ++SG GS      NR    ++    QQ          
Sbjct: 145 VGQGI---------GIPSRTNSMSSSGLGS-----PNRSSPSIICMPKQQPSRQPFTVNS 204

Query: 227 SAGGPLSQNHMQSVNSLNSLGMLNDVNTSDN-SPFDINDFPQLT--SRPSSAGGPQ---G 286
            +G  +++N    +N+  S  + N  + S+N +  D++DFP L   +R   +G P     
Sbjct: 205 MSGFGMNRNQAFGMNNSLSSNIFNGTDGSENVTGLDLSDFPALADRNRREGSGNPTPLIN 264

Query: 287 QLRSHAIVSGSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENS 346
            L   A   G + K    P  +Q+Q+FSI NEDFPALP                      
Sbjct: 265 PLAGRAPYVGMVTK----PANEQSQDFSIHNEDFPALP---------------------- 324

Query: 347 VPMMQSQQFSIGRSAGFNLGGTYTHRPQQQQQHSSAVSNS--TVSFPPGNNQDLLHLHGS 406
                              G +Y           S +S S  T S   G           
Sbjct: 325 -------------------GSSYKDPTSSNDDSKSNLSTSGKTTSSTDGPK--------- 384

Query: 407 DIFPSSHPASYHQQSSGPPGIGLRPLSS-PNSASGMSYDQLIQQYQQPHGQSQFRLQHMS 466
             FP    ++    +    GI + P     N   GM  DQ                    
Sbjct: 385 --FPGDKSSTTQNNNQQKKGIQVLPDGRVTNIPQGMVTDQ-------------------- 444

Query: 467 GVSQSFRDQGMKSMQAAQSSPDPFGLLGLLSVIRL--SDPDLASLALGIDLTTLGLNLNS 526
                                  FG++GLL+ IR   +DP +  LALG DLTTLGLNLNS
Sbjct: 445 -----------------------FGMIGLLTFIRAAETDPGMVHLALGSDLTTLGLNLNS 504

Query: 527 ADNLHKTFGSPWSDEPAK-GDPDFNVPQCYL--IKPPPTLHQGYFSKFTLETLFYIFFSM 586
            +NL+  F SPW+  P +  D DF+VP  YL  I     L      ++  + LFY+++  
Sbjct: 505 PENLYPKFASPWASSPCRPQDIDFHVPSEYLTNIHIRDKLAAIKLGRYGEDLLFYLYYMN 528

Query: 587 PKDEAQLYAANELYNRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVR 646
             D  QL AA EL+NR W YHKE+R W  R   MEP +KTNTYERG+Y  FD   +  V 
Sbjct: 565 GGDVLQLLAAVELFNRDWRYHKEERVWITRAPGMEPTMKTNTYERGTYYFFDCLNWRKVA 528

Query: 647 KDNFVLHYEMVEKRPALP 649
           K+ F L Y+ +E+RP LP
Sbjct: 625 KE-FHLEYDKLEERPHLP 528

BLAST of HG10001431 vs. ExPASy Swiss-Prot
Match: P87240 (General negative regulator of transcription subunit 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=not2 PE=1 SV=2)

HSP 1 Score: 125.2 bits (313), Expect = 2.8e-27
Identity = 71/186 (38.17%), Postives = 105/186 (56.45%), Query Frame = 0

Query: 472 AQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADN---LHKTFGSPWSDE 531
           A  +   + L  LL +IR+ D ++++L LG DL  LG +L   +    +     SPW++ 
Sbjct: 123 ADENAKQYMLESLLPIIRMEDSEMSTLQLGCDLAALGFDLAPVEEDRLISTNLFSPWAEL 182

Query: 532 PAK---GDPDFNVPQCYL-IKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELY 591
             K     P F +P CY  + PPP + + +  +F+ ETLFYIF++MP+D  Q  AA EL 
Sbjct: 183 NTKKPVSQPMFKLPACYKNVNPPPAISKIF--QFSDETLFYIFYTMPRDVMQEAAAQELT 242

Query: 592 NRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKR 651
           NR W +HKE R W   V  M+PL +T  +ERG Y+ FDP  ++ ++KD F+L Y  +E R
Sbjct: 243 NRNWRFHKELRVWLTPVPGMKPLQRTPQFERGYYMFFDPIHWKRIKKD-FLLMYAALEDR 302

BLAST of HG10001431 vs. ExPASy TrEMBL
Match: A0A1S3BJG5 (probable NOT transcription complex subunit VIP2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490568 PE=3 SV=1)

HSP 1 Score: 1178.3 bits (3047), Expect = 0.0e+00
Identity = 612/666 (91.89%), Postives = 624/666 (93.69%), Query Frame = 0

Query: 2   SCSLQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH-----------------SGA 61
           SCSLQSSLNGS SNLPDGTGRSF TSFSGQSGAASPVFHH                 SGA
Sbjct: 58  SCSLQSSLNGSTSNLPDGTGRSFATSFSGQSGAASPVFHHSGGGLHNIHGSFNIQNMSGA 117

Query: 62  LTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVV 121
           LTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGI+VV
Sbjct: 118 LTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGINVV 177

Query: 122 GNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN 181
           GNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN
Sbjct: 178 GNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN 237

Query: 182 IGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYP 241
           IGRS+T GGGLSLPGLASRLNL +NSGSGSLTVQGQNRL+SGVLPQGSQQV+SML NSYP
Sbjct: 238 IGRSVTAGGGLSLPGLASRLNLNSNSGSGSLTVQGQNRLISGVLPQGSQQVLSMLGNSYP 297

Query: 242 SAGGPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHA 301
           SAGGPLSQNHMQSVNSLNSLGMLNDVN +DNSPFDINDFPQLTSRPSSAGGPQGQL    
Sbjct: 298 SAGGPLSQNHMQSVNSLNSLGMLNDVNANDNSPFDINDFPQLTSRPSSAGGPQGQL---- 357

Query: 302 IVSGSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQS 361
               SLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQS
Sbjct: 358 ---SSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQS 417

Query: 362 QQFSIGRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHP 421
           QQFSIGRSAGFNLGGT++HRPQQQQQHSSAVSNSTVSFPP NNQDLLHLHGSDIFPSSH 
Sbjct: 418 QQFSIGRSAGFNLGGTFSHRPQQQQQHSSAVSNSTVSFPPANNQDLLHLHGSDIFPSSHA 477

Query: 422 ASYHQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQ 481
           ASYHQQSSGPPGIGLRPLSSPNSASGM YDQL QQYQQ HGQSQFRLQHMSGVSQSFRDQ
Sbjct: 478 ASYHQQSSGPPGIGLRPLSSPNSASGMGYDQL-QQYQQHHGQSQFRLQHMSGVSQSFRDQ 537

Query: 482 GMKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSP 541
           G+KSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSP
Sbjct: 538 GIKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSP 597

Query: 542 WSDEPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELY 601
           WSDEPAKGDPDFNVPQCYLIKPP +LHQGYF KF+LETLFYIFFSMPKDEAQLYAANELY
Sbjct: 598 WSDEPAKGDPDFNVPQCYLIKPPASLHQGYFPKFSLETLFYIFFSMPKDEAQLYAANELY 657

Query: 602 NRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKR 651
           NRGWFYHKE RFWFIRVSNMEPLVKT+TYERGSYLCFDP TFETVRKDNFVLHYEMVEKR
Sbjct: 658 NRGWFYHKEHRFWFIRVSNMEPLVKTSTYERGSYLCFDPHTFETVRKDNFVLHYEMVEKR 715

BLAST of HG10001431 vs. ExPASy TrEMBL
Match: A0A6J1G5K7 (probable NOT transcription complex subunit VIP2 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111451091 PE=3 SV=1)

HSP 1 Score: 1177.2 bits (3044), Expect = 0.0e+00
Identity = 609/661 (92.13%), Postives = 621/661 (93.95%), Query Frame = 0

Query: 5   LQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH---------------SGALTSRN 64
           L SS+NGSASNLPDGTGRSF  SFSGQSGAASPVFHH               SGALTSRN
Sbjct: 5   LNSSVNGSASNLPDGTGRSFANSFSGQSGAASPVFHHSGLHNIHGNFNLQNMSGALTSRN 64

Query: 65  STINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNPGF 124
           STINNVPSGGVQQPTGT+SSGRFASNNLPVALSQLSHGSSHGHSGV +RGGISVVGNPGF
Sbjct: 65  STINNVPSGGVQQPTGTISSGRFASNNLPVALSQLSHGSSHGHSGVTNRGGISVVGNPGF 124

Query: 125 SSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSI 184
           SSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSI
Sbjct: 125 SSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSI 184

Query: 185 TTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGP 244
           TTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGP
Sbjct: 185 TTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGP 244

Query: 245 LSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHAIVSGS 304
           LSQNH+Q+VNSLNSLGMLNDVN+SDNSPFDINDFPQLTSRPSSAGGPQGQL        S
Sbjct: 245 LSQNHIQNVNSLNSLGMLNDVNSSDNSPFDINDFPQLTSRPSSAGGPQGQL-------SS 304

Query: 305 LRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQFSI 364
           LRKQGLSPIVQQNQEFSIQ+EDFPAL RFKGGN DYGMDIHQKDQHENSVP+MQSQQFSI
Sbjct: 305 LRKQGLSPIVQQNQEFSIQSEDFPALSRFKGGNVDYGMDIHQKDQHENSVPIMQSQQFSI 364

Query: 365 GRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHPASYHQ 424
           GRSAGFNLG TY+HRPQQQQQHS AVSNSTVSFPP NNQDLLHLHGSDIFPSSH ASYHQ
Sbjct: 365 GRSAGFNLGSTYSHRPQQQQQHSPAVSNSTVSFPPANNQDLLHLHGSDIFPSSHAASYHQ 424

Query: 425 QSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQGMKSM 484
           QSSGPPGIGLRPLSSPNS SGM YDQLIQQYQQ H Q QFRLQHMSGVSQSFRDQGMKSM
Sbjct: 425 QSSGPPGIGLRPLSSPNSVSGMGYDQLIQQYQQHHSQPQFRLQHMSGVSQSFRDQGMKSM 484

Query: 485 QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEP 544
           QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEP
Sbjct: 485 QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEP 544

Query: 545 AKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWF 604
           AKGDPDFNVPQCYLIKPPP+LHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWF
Sbjct: 545 AKGDPDFNVPQCYLIKPPPSLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWF 604

Query: 605 YHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKRPALPQ 651
           YHKE RFWFIRVSNMEPLVKT++YERGSYLCFDP TFETVRKDNFVLHYEMVEKRPALPQ
Sbjct: 605 YHKEHRFWFIRVSNMEPLVKTSSYERGSYLCFDPHTFETVRKDNFVLHYEMVEKRPALPQ 658

BLAST of HG10001431 vs. ExPASy TrEMBL
Match: A0A6J1G5I6 (probable NOT transcription complex subunit VIP2 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111451091 PE=3 SV=1)

HSP 1 Score: 1175.6 bits (3040), Expect = 0.0e+00
Identity = 609/665 (91.58%), Postives = 621/665 (93.38%), Query Frame = 0

Query: 5   LQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH-------------------SGAL 64
           L SS+NGSASNLPDGTGRSF  SFSGQSGAASPVFHH                   SGAL
Sbjct: 5   LNSSVNGSASNLPDGTGRSFANSFSGQSGAASPVFHHSGTIQGLHNIHGNFNLQNMSGAL 64

Query: 65  TSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVG 124
           TSRNSTINNVPSGGVQQPTGT+SSGRFASNNLPVALSQLSHGSSHGHSGV +RGGISVVG
Sbjct: 65  TSRNSTINNVPSGGVQQPTGTISSGRFASNNLPVALSQLSHGSSHGHSGVTNRGGISVVG 124

Query: 125 NPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNI 184
           NPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNI
Sbjct: 125 NPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNI 184

Query: 185 GRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPS 244
           GRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPS
Sbjct: 185 GRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPS 244

Query: 245 AGGPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHAI 304
           AGGPLSQNH+Q+VNSLNSLGMLNDVN+SDNSPFDINDFPQLTSRPSSAGGPQGQL     
Sbjct: 245 AGGPLSQNHIQNVNSLNSLGMLNDVNSSDNSPFDINDFPQLTSRPSSAGGPQGQL----- 304

Query: 305 VSGSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQ 364
              SLRKQGLSPIVQQNQEFSIQ+EDFPAL RFKGGN DYGMDIHQKDQHENSVP+MQSQ
Sbjct: 305 --SSLRKQGLSPIVQQNQEFSIQSEDFPALSRFKGGNVDYGMDIHQKDQHENSVPIMQSQ 364

Query: 365 QFSIGRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHPA 424
           QFSIGRSAGFNLG TY+HRPQQQQQHS AVSNSTVSFPP NNQDLLHLHGSDIFPSSH A
Sbjct: 365 QFSIGRSAGFNLGSTYSHRPQQQQQHSPAVSNSTVSFPPANNQDLLHLHGSDIFPSSHAA 424

Query: 425 SYHQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQG 484
           SYHQQSSGPPGIGLRPLSSPNS SGM YDQLIQQYQQ H Q QFRLQHMSGVSQSFRDQG
Sbjct: 425 SYHQQSSGPPGIGLRPLSSPNSVSGMGYDQLIQQYQQHHSQPQFRLQHMSGVSQSFRDQG 484

Query: 485 MKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPW 544
           MKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPW
Sbjct: 485 MKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPW 544

Query: 545 SDEPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYN 604
           SDEPAKGDPDFNVPQCYLIKPPP+LHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYN
Sbjct: 545 SDEPAKGDPDFNVPQCYLIKPPPSLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYN 604

Query: 605 RGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKRP 651
           RGWFYHKE RFWFIRVSNMEPLVKT++YERGSYLCFDP TFETVRKDNFVLHYEMVEKRP
Sbjct: 605 RGWFYHKEHRFWFIRVSNMEPLVKTSSYERGSYLCFDPHTFETVRKDNFVLHYEMVEKRP 662

BLAST of HG10001431 vs. ExPASy TrEMBL
Match: A0A6J1L2Z0 (probable NOT transcription complex subunit VIP2 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111499941 PE=3 SV=1)

HSP 1 Score: 1174.8 bits (3038), Expect = 0.0e+00
Identity = 608/661 (91.98%), Postives = 620/661 (93.80%), Query Frame = 0

Query: 5   LQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH---------------SGALTSRN 64
           L SS+NGSASNLPDG+GRSF  SFSGQSGAASPVFHH               SGALTSRN
Sbjct: 5   LNSSVNGSASNLPDGSGRSFANSFSGQSGAASPVFHHSGLHNIHGNFNLQNMSGALTSRN 64

Query: 65  STINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNPGF 124
           STINNVPSGGVQQPTG +SSGRFASNNLPVALSQLSHGSSHGHSGVA+RGGISVVGNPGF
Sbjct: 65  STINNVPSGGVQQPTGAISSGRFASNNLPVALSQLSHGSSHGHSGVANRGGISVVGNPGF 124

Query: 125 SSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSI 184
           SSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSI
Sbjct: 125 SSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGRSI 184

Query: 185 TTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGP 244
           TTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGP
Sbjct: 185 TTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAGGP 244

Query: 245 LSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHAIVSGS 304
           LSQNH+Q+VNSLNSLGMLNDVN SDNSPFDINDFPQLTSRPSSAGGPQGQL        S
Sbjct: 245 LSQNHIQNVNSLNSLGMLNDVNASDNSPFDINDFPQLTSRPSSAGGPQGQL-------SS 304

Query: 305 LRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQFSI 364
           LRKQGLSPIVQQNQEFSIQ+EDFPAL RFKGGN DYGMDIHQKDQHENSVP+MQSQQFSI
Sbjct: 305 LRKQGLSPIVQQNQEFSIQSEDFPALSRFKGGNVDYGMDIHQKDQHENSVPIMQSQQFSI 364

Query: 365 GRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHPASYHQ 424
           GRSAGFNLG TY+HRPQQQQQHS AVSNSTVSFPP NNQDLLHLHGSDIFPSSH ASYHQ
Sbjct: 365 GRSAGFNLGSTYSHRPQQQQQHSPAVSNSTVSFPPANNQDLLHLHGSDIFPSSHAASYHQ 424

Query: 425 QSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQGMKSM 484
           QSSGPPGIGLRPLSSPNS SGM YDQLIQQYQQ H Q QFRLQHMSGVSQSFRDQGMKSM
Sbjct: 425 QSSGPPGIGLRPLSSPNSVSGMGYDQLIQQYQQHHSQPQFRLQHMSGVSQSFRDQGMKSM 484

Query: 485 QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEP 544
           QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEP
Sbjct: 485 QAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSDEP 544

Query: 545 AKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWF 604
           AKGDPDFNVPQCYLIKPPP+LHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWF
Sbjct: 545 AKGDPDFNVPQCYLIKPPPSLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWF 604

Query: 605 YHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKRPALPQ 651
           YHKE RFWFIRVSNMEPLVKT++YERGSYLCFDP TFETVRKDNFVLHYEMVEKRPALPQ
Sbjct: 605 YHKEHRFWFIRVSNMEPLVKTSSYERGSYLCFDPHTFETVRKDNFVLHYEMVEKRPALPQ 658

BLAST of HG10001431 vs. ExPASy TrEMBL
Match: A0A1S4E1U2 (probable NOT transcription complex subunit VIP2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497394 PE=3 SV=1)

HSP 1 Score: 1173.7 bits (3035), Expect = 0.0e+00
Identity = 609/663 (91.86%), Postives = 621/663 (93.67%), Query Frame = 0

Query: 5   LQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH-----------------SGALTS 64
           L SSLNGSASNLPDGTGRSF TSFSGQSGAASPVFHH                 SGALTS
Sbjct: 5   LNSSLNGSASNLPDGTGRSFATSFSGQSGAASPVFHHSGGGLHNIHGSFSIQNMSGALTS 64

Query: 65  RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP 124
           RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP
Sbjct: 65  RNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVVGNP 124

Query: 125 GFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGNIGR 184
           GFSSSTNAVGGSIPGILSTSAAIGNRN VPGLGVSPILGNAGPRITSSMGNM SGGNIGR
Sbjct: 125 GFSSSTNAVGGSIPGILSTSAAIGNRNTVPGLGVSPILGNAGPRITSSMGNMASGGNIGR 184

Query: 185 SITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYPSAG 244
           SIT GGGLSLPGLASRLNLGANSGSGSLT+QGQNRLMSGVLPQGSQQVISMLSNSYPSAG
Sbjct: 185 SITAGGGLSLPGLASRLNLGANSGSGSLTMQGQNRLMSGVLPQGSQQVISMLSNSYPSAG 244

Query: 245 GPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDINDFPQLTSRPSSAGGPQGQLRSHAIVS 304
           GPLSQ+HMQSVNSLNSLGMLN+VNT+DNSPFDINDFPQLTSRPSSAGGPQGQL       
Sbjct: 245 GPLSQSHMQSVNSLNSLGMLNEVNTNDNSPFDINDFPQLTSRPSSAGGPQGQL------- 304

Query: 305 GSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHENSVPMMQSQQF 364
            SLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQH+NSVPMMQSQQF
Sbjct: 305 SSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQHDNSVPMMQSQQF 364

Query: 365 SIGRSAGFNLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFPSSHPASY 424
           SIGRSAGFNLGGTY+HRPQQQQQHS AVSNS+VSFPP NNQDLLHLHGSD+FPSSH ASY
Sbjct: 365 SIGRSAGFNLGGTYSHRPQQQQQHSPAVSNSSVSFPPANNQDLLHLHGSDMFPSSHAASY 424

Query: 425 HQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQSFRDQGMK 484
           HQQSSGPPGIGLRPLSSP+SASGMSYDQLIQQYQQ   QSQFRLQHMSGVSQSFRDQGMK
Sbjct: 425 HQQSSGPPGIGLRPLSSPSSASGMSYDQLIQQYQQHPSQSQFRLQHMSGVSQSFRDQGMK 484

Query: 485 SMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD 544
           SMQA QSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD
Sbjct: 485 SMQATQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKTFGSPWSD 544

Query: 545 EPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG 604
           EPAKGDPDF VPQCYLIKPPPTLH+GYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG
Sbjct: 545 EPAKGDPDFIVPQCYLIKPPPTLHRGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRG 604

Query: 605 WFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEMVEKRPAL 651
           WFYHKE RFWFIRVSNMEPLVKT+TYERGSYLCFDP TFET+RKDNFVLHYEMVEKRP L
Sbjct: 605 WFYHKEHRFWFIRVSNMEPLVKTSTYERGSYLCFDPHTFETIRKDNFVLHYEMVEKRPVL 660

BLAST of HG10001431 vs. TAIR 10
Match: AT1G07705.2 (NOT2 / NOT3 / NOT5 family )

HSP 1 Score: 791.6 bits (2043), Expect = 4.9e-229
Identity = 438/673 (65.08%), Postives = 509/673 (75.63%), Query Frame = 0

Query: 2   SCSLQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH-------------------S 61
           S  L SS+NGS SNL DG+GR+FT+SFSGQSGAASPVFHH                   +
Sbjct: 2   SSLLNSSINGSTSNLSDGSGRTFTSSFSGQSGAASPVFHHAGSIQGLHNIHGNFNVPNLA 61

Query: 62  GALTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGIS 121
           G+L SRNS++N VPS GVQQ  G++S+GRFAS+N+PVALSQ+SHGSSHGHSG+ +RG   
Sbjct: 62  GSLGSRNSSLNGVPSAGVQQQNGSISNGRFASSNIPVALSQMSHGSSHGHSGLTNRG--- 121

Query: 122 VVGNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSG 181
                                              GLGVSPILGN G R+TSSMGNMV G
Sbjct: 122 -----------------------------------GLGVSPILGNVGSRMTSSMGNMVGG 181

Query: 182 GNIGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNS 241
           G +GR++++GGGLS+P L SRLNL  NSGSG++   GQNR+M GVLPQGS QV+SML NS
Sbjct: 182 GTMGRTLSSGGGLSIPSLGSRLNLAVNSGSGNI---GQNRMMGGVLPQGSPQVLSMLGNS 241

Query: 242 YPSAGGPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDI-NDFPQLTSRPSSAGGPQGQLR 301
           YPSAGG LSQNH+Q++NSL+S+G+LND+N++D SPFDI NDFPQLTSRPSSAG  QGQL 
Sbjct: 242 YPSAGG-LSQNHVQAMNSLSSMGLLNDMNSNDTSPFDINNDFPQLTSRPSSAGS-QGQL- 301

Query: 302 SHAIVSGSLRKQGL--SPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQ-HENS 361
                 GS  KQGL  SPIVQQNQEFSIQNEDFPALP +KG +ADY MD+H K+Q HENS
Sbjct: 302 ------GSRLKQGLGISPIVQQNQEFSIQNEDFPALPGYKGSSADYPMDLHHKEQLHENS 361

Query: 362 VPMMQSQQFSIGRSAGFNLGGTYT-HRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSD 421
           V MMQSQQ S+GRS GFNLGG YT HRPQQQQQH+ AVS+S VS           LHGSD
Sbjct: 362 VLMMQSQQLSMGRSGGFNLGGAYTSHRPQQQQQHAQAVSSSGVS-----------LHGSD 421

Query: 422 IFPSSHPASYHQQSSGPPGIGLRPLSSPNSASGMSYD-QLIQQYQQPHGQSQFRLQHMSG 481
           IF SSHP  YH Q+ G PGIGLR ++S NS +GM YD QLIQQYQ     +Q+RLQ MS 
Sbjct: 422 IFSSSHP-PYHSQTGGAPGIGLRSMNSANSITGMGYDQQLIQQYQHQQNSAQYRLQQMSA 481

Query: 482 VSQSFRDQGMKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADN 541
            SQ FRD G+KSMQ+ QS+PD FGLLGLLSVI++SDPDL SLALGIDLTTLGLNLNS +N
Sbjct: 482 ASQPFRDVGLKSMQSTQSNPDRFGLLGLLSVIKMSDPDLTSLALGIDLTTLGLNLNSTEN 541

Query: 542 LHKTFGSPWSDEPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQ 601
           LHKTFGSPWS+EP+K DP+F+VPQCY  K PP LHQG F+K  +ETLFY+F+SMPKDEAQ
Sbjct: 542 LHKTFGSPWSNEPSKVDPEFSVPQCYYAKNPPPLHQGLFAKLLVETLFYVFYSMPKDEAQ 601

Query: 602 LYAANELYNRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVL 650
           LYAANELYNRGWFYHKE R WFIR+   EPLVKTN YERGSY CFDP +FE V+K+NFVL
Sbjct: 602 LYAANELYNRGWFYHKEHRLWFIRIG--EPLVKTNAYERGSYHCFDPNSFEIVQKENFVL 610

BLAST of HG10001431 vs. TAIR 10
Match: AT5G59710.1 (VIRE2 interacting protein 2 )

HSP 1 Score: 738.0 bits (1904), Expect = 6.4e-213
Identity = 417/669 (62.33%), Postives = 498/669 (74.44%), Query Frame = 0

Query: 4   SLQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHHS-------------------GA 63
           +L SSLNGSASNLPDG+GRSFT S+SGQSGA SP FHH+                   G 
Sbjct: 3   NLHSSLNGSASNLPDGSGRSFTASYSGQSGAPSPSFHHTGNLQGLHNIHGNYNVGNMQGT 62

Query: 64  LTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGISVV 123
           LTSRNS++N++PS GVQQP G+ SSGRFASNNLPV LSQLSHGSSHGHSG+ +R G++VV
Sbjct: 63  LTSRNSSMNSIPSAGVQQPNGSFSSGRFASNNLPVNLSQLSHGSSHGHSGIPNR-GLNVV 122

Query: 124 GNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSGGN 183
           GNPGFSS+ N VGGSIPGILSTSA + NRN+VPG+G+S +LGN+GPRIT+SMGNMV GGN
Sbjct: 123 GNPGFSSNANGVGGSIPGILSTSAGLSNRNSVPGMGISQLLGNSGPRITNSMGNMVGGGN 182

Query: 184 IGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNSYP 243
           +GR+I++ GGLS+PGL+SRLNL ANSGSG L VQGQNR+M GVLPQGS QV+SML NSY 
Sbjct: 183 LGRNISS-GGLSIPGLSSRLNLAANSGSG-LNVQGQNRMMGGVLPQGS-QVMSMLGNSYH 242

Query: 244 SAGGPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDI-NDFPQLTSRPSSAGGPQGQLRSH 303
           + GGPLSQNH+QSVN++    ML+D + +D+S FDI NDFPQLTSRP SAGG QG L   
Sbjct: 243 TGGGPLSQNHVQSVNNM----MLSD-HPNDSSLFDINNDFPQLTSRPGSAGGTQGHL--- 302

Query: 304 AIVSGSLRKQGLS-PIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQ-HENSVPM 363
               GSLRKQGL  P+VQQNQEFSIQNEDFPALP +KGGN++Y MD+HQK+Q H+N++ M
Sbjct: 303 ----GSLRKQGLGVPLVQQNQEFSIQNEDFPALPGYKGGNSEYPMDLHQKEQLHDNAMSM 362

Query: 364 MQSQQFSIGRSAGFNLGGTY-THRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSDIFP 423
           M SQ FS+GRS GFNLG TY +HRPQQQ QH+S+                          
Sbjct: 363 MHSQNFSMGRSGGFNLGATYSSHRPQQQPQHTSS-------------------------- 422

Query: 424 SSHPASYHQQSSGPPGIGLRPLSSPNSASGMSYDQLIQQYQQPHGQSQFRLQHMSGVSQS 483
                     + G  G+GLRPLSSPN+ S + YDQLIQQYQQ   QSQF +Q MS ++Q 
Sbjct: 423 ----------TGGLQGLGLRPLSSPNAVSSIGYDQLIQQYQQHQNQSQFPVQQMSSINQ- 482

Query: 484 FRDQGMKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADNLHKT 543
           FRD  MKS    QS  DPF LLGLL V+  S+P+L SLALGIDLTTLGL+LNS  NL+KT
Sbjct: 483 FRDSEMKS---TQSEADPFCLLGLLDVLNRSNPELTSLALGIDLTTLGLDLNSTGNLYKT 542

Query: 544 FGSPWSDEPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAA 603
           F SPW++EPAK + +F VP CY    PP L +  F +F+ E LFY F+SMPKDEAQLYAA
Sbjct: 543 FASPWTNEPAKSEVEFTVPNCYYATEPPPLTRASFKRFSYELLFYTFYSMPKDEAQLYAA 602

Query: 604 NELYNRGWFYHKEQRFWFIRVSNMEPLVKTNTYERGSYLCFDPQTFETVRKDNFVLHYEM 650
           +ELY RGWFYHKE R WF RV   EPLV+  TYERG+Y   DP +F+TVRK++FV+ YE+
Sbjct: 603 DELYERGWFYHKELRVWFFRVG--EPLVRAATYERGTYEYLDPNSFKTVRKEHFVIKYEL 613

BLAST of HG10001431 vs. TAIR 10
Match: AT1G07705.1 (NOT2 / NOT3 / NOT5 family )

HSP 1 Score: 685.6 bits (1768), Expect = 3.8e-197
Identity = 390/607 (64.25%), Postives = 453/607 (74.63%), Query Frame = 0

Query: 2   SCSLQSSLNGSASNLPDGTGRSFTTSFSGQSGAASPVFHH-------------------S 61
           S  L SS+NGS SNL DG+GR+FT+SFSGQSGAASPVFHH                   +
Sbjct: 2   SSLLNSSINGSTSNLSDGSGRTFTSSFSGQSGAASPVFHHAGSIQGLHNIHGNFNVPNLA 61

Query: 62  GALTSRNSTINNVPSGGVQQPTGTLSSGRFASNNLPVALSQLSHGSSHGHSGVASRGGIS 121
           G+L SRNS++N VPS GVQQ  G++S+GRFAS+N+PVALSQ+SHGSSHGHSG+ +RG   
Sbjct: 62  GSLGSRNSSLNGVPSAGVQQQNGSISNGRFASSNIPVALSQMSHGSSHGHSGLTNRG--- 121

Query: 122 VVGNPGFSSSTNAVGGSIPGILSTSAAIGNRNAVPGLGVSPILGNAGPRITSSMGNMVSG 181
                                              GLGVSPILGN G R+TSSMGNMV G
Sbjct: 122 -----------------------------------GLGVSPILGNVGSRMTSSMGNMVGG 181

Query: 182 GNIGRSITTGGGLSLPGLASRLNLGANSGSGSLTVQGQNRLMSGVLPQGSQQVISMLSNS 241
           G +GR++++GGGLS+P L SRLNL  NSGSG++   GQNR+M GVLPQGS QV+SML NS
Sbjct: 182 GTMGRTLSSGGGLSIPSLGSRLNLAVNSGSGNI---GQNRMMGGVLPQGSPQVLSMLGNS 241

Query: 242 YPSAGGPLSQNHMQSVNSLNSLGMLNDVNTSDNSPFDI-NDFPQLTSRPSSAGGPQGQLR 301
           YPSAGG LSQNH+Q++NSL+S+G+LND+N++D SPFDI NDFPQLTSRPSSAG  QGQL 
Sbjct: 242 YPSAGG-LSQNHVQAMNSLSSMGLLNDMNSNDTSPFDINNDFPQLTSRPSSAGS-QGQL- 301

Query: 302 SHAIVSGSLRKQGL--SPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQ-HENS 361
                 GS  KQGL  SPIVQQNQEFSIQNEDFPALP +KG +ADY MD+H K+Q HENS
Sbjct: 302 ------GSRLKQGLGISPIVQQNQEFSIQNEDFPALPGYKGSSADYPMDLHHKEQLHENS 361

Query: 362 VPMMQSQQFSIGRSAGFNLGGTYT-HRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLHGSD 421
           V MMQSQQ S+GRS GFNLGG YT HRPQQQQQH+ AVS+S VS           LHGSD
Sbjct: 362 VLMMQSQQLSMGRSGGFNLGGAYTSHRPQQQQQHAQAVSSSGVS-----------LHGSD 421

Query: 422 IFPSSHPASYHQQSSGPPGIGLRPLSSPNSASGMSYD-QLIQQYQQPHGQSQFRLQHMSG 481
           IF SSHP  YH Q+ G PGIGLR ++S NS +GM YD QLIQQYQ     +Q+RLQ MS 
Sbjct: 422 IFSSSHP-PYHSQTGGAPGIGLRSMNSANSITGMGYDQQLIQQYQHQQNSAQYRLQQMSA 481

Query: 482 VSQSFRDQGMKSMQAAQSSPDPFGLLGLLSVIRLSDPDLASLALGIDLTTLGLNLNSADN 541
            SQ FRD G+KSMQ+ QS+PD FGLLGLLSVI++SDPDL SLALGIDLTTLGLNLNS +N
Sbjct: 482 ASQPFRDVGLKSMQSTQSNPDRFGLLGLLSVIKMSDPDLTSLALGIDLTTLGLNLNSTEN 541

Query: 542 LHKTFGSPWSDEPAKGDPDFNVPQCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQ 584
           LHKTFGSPWS+EP+K DP+F+VPQCY  K PP LHQG F+K  +ETLFY+F+SMPKDEAQ
Sbjct: 542 LHKTFGSPWSNEPSKVDPEFSVPQCYYAKNPPPLHQGLFAKLLVETLFYVFYSMPKDEAQ 546

BLAST of HG10001431 vs. TAIR 10
Match: AT5G18230.1 (transcription regulator NOT2/NOT3/NOT5 family protein )

HSP 1 Score: 52.0 bits (123), Expect = 2.1e-06
Identity = 106/446 (23.77%), Postives = 172/446 (38.57%), Query Frame = 0

Query: 224 PSAGGPLSQNHMQSVNSLNSLGMLNDVN--TSDNSPFDI------NDFPQLTSRPSSAGG 283
           P+ G  +S      V   N +G+ ++V   TS  S   +      ND     S P     
Sbjct: 400 PANGSRISATSAAEVAKRNIMGVESNVQPLTSPLSKMVLPPTAKGNDGTASDSNPGDVAA 459

Query: 284 PQGQLRSHAIVSGSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQH 343
             G+  S +IVSGS  + G SP   QN+    + E  P                   DQ 
Sbjct: 460 SIGRAFSPSIVSGSQWRPG-SPFQSQNETVRGRTEIAP-------------------DQR 519

Query: 344 ENSVPMMQSQQFSIGRSAGF-NLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLH 403
           E  +  +Q  Q   G   G  +L G    +   QQQ+     +S++S P G+    +   
Sbjct: 520 EKFLQRLQQVQQGHGNLLGIPSLSGGNEKQFSSQQQNPLLQQSSSIS-PHGSLGIGVQAP 579

Query: 404 GSDIFPSSHPASYHQQSSG-PPGIGLRP---------------LSSPNSASGMSYDQLIQ 463
           G ++  S   AS  QQS+     +G +P                + P+ ++ ++  + IQ
Sbjct: 580 GFNVMSS---ASLQQQSNAMSQQLGQQPSVADVDHVRNDDQSQQNLPDDSASIAASKAIQ 639

Query: 464 Q-------YQQPHGQSQFRLQHMSGVSQSFRDQGMKSMQAAQSSPDPFGLLGLLSVIRLS 523
                   +  P G   + L  +   S      G + +Q  QSS    G++G     R S
Sbjct: 640 SEDDSKVLFDTPSGMPSYMLDPVQVSSGPDFSPG-QPIQPGQSS-SSLGVIG-----RRS 699

Query: 524 DPDLASLALGIDLTTLGLNLNSADNLHKTF-----------GSPWSD-EPAKGDPDFNVP 583
           + +L ++     +  +   +++   L   F             P+S   PA     F   
Sbjct: 700 NSELGAIGDPSAVGPMHDQMHNLQMLEAAFYKRPQPSDSERPRPYSPRNPAITPQTFPQT 759

Query: 584 QCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWFYHKEQRFWFI 626
           Q  +I  P    +     +  +TLF+ F+       Q  AA EL  + W YH++   WF 
Sbjct: 760 QAPIINNPLLWERLGSDAYGTDTLFFAFYYQQNSYQQYLAAKELKKQSWRYHRKFNTWFQ 812

BLAST of HG10001431 vs. TAIR 10
Match: AT5G18230.2 (transcription regulator NOT2/NOT3/NOT5 family protein )

HSP 1 Score: 52.0 bits (123), Expect = 2.1e-06
Identity = 106/446 (23.77%), Postives = 172/446 (38.57%), Query Frame = 0

Query: 224 PSAGGPLSQNHMQSVNSLNSLGMLNDVN--TSDNSPFDI------NDFPQLTSRPSSAGG 283
           P+ G  +S      V   N +G+ ++V   TS  S   +      ND     S P     
Sbjct: 402 PANGSRISATSAAEVAKRNIMGVESNVQPLTSPLSKMVLPPTAKGNDGTASDSNPGDVAA 461

Query: 284 PQGQLRSHAIVSGSLRKQGLSPIVQQNQEFSIQNEDFPALPRFKGGNADYGMDIHQKDQH 343
             G+  S +IVSGS  + G SP   QN+    + E  P                   DQ 
Sbjct: 462 SIGRAFSPSIVSGSQWRPG-SPFQSQNETVRGRTEIAP-------------------DQR 521

Query: 344 ENSVPMMQSQQFSIGRSAGF-NLGGTYTHRPQQQQQHSSAVSNSTVSFPPGNNQDLLHLH 403
           E  +  +Q  Q   G   G  +L G    +   QQQ+     +S++S P G+    +   
Sbjct: 522 EKFLQRLQQVQQGHGNLLGIPSLSGGNEKQFSSQQQNPLLQQSSSIS-PHGSLGIGVQAP 581

Query: 404 GSDIFPSSHPASYHQQSSG-PPGIGLRP---------------LSSPNSASGMSYDQLIQ 463
           G ++  S   AS  QQS+     +G +P                + P+ ++ ++  + IQ
Sbjct: 582 GFNVMSS---ASLQQQSNAMSQQLGQQPSVADVDHVRNDDQSQQNLPDDSASIAASKAIQ 641

Query: 464 Q-------YQQPHGQSQFRLQHMSGVSQSFRDQGMKSMQAAQSSPDPFGLLGLLSVIRLS 523
                   +  P G   + L  +   S      G + +Q  QSS    G++G     R S
Sbjct: 642 SEDDSKVLFDTPSGMPSYMLDPVQVSSGPDFSPG-QPIQPGQSS-SSLGVIG-----RRS 701

Query: 524 DPDLASLALGIDLTTLGLNLNSADNLHKTF-----------GSPWSD-EPAKGDPDFNVP 583
           + +L ++     +  +   +++   L   F             P+S   PA     F   
Sbjct: 702 NSELGAIGDPSAVGPMHDQMHNLQMLEAAFYKRPQPSDSERPRPYSPRNPAITPQTFPQT 761

Query: 584 QCYLIKPPPTLHQGYFSKFTLETLFYIFFSMPKDEAQLYAANELYNRGWFYHKEQRFWFI 626
           Q  +I  P    +     +  +TLF+ F+       Q  AA EL  + W YH++   WF 
Sbjct: 762 QAPIINNPLLWERLGSDAYGTDTLFFAFYYQQNSYQQYLAAKELKKQSWRYHRKFNTWFQ 814

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038902637.10.0e+0093.21probable NOT transcription complex subunit VIP2 isoform X1 [Benincasa hispida][more]
XP_038901690.10.0e+0092.46probable NOT transcription complex subunit VIP2 isoform X1 [Benincasa hispida][more]
XP_023532180.10.0e+0092.28probable NOT transcription complex subunit VIP2 isoform X2 [Cucurbita pepo subsp... [more]
XP_008448344.10.0e+0091.89PREDICTED: probable NOT transcription complex subunit VIP2 isoform X1 [Cucumis m... [more]
XP_011649310.10.0e+0092.01probable NOT transcription complex subunit VIP2 isoform X2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q52JK61.3e-25875.78Probable NOT transcription complex subunit VIP2 (Fragment) OS=Nicotiana benthami... [more]
Q9FPW49.0e-21262.33Probable NOT transcription complex subunit VIP2 OS=Arabidopsis thaliana OX=3702 ... [more]
Q9NZN89.8e-3327.11CCR4-NOT transcription complex subunit 2 OS=Homo sapiens OX=9606 GN=CNOT2 PE=1 S... [more]
Q8C5L31.3e-3227.51CCR4-NOT transcription complex subunit 2 OS=Mus musculus OX=10090 GN=Cnot2 PE=1 ... [more]
P872402.8e-2738.17General negative regulator of transcription subunit 2 OS=Schizosaccharomyces pom... [more]
Match NameE-valueIdentityDescription
A0A1S3BJG50.0e+0091.89probable NOT transcription complex subunit VIP2 isoform X1 OS=Cucumis melo OX=36... [more]
A0A6J1G5K70.0e+0092.13probable NOT transcription complex subunit VIP2 isoform X2 OS=Cucurbita moschata... [more]
A0A6J1G5I60.0e+0091.58probable NOT transcription complex subunit VIP2 isoform X1 OS=Cucurbita moschata... [more]
A0A6J1L2Z00.0e+0091.98probable NOT transcription complex subunit VIP2 isoform X2 OS=Cucurbita maxima O... [more]
A0A1S4E1U20.0e+0091.86probable NOT transcription complex subunit VIP2 isoform X1 OS=Cucumis melo OX=36... [more]
Match NameE-valueIdentityDescription
AT1G07705.24.9e-22965.08NOT2 / NOT3 / NOT5 family [more]
AT5G59710.16.4e-21362.33VIRE2 interacting protein 2 [more]
AT1G07705.13.8e-19764.25NOT2 / NOT3 / NOT5 family [more]
AT5G18230.12.1e-0623.77transcription regulator NOT2/NOT3/NOT5 family protein [more]
AT5G18230.22.1e-0623.77transcription regulator NOT2/NOT3/NOT5 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007282NOT2/NOT3/NOT5, C-terminalPFAMPF04153NOT2_3_5coord: 519..640
e-value: 6.5E-33
score: 113.6
IPR038635CCR4-NOT complex subunit 2/3/5, N-terminal domain superfamilyGENE3D2.30.30.1020coord: 469..650
e-value: 4.2E-66
score: 224.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 352..430
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 39..71
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 250..281
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 352..386
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 402..430
NoneNo IPR availablePANTHERPTHR23326:SF15NOT TRANSCRIPTION COMPLEX SUBUNIT VIP2 ISOFORM X1-RELATEDcoord: 6..650
IPR040168Not2/Not3/Not5PANTHERPTHR23326CCR4 NOT-RELATEDcoord: 6..650

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10001431.1HG10001431.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000289 nuclear-transcribed mRNA poly(A) tail shortening
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0030015 CCR4-NOT core complex
cellular_component GO:0000932 P-body