Tan0019604 (gene) Snake gourd v1

Overview
NameTan0019604
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSAGA-Tad1 domain-containing protein
LocationLG02: 94755700 .. 94758503 (-)
RNA-Seq ExpressionTan0019604
SyntenyTan0019604
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCAGAACTATCAAAACTCAAATAATAATCAGCAAAAACCCAACAAATAAAATCTAAAATTATAAGAGAGAGAGAGAGAAAAAAAAAAAAAAGAGAGAGAGAAAGAGAGACAGAGGTTGAAAGACAGAGAAAGAAAGCAAACGAAAGTTCGAAGCTCCGAAGATTCGGACCGAGCCATTCAGCTTCAATTTCTCTTCAATTCGTCAAGTTCGTCAAAGGTCAGAGCTCAAGTAATCCATATACCCTTGTTGTTCAAATGCTATTGAGCTTAGTTGGCGAGATTCAATGTGCTTCATATCTCTGCAATTTGTGCTCTGTTCCCTCTACTTAACTTCGCGCAACTCACTGATCTCTTTCTTCGACTACGAAATTTCGTGTTTCGAACTTCTGAGCTTGTGAATTCTGCTGAAAGAATGAGGGGTTTGTATTGGTTCGTCGACCCAATACGCGTTTTCGTTAGTTGTTGAAAAATTTTGCGTGTTCTGAACTCGGGTTTCGTTGTTCGAAGTTAGGGTTTTGATTTGCGATTGATTTGCAATAGGTTTGATTTGGTTTTGAAATTTTGGCGGGATTTAGTGAGCCATGTAGTGGGTTTAACTTCTAGGGTTGTGGTTTTGAGAAGTTGCGATTTGGAGTTGCATTTTGTGCTTTGTAGTTGGGTAAAAGAAAGGATCAGTTGGAAATATGATTGCTGCAATTCTGTTGAAATTCATTTCGGGATTTCCTTGATCTGGTGGGGTGGATTTTAGGAACTTGCTTAAATGATGCTGACCTTGAAAATTAATCTGCATTGATTGATATATGTGGGAGTGAGAACTTTGTGGTTATTTTTTTCTGTTTTTTTTGGTGAAGTTGAATAGTGATTGTGTTCCTCTTCCTGTTTGGAATTGAGTGATTTTGTAAAGGGTGTGTGAATCTGTCTGCTGAGAAATGCAACCTCAGCAGAGCTTGAGAATTGATTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTTGGAACCGATCGGTCAAAACGGTACTTCTTTTACTTGAATAGGTTCTTGAGTCGAAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTGTCAAGCTAAGGCTGCACCACCAATAGCTGTAGCAGGCTATCCGAAAACTTCGGCGCAATCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGATGGTGGAGCTGCTTTTCCTACTCCCACTCAAAATATTCCTATATGGTCTAATGGAGGTTTTCCAATGTCTCCGAGAAAGAGCAGGTCTGGGATACGCGATCGCAAACTCAAGGACAGGCCGAGTCCACTCGGACCAAACGGGAAGGTTGAATGTATCTCACATCAATCAGTAGGCAAGGAAGATGGCAGCTACAAAATCACGATGGACAATGGCGATGCAGCTCTTTGTGACTATCAAAGGCCAGTGCAGCATCTGCAAGGAGTTGCTGAACTGCCTGAAAACAATGTTGAGGCTAGAGTTCAGCAACCAGCAGGAAAGCAAGTCCTACAGAATAAGATCCAGGTTGAGGGAACCAAGGTTGAAGACAGGGAAGAACCGGGACAGTCAAATCAGTCGTGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGGGGCCCGCAAAGCGAGGCCTGTGGATGGTGGGGGTGATTTTAGCTTTAGTGATATTGGTCATTTGTTGGATACTGAGTCATTGAGACGACGCATGGAACAAATTGCTACAGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAGTATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGGCCTGCATATGAGCCCGAGAAACCTCTTGCCCACAAGCAGCAGATTCAGGGGAAAGTTATCAATGGCATGTTGCCAAATAATCAATTACATGGGCGATATAGCAATGGAAATGGAGAAGCTATGCACGAACACAGATTACAGTGCTCGATATCGTTGCTTGACTTCAAGGTAGCTATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCAGATGGAGAAAATTTGTATGTGTGCATCCGAGGAATGAAACAACTCTGATATATCTATTTATCCCATTCCACAGTTTGATTGCCCCATCATAGTTCAAAGGGGTCGAGAAGGGATCTCAAAGACCTCTCAGCTATGCTCATTTGTGTGAGTAAACAGATTGCTCAAAACTTGATAAAAGCTCTGGCTTGCAAAGATGACAAGCAGGTCAAATATCAAAATCGCATCATTTAACTGTGACTAAGCTTGGGCGCTGTCGGTCAGAATGCCCATCGGTGATCGATCTACATTTCGCATCTAAACTGGCTGTAGCCGCGATTCAAGGATCTCAAGCTTTTCATGTAAATTTAGATCACTGAGGAAATATACAAAATTTTGCAGAGATACAGTTTGTGTAGCTTGGTGGGAGGTAATTACTGGTGCAGGAGCGTAGTGTTGCTCTTTGTGTATTTAAATCTGAGAATTGATTTGAATTTCATATGAATAACTCATCAACAAACAACAATCTGCAGATAGTATCCTCTGTATTTCATGCTTACTCTTTTTTTTTTTTTTTTAGATAGAAAGATGCAGATGCTATAAAATGATATAGCATTTCTATTTAGAAAGATCTGTGTTCAAAATGTTCCACATTTGTAATGCAATAGTCAGCCA

mRNA sequence

CGCAGAACTATCAAAACTCAAATAATAATCAGCAAAAACCCAACAAATAAAATCTAAAATTATAAGAGAGAGAGAGAGAAAAAAAAAAAAAAGAGAGAGAGAAAGAGAGACAGAGGTTGAAAGACAGAGAAAGAAAGCAAACGAAAGTTCGAAGCTCCGAAGATTCGGACCGAGCCATTCAGCTTCAATTTCTCTTCAATTCGTCAAGTTCGTCAAAGGTCAGAGCTCAAGTAATCCATATACCCTTGTTGTTCAAATGCTATTGAGCTTAGTTGGCGAGATTCAATGTGCTTCATATCTCTGCAATTTGTGCTCTGTTCCCTCTACTTAACTTCGCGCAACTCACTGATCTCTTTCTTCGACTACGAAATTTCGTGTTTCGAACTTCTGAGCTTGTGAATTCTGCTGAAAGAATGAGGGGTTTGTATTGGTTCGTCGACCCAATACGCGTTTTCGTTAGTTGTTGAAAAATTTTGCGTGTTCTGAACTCGGGTTTCGTTGTTCGAAGTTAGGGTTTTGATTTGCGATTGATTTGCAATAGGTTTGATTTGGTTTTGAAATTTTGGCGGGATTTAGTGAGCCATGTAGTGGGTTTAACTTCTAGGGTTGTGGTTTTGAGAAGTTGCGATTTGGAGTTGCATTTTGTGCTTTGTAGTTGGGTAAAAGAAAGGATCAGTTGGAAATATGATTGCTGCAATTCTGTTGAAATTCATTTCGGGATTTCCTTGATCTGGTGGGGTGGATTTTAGGAACTTGCTTAAATGATGCTGACCTTGAAAATTAATCTGCATTGATTGATATATGTGGGAGTGAGAACTTTGTGGTTATTTTTTTCTGTTTTTTTTGGTGAAGTTGAATAGTGATTGTGTTCCTCTTCCTGTTTGGAATTGAGTGATTTTGTAAAGGGTGTGTGAATCTGTCTGCTGAGAAATGCAACCTCAGCAGAGCTTGAGAATTGATTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTTGGAACCGATCGGTCAAAACGGTACTTCTTTTACTTGAATAGGTTCTTGAGTCGAAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTGTCAAGCTAAGGCTGCACCACCAATAGCTGTAGCAGGCTATCCGAAAACTTCGGCGCAATCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGATGGTGGAGCTGCTTTTCCTACTCCCACTCAAAATATTCCTATATGGTCTAATGGAGGTTTTCCAATGTCTCCGAGAAAGAGCAGGTCTGGGATACGCGATCGCAAACTCAAGGACAGGCCGAGTCCACTCGGACCAAACGGGAAGGTTGAATGTATCTCACATCAATCAGTAGGCAAGGAAGATGGCAGCTACAAAATCACGATGGACAATGGCGATGCAGCTCTTTGTGACTATCAAAGGCCAGTGCAGCATCTGCAAGGAGTTGCTGAACTGCCTGAAAACAATGTTGAGGCTAGAGTTCAGCAACCAGCAGGAAAGCAAGTCCTACAGAATAAGATCCAGGTTGAGGGAACCAAGGTTGAAGACAGGGAAGAACCGGGACAGTCAAATCAGTCGTGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGGGGCCCGCAAAGCGAGGCCTGTGGATGGTGGGGGTGATTTTAGCTTTAGTGATATTGGTCATTTGTTGGATACTGAGTCATTGAGACGACGCATGGAACAAATTGCTACAGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAGTATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGGCCTGCATATGAGCCCGAGAAACCTCTTGCCCACAAGCAGCAGATTCAGGGGAAAGTTATCAATGGCATGTTGCCAAATAATCAATTACATGGGCGATATAGCAATGGAAATGGAGAAGCTATGCACGAACACAGATTACAGTGCTCGATATCGTTGCTTGACTTCAAGGTAGCTATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCAGATGGAGAAAATTTGTATGTGTGCATCCGAGGAATGAAACAACTCTGATATATCTATTTATCCCATTCCACAGTTTGATTGCCCCATCATAGTTCAAAGGGGTCGAGAAGGGATCTCAAAGACCTCTCAGCTATGCTCATTTGTGTGAGTAAACAGATTGCTCAAAACTTGATAAAAGCTCTGGCTTGCAAAGATGACAAGCAGGTCAAATATCAAAATCGCATCATTTAACTGTGACTAAGCTTGGGCGCTGTCGGTCAGAATGCCCATCGGTGATCGATCTACATTTCGCATCTAAACTGGCTGTAGCCGCGATTCAAGGATCTCAAGCTTTTCATGTAAATTTAGATCACTGAGGAAATATACAAAATTTTGCAGAGATACAGTTTGTGTAGCTTGGTGGGAGGTAATTACTGGTGCAGGAGCGTAGTGTTGCTCTTTGTGTATTTAAATCTGAGAATTGATTTGAATTTCATATGAATAACTCATCAACAAACAACAATCTGCAGATAGTATCCTCTGTATTTCATGCTTACTCTTTTTTTTTTTTTTTTAGATAGAAAGATGCAGATGCTATAAAATGATATAGCATTTCTATTTAGAAAGATCTGTGTTCAAAATGTTCCACATTTGTAATGCAATAGTCAGCCA

Coding sequence (CDS)

ATGCAACCTCAGCAGAGCTTGAGAATTGATTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTTGGAACCGATCGGTCAAAACGGTACTTCTTTTACTTGAATAGGTTCTTGAGTCGAAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTGTCAAGCTAAGGCTGCACCACCAATAGCTGTAGCAGGCTATCCGAAAACTTCGGCGCAATCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGATGGTGGAGCTGCTTTTCCTACTCCCACTCAAAATATTCCTATATGGTCTAATGGAGGTTTTCCAATGTCTCCGAGAAAGAGCAGGTCTGGGATACGCGATCGCAAACTCAAGGACAGGCCGAGTCCACTCGGACCAAACGGGAAGGTTGAATGTATCTCACATCAATCAGTAGGCAAGGAAGATGGCAGCTACAAAATCACGATGGACAATGGCGATGCAGCTCTTTGTGACTATCAAAGGCCAGTGCAGCATCTGCAAGGAGTTGCTGAACTGCCTGAAAACAATGTTGAGGCTAGAGTTCAGCAACCAGCAGGAAAGCAAGTCCTACAGAATAAGATCCAGGTTGAGGGAACCAAGGTTGAAGACAGGGAAGAACCGGGACAGTCAAATCAGTCGTGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGGGGCCCGCAAAGCGAGGCCTGTGGATGGTGGGGGTGATTTTAGCTTTAGTGATATTGGTCATTTGTTGGATACTGAGTCATTGAGACGACGCATGGAACAAATTGCTACAGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAGTATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGGCCTGCATATGAGCCCGAGAAACCTCTTGCCCACAAGCAGCAGATTCAGGGGAAAGTTATCAATGGCATGTTGCCAAATAATCAATTACATGGGCGATATAGCAATGGAAATGGAGAAGCTATGCACGAACACAGATTACAGTGCTCGATATCGTTGCTTGACTTCAAGGTAGCTATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCAGATGGAGAAAATTTGTATGTGTGCATCCGAGGAATGA

Protein sequence

MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE
Homology
BLAST of Tan0019604 vs. NCBI nr
Match: XP_008445087.1 (PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo] >KAA0064996.1 SAGA-Tad1 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 715.3 bits (1845), Expect = 3.1e-202
Identity = 361/419 (86.16%), Postives = 373/419 (89.02%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAKAAPPI VAGYPKTS QSAKISP++EDGNEDGGA FPT TQNI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           P WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGD 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A LCDY+RPVQHLQGVAELPENN+E RV QP+GKQVL NKIQVE TKVEDREE GQSN S
Sbjct: 181 ATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSAS GG  K RPVD GGDFSF D+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE 420
           NNQLHGR+SNGN E +HEHRLQCSISLLDFKVAMELNP QLGEDWPL +EKICM A  E
Sbjct: 361 NNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFGE 407

BLAST of Tan0019604 vs. NCBI nr
Match: XP_023546134.1 (uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 713.0 bits (1839), Expect = 1.5e-201
Identity = 361/419 (86.16%), Postives = 379/419 (90.45%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISPVIEDGNEDGGA FPT TQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           PIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I +DNG+
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMLDNGN 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A  CDYQRPVQHLQGV ELPENN+EARVQ+P+GKQVLQ  +QVEGTKVEDREE  QSN+S
Sbjct: 181 ATSCDYQRPVQHLQGVYELPENNIEARVQRPSGKQVLQ--MQVEGTKVEDREEARQSNRS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE 420
           NNQLH  +SNGNGE +HE RL CSISLLDFKVAMELNPKQLGEDWPL +EKI M A  E
Sbjct: 361 NNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 417

BLAST of Tan0019604 vs. NCBI nr
Match: XP_004138880.1 (uncharacterized protein LOC101213741 [Cucumis sativus] >KGN62868.1 hypothetical protein Csa_022104 [Cucumis sativus])

HSP 1 Score: 708.8 bits (1828), Expect = 2.9e-200
Identity = 356/414 (85.99%), Postives = 372/414 (89.86%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAK APPI VAGYPKTS QSAKISP++EDGNEDGGA FPT TQNI
Sbjct: 61  WLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           P WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGD 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A LCDY+RPVQ+LQG+AELPENN+E RV QP+GKQ LQNKIQVE TKVEDREE GQSN S
Sbjct: 181 ATLCDYKRPVQNLQGIAELPENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSASIGGARK RPVD GGDFS SD+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICM 415
           NNQLHGR+SNG+ E +HEHRLQCSISLLDFKVAMELNP QLGEDWPL +EKICM
Sbjct: 361 NNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICM 402

BLAST of Tan0019604 vs. NCBI nr
Match: XP_022997521.1 (uncharacterized protein LOC111492414 [Cucurbita maxima])

HSP 1 Score: 703.0 bits (1813), Expect = 1.6e-198
Identity = 358/419 (85.44%), Postives = 374/419 (89.26%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISPVIEDGNEDGGA F T TQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFATSTQGI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           PIWSN GF MSPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I MDNG+
Sbjct: 121 PIWSNEGFSMSPRKCRSGIRDRKLKDRPSLLAPNLKVECISAQSACKEDGSCRIMMDNGN 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A  CDYQRPVQHLQGV ELPENN+EARVQ+P+GKQVLQ  +QVEGTKVEDREE  QSN+S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVLQ--MQVEGTKVEDREEARQSNRS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGPWPVFEPEKPLAHNQQIQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE 420
           NNQLH  +SNGN E +HE RL CSISLLDFKVAMELNPKQLGEDWPL +EKI M A  E
Sbjct: 361 NNQLHRLHSNGNREVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 417

BLAST of Tan0019604 vs. NCBI nr
Match: KAG7029751.1 (hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 701.8 bits (1810), Expect = 3.5e-198
Identity = 357/419 (85.20%), Postives = 373/419 (89.02%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISPVIEDGNEDGGA FPT TQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           PIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I MDNG+
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGN 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A  CDYQRPVQHLQGV ELPENN+EARVQ+PAGKQVLQ        +VEDREE  QSN+S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE 420
           NNQLH  +SNGNGE +HE RL CSISLLDFKVAMELNPKQLGEDWPL +EKI M A  E
Sbjct: 361 NNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 412

BLAST of Tan0019604 vs. ExPASy TrEMBL
Match: A0A5A7VF96 (SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003200 PE=4 SV=1)

HSP 1 Score: 715.3 bits (1845), Expect = 1.5e-202
Identity = 361/419 (86.16%), Postives = 373/419 (89.02%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAKAAPPI VAGYPKTS QSAKISP++EDGNEDGGA FPT TQNI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           P WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGD 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A LCDY+RPVQHLQGVAELPENN+E RV QP+GKQVL NKIQVE TKVEDREE GQSN S
Sbjct: 181 ATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSAS GG  K RPVD GGDFSF D+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE 420
           NNQLHGR+SNGN E +HEHRLQCSISLLDFKVAMELNP QLGEDWPL +EKICM A  E
Sbjct: 361 NNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFGE 407

BLAST of Tan0019604 vs. ExPASy TrEMBL
Match: A0A1S3BCQ5 (uncharacterized protein LOC103488231 OS=Cucumis melo OX=3656 GN=LOC103488231 PE=4 SV=1)

HSP 1 Score: 715.3 bits (1845), Expect = 1.5e-202
Identity = 361/419 (86.16%), Postives = 373/419 (89.02%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAKAAPPI VAGYPKTS QSAKISP++EDGNEDGGA FPT TQNI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           P WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGD 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A LCDY+RPVQHLQGVAELPENN+E RV QP+GKQVL NKIQVE TKVEDREE GQSN S
Sbjct: 181 ATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSAS GG  K RPVD GGDFSF D+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE 420
           NNQLHGR+SNGN E +HEHRLQCSISLLDFKVAMELNP QLGEDWPL +EKICM A  E
Sbjct: 361 NNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFGE 407

BLAST of Tan0019604 vs. ExPASy TrEMBL
Match: A0A0A0LM32 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G378510 PE=4 SV=1)

HSP 1 Score: 708.8 bits (1828), Expect = 1.4e-200
Identity = 356/414 (85.99%), Postives = 372/414 (89.86%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLS+KLSKNEFDK CCRVLGRENL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAK APPI VAGYPKTS QSAKISP++EDGNEDGGA FPT TQNI
Sbjct: 61  WLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           P WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH S           MDNGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGD 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A LCDY+RPVQ+LQG+AELPENN+E RV QP+GKQ LQNKIQVE TKVEDREE GQSN S
Sbjct: 181 ATLCDYKRPVQNLQGIAELPENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSASIGGARK RPVD GGDFS SD+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICM 415
           NNQLHGR+SNG+ E +HEHRLQCSISLLDFKVAMELNP QLGEDWPL +EKICM
Sbjct: 361 NNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICM 402

BLAST of Tan0019604 vs. ExPASy TrEMBL
Match: A0A6J1K7Q1 (uncharacterized protein LOC111492414 OS=Cucurbita maxima OX=3661 GN=LOC111492414 PE=4 SV=1)

HSP 1 Score: 703.0 bits (1813), Expect = 7.6e-199
Identity = 358/419 (85.44%), Postives = 374/419 (89.26%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISPVIEDGNEDGGA F T TQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFATSTQGI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           PIWSN GF MSPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I MDNG+
Sbjct: 121 PIWSNEGFSMSPRKCRSGIRDRKLKDRPSLLAPNLKVECISAQSACKEDGSCRIMMDNGN 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A  CDYQRPVQHLQGV ELPENN+EARVQ+P+GKQVLQ  +QVEGTKVEDREE  QSN+S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVLQ--MQVEGTKVEDREEARQSNRS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGPWPVFEPEKPLAHNQQIQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE 420
           NNQLH  +SNGN E +HE RL CSISLLDFKVAMELNPKQLGEDWPL +EKI M A  E
Sbjct: 361 NNQLHRLHSNGNREVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 417

BLAST of Tan0019604 vs. ExPASy TrEMBL
Match: A0A6J1HF85 (uncharacterized protein LOC111463000 OS=Cucurbita moschata OX=3662 GN=LOC111463000 PE=4 SV=1)

HSP 1 Score: 700.7 bits (1807), Expect = 3.8e-198
Identity = 356/419 (84.96%), Postives = 373/419 (89.02%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLS+KLSKNEFDKLCCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
           WLHNQLIQSILKNACQAKAAPPI  AGYPKTS Q+AKISPVIEDGNEDGGA FPT TQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGD 180
           PIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QS  KEDGS +I MDNG+
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGN 180

Query: 181 AALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQS 240
           A  CDYQRPVQHLQGV ELPENN+EARVQ+P+GKQVLQ        +VEDREE  QSN+S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVLQ-------MQVEDREEARQSNRS 240

Query: 241 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQ 300
            LLRSRLLAPLGIPFCSASIGGA K RPVD GG+FSFSD+GHLLDTESLRRRMEQIA VQ
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQ 300

Query: 301 GLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLP 360
           GLGSVSADCA+ILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLP
Sbjct: 301 GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFEPEKPLAHNQQIQGKVINGMLP 360

Query: 361 NNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE 420
           NNQLH  +SNGNGE +HE RL CSISLLDFKVAMELNPKQLGEDWPL +EKI M A  E
Sbjct: 361 NNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFTE 412

BLAST of Tan0019604 vs. TAIR 10
Match: AT2G24530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 348.6 bits (893), Expect = 7.0e-96
Identity = 201/422 (47.63%), Postives = 264/422 (62.56%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLS+KL+K+EFDK C R+LGRENL
Sbjct: 1   MQRSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGY-PKTSAQSAKISPVIEDGNEDGGAAFPTPTQN 120
            LHNQLI+SIL+NA  AK+ PP   AG+  K +A  ++      DG E  G   P  +Q+
Sbjct: 61  SLHNQLIRSILRNATVAKSPPPDHEAGHSTKANAFQSR-----GDGLEQSGTLIPNHSQH 120

Query: 121 IPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNG 180
            P+WSNG  P+SPRK RSG+++RK +DRPSPLG NGKVE + HQ V +ED    + M+NG
Sbjct: 121 EPVWSNGVLPISPRKVRSGMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENG 180

Query: 181 DAALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQ 240
                DYQR  +++        +  +    +P  K  + NK ++    + D +   +  +
Sbjct: 181 -----DYQRSGRYV-------ADEKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQAR 240

Query: 241 SCLLRSRLLAPLGIPFCSASIGGARKARPVDGGGD-FSFSDIGHLLDTESLRRRMEQIAT 300
             L  S L+APLGIPFCSAS+GG+ +  PV    +  S  D G L D E LR+RME IA 
Sbjct: 241 VNLSMSPLIAPLGIPFCSASVGGSPRTIPVSTNAELISCYDSGGLPDIEMLRKRMENIAV 300

Query: 301 VQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVING 360
            QGL  VS +CA  LN +LDVYLK+LI SC DLVGA     +P K    KQQ Q K++NG
Sbjct: 301 AQGLEGVSMECAKTLNNMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNG 360

Query: 361 MLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCAS 420
           + P N L  +  NG+ +   +H    S+S+LDF+ AMELNP+QLGEDWP   E+I + + 
Sbjct: 361 VWPTNSLKIQTPNGSSDIRQDHH---SVSMLDFRTAMELNPRQLGEDWPTLRERISLRSF 402

BLAST of Tan0019604 vs. TAIR 10
Match: AT4G31440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2; Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 277.7 bits (709), Expect = 1.5e-74
Identity = 187/422 (44.31%), Postives = 240/422 (56.87%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENL 60
           MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLS+KL+K+EFDK C R+LGRENL
Sbjct: 1   MQRLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNI 120
            LHN+LI+SIL+NA  AK+ P +  +G+P  S    K     EDG E+  +  P   +N 
Sbjct: 61  SLHNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLGK-----EDGPEESRSLNPDHIRND 120

Query: 121 PIWSNGGFPMSPRKSRSG-IRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNG 180
              SNG       K R G   DR ++D+P PLG NGKV                     G
Sbjct: 121 LALSNGVL----AKVRPGTCDDRTIRDKPCPLGSNGKV--------------------LG 180

Query: 181 DAALCDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQ 240
             A   Y RP ++       P+   ++    PA ++ +  K QV      D E    +  
Sbjct: 181 PFA---YSRPGRY-------PDER-DSAFLCPAEQKAVSGKDQVAAPISRDDE----AQV 240

Query: 241 SCLLRSRLLAPLGIPFCSASIGGARKARPVD-GGGDFSFSDIGHLLDTESLRRRMEQIAT 300
             L    ++APLGIPFCSAS+GG R+  PV       S  D G L DTE LR+RME IA 
Sbjct: 241 RILSTPPVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLRKRMENIAV 300

Query: 301 VQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVING 360
            QGLG VSA+C+ +LN +LD+YLK+L++SCVDL GA      P K    KQQ + +++NG
Sbjct: 301 TQGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQSRDELVNG 360

Query: 361 MLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCAS 420
           +  NN  H + SN   +   E   Q S+SLLDF+VAMELNP QLGEDWPL  E+I +   
Sbjct: 361 VRTNNSFHIQTSNQPSDITRE---QHSVSLLDFRVAMELNPHQLGEDWPLLRERISISLF 375

BLAST of Tan0019604 vs. TAIR 10
Match: AT4G33890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 161.8 bits (408), Expect = 1.2e-39
Identity = 135/424 (31.84%), Postives = 195/424 (45.99%), Query Frame = 0

Query: 4   QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLH 63
           Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LH
Sbjct: 5   QGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLH 64

Query: 64  NQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNIPIW 123
           N+LI+SI+KNAC AK+ P I   G              +  GN D        +Q  P+ 
Sbjct: 65  NRLIRSIIKNACIAKSPPFIKKGG------------SFVRFGNGDS----KKNSQIQPLH 124

Query: 124 SNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAAL 183
            +  F  S RK RS    RKL+DRPSPLGP GK               + +T  N     
Sbjct: 125 GDSAFSPSTRKCRS----RKLRDRPSPLGPLGK--------------PHSLTTTN----- 184

Query: 184 CDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQ---SNQS 243
              +  +   Q   EL                 L ++  VE   VE+ EE  Q    + S
Sbjct: 185 ---EESMSKAQSATELLS---------------LGSRPPVEVVSVEEGEEVEQIAGGSPS 244

Query: 244 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFS-----DIGHLLDTESLRRRMEQ 303
              R  L APLG+   S   G  RK+         SF+     + G L DT +LR R+E+
Sbjct: 245 VQSRCPLTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLER 304

Query: 304 IATVQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVI 363
              ++GL  ++ D  S+LN  LDV++++LI  C+ L       +                
Sbjct: 305 RLEMEGL-KITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTD---------------- 342

Query: 364 NGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMC 420
                      R    N +   + R    +S+ DF+  MELN + LGEDWP+ MEKIC  
Sbjct: 365 -----------RVREMNYQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSR 342

BLAST of Tan0019604 vs. TAIR 10
Match: AT4G33890.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 161.8 bits (408), Expect = 1.2e-39
Identity = 135/424 (31.84%), Postives = 195/424 (45.99%), Query Frame = 0

Query: 4   QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLH 63
           Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LH
Sbjct: 5   QGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLH 64

Query: 64  NQLIQSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNIPIW 123
           N+LI+SI+KNAC AK+ P I   G              +  GN D        +Q  P+ 
Sbjct: 65  NRLIRSIIKNACIAKSPPFIKKGG------------SFVRFGNGDS----KKNSQIQPLH 124

Query: 124 SNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAAL 183
            +  F  S RK RS    RKL+DRPSPLGP GK               + +T  N     
Sbjct: 125 GDSAFSPSTRKCRS----RKLRDRPSPLGPLGK--------------PHSLTTTN----- 184

Query: 184 CDYQRPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQ---SNQS 243
              +  +   Q   EL                 L ++  VE   VE+ EE  Q    + S
Sbjct: 185 ---EESMSKAQSATELLS---------------LGSRPPVEVVSVEEGEEVEQIAGGSPS 244

Query: 244 CLLRSRLLAPLGIPFCSASIGGARKARPVDGGGDFSFS-----DIGHLLDTESLRRRMEQ 303
              R  L APLG+   S   G  RK+         SF+     + G L DT +LR R+E+
Sbjct: 245 VQSRCPLTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLER 304

Query: 304 IATVQGLGSVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVI 363
              ++GL  ++ D  S+LN  LDV++++LI  C+ L       +                
Sbjct: 305 RLEMEGL-KITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTD---------------- 342

Query: 364 NGMLPNNQLHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMC 420
                      R    N +   + R    +S+ DF+  MELN + LGEDWP+ MEKIC  
Sbjct: 365 -----------RVREMNYQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSR 342

BLAST of Tan0019604 vs. TAIR 10
Match: AT2G14850.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 161.0 bits (406), Expect = 2.1e-39
Identity = 130/416 (31.25%), Postives = 189/416 (45.43%), Query Frame = 0

Query: 8   RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSRKLSKNEFDKLCCRVLGRENLWLHNQLI 67
           R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GREN+ LHN+L+
Sbjct: 9   RLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRLV 68

Query: 68  QSILKNACQAKAAPPIAVAGYPKTSAQSAKISPVIEDGNEDGGAAFPTPTQNIPIWSNGG 127
           +SILKNA  AK+ PP     YPK S                             ++ +  
Sbjct: 69  RSILKNASVAKSPPP----RYPKKS-----------------------------LYGDPV 128

Query: 128 FPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSVGKEDGSYKITMDNGDAALCDYQ 187
           FP SPRK RS    RK +DRPSPLGP GK + ++                  D ++   Q
Sbjct: 129 FPPSPRKCRS----RKFRDRPSPLGPLGKPQSLT---------------TTNDESMSKAQ 188

Query: 188 RPVQHLQGVAELPENNVEARVQQPAGKQVLQNKIQVEGTKVEDREEPGQSNQSCLLRSR- 247
           R                                + +E   VED EE  Q   S  ++SR 
Sbjct: 189 R--------------------------------LPMEVVSVEDGEEVEQMTGSPSVQSRS 248

Query: 248 -LLAPLGIPFCSASIGGARKAR--PVDGGGDFSFSDIGHLLDTESLRRRMEQIATVQGLG 307
            L APLG+ F   S     KAR    +G    +    G L D  +LR R+E+   ++G+ 
Sbjct: 249 PLTAPLGVSFHLKS-----KARFSTYNGINRETCQSSGELPDMITLRARLEKKLEMEGI- 291

Query: 308 SVSADCASILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQ 367
            +S D A++LN+ L+ Y+++LI  C+ L                                
Sbjct: 309 KLSMDSANLLNRGLNAYMRRLIEPCLSL-------------------------------- 291

Query: 368 LHGRYSNGNGEAMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLQMEKICMCASEE 420
                      A  + R   ++S+LDF  AME+NP+ LGE+WP+Q+EKIC  ASEE
Sbjct: 369 -----------ASQQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEKICCRASEE 291

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_008445087.13.1e-20286.16PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo] >KAA0064996.1 SAG... [more]
XP_023546134.11.5e-20186.16uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo][more]
XP_004138880.12.9e-20085.99uncharacterized protein LOC101213741 [Cucumis sativus] >KGN62868.1 hypothetical ... [more]
XP_022997521.11.6e-19885.44uncharacterized protein LOC111492414 [Cucurbita maxima][more]
KAG7029751.13.5e-19885.20hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A5A7VF961.5e-20286.16SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A1S3BCQ51.5e-20286.16uncharacterized protein LOC103488231 OS=Cucumis melo OX=3656 GN=LOC103488231 PE=... [more]
A0A0A0LM321.4e-20085.99Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G378510 PE=4 SV=1[more]
A0A6J1K7Q17.6e-19985.44uncharacterized protein LOC111492414 OS=Cucurbita maxima OX=3661 GN=LOC111492414... [more]
A0A6J1HF853.8e-19884.96uncharacterized protein LOC111463000 OS=Cucurbita moschata OX=3662 GN=LOC1114630... [more]
Match NameE-valueIdentityDescription
AT2G24530.17.0e-9647.63unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31440.11.5e-7444.31unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.11.2e-3931.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.21.2e-3931.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G14850.12.1e-3931.25unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 5..332
e-value: 3.0E-63
score: 213.7
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PANTHERPTHR21277TRANSCRIPTIONAL ADAPTER 1coord: 1..419
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 122..158
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 135..149
NoneNo IPR availablePANTHERPTHR21277:SF38TRANSCRIPTIONAL REGULATOR OF RNA POLII, SAGA, SUBUNITcoord: 1..419

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0019604.1Tan0019604.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0000124 SAGA complex
cellular_component GO:0070461 SAGA-type complex
molecular_function GO:0003713 transcription coactivator activity