Tan0007643 (gene) Snake gourd v1

Overview
NameTan0007643
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA ligase 1 isoform X1
LocationLG02: 96616741 .. 96619109 (+)
RNA-Seq ExpressionTan0007643
SyntenyTan0007643
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGTGAGAGGCATTTAGGTTTTCCAATCTGCCGCTTCATCGAAAGAGAAGTCTTCTTCGTTCTCCGCTAGGGTTTTCCATCGTGATCGTCTTTCCCTTCTGTTTTCTCAAATCCGAACTGCTTTTTCCTCTGTTTTCTTATAAATCTTCGACGAATCCTTCCCATAATAGACACAGATGTCTCGTTGCTTTCCTTACCCACCTCCTGGTTACGCGAGGAAGGTGGCTAGGACCGAGGCGGCCTTGATCGAATCGATTAAGGTTTGTTTGAGCTATTTGGTTGGTTTTCAATAATGTATGCTGGATTCAATTGTGTTCTGATTCCAATTGATTGCGAGATCTTCTTGGAATTGTAACTTTTTGGATGGGGGTATATCTATCATCAGGAACGGCTTTGTTTTTTTGTGGTTTTTACGGCGTCGGGGAAGGAATCGTAATGGGTTTCTTGCCTTTGACTGGGAGAGTTACAAGTTTTGTACCCTAATGATAGGAATCATGATTTTAGGTTATTGGGAGTTTAGATTCTAGTTTCAGTTTTGTTCTTCATGTGTATGTAGTAATTGACGTGTAATTTCTCCTTAAAGAATAAGAGATCTTTGAGAGTTCCAGTGGTATAGATAGTGGTTTTGGGACTCTTAGTTGCATGTCTTACAAATCTTTCTTGGATCTTGTGTGGATCTTCGGGTTTCACCTGTCTAAACACTTAAGTAAACATCTGAATCGTATAACTTTTTAATGAATCGTCTATTTTTCATGGGAACGTTATGGTTTTTTAAATATATTTTAAATATAGTTCCTGGGTTTTCAAGTGAATTGTTTTCTACCAAATTTAAGTGGAAGAGAATTAGGGTTTCTTCTAGTAGTTTGATAAATATGAGCGGCTTTGTATCAGTTTTTCGCAGAACCTTTTTTCCCCCTGATATCCTTAGGCATATGATTACATGATGTGAATGTACGAGTTTACTACTTAAAATGAATGATTATGAGTTGGGATGAACATTCTCCTTAACAATTGGAATGGCTACCCTTTTGAAGGGTTGTATTTTATTAGTTAGTTAGGGCATGGTTGATCTGTTTTGTGCTTGCGCCGTCTGCACTTTTTGAAATGTTATTCTGGTGCGTTATAGCTCGTATCTGAAAGACGAGAAAGCAGGACTGAGAGAAAGAAAGAGAAAAGTAAGCACAAGAAAGAGAAGAAGAGTAAGGACAAGAAACACAAAAGCAAAGAACATAAAGAAAAATCTTCTCGTAGCCGTGACTTGAGTGATCGGAAACGAAAGGAAGCCAAGGACCTTCCAAAGGAAGCCAAAGTTGAAGCAGAACAATTAGAAAAGAGTGGTCTCACTGAAGAGCATGGACAACCAGTATGGCCTCAAAGCCCTGGCTACTTGTCTGACGGATCTCAGATCAACCACAAGAGGAAAAGGGATGCTTCATTACTGCCTAATCAAGGTTCTAAACCTGGTGGGTTCTCTTTATAGATAATAGGATCTTTTATGAAATATAAGTTCTCTTGGCCATTTGAAAACATAGTTTTATATTATTGATCACGAACAGGAAAAATCATACGGATCAAACTGGGTTCTTCACTAAGCCGGCAAGAGTCAGTTGGCAGCCAACAAACGTGTTCTACATCTGGTCGTGATAGTTCTCTTGATCAAAAGAGAGATGAGAACAGACGTGGACCACCCATTCAGCAAAAACCTTGCCTCACAATTGCTGACACAGCTGGTTCTGTCAAGGATCCCATTTCTAAACCTGTGATCAAAGACCCTTCTTCGCATGCTGTCAAGGATCCCATTGCTAAACCTAAGATCAAAGTCTCTTCTTCGCATGCTGTCAAGGATCCCATTTCTAAACCTAAGATCGAAGTCCCTTCTTTGCATGCTGTCAAGGATCCCATTTCTAAACCTAAGATCAAAGTCCCTTCTTCGCATGCTGTCAAGGACATTGGTACTCATCAAGGTAATGTTGCGTCAGTGTCACTACCTCCCCGCACAAAAAGTCCTGCTGAATCTGCTTATGAGGCCTTATTTGAGAAGTGGGTACCACCTCCACTTCAGTTGGAGCAACAAATGAATGATGAAGAATGGCTCTTTGGAACAAGAAAACAGCAAGTGGGACAAACTACGAAGACCAACAACAAAGCTTTCAGTCATGTTCCCAGCTGTAGAAGTTTGAGTCTGTGGCCAAGAGGACAATATCTGCCGGATGCGGATGTTTATTCATTGCCTTATACGATCCCATTTTGATTTCGAATTCTTTTCTTCAGAGAGAGTAATATGTACAGTACTACGCCAGTAGGAACAATCTTTTTCCTTGGCGTTATAATTGTTTTAGAATCAATATAATTGTCATTCAATTTTGTTC

mRNA sequence

GCGTGAGAGGCATTTAGGTTTTCCAATCTGCCGCTTCATCGAAAGAGAAGTCTTCTTCGTTCTCCGCTAGGGTTTTCCATCGTGATCGTCTTTCCCTTCTGTTTTCTCAAATCCGAACTGCTTTTTCCTCTGTTTTCTTATAAATCTTCGACGAATCCTTCCCATAATAGACACAGATGTCTCGTTGCTTTCCTTACCCACCTCCTGGTTACGCGAGGAAGGTGGCTAGGACCGAGGCGGCCTTGATCGAATCGATTAAGCTCGTATCTGAAAGACGAGAAAGCAGGACTGAGAGAAAGAAAGAGAAAAGTAAGCACAAGAAAGAGAAGAAGAGTAAGGACAAGAAACACAAAAGCAAAGAACATAAAGAAAAATCTTCTCGTAGCCGTGACTTGAGTGATCGGAAACGAAAGGAAGCCAAGGACCTTCCAAAGGAAGCCAAAGTTGAAGCAGAACAATTAGAAAAGAGTGGTCTCACTGAAGAGCATGGACAACCAGTATGGCCTCAAAGCCCTGGCTACTTGTCTGACGGATCTCAGATCAACCACAAGAGGAAAAGGGATGCTTCATTACTGCCTAATCAAGGTTCTAAACCTGGAAAAATCATACGGATCAAACTGGGTTCTTCACTAAGCCGGCAAGAGTCAGTTGGCAGCCAACAAACGTGTTCTACATCTGGTCGTGATAGTTCTCTTGATCAAAAGAGAGATGAGAACAGACGTGGACCACCCATTCAGCAAAAACCTTGCCTCACAATTGCTGACACAGCTGGTTCTGTCAAGGATCCCATTTCTAAACCTGTGATCAAAGACCCTTCTTCGCATGCTGTCAAGGATCCCATTGCTAAACCTAAGATCAAAGTCTCTTCTTCGCATGCTGTCAAGGATCCCATTTCTAAACCTAAGATCGAAGTCCCTTCTTTGCATGCTGTCAAGGATCCCATTTCTAAACCTAAGATCAAAGTCCCTTCTTCGCATGCTGTCAAGGACATTGGTACTCATCAAGGTAATGTTGCGTCAGTGTCACTACCTCCCCGCACAAAAAGTCCTGCTGAATCTGCTTATGAGGCCTTATTTGAGAAGTGGGTACCACCTCCACTTCAGTTGGAGCAACAAATGAATGATGAAGAATGGCTCTTTGGAACAAGAAAACAGCAAGTGGGACAAACTACGAAGACCAACAACAAAGCTTTCAGTCATGTTCCCAGCTGTAGAAGTTTGAGTCTGTGGCCAAGAGGACAATATCTGCCGGATGCGGATGTTTATTCATTGCCTTATACGATCCCATTTTGATTTCGAATTCTTTTCTTCAGAGAGAGTAATATGTACAGTACTACGCCAGTAGGAACAATCTTTTTCCTTGGCGTTATAATTGTTTTAGAATCAATATAATTGTCATTCAATTTTGTTC

Coding sequence (CDS)

ATGTCTCGTTGCTTTCCTTACCCACCTCCTGGTTACGCGAGGAAGGTGGCTAGGACCGAGGCGGCCTTGATCGAATCGATTAAGCTCGTATCTGAAAGACGAGAAAGCAGGACTGAGAGAAAGAAAGAGAAAAGTAAGCACAAGAAAGAGAAGAAGAGTAAGGACAAGAAACACAAAAGCAAAGAACATAAAGAAAAATCTTCTCGTAGCCGTGACTTGAGTGATCGGAAACGAAAGGAAGCCAAGGACCTTCCAAAGGAAGCCAAAGTTGAAGCAGAACAATTAGAAAAGAGTGGTCTCACTGAAGAGCATGGACAACCAGTATGGCCTCAAAGCCCTGGCTACTTGTCTGACGGATCTCAGATCAACCACAAGAGGAAAAGGGATGCTTCATTACTGCCTAATCAAGGTTCTAAACCTGGAAAAATCATACGGATCAAACTGGGTTCTTCACTAAGCCGGCAAGAGTCAGTTGGCAGCCAACAAACGTGTTCTACATCTGGTCGTGATAGTTCTCTTGATCAAAAGAGAGATGAGAACAGACGTGGACCACCCATTCAGCAAAAACCTTGCCTCACAATTGCTGACACAGCTGGTTCTGTCAAGGATCCCATTTCTAAACCTGTGATCAAAGACCCTTCTTCGCATGCTGTCAAGGATCCCATTGCTAAACCTAAGATCAAAGTCTCTTCTTCGCATGCTGTCAAGGATCCCATTTCTAAACCTAAGATCGAAGTCCCTTCTTTGCATGCTGTCAAGGATCCCATTTCTAAACCTAAGATCAAAGTCCCTTCTTCGCATGCTGTCAAGGACATTGGTACTCATCAAGGTAATGTTGCGTCAGTGTCACTACCTCCCCGCACAAAAAGTCCTGCTGAATCTGCTTATGAGGCCTTATTTGAGAAGTGGGTACCACCTCCACTTCAGTTGGAGCAACAAATGAATGATGAAGAATGGCTCTTTGGAACAAGAAAACAGCAAGTGGGACAAACTACGAAGACCAACAACAAAGCTTTCAGTCATGTTCCCAGCTGTAGAAGTTTGAGTCTGTGGCCAAGAGGACAATATCTGCCGGATGCGGATGTTTATTCATTGCCTTATACGATCCCATTTTGA

Protein sequence

MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
Homology
BLAST of Tan0007643 vs. NCBI nr
Match: XP_023545923.1 (uncharacterized protein LOC111805212 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 400.6 bits (1028), Expect = 1.5e-107
Identity = 249/381 (65.35%), Postives = 279/381 (73.23%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKS
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKE-KSKDRKHKS 60

Query: 61  KEH---KEKSSRSRDLSDRKR----KEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSP 120
           KE    KEKSSRSRDL+D+K+    KE KD  +  KVEAEQLEKSGLTEEHGQPVWP SP
Sbjct: 61  KERKESKEKSSRSRDLNDQKQKACVKEVKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSP 120

Query: 121 GYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDS 180
           GYLSDG+QIN KRKRD SL P++G KPGK+IRIKL SSLS+QE  S GS+Q CS SGRD 
Sbjct: 121 GYLSDGTQINQKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGSEQMCSVSGRDC 180

Query: 181 SLDQKRDENRRGPPIQQKPCLTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVS 240
           S DQK DEN     +++  C   ++TA +VKD   SKP IKDP  HAVKD  +       
Sbjct: 181 SRDQKSDEN---SSVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTS------- 240

Query: 241 SSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS 300
                                     SKPKIK PS HAVK+I +  GNV S+   PRT+S
Sbjct: 241 --------------------------SKPKIKDPSPHAVKEISS-LGNVMSL---PRTRS 300

Query: 301 PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSL 360
           P ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS VPSCRS SL
Sbjct: 301 PVESAYEALFEKWVPPPLQLEQQMDDEEWLFRTEKQD-GRSTKT-NEAFSSVPSCRSSSL 338

Query: 361 WPRGQYLPDADVYSLPYTIPF 372
           WPRGQYL DADVYSLPYTIP+
Sbjct: 361 WPRGQYLADADVYSLPYTIPY 338

BLAST of Tan0007643 vs. NCBI nr
Match: XP_022997629.1 (uncharacterized protein LOC111492505 isoform X1 [Cucurbita maxima])

HSP 1 Score: 397.5 bits (1020), Expect = 1.3e-106
Identity = 247/381 (64.83%), Postives = 278/381 (72.97%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKS
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKE-KSKDRKHKS 60

Query: 61  ---KEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSP 120
              KE KEKSSRSRDL+D+K     KEAKD  +  KVEAEQLEKSGLTEEHGQPVWP SP
Sbjct: 61  NERKESKEKSSRSRDLNDQKHKVCVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSP 120

Query: 121 GYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDS 180
           GYLSDG+QINHKRKRD SL P++G KPGK+IRIKL SSLS+QE  S G + TCS SGRD 
Sbjct: 121 GYLSDGTQINHKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGCELTCSVSGRDI 180

Query: 181 SLDQKRDENRRGPPIQQKPCLTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVS 240
           S DQK DEN     +++  C   ++TA +VKD   SKP IKDP  HAVKD  +       
Sbjct: 181 SRDQKSDEN--SSVVRRSTCFANSETARAVKDCTSSKPKIKDPPPHAVKDRTS------- 240

Query: 241 SSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS 300
                                     SKPKIK PS HAVK+I +  GNV S+   PRT+S
Sbjct: 241 --------------------------SKPKIKDPSPHAVKEISS-LGNVMSL---PRTRS 300

Query: 301 PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSL 360
           P ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS +PSCR+ SL
Sbjct: 301 PVESAYEALFEKWVPPPLQLEQQMDDEEWLFPTEKQD-GRSTKT-NEAFSSIPSCRNSSL 339

Query: 361 WPRGQYLPDADVYSLPYTIPF 372
           WPRGQYL  ADVYSLPYTIP+
Sbjct: 361 WPRGQYLAVADVYSLPYTIPY 339

BLAST of Tan0007643 vs. NCBI nr
Match: XP_022144272.1 (chromatin assembly factor 1 subunit A-like [Momordica charantia])

HSP 1 Score: 394.4 bits (1012), Expect = 1.1e-105
Identity = 247/380 (65.00%), Postives = 271/380 (71.32%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEK--KSKDKKH 60
           MSRCFPYPPPGYA KVARTEAALIESIKL SER++S+ +RKKEKSKH+KE+  KSK+KK 
Sbjct: 1   MSRCFPYPPPGYAGKVARTEAALIESIKLQSERQQSKHDRKKEKSKHRKERSEKSKEKKQ 60

Query: 61  KSKEHKEKSSRSRDLSDRKRKE----AKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSPG 120
           + KE KEKSS S DL+D+K+KE    A+D  K  KVEAEQLEKSGLTEEHGQPVWPQSPG
Sbjct: 61  RRKERKEKSSCSCDLNDQKQKECAKQAEDRLKGTKVEAEQLEKSGLTEEHGQPVWPQSPG 120

Query: 121 YLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSS 180
           YLSDG+QINHKRKRDA L PN+ SKPGKIIRIKL SSLS QE  S  +QQTCSTSGR   
Sbjct: 121 YLSDGTQINHKRKRDAKLQPNEDSKPGKIIRIKLASSLSNQEDSSADTQQTCSTSGRYDC 180

Query: 181 LDQKRDENRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSS 240
           +DQKRDEN  GP  QQKPC T ++T  +V++   KP IKD S                  
Sbjct: 181 VDQKRDENSCGPN-QQKPCFTNSNTVVAVEEAPPKPRIKDHSR----------------- 240

Query: 241 HAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPA 300
                                           S HAVKDI   QGNV  V  P RT+SPA
Sbjct: 241 --------------------------------SVHAVKDI-RPQGNV--VPFPTRTRSPA 300

Query: 301 ESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTK-TNNKAFSHVPSCRSLSLW 360
           ES YEALFEKW+PPPLQLEQQM+DEEWLFGTRKQ  GQTTK T NKAFS VPSCRS SLW
Sbjct: 301 ESEYEALFEKWIPPPLQLEQQMDDEEWLFGTRKQD-GQTTKATTNKAFSPVPSCRSSSLW 326

Query: 361 PRGQYLPDADVYSLPYTIPF 372
           PRGQYLPDADVYSLPYTIPF
Sbjct: 361 PRGQYLPDADVYSLPYTIPF 326

BLAST of Tan0007643 vs. NCBI nr
Match: KAG7029449.1 (hypothetical protein SDJN02_07788 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 390.6 bits (1002), Expect = 1.5e-104
Identity = 245/379 (64.64%), Postives = 276/379 (72.82%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKS
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKE-KSKDRKHKS 60

Query: 61  KEHKE-KSSRSRDLSDRKR----KEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSPGY 120
           KE KE K   SR L+D+K+    KEAKD  +  KVEAEQLEKSGLTEEHGQPVWP SPGY
Sbjct: 61  KERKERKEKSSRSLNDQKQKACVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSPGY 120

Query: 121 LSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSL 180
           LSDG+QINHKRKRD SL P++G KPGK+IRIKL SSLS+QE  S  S+QTCS SG D S 
Sbjct: 121 LSDGTQINHKRKRD-SLQPDEGCKPGKVIRIKLASSLSQQENSSADSEQTCSVSGCDCSR 180

Query: 181 DQKRDENRRGPPIQQKPCLTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSS 240
           DQKRDEN     +++  C   ++TA +VKD   SKP IKDP  HAVKD  +         
Sbjct: 181 DQKRDEN--SSVVRRSTCFANSETALAVKDCTSSKPKIKDPPPHAVKDRTS--------- 240

Query: 241 HAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPA 300
                                   SKPKIK P  HAVK+I +  GNV S+   PRT+SP 
Sbjct: 241 ------------------------SKPKIKDPPPHAVKEISS-LGNVMSL---PRTRSPV 300

Query: 301 ESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWP 360
           ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS +PSCRS SLWP
Sbjct: 301 ESAYEALFEKWVPPPLQLEQQMDDEEWLFQTEKQD-GRSTKT-NEAFSSIPSCRSSSLWP 336

Query: 361 RGQYLPDADVYSLPYTIPF 372
           RGQYL DADVYSLPYTIP+
Sbjct: 361 RGQYLADADVYSLPYTIPY 336

BLAST of Tan0007643 vs. NCBI nr
Match: XP_038885448.1 (DNA ligase 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 389.4 bits (999), Expect = 3.4e-104
Identity = 244/382 (63.87%), Postives = 268/382 (70.16%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEK------KSK 60
           MSRCFPYPPPGY RKVA TEAALIESIKL SERR+S+ + KKEKSKHKKEK      +SK
Sbjct: 1   MSRCFPYPPPGYVRKVASTEAALIESIKLQSERRQSKNDSKKEKSKHKKEKSKDKKERSK 60

Query: 61  DKKHKSKEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWP 120
           DKKHKSKE KEKSS SRDL+D+K+    KEAK+L K  KVEAEQLE+SGLTEEHGQPVWP
Sbjct: 61  DKKHKSKERKEKSSHSRDLNDQKQKECLKEAKELLKGTKVEAEQLERSGLTEEHGQPVWP 120

Query: 121 QSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRD 180
           QSPGYLSDG+QINHKRKRDASL  N+G KPGKIIRIKL  S     S GS+QTCSTSGRD
Sbjct: 121 QSPGYLSDGTQINHKRKRDASLQSNEGCKPGKIIRIKLSLSQQEDSSAGSEQTCSTSGRD 180

Query: 181 SSLDQKRDENRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVS 240
            S+DQKRDEN RG  IQQ    T A TA +V DP S                        
Sbjct: 181 ISVDQKRDENSRG-SIQQNTGFTYAGTAVAVNDPSS------------------------ 240

Query: 241 SSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS 300
                    SKPKI+                      +VKDI + +GNV  VSLPPRT+S
Sbjct: 241 ---------SKPKIQ----------------------SVKDISS-KGNV--VSLPPRTRS 300

Query: 301 PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSC-RSLS 360
           PAESAYEALFEKWV PPLQLEQQ +DE+WLFG  ++Q GQ+  TNNKAFS VPSC RS S
Sbjct: 301 PAESAYEALFEKWVAPPLQLEQQTDDEDWLFGRTRKQDGQS--TNNKAFSSVPSCGRSSS 321

Query: 361 LWPRGQYLPDADVYSLPYTIPF 372
           LWPRGQYL DADVYSLPYTIPF
Sbjct: 361 LWPRGQYLADADVYSLPYTIPF 321

BLAST of Tan0007643 vs. ExPASy TrEMBL
Match: A0A6J1KC15 (uncharacterized protein LOC111492505 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111492505 PE=4 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 6.1e-107
Identity = 247/381 (64.83%), Postives = 278/381 (72.97%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKS
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKE-KSKDRKHKS 60

Query: 61  ---KEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSP 120
              KE KEKSSRSRDL+D+K     KEAKD  +  KVEAEQLEKSGLTEEHGQPVWP SP
Sbjct: 61  NERKESKEKSSRSRDLNDQKHKVCVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSP 120

Query: 121 GYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDS 180
           GYLSDG+QINHKRKRD SL P++G KPGK+IRIKL SSLS+QE  S G + TCS SGRD 
Sbjct: 121 GYLSDGTQINHKRKRDDSLQPDEGCKPGKVIRIKLASSLSQQENSSAGCELTCSVSGRDI 180

Query: 181 SLDQKRDENRRGPPIQQKPCLTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVS 240
           S DQK DEN     +++  C   ++TA +VKD   SKP IKDP  HAVKD  +       
Sbjct: 181 SRDQKSDEN--SSVVRRSTCFANSETARAVKDCTSSKPKIKDPPPHAVKDRTS------- 240

Query: 241 SSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS 300
                                     SKPKIK PS HAVK+I +  GNV S+   PRT+S
Sbjct: 241 --------------------------SKPKIKDPSPHAVKEISS-LGNVMSL---PRTRS 300

Query: 301 PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSL 360
           P ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS +PSCR+ SL
Sbjct: 301 PVESAYEALFEKWVPPPLQLEQQMDDEEWLFPTEKQD-GRSTKT-NEAFSSIPSCRNSSL 339

Query: 361 WPRGQYLPDADVYSLPYTIPF 372
           WPRGQYL  ADVYSLPYTIP+
Sbjct: 361 WPRGQYLAVADVYSLPYTIPY 339

BLAST of Tan0007643 vs. ExPASy TrEMBL
Match: A0A6J1CT76 (chromatin assembly factor 1 subunit A-like OS=Momordica charantia OX=3673 GN=LOC111013996 PE=4 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 5.1e-106
Identity = 247/380 (65.00%), Postives = 271/380 (71.32%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEK--KSKDKKH 60
           MSRCFPYPPPGYA KVARTEAALIESIKL SER++S+ +RKKEKSKH+KE+  KSK+KK 
Sbjct: 1   MSRCFPYPPPGYAGKVARTEAALIESIKLQSERQQSKHDRKKEKSKHRKERSEKSKEKKQ 60

Query: 61  KSKEHKEKSSRSRDLSDRKRKE----AKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSPG 120
           + KE KEKSS S DL+D+K+KE    A+D  K  KVEAEQLEKSGLTEEHGQPVWPQSPG
Sbjct: 61  RRKERKEKSSCSCDLNDQKQKECAKQAEDRLKGTKVEAEQLEKSGLTEEHGQPVWPQSPG 120

Query: 121 YLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSS 180
           YLSDG+QINHKRKRDA L PN+ SKPGKIIRIKL SSLS QE  S  +QQTCSTSGR   
Sbjct: 121 YLSDGTQINHKRKRDAKLQPNEDSKPGKIIRIKLASSLSNQEDSSADTQQTCSTSGRYDC 180

Query: 181 LDQKRDENRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSS 240
           +DQKRDEN  GP  QQKPC T ++T  +V++   KP IKD S                  
Sbjct: 181 VDQKRDENSCGPN-QQKPCFTNSNTVVAVEEAPPKPRIKDHSR----------------- 240

Query: 241 HAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPA 300
                                           S HAVKDI   QGNV  V  P RT+SPA
Sbjct: 241 --------------------------------SVHAVKDI-RPQGNV--VPFPTRTRSPA 300

Query: 301 ESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTK-TNNKAFSHVPSCRSLSLW 360
           ES YEALFEKW+PPPLQLEQQM+DEEWLFGTRKQ  GQTTK T NKAFS VPSCRS SLW
Sbjct: 301 ESEYEALFEKWIPPPLQLEQQMDDEEWLFGTRKQD-GQTTKATTNKAFSPVPSCRSSSLW 326

Query: 361 PRGQYLPDADVYSLPYTIPF 372
           PRGQYLPDADVYSLPYTIPF
Sbjct: 361 PRGQYLPDADVYSLPYTIPF 326

BLAST of Tan0007643 vs. ExPASy TrEMBL
Match: A0A6J1KAB9 (uncharacterized protein LOC111492505 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111492505 PE=4 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 2.4e-103
Identity = 244/381 (64.04%), Postives = 275/381 (72.18%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKS
Sbjct: 1   MSRCFPYPPPGYVRKVARTEAALIESIKLQSERRQHKTDSKKEKSKHKKE-KSKDRKHKS 60

Query: 61  ---KEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSP 120
              KE KEKSSRSRDL+D+K     KEAKD  +  KVEAEQLEKSGLTEEHGQPVWP SP
Sbjct: 61  NERKESKEKSSRSRDLNDQKHKVCVKEAKDRLEGTKVEAEQLEKSGLTEEHGQPVWPHSP 120

Query: 121 GYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDS 180
           GYLSDG+QINHKRKRD SL P++    GK+IRIKL SSLS+QE  S G + TCS SGRD 
Sbjct: 121 GYLSDGTQINHKRKRDDSLQPDE----GKVIRIKLASSLSQQENSSAGCELTCSVSGRDI 180

Query: 181 SLDQKRDENRRGPPIQQKPCLTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVS 240
           S DQK DEN     +++  C   ++TA +VKD   SKP IKDP  HAVKD  +       
Sbjct: 181 SRDQKSDEN--SSVVRRSTCFANSETARAVKDCTSSKPKIKDPPPHAVKDRTS------- 240

Query: 241 SSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS 300
                                     SKPKIK PS HAVK+I +  GNV S+   PRT+S
Sbjct: 241 --------------------------SKPKIKDPSPHAVKEISS-LGNVMSL---PRTRS 300

Query: 301 PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSL 360
           P ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS +PSCR+ SL
Sbjct: 301 PVESAYEALFEKWVPPPLQLEQQMDDEEWLFPTEKQD-GRSTKT-NEAFSSIPSCRNSSL 335

Query: 361 WPRGQYLPDADVYSLPYTIPF 372
           WPRGQYL  ADVYSLPYTIP+
Sbjct: 361 WPRGQYLAVADVYSLPYTIPY 335

BLAST of Tan0007643 vs. ExPASy TrEMBL
Match: A0A6J1KTG3 (uncharacterized protein LOC111496256 OS=Cucurbita maxima OX=3661 GN=LOC111496256 PE=4 SV=1)

HSP 1 Score: 372.9 bits (956), Expect = 1.6e-99
Identity = 232/375 (61.87%), Postives = 270/375 (72.00%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSRCFPYPPPGYARKVARTEAALIESIKL+SERRE  TERKKEKSKHKKE KSKDKKHKS
Sbjct: 1   MSRCFPYPPPGYARKVARTEAALIESIKLLSERREKGTERKKEKSKHKKE-KSKDKKHKS 60

Query: 61  KEHKEKSSRS-RDLSDRKRKEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSPGYLSDG 120
           KEH++KSSRS RD +D+K+KE KDL +  KVEAEQLEKSGLTEEHGQPVWPQSPGYLSDG
Sbjct: 61  KEHRDKSSRSRRDSTDQKQKEVKDLLQGTKVEAEQLEKSGLTEEHGQPVWPQSPGYLSDG 120

Query: 121 SQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKR 180
           +Q NHKRKR ASL PN+  KPGK+IRIKL SSLS+QE  S GS+QTCST+GR +SL Q R
Sbjct: 121 TQSNHKRKRGASLQPNEDCKPGKVIRIKLASSLSQQEDSSAGSEQTCSTTGRRNSLHQTR 180

Query: 181 DENRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKD 240
           DEN    PI QK   T  D    V++    P+++  S  ++         K   +   + 
Sbjct: 181 DENSSRVPIVQKTSFTSLDQK-RVENSSRVPIVQKTSLTSLDQ-------KRDENSRSRP 240

Query: 241 PISKPKIEV-PSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAESAY 300
            + K  + +  +  AV+DPIS+P IK    HAV    TH           R + P+ESAY
Sbjct: 241 IVQKTSLTIADTAVAVQDPISEPNIKDLPLHAVDIGSTH-----------RKRKPSESAY 300

Query: 301 EALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQY 360
           E LF+KWVPP LQL QQ  DEEWLFGT+KQ   + TKT N+AFSH P CR  SLWPRGQ+
Sbjct: 301 EDLFDKWVPPTLQLGQQTVDEEWLFGTKKQD--ERTKT-NQAFSHAPICRRSSLWPRGQF 352

Query: 361 LPDADVYSLPYTIPF 372
           +P+ADVY LPYTIPF
Sbjct: 361 VPEADVYLLPYTIPF 352

BLAST of Tan0007643 vs. ExPASy TrEMBL
Match: A0A6J1GHA3 (uncharacterized protein LOC111454195 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454195 PE=4 SV=1)

HSP 1 Score: 367.1 bits (941), Expect = 8.8e-98
Identity = 235/382 (61.52%), Postives = 269/382 (70.42%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSRCFPYPPPGYARKVARTEAALIE IKL+SERRE  TERKKEKSKHKKE KSKDKKHKS
Sbjct: 1   MSRCFPYPPPGYARKVARTEAALIEPIKLLSERREKGTERKKEKSKHKKE-KSKDKKHKS 60

Query: 61  KEHKEKSSRS--RDLSDRKRKEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSPGYLSD 120
           KEH++KSSRS  R  +D+K+KE KDL K  KVEAEQLEKSGLTEEHGQPVWPQSPGYLSD
Sbjct: 61  KEHRDKSSRSSRRYSNDQKQKEVKDLLKGTKVEAEQLEKSGLTEEHGQPVWPQSPGYLSD 120

Query: 121 GSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRD-SSLDQ 180
           G+Q NHKRKR AS+ PN+  KPGK+IRIKL SSLS+QE  S GS+QTCST+GR  + L Q
Sbjct: 121 GTQSNHKRKRGASIQPNEDCKPGKVIRIKLASSLSQQEDSSAGSEQTCSTTGRRCNPLHQ 180

Query: 181 KRDENRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAV 240
            RDEN    PI QK  LT  D              +D S   V  P+ K  +        
Sbjct: 181 TRDENSSRVPIVQKTSLTSLDQK------------RDESCRRV-PPVQKTSLTSLDQKRD 240

Query: 241 KDP-----ISKPKIEVPSLH-AVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTK 300
           ++      + K  + +P    AVKDPISKP IK    HAV DIGTH           R +
Sbjct: 241 ENSGRGLIVQKTSLTMPDTPVAVKDPISKPNIKDLPLHAV-DIGTH-----------RKR 300

Query: 301 SPAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLS 360
            P++SAYE LF+KWVPP LQL QQ ++EEWLFG +KQ   + TKT N+AFSH P CRS S
Sbjct: 301 KPSDSAYEDLFDKWVPPTLQLGQQTDNEEWLFGPKKQD--ERTKT-NQAFSHAPICRSSS 353

Query: 361 LWPRGQYLPDADVYSLPYTIPF 372
           LWPRGQ++P+ADVY LPYTIPF
Sbjct: 361 LWPRGQFVPEADVYMLPYTIPF 353

BLAST of Tan0007643 vs. TAIR 10
Match: AT1G20100.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75860.1); Has 471 Blast hits to 438 proteins in 92 species: Archae - 0; Bacteria - 14; Metazoa - 217; Fungi - 43; Plants - 91; Viruses - 1; Other Eukaryotes - 105 (source: NCBI BLink). )

HSP 1 Score: 71.6 bits (174), Expect = 1.5e-12
Identity = 111/380 (29.21%), Postives = 159/380 (41.84%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSR F  PPP YAR  A  +  L+E  K+     +S+   +KEK + KKEKK K K+ KS
Sbjct: 1   MSRYFTSPPPVYARNWANGQ-NLVEWTKIERPIVDSKKLHRKEKKEKKKEKKLK-KEKKS 60

Query: 61  KEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSPGYLSDGS 120
            E K  ++                 K    E+EQLEKS LTEE  QP      GYLSDGS
Sbjct: 61  LEQKYSTT-----------------KTVSYESEQLEKSCLTEEFEQP----QVGYLSDGS 120

Query: 121 QINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQKRDEN 180
           Q + KR+R+ S        P  +                                  +  
Sbjct: 121 QNSKKRRRETS--------PAVV----------------------------------ESQ 180

Query: 181 RRGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPIS 240
            +  P+  KP          ++    KP  K+  +   +DP       V S+   + P  
Sbjct: 181 IKATPVAGKPL--------RIRIVFKKP--KEAEAVPQEDP-------VCSTSGTQRPSE 240

Query: 241 KP-KIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS---PAESAY 300
            P  + +PS       I    + VPS+          G VA +S   + K      ES Y
Sbjct: 241 LPSSVSLPS-------ICDHDVAVPST------SLESGKVAIISESKKRKKHKPSKESRY 285

Query: 301 EALFEKWVPPPLQLEQ-QMNDEEWLFGT-RKQQVG---QTTKTNNKAFSHVPSCRSLSLW 360
            +LF++ VPP + LE+   + ++WLFGT RK+ V     + KT+      + + R  S  
Sbjct: 301 NSLFDELVPPCISLEEDDSSSDDWLFGTSRKENVSSAKSSYKTDEDTIMSLQTSRDCSSL 285

Query: 361 PRGQYLPDADVYSLPYTIPF 372
           PR   L +  ++SLPYT+PF
Sbjct: 361 PRAMLLSEVGIFSLPYTVPF 285

BLAST of Tan0007643 vs. TAIR 10
Match: AT1G75860.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G20100.1); Has 258 Blast hits to 235 proteins in 58 species: Archae - 0; Bacteria - 4; Metazoa - 59; Fungi - 16; Plants - 90; Viruses - 0; Other Eukaryotes - 89 (source: NCBI BLink). )

HSP 1 Score: 56.2 bits (134), Expect = 6.4e-08
Identity = 101/379 (26.65%), Postives = 145/379 (38.26%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSR    PP  +AR     +  L+ES KL     +S+   + EK + K+++K K +  + 
Sbjct: 1   MSRVLTCPPLVFARNHVGVQ-NLVESTKLKRITLDSKKAHRIEKKEKKEKRKEKKETKRE 60

Query: 61  KEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSPGYLSDGS 120
           K HK     S   +D   K      K+   E++ LEKSGLT+E  +P   +  GYLSDGS
Sbjct: 61  KSHK----HSIKATDNHHKLIFLPSKKVSDESDSLEKSGLTDELEEP--QKHLGYLSDGS 120

Query: 121 QINHKRKRDAS-----LLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQ 180
           Q + KR RD S      L       GK +RI++                        + +
Sbjct: 121 QNSKKRIRDDSPPAVESLIKAAPVAGKPLRIRM------------------------VFK 180

Query: 181 KRDENRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAV 240
           K  E     P +   C T    + S +D I+  +    +S   K+         S+S A 
Sbjct: 181 KPKEEVPTLPREAVVCSTTVAKSLSHQDVITSSISSSKTSELEKN-------LPSTSIAA 240

Query: 241 KDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAESA 300
            D   K K                                           + +S  E  
Sbjct: 241 IDETKKRK-------------------------------------------KHRSSKEDQ 297

Query: 301 YEALFEKWVPPPL---QLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWP 360
           Y ALF+ W PP +         N + WLFG + Q+V    K   K           S WP
Sbjct: 301 YNALFDGWTPPSMCIADASSNDNGDYWLFGNKTQEV-LKPKAAVKVDDDTMMRPGDSSWP 297

Query: 361 RGQYLPDADVYSLPYTIPF 372
           R Q+L +  +YSLPYT+PF
Sbjct: 361 RAQFLSEVGIYSLPYTVPF 297

BLAST of Tan0007643 vs. TAIR 10
Match: AT4G35940.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G17787.1). )

HSP 1 Score: 50.8 bits (120), Expect = 2.7e-06
Identity = 106/404 (26.24%), Postives = 167/404 (41.34%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSER---------RESRTERKKEKSKHKKEK 60
           MSRCFP+PPPGY     R EA ++ SIK V E+         R S  + KK+K + K++K
Sbjct: 1   MSRCFPFPPPGYVLNGIRDEAVIVSSIKGVEEKAKKEQRRKDRRSDKKDKKDKKERKEKK 60

Query: 61  KSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLP---KEAKVEAEQLEKSGLTEEHG--Q 120
           + K+KK K +E KE  S  R    R++++   +    K  + E   LEKS LT E    Q
Sbjct: 61  EKKEKKRKEREGKEVGSEKRSHKRRRKEDGAKVDLFHKLKESEVNCLEKSSLTVERELLQ 120

Query: 121 PVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSL------SRQESVGS 180
                S     + +++  K+K     L  + +      R++    L      + ++ V  
Sbjct: 121 STSQNSCDSTLNSNEMLPKQKEVQQPLDGRHNNNNNEKRVEKQQPLDGRHNNNNEKRVEK 180

Query: 181 QQTCSTSGRDSSLDQKRDE-----NRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDP-- 240
           QQ     GR ++ ++KR E     N R     +K         G   +   K + K    
Sbjct: 181 QQ--PLDGRHNNNNEKRIEKQQPLNGRHNNNNEKLMEKQQPLNGRHNNNNEKRIEKQQPL 240

Query: 241 -SSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDI 300
              H  K+   + +  +   H   D  S      P     KDPI + K       +    
Sbjct: 241 NGRHNNKEKQKEKQQPLDVRHNNND--SAEHASKPREEKRKDPIFRGKHGKEKISSSSTR 300

Query: 301 GTHQGNVASVSLPPRTKSPAESAYEALFEKWVPPPLQLEQQM---NDEEWLFGTRKQQVG 360
            T+Q   +  + PP         +  + E WVP  ++    +    DEE  +  +K    
Sbjct: 301 ETYQPPKSLCNCPP----SMVLQFLDVVENWVPNTIERRVDLINSEDEECWWSMKKPPSS 360

Query: 361 QT--TKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF 372
            T   K  N+  + +    +   WP  + LP+ADVY+LPYT+PF
Sbjct: 361 TTEICKQLNRE-NEIKQVGNTMGWPCARLLPEADVYALPYTVPF 395

BLAST of Tan0007643 vs. TAIR 10
Match: AT1G20100.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75860.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 48.1 bits (113), Expect = 1.7e-05
Identity = 66/175 (37.71%), Postives = 89/175 (50.86%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS 60
           MSR F  PPP YAR  A  +  L+E  K+     +S+   +KEK + KKEKK K K+ KS
Sbjct: 1   MSRYFTSPPPVYARNWANGQ-NLVEWTKIERPIVDSKKLHRKEKKEKKKEKKLK-KEKKS 60

Query: 61  KEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWPQSPGYLSDGS 120
            E K  ++                 K    E+EQLEKS LTEE  QP      GYLSDGS
Sbjct: 61  LEQKYSTT-----------------KTVSYESEQLEKSCLTEEFEQP----QVGYLSDGS 120

Query: 121 QINHKRKRDAS--LLPNQ---GSKPGKIIRIKLGSSLSRQESVGSQQ--TCSTSG 169
           Q + KR+R+ S  ++ +Q       GK +RI++     ++     Q+   CSTSG
Sbjct: 121 QNSKKRRRETSPAVVESQIKATPVAGKPLRIRIVFKKPKEAEAVPQEDPVCSTSG 152

BLAST of Tan0007643 vs. TAIR 10
Match: AT4G35940.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G17787.1); Has 45288 Blast hits to 24095 proteins in 1140 species: Archae - 93; Bacteria - 2895; Metazoa - 13424; Fungi - 2873; Plants - 1183; Viruses - 123; Other Eukaryotes - 24697 (source: NCBI BLink). )

HSP 1 Score: 47.0 bits (110), Expect = 3.9e-05
Identity = 115/462 (24.89%), Postives = 181/462 (39.18%), Query Frame = 0

Query: 1   MSRCFPYPPPGYARKVARTEAALIESIKLVSER---------RESRTERKKEKSKHKKEK 60
           MSRCFP+PPPGY     R EA ++ SIK V E+         R S  + KK+K + K++K
Sbjct: 1   MSRCFPFPPPGYVLNGIRDEAVIVSSIKGVEEKAKKEQRRKDRRSDKKDKKDKKERKEKK 60

Query: 61  KSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLP---KEAKVEAEQLEKSGLT------- 120
           + K+KK K +E KE  S  R    R++++   +    K  + E   LEKS LT       
Sbjct: 61  EKKEKKRKEREGKEVGSEKRSHKRRRKEDGAKVDLFHKLKESEVNCLEKSSLTVERELLQ 120

Query: 121 --------------------EEHGQP-------------VWPQSPGYLSDGSQINHKRKR 180
                               +E  QP             V  Q P    DG   N+  KR
Sbjct: 121 STSQNSCDSTLNSNEMLPKQKEVQQPLDGRHNNNNNEKRVEKQQP---LDGRHNNNNEKR 180

Query: 181 DASLLPNQG-SKPGKIIRIKLGSSLSRQESVGSQQTCS----TSGRDSSLDQKRDENR-- 240
                P  G        RI+    L+ + +  +++        +GR ++ ++KR E +  
Sbjct: 181 VEKQQPLDGRHNNNNEKRIEKQQPLNGRHNNNNEKLMEKQQPLNGRHNNNNEKRIEKQQP 240

Query: 241 -------------RGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKP--- 300
                        +  P+  +     +++   ++ PI +   KDP          KP   
Sbjct: 241 LNGRHNNKEKQKEKQQPLDVRHNNNDSESIIRIRLPIRRQ--KDPEVMMTNKDQEKPGPS 300

Query: 301 -KIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSL 360
             IK+ SS     P  +P  + P   +  +  SKP+ +       +  G H     S S 
Sbjct: 301 RGIKLDSSQL---PTREPVNQHPCSTSAAEHASKPREEKRKDPIFR--GKHGKEKISSSS 360

Query: 361 PPRTKSPAES----------AYEALFEKWVPPPLQLEQQM---NDEEWLFGTRKQQVGQT 372
              T  P +S           +  + E WVP  ++    +    DEE  +  +K     T
Sbjct: 361 TRETYQPPKSLCNCPPSMVLQFLDVVENWVPNTIERRVDLINSEDEECWWSMKKPPSSTT 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023545923.11.5e-10765.35uncharacterized protein LOC111805212 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022997629.11.3e-10664.83uncharacterized protein LOC111492505 isoform X1 [Cucurbita maxima][more]
XP_022144272.11.1e-10565.00chromatin assembly factor 1 subunit A-like [Momordica charantia][more]
KAG7029449.11.5e-10464.64hypothetical protein SDJN02_07788 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_038885448.13.4e-10463.87DNA ligase 1 isoform X1 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1KC156.1e-10764.83uncharacterized protein LOC111492505 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1CT765.1e-10665.00chromatin assembly factor 1 subunit A-like OS=Momordica charantia OX=3673 GN=LOC... [more]
A0A6J1KAB92.4e-10364.04uncharacterized protein LOC111492505 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1KTG31.6e-9961.87uncharacterized protein LOC111496256 OS=Cucurbita maxima OX=3661 GN=LOC111496256... [more]
A0A6J1GHA38.8e-9861.52uncharacterized protein LOC111454195 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT1G20100.11.5e-1229.21unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G75860.16.4e-0826.65unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G35940.22.7e-0626.24unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G20100.21.7e-0537.71unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G35940.13.9e-0524.89unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 150..170
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 40..64
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 65..100
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 28..189
NoneNo IPR availablePANTHERPTHR34660:SF7DNA LIGASEcoord: 1..371
NoneNo IPR availablePANTHERPTHR34660MYB-LIKE PROTEIN Xcoord: 1..371

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007643.1Tan0007643.1mRNA