Tan0004231 (gene) Snake gourd v1

Overview
NameTan0004231
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionYqaJ domain-containing protein
LocationLG02: 91009375 .. 91011900 (-)
RNA-Seq ExpressionTan0004231
SyntenyTan0004231
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCTCGCTGCGGTCTCCTTTTCTCCAGCTGGAGCGTCTCGAAGTTTTCTTCATGGAGGTTCCCCTTTCAATCGATTGCCGCGCGTCGCTTCATTTTCAACTCGTGAAGTTGATGCATTCAGCTCAACTTCTCTTTTGGGTACTTTATCGAACACCTTATCTGCTACTCGTTTCCTGCGATTATACGTGTTTTTATTCAATTTGGGGCTTAGACTTGGGTTCCCGTTTTAATATTCTGATGGATTTTCTTCTTCTACTTCACGAAGATAGGAAACAAATTTTTCTGCTTGGCATTGTTACTTTGTGCATCCTTGTGATAGTCATTTTATTTTCAATCTTCTGTTTTTGAAAGTTAATCTTGCTTTCAACAAATTATTCCTTCCAATTGCATGCCTAAAATCATTCCTAAAGTTCTATCCAAAGTTTTCTGGTGTTATAGAATCAGAAAGAAAATACTAAGGCTCCCGTTGATATACATTTAGTTTTTAGTTTTTAGTTTTTGAAATTTAAACCTAAAAATACTATTTCTACCAATGAGTTTACATGTCATCTTATCTACTTTGTACCTATATTTTCAAAAACCATACCAACTTTTGAAAACTAAAAAAAGTACCTTTCAAAAACTTGATTTTGTTTTTAGAATTTGGAAGTTTTTAGAATTTGGCTAAAAGTTCAAATGGTTACTTAAGGGAGATGATACTATTGTAGAGGAATTATGTGAAAATAAGCTAAACTTTCAAAAACTAAAAATAAAATGGTTATAAACGGGCCTAAGTATAGAGTTTATTTCTTGAAAAGAGAAAACTAAAAGCAAAAGCTATTTCTTTGTAAACTATTTCGTTCAACTGTACCAAGTCAAGATTAATTAAGAAAATTTTCGTTTGGTTGTTTTTGGGACCATTGATTGCTTAATGTTACTGAGGTTAATAGCTTGTTAATCCACATGGACCTCTGTAATCTTAATGTTAGATGCATACCCAAAAACAATTTTCTTAATTCAGCTGACTTAGCACGGTTGTGACTTGGAAGGTGAAATAACTAGAAGAGTGATCGATTTCCATTCCCTGGCTTCTTAAAAATAGGATACAGATTCCTTGAATTCATTCTGTCGAGGATGGATAAAGAATAATGTATATCTTGGATTTATCGTTTGTGAATTTGAAAATTTTAAAATCTGCACTATACTTGGAGAATTTCTTGTATTTGATAGAAAGTTGATTGCTTATAGCATCCAAAATTGAATTAAGGTCATTTCTTGGAGTGTTGTTCCTGATCAAGCTACTATAATAGAAGATTCCTTTCCTTGTTTTTAGATCATTTCCTTGCACACCAAGTCACTCGTCGCTTAAAAAAAAAGGTAGAAATTGATTCGTTTCTTTGGTAGGATAACAAGTTCATCATCAACACCACTACAACAATTGTTTGTATTTTCTTGAATGCTTAACCAGCAAAAACTGTCCATGCTTATTTTTCACTTCATCTATACATAAACATGCTACACATTTTGTACCTGAACTTTTAGCAGTCTGTGGGTTTTGCAGGACACTTCATCAAAGTAACTCTCCAATCGACATTGCCATTATGCCAACAATGAGCAACACCTCCATTGCTAGAATCTTCTGTAGCCACCCTAGATCAAATGCAAGGCTGTTCTCAAAACGAAAACATGGGAGTGGTTCAAGAACCCTTTCAACATGCACCTCACCATCTAGCTCCATAACCAACCCCCTGGTCATCCGTTTACCCTCAGCCTTGATTTTGGCTTCCCAGGTCACCTCTTCAGACGCCCCTCAACGTTCAGAAGAATGGTTTGCGCTACGGAGGGACAGGCTGACTACAAGCACATTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGCCGCTTCGAGCTATGGCAAGAGAAAGTGTTTCCTTCAGAGATTCAAAAACCAGAAGCACGACAGCAGTATGCCATGGAGTGGGGTGTGCTCAACGAAGCAAATGCCATCGATCGGTATAAAAGCATCACAGGCCGAGATGTAAGCTTGTTAGGATTTGCAACTCACTCGGAGCAGCAATTCGACTGGCTAGGTGCCTCCCCCGACGGCCTATTGGGATGCTTTCAAGGAGGTGGGATCCTTGAAGTAAAATGTCCATACAACAAGGGAAAGCCTGAGAAGGGACTACCCTGGTCGACTATGCCTTTCTATTACATGCCACAGGTACAGGGCCAATTGGAGATAATGGACAGAGAGTGGGCGGATTTGTATTGCTGGACACCAAATGGAAGCACAATATTTCGCGTTTGTAGGGAACGTGGTTATTGGGAATTGATACGTGAAATGTTAAAGGAATTTTGGTGGGAAAATGTTGTTCCTGCAAAGGAGGCTCTATCATTGGGAAAAGAGGAAGAGGTCAAGTCATATAAGCCAACATCCACACACAAACAGACTGGACTAGCAATTGCTAAGAGCATCAAGTTAGCAAGCGAGGCCAAATTGTTGTGTAGGGAAATTGCTGGGCATGTTGAATTCTACCGATGA

mRNA sequence

ATGAAGCTCGCTGCGGTCTCCTTTTCTCCAGCTGGAGCGTCTCGAAGTTTTCTTCATGGAGGTTCCCCTTTCAATCGATTGCCGCGCGTCGCTTCATTTTCAACTCGTGAAGTTGATGCATTCAGCTCAACTTCTCTTTTGGTCTGTGGGTTTTGCAGGACACTTCATCAAAGTAACTCTCCAATCGACATTGCCATTATGCCAACAATGAGCAACACCTCCATTGCTAGAATCTTCTGTAGCCACCCTAGATCAAATGCAAGGCTGTTCTCAAAACGAAAACATGGGAGTGGTTCAAGAACCCTTTCAACATGCACCTCACCATCTAGCTCCATAACCAACCCCCTGGTCATCCGTTTACCCTCAGCCTTGATTTTGGCTTCCCAGGTCACCTCTTCAGACGCCCCTCAACGTTCAGAAGAATGGTTTGCGCTACGGAGGGACAGGCTGACTACAAGCACATTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGCCGCTTCGAGCTATGGCAAGAGAAAGTGTTTCCTTCAGAGATTCAAAAACCAGAAGCACGACAGCAGTATGCCATGGAGTGGGGTGTGCTCAACGAAGCAAATGCCATCGATCGGTATAAAAGCATCACAGGCCGAGATGTAAGCTTGTTAGGATTTGCAACTCACTCGGAGCAGCAATTCGACTGGCTAGGTGCCTCCCCCGACGGCCTATTGGGATGCTTTCAAGGAGGTGGGATCCTTGAAGTAAAATGTCCATACAACAAGGGAAAGCCTGAGAAGGGACTACCCTGGTCGACTATGCCTTTCTATTACATGCCACAGGTACAGGGCCAATTGGAGATAATGGACAGAGAGTGGGCGGATTTGTATTGCTGGACACCAAATGGAAGCACAATATTTCGCGTTTGTAGGGAACGTGGTTATTGGGAATTGATACGTGAAATGTTAAAGGAATTTTGGTGGGAAAATGTTGTTCCTGCAAAGGAGGCTCTATCATTGGGAAAAGAGGAAGAGGTCAAGTCATATAAGCCAACATCCACACACAAACAGACTGGACTAGCAATTGCTAAGAGCATCAAGTTAGCAAGCGAGGCCAAATTGTTGTGTAGGGAAATTGCTGGGCATGTTGAATTCTACCGATGA

Coding sequence (CDS)

ATGAAGCTCGCTGCGGTCTCCTTTTCTCCAGCTGGAGCGTCTCGAAGTTTTCTTCATGGAGGTTCCCCTTTCAATCGATTGCCGCGCGTCGCTTCATTTTCAACTCGTGAAGTTGATGCATTCAGCTCAACTTCTCTTTTGGTCTGTGGGTTTTGCAGGACACTTCATCAAAGTAACTCTCCAATCGACATTGCCATTATGCCAACAATGAGCAACACCTCCATTGCTAGAATCTTCTGTAGCCACCCTAGATCAAATGCAAGGCTGTTCTCAAAACGAAAACATGGGAGTGGTTCAAGAACCCTTTCAACATGCACCTCACCATCTAGCTCCATAACCAACCCCCTGGTCATCCGTTTACCCTCAGCCTTGATTTTGGCTTCCCAGGTCACCTCTTCAGACGCCCCTCAACGTTCAGAAGAATGGTTTGCGCTACGGAGGGACAGGCTGACTACAAGCACATTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGCCGCTTCGAGCTATGGCAAGAGAAAGTGTTTCCTTCAGAGATTCAAAAACCAGAAGCACGACAGCAGTATGCCATGGAGTGGGGTGTGCTCAACGAAGCAAATGCCATCGATCGGTATAAAAGCATCACAGGCCGAGATGTAAGCTTGTTAGGATTTGCAACTCACTCGGAGCAGCAATTCGACTGGCTAGGTGCCTCCCCCGACGGCCTATTGGGATGCTTTCAAGGAGGTGGGATCCTTGAAGTAAAATGTCCATACAACAAGGGAAAGCCTGAGAAGGGACTACCCTGGTCGACTATGCCTTTCTATTACATGCCACAGGTACAGGGCCAATTGGAGATAATGGACAGAGAGTGGGCGGATTTGTATTGCTGGACACCAAATGGAAGCACAATATTTCGCGTTTGTAGGGAACGTGGTTATTGGGAATTGATACGTGAAATGTTAAAGGAATTTTGGTGGGAAAATGTTGTTCCTGCAAAGGAGGCTCTATCATTGGGAAAAGAGGAAGAGGTCAAGTCATATAAGCCAACATCCACACACAAACAGACTGGACTAGCAATTGCTAAGAGCATCAAGTTAGCAAGCGAGGCCAAATTGTTGTGTAGGGAAATTGCTGGGCATGTTGAATTCTACCGATGA

Protein sequence

MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
Homology
BLAST of Tan0004231 vs. NCBI nr
Match: XP_038884042.1 (uncharacterized protein LOC120074988 isoform X1 [Benincasa hispida])

HSP 1 Score: 696.0 bits (1795), Expect = 1.7e-196
Identity = 339/379 (89.45%), Postives = 354/379 (93.40%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MKLAAVSFS AGAS+  LHGGS FNR PRVASF+ R+VDAFSSTSLLVCG CRTLH SNS
Sbjct: 1   MKLAAVSFSRAGASQRLLHGGSSFNRFPRVASFAARQVDAFSSTSLLVCGLCRTLHHSNS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            ++ AIM TM+NTSI+RI C H R NARL  +RKHGSGSRT STC SPSSSITNPLVIRL
Sbjct: 61  SVETAIMSTMNNTSISRICCRHSRMNARLL-RRKHGSGSRTFSTCASPSSSITNPLVIRL 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PSALILASQVT SDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELW EKVFP EI
Sbjct: 121 PSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPPEI 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           QKPEA QQ AMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF
Sbjct: 181 QKPEAPQQNAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQ+EIMDREWADLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQMEIMDREWADLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYW+LIREML+EFWWENVVPA+EAL LG+EE+VKSYKPTSTHKQTGLAIAKSIKL
Sbjct: 301 VCRERGYWDLIREMLREFWWENVVPAREALLLGREEQVKSYKPTSTHKQTGLAIAKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           ASEAKLLCREIAGH+EFYR
Sbjct: 361 ASEAKLLCREIAGHIEFYR 378

BLAST of Tan0004231 vs. NCBI nr
Match: XP_023537251.1 (uncharacterized protein LOC111798383 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 684.9 bits (1766), Expect = 4.0e-193
Identity = 333/379 (87.86%), Postives = 350/379 (92.35%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MKLAAVSFS +GASRSFLH  S FNRLPRVASFS R+VDAFSSTSL VCGFCRT HQSNS
Sbjct: 1   MKLAAVSFSQSGASRSFLHADSSFNRLPRVASFSARQVDAFSSTSLSVCGFCRTPHQSNS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            ID A++ TMSNTSIARI C HPRSNARLFSKRK  +GSRT ST  SP  S+TNPL+IRL
Sbjct: 61  SIDTALLSTMSNTSIARICCRHPRSNARLFSKRKQWNGSRTFSTSISPPKSVTNPLLIRL 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSEI
Sbjct: 121 PSALIVASQVTPSDAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRFELWHEKVFPSEI 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           +KPEARQQYAMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCF
Sbjct: 181 KKPEARQQYAMEWGVLNEENAIHRYKSITGRDVSFLGFATHSEQQLDWLGASPDGLLGCF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYWELI EML+EFWWENVVPA+EA S G+E+EV+SYKPTSTHK TG+AIAKSIKL
Sbjct: 301 VCRERGYWELIHEMLREFWWENVVPAREASSSGREKEVESYKPTSTHKLTGVAIAKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           AS+AKLLCREIAGHVEFYR
Sbjct: 361 ASDAKLLCREIAGHVEFYR 379

BLAST of Tan0004231 vs. NCBI nr
Match: KAG7020121.1 (hypothetical protein SDJN02_16803 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 680.6 bits (1755), Expect = 7.6e-192
Identity = 332/379 (87.60%), Postives = 350/379 (92.35%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MKLAAVSFS +GASRSFLH  S FNRLP VASFS R+VDAFSSTSL VCGFCRT HQSNS
Sbjct: 1   MKLAAVSFSQSGASRSFLHADSSFNRLPCVASFSARQVDAFSSTSLSVCGFCRTPHQSNS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            I+ A++ TMSNTSIARI C  PRSNARLFSKRK  +GSRT ST  SP  S+TNPL+IRL
Sbjct: 61  SINTALLSTMSNTSIARICCRLPRSNARLFSKRKQWNGSRTFSTSISPPKSVTNPLLIRL 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSEI
Sbjct: 121 PSALIVASQVTPSDAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRFELWHEKVFPSEI 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           +KPEARQQYAMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCF
Sbjct: 181 KKPEARQQYAMEWGVLNEENAIHRYKSITGRDVSFLGFATHSEQQLDWLGASPDGLLGCF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYWELI EML+EFWWENVVPA+EALS G+E+EV+SYKPTSTHKQTG+AIAKSIKL
Sbjct: 301 VCRERGYWELIHEMLREFWWENVVPAREALSSGREKEVESYKPTSTHKQTGVAIAKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           AS+AKLLCREIAGHVEFYR
Sbjct: 361 ASDAKLLCREIAGHVEFYR 379

BLAST of Tan0004231 vs. NCBI nr
Match: XP_023001988.1 (uncharacterized protein LOC111496007 isoform X1 [Cucurbita maxima])

HSP 1 Score: 680.2 bits (1754), Expect = 9.9e-192
Identity = 331/379 (87.34%), Postives = 351/379 (92.61%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MKLAAVSFS + ASRSFLH  S FNRLPRVASFS  +VDAFSSTSL VC FCRT HQSNS
Sbjct: 1   MKLAAVSFSQSVASRSFLHADSSFNRLPRVASFSPPQVDAFSSTSLSVCWFCRTPHQSNS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            ID A++ TMSNTSIARI C+HPRSNARLFSKRK  +GSRT ST  SP  S+TNPL+IRL
Sbjct: 61  SIDTALLSTMSNTSIARICCTHPRSNARLFSKRKQWNGSRTFSTSISPPKSVTNPLLIRL 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKG+RRFELW EKVFPSEI
Sbjct: 121 PSALIVASQVTPSDAPQRSEEWFALRRDKLTTSTFSTALGFWKGSRRFELWHEKVFPSEI 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           +KPEARQQYAMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ +WLGASPDGLLGCF
Sbjct: 181 KKPEARQQYAMEWGVLNEENAIHRYKSITGRDVSFLGFATHSEQQLEWLGASPDGLLGCF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYWELI EML+EFWWENVVPA+EALSLG+EEEV+SYKPTSTHKQTG+AI+KSIKL
Sbjct: 301 VCRERGYWELIHEMLREFWWENVVPAREALSLGREEEVESYKPTSTHKQTGVAISKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           AS+AKLLCREIAGHVEFYR
Sbjct: 361 ASDAKLLCREIAGHVEFYR 379

BLAST of Tan0004231 vs. NCBI nr
Match: XP_022951164.1 (uncharacterized protein LOC111454092 isoform X1 [Cucurbita moschata])

HSP 1 Score: 678.3 bits (1749), Expect = 3.8e-191
Identity = 331/379 (87.34%), Postives = 349/379 (92.08%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MKLAAVSFS +GASRSFLH  S FNRLP VASFS R+VDAFSSTS  VCGFCRT HQSNS
Sbjct: 1   MKLAAVSFSQSGASRSFLHADSSFNRLPCVASFSARQVDAFSSTSRSVCGFCRTPHQSNS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            I+ A++ TMSNTSIARI C  PRSNARLFSKRK  +GSRT ST  SP  S+TNPL+IRL
Sbjct: 61  SINTALLSTMSNTSIARICCRLPRSNARLFSKRKQWNGSRTFSTSISPPKSVTNPLLIRL 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSEI
Sbjct: 121 PSALIVASQVTPSDAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRFELWHEKVFPSEI 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           +KPEARQQYAMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCF
Sbjct: 181 KKPEARQQYAMEWGVLNEENAIHRYKSITGRDVSFLGFATHSEQQLDWLGASPDGLLGCF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYWELI EML+EFWWENVVPA+EALS G+E+EV+SYKPTSTHKQTG+AIAKSIKL
Sbjct: 301 VCRERGYWELIHEMLREFWWENVVPAREALSSGREKEVESYKPTSTHKQTGVAIAKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           AS+AKLLCREIAGHVEFYR
Sbjct: 361 ASDAKLLCREIAGHVEFYR 379

BLAST of Tan0004231 vs. ExPASy TrEMBL
Match: A0A6J1KS64 (uncharacterized protein LOC111496007 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111496007 PE=4 SV=1)

HSP 1 Score: 680.2 bits (1754), Expect = 4.8e-192
Identity = 331/379 (87.34%), Postives = 351/379 (92.61%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MKLAAVSFS + ASRSFLH  S FNRLPRVASFS  +VDAFSSTSL VC FCRT HQSNS
Sbjct: 1   MKLAAVSFSQSVASRSFLHADSSFNRLPRVASFSPPQVDAFSSTSLSVCWFCRTPHQSNS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            ID A++ TMSNTSIARI C+HPRSNARLFSKRK  +GSRT ST  SP  S+TNPL+IRL
Sbjct: 61  SIDTALLSTMSNTSIARICCTHPRSNARLFSKRKQWNGSRTFSTSISPPKSVTNPLLIRL 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKG+RRFELW EKVFPSEI
Sbjct: 121 PSALIVASQVTPSDAPQRSEEWFALRRDKLTTSTFSTALGFWKGSRRFELWHEKVFPSEI 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           +KPEARQQYAMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ +WLGASPDGLLGCF
Sbjct: 181 KKPEARQQYAMEWGVLNEENAIHRYKSITGRDVSFLGFATHSEQQLEWLGASPDGLLGCF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYWELI EML+EFWWENVVPA+EALSLG+EEEV+SYKPTSTHKQTG+AI+KSIKL
Sbjct: 301 VCRERGYWELIHEMLREFWWENVVPAREALSLGREEEVESYKPTSTHKQTGVAISKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           AS+AKLLCREIAGHVEFYR
Sbjct: 361 ASDAKLLCREIAGHVEFYR 379

BLAST of Tan0004231 vs. ExPASy TrEMBL
Match: A0A6J1GHX8 (uncharacterized protein LOC111454092 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454092 PE=4 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 1.8e-191
Identity = 331/379 (87.34%), Postives = 349/379 (92.08%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MKLAAVSFS +GASRSFLH  S FNRLP VASFS R+VDAFSSTS  VCGFCRT HQSNS
Sbjct: 1   MKLAAVSFSQSGASRSFLHADSSFNRLPCVASFSARQVDAFSSTSRSVCGFCRTPHQSNS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            I+ A++ TMSNTSIARI C  PRSNARLFSKRK  +GSRT ST  SP  S+TNPL+IRL
Sbjct: 61  SINTALLSTMSNTSIARICCRLPRSNARLFSKRKQWNGSRTFSTSISPPKSVTNPLLIRL 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSEI
Sbjct: 121 PSALIVASQVTPSDAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRFELWHEKVFPSEI 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           +KPEARQQYAMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCF
Sbjct: 181 KKPEARQQYAMEWGVLNEENAIHRYKSITGRDVSFLGFATHSEQQLDWLGASPDGLLGCF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYWELI EML+EFWWENVVPA+EALS G+E+EV+SYKPTSTHKQTG+AIAKSIKL
Sbjct: 301 VCRERGYWELIHEMLREFWWENVVPAREALSSGREKEVESYKPTSTHKQTGVAIAKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           AS+AKLLCREIAGHVEFYR
Sbjct: 361 ASDAKLLCREIAGHVEFYR 379

BLAST of Tan0004231 vs. ExPASy TrEMBL
Match: A0A6J1BPX6 (uncharacterized protein LOC111004694 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111004694 PE=4 SV=1)

HSP 1 Score: 674.9 bits (1740), Expect = 2.0e-190
Identity = 333/379 (87.86%), Postives = 344/379 (90.77%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MKLAAVSFS +GASR FLHGG  FNRLPRVAS S   VDAF STSLLVCGFCRTLHQ NS
Sbjct: 1   MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            I+   M TMS TSI+RI C HP  NARL SKRKHGSGSRT STCTS SSS TNPLV R 
Sbjct: 61  SIN-TTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRF 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PSALILASQVT SDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELW EKVFP EI
Sbjct: 121 PSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEI 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           QK EA Q+ AMEWGVLNEA AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF
Sbjct: 181 QKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYWEL+REML+EFWWENVVPA+EALSLG+EEEVKSYKPTSTHKQTGLAIAKSIKL
Sbjct: 301 VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           ASEAKL+ REIAGHVEFYR
Sbjct: 361 ASEAKLMFREIAGHVEFYR 378

BLAST of Tan0004231 vs. ExPASy TrEMBL
Match: A0A1S3B9S9 (uncharacterized protein LOC103487752 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103487752 PE=4 SV=1)

HSP 1 Score: 636.0 bits (1639), Expect = 1.0e-178
Identity = 316/379 (83.38%), Postives = 335/379 (88.39%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MK AAVSFS +GASRS  HGGS FN+LP VASFS R+    +S SLLVCG CRTL QSNS
Sbjct: 1   MKFAAVSFSQSGASRSLFHGGSSFNQLPPVASFSARKF-PLNSDSLLVCGLCRTLCQSNS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            ++IAIM TM+N SIARI C   R NA+L+ KR  G  SR+ STC +PSS  TNP VI L
Sbjct: 61  -VEIAIMSTMNNISIARICCRDSRKNAKLYLKRNRGIASRSFSTCATPSSYTTNPPVIWL 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PS LILASQV  S APQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSE 
Sbjct: 121 PSPLILASQVNQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRFELWHEKVFPSET 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           QK +A QQ AMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF
Sbjct: 181 QKTDAPQQNAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQ+EIM REW+DLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQMEIMGREWSDLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYW+LIRE+LKEFWWENVVPAKEALSLG+EE+ KSYKPTSTHKQTGLAIAKSIKL
Sbjct: 301 VCRERGYWDLIREILKEFWWENVVPAKEALSLGREEQAKSYKPTSTHKQTGLAIAKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           ASEAKLLCREIAGHVEFYR
Sbjct: 361 ASEAKLLCREIAGHVEFYR 377

BLAST of Tan0004231 vs. ExPASy TrEMBL
Match: A0A0A0LNG4 (YqaJ domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G350400 PE=4 SV=1)

HSP 1 Score: 634.4 bits (1635), Expect = 3.0e-178
Identity = 311/379 (82.06%), Postives = 329/379 (86.81%), Query Frame = 0

Query: 1   MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNS 60
           MK AAVSFS +GASRS LHGGS FN+L  VAS S R+  +F+S SLLVCG CRTL QS+S
Sbjct: 1   MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSSS 60

Query: 61  PIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRL 120
            ++ AIM TM+N SIARI C H R NARL+ KR H   SR  STC SPSSS  NPLVI L
Sbjct: 61  LVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWL 120

Query: 121 PSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEI 180
           PS L+LASQ   S APQRSEEWFALRRD+LTTSTFSTALGFWKGNRR ELW EKVFPSEI
Sbjct: 121 PSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEI 180

Query: 181 QKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF 240
           QK EA QQ AMEWGVLNE NAIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CF
Sbjct: 181 QKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECF 240

Query: 241 QGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR 300
           QGGGILEVKCPYNKGKPEKGLPWST+PFYYMPQVQGQ+EIM REWADLYCWTPNGSTIFR
Sbjct: 241 QGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR 300

Query: 301 VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKL 360
           VCRERGYW+LIRE+L+EFWWENVVPAKEAL LG EE+ KSYKPTSTHKQTGLAIAKSIKL
Sbjct: 301 VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKL 360

Query: 361 ASEAKLLCREIAGHVEFYR 380
           ASEAKL CREIAGHVEFYR
Sbjct: 361 ASEAKLFCREIAGHVEFYR 379

BLAST of Tan0004231 vs. TAIR 10
Match: AT1G67660.3 (Restriction endonuclease, type II-like superfamily protein )

HSP 1 Score: 385.2 bits (988), Expect = 6.1e-107
Identity = 190/336 (56.55%), Postives = 246/336 (73.21%), Query Frame = 0

Query: 44  TSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLS 103
           T + VC  CR L  +   ++  I+  M   SI+      P+S+  + S+++    S  LS
Sbjct: 10  TFVTVC-VCRVLKPNKVALNSMILSAMRTCSISGFHTHLPKSSGSVSSRKRF--SSTALS 69

Query: 104 TCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWK 163
             T   S   +P      S++I++S ++ SD PQ+SEEWFALR+D+LTTSTFSTALGFWK
Sbjct: 70  LITQTISPFAHP-----RSSVIVSSLLSPSDIPQKSEEWFALRKDKLTTSTFSTALGFWK 129

Query: 164 GNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSE 223
           GNRR ELW EKV+ S+ +  E   ++AM WGV  E++AI+RYK I G +V  +GFA HS 
Sbjct: 130 GNRRAELWHEKVYDSDARVVEESARFAMNWGVQMESSAIERYKRIMGCEVGTMGFAIHSN 189

Query: 224 QQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDR 283
           ++F WLGASPDG+L CF   GILEVKCPYNKGK E  LPW  +P+YYMPQ+QGQ+EIMDR
Sbjct: 190 EEFHWLGASPDGILDCF---GILEVKCPYNKGKTETVLPWKKVPYYYMPQLQGQMEIMDR 249

Query: 284 EWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALSLGKE-EEVKSYK 343
           EW +LYCWT NGST+FRV R+R YW +I ++L+EFWWE+V+PA+EAL LGKE EEVK Y+
Sbjct: 250 EWVNLYCWTRNGSTVFRVMRDRSYWRIIHDVLREFWWESVIPAREALLLGKEDEEVKKYE 309

Query: 344 PTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFY 379
           PTSTHK+T LAIAKS+ LA+E+KL+CREIA HVEF+
Sbjct: 310 PTSTHKRTKLAIAKSLNLAAESKLVCREIADHVEFF 334

BLAST of Tan0004231 vs. TAIR 10
Match: AT1G67660.1 (Restriction endonuclease, type II-like superfamily protein )

HSP 1 Score: 384.8 bits (987), Expect = 8.0e-107
Identity = 187/328 (57.01%), Postives = 242/328 (73.78%), Query Frame = 0

Query: 52  CRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSS 111
           CR L  +   ++  I+  M   SI+      P+S+  + S+++    S  LS  T   S 
Sbjct: 38  CRVLKPNKVALNSMILSAMRTCSISGFHTHLPKSSGSVSSRKRF--SSTALSLITQTISP 97

Query: 112 ITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELW 171
             +P      S++I++S ++ SD PQ+SEEWFALR+D+LTTSTFSTALGFWKGNRR ELW
Sbjct: 98  FAHP-----RSSVIVSSLLSPSDIPQKSEEWFALRKDKLTTSTFSTALGFWKGNRRAELW 157

Query: 172 QEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGA 231
            EKV+ S+ +  E   ++AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGA
Sbjct: 158 HEKVYDSDARVVEESARFAMNWGVQMESSAIERYKRIMGCEVGTMGFAIHSNEEFHWLGA 217

Query: 232 SPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCW 291
           SPDG+L CF   GILEVKCPYNKGK E  LPW  +P+YYMPQ+QGQ+EIMDREW +LYCW
Sbjct: 218 SPDGILDCF---GILEVKCPYNKGKTETVLPWKKVPYYYMPQLQGQMEIMDREWVNLYCW 277

Query: 292 TPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALSLGKE-EEVKSYKPTSTHKQT 351
           T NGST+FRV R+R YW +I ++L+EFWWE+V+PA+EAL LGKE EEVK Y+PTSTHK+T
Sbjct: 278 TRNGSTVFRVMRDRSYWRIIHDVLREFWWESVIPAREALLLGKEDEEVKKYEPTSTHKRT 337

Query: 352 GLAIAKSIKLASEAKLLCREIAGHVEFY 379
            LAIAKS+ LA+E+KL+CREIA HVEF+
Sbjct: 338 KLAIAKSLNLAAESKLVCREIADHVEFF 355

BLAST of Tan0004231 vs. TAIR 10
Match: AT1G67660.2 (Restriction endonuclease, type II-like superfamily protein )

HSP 1 Score: 379.0 bits (972), Expect = 4.4e-105
Identity = 184/314 (58.60%), Postives = 236/314 (75.16%), Query Frame = 0

Query: 66  IMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRLPSALI 125
           I+  M   SI+      P+S+  + S+++    S  LS  T   S   +P      S++I
Sbjct: 2   ILSAMRTCSISGFHTHLPKSSGSVSSRKRF--SSTALSLITQTISPFAHP-----RSSVI 61

Query: 126 LASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEA 185
           ++S ++ SD PQ+SEEWFALR+D+LTTSTFSTALGFWKGNRR ELW EKV+ S+ +  E 
Sbjct: 62  VSSLLSPSDIPQKSEEWFALRKDKLTTSTFSTALGFWKGNRRAELWHEKVYDSDARVVEE 121

Query: 186 RQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGI 245
             ++AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GI
Sbjct: 122 SARFAMNWGVQMESSAIERYKRIMGCEVGTMGFAIHSNEEFHWLGASPDGILDCF---GI 181

Query: 246 LEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRER 305
           LEVKCPYNKGK E  LPW  +P+YYMPQ+QGQ+EIMDREW +LYCWT NGST+FRV R+R
Sbjct: 182 LEVKCPYNKGKTETVLPWKKVPYYYMPQLQGQMEIMDREWVNLYCWTRNGSTVFRVMRDR 241

Query: 306 GYWELIREMLKEFWWENVVPAKEALSLGKE-EEVKSYKPTSTHKQTGLAIAKSIKLASEA 365
            YW +I ++L+EFWWE+V+PA+EAL LGKE EEVK Y+PTSTHK+T LAIAKS+ LA+E+
Sbjct: 242 SYWRIIHDVLREFWWESVIPAREALLLGKEDEEVKKYEPTSTHKRTKLAIAKSLNLAAES 301

Query: 366 KLLCREIAGHVEFY 379
           KL+CREIA HVEF+
Sbjct: 302 KLVCREIADHVEFF 305

BLAST of Tan0004231 vs. TAIR 10
Match: AT1G13810.1 (Restriction endonuclease, type II-like superfamily protein )

HSP 1 Score: 170.2 bits (430), Expect = 3.1e-42
Identity = 88/247 (35.63%), Postives = 144/247 (58.30%), Query Frame = 0

Query: 140 EEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEA 199
           + W  LR++RLT S F+ A+GF    RR  LW EK+  +   KP A  + A  W + NE 
Sbjct: 60  KNWEDLRKNRLTASNFARAIGFSPDGRR-NLWLEKIGAA---KPFAGNR-ATFWDIENEV 119

Query: 200 NAIDRYKSITGRDVSLLGFATH---SEQQFDWLGASPDGLLGCFQGG----GILEVKCPY 259
            A++RY  +TG ++ +  F  +      + +WLGASPDG++   + G    G+LEVKCP+
Sbjct: 120 EALERYNELTGNEILIPEFVVYKNGESPEENWLGASPDGVINVVKDGVTSCGVLEVKCPF 179

Query: 260 NKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIR 319
           +     K  PW  +P+  +PQ+QG +EI+D +W DLYCWT NGS++FRV R+  +WE ++
Sbjct: 180 DNRDNSKVYPWKKVPYNCVPQLQGLMEIVDTDWLDLYCWTRNGSSLFRVWRDTAFWEDMK 239

Query: 320 EMLKEFWWENVVPAKEALS----LGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLC 376
             L +FW  +V+PA+E  +       + +++ +KP   H+     +  + ++++ A  L 
Sbjct: 240 PALFDFWQNHVLPAREIYNNFDIKDPQVKLREFKPKHWHEDCKKIMRGAERISANANRLF 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038884042.11.7e-19689.45uncharacterized protein LOC120074988 isoform X1 [Benincasa hispida][more]
XP_023537251.14.0e-19387.86uncharacterized protein LOC111798383 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7020121.17.6e-19287.60hypothetical protein SDJN02_16803 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023001988.19.9e-19287.34uncharacterized protein LOC111496007 isoform X1 [Cucurbita maxima][more]
XP_022951164.13.8e-19187.34uncharacterized protein LOC111454092 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1KS644.8e-19287.34uncharacterized protein LOC111496007 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1GHX81.8e-19187.34uncharacterized protein LOC111454092 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1BPX62.0e-19087.86uncharacterized protein LOC111004694 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A1S3B9S91.0e-17883.38uncharacterized protein LOC103487752 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0LNG43.0e-17882.06YqaJ domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G350400 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G67660.36.1e-10756.55Restriction endonuclease, type II-like superfamily protein [more]
AT1G67660.18.0e-10757.01Restriction endonuclease, type II-like superfamily protein [more]
AT1G67660.24.4e-10558.60Restriction endonuclease, type II-like superfamily protein [more]
AT1G13810.13.1e-4235.63Restriction endonuclease, type II-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR017482Putative phage-type endonucleaseTIGRFAMTIGR03033TIGR03033coord: 136..293
e-value: 1.3E-21
score: 75.2
IPR019080YqaJ viral recombinasePFAMPF09588YqaJcoord: 141..283
e-value: 2.3E-19
score: 70.1
IPR011604Exonuclease, phage-type/RecB, C-terminalGENE3D3.90.320.10coord: 122..329
e-value: 1.1E-49
score: 170.7
NoneNo IPR availablePANTHERPTHR46609:SF6RESTRICTION ENDONUCLEASE, TYPE II-LIKE SUPERFAMILY PROTEINcoord: 71..378
NoneNo IPR availablePANTHERPTHR46609EXONUCLEASE, PHAGE-TYPE/RECB, C-TERMINAL DOMAIN-CONTAINING PROTEINcoord: 71..378
IPR011335Restriction endonuclease type II-likeSUPERFAMILY52980Restriction endonuclease-likecoord: 133..334

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004231.1Tan0004231.1mRNA