Tan0004244 (gene) Snake gourd v1

Overview
NameTan0004244
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG03: 6046154 .. 6051122 (-)
RNA-Seq ExpressionTan0004244
SyntenyTan0004244
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCTACCACATGAGAGGAAGTAATTCGGGTCTGCAGCTTCTTGGGCGGTCTGTGCCCTTCCCTTCTCTTTGGTTTGCTTCAAATTTTCCTTCAACGGCAATGACAAATCACTTGGCCTTTCAGCTTTCCATTTCCTCAACCAAAACCTTCATCATCCATGGCTTTGCCGCCACTCATAAAGAGAAAGCACTTCCATCCATCTTTTCTGCTTCACCCTTCAAACCCTCACCGATGAATTTCAAATCCAACAACCCAACAACAGTTACAATCACAACCCCAATGCAATTCAATGCAAGTGAGCCTCTTTTCTTCTCGTCTACAAATTTCTTCTTTACCATTTTGTTTTCTGTTCTGTTTTGACGGCTTCTGAGTTTACATTCGTTTGTTTATATCTTTAAAACCCTAAGAAATGAGTCCAAAGGCACACACACGAAGTCTTTTTCAAACGGTAAACCATCGAAATAAATGGCTAGGAATTAAGTCGTGTGTGTATGTGTGTGCTGCTTCCTGGGTTTTGGGAGAATTCCAAGGATTCTGATAGAAATGATATTAGGATAGAATTAAGAATATTATTAAGGTATTAAGGGTATATTAACAATTAGACAGGGAAGTTAATGGTGTTTTGTTATAGATAGAATGAGTATGAAAGGATGAATGTATGGTATGCATTGGTTTAGTTGACTAAGGTGTGAATGAGATTACTGAAGGGGTGGTAGGTTCTAAGTACCTGTTCTTCTATCCCTTATATTTTAGTATCATTCTTGTTGTTTGGATCTATCAGATTCATTGCCATTTGAGAGCAAAAAATTGCTTTACAGGTGCACAAGCAAATGATCTAGTTACAACTGGAATGGAAGAGCAAGCAGAGATGGAAGTTGCAGAGGGATATACCATTTCTCAATTTTGTGATAAAATAATTGACATTTTCTTGAATCAGAAGCCCAAGACTAAGGAATGGAGGAAGTTTTTGGTATTTAGAGAGGAGTGGAAAAAGTATAGGGAGAGCTTCTACAGTCACTGTCAAAGGCTGGCGGATTGGCAGAGTGATCCAAATATGAAAGAGAAGTTAATTTCACTTAGGAGAAAGGTCAAAACGGTATGTCATTGTCCTCTTCTTCTTCCCCTCCATTTGTTTTTTCAACTTTCCTGGTATGATTTTTGTTCACAAAAACCAAATCTTGTTGCATTTGTTTTCGGTTGTAAGAGTTTGAAAACAGAATTTAGAGGAGAATTTAAGCTGTAAATGTTCACTGCTTGTTCATTATTTGGTGAGATTTTCCAATGACCAGAGTGTAATTTGTTTAAACATTTAGAACGGCAACTTTTGTTTTTTGCTTTTTCATTTTTTGATCTCAAGTGAGAAAGATACAGATTGATGATGAAATGGAGATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCAACTGACATTAATGCAGTAGTTGCAAAGCGGCGCAAAGAGTTCACAGAGGAATTTTTCAAGTTCCTTACTTTGATTTCAGAAACCCATGATAGTTTGGAAGATCGTGATGGTAATAGTTTGAAAACTTTCAATTCCTTATCTTAGATTTTTCATTTTTTTATCATGAATGAGAAATATTCAGCTTAGATTTAATCTAAGATTTAGTTTATATTTATATTGTCCTTATCTTTTAACCATCTTTAGAAGTATTTAGATGAATTTAATCTTTGTGTAGCCAGGATTATATATTTGCTCTCTTTAAATCTAGAAGCCCTATCATGCATTTTTCACCTAAGTAGTGAATAATTAAAGTTGTCAATCTTGCCAATAAGAGCATAGCTCAACTGGCATTAAGTAAGACTCATGACCAAGAGGTCATAGGTTCGAATCCTCCCACCCCAAATATTGATGTACTCAAAAAAAGTTGTCAATCTTCGTATGTTTTTTCTTAAATTTTGTTCTGGGATTATTATTTGACAAATTTGTATCATGAATTCACACTGTGACATACATTTCATTCTTTATTCTTATTCACCTTTATGACTGATGACTGAATAATGACATTTATAATCTATAGTAGATTTCTCAAGAATTGACTAAAACCCAGAGTTGGGACAACATTTTAACTAGGAGATAGAGTTACTCTATTCTGAAAGAATAAAGAATTAATAAACAAAGACTTTATCATTAGTTCTCTCCTAAATTATTTTTCTTATTTGAGCAGCGGTGGCTCGGCTGGCGGCCAGATGTCTGTCTGCAGTGAGTGCTTACGATCGAACATTAGAAAATGTGGAGACATTAGATGCTGCACAAGTCAAATTTGATGATATACTGAATTCTCCCTCATTGGATGTGGCTTGTGAGAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGATTCACCAACCATGAAGAATGAGGTAATGTTTAGATTTAAAACTTGTCATATTTATTTCATTTTAGTACCCAAACTTCAAGTAGTCTGTTTGGTCCAACTTTCAAGTGTTCTGTTTTTTCCTCTAGACTTTGCATAAAACATTATTTTCATTCTTACCTTTAATTTATAGTTAATTTATTTAAGAAGAATTTAACTAAATTTAAAAATCCGATGTCTAATTATTACGTGTACTTGTCTTTATGGACATATATTGATTAGGTAAAATGAAAAAAAAAAAAAAGGAAAAATGGGAATACGTCAGGTTTGTGTTAAAAAATTTAAAAAAACAAAAACAAAAACAAATAATAAGGGTCCCGTTTGATAACCATTTTGTTTTTGGATTTTAATTTTTGAAAGTTAAACTTATGTTCACCTAATTTCTCTACAATAAAGATTATCTCCCTTAAGTAAACATTTGAACTCTTAGCCAAATTCTAAAAACAAAGTCAAGTTTTTGAAAGCTACTTTTTTTAGTTTTCAAAAGTTGGCTTGGTTTTTGAAAACATAGGTACAAAGTAGACAAAAAGACATATAAACTCATTGGTATAAGTAGTATTTTTAGGTTTAAATTTCAAAAACTAAAAACCAAAAACAAAATGGTTATCAAACGGGGGCAAAGGTTTTATGGAAAGTTCAAAAACCAACAGAATGTTTGGAAAGAGGACAAAAATAGAATTCACATAAGAGTCGAGGACTGATTTAGAATCTAAACTTAGTGTATATGTAATGTATCATATGACACTTAACGTTTGCTAATTCTATTTCTAAACCATTATATAATAATGCATTATCAACCTCCTTTCCCAACCATTTCCTTGGTTTTATTTTCACATTAGCCATTTCCATGGTTTGTCTTGTCTCTGCTTCCCCGGTACTGTGTCTTTGGCATGCTTTTTTATTTTTATTTTTTTTTAGTGAAGCATTTATTTGAAATATATTAGTTAAATGCTTCATTTACAAACACTTTTACAATTTCTCCAGCCACTTATGATTCTCGAAAATTTTAAGAACTTTTAAAAGTACTTTTGGTGGTTAAAAGCAGTTTTTACCCGTCATCCAAACACTACCAATTTAAAAAAAGAAAAAAGAGAGAAAAATAGTCATTAATAAGCACTTTAAAGACATGCCAAACACACTAATACACTAAAAAGGGATTGTTGTTTCTATCCTCAAACACTTTTAAGATGCAACTATTATGTCACCTTCTCTGAGTTCTGTCTGTGTGTTTCTCTCTGAGGATGTAGGTGAAAGAAATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCTCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTAAATATCATAGATCCTGAAGAACGGTTTTCTGCTTTAGCAACAGCCTTTGCACCTGGTGATGGAAGTGAACCCAAAGATCGGAATGCTATGTACACGTGAGTAAGCCTCATATCATATTCTAATGTACATTCCTTTTTTTTTTTAATGGCGATTACGCATTTCTCCCCTAGGTTTCATTAAAGTTAAAATATCATTTTGGTCCCTCTAATTTGGGACTTGTTCTATTGTAGTCCTTGCGCTTTCAAGTGTATAAATTTAGTCCCCATGCTTTTATTAAATCTTAAATTTTTTTCTCTTCTGCTAGTTTATATATAATAAATAACATTTCTTAAAAAAAATTAGTCTCTATTAACTTTCTTTCTATAAATTTTGGAAATATATTCACATGATGTATTTTCTTGAATAGAACCTATTATTATTTTTTTACCAAGTTGGATAAAAAACAACTTCAAACTAATAATAATGACTAAGTAAAAACTATTTTTAGACATTTGGAAGTACTTGGACTAAAACAGAAGGAATCCCAAAGTGTAGGACCAGAATAGTATTTTATATCTATTTTGGTCTTTGAACTTTTAAATTTAGTCCACTCTAGTCCACGACATTTGTTTCAATTCCCAAACTTTGAGAAACTAACTATTTTGATAATCGTTGCTATTTTGTCGTCAATTATTTATCATTTAAATGACTTGGCTCTAATATGTGTATCAAGAACATGTTTAGGTGGGTACACATTGAAGGCATATAGGGGAATTTATGGATTACAAAGAGCATGAGAATATTTGGAGAAGGAATAATTTTCTTATAAATCTGTTTTTTTTTTTTAATGATTGAGCAGAACTCCAAAAGAGCTGCATAAGTGGATAAAGATGATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGAATCAGCCTGTTGTTATACAAAGGCTATTCATTCTCAAGGATACTATTGAAACTGAGTATTTGTATCAAAATGAGCTTCAGAATTCTGAAACAAAACTAAACCATGTCTCTGAAGATGCAGTTTCCATATAAGTTTGTAAATGCTGTCATTTTTTGTAATCTAGCCAACACTAGGATCGTTAGTTTTGGCATTGACTCGAGAATTATAATGATATATGATTATAAACTATTAAAAGATGAATATTCG

mRNA sequence

GGCTACCACATGAGAGGAAGTAATTCGGGTCTGCAGCTTCTTGGGCGGTCTGTGCCCTTCCCTTCTCTTTGGTTTGCTTCAAATTTTCCTTCAACGGCAATGACAAATCACTTGGCCTTTCAGCTTTCCATTTCCTCAACCAAAACCTTCATCATCCATGGCTTTGCCGCCACTCATAAAGAGAAAGCACTTCCATCCATCTTTTCTGCTTCACCCTTCAAACCCTCACCGATGAATTTCAAATCCAACAACCCAACAACAGTTACAATCACAACCCCAATGCAATTCAATGCAAGTGCACAAGCAAATGATCTAGTTACAACTGGAATGGAAGAGCAAGCAGAGATGGAAGTTGCAGAGGGATATACCATTTCTCAATTTTGTGATAAAATAATTGACATTTTCTTGAATCAGAAGCCCAAGACTAAGGAATGGAGGAAGTTTTTGGTATTTAGAGAGGAGTGGAAAAAGTATAGGGAGAGCTTCTACAGTCACTGTCAAAGGCTGGCGGATTGGCAGAGTGATCCAAATATGAAAGAGAAGTTAATTTCACTTAGGAGAAAGGTCAAAACGATTGATGATGAAATGGAGATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCAACTGACATTAATGCAGTAGTTGCAAAGCGGCGCAAAGAGTTCACAGAGGAATTTTTCAAGTTCCTTACTTTGATTTCAGAAACCCATGATAGTTTGGAAGATCGTGATGCGGTGGCTCGGCTGGCGGCCAGATGTCTGTCTGCAGTGAGTGCTTACGATCGAACATTAGAAAATGTGGAGACATTAGATGCTGCACAAGTCAAATTTGATGATATACTGAATTCTCCCTCATTGGATGTGGCTTGTGAGAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGATTCACCAACCATGAAGAATGAGGTGAAAGAAATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCTCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTAAATATCATAGATCCTGAAGAACGGTTTTCTGCTTTAGCAACAGCCTTTGCACCTGGTGATGGAAGTGAACCCAAAGATCGGAATGCTATGTACACAACTCCAAAAGAGCTGCATAAGTGGATAAAGATGATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGAATCAGCCTGTTGTTATACAAAGGCTATTCATTCTCAAGGATACTATTGAAACTGAGTATTTGTATCAAAATGAGCTTCAGAATTCTGAAACAAAACTAAACCATGTCTCTGAAGATGCAGTTTCCATATAAGTTTGTAAATGCTGTCATTTTTTGTAATCTAGCCAACACTAGGATCGTTAGTTTTGGCATTGACTCGAGAATTATAATGATATATGATTATAAACTATTAAAAGATGAATATTCG

Coding sequence (CDS)

ATGAGAGGAAGTAATTCGGGTCTGCAGCTTCTTGGGCGGTCTGTGCCCTTCCCTTCTCTTTGGTTTGCTTCAAATTTTCCTTCAACGGCAATGACAAATCACTTGGCCTTTCAGCTTTCCATTTCCTCAACCAAAACCTTCATCATCCATGGCTTTGCCGCCACTCATAAAGAGAAAGCACTTCCATCCATCTTTTCTGCTTCACCCTTCAAACCCTCACCGATGAATTTCAAATCCAACAACCCAACAACAGTTACAATCACAACCCCAATGCAATTCAATGCAAGTGCACAAGCAAATGATCTAGTTACAACTGGAATGGAAGAGCAAGCAGAGATGGAAGTTGCAGAGGGATATACCATTTCTCAATTTTGTGATAAAATAATTGACATTTTCTTGAATCAGAAGCCCAAGACTAAGGAATGGAGGAAGTTTTTGGTATTTAGAGAGGAGTGGAAAAAGTATAGGGAGAGCTTCTACAGTCACTGTCAAAGGCTGGCGGATTGGCAGAGTGATCCAAATATGAAAGAGAAGTTAATTTCACTTAGGAGAAAGGTCAAAACGATTGATGATGAAATGGAGATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCAACTGACATTAATGCAGTAGTTGCAAAGCGGCGCAAAGAGTTCACAGAGGAATTTTTCAAGTTCCTTACTTTGATTTCAGAAACCCATGATAGTTTGGAAGATCGTGATGCGGTGGCTCGGCTGGCGGCCAGATGTCTGTCTGCAGTGAGTGCTTACGATCGAACATTAGAAAATGTGGAGACATTAGATGCTGCACAAGTCAAATTTGATGATATACTGAATTCTCCCTCATTGGATGTGGCTTGTGAGAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGATTCACCAACCATGAAGAATGAGGTGAAAGAAATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCTCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTAAATATCATAGATCCTGAAGAACGGTTTTCTGCTTTAGCAACAGCCTTTGCACCTGGTGATGGAAGTGAACCCAAAGATCGGAATGCTATGTACACAACTCCAAAAGAGCTGCATAAGTGGATAAAGATGATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGAATCAGCCTGTTGTTATACAAAGGCTATTCATTCTCAAGGATACTATTGAAACTGAGTATTTGTATCAAAATGAGCTTCAGAATTCTGAAACAAAACTAAACCATGTCTCTGAAGATGCAGTTTCCATATAA

Protein sequence

MRGSNSGLQLLGRSVPFPSLWFASNFPSTAMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKPSPMNFKSNNPTTVTITTPMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVFREEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPTDINAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVETLDAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYTTPKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNSETKLNHVSEDAVSI
Homology
BLAST of Tan0004244 vs. ExPASy Swiss-Prot
Match: Q84WN0 (Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 PE=2 SV=2)

HSP 1 Score: 490.7 bits (1262), Expect = 1.8e-137
Identity = 250/374 (66.84%), Postives = 309/374 (82.62%), Query Frame = 0

Query: 78  KSNNPTTVTITT-PMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQK 137
           K    +T+T  T  + +N +  A   V + +E+  E+EVAEGYT++QFCDKIID+FLN+K
Sbjct: 41  KIRKSSTITFATDTVTYNGTTSAE--VKSSVEDPMEVEVAEGYTMAQFCDKIIDLFLNEK 100

Query: 138 PKTKEWRKFLVFREEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIH 197
           PK K+W+ +LV R+EW KY  +FY  C+  AD ++DP +K+KL+SL  KVK ID EME H
Sbjct: 101 PKVKQWKTYLVLRDEWNKYSVNFYKRCRIRADTETDPILKQKLVSLESKVKKIDKEMEKH 160

Query: 198 SELLKELQDSPTDINAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAV 257
           ++LLKE+Q++PTDINA+ AKRR++FT EFF+++TL+SET D LEDRDAVARLA RCLSAV
Sbjct: 161 NDLLKEIQENPTDINAIAAKRRRDFTGEFFRYVTLLSETLDGLEDRDAVARLATRCLSAV 220

Query: 258 SAYDRTLENVETLDAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASA 317
           SAYD TLE+VETLD AQ KF+DILNSPS+D ACEKI SLAKAKELDSSLILLINSA+A+A
Sbjct: 221 SAYDNTLESVETLDTAQAKFEDILNSPSVDSACEKIRSLAKAKELDSSLILLINSAYAAA 280

Query: 318 KDSPTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGD 377
           K+S T+ NE K+IMY LYKATKS+LRS+ PKEIKLLK+LLNI DPEERFSALATAF+PGD
Sbjct: 281 KESQTVTNEAKDIMYHLYKATKSSLRSITPKEIKLLKYLLNITDPEERFSALATAFSPGD 340

Query: 378 GSEPKDRNAMYTTPKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIE 437
             E KD  A+YTTPKELHKWIK+MLD+YHLN+E+TDI+EA+ M+QP+VIQRLFILKDTIE
Sbjct: 341 DHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIE 400

Query: 438 TEYLYQNELQNSET 451
            EYL +  +   ET
Sbjct: 401 DEYLDKKTIVADET 412

BLAST of Tan0004244 vs. NCBI nr
Match: XP_038883875.1 (uncharacterized protein At4g37920 isoform X2 [Benincasa hispida])

HSP 1 Score: 718.4 bits (1853), Expect = 4.0e-203
Identity = 375/433 (86.61%), Postives = 397/433 (91.69%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKPSPMNFKSNNPTTVTITT 89
           A TNHL FQLSISSTK+FI   F+AT   K LPSI+SAS FKPSP  +KS+NPT VTITT
Sbjct: 2   AFTNHLLFQLSISSTKSFIFPSFSAT--LKPLPSIYSASLFKPSPEIYKSDNPTPVTITT 61

Query: 90  PMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVFR 149
           PMQF ASA  ND+ TT  EE+AEMEVAEGYTISQFCDKIIDIF+N+KPKTKEWRKFLVFR
Sbjct: 62  PMQFKASALVNDVATTEKEEEAEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFR 121

Query: 150 EEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPTD 209
           EEWKKYRESFYSHCQR ADW+SDP MKEKLISLRRKVK IDDEMEIH ELLKELQDSPTD
Sbjct: 122 EEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHGELLKELQDSPTD 181

Query: 210 INAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVETL 269
           INA+VAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCL+AVSAYDRTLENVETL
Sbjct: 182 INAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETL 241

Query: 270 DAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKEI 329
           D+AQ KFDDIL SPSLDVACEKIASLAKAKELDSSLILLINSAWA+AK+S TMKNEVKEI
Sbjct: 242 DSAQAKFDDILTSPSLDVACEKIASLAKAKELDSSLILLINSAWAAAKESTTMKNEVKEI 301

Query: 330 MYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYTT 389
           MY LYKATKS+LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KD  A+YTT
Sbjct: 302 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQKDPKALYTT 361

Query: 390 PKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNSE 449
           PKELHKWIK+MLDSYHLNQEDTDIREAR+M QPVVIQRLFILKDTIETEYL QNE QN +
Sbjct: 362 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPVVIQRLFILKDTIETEYLEQNEFQNPQ 421

Query: 450 TKLNHVSEDAVSI 463
           +  NHVSEDAVSI
Sbjct: 422 STPNHVSEDAVSI 432

BLAST of Tan0004244 vs. NCBI nr
Match: KAG6603165.1 (hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 714.5 bits (1843), Expect = 5.8e-202
Identity = 371/433 (85.68%), Postives = 399/433 (92.15%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKPSPMNFKSNNPTTVTITT 89
           A+TN LAFQLSISSTKTFI   F+A   +K LPSI SA+PFK SP N KS+N  T T+ T
Sbjct: 2   AITNQLAFQLSISSTKTFIFRRFSAA--QKPLPSISSATPFKSSPKNSKSDNRATATVPT 61

Query: 90  PMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVFR 149
           PMQFNASA+AND+ TT MEEQAEMEVAEGYTISQFCDKIIDIF+N+KPKTKEWRK LVFR
Sbjct: 62  PMQFNASARANDVATTEMEEQAEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKLLVFR 121

Query: 150 EEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPTD 209
           EEWKKYRESFYSHCQR ADW+SDP MKEKL+SL R+VK IDDEMEIHSELLKELQDSPTD
Sbjct: 122 EEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRIDDEMEIHSELLKELQDSPTD 181

Query: 210 INAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVETL 269
           INA+VAKRR+EFTE+FFKFLTL+SETHDSLED DAVARLAARCLSAVSAYDRTLE+VETL
Sbjct: 182 INAIVAKRRQEFTEDFFKFLTLVSETHDSLEDHDAVARLAARCLSAVSAYDRTLEHVETL 241

Query: 270 DAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKEI 329
           D+AQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAK+S TMKNEVKEI
Sbjct: 242 DSAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 301

Query: 330 MYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYTT 389
           MY LYKATKS LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSEPKD NA+YTT
Sbjct: 302 MYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEPKDPNAIYTT 361

Query: 390 PKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNSE 449
           PKELHKWIK+MLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYL QNE QN++
Sbjct: 362 PKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYLEQNESQNAQ 421

Query: 450 TKLNHVSEDAVSI 463
           +K NHVS +AVSI
Sbjct: 422 SKPNHVSANAVSI 432

BLAST of Tan0004244 vs. NCBI nr
Match: XP_022933100.1 (uncharacterized protein At4g37920 [Cucurbita moschata])

HSP 1 Score: 714.5 bits (1843), Expect = 5.8e-202
Identity = 371/433 (85.68%), Postives = 398/433 (91.92%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKPSPMNFKSNNPTTVTITT 89
           A+TN LAFQLSISSTKTFI   F+A   +K LPSI SA+PFK SP N KS+N  T T+ T
Sbjct: 2   AITNQLAFQLSISSTKTFIFRRFSAA--QKPLPSISSATPFKSSPKNSKSDNRATATVPT 61

Query: 90  PMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVFR 149
           PMQFNASA+ ND+ TT MEEQAEMEVAEGYTISQFCDKIIDIF+N+KPKTKEWRK LVFR
Sbjct: 62  PMQFNASARTNDVATTEMEEQAEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKLLVFR 121

Query: 150 EEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPTD 209
           EEWKKYRESFYSHCQR ADW+SDP MKEKL+SL R+VK IDDEMEIHSELLKELQDSPTD
Sbjct: 122 EEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRIDDEMEIHSELLKELQDSPTD 181

Query: 210 INAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVETL 269
           INA+VAKRRKEFTE+FFKFLTL+SETHDSLED DAVARLAARCLSAVSAYDRTLE+VETL
Sbjct: 182 INAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAARCLSAVSAYDRTLEHVETL 241

Query: 270 DAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKEI 329
           D+AQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAK+S TMKNEVKEI
Sbjct: 242 DSAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 301

Query: 330 MYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYTT 389
           MY LYKATKS LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSEPKD NA+YTT
Sbjct: 302 MYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEPKDPNAIYTT 361

Query: 390 PKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNSE 449
           PKELHKWIK+MLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYL QNE QN++
Sbjct: 362 PKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYLEQNESQNAQ 421

Query: 450 TKLNHVSEDAVSI 463
           +K NHVS +AVSI
Sbjct: 422 SKPNHVSTNAVSI 432

BLAST of Tan0004244 vs. NCBI nr
Match: XP_022967802.1 (uncharacterized protein At4g37920 [Cucurbita maxima])

HSP 1 Score: 713.0 bits (1839), Expect = 1.7e-201
Identity = 369/433 (85.22%), Postives = 398/433 (91.92%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKPSPMNFKSNNPTTVTITT 89
           A+TN LAFQLSISST+TFI   F+A   +  LPSI SA PFKP+P N KS+N  T T+ T
Sbjct: 2   AITNQLAFQLSISSTRTFIFRRFSAA--QNPLPSISSAIPFKPAPKNSKSDNRATATVPT 61

Query: 90  PMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVFR 149
           PMQFNASA+AND+ TT MEEQ EMEVAEGYTISQFCDKIIDIF+N+KPKTKEWRK LVFR
Sbjct: 62  PMQFNASARANDVATTEMEEQTEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKLLVFR 121

Query: 150 EEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPTD 209
           EEWKKYRESFYSHCQR ADW+SDP MKEKL+SL R+VK IDDEMEIHSELLKELQDSPTD
Sbjct: 122 EEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRIDDEMEIHSELLKELQDSPTD 181

Query: 210 INAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVETL 269
           INA+VAKRRKEFTE+FFKFLTL+SETHDSLED DAVARLAARCLSAVSAYDRTLE+VETL
Sbjct: 182 INAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAARCLSAVSAYDRTLEHVETL 241

Query: 270 DAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKEI 329
           D+AQVKFDDILNSP+LDVACEKIASLAKAKELDSSLILLINSAWASAK+S TMKNEVKEI
Sbjct: 242 DSAQVKFDDILNSPTLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 301

Query: 330 MYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYTT 389
           MY+LYKATKS LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSEPKD NA+YTT
Sbjct: 302 MYRLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEPKDPNAIYTT 361

Query: 390 PKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNSE 449
           PKELHKWIK+MLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYL QNELQN +
Sbjct: 362 PKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYLEQNELQNPQ 421

Query: 450 TKLNHVSEDAVSI 463
           +K NHVS +AVSI
Sbjct: 422 SKPNHVSANAVSI 432

BLAST of Tan0004244 vs. NCBI nr
Match: XP_023544083.1 (uncharacterized protein At4g37920 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 708.4 bits (1827), Expect = 4.1e-200
Identity = 371/434 (85.48%), Postives = 396/434 (91.24%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSI-FSASPFKPSPMNFKSNNPTTVTIT 89
           A+TN L FQLSISSTKTFI   F+A   +K LP I  SA PFKPSP N KS+N  T T+ 
Sbjct: 2   AITNQLTFQLSISSTKTFIFRRFSAA--QKPLPPISSSAIPFKPSPKNSKSDNRATATVP 61

Query: 90  TPMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVF 149
           TPMQFNASA+AND+ T  MEEQAEMEVAEGYTISQFCDKIIDIF+N+KPKTKEWRK LVF
Sbjct: 62  TPMQFNASARANDVATMEMEEQAEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKLLVF 121

Query: 150 REEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPT 209
           REEWKKYRESFYSHCQR ADW+SDP MKEKL+SL R+VK IDDEMEIHSELLKELQDSPT
Sbjct: 122 REEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRIDDEMEIHSELLKELQDSPT 181

Query: 210 DINAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVET 269
           DINA+VAKRRKEFTE+FFKFLTL+SETHDSLED DAVARLAARCLSAVSAYDRTLE+VET
Sbjct: 182 DINAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAARCLSAVSAYDRTLEHVET 241

Query: 270 LDAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKE 329
           LD+AQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAK+S TMKNEVKE
Sbjct: 242 LDSAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKE 301

Query: 330 IMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYT 389
           IMY LYKATKS LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE KD NA+YT
Sbjct: 302 IMYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEAKDPNAIYT 361

Query: 390 TPKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNS 449
           TPKELHKWIK+MLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYL QNELQN+
Sbjct: 362 TPKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYLEQNELQNA 421

Query: 450 ETKLNHVSEDAVSI 463
           +TK NHVS +AVSI
Sbjct: 422 QTKPNHVSANAVSI 433

BLAST of Tan0004244 vs. ExPASy TrEMBL
Match: A0A6J1F3Z5 (uncharacterized protein At4g37920 OS=Cucurbita moschata OX=3662 GN=LOC111439867 PE=4 SV=1)

HSP 1 Score: 714.5 bits (1843), Expect = 2.8e-202
Identity = 371/433 (85.68%), Postives = 398/433 (91.92%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKPSPMNFKSNNPTTVTITT 89
           A+TN LAFQLSISSTKTFI   F+A   +K LPSI SA+PFK SP N KS+N  T T+ T
Sbjct: 2   AITNQLAFQLSISSTKTFIFRRFSAA--QKPLPSISSATPFKSSPKNSKSDNRATATVPT 61

Query: 90  PMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVFR 149
           PMQFNASA+ ND+ TT MEEQAEMEVAEGYTISQFCDKIIDIF+N+KPKTKEWRK LVFR
Sbjct: 62  PMQFNASARTNDVATTEMEEQAEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKLLVFR 121

Query: 150 EEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPTD 209
           EEWKKYRESFYSHCQR ADW+SDP MKEKL+SL R+VK IDDEMEIHSELLKELQDSPTD
Sbjct: 122 EEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRIDDEMEIHSELLKELQDSPTD 181

Query: 210 INAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVETL 269
           INA+VAKRRKEFTE+FFKFLTL+SETHDSLED DAVARLAARCLSAVSAYDRTLE+VETL
Sbjct: 182 INAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAARCLSAVSAYDRTLEHVETL 241

Query: 270 DAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKEI 329
           D+AQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAK+S TMKNEVKEI
Sbjct: 242 DSAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 301

Query: 330 MYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYTT 389
           MY LYKATKS LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSEPKD NA+YTT
Sbjct: 302 MYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEPKDPNAIYTT 361

Query: 390 PKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNSE 449
           PKELHKWIK+MLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYL QNE QN++
Sbjct: 362 PKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYLEQNESQNAQ 421

Query: 450 TKLNHVSEDAVSI 463
           +K NHVS +AVSI
Sbjct: 422 SKPNHVSTNAVSI 432

BLAST of Tan0004244 vs. ExPASy TrEMBL
Match: A0A6J1HRT8 (uncharacterized protein At4g37920 OS=Cucurbita maxima OX=3661 GN=LOC111467208 PE=4 SV=1)

HSP 1 Score: 713.0 bits (1839), Expect = 8.1e-202
Identity = 369/433 (85.22%), Postives = 398/433 (91.92%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKPSPMNFKSNNPTTVTITT 89
           A+TN LAFQLSISST+TFI   F+A   +  LPSI SA PFKP+P N KS+N  T T+ T
Sbjct: 2   AITNQLAFQLSISSTRTFIFRRFSAA--QNPLPSISSAIPFKPAPKNSKSDNRATATVPT 61

Query: 90  PMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVFR 149
           PMQFNASA+AND+ TT MEEQ EMEVAEGYTISQFCDKIIDIF+N+KPKTKEWRK LVFR
Sbjct: 62  PMQFNASARANDVATTEMEEQTEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKLLVFR 121

Query: 150 EEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPTD 209
           EEWKKYRESFYSHCQR ADW+SDP MKEKL+SL R+VK IDDEMEIHSELLKELQDSPTD
Sbjct: 122 EEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRIDDEMEIHSELLKELQDSPTD 181

Query: 210 INAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVETL 269
           INA+VAKRRKEFTE+FFKFLTL+SETHDSLED DAVARLAARCLSAVSAYDRTLE+VETL
Sbjct: 182 INAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAARCLSAVSAYDRTLEHVETL 241

Query: 270 DAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKEI 329
           D+AQVKFDDILNSP+LDVACEKIASLAKAKELDSSLILLINSAWASAK+S TMKNEVKEI
Sbjct: 242 DSAQVKFDDILNSPTLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 301

Query: 330 MYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYTT 389
           MY+LYKATKS LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSEPKD NA+YTT
Sbjct: 302 MYRLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEPKDPNAIYTT 361

Query: 390 PKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNSE 449
           PKELHKWIK+MLDSYHLNQEDTDIREAR M QP+VIQRLFILKDTIETEYL QNELQN +
Sbjct: 362 PKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYLEQNELQNPQ 421

Query: 450 TKLNHVSEDAVSI 463
           +K NHVS +AVSI
Sbjct: 422 SKPNHVSANAVSI 432

BLAST of Tan0004244 vs. ExPASy TrEMBL
Match: A0A1S3B4W5 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486045 PE=4 SV=1)

HSP 1 Score: 698.4 bits (1801), Expect = 2.1e-197
Identity = 365/436 (83.72%), Postives = 397/436 (91.06%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKPSPMNFKSNNPTTVTITT 89
           A TNHL FQ  ISSTK+FI   F+ T   K LPSI+SASPFKPSP   KS+N TTVTIT 
Sbjct: 2   AFTNHLPFQFYISSTKSFIFPNFSTT--LKPLPSIYSASPFKPSPKFSKSDNRTTVTITA 61

Query: 90  PMQ-FNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVF 149
           P+Q FNASA+ ND+ T+  EEQAEMEVA+GY++SQFCDKIIDIF+N+KPKTKEWRKFLVF
Sbjct: 62  PLQIFNASARVNDVATSEKEEQAEMEVAKGYSLSQFCDKIIDIFMNEKPKTKEWRKFLVF 121

Query: 150 REEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPT 209
           REEWKKYRESFYSHCQR ADW+SDP MKEKLISLRRKVK IDDEMEIHSELLKELQDSPT
Sbjct: 122 REEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPT 181

Query: 210 DINAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVET 269
           DINA+VA RRKEFT+EFFKFLTLISETHDSLEDRDAVARLAARCL+AVSAYDRTLENVET
Sbjct: 182 DINAIVANRRKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVET 241

Query: 270 LDAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKE 329
           LD+AQ KFD+ILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAK+S TMKNEVKE
Sbjct: 242 LDSAQAKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKE 301

Query: 330 IMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYT 389
           IMY LYKATKS+LRSMAPKEIKLLKHLLNI+DPEERFSALATAF+PGDGSE KD NA+YT
Sbjct: 302 IMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFSPGDGSEQKDPNALYT 361

Query: 390 TPKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNS 449
           TPKELHKWIK+MLDSYHLNQEDTDIREAR+M QP+VIQRLFILKDTIETEYL QN+ QN 
Sbjct: 362 TPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNP 421

Query: 450 ETK--LNHVSEDAVSI 463
           +++   NH SEDA+SI
Sbjct: 422 QSRPNHNHGSEDAISI 435

BLAST of Tan0004244 vs. ExPASy TrEMBL
Match: A0A0A0L3X1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G515540 PE=4 SV=1)

HSP 1 Score: 689.5 bits (1778), Expect = 9.6e-195
Identity = 360/436 (82.57%), Postives = 393/436 (90.14%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKPSPMNFKSNNPTTVTITT 89
           A TNHL FQ  +SSTK FI   F+ T     LPSI+SASPFKPSP   KS+N T+VTIT 
Sbjct: 2   AFTNHLPFQFYVSSTKPFIFPSFSTT--LNPLPSIYSASPFKPSPKISKSDNRTSVTITA 61

Query: 90  PMQ-FNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKEWRKFLVF 149
           P+Q FNASA+ ND+ T+  EEQ EMEVA+GY++SQFCDKIIDIFLN+KPKTKEWRKFLVF
Sbjct: 62  PLQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVF 121

Query: 150 REEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLKELQDSPT 209
           REEWKKYRESFYSHCQR ADW+ DP MKEKLISLRRKVK IDDEMEIHSELLKELQDSPT
Sbjct: 122 REEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPT 181

Query: 210 DINAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDRTLENVET 269
           DINA+VAKR KEFT+EFFKFLTLISETHDSLEDRDAVARLAARCL+AVSAY+RTLENVET
Sbjct: 182 DINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVET 241

Query: 270 LDAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPTMKNEVKE 329
           LD+AQVKFD+ILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAK+S TMKNEVKE
Sbjct: 242 LDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKE 301

Query: 330 IMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPKDRNAMYT 389
           IMY LYKATKS+LRSMAPKEIKLLKHLLNI+DPEERFSALAT F+PGDGSE KD NA+YT
Sbjct: 302 IMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYT 361

Query: 390 TPKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLYQNELQNS 449
           TPKELHKWIK+MLDSYHLNQEDTDIREAR+M QP+VIQRLFILKDTIETEYL QN+ QN 
Sbjct: 362 TPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNP 421

Query: 450 ETK--LNHVSEDAVSI 463
           +++   NH SEDA+SI
Sbjct: 422 QSRPSHNHGSEDAISI 435

BLAST of Tan0004244 vs. ExPASy TrEMBL
Match: A0A6J1DBT6 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018874 PE=4 SV=1)

HSP 1 Score: 636.7 bits (1641), Expect = 7.4e-179
Identity = 339/423 (80.14%), Postives = 366/423 (86.52%), Query Frame = 0

Query: 30  AMTNHLAFQLSISSTKTFIIHGFAATHKEKALPSIFSASPFKP------SPMNFKSNNPT 89
           AM N+L F LS SS KT I          KALP     +P  P      SP   KSN+PT
Sbjct: 2   AMANYLPFHLSSSSPKTSIF--------PKALPE----APRNPLISSALSPKKSKSNHPT 61

Query: 90  TVTITTPMQFNASAQ--ANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQKPKTKE 149
           T++IT+P +  A+A   AND+ T  ME Q+EMEVAEGYTISQFCDKIIDIFLN+KPKTKE
Sbjct: 62  TISITSPTKLKATASLGANDVATAEMEAQSEMEVAEGYTISQFCDKIIDIFLNEKPKTKE 121

Query: 150 WRKFLVFREEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIHSELLK 209
           WRK LVFREEWKKYRESFYSHCQR  DW+SDP+MKE+LISLRRKVK IDDEMEIHSEL K
Sbjct: 122 WRKLLVFREEWKKYRESFYSHCQRRVDWESDPSMKERLISLRRKVKRIDDEMEIHSELFK 181

Query: 210 ELQDSPTDINAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAVSAYDR 269
           ELQDSPTDINA+VAKRRK+FTEEFF FLTLISETHDSLEDRDAVARLAARCLSAVSAYDR
Sbjct: 182 ELQDSPTDINAIVAKRRKDFTEEFFXFLTLISETHDSLEDRDAVARLAARCLSAVSAYDR 241

Query: 270 TLENVETLDAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKDSPT 329
           TLE V+TLD AQ KFDDILNSPSLDVACEKI SLAKAKELDSSLILLINSAWASAK+S T
Sbjct: 242 TLEYVDTLDCAQAKFDDILNSPSLDVACEKIESLAKAKELDSSLILLINSAWASAKESTT 301

Query: 330 MKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEPK 389
           MKNEVKEIMY+LY+ATKS+LRSMAPKEIKLLKHLLNI+DPEERFSALATAFAPGDGSE +
Sbjct: 302 MKNEVKEIMYRLYRATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEAR 361

Query: 390 DRNAMYTTPKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIETEYLY 445
           D NAMYTTPKELHKWIK+MLDSYHLNQEDT++REAR+MNQPVVIQRLFILKDTIETEYL 
Sbjct: 362 DPNAMYTTPKELHKWIKIMLDSYHLNQEDTEMREARNMNQPVVIQRLFILKDTIETEYLE 412

BLAST of Tan0004244 vs. TAIR 10
Match: AT4G37920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane, chloroplast, chloroplast envelope; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36320.1); Has 123 Blast hits to 120 proteins in 40 species: Archae - 2; Bacteria - 11; Metazoa - 8; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 490.7 bits (1262), Expect = 1.3e-138
Identity = 250/374 (66.84%), Postives = 309/374 (82.62%), Query Frame = 0

Query: 78  KSNNPTTVTITT-PMQFNASAQANDLVTTGMEEQAEMEVAEGYTISQFCDKIIDIFLNQK 137
           K    +T+T  T  + +N +  A   V + +E+  E+EVAEGYT++QFCDKIID+FLN+K
Sbjct: 41  KIRKSSTITFATDTVTYNGTTSAE--VKSSVEDPMEVEVAEGYTMAQFCDKIIDLFLNEK 100

Query: 138 PKTKEWRKFLVFREEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVKTIDDEMEIH 197
           PK K+W+ +LV R+EW KY  +FY  C+  AD ++DP +K+KL+SL  KVK ID EME H
Sbjct: 101 PKVKQWKTYLVLRDEWNKYSVNFYKRCRIRADTETDPILKQKLVSLESKVKKIDKEMEKH 160

Query: 198 SELLKELQDSPTDINAVVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLSAV 257
           ++LLKE+Q++PTDINA+ AKRR++FT EFF+++TL+SET D LEDRDAVARLA RCLSAV
Sbjct: 161 NDLLKEIQENPTDINAIAAKRRRDFTGEFFRYVTLLSETLDGLEDRDAVARLATRCLSAV 220

Query: 258 SAYDRTLENVETLDAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASA 317
           SAYD TLE+VETLD AQ KF+DILNSPS+D ACEKI SLAKAKELDSSLILLINSA+A+A
Sbjct: 221 SAYDNTLESVETLDTAQAKFEDILNSPSVDSACEKIRSLAKAKELDSSLILLINSAYAAA 280

Query: 318 KDSPTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGD 377
           K+S T+ NE K+IMY LYKATKS+LRS+ PKEIKLLK+LLNI DPEERFSALATAF+PGD
Sbjct: 281 KESQTVTNEAKDIMYHLYKATKSSLRSITPKEIKLLKYLLNITDPEERFSALATAFSPGD 340

Query: 378 GSEPKDRNAMYTTPKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVIQRLFILKDTIE 437
             E KD  A+YTTPKELHKWIK+MLD+YHLN+E+TDI+EA+ M+QP+VIQRLFILKDTIE
Sbjct: 341 DHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIE 400

Query: 438 TEYLYQNELQNSET 451
            EYL +  +   ET
Sbjct: 401 DEYLDKKTIVADET 412

BLAST of Tan0004244 vs. TAIR 10
Match: AT1G36320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37920.1); Has 93 Blast hits to 90 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 287.3 bits (734), Expect = 2.1e-77
Identity = 151/375 (40.27%), Postives = 244/375 (65.07%), Query Frame = 0

Query: 71  KPSPMNFKSNNPTTVTITTPMQFNASAQANDLVTTGMEEQ---AEMEVAEGYTISQFCDK 130
           K S +N KS          P +F  SA  +D      EE+   +E  V +   + + CDK
Sbjct: 48  KGSLLNLKSER--------PQRFVISAVVDDKSVVAKEEKKDGSEEVVVDNQRMIKVCDK 107

Query: 131 IIDIFLNQKPKTKEWRKFLVFREEWKKYRESFYSHCQRLADWQSDPNMKEKLISLRRKVK 190
           +I++F+  KP   +WR+ L F +EW   R  FY  CQ  AD + +P MK K+  L RK+K
Sbjct: 108 LIEVFMVDKPTPSDWRRLLAFSKEWDSIRPHFYKRCQERADSEDNPEMKHKVHRLARKLK 167

Query: 191 TIDDEMEIHSELLKELQDS-PTDINAVVAKRRKEFTEEFFKFLTLISET-HDSLEDRDAV 250
            +D++++ H+ELL  ++ + P +I  +VA+RRK+FT EFF+ L  ++E+ +D+ ++++A+
Sbjct: 168 EVDEDIQRHNELLNVIKRTPPAEIGELVARRRKDFTNEFFEHLHTVAESYYDNPDEQNAL 227

Query: 251 ARLAARCLSAVSAYDRTLENVETLDAAQVKFDDILNSPSLDVACEKIASLAKAKELDSSL 310
           A L    ++AV AYD + E+++ L+AA++K  DI+NSPSLD AC KI SLA+  +LDS+L
Sbjct: 228 ASLGKLSIAAVQAYDTSTESIDALNAAEMKLQDIINSPSLDAACRKIDSLAEKNQLDSAL 287

Query: 311 ILLINSAWASAKDSPTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERF 370
           +L+I  AW++AK+S  MK EVK+I+Y LY   + NL+ + PKE+++LK+LL+I DP+E+ 
Sbjct: 288 VLMITKAWSAAKESNMMKEEVKDILYHLYVTARGNLQRLMPKEVRILKYLLSIEDPQEQI 347

Query: 371 SALATAFAPGDGSEPKDRNAMYTTPKELHKWIKMMLDSYHLNQEDTDIREARHMNQPVVI 430
           SAL  AF PGD  E  D + +YTTP+ L   +K +L++YH ++E + ++EA+ +  P +I
Sbjct: 348 SALQDAFTPGDELEGTDVDYLYTTPEHLQSLMKTVLEAYHFSREGSLVKEAKDLMHPELI 407

Query: 431 QRLFILKDTIETEYL 441
            ++  LK  +E +Y+
Sbjct: 408 AKIEQLKKLVEKKYM 414

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q84WN01.8e-13766.84Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 P... [more]
Match NameE-valueIdentityDescription
XP_038883875.14.0e-20386.61uncharacterized protein At4g37920 isoform X2 [Benincasa hispida][more]
KAG6603165.15.8e-20285.68hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022933100.15.8e-20285.68uncharacterized protein At4g37920 [Cucurbita moschata][more]
XP_022967802.11.7e-20185.22uncharacterized protein At4g37920 [Cucurbita maxima][more]
XP_023544083.14.1e-20085.48uncharacterized protein At4g37920 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1F3Z52.8e-20285.68uncharacterized protein At4g37920 OS=Cucurbita moschata OX=3662 GN=LOC111439867 ... [more]
A0A6J1HRT88.1e-20285.22uncharacterized protein At4g37920 OS=Cucurbita maxima OX=3661 GN=LOC111467208 PE... [more]
A0A1S3B4W52.1e-19783.72uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3... [more]
A0A0A0L3X19.6e-19582.57Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G515540 PE=4 SV=1[more]
A0A6J1DBT67.4e-17980.14uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Momordica charant... [more]
Match NameE-valueIdentityDescription
AT4G37920.11.3e-13866.84unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G36320.12.1e-7740.27unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 186..206
NoneNo IPR availablePANTHERPTHR31755:SF2ENDORIBONUCLEASE E-LIKE PROTEINcoord: 54..454
IPR040320Uncharacterized protein At4g37920-likePANTHERPTHR31755FOLATE RECEPTOR-LIKEcoord: 54..454

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004244.1Tan0004244.1mRNA