Spg008116 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg008116
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Locationscaffold1: 53232822 .. 53238312 (+)
RNA-Seq ExpressionSpg008116
SyntenySpg008116
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAAAATTGCGTGTCCGACACGTGTTCGGAGCGTGTCCGGACGAATCCGTGTCCGTGCTTCCTAGCCAACCAACCAATCTCATGGGAGAAGTTCAAAGACCTGCTTTATGATTACTTCCCTGAGACGGTTATGACAAGGAAATTGAATTCCTGCACTTGACCCAAGGGAATATGTCCGTAGTGTAGTACAAGAGGAAGTTCACCGCACCCTCGTGCTTTGCCCCTGATCTGGTCAGCACACCAGAATGGAAAATCAAGAGGTTCATTAAAGGCCTCCATGATGAGATCTGCAGTTCCGTAGTCCTAAAAGGGTCCACGACTTTTGCCGATGCGCTCAAAGACGCATTAATCATGGATAAGAATGGGGCGAAGAAGGTACAACCTTGATGGGGAGAAGCTTCGTCGTTGGAAGTCAAGAGGAAACCTCCTCCTATGTTTGCATGGCAGCCATCCAAGGCCCTTCGTTAGTAACCACAGAGGCCGGCTCCTGCCCTCCTTCTGTGCACTACATGTAAAAGGCACAACTCTGGACAATGCTGGACAGTCGAGTAGGTCTGTTTCAATTGTGAAAAAGGACACTATGTAAGGCAGTGCCTGACTAAGAGTGAAACAAATGCAGAGAAGCCAGCCTCAAAAGTCCTACCAGTGCAAGCTCAAGGTGGAAATCAGAGGGCACGCGTCTTTGCCCTAACTAAAGAAGAAGCAAATGATGGGGATGTCGTGGTTACAAGTACGCTCACTCTAAACTTCGGCTTTTGCTTGATTGTTCACCTTATTTGTGCCACTAGGTTAGGCTTAAAATCTGGGGCATTACAATATTTATCTTATCCTAATCTTTTAACCAACTTCAGAAGTATTTAGATGAACTTTATCTTTTAGTTTAGCATGGATTCTATATTTGTTTGTTTGTATGTTTTTTTTCCTGGAAGTCTTATCCTGCATTTTTCACCACTGAATGGATAGTAAAATTTAGGGAAAATTGCACAAACTACTCCTGTACTATGGGGTATGTTGCCATGTTACCTATGGATTTTTAATTTGATATATTAGCCCCTAACTTTGTCATGTATTGCAATCACACCCATGAATTTTTTATTTGATCAATCTGCCTTTGTTGCAATTAGTCCTTTTTGTTAGTTGCTGTTAGATTAATGTTTAATTGGACTTCATATGTGTGACTCATACATGAGTAAGGGTGGGTAAATTTGTCATTGAGGACCTCACGAGGGTGGCCCCTCCTTTTGGGCTTACGATTTGGGTAGTGCCATGATCGATTAGTTTGGAGATGAAAGTTTTAGTTATGATGACGGCAATCTCTTTGGTCTTCTGACATTTCGTCTTTCCCATAAAAAAAGAATAAAGAAAAGAAAAATCACACAACCTAAATAACACCCACTAAATTGAAACTTCCAACGTTTGCTTGTCTAAAAACTTCAATTTTTTTCTTCTTTTTTTTTCCTTTTCTTTCCAAATCTTCCAATAACAAGTGCTCTTCCCCATTCATGTGTCAGCCACACATGTGAACCCAATTAAATAGTAATTTAATGACAACTAACATGAGGGACTGATTACAACACTTAACAAAGTACATGATGTTGATCGATCAAATTAAAATTTCACGAGTGTGAGTGCAATATTTAACAAAGTACATAGGGCTGGTTGATTAAATTTAAAAATTCATGGGTGCGATGGTAACAAATCCCATAATTTAGGAGTTTGTGAAAATTTCCCTAAACTTTGTCATTTTCATACTTGTTTCTAGATAAATTTGTATCATGTATTCACAATATATATTCACCTTAACGATTTATGATTGGATAATGACATTTATAATCTGTGGTAGATTTATCAAGAGTTAACTAAAACCTAGAGTTGTAACAACATTTTAGCTAGGAGTGAGTTTCTCTAACCTGAAAGAAAAAAAAAGTTCCAAACCAAATAAATAATCACAGACTTTATCAATAGCTTTTTCTTAAATTATTTTTGCTATTTTGGGCAGCGGTGGCTCGGCTGGCGGCTAGATGTCTGTCTGCAGTTAGTGCTTATGATAGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATGATATACTGAATTCTCCCTCGTTGGAAGTGGCTTGTGAAAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAGAACGAGGTAATGTTTTAATTTAAATCTTATTTTACTGCTTGAACTGTCACATTTATTTCATTTTAGTGCCCAAGGTTTCAAATAGTCCGTCTAGTCCTCCAACTTTCAAGTGTCATGTTTTTGTCTCTAGATTTTTGCATAAAACATCCTTTTGATCCTTACCTATAATTTGTCATTAATTTAACAAAAATCTAATTACATTTAAAAGTTTGATACCTAATTGATAACAATGAATTAAACAGAAGGTTGGAAAGTGTAGGGAAGAAAATAAAATTTTTACATGAGAGTTAAGGACTGATATAGAATTTGAACTTAATGTATATGTATGACCCTTACATTTAATCAAATCTACTTCTAAACCATTATGTTATAATGCATTGCCAATCTCCCTGCCAACAATATTAATTGCTGGTTGTACTTTGAAGTTCAATGAAAGTAACACCCCAGAACCCCAGTTTTGCCTAAGTTGGAAACTTACTGTCATTTTCAATTACTCATTTCTATGGACTGGTCTGTTATCTGTGCTTCCCACATTTTAAAGTGAAACACTTATTTTATAAGCACATTTGGAAAAAGAATATGTAAATGCTTCCTATAAAAAAAACTTTTAGAATTTCTTCAATCACTTTTGATTATTGCAAATTTTAAAAAGTGCTTTTAATAGGTAAAAACATTTTTTACCCCTTTTTAAAGTCTGAACCAAATACTGCCGATTTTCTTTTTTAAATAAATTTAGTTATGAATAAGCACTTTAAAAGGTATGCTAACCACACTAAAAGGAATACTGTTTCCACCTTCTCATTGTAATTGATATATTGAAAATGGATTGCAAAATTGATTGAAAGTCTATGAACTGATGATCAATCCATATGAAGAAGGATAGTATGCCCTTAAAAAAACATTTCATCTGTATAGAGTTCAGCAGACTAAGAGATTCTTTTCCAATTTTTTTTCAACTTTTTGATGCAATTATTATGTCATCTTCTCTGTCACTCTCTGGATGTAGGTGAAAGAGATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTAAACATAATAGATCCTGAGGAACGATTTTCTGCTTTAGCAACGGCCTTCGCCCCAGGTGATGGAAGTGAAGCCAAAGATCCGAATGCTATGTACACGTGAGTAAGCCTCATATCATATTCAAATGTACATTTTTTTTTCCATGATGATCACATATTTCTCCGATAGGTTTCATTAAGGTTAAAATACCATTTTGGTCCCTATACTTTGAGACTTCTATTTTAGTTCTAATGTCCAAAGTTAGTCTCCATACTTCCAATAAATCTTAAAATTAGTCACTATTCTACTGATAGTTTATGGTTAACATTTCCAAAAAAAATTTAGACTCTATTAACCTTTCCTCTATAAATTTTGGAAATATCTTCACAGTATGTATTTTGGTAAAATATATTTTTGGTCTCTGAGATTTTTAACTTAATTTCTATATCGTCCCAAGGTTTTTAAAACTAACATTTTTGGTCCTTGAGATTTAAAGTTTGTTTCTATTTGGTCACTAAGGTTTCAAAATTGACATGTTGAGTTCCTAAAGTTTGAATTTGATTTATAATTGGTCCCTAAGGTTTGAAATTGGTTTCTAGTAAAGTTACGTTAATTGACTTAATGGAATGATGATGTGGTAATTAACATTCGCTAACTAGACTTTGACGATATGTATTTTTAAAATGAGTTGAATTTTTTTATATATTTTTTATTTTTTATATAGACTAGTTAGAAACCAATTTCAAATTTCAGGGACCAAATTAGAAATCAACTTCAAACTTTAGGGACCAATAAGAAATCAAATACAAGCCTTAAGGACCAAACATATCACTTTTGAAATCTCAAGGACCAAATGGAAACTAAATTCAAATCTAAGGACCAGAAGTGTGAGTTTTGAAATCTTTGGGATTGAATAGAAATTAAATTCAAATCTAAGGTATCAAAATTGTACTTTATCCAAATTATTATTATTATTTTACTAACTTGGATAACATTAACTTTAAACTAACAATAATGACTACTCTTAAGATTTATTGAAAGTATAGGGACTACATTTGGACATTTGAAAGAATAGAGACTAAAATGAAATGAACTCCAAAGTATAGGGACCAAAATAATATTTCAACCTTTCATATCATTAGAGTTTACCTCAAAATCCATCTACTATTTAGTTTATATCTATTTTGGTCTTTGAACTTTTAAATTTAGTTCATTTTAGTCCTTAACTTTTGTTTTTATTCCTGAACTTTTAGAAAGTAACCATTTTGATAATGGTCGTTATTTTGGCACCAATTATTTATTCGATGGATGATGGGGCTCTAAATATGTGTATCAACATCCTGATGCGAGTACACATTGAAGCCATATTAGGCTAGTTGAGGTGAGTTATGGATAGGGAACACTCAATACATGTCCAAATCAACTCTCAATTCTCATCTTGGCTGACCTATATGGCTTCAATATAGGGCTCCTGTCAGCATGTTGATATAAATTCAGAATCACATTATCCATTTTTTTAAATAATTGACAACAAAATAACAAGAACGAGCAAAATGACAACATAGGAACAAAATAACGTCTATGAAATACTTGACTATAGTATAACAATGATGATCAAAAGGATTGTTTGGTTAAAGTTTAGGACACTCTGGATGAGATTTAAATGATTGTTTTGTTAAAGTTCTGAAACCAGAATGAGATTTAAACGACGAAGGTCGAGTAACAAATTACGAGGACGAGAATATGATCATATTTGTGCAAGAAATCATGTTATAAATGTTTTTGTAATTGGGCAGAACTCCAAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGATACTATTGAAACTGAGTATTTGGAACAAAATGAGTTTCAGAATTCTCAAACAAATCCAAATCATGTCTCTGAAGATGCAGTTTCCATATAGTTGTAAATGTCTCTCCTTTTTGTAATCTAGCCAACACTAGGATCATTAGTTTTTGTATCGAACTCGAAAATTGCAATAGTTACAAAATCTGGTAACTCGTGTAACAATATATAACCCTTTTTTTTTGTATTTATAATTTAATCTGTACTTTTGC

mRNA sequence

ATGCAAAATTGCGTGTCCGACACGTGTTCGGAGCGTGTCCGGACGAATCCGTGTCCCACACCAGAATGGAAAATCAAGAGGTTCATTAAAGGCCTCCATGATGAGATCTGCAGTTCCGTAGTCCTAAAAGGGTCCACGACTTTTGCCGATGCGCTCAAAGACGCATTAATCATGGATAAGAATGGGGCGAAGAAGTGCCTGACTAAGAGTGAAACAAATGCAGAGAAGCCAGCCTCAAAAGTCCTACCAGTGCAAGCTCAAGGTGGAAATCAGAGGGCACGCGTCTTTGCCCTAACTAAAGAAGAAGCAAATGATGGGGATGTCGTGGTTACAACGGTGGCTCGGCTGGCGGCTAGATGTCTGTCTGCAGTTAGTGCTTATGATAGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATGATATACTGAATTCTCCCTCGTTGGAAGTGGCTTGTGAAAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAGAACGAGGTGAAAGAGATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTAAACATAATAGATCCTGAGGAACGATTTTCTGCTTTAGCAACGGCCTTCGCCCCAGGTGATGGAAGTGAAGCCAAAGATCCGAATGCTATGTACACAACTCCAAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGATACTATTGAAACTGAGTATTTGGAACAAAATGAGTTTCAGAATTCTCAAACAAATCCAAATCATGTCTCTGAAGATGCAGTTTCCATATAG

Coding sequence (CDS)

ATGCAAAATTGCGTGTCCGACACGTGTTCGGAGCGTGTCCGGACGAATCCGTGTCCCACACCAGAATGGAAAATCAAGAGGTTCATTAAAGGCCTCCATGATGAGATCTGCAGTTCCGTAGTCCTAAAAGGGTCCACGACTTTTGCCGATGCGCTCAAAGACGCATTAATCATGGATAAGAATGGGGCGAAGAAGTGCCTGACTAAGAGTGAAACAAATGCAGAGAAGCCAGCCTCAAAAGTCCTACCAGTGCAAGCTCAAGGTGGAAATCAGAGGGCACGCGTCTTTGCCCTAACTAAAGAAGAAGCAAATGATGGGGATGTCGTGGTTACAACGGTGGCTCGGCTGGCGGCTAGATGTCTGTCTGCAGTTAGTGCTTATGATAGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATGATATACTGAATTCTCCCTCGTTGGAAGTGGCTTGTGAAAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAGAACGAGGTGAAAGAGATAATGTATCAATTATACAAAGCCACAAAAAGCAATCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTAAACATAATAGATCCTGAGGAACGATTTTCTGCTTTAGCAACGGCCTTCGCCCCAGGTGATGGAAGTGAAGCCAAAGATCCGAATGCTATGTACACAACTCCAAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTGAACCAGGAAGATACAGACATCAGAGAAGCAAGGCATATGACTCAGCCTGTTGTTATACAAAGGCTATTCATCCTCAAGGATACTATTGAAACTGAGTATTTGGAACAAAATGAGTTTCAGAATTCTCAAACAAATCCAAATCATGTCTCTGAAGATGCAGTTTCCATATAG

Protein sequence

MQNCVSDTCSERVRTNPCPTPEWKIKRFIKGLHDEICSSVVLKGSTTFADALKDALIMDKNGAKKCLTKSETNAEKPASKVLPVQAQGGNQRARVFALTKEEANDGDVVVTTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVVIQRLFILKDTIETEYLEQNEFQNSQTNPNHVSEDAVSI
Homology
BLAST of Spg008116 vs. NCBI nr
Match: XP_023544083.1 (uncharacterized protein At4g37920 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 393.7 bits (1010), Expect = 1.6e-105
Identity = 204/218 (93.58%), Postives = 211/218 (96.79%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSPSL+VACEKIASLAKAKELDSS
Sbjct: 216 VARLAARCLSAVSAYDRTLEHVETLDSAQVKFDDILNSPSLDVACEKIASLAKAKELDSS 275

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWASAKESTTMKNEVKEIMY LYKATKS LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 276 LILLINSAWASAKESTTMKNEVKEIMYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEER 335

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALATAFAPGDGSEAKDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+V
Sbjct: 336 FSALATAFAPGDGSEAKDPNAIYTTPKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIV 395

Query: 293 IQRLFILKDTIETEYLEQNEFQNSQTNPNHVSEDAVSI 331
           IQRLFILKDTIETEYLEQNE QN+QT PNHVS +AVSI
Sbjct: 396 IQRLFILKDTIETEYLEQNELQNAQTKPNHVSANAVSI 433

BLAST of Spg008116 vs. NCBI nr
Match: XP_038883874.1 (uncharacterized protein At4g37920 isoform X1 [Benincasa hispida])

HSP 1 Score: 393.3 bits (1009), Expect = 2.1e-105
Identity = 203/218 (93.12%), Postives = 211/218 (96.79%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCL+AVSAYDRTLENVETLDSAQ KFDDIL SPSL+VACEKIASLAKAKELDSS
Sbjct: 226 VARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILTSPSLDVACEKIASLAKAKELDSS 285

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWA+AKESTTMKNEVKEIMY LYKATKS+LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 286 LILLINSAWAAAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEER 345

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALATAFAPGDGSE KDP A+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQPVV
Sbjct: 346 FSALATAFAPGDGSEQKDPKALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPVV 405

Query: 293 IQRLFILKDTIETEYLEQNEFQNSQTNPNHVSEDAVSI 331
           IQRLFILKDTIETEYLEQNEFQN Q+ PNHVSEDAVSI
Sbjct: 406 IQRLFILKDTIETEYLEQNEFQNPQSTPNHVSEDAVSI 443

BLAST of Spg008116 vs. NCBI nr
Match: XP_038883875.1 (uncharacterized protein At4g37920 isoform X2 [Benincasa hispida])

HSP 1 Score: 393.3 bits (1009), Expect = 2.1e-105
Identity = 203/218 (93.12%), Postives = 211/218 (96.79%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCL+AVSAYDRTLENVETLDSAQ KFDDIL SPSL+VACEKIASLAKAKELDSS
Sbjct: 215 VARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILTSPSLDVACEKIASLAKAKELDSS 274

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWA+AKESTTMKNEVKEIMY LYKATKS+LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 275 LILLINSAWAAAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEER 334

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALATAFAPGDGSE KDP A+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQPVV
Sbjct: 335 FSALATAFAPGDGSEQKDPKALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPVV 394

Query: 293 IQRLFILKDTIETEYLEQNEFQNSQTNPNHVSEDAVSI 331
           IQRLFILKDTIETEYLEQNEFQN Q+ PNHVSEDAVSI
Sbjct: 395 IQRLFILKDTIETEYLEQNEFQNPQSTPNHVSEDAVSI 432

BLAST of Spg008116 vs. NCBI nr
Match: XP_038883876.1 (uncharacterized protein At4g37920 isoform X3 [Benincasa hispida])

HSP 1 Score: 393.3 bits (1009), Expect = 2.1e-105
Identity = 203/218 (93.12%), Postives = 211/218 (96.79%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCL+AVSAYDRTLENVETLDSAQ KFDDIL SPSL+VACEKIASLAKAKELDSS
Sbjct: 133 VARLAARCLAAVSAYDRTLENVETLDSAQAKFDDILTSPSLDVACEKIASLAKAKELDSS 192

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWA+AKESTTMKNEVKEIMY LYKATKS+LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 193 LILLINSAWAAAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEER 252

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALATAFAPGDGSE KDP A+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQPVV
Sbjct: 253 FSALATAFAPGDGSEQKDPKALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPVV 312

Query: 293 IQRLFILKDTIETEYLEQNEFQNSQTNPNHVSEDAVSI 331
           IQRLFILKDTIETEYLEQNEFQN Q+ PNHVSEDAVSI
Sbjct: 313 IQRLFILKDTIETEYLEQNEFQNPQSTPNHVSEDAVSI 350

BLAST of Spg008116 vs. NCBI nr
Match: KAG6603165.1 (hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 389.4 bits (999), Expect = 3.0e-104
Identity = 202/218 (92.66%), Postives = 210/218 (96.33%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSPSL+VACEKIASLAKAKELDSS
Sbjct: 215 VARLAARCLSAVSAYDRTLEHVETLDSAQVKFDDILNSPSLDVACEKIASLAKAKELDSS 274

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWASAKESTTMKNEVKEIMY LYKATKS LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 275 LILLINSAWASAKESTTMKNEVKEIMYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEER 334

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALATAFAPGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+V
Sbjct: 335 FSALATAFAPGDGSEPKDPNAIYTTPKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIV 394

Query: 293 IQRLFILKDTIETEYLEQNEFQNSQTNPNHVSEDAVSI 331
           IQRLFILKDTIETEYLEQNE QN+Q+ PNHVS +AVSI
Sbjct: 395 IQRLFILKDTIETEYLEQNESQNAQSKPNHVSANAVSI 432

BLAST of Spg008116 vs. ExPASy Swiss-Prot
Match: Q84WN0 (Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 PE=2 SV=2)

HSP 1 Score: 313.9 bits (803), Expect = 2.1e-84
Identity = 166/225 (73.78%), Postives = 190/225 (84.44%), Query Frame = 0

Query: 102 EANDGDVVVTTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIA 161
           E  DG      VARLA RCLSAVSAYD TLE+VETLD+AQ KF+DILNSPS++ ACEKI 
Sbjct: 196 ETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDSACEKIR 255

Query: 162 SLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLK 221
           SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMY LYKATKS+LRS+ PKEIKLLK
Sbjct: 256 SLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKSSLRSITPKEIKLLK 315

Query: 222 HLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDI 281
           +LLNI DPEERFSALATAF+PGD  EAKDP A+YTTPKELHKWIKIMLD+YHLN+E+TDI
Sbjct: 316 YLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDI 375

Query: 282 REARHMTQPVVIQRLFILKDTIETEYLEQNEFQNSQTNPNHVSED 327
           +EA+ M+QP+VIQRLFILKDTIE EYL++      +T P    ED
Sbjct: 376 KEAKQMSQPIVIQRLFILKDTIEDEYLDKKTIVADET-PKKEEED 419

BLAST of Spg008116 vs. ExPASy TrEMBL
Match: A0A6J1F3Z5 (uncharacterized protein At4g37920 OS=Cucurbita moschata OX=3662 GN=LOC111439867 PE=4 SV=1)

HSP 1 Score: 389.4 bits (999), Expect = 1.5e-104
Identity = 202/218 (92.66%), Postives = 210/218 (96.33%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSPSL+VACEKIASLAKAKELDSS
Sbjct: 215 VARLAARCLSAVSAYDRTLEHVETLDSAQVKFDDILNSPSLDVACEKIASLAKAKELDSS 274

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWASAKESTTMKNEVKEIMY LYKATKS LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 275 LILLINSAWASAKESTTMKNEVKEIMYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEER 334

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALATAFAPGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+V
Sbjct: 335 FSALATAFAPGDGSEPKDPNAIYTTPKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIV 394

Query: 293 IQRLFILKDTIETEYLEQNEFQNSQTNPNHVSEDAVSI 331
           IQRLFILKDTIETEYLEQNE QN+Q+ PNHVS +AVSI
Sbjct: 395 IQRLFILKDTIETEYLEQNESQNAQSKPNHVSTNAVSI 432

BLAST of Spg008116 vs. ExPASy TrEMBL
Match: A0A6J1HRT8 (uncharacterized protein At4g37920 OS=Cucurbita maxima OX=3661 GN=LOC111467208 PE=4 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 2.5e-104
Identity = 201/218 (92.20%), Postives = 210/218 (96.33%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCLSAVSAYDRTLE+VETLDSAQVKFDDILNSP+L+VACEKIASLAKAKELDSS
Sbjct: 215 VARLAARCLSAVSAYDRTLEHVETLDSAQVKFDDILNSPTLDVACEKIASLAKAKELDSS 274

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWASAKESTTMKNEVKEIMY+LYKATKS LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 275 LILLINSAWASAKESTTMKNEVKEIMYRLYKATKSGLRSMAPKEIKLLKHLLNIVDPEER 334

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALATAFAPGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR M QP+V
Sbjct: 335 FSALATAFAPGDGSEPKDPNAIYTTPKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIV 394

Query: 293 IQRLFILKDTIETEYLEQNEFQNSQTNPNHVSEDAVSI 331
           IQRLFILKDTIETEYLEQNE QN Q+ PNHVS +AVSI
Sbjct: 395 IQRLFILKDTIETEYLEQNELQNPQSKPNHVSANAVSI 432

BLAST of Spg008116 vs. ExPASy TrEMBL
Match: A0A1S3B4W5 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486045 PE=4 SV=1)

HSP 1 Score: 386.3 bits (991), Expect = 1.2e-103
Identity = 200/220 (90.91%), Postives = 212/220 (96.36%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCL+AVSAYDRTLENVETLDSAQ KFD+ILNSPSL+VACEKIASLAKAKELDSS
Sbjct: 216 VARLAARCLAAVSAYDRTLENVETLDSAQAKFDNILNSPSLDVACEKIASLAKAKELDSS 275

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWASAKESTTMKNEVKEIMY LYKATKS+LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 276 LILLINSAWASAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEER 335

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALATAF+PGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQP+V
Sbjct: 336 FSALATAFSPGDGSEQKDPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIV 395

Query: 293 IQRLFILKDTIETEYLEQNEFQNSQTNP--NHVSEDAVSI 331
           IQRLFILKDTIETEYLEQN+FQN Q+ P  NH SEDA+SI
Sbjct: 396 IQRLFILKDTIETEYLEQNQFQNPQSRPNHNHGSEDAISI 435

BLAST of Spg008116 vs. ExPASy TrEMBL
Match: A0A0A0L3X1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G515540 PE=4 SV=1)

HSP 1 Score: 384.4 bits (986), Expect = 4.7e-103
Identity = 199/220 (90.45%), Postives = 212/220 (96.36%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCL+AVSAY+RTLENVETLDSAQVKFD+ILNSPSL+VACEKIASLAKAKELDSS
Sbjct: 216 VARLAARCLAAVSAYNRTLENVETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSS 275

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWASAKESTTMKNEVKEIMY LYKATKS+LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 276 LILLINSAWASAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEER 335

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALAT F+PGDGSE KDPNA+YTTPKELHKWIKIMLDSYHLNQEDTDIREAR+MTQP+V
Sbjct: 336 FSALATTFSPGDGSEQKDPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIV 395

Query: 293 IQRLFILKDTIETEYLEQNEFQNSQTNP--NHVSEDAVSI 331
           IQRLFILKDTIETEYLEQN+FQN Q+ P  NH SEDA+SI
Sbjct: 396 IQRLFILKDTIETEYLEQNQFQNPQSRPSHNHGSEDAISI 435

BLAST of Spg008116 vs. ExPASy TrEMBL
Match: A0A6J1DBT6 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018874 PE=4 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 9.5e-96
Identity = 185/201 (92.04%), Postives = 195/201 (97.01%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           VARLAARCLSAVSAYDRTLE V+TLD AQ KFDDILNSPSL+VACEKI SLAKAKELDSS
Sbjct: 213 VARLAARCLSAVSAYDRTLEYVDTLDCAQAKFDDILNSPSLDVACEKIESLAKAKELDSS 272

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           LILLINSAWASAKESTTMKNEVKEIMY+LY+ATKS+LRSMAPKEIKLLKHLLNI+DPEER
Sbjct: 273 LILLINSAWASAKESTTMKNEVKEIMYRLYRATKSSLRSMAPKEIKLLKHLLNIVDPEER 332

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
           FSALATAFAPGDGSEA+DPNAMYTTPKELHKWIKIMLDSYHLNQEDT++REAR+M QPVV
Sbjct: 333 FSALATAFAPGDGSEARDPNAMYTTPKELHKWIKIMLDSYHLNQEDTEMREARNMNQPVV 392

Query: 293 IQRLFILKDTIETEYLEQNEF 314
           IQRLFILKDTIETEYLEQ EF
Sbjct: 393 IQRLFILKDTIETEYLEQTEF 413

BLAST of Spg008116 vs. TAIR 10
Match: AT4G37920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane, chloroplast, chloroplast envelope; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36320.1); Has 123 Blast hits to 120 proteins in 40 species: Archae - 2; Bacteria - 11; Metazoa - 8; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 313.9 bits (803), Expect = 1.5e-85
Identity = 166/225 (73.78%), Postives = 190/225 (84.44%), Query Frame = 0

Query: 102 EANDGDVVVTTVARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIA 161
           E  DG      VARLA RCLSAVSAYD TLE+VETLD+AQ KF+DILNSPS++ ACEKI 
Sbjct: 196 ETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDSACEKIR 255

Query: 162 SLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLK 221
           SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMY LYKATKS+LRS+ PKEIKLLK
Sbjct: 256 SLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKSSLRSITPKEIKLLK 315

Query: 222 HLLNIIDPEERFSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDI 281
           +LLNI DPEERFSALATAF+PGD  EAKDP A+YTTPKELHKWIKIMLD+YHLN+E+TDI
Sbjct: 316 YLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLNKEETDI 375

Query: 282 REARHMTQPVVIQRLFILKDTIETEYLEQNEFQNSQTNPNHVSED 327
           +EA+ M+QP+VIQRLFILKDTIE EYL++      +T P    ED
Sbjct: 376 KEAKQMSQPIVIQRLFILKDTIEDEYLDKKTIVADET-PKKEEED 419

BLAST of Spg008116 vs. TAIR 10
Match: AT1G36320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37920.1); Has 93 Blast hits to 90 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 182.2 bits (461), Expect = 6.9e-46
Identity = 87/196 (44.39%), Postives = 139/196 (70.92%), Query Frame = 0

Query: 113 VARLAARCLSAVSAYDRTLENVETLDSAQVKFDDILNSPSLEVACEKIASLAKAKELDSS 172
           +A L    ++AV AYD + E+++ L++A++K  DI+NSPSL+ AC KI SLA+  +LDS+
Sbjct: 219 LASLGKLSIAAVQAYDTSTESIDALNAAEMKLQDIINSPSLDAACRKIDSLAEKNQLDSA 278

Query: 173 LILLINSAWASAKESTTMKNEVKEIMYQLYKATKSNLRSMAPKEIKLLKHLLNIIDPEER 232
           L+L+I  AW++AKES  MK EVK+I+Y LY   + NL+ + PKE+++LK+LL+I DP+E+
Sbjct: 279 LVLMITKAWSAAKESNMMKEEVKDILYHLYVTARGNLQRLMPKEVRILKYLLSIEDPQEQ 338

Query: 233 FSALATAFAPGDGSEAKDPNAMYTTPKELHKWIKIMLDSYHLNQEDTDIREARHMTQPVV 292
            SAL  AF PGD  E  D + +YTTP+ L   +K +L++YH ++E + ++EA+ +  P +
Sbjct: 339 ISALQDAFTPGDELEGTDVDYLYTTPEHLQSLMKTVLEAYHFSREGSLVKEAKDLMHPEL 398

Query: 293 IQRLFILKDTIETEYL 309
           I ++  LK  +E +Y+
Sbjct: 399 IAKIEQLKKLVEKKYM 414

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023544083.11.6e-10593.58uncharacterized protein At4g37920 [Cucurbita pepo subsp. pepo][more]
XP_038883874.12.1e-10593.12uncharacterized protein At4g37920 isoform X1 [Benincasa hispida][more]
XP_038883875.12.1e-10593.12uncharacterized protein At4g37920 isoform X2 [Benincasa hispida][more]
XP_038883876.12.1e-10593.12uncharacterized protein At4g37920 isoform X3 [Benincasa hispida][more]
KAG6603165.13.0e-10492.66hypothetical protein SDJN03_03774, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Q84WN02.1e-8473.78Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 P... [more]
Match NameE-valueIdentityDescription
A0A6J1F3Z51.5e-10492.66uncharacterized protein At4g37920 OS=Cucurbita moschata OX=3662 GN=LOC111439867 ... [more]
A0A6J1HRT82.5e-10492.20uncharacterized protein At4g37920 OS=Cucurbita maxima OX=3661 GN=LOC111467208 PE... [more]
A0A1S3B4W51.2e-10390.91uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3... [more]
A0A0A0L3X14.7e-10390.45Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G515540 PE=4 SV=1[more]
A0A6J1DBT69.5e-9692.04uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Momordica charant... [more]
Match NameE-valueIdentityDescription
AT4G37920.11.5e-8573.78unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G36320.16.9e-4644.39unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31755:SF2ENDORIBONUCLEASE E-LIKE PROTEINcoord: 112..322
IPR040320Uncharacterized protein At4g37920-likePANTHERPTHR31755FOLATE RECEPTOR-LIKEcoord: 112..322

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg008116.1Spg008116.1mRNA