CSPI01G25250 (gene) Wild cucumber (PI 183967)

NameCSPI01G25250
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr1 : 20709717 .. 20714731 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAAAATGGGGTGATAAGAAGCGACATTCTCTTGAGTTTCGAGCAGGAGATCAAGTCCTCATCAAGCTGAAAACAGATCAAATTCGGTTTAGAGGGCGCAAAGATCAGCACCTTGTCAGAAAATATGAGGGGTTTGGGGAAGTCCTCAAGAAGATAGGAAATGCATAGTATAGGGTGTTGTTGCCTACATGGATGAAAATTCACCCAGTAATTCATGTGAGCAACTTAAAACCCTACCATCAAGACCCCGACGACAAGCAGCACGACATATTGCTCGACCATGTATCAACCTGAAGCAGAAAGAAGATAAAGAAGTTGAAGAGATCCTTGCAGATCGAGTAAGGAAGGGTAGAGGCCCACAAGGAGAATCCACGAAATCCTGGTTAGATGGAAAAACCTCCCCGTGGAAGAAACGGGTTGGGAACGTGTTGAAGACATTAGAGCGTGGAAGCAGAAGATCGAAGAGTTCTAGCTTCGTCAGTCGATAGGGACGTCAGCTGTTTAAGTGGGGGAGAATGTTATGAGCATGCTTGTCCAAGGCGTTGTTTGTCTATGACCACATGCACAAACATCCACTCCCCACCTTATGTTTATACTTTTGTACTTAGTTTAGCTTATTTTCTTTAATTTTTGTCGCTTACTTGTAAGTGACGTTTCAAACTTTTCATATGTAAGCCTTTCGTATTTGAAAATGTTAAAAATGTACTATAAGGGAAACTCACTAGTCAATTCCCCGACTAAGCCTTGTTACTTTGGAAACTGAACTTTCTTACAATTGTCAGTCTTCAATGCTTTCTTTAATAATGTTTTCAATGCTTTTCTTTCTCTCACGTTTTATAAAACCATTTTCTCTAAAAACATGAATGGAGGCTACATCATATTGCAATGTTGCCGTGACTTTTGGATTTAGTACGGCTAACTTATTCATTGAGATAGCACGTTTTATAAAATATCTTCCTCCCAAAATTCTTTTTGTCTTCTGCTAAAACCCACAACTCATCACCGTCACGACTCCGACTCCAACTGTCAACACTTCGCAGAACCTGCCGGAAACCCAAGAAAACTACACTTTTCAATCAAAACCAAGGATATCTGAATCCTCACTTCCTTCATCACAATGATAATACAAGTCTGGTGTTAGCAACGGAGCAATTGACAGAAGAGAATTATGTTTCTTGGACTCAAGCGATGACCATTGGTCTTTCAGTGAAGAACAAGATTGGGTTCGTCGACGAGACTATTACCAAACGAACCGGTGATCTCCTTCCGGCTTGGATTAGAAATAACATCGTTATTTCTTGGATCCTAAACTCAGTCTCTAAACCTGTCTCAGCAAGTATTTTGTTTTCAGGTTCAACCAGAGCAATATGGATTGACCTCAAAGAAAGATTTCAGAAGAAGAACGTGCCAAGGATTTTTCAGTTGAAGCGATCCCTTGCAACCTTGGAACAAAACCAAGATACCATTGGTGTGTATTATACTAAATTCAATACTCTCATTGATGAACTGAACACATATAGACCAGGGTGCACTTGCGGAGCTTGCAGTTGTGACATCGTGCGAGAAATGACAGATTTCCTCCAAATGTAATATCTCATGGACTTATGGGATTGAATGAGAACATCTCTCAAGCCCGGGCTCAACTTCTCCTCATGGATCCTCTTCCATCAACAAGCCGAGCTTTCTCTCTTCTTCTTCAAGAGGAACAACAAAGATCAATTGGATCTTTTTCTTCTACAGCACCAACGATGGCCTTTGCAGTATCTTCTAACTCATCCAAGAATGAATCCACCAATCGACAAAGGAGAGAAAAGACTATATGCACCCACTGCAACATTTCTGAACACATGATAGATTAGTGTTACAAGCTCCATGGATACCCTCCTGGATATAAGACCAAGCAACAACAGCAGTGAACCAATAATGTTGTTAATGCAGTAGCAACTCAGAACAATGAAAGTCGTTCTCAAGGTACCACACAGAGTAATCAGATATTGAATAATACCAGCACTGCAGAAGCTTTGATCCAGTGTCAAAACCTCCTCAACCAGCTTCAGTCTCAAATGAATGCTTCCAACCAACCAACTACCTCACATAAAGCAGGTACTTCTTATTCATTTCCCCTGTGGATAATTGATCCTGGAGCATCCACTCACATTTCTTGTTGCAAGTCCCATTTTGCATCCATTCAACCATGCTCAACATCCATCCGTTTACCTAATAAACAAGTTTTTGAAGGTAAAAGTGCTAGCACTATCAAATTATCTGAATCTATAGAGCTGAAGAATGTGTTATATATTCCTGAATTTTCATTCAACCTCAGTGAGCGCATTAACAAGAGATCTACTCGTCGATGTTAGTTTCTCTACTAATGGTTGTGTAACTCAGGACAAGTTCACTTTGAAGAAGATTGGCAATGCTGAACTTTTATATGGTCTATATGTCTTCAAATTGGGAAACACTCTTGATCTGCAGTCTACCATATGTGCTGTAAGCCATGATAATACTTTTTTGTGGCATCAAAGGCTTGGTCACCCTTCTGTTGATGTTTTGAAATCTTTGCAAGACGTGTTGCAATTGAAGTCTTTTAATTTTCATTCTTGTACTACTTGTCCCTTAGCAAAGCAAAGAAGACTTTCTTTTTCTTCAAATAATCGAGTGTCTCCAAATCCATGTACATACATGTATAAACTTTCCTCATTGTTCATAAATAAAGTGAGTGAAATATTTCTTCTATTCTCTGTTAATATCTGGCTTCTCATCCTTTCTTAATTTTTCTTGCTGTATACAGTTGCAGAATTGGTCTGGATGTCCACCTAATATGCATGATATAAGTGGAAATGCACACTTCATAAATAAATGTGATAATGGAATTGTCATTCATCGTAATAGGGATCCTGAAAGTGGTCCTATTGATCTCGTACAGGTACAATTGCAGATATATTATAAAAACAATATGTTTCTTTCTTTTCTGTGTACATTAGTTTTCTTTGTATCCTTGCTACTTGTAGGTATGTGTACGAAAAGTGAGAAATAAGGTTGCAGGAACAATTGGGGAAGCTTATTTGGCATATAATAGGTATGTGAAGATCGTTTTCCAAAGTTTTTCAGGAATTCACATCTTGGTTCTGACTTCTTGTGGTGATGAATTGTCCTATATGCTTTCTTGAAATTAAGGTTGTTGAACAATGAACATTATATATTCCTTAATGATAAGAGTTTTATGAAAAAACATTAGGTAGCAACATTTTCCCAGTGTTTTGAACAATCTTTGAAAGCAAACCGTTTCTTTTATTTTGGAGAACTACTCTGGTTATCCGTTATGGTTTTGCACTGTGTACTTTGAAGTTTGGCCAATTTCTTGTTGAAGGATTTGTGGATGAGATTCATTGCTTAAGCCTTGGTGTTTAAAATATTCTCTTGAAGTTTTGGTCAATCATCATTCTAGTTAACAAGGACTTCATACCGATACACATGGCACACTGAAATCAACAAACATAAAATCATTCAGAAAAAGTACCTCGCACTAGCCCTCTCTCCTTGTTGGTGCATTATGTGCAAGATGTCAAGAAATGGGTAGAGTTTTTCCTCACGTTCTCGTTTGACTGTATTGGCCCACAATCTGAAAAGGACAATCCTTTAATTTAATTGGAGCGTAGCTCCCCTAGCCATTCTTTCAAAAAAGTTCGGGAGCATCCTTGGCTTCACATCAATTAAGGATTCTTTTGGTTACCATGGATCAAAATTGTTTGATGAATTAAGGAACAACCCACCCTTCAGAAATTGTTTGATGAGATATGTTGCACATACTAAAACAGTTTCTTAACCAAAATCTGAGGGGTCTTTGCCTTGGGAAATTTTGCCAATAAATGCGTAGTCTGTCGGTTACAGTGATTCTTTGTGGACAGTGGCCTTAAAGCGTTGCTGCGAATAGTAATCTGTTAGTCCTTAAGGGTTGGATAGTCGAGTTTACAACTTACACCAGTCAAATCTCTTGGATTTCTATCTTCAAAACTAGTCTTCCTTCTAAGATTGATTGTTGTGTTTTAAAGTTAGACTTTGAGACGACTCGTGGACAAAAGCCACCCCTTATGCCATCAACAAAACATGTGATTTCTTATAGAACTCTATAATTCATGTAGTCACTAGTCAGAATGCTAAGCACTCATGAGAGTGATTAAGCAACAGGGAATAACCCTCTAACTTTTAGGAATGCAACTGGATCCCCCTTTTTTATTGAATTATTGAGTTCGGTTGCAACTTTAGCCAATCATTTGCTGCAAGGGTTGAATGACTGAATGCTGATCCCTCGATAGTCAAAATAACTGACAATGATGCCATCTGGAGGCTTTCAGCTACCTAATAGGTTCTTGTAATACGTATCTGTGTGGTTTCTGTTTGCGGAAACTCAACCTTGATGGAGTTCAAAGAACAAAACATAAATTCTAACTCTGCCTAGATGTTTGGTTTGTGAGAAAGGGGGAAAACACAGACCACTTTTGCTTTCATTATCAGCCCAATGATTGCAAGGTACATGATTGACAATCATTACCTGTGGGCCTTTTGTGCTATTCTAGTGTTTTCTGGAGTGAGTGAGTCATCGAGTGGCATGCAAATTGCAATACGCTCCTTCTATTCTCTGTTTATCAGCATGTCACTGTATCAATGTACTTTTATTTGAAACCCTTTGTTCTCTGTATATCCGCTATTAAAAGGTCAATAATGGCTGTTATGCTCATCAGGGTAACCGGAGAATTCTTCGATGCTGCTGGGGATATGAAACTTAAGAAACCATCATCTTGAGAGGAGGTATGGTGTTGAAGGCCTTCACAAGTATTGTTTAAGGTCATTCATCCACTGCAATACTGTGAATTGTCGATAAATGTCGACTGCTAATTCATTCGTACAATTCACATTTATTTAGCAATGTCATTTTTGTTTTGGGTGATCCATAGCAAGAAGAAATTATTAAGTTGATGGATGTGTATAATTATTAAGTTGATGGATGTGTA

mRNA sequence

ATGAAAAAATGGGGTGATAAGAAGCGACATTCTCTTGAGTTTCGAGCAGGAGATCAAGTCCTCATCAAGCTGAAAACAGATCAAATTCGGTTTAGAGGGCGCAAAGATCAGCACCTTGTCAGAAAATATGAGGGGTTTGGGGAAGGTGTTGTTGCCTACATGGATGAAAATTCACCCAGTAATTCATGTGAGCAACTTAAAACCCTACCATCAAGACCCCGACGACAAGCAGCACGACATATTGCTCGACCATGTATCAACCTGAAGCAGAAAGAAGATAAAGAAGTTGAAGAGATCCTTGCAGATCGAAGCGTGGAAGCAGAAGATCGAAGAGTTCTAGCTTCGTCAGTCGATAGGGACGTCAGCTGTTTAAGTGGGGGAGAATGTTATGAGCATGCTTGTCCAAGGCGTTGTTTAACCTGCCGGAAACCCAAGAAAACTACACTTTTCAATCAAAACCAAGGATATCTGAATCCTCACTTCCTTCATCACAATGATAATACAAGTCTGGTGTTAGCAACGGAGCAATTGACAGAAGAGAATTATGTTTCTTGGACTCAAGCGATGACCATTGGTCTTTCAGTGAAGAACAAGATTGGGTTCGTCGACGAGACTATTACCAAACGAACCGGTGATCTCCTTCCGGCTTGGATTAGAAATAACATCGTTATTTCTTGGATCCTAAACTCAGTCTCTAAACCTGTCTCAGCAAGTATTTTGTTTTCAGGTTCAACCAGAGCAATATGGATTGACCTCAAAGAAAGATTTCAGAAGAAGAACGTGCCAAGGATTTTTCAGTTGAAGCGATCCCTTGCAACCTTGGAACAAAACCAAGATACCATTGGTGTAAATGACAGATTTCCTCCAAATGTAATATCTCATGGACTTATGGGATTGAATGAGAACATCTCTCAAGCCCGGGCTCAACTTCTCCTCATGGATCCTCTTCCATCAACAAGCCGAGCTTTCTCTCTTCTTCTTCAAGAGGAACAACAAAGATCAATTGGATCTTTTTCTTCTACAGCACCAACGATGGCCTTTGCAATTAGTGTTACAAGCTCCATGGATACCCTCCTGGATATAAGACCAAGCAACAACAGCATAGCAACTCAGAACAATGAAAGTCGTTCTCAAGGTACCACACAGAGTAATCAGATATTGAATAATACCAGCACTGCAGAAGCTTTGATCCAGTGTCAAAACCTCCTCAACCAGCTTCAGTCTCAAATGAATGCTTCCAACCAACCAACTACCTCACATAAAGCAGGTACTTCTTATTCATTTCCCCTGTGGATAATTGATCCTGGAGCATCCACTCACATTTCTTGTTGCAAGTCCCATTTTGCATCCATTCAACCATGCTCAACATCCATCCGTTTACCTAATAAACAAGTTTTTGAAGTGAGCGCATTAACAAGAGATCTACTCGTCGATGTTAGTTTCTCTACTAATGGTTGTGTAACTCAGGACAAGTTCACTTTGAAGAAGATTGGCAATGCTGAACTTTTATATGGTCTATATGTCTTCAAATTGGGAAACACTCTTGATCTGCAGTCTACCATATGTGCTTTGCAGAATTGGTCTGGATGTCCACCTAATATGCATGATATAAGTGGAAATGCACACTTCATAAATAAATGTGATAATGGAATTGTCATTCATCGTAATAGGGATCCTGAAAGTGGTCCTATTGATCTCGTACAGGTATGTGTACGAAAAGTGAGAAATAAGGTTGCAGGAACAATTGGGGAAGCTTATTTGGCATATAATAGGGTAACCGGAGAATTCTTCGATGCTGCTGGGGATATGAAACTTAAGAAACCATCATCTTGA

Coding sequence (CDS)

ATGAAAAAATGGGGTGATAAGAAGCGACATTCTCTTGAGTTTCGAGCAGGAGATCAAGTCCTCATCAAGCTGAAAACAGATCAAATTCGGTTTAGAGGGCGCAAAGATCAGCACCTTGTCAGAAAATATGAGGGGTTTGGGGAAGGTGTTGTTGCCTACATGGATGAAAATTCACCCAGTAATTCATGTGAGCAACTTAAAACCCTACCATCAAGACCCCGACGACAAGCAGCACGACATATTGCTCGACCATGTATCAACCTGAAGCAGAAAGAAGATAAAGAAGTTGAAGAGATCCTTGCAGATCGAAGCGTGGAAGCAGAAGATCGAAGAGTTCTAGCTTCGTCAGTCGATAGGGACGTCAGCTGTTTAAGTGGGGGAGAATGTTATGAGCATGCTTGTCCAAGGCGTTGTTTAACCTGCCGGAAACCCAAGAAAACTACACTTTTCAATCAAAACCAAGGATATCTGAATCCTCACTTCCTTCATCACAATGATAATACAAGTCTGGTGTTAGCAACGGAGCAATTGACAGAAGAGAATTATGTTTCTTGGACTCAAGCGATGACCATTGGTCTTTCAGTGAAGAACAAGATTGGGTTCGTCGACGAGACTATTACCAAACGAACCGGTGATCTCCTTCCGGCTTGGATTAGAAATAACATCGTTATTTCTTGGATCCTAAACTCAGTCTCTAAACCTGTCTCAGCAAGTATTTTGTTTTCAGGTTCAACCAGAGCAATATGGATTGACCTCAAAGAAAGATTTCAGAAGAAGAACGTGCCAAGGATTTTTCAGTTGAAGCGATCCCTTGCAACCTTGGAACAAAACCAAGATACCATTGGTGTAAATGACAGATTTCCTCCAAATGTAATATCTCATGGACTTATGGGATTGAATGAGAACATCTCTCAAGCCCGGGCTCAACTTCTCCTCATGGATCCTCTTCCATCAACAAGCCGAGCTTTCTCTCTTCTTCTTCAAGAGGAACAACAAAGATCAATTGGATCTTTTTCTTCTACAGCACCAACGATGGCCTTTGCAATTAGTGTTACAAGCTCCATGGATACCCTCCTGGATATAAGACCAAGCAACAACAGCATAGCAACTCAGAACAATGAAAGTCGTTCTCAAGGTACCACACAGAGTAATCAGATATTGAATAATACCAGCACTGCAGAAGCTTTGATCCAGTGTCAAAACCTCCTCAACCAGCTTCAGTCTCAAATGAATGCTTCCAACCAACCAACTACCTCACATAAAGCAGGTACTTCTTATTCATTTCCCCTGTGGATAATTGATCCTGGAGCATCCACTCACATTTCTTGTTGCAAGTCCCATTTTGCATCCATTCAACCATGCTCAACATCCATCCGTTTACCTAATAAACAAGTTTTTGAAGTGAGCGCATTAACAAGAGATCTACTCGTCGATGTTAGTTTCTCTACTAATGGTTGTGTAACTCAGGACAAGTTCACTTTGAAGAAGATTGGCAATGCTGAACTTTTATATGGTCTATATGTCTTCAAATTGGGAAACACTCTTGATCTGCAGTCTACCATATGTGCTTTGCAGAATTGGTCTGGATGTCCACCTAATATGCATGATATAAGTGGAAATGCACACTTCATAAATAAATGTGATAATGGAATTGTCATTCATCGTAATAGGGATCCTGAAAGTGGTCCTATTGATCTCGTACAGGTATGTGTACGAAAAGTGAGAAATAAGGTTGCAGGAACAATTGGGGAAGCTTATTTGGCATATAATAGGGTAACCGGAGAATTCTTCGATGCTGCTGGGGATATGAAACTTAAGAAACCATCATCTTGA
BLAST of CSPI01G25250 vs. Swiss-Prot
Match: TWIH_ARATH (Twinkle homolog protein, chloroplastic/mitochondrial OS=Arabidopsis thaliana GN=At1g30680 PE=1 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 2.1e-26
Identity = 52/76 (68.42%), Postives = 65/76 (85.53%), Query Frame = 1

Query: 524 LQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG 583
           LQ+W G  PN++DISG+AHFINKCDNGI++HRNRD  +GP+DLVQ+ VRKVRNKVAG IG
Sbjct: 619 LQHWDGGAPNLYDISGSAHFINKCDNGIIVHRNRDENAGPLDLVQIGVRKVRNKVAGQIG 678

Query: 584 EAYLAYNRVTGEFFDA 600
           +AYL Y+R TG + D+
Sbjct: 679 DAYLCYDRTTGSYSDS 694

BLAST of CSPI01G25250 vs. TrEMBL
Match: A0A0A0LVZ4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G541900 PE=4 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 9.2e-45
Identity = 87/90 (96.67%), Postives = 88/90 (97.78%), Query Frame = 1

Query: 521 ICALQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAG 580
           +  LQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAG
Sbjct: 4   VLKLQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAG 63

Query: 581 TIGEAYLAYNRVTGEFFDAAGDMKLKKPSS 611
           TIGEAYLAYNRVTGEFFDAAGDMKLKKPSS
Sbjct: 64  TIGEAYLAYNRVTGEFFDAAGDMKLKKPSS 93

BLAST of CSPI01G25250 vs. TrEMBL
Match: A0A0A0LYM8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G541890 PE=4 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 2.4e-40
Identity = 81/87 (93.10%), Postives = 84/87 (96.55%), Query Frame = 1

Query: 524 LQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG 583
           LQNWSG PPNM+DISG+AHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG
Sbjct: 631 LQNWSGSPPNMYDISGSAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG 690

Query: 584 EAYLAYNRVTGEFFDAAGDMKLKKPSS 611
           EAYL YNRVTGEF DAAGD+KLKKPSS
Sbjct: 691 EAYLEYNRVTGEFLDAAGDVKLKKPSS 717

BLAST of CSPI01G25250 vs. TrEMBL
Match: A0A151U482_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_006768 PE=4 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 6.0e-36
Identity = 95/220 (43.18%), Postives = 121/220 (55.00%), Query Frame = 1

Query: 153 NQGYLNPHFLHHNDNTSLVLATEQLTEENYVSWTQAMTIGLSVKNKIGFVDETITK--RT 212
           +Q   NP FLHH+D   LVL ++ L  +NY +W++AM + L VKNK+ F+D T+ K   T
Sbjct: 9   SQDVSNPLFLHHSDGPGLVLTSQPLDHKNYTTWSRAMQVALFVKNKLAFIDGTLPKPAST 68

Query: 213 GDLLPAWIR-NNIVISWILNSVSKPVSASILFSGSTRAIWIDLKERFQKKNVPRIFQLKR 272
                AW   NN+VISW+ NSVSK +  SILF+ + + IW DLK RF KKN  RIFQL+R
Sbjct: 69  DSTFVAWNHANNVVISWLYNSVSKDIITSILFASTAQEIWHDLKTRFSKKNGSRIFQLRR 128

Query: 273 SLATLEQNQDTI---------------GVNDRF--------------PPNVISHGLMGLN 332
            L +L Q  D I               G    F                  +   LMGLN
Sbjct: 129 QLMSLHQGMDDISTYYTKLKSIWEELSGYKPTFQCTCGGLQQLQSFTESEYVMSFLMGLN 188

Query: 333 ENISQARAQLLLMDPLPSTSRAFSLLLQEEQQRSIGSFSS 341
           ++ISQ R Q+LL DPLPS    FSL+LQ+E QR I   SS
Sbjct: 189 DSISQIRGQILLSDPLPSIGNVFSLVLQDEAQREIAVTSS 228

BLAST of CSPI01G25250 vs. TrEMBL
Match: A0A151RRY1_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_033170 PE=4 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 5.1e-35
Identity = 100/250 (40.00%), Postives = 135/250 (54.00%), Query Frame = 1

Query: 158 NPHFLHHNDNTSLVLATEQLTEENYVSWTQAMTIGLSVKNKIGFVDETITKRTGD--LLP 217
           NP FLHH+D   LVL ++ L  +NY +W+ AM +  SVKNKI FVD ++ K   +    P
Sbjct: 14  NPLFLHHSDGPGLVLTSQPLDNKNYTTWSHAMLVAFSVKNKIPFVDGSLPKLAANHPTYP 73

Query: 218 AWIR-NNIVISWILNSVSKPVSASILFSGSTRAIWIDLKERFQKKNVPRIFQLKRSLATL 277
           AWIR NN+VISW+ NSVSK +  SILF+ + + IW DLK +F +KN P IFQL+R L +L
Sbjct: 74  AWIRGNNLVISWLYNSVSKDIITSILFANTAKEIWDDLKTKFSRKNGPHIFQLRRQLMSL 133

Query: 278 EQNQDTI---------------GVNDRFP--------------PNVISHGLMGLNENISQ 337
           +Q  D +               G    FP                 +   LMGLN++ SQ
Sbjct: 134 QQGIDYVSTYYTKLKSIWEELSGYKPSFPCTCGGLQHLQDYNASEYVMSFLMGLNDSFSQ 193

Query: 338 ARAQLLLMDPLPSTSRAFSLLLQEEQQRSIGSFSSTAPTMAFAISVTSSMDTLLDIRPSN 376
            R Q+LL  PLP     FSL+LQEE Q  IG+  +  P++ F      SM  L  +  SN
Sbjct: 194 IRGQILLSYPLPPIGNVFSLILQEETQIEIGTNITHTPSVNF-----DSMAFL--VNSSN 253

BLAST of CSPI01G25250 vs. TrEMBL
Match: A5BJ98_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_009790 PE=4 SV=1)

HSP 1 Score: 157.1 bits (396), Expect = 6.6e-35
Identity = 90/246 (36.59%), Postives = 143/246 (58.13%), Query Frame = 1

Query: 158 NPHFLHHNDNTSLVLATEQLTEENYVSWTQAMTIGLSVKNKIGFVDETITKRTGDLLP-- 217
           NP+FLHH+D+  +VL ++ L  +NY +W +AMTI L+ K+K+GF+D T T  +    P  
Sbjct: 16  NPYFLHHSDHPGMVLVSKPLNGDNYSTWCRAMTISLNAKSKLGFIDGTTTMSSATDKPDE 75

Query: 218 --AWIR-NNIVISWILNSVSKPVSASILFSGSTRAIWIDLKERFQKKNVPRIFQLKRSLA 277
             +W + N++++SWILNS+S+ ++ S++FS + + +W DL++RF + N PRIFQ++R +A
Sbjct: 76  HASWKKCNDMILSWILNSLSQDLADSVIFSTTAQEVWEDLRDRFSQSNAPRIFQIERDIA 135

Query: 278 TLEQNQDTIGVN-----------DRFPPNVISHG-----------LMGLNENISQARAQL 337
            L Q+Q T+                +   V S G           LMGLNE+ +  R Q+
Sbjct: 136 CLTQDQMTVAAYYTRLKKLWDELGSYNDTVCSCGADHKRRRLMQFLMGLNESYNAIRGQI 195

Query: 338 LLMDPLPSTSRAFSLLLQEEQQRSIGSFSSTAPTMAFAISVTSSMDTLLDIRPSNNSIAT 377
           LLM+PLP  +RA+S ++QEE+QRS+G+   T    A  +     M   L +R    S + 
Sbjct: 196 LLMNPLPDVARAYSSIVQEEKQRSLGATRETTENSAMVVQRAEPM--ALAVRHGQGSSSR 255

BLAST of CSPI01G25250 vs. TAIR10
Match: AT1G30680.1 (AT1G30680.1 toprim domain-containing protein)

HSP 1 Score: 122.1 bits (305), Expect = 1.2e-27
Identity = 52/76 (68.42%), Postives = 65/76 (85.53%), Query Frame = 1

Query: 524 LQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG 583
           LQ+W G  PN++DISG+AHFINKCDNGI++HRNRD  +GP+DLVQ+ VRKVRNKVAG IG
Sbjct: 619 LQHWDGGAPNLYDISGSAHFINKCDNGIIVHRNRDENAGPLDLVQIGVRKVRNKVAGQIG 678

Query: 584 EAYLAYNRVTGEFFDA 600
           +AYL Y+R TG + D+
Sbjct: 679 DAYLCYDRTTGSYSDS 694

BLAST of CSPI01G25250 vs. TAIR10
Match: AT1G21280.1 (AT1G21280.1 Retrotransposon gag protein (InterPro:IPR005162))

HSP 1 Score: 67.8 bits (164), Expect = 2.7e-11
Identity = 40/129 (31.01%), Postives = 69/129 (53.49%), Query Frame = 1

Query: 156 YLNPHFLHHNDNTSLVLATEQLTEENYVSWTQAMTIGLSVKNKIGFVDETITKRT--GDL 215
           Y  P  +HH  + S+   ++   E+NYV+W       L V  K GF+D T+ K      L
Sbjct: 18  YYLPPDIHHPSDFSIQKLSKD--EDNYVAWKIRFRSFLRVTKKFGFIDGTLPKPDPFSPL 77

Query: 216 LPAWIR-NNIVISWILNSVSKPVSASILFSGSTRAIWIDLKERFQKKNVPRIFQLKRSLA 275
              W + N +V+ W++NS++  +  S++++ +   +W DL+  F      +I+QL+R LA
Sbjct: 78  YQPWEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMWEDLRRVFVPCVDLKIYQLRRRLA 137

Query: 276 TLEQNQDTI 282
           TL Q  D++
Sbjct: 138 TLRQGGDSV 144

BLAST of CSPI01G25250 vs. NCBI nr
Match: gi|778668587|ref|XP_011649121.1| (PREDICTED: uncharacterized protein LOC105434586 [Cucumis sativus])

HSP 1 Score: 230.3 bits (586), Expect = 8.8e-57
Identity = 112/138 (81.16%), Postives = 124/138 (89.86%), Query Frame = 1

Query: 148 TLFNQNQGYLNPHFLHHNDNTSLVLATEQLTEENYVSWTQAMTIGLSVKNKIGFVDETIT 207
           T F+QNQGYLNP+FLHHNDNT+LVL TEQLTEENYVSW++AMTIGLSVKNKIGFVD TI 
Sbjct: 30  TSFDQNQGYLNPYFLHHNDNTNLVLVTEQLTEENYVSWSRAMTIGLSVKNKIGFVDGTIA 89

Query: 208 KRTGDLLPAWIR-NNIVISWILNSVSKPVSASILFSGSTRAIWIDLKERFQKKNVPRIFQ 267
           + TGDLLP WIR NNIVISWILNSVSKP+SA+ILFS   R IW++LKERFQKKN PRIFQ
Sbjct: 90  RPTGDLLPVWIRNNNIVISWILNSVSKPISANILFSDLARTIWVELKERFQKKNAPRIFQ 149

Query: 268 LKRSLATLEQNQDTIGVN 285
           LKRSLATL QNQD+IG +
Sbjct: 150 LKRSLATLSQNQDSIGTS 167

BLAST of CSPI01G25250 vs. NCBI nr
Match: gi|700210852|gb|KGN65948.1| (hypothetical protein Csa_1G541900 [Cucumis sativus])

HSP 1 Score: 189.9 bits (481), Expect = 1.3e-44
Identity = 87/90 (96.67%), Postives = 88/90 (97.78%), Query Frame = 1

Query: 521 ICALQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAG 580
           +  LQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAG
Sbjct: 4   VLKLQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAG 63

Query: 581 TIGEAYLAYNRVTGEFFDAAGDMKLKKPSS 611
           TIGEAYLAYNRVTGEFFDAAGDMKLKKPSS
Sbjct: 64  TIGEAYLAYNRVTGEFFDAAGDMKLKKPSS 93

BLAST of CSPI01G25250 vs. NCBI nr
Match: gi|778662022|ref|XP_011659237.1| (PREDICTED: twinkle homolog protein, chloroplastic/mitochondrial-like isoform X1 [Cucumis sativus])

HSP 1 Score: 189.1 bits (479), Expect = 2.3e-44
Identity = 87/87 (100.00%), Postives = 87/87 (100.00%), Query Frame = 1

Query: 524 LQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG 583
           LQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG
Sbjct: 628 LQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG 687

Query: 584 EAYLAYNRVTGEFFDAAGDMKLKKPSS 611
           EAYLAYNRVTGEFFDAAGDMKLKKPSS
Sbjct: 688 EAYLAYNRVTGEFFDAAGDMKLKKPSS 714

BLAST of CSPI01G25250 vs. NCBI nr
Match: gi|659121592|ref|XP_008460736.1| (PREDICTED: uncharacterized protein LOC103499498 [Cucumis melo])

HSP 1 Score: 185.3 bits (469), Expect = 3.3e-43
Identity = 108/219 (49.32%), Postives = 134/219 (61.19%), Query Frame = 1

Query: 189 MTIGLSVKNKIGFVDETITKRTGDLLPAWIR-NNIVISWILNSVSKPVSASILFSGSTRA 248
           M IGLSVKNK+GF+D T+ +   DLLP+WIR NNIVISWILNSVSKP+S SILF+ S R+
Sbjct: 1   MIIGLSVKNKLGFIDGTLPRPNDDLLPSWIRNNNIVISWILNSVSKPISTSILFADSARS 60

Query: 249 IWIDLKERFQKKNVPRIFQLKRSLATLEQNQDTIG------------------------- 308
           IW+DLKERFQ+KN PRIF LKRSLA L  NQ+++                          
Sbjct: 61  IWLDLKERFQRKNAPRIFHLKRSLAILSHNQESVSMYFTKFKTLIDELNSYRPACTCGSC 120

Query: 309 -------VNDRFPPNVISHGLMGLNENISQARAQLLLMDPLPSTSRAFSLLLQEEQQRSI 368
                  V +      +   LMGLN++ +Q R QLLLM+P+PS SRAFSLLLQEEQQR+I
Sbjct: 121 RCGENQEVANFLQTEYLMDFLMGLNDSYAQTRTQLLLMEPIPSISRAFSLLLQEEQQRAI 180

Query: 369 GSFSSTAPTMAFAISVTSSMDTLLDIRPSNNSIATQNNE 375
            SFS    T   A++ +S+        P NNS   Q  +
Sbjct: 181 SSFSPAINTPTIALAASSN-------NPKNNSAHKQRKD 212

BLAST of CSPI01G25250 vs. NCBI nr
Match: gi|778662016|ref|XP_011659233.1| (PREDICTED: twinkle homolog protein, chloroplastic/mitochondrial [Cucumis sativus])

HSP 1 Score: 175.3 bits (443), Expect = 3.4e-40
Identity = 81/87 (93.10%), Postives = 84/87 (96.55%), Query Frame = 1

Query: 524 LQNWSGCPPNMHDISGNAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG 583
           LQNWSG PPNM+DISG+AHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG
Sbjct: 631 LQNWSGSPPNMYDISGSAHFINKCDNGIVIHRNRDPESGPIDLVQVCVRKVRNKVAGTIG 690

Query: 584 EAYLAYNRVTGEFFDAAGDMKLKKPSS 611
           EAYL YNRVTGEF DAAGD+KLKKPSS
Sbjct: 691 EAYLEYNRVTGEFLDAAGDVKLKKPSS 717

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TWIH_ARATH2.1e-2668.42Twinkle homolog protein, chloroplastic/mitochondrial OS=Arabidopsis thaliana GN=... [more]
Match NameE-valueIdentityDescription
A0A0A0LVZ4_CUCSA9.2e-4596.67Uncharacterized protein OS=Cucumis sativus GN=Csa_1G541900 PE=4 SV=1[more]
A0A0A0LYM8_CUCSA2.4e-4093.10Uncharacterized protein OS=Cucumis sativus GN=Csa_1G541890 PE=4 SV=1[more]
A0A151U482_CAJCA6.0e-3643.18Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A0A151RRY1_CAJCA5.1e-3540.00Uncharacterized protein OS=Cajanus cajan GN=KK1_033170 PE=4 SV=1[more]
A5BJ98_VITVI6.6e-3536.59Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_009790 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G30680.11.2e-2768.42 toprim domain-containing protein[more]
AT1G21280.12.7e-1131.01 Retrotransposon gag protein (InterPro:IPR005162)[more]
Match NameE-valueIdentityDescription
gi|778668587|ref|XP_011649121.1|8.8e-5781.16PREDICTED: uncharacterized protein LOC105434586 [Cucumis sativus][more]
gi|700210852|gb|KGN65948.1|1.3e-4496.67hypothetical protein Csa_1G541900 [Cucumis sativus][more]
gi|778662022|ref|XP_011659237.1|2.3e-44100.00PREDICTED: twinkle homolog protein, chloroplastic/mitochondrial-like isoform X1 ... [more]
gi|659121592|ref|XP_008460736.1|3.3e-4349.32PREDICTED: uncharacterized protein LOC103499498 [Cucumis melo][more]
gi|778662016|ref|XP_011659233.1|3.4e-4093.10PREDICTED: twinkle homolog protein, chloroplastic/mitochondrial [Cucumis sativus... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR027032Twinkle-like protein
IPR027417P-loop_NTPase
Vocabulary: Molecular Function
TermDefinition
GO:0003697single-stranded DNA binding
GO:00431395'-3' DNA helicase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032508 DNA duplex unwinding
biological_process GO:0006260 DNA replication
biological_process GO:0008150 biological_process
biological_process GO:0015074 DNA integration
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005657 replication fork
cellular_component GO:0005575 cellular_component
molecular_function GO:0043139 5'-3' DNA helicase activity
molecular_function GO:0005524 ATP binding
molecular_function GO:0003697 single-stranded DNA binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G25250.1CSPI01G25250.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027032Twinkle-like proteinPANTHERPTHR12873T7-LIKE MITOCHONDRIAL DNA HELICASEcoord: 525..598
score: 4.1
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3DG3DSA:3.40.50.300coord: 526..600
score: 2.
NoneNo IPR availablePANTHERPTHR12873:SF0TWINKLE PROTEIN, MITOCHONDRIALcoord: 525..598
score: 4.1