Cucsa.119520.1 (mRNA) Cucumber (Gy14) v1

NameCucsa.119520.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionGlutamate formiminotransferase 1
Locationscaffold00998 : 1444234 .. 1445718 (-)
Sequence length1307
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCACTCTCCTTCCTTCAAGTGTAATCGAGAATACAAATTCTTATAATTCAAAACAACATGAAACAAGAAATTAAAATATAGCTATAGTATGAGCCACGGGGTTGAATATAAAAATGAATAATAAAAACAAAAATGTTCTTGTTTTTGACCAGACGCAAACCCTGAATAAGACTGCTACCTGATCAGCCTCCATTTTGCTCACTTTAAAGTTTTTAGTTATCGGCCTCACGGGTCTCTCTTCTCCCAATTCCCCCATATATCTCCCCTTCTAAAAACCCTTACCGATCACAAATCGAAGAAACCAACCCAAGCATATACACCCACTTAGCAATGGCTTTCCATCTCACCGCCAAGGTAAAAATGAGTTCTCGTACCCCAATTTCGTCTCAATTGCTTTATGCTAATTTACGAATTGGGAATTGAACACCTTAATTGTTAAATAATATTCCAGGACAAGAAAAGAAGCTTGGATCAAAAAGTCCTTTTATGCTGCAAATACTACGTTTCTGAATCGCGCAATCGTTCTGTACTAGAGGCCATCGAGGGAGCTGCAAGAGAAGACCCAGATTCTGTTATTGTAAACAAATTCGAAGACGGAGCTTACAACAGAACAAGGTACACCATCGTCTCTTACGTCGTTCACGACACCACAGGCAACGCCATTTACAGCCCATTGCTCCAAACCGTACTGTCTATGACCCAAGTTGCTTTCTCTCATATTAATCTCGAGTCTCATTCCGGTACTCACCCTCGGCTTGGAGTCGTAGATGACATCGTTTTTCATCCCCTGGCTCGAGCCTCCCTCCACGAAGCCGCTTGGCTAGCTAAGGCAGTCGCTAAGGATATCGCTGCCATGTTTCAAGGTCTTTTAATGAATTACATACAATATTGGGATTCCATTGGTGTTGGTGATTTATAAGTATTGGGATTGTGATTTTGTGCAGTGCCTGTATTTCTTTACTCTGCGGCTCACCCAAGTGGGAAAGCGCCGGACGATTTGAGGCGTGAGCTTGGGTATTTCCGGCCAAATTACAAGGGGAATCAATGGGCTGGGTGGTCGATGCCGGAAACTTTGCCGGAGAATCCTGATGAAGGTCCAAATACAGTATCTCGAGAGCGAGGAATCACGATGATTGGAGCGCGTCCGTGGACGGCAATGTATAATATTCCAATTCTGTCGACGGACGTGTCAGCAACACGGAGAATAGCGAGGATGGTGAGTGGGAGAGGAGGTGGATTGCCGACGGTGCAAACGATAGGGCTTCTTCACGATGATGAGACCACGGAGATAGCTTGTGTTCTGTTGGAGCCGAATCAGGTTGGAGCAGATCGAGTTCAGAGACATGTGGAGATTGTTGCGGCCCAATTCGGGTTAGAAGTTGAGAATGGATATTTTACTGATTACTCACCAGAGATGATTGTTGAGAAATATTTGAATTTGATTTCTGGCACCCAAAGTCTATCGGGAAATCGTTTGAACTAA

mRNA sequence

ATGCACTCTCCTTCCTTCAAGTGTAATCGAGAATACAAATTCTTATAATTCAAAACAACATGAAACAAGAAATTAAAATATAGCTATAGTATGAGCCACGGGGTTGAATATAAAAATGAATAATAAAAACAAAAATGTTCTTGTTTTTGACCAGACGCAAACCCTGAATAAGACTGCTACCTGATCAGCCTCCATTTTGCTCACTTTAAAGTTTTTAGTTATCGGCCTCACGGGTCTCTCTTCTCCCAATTCCCCCATATATCTCCCCTTCTAAAAACCCTTACCGATCACAAATCGAAGAAACCAACCCAAGCATATACACCCACTTAGCAATGGCTTTCCATCTCACCGCCAAGGACAAGAAAAGAAGCTTGGATCAAAAAGTCCTTTTATGCTGCAAATACTACGTTTCTGAATCGCGCAATCGTTCTGTACTAGAGGCCATCGAGGGAGCTGCAAGAGAAGACCCAGATTCTGTTATTGTAAACAAATTCGAAGACGGAGCTTACAACAGAACAAGGTACACCATCGTCTCTTACGTCGTTCACGACACCACAGGCAACGCCATTTACAGCCCATTGCTCCAAACCGTACTGTCTATGACCCAAGTTGCTTTCTCTCATATTAATCTCGAGTCTCATTCCGGTACTCACCCTCGGCTTGGAGTCGTAGATGACATCGTTTTTCATCCCCTGGCTCGAGCCTCCCTCCACGAAGCCGCTTGGCTAGCTAAGGCAGTCGCTAAGGATATCGCTGCCATGTTTCAAGTGCCTGTATTTCTTTACTCTGCGGCTCACCCAAGTGGGAAAGCGCCGGACGATTTGAGGCGTGAGCTTGGGTATTTCCGGCCAAATTACAAGGGGAATCAATGGGCTGGGTGGTCGATGCCGGAAACTTTGCCGGAGAATCCTGATGAAGGTCCAAATACAGTATCTCGAGAGCGAGGAATCACGATGATTGGAGCGCGTCCGTGGACGGCAATGTATAATATTCCAATTCTGTCGACGGACGTGTCAGCAACACGGAGAATAGCGAGGATGGTGAGTGGGAGAGGAGGTGGATTGCCGACGGTGCAAACGATAGGGCTTCTTCACGATGATGAGACCACGGAGATAGCTTGTGTTCTGTTGGAGCCGAATCAGGTTGGAGCAGATCGAGTTCAGAGACATGTGGAGATTGTTGCGGCCCAATTCGGGTTAGAAGTTGAGAATGGATATTTTACTGATTACTCACCAGAGATGATTGTTGAGAAATATTTGAATTTGATTTCTGGCACCCAAAGTCTATCGGGAAATCGTTTGAACTAA

Coding sequence (CDS)

ATGGCTTTCCATCTCACCGCCAAGGACAAGAAAAGAAGCTTGGATCAAAAAGTCCTTTTATGCTGCAAATACTACGTTTCTGAATCGCGCAATCGTTCTGTACTAGAGGCCATCGAGGGAGCTGCAAGAGAAGACCCAGATTCTGTTATTGTAAACAAATTCGAAGACGGAGCTTACAACAGAACAAGGTACACCATCGTCTCTTACGTCGTTCACGACACCACAGGCAACGCCATTTACAGCCCATTGCTCCAAACCGTACTGTCTATGACCCAAGTTGCTTTCTCTCATATTAATCTCGAGTCTCATTCCGGTACTCACCCTCGGCTTGGAGTCGTAGATGACATCGTTTTTCATCCCCTGGCTCGAGCCTCCCTCCACGAAGCCGCTTGGCTAGCTAAGGCAGTCGCTAAGGATATCGCTGCCATGTTTCAAGTGCCTGTATTTCTTTACTCTGCGGCTCACCCAAGTGGGAAAGCGCCGGACGATTTGAGGCGTGAGCTTGGGTATTTCCGGCCAAATTACAAGGGGAATCAATGGGCTGGGTGGTCGATGCCGGAAACTTTGCCGGAGAATCCTGATGAAGGTCCAAATACAGTATCTCGAGAGCGAGGAATCACGATGATTGGAGCGCGTCCGTGGACGGCAATGTATAATATTCCAATTCTGTCGACGGACGTGTCAGCAACACGGAGAATAGCGAGGATGGTGAGTGGGAGAGGAGGTGGATTGCCGACGGTGCAAACGATAGGGCTTCTTCACGATGATGAGACCACGGAGATAGCTTGTGTTCTGTTGGAGCCGAATCAGGTTGGAGCAGATCGAGTTCAGAGACATGTGGAGATTGTTGCGGCCCAATTCGGGTTAGAAGTTGAGAATGGATATTTTACTGATTACTCACCAGAGATGATTGTTGAGAAATATTTGAATTTGATTTCTGGCACCCAAAGTCTATCGGGAAATCGTTTGAACTAA

Protein sequence

MAFHLTAKDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIVSYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLHEAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPETLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTVQTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYSPEMIVEKYLNLISGTQSLSGNRLN*
BLAST of Cucsa.119520.1 vs. Swiss-Prot
Match: GLFT_STRP1 (Glutamate formimidoyltransferase OS=Streptococcus pyogenes serotype M1 GN=M5005_Spy1772 PE=1 SV=1)

HSP 1 Score: 95.5 bits (236), Expect = 1.1e-18
Identity = 72/281 (25.62%), Postives = 134/281 (47.69%), Query Frame = 1

Query: 17  KVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIVSYVVHDTTG 76
           K++ C   + SE +N++V++ +   A+  P   +++   D ++NR+ +T+V     D + 
Sbjct: 3   KIVECIPNF-SEGQNQAVIDGLVATAKSIPGVTLLDYSSDASHNRSVFTLVG---DDQS- 62

Query: 77  NAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLHEAAWLAKAV 136
                 + +    + + A  +I++  H G HPR+G  D   F P+   +  E   ++K V
Sbjct: 63  ------IQEAAFQLVKYASENIDMTKHHGEHPRMGATDVCPFVPIKDITTQECVEISKQV 122

Query: 137 AKDIAAMFQVPVFLY--SAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPETLPEN-- 196
           A+ I     +P+FLY  SA  P        R+ L   R   KG Q+ G  MPE L E   
Sbjct: 123 AERINRELGIPIFLYEDSATRPE-------RQNLAKVR---KG-QFEG--MPEKLLEEDW 182

Query: 197 -PDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTVQTIG 256
            PD G   +    G+T +GAR     +N+ + + ++    +IA+++ G GGG    + IG
Sbjct: 183 APDYGDRKIHPTAGVTAVGARMPLVAFNVNLDTDNIDIAHKIAKIIRGSGGGYKYCKAIG 242

Query: 257 -LLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEV 292
            +L D    +++  ++   +    R    ++  A ++G+ V
Sbjct: 243 VMLEDRHIAQVSMNMVNFEKCSLYRTFETIKFEARRYGVNV 259

BLAST of Cucsa.119520.1 vs. Swiss-Prot
Match: GLFT_PICTO (Glutamate formimidoyltransferase OS=Picrophilus torridus (strain ATCC 700027 / DSM 9790 / JCM 10055 / NBRC 100828) GN=PTO1242 PE=1 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 1.7e-14
Identity = 67/225 (29.78%), Postives = 109/225 (48.44%), Query Frame = 1

Query: 27  SESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIVSYVVHDTTGNAIYSPLLQT 86
           SE R+ S +E I  + +      I++   D  +NR+  T    +            +++ 
Sbjct: 16  SEGRDISKIEKIIDSIKNIEGVKILDLNVDPQHNRSVITFTCGIER----------IIEA 75

Query: 87  VLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLHEAAWLAKAVAKDIAAMFQV 146
            +SM + A S I++E HSG HPR G  D     P+  AS+ +    ++ + + + +   +
Sbjct: 76  GISMIKTAASLIDMEKHSGLHPRFGATDVFPIIPIT-ASMDDCIIASRNLGRLVGSELNI 135

Query: 147 PVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWS-MPETLPENPDEGPNTVSRERG 206
           PV++YS    S   P+  RR L   R   K  Q+     + +T    PD GP+++    G
Sbjct: 136 PVYMYSE---SAMVPE--RRNLENIRN--KNVQYEELKELIKTDKYRPDFGPDSLG-SAG 195

Query: 207 ITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTVQTI 251
             +IGARP    YNI I + D+   RRIA  + GR GGL T++T+
Sbjct: 196 AVIIGARPALIAYNIYISTDDIKIGRRIASALRGRDGGLNTLKTL 221

BLAST of Cucsa.119520.1 vs. TrEMBL
Match: A0A0A0KAE4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G124100 PE=4 SV=1)

HSP 1 Score: 653.7 bits (1685), Expect = 1.2e-184
Identity = 324/324 (100.00%), Postives = 324/324 (100.00%), Query Frame = 1

Query: 1   MAFHLTAKDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYN 60
           MAFHLTAKDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYN
Sbjct: 1   MAFHLTAKDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYN 60

Query: 61  RTRYTIVSYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHP 120
           RTRYTIVSYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHP
Sbjct: 61  RTRYTIVSYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHP 120

Query: 121 LARASLHEAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQW 180
           LARASLHEAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQW
Sbjct: 121 LARASLHEAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQW 180

Query: 181 AGWSMPETLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGR 240
           AGWSMPETLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGR
Sbjct: 181 AGWSMPETLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGR 240

Query: 241 GGGLPTVQTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYS 300
           GGGLPTVQTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYS
Sbjct: 241 GGGLPTVQTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYS 300

Query: 301 PEMIVEKYLNLISGTQSLSGNRLN 325
           PEMIVEKYLNLISGTQSLSGNRLN
Sbjct: 301 PEMIVEKYLNLISGTQSLSGNRLN 324

BLAST of Cucsa.119520.1 vs. TrEMBL
Match: W9R857_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022176 PE=4 SV=1)

HSP 1 Score: 476.5 bits (1225), Expect = 2.6e-131
Identity = 226/305 (74.10%), Postives = 263/305 (86.23%), Query Frame = 1

Query: 9   DKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIVS 68
           +KK++ DQ +LLCCK ++SES NRSVL+AIE AAR DP+SVIV KF+D AYNR RYTIVS
Sbjct: 2   EKKKAADQSMLLCCKLFISESHNRSVLDAIERAARHDPESVIVTKFDDRAYNRARYTIVS 61

Query: 69  YVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLHE 128
           YVVHD TG A+YSPL QTV++M + AF  INLE+HSG HPRLGVVDDIVFHPLA ASL E
Sbjct: 62  YVVHDCTGGAVYSPLQQTVVAMAEAAFDAINLETHSGAHPRLGVVDDIVFHPLAHASLDE 121

Query: 129 AAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPET 188
           AAWLAKAVA DI   FQVPV+LY+AAHP+GKA D +RRELGY+RPN+ GNQWAGW+MPE 
Sbjct: 122 AAWLAKAVALDIGNRFQVPVYLYAAAHPTGKALDTIRRELGYYRPNFMGNQWAGWTMPEV 181

Query: 189 LPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTVQ 248
           LPE P+EGP +VSR RGITMIGA PW A+YN+P+LSTDVSA +RIARMVS RGGGLPTVQ
Sbjct: 182 LPEKPNEGPTSVSRARGITMIGACPWVALYNVPLLSTDVSAAKRIARMVSARGGGLPTVQ 241

Query: 249 TIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYSPEMIVEKY 308
           T+GL+H ++ TEIAC+LLEPNQ+GADRVQ  VE++AAQ GL+VE GYFTDYSPEMIVEKY
Sbjct: 242 TLGLVHGEDATEIACMLLEPNQIGADRVQNRVEVLAAQEGLDVEKGYFTDYSPEMIVEKY 301

Query: 309 LNLIS 314
           + L S
Sbjct: 302 MKLTS 306

BLAST of Cucsa.119520.1 vs. TrEMBL
Match: A0A059DFH5_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A01786 PE=4 SV=1)

HSP 1 Score: 473.8 bits (1218), Expect = 1.7e-130
Identity = 225/309 (72.82%), Postives = 262/309 (84.79%), Query Frame = 1

Query: 8   KDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIV 67
           K+ K+     +LLCCK ++SESRNR+VL++IE AA  DP+++IVNKFED AYNR RYT+V
Sbjct: 17  KENKKISSTSMLLCCKLFISESRNRAVLDSIERAAHLDPETIIVNKFEDRAYNRVRYTLV 76

Query: 68  SYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLH 127
           S+VVHD TG A+YSPL QTV++M + AF  INLE HSG HPRLGVVDDIVFHPLARASL 
Sbjct: 77  SHVVHDCTGQAVYSPLQQTVMAMAEAAFEAINLELHSGAHPRLGVVDDIVFHPLARASLD 136

Query: 128 EAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPE 187
           EAAWLAKAVA DI    QVPVFLY+AAHP+GKA D +RRELGYFRPN+ GNQWAGW+MPE
Sbjct: 137 EAAWLAKAVAADIGLKLQVPVFLYAAAHPTGKALDTIRRELGYFRPNFMGNQWAGWTMPE 196

Query: 188 TLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTV 247
            LPE PDEGP  VSR RGITMIGARPW A+YN+PILSTDVSATRRIARMVS RGGGLPTV
Sbjct: 197 VLPERPDEGPTHVSRTRGITMIGARPWVALYNVPILSTDVSATRRIARMVSARGGGLPTV 256

Query: 248 QTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYSPEMIVEK 307
           QT+GL+H ++ TEIAC+LLEPNQ+GADRVQ  VE +AAQ GL+VE GYFTD+SPEMI+EK
Sbjct: 257 QTLGLVHGEDATEIACMLLEPNQIGADRVQNRVETLAAQEGLDVERGYFTDFSPEMIIEK 316

Query: 308 YLNLISGTQ 317
           Y+ L+S  +
Sbjct: 317 YVKLVSSRE 325

BLAST of Cucsa.119520.1 vs. TrEMBL
Match: M5WXD0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008822mg PE=4 SV=1)

HSP 1 Score: 473.8 bits (1218), Expect = 1.7e-130
Identity = 224/304 (73.68%), Postives = 265/304 (87.17%), Query Frame = 1

Query: 10  KKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIVSY 69
           KK+++DQ +LLCCK Y+SESRN + L+AIE AAR DP+SVIVNKFED AYNR RYTIVSY
Sbjct: 11  KKKTIDQSMLLCCKLYISESRNHAALDAIERAARLDPESVIVNKFEDRAYNRVRYTIVSY 70

Query: 70  VVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLHEA 129
           V+HD+TG+AIYSPL QTV++M + AF  INLE HSG HPRLGVVDDIVFHPLARASL EA
Sbjct: 71  VMHDSTGSAIYSPLQQTVMAMAEAAFGAINLEQHSGAHPRLGVVDDIVFHPLARASLDEA 130

Query: 130 AWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPETL 189
           AWLAKAVA DI   FQVPV+LY+AAHP+GKA D +RRELGY+RPN+ G+QWAGW+MPE L
Sbjct: 131 AWLAKAVAVDIGNRFQVPVYLYAAAHPTGKALDTIRRELGYYRPNFMGSQWAGWTMPEIL 190

Query: 190 PENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTVQT 249
            E PDEGP ++   RGI+MIGARPW A+YNIPILSTDV+ATRRIARMVS RGGGLPTVQT
Sbjct: 191 HEKPDEGPTSICPARGISMIGARPWVALYNIPILSTDVAATRRIARMVSARGGGLPTVQT 250

Query: 250 IGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYSPEMIVEKYL 309
           +GL+H +++TEIAC+LLEPNQ+G DRVQ HVE++AAQ GL+VE GYFTD+SP+MI+EKY+
Sbjct: 251 LGLVHGEDSTEIACMLLEPNQIGGDRVQNHVEMLAAQEGLDVEKGYFTDHSPDMIIEKYM 310

Query: 310 NLIS 314
            L S
Sbjct: 311 KLTS 314

BLAST of Cucsa.119520.1 vs. TrEMBL
Match: V7C5W8_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_003G044600g PE=4 SV=1)

HSP 1 Score: 464.5 bits (1194), Expect = 1.0e-127
Identity = 217/317 (68.45%), Postives = 268/317 (84.54%), Query Frame = 1

Query: 1   MAFHLTAKDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYN 60
           M F+ TAKD+K+ +DQ +LLCCK++VSESR  + LEAIE AAR +P++VIVNKF D AYN
Sbjct: 1   MDFNCTAKDQKKGVDQSILLCCKFFVSESRRMATLEAIECAARSNPETVIVNKFHDRAYN 60

Query: 61  RTRYTIVSYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHP 120
           R R+T+VSYV+HD TG+ +YSPL QTV++M + AF+ INLE H G HPRLG VDDIVFHP
Sbjct: 61  RARFTLVSYVLHDCTGSPVYSPLHQTVIAMAEAAFNTINLEFHDGAHPRLGAVDDIVFHP 120

Query: 121 LARASLHEAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQW 180
           L RASL EAAWLAKAV+ DI   F VPVFLY+AAHP+GK  D +RRELGY+RPN++G+QW
Sbjct: 121 LGRASLDEAAWLAKAVSADIGNRFSVPVFLYAAAHPTGKELDTIRRELGYYRPNFRGSQW 180

Query: 181 AGWSMPETLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGR 240
           AGW+MP+TLP++PDEGPN VSR +GI+MIGARPW A+YN+PIL TDVSA RRIAR VSGR
Sbjct: 181 AGWAMPDTLPQSPDEGPNVVSRAKGISMIGARPWVALYNVPILCTDVSAARRIARKVSGR 240

Query: 241 GGGLPTVQTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYS 300
           GGGLPTVQT+GL+H +++TEIAC+LLE N +GADRVQ  VE++AAQ GL VE GYFTD+S
Sbjct: 241 GGGLPTVQTLGLVHGEDSTEIACMLLESNVIGADRVQHRVEMLAAQEGLGVEKGYFTDFS 300

Query: 301 PEMIVEKYLNLISGTQS 318
           PEMIV++Y+ LI+  +S
Sbjct: 301 PEMIVDQYMKLITANKS 317

BLAST of Cucsa.119520.1 vs. TAIR10
Match: AT2G20830.2 (AT2G20830.2 transferases;folic acid binding)

HSP 1 Score: 285.0 bits (728), Expect = 5.7e-77
Identity = 139/300 (46.33%), Postives = 199/300 (66.33%), Query Frame = 1

Query: 16  QKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIVSYVVHDTT 75
           +++L CCK Y+SE+RN++ LEAIE A +  P + IVNKFED AY R  YT+VS     + 
Sbjct: 137 REMLGCCKVYISEARNKTALEAIERALKPFPPAAIVNKFEDAAYGRVGYTVVS-----SL 196

Query: 76  GNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLHEAAWLAKA 135
            N   S L   V +M + A   INLE H G+HPRLGVVD I FHPL++ S+ + + +A +
Sbjct: 197 ANGSSSSLKNAVFAMVKTALDTINLELHCGSHPRLGVVDHICFHPLSQTSIEQVSSVANS 256

Query: 136 VAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPETLPENPDE 195
           +A DI ++ +VP +LY AA       D +RR+LGYF+ N +G++WAG    E +P  PD 
Sbjct: 257 LAMDIGSILRVPTYLYGAAEKEQCTLDSIRRKLGYFKANREGHEWAGGFDLEMVPLKPDA 316

Query: 196 GPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTVQTIGLLHD 255
           GP  VS+ +G+  +GA  W + YN+P++S D+ A RRIAR  S RGGGL +VQT+ L+H 
Sbjct: 317 GPQEVSKAKGVVAVGACGWVSNYNVPVMSNDLKAVRRIARKTSERGGGLASVQTMALVHG 376

Query: 256 DETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYSPEMIVEKYLNLISGT 315
           +   E+AC LL P+QVG D VQ  +E +  + GL V  GY+TDY+P+ IVE+Y++L++ +
Sbjct: 377 EGVIEVACNLLNPSQVGGDEVQGLIERLGREEGLLVGKGYYTDYTPDQIVERYMDLLNNS 431

BLAST of Cucsa.119520.1 vs. NCBI nr
Match: gi|449444392|ref|XP_004139959.1| (PREDICTED: formimidoyltransferase-cyclodeaminase-like [Cucumis sativus])

HSP 1 Score: 653.7 bits (1685), Expect = 1.7e-184
Identity = 324/324 (100.00%), Postives = 324/324 (100.00%), Query Frame = 1

Query: 1   MAFHLTAKDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYN 60
           MAFHLTAKDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYN
Sbjct: 1   MAFHLTAKDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYN 60

Query: 61  RTRYTIVSYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHP 120
           RTRYTIVSYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHP
Sbjct: 61  RTRYTIVSYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHP 120

Query: 121 LARASLHEAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQW 180
           LARASLHEAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQW
Sbjct: 121 LARASLHEAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQW 180

Query: 181 AGWSMPETLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGR 240
           AGWSMPETLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGR
Sbjct: 181 AGWSMPETLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGR 240

Query: 241 GGGLPTVQTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYS 300
           GGGLPTVQTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYS
Sbjct: 241 GGGLPTVQTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYS 300

Query: 301 PEMIVEKYLNLISGTQSLSGNRLN 325
           PEMIVEKYLNLISGTQSLSGNRLN
Sbjct: 301 PEMIVEKYLNLISGTQSLSGNRLN 324

BLAST of Cucsa.119520.1 vs. NCBI nr
Match: gi|703096352|ref|XP_010095820.1| (hypothetical protein L484_022176 [Morus notabilis])

HSP 1 Score: 476.5 bits (1225), Expect = 3.8e-131
Identity = 226/305 (74.10%), Postives = 263/305 (86.23%), Query Frame = 1

Query: 9   DKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIVS 68
           +KK++ DQ +LLCCK ++SES NRSVL+AIE AAR DP+SVIV KF+D AYNR RYTIVS
Sbjct: 2   EKKKAADQSMLLCCKLFISESHNRSVLDAIERAARHDPESVIVTKFDDRAYNRARYTIVS 61

Query: 69  YVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLHE 128
           YVVHD TG A+YSPL QTV++M + AF  INLE+HSG HPRLGVVDDIVFHPLA ASL E
Sbjct: 62  YVVHDCTGGAVYSPLQQTVVAMAEAAFDAINLETHSGAHPRLGVVDDIVFHPLAHASLDE 121

Query: 129 AAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPET 188
           AAWLAKAVA DI   FQVPV+LY+AAHP+GKA D +RRELGY+RPN+ GNQWAGW+MPE 
Sbjct: 122 AAWLAKAVALDIGNRFQVPVYLYAAAHPTGKALDTIRRELGYYRPNFMGNQWAGWTMPEV 181

Query: 189 LPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTVQ 248
           LPE P+EGP +VSR RGITMIGA PW A+YN+P+LSTDVSA +RIARMVS RGGGLPTVQ
Sbjct: 182 LPEKPNEGPTSVSRARGITMIGACPWVALYNVPLLSTDVSAAKRIARMVSARGGGLPTVQ 241

Query: 249 TIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYSPEMIVEKY 308
           T+GL+H ++ TEIAC+LLEPNQ+GADRVQ  VE++AAQ GL+VE GYFTDYSPEMIVEKY
Sbjct: 242 TLGLVHGEDATEIACMLLEPNQIGADRVQNRVEVLAAQEGLDVEKGYFTDYSPEMIVEKY 301

Query: 309 LNLIS 314
           + L S
Sbjct: 302 MKLTS 306

BLAST of Cucsa.119520.1 vs. NCBI nr
Match: gi|702244641|ref|XP_010051646.1| (PREDICTED: formimidoyltransferase-cyclodeaminase-like [Eucalyptus grandis])

HSP 1 Score: 473.8 bits (1218), Expect = 2.4e-130
Identity = 225/309 (72.82%), Postives = 262/309 (84.79%), Query Frame = 1

Query: 8   KDKKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIV 67
           K+ K+     +LLCCK ++SESRNR+VL++IE AA  DP+++IVNKFED AYNR RYT+V
Sbjct: 17  KENKKISSTSMLLCCKLFISESRNRAVLDSIERAAHLDPETIIVNKFEDRAYNRVRYTLV 76

Query: 68  SYVVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLH 127
           S+VVHD TG A+YSPL QTV++M + AF  INLE HSG HPRLGVVDDIVFHPLARASL 
Sbjct: 77  SHVVHDCTGQAVYSPLQQTVMAMAEAAFEAINLELHSGAHPRLGVVDDIVFHPLARASLD 136

Query: 128 EAAWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPE 187
           EAAWLAKAVA DI    QVPVFLY+AAHP+GKA D +RRELGYFRPN+ GNQWAGW+MPE
Sbjct: 137 EAAWLAKAVAADIGLKLQVPVFLYAAAHPTGKALDTIRRELGYFRPNFMGNQWAGWTMPE 196

Query: 188 TLPENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTV 247
            LPE PDEGP  VSR RGITMIGARPW A+YN+PILSTDVSATRRIARMVS RGGGLPTV
Sbjct: 197 VLPERPDEGPTHVSRTRGITMIGARPWVALYNVPILSTDVSATRRIARMVSARGGGLPTV 256

Query: 248 QTIGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYSPEMIVEK 307
           QT+GL+H ++ TEIAC+LLEPNQ+GADRVQ  VE +AAQ GL+VE GYFTD+SPEMI+EK
Sbjct: 257 QTLGLVHGEDATEIACMLLEPNQIGADRVQNRVETLAAQEGLDVERGYFTDFSPEMIIEK 316

Query: 308 YLNLISGTQ 317
           Y+ L+S  +
Sbjct: 317 YVKLVSSRE 325

BLAST of Cucsa.119520.1 vs. NCBI nr
Match: gi|595967184|ref|XP_007217258.1| (hypothetical protein PRUPE_ppa008822mg [Prunus persica])

HSP 1 Score: 473.8 bits (1218), Expect = 2.4e-130
Identity = 224/304 (73.68%), Postives = 265/304 (87.17%), Query Frame = 1

Query: 10  KKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIVSY 69
           KK+++DQ +LLCCK Y+SESRN + L+AIE AAR DP+SVIVNKFED AYNR RYTIVSY
Sbjct: 11  KKKTIDQSMLLCCKLYISESRNHAALDAIERAARLDPESVIVNKFEDRAYNRVRYTIVSY 70

Query: 70  VVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLHEA 129
           V+HD+TG+AIYSPL QTV++M + AF  INLE HSG HPRLGVVDDIVFHPLARASL EA
Sbjct: 71  VMHDSTGSAIYSPLQQTVMAMAEAAFGAINLEQHSGAHPRLGVVDDIVFHPLARASLDEA 130

Query: 130 AWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPETL 189
           AWLAKAVA DI   FQVPV+LY+AAHP+GKA D +RRELGY+RPN+ G+QWAGW+MPE L
Sbjct: 131 AWLAKAVAVDIGNRFQVPVYLYAAAHPTGKALDTIRRELGYYRPNFMGSQWAGWTMPEIL 190

Query: 190 PENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTVQT 249
            E PDEGP ++   RGI+MIGARPW A+YNIPILSTDV+ATRRIARMVS RGGGLPTVQT
Sbjct: 191 HEKPDEGPTSICPARGISMIGARPWVALYNIPILSTDVAATRRIARMVSARGGGLPTVQT 250

Query: 250 IGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYSPEMIVEKYL 309
           +GL+H +++TEIAC+LLEPNQ+G DRVQ HVE++AAQ GL+VE GYFTD+SP+MI+EKY+
Sbjct: 251 LGLVHGEDSTEIACMLLEPNQIGGDRVQNHVEMLAAQEGLDVEKGYFTDHSPDMIIEKYM 310

Query: 310 NLIS 314
            L S
Sbjct: 311 KLTS 314

BLAST of Cucsa.119520.1 vs. NCBI nr
Match: gi|645248294|ref|XP_008230229.1| (PREDICTED: formimidoyltransferase-cyclodeaminase-like [Prunus mume])

HSP 1 Score: 473.8 bits (1218), Expect = 2.4e-130
Identity = 224/304 (73.68%), Postives = 265/304 (87.17%), Query Frame = 1

Query: 10  KKRSLDQKVLLCCKYYVSESRNRSVLEAIEGAAREDPDSVIVNKFEDGAYNRTRYTIVSY 69
           KK+++DQ +LLCCK Y+SESRN + L+AIE AAR DP+SVIVNKFED AYNR RYTIVSY
Sbjct: 37  KKKTIDQSMLLCCKLYISESRNHAALDAIERAARLDPESVIVNKFEDRAYNRVRYTIVSY 96

Query: 70  VVHDTTGNAIYSPLLQTVLSMTQVAFSHINLESHSGTHPRLGVVDDIVFHPLARASLHEA 129
           V+HD+TG+AIYSPL QTV++M + AF  INLE HSG HPRLGVVDDIVFHPLARASL EA
Sbjct: 97  VMHDSTGSAIYSPLQQTVMAMAEAAFGAINLEQHSGAHPRLGVVDDIVFHPLARASLDEA 156

Query: 130 AWLAKAVAKDIAAMFQVPVFLYSAAHPSGKAPDDLRRELGYFRPNYKGNQWAGWSMPETL 189
           AWLAKAVA DI   FQVPV+LY+AAHP+GKA D +RRELGY+RPN+ G+QWAGW+MPE L
Sbjct: 157 AWLAKAVAVDIGNRFQVPVYLYAAAHPTGKALDTIRRELGYYRPNFMGSQWAGWTMPEIL 216

Query: 190 PENPDEGPNTVSRERGITMIGARPWTAMYNIPILSTDVSATRRIARMVSGRGGGLPTVQT 249
            E PDEGP ++   RGI+MIGARPW A+YNIPILSTDV+ATRRIARMVS RGGGLPTVQT
Sbjct: 217 HEKPDEGPTSICPARGISMIGARPWVALYNIPILSTDVAATRRIARMVSARGGGLPTVQT 276

Query: 250 IGLLHDDETTEIACVLLEPNQVGADRVQRHVEIVAAQFGLEVENGYFTDYSPEMIVEKYL 309
           +GL+H +++TEIAC+LLEPNQ+G DRVQ HVE++AAQ GL+VE GYFTD+SP+MI+EKY+
Sbjct: 277 LGLVHGEDSTEIACMLLEPNQIGGDRVQNHVEMLAAQEGLDVEKGYFTDHSPDMIIEKYM 336

Query: 310 NLIS 314
            L S
Sbjct: 337 KLTS 340

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GLFT_STRP11.1e-1825.62Glutamate formimidoyltransferase OS=Streptococcus pyogenes serotype M1 GN=M5005_... [more]
GLFT_PICTO1.7e-1429.78Glutamate formimidoyltransferase OS=Picrophilus torridus (strain ATCC 700027 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0KAE4_CUCSA1.2e-184100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_6G124100 PE=4 SV=1[more]
W9R857_9ROSA2.6e-13174.10Uncharacterized protein OS=Morus notabilis GN=L484_022176 PE=4 SV=1[more]
A0A059DFH5_EUCGR1.7e-13072.82Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A01786 PE=4 SV=1[more]
M5WXD0_PRUPE1.7e-13073.68Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008822mg PE=4 SV=1[more]
V7C5W8_PHAVU1.0e-12768.45Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_003G044600g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G20830.25.7e-7746.33 transferases;folic acid binding[more]
Match NameE-valueIdentityDescription
gi|449444392|ref|XP_004139959.1|1.7e-184100.00PREDICTED: formimidoyltransferase-cyclodeaminase-like [Cucumis sativus][more]
gi|703096352|ref|XP_010095820.1|3.8e-13174.10hypothetical protein L484_022176 [Morus notabilis][more]
gi|702244641|ref|XP_010051646.1|2.4e-13072.82PREDICTED: formimidoyltransferase-cyclodeaminase-like [Eucalyptus grandis][more]
gi|595967184|ref|XP_007217258.1|2.4e-13073.68hypothetical protein PRUPE_ppa008822mg [Prunus persica][more]
gi|645248294|ref|XP_008230229.1|2.4e-13073.68PREDICTED: formimidoyltransferase-cyclodeaminase-like [Prunus mume][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR012886Formiminotransferase_N
IPR013802Formiminotransferase_C
IPR022384FormiminoTrfase_cat_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005542folic acid binding
GO:0016740transferase activity
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005542 folic acid binding
molecular_function GO:0016740 transferase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cucsa.119520Cucsa.119520gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cucsa.119520.1Cucsa.119520.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.119520.1.CDS.3Cucsa.119520.1.CDS.3CDS
Cucsa.119520.1.CDS.2Cucsa.119520.1.CDS.2CDS
Cucsa.119520.1.CDS.1Cucsa.119520.1.CDS.1CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.119520.1.five_prime_UTR.1Cucsa.119520.1.five_prime_UTR.1five_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012886Formiminotransferase, N-terminal subdomainGENE3DG3DSA:3.30.990.10coord: 19..214
score: 2.3
IPR012886Formiminotransferase, N-terminal subdomainPFAMPF07837FTCD_Ncoord: 19..212
score: 2.5
IPR012886Formiminotransferase, N-terminal subdomainSMARTSM01222FTCD_N_2coord: 18..213
score: 9.6
IPR013802Formiminotransferase, C-terminal subdomainGENE3DG3DSA:3.30.70.670coord: 218..293
score: 3.
IPR013802Formiminotransferase, C-terminal subdomainSMARTSM01221FTCD_2coord: 218..308
score: 0.
IPR022384Formiminotransferase subdomainunknownSSF55116Formiminotransferase domain of formiminotransferase-cyclodeaminase.coord: 19..212
score: 7.06
NoneNo IPR availablePANTHERPTHR12234FORMIMINOTRANSFERASE-CYCLODEAMINASEcoord: 12..317
score: 2.1E
NoneNo IPR availablePANTHERPTHR12234:SF3SUBFAMILY NOT NAMEDcoord: 12..317
score: 2.1E