CmoCh13G003030 (gene) Cucurbita moschata (Rifu)

NameCmoCh13G003030
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionRNA-directed DNA polymerase homolog
LocationCmo_Chr13 : 3264614 .. 3268722 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTCGAGTTCAAAGAGTCCATGAGGAAGCGTTTTGTTCCACAATATTTTCAAGGGACATGGCGCAAAAGCTTCAAGCGTTGAAACAAGGACGCAAATCTGTGGAGGATTATTACAAGGAGATGGATACATTGATGGATCGACTTGAACTCGATGAGGACATGGAGGCTCTCATGGCGCGGTTTCTTAATGGGTTAAACACAGAGATTGCAGACAAGACTGATTTACAGCCTTATTCTAATATTGAGGAGTTGTTGCACATTGCAATTAAGATCGAGAGGCAAATCCAACAAAGGTCTCAACGGTATTCTTCTAAAACTTTTCCCAATTCTACTTCTACATGGAAAAAGGATAGTAAGAACATTGATTATAAGCATAGAAATCAAGAGATTAATGAGAAGCCTCAAGCTAAATTTGAGAAAGGGGAGAGTTCTAGAACAGGGAAAGAAAAAGTAGAAAAGTCTAATGTTCGAAATAGGGATTTAAAGTGTTGGAGATGTCAAGGGGTAGGACACTATAGTAGAGATTGCCCAAATGCAAGAATTATGACCATCAAGGAGGGAGAAATTGTTACGGATGACGAGGCACATGACGACATAAATGAGGAAACTGATGAGAGTGAGGAGTTTAGTGAAGAGGACCCTACACATATATCTTTGGTTACTCGACAAGCTCTAAACACCCACATTAAGGAGGACGGCCTAGACCAAAGGGAGAACTTGTTTCAAACTCGGTGTCTTGTTCAATCTGTACCTTGTAGTGTTGTCATTGATAGCGGTAGTTGCACCAATGTTGTGAGTTCCATTCTGGTCAAAAGACTTAATTTAAAGACACAACCACATCCAAGACCCTACAAGCTTCAATGGTTGAATGATTGTGGGGAAGTACGGATAACTCAACAAACTCTTGTTTCTTTTACAATTGGAAAATATGTTGATGATGTTTTATGTGATGTTGTATCCATGCATGTTGGAGATTTACTACTGGGGAGGCCATGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCCTTTACTCACAACGGTAGAAAAACTACTCTTATCCCATTGTCTCCAAAAAATGTATTTATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGTTGATGCAAAAGCAGAGATTGAAAAAGAATCAAGTGAAAAAAAGAGCTTGAGAGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGCCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGGCTAGGAATGTTTTGCTCTCTAACCAGACTATTCTTGTACTTATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTACTTTTGCAAGAGTTTGAACATTTATTTTCCGAGGAGATGCCTAGTAGTTTGCCACCACTTAAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACTGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGTATGTACGTGAAAGTTTGAGTCCTTGTTCTGTTCCAGTTATTCTTGTACCTAAGAAAGATGGTTCTTGGTGTATGTGTGTTGATTGTAGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGGAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAATCATGTCTTACGAGAATACTTAGGTAAGTTTGTGGTTGTTTATTTTGATGACATCCTTGTTTACTCTAAATCTTTAGATGATCATATTACCCATGTACGCAATGTTTTGACTAGTTTAAGAAACGAATGTTTGTATGTAAATTTAAAGAAATGTAGCTTTTGCATGGAAAAAGTTAACTTTCTTGGGTTTGTAGTTTCATCTAATGGTGTTGAGGTTGATGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACTCCGAAAAATGTGAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAAAATTTTAGTACAATTGCTTCACTCTTGAATGAACTTGTTAAGAAAAATGTCTCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCGTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTAATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCGTATGTCATAAAATATAAACAAGGAAAGGAGAACATTGTAGCAGATGCTTTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTGAACATTTTTTTTTGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGGTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCAGTTCCTAATGGTCCATGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATTTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTGTTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTAGGAACTAAGCTAGTATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAACAGAACCATGACTGCTATGCTTAGGGCTATTATTGATAAGAATCTTAAGACTTGGGAGGATTGTTTGCCATTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGTTTATGGCTTTAATCCTTTAACCCCTATTGACTTGTTACCCATACCGTCAAAAGAGTTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTGTTCATAAATTGCACAAGCAAGTGAAAGAACAAATTGAGAAACAAAATTCCAAGGTTGCCACCCGAATTAATAAAGGACGTAAGATTGTCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGAAAAGAAAGATTTCCTACTCAAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCGCATCAACGACAATGCTTATAAAATTGATTTACCAGGTAAGTACGGTGTTAGTGCAACTTTTAGTGTTGTTGATTTGAGCCCTTTTGATGTAGGTGATGGCTTGGATTCGAGGACGAATCCTTCTCAAGAGGGGGAGAATGATATGAACCACGACCAAGGAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCCAAGAAGCTACAACAAACTATAAATAAGCATATATAA

mRNA sequence

ATGGGTCGAGTTCAAAGAGTCCATGAGGAAGCGTTTTGTTCCACAATATTTTCAAGGGACATGGCGCAAAAGCTTCAAGCGTTGAAACAAGGACGCAAATCTGTGGAGGATTATTACAAGGAGATGGATACATTGATGGATCGACTTGAACTCGATGAGGACATGGAGGCTCTCATGGCGCGGTTTCTTAATGGGTTAAACACAGAGATTGCAGACAAGACTGATTTACAGCCTTATTCTAATATTGAGGAGTTGTTGCACATTGCAATTAAGATCGAGAGGCAAATCCAACAAAGGTCTCAACGGTATTCTTCTAAAACTTTTCCCAATTCTACTTCTACATGGAAAAAGGATAGTAAGAACATTGATTATAAGCATAGAAATCAAGAGATTAATGAGAAGCCTCAAGCTAAATTTGAGAAAGGGGAGAGTTCTAGAACAGGGAAAGAAAAAGTAGAAAAGTCTAATGTTCGAAATAGGGATTTAAAGTGTTGGAGATGTCAAGGGGTAGGACACTATAGTAGAGATTGCCCAAATGCAAGAATTATGACCATCAAGGAGGGAGAAATTGTTACGGATGACGAGGCACATGACGACATAAATGAGGAAACTGATGAGAGTGAGGAGTTTAGTGAAGAGGACCCTACACATATATCTTTGGTTACTCGACAAGCTCTAAACACCCACATTAAGGAGGACGGCCTAGACCAAAGGGAGAACTTGTTTCAAACTCGGTGTCTTGTTCAATCTGTACCTTGTAGTGTTGTCATTGATAGCGGTAGTTGCACCAATGTTGTGAGTTCCATTCTGGTCAAAAGACTTAATTTAAAGACACAACCACATCCAAGACCCTACAAGCTTCAATGGTTGAATGATTGTGGGGAAGTACGGATAACTCAACAAACTCTTGTTTCTTTTACAATTGGAAAATATGTTGATGATGTTTTATGTGATGTTGTATCCATGCATGTTGGAGATTTACTACTGGGGAGGCCATGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCCTTTACTCACAACGGTAGAAAAACTACTCTTATCCCATTGTCTCCAAAAAATGTATTTATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGTTGATGCAAAAGCAGAGATTGAAAAAGAATCAAGTGAAAAAAAGAGCTTGAGAGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGCCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGGCTAGGAATGTTTTGCTCTCTAACCAGACTATTCTTGTACTTATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTACTTTTGCAAGAGTTTGAACATTTATTTTCCGAGGAGATGCCTAGTAGTTTGCCACCACTTAAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACTGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGGAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAATCATGTCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGATGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACTCCGAAAAATGTGAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAAAATTTTAGTACAATTGCTTCACTCTTGAATGAACTTGTTAAGAAAAATGTCTCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCGTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTAATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCGTATGTCATAAAATATAAACAAGGAAAGGAGAACATTGTAGCAGATGCTTTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTGAACATTTTTTTTTGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGGTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCAGTTCCTAATGGTCCATGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATTTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTGTTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTAGGAACTAAGCTAGTATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAACAGAACCATGACTGCTATGCTTAGGGCTATTATTGATAAGAATCTTAAGACTTGGGAGGATTGTTTGCCATTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGTTTATGGCTTTAATCCTTTAACCCCTATTGACTTGTTACCCATACCGTCAAAAGAGTTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTGTTCATAAATTGCACAAGCAAGTGAAAGAACAAATTGAGAAACAAAATTCCAAGGTTGCCACCCGAATTAATAAAGGACGTAAGATTGTCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGAAAAGAAAGATTTCCTACTCAAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCGCATCAACGACAATGCTTATAAAATTGATTTACCAGGTAAGTACGGTGTTAGTGCAACTTTTAGTGTTGTTGATTTGAGCCCTTTTGATGTAGGTGATGGCTTGGATTCGAGGACGAATCCTTCTCAAGAGGGGGAGAATGATATGAACCACGACCAAGGAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCCAAGAAGCTACAACAAACTATAAATAAGCATATATAA

Coding sequence (CDS)

ATGGGTCGAGTTCAAAGAGTCCATGAGGAAGCGTTTTGTTCCACAATATTTTCAAGGGACATGGCGCAAAAGCTTCAAGCGTTGAAACAAGGACGCAAATCTGTGGAGGATTATTACAAGGAGATGGATACATTGATGGATCGACTTGAACTCGATGAGGACATGGAGGCTCTCATGGCGCGGTTTCTTAATGGGTTAAACACAGAGATTGCAGACAAGACTGATTTACAGCCTTATTCTAATATTGAGGAGTTGTTGCACATTGCAATTAAGATCGAGAGGCAAATCCAACAAAGGTCTCAACGGTATTCTTCTAAAACTTTTCCCAATTCTACTTCTACATGGAAAAAGGATAGTAAGAACATTGATTATAAGCATAGAAATCAAGAGATTAATGAGAAGCCTCAAGCTAAATTTGAGAAAGGGGAGAGTTCTAGAACAGGGAAAGAAAAAGTAGAAAAGTCTAATGTTCGAAATAGGGATTTAAAGTGTTGGAGATGTCAAGGGGTAGGACACTATAGTAGAGATTGCCCAAATGCAAGAATTATGACCATCAAGGAGGGAGAAATTGTTACGGATGACGAGGCACATGACGACATAAATGAGGAAACTGATGAGAGTGAGGAGTTTAGTGAAGAGGACCCTACACATATATCTTTGGTTACTCGACAAGCTCTAAACACCCACATTAAGGAGGACGGCCTAGACCAAAGGGAGAACTTGTTTCAAACTCGGTGTCTTGTTCAATCTGTACCTTGTAGTGTTGTCATTGATAGCGGTAGTTGCACCAATGTTGTGAGTTCCATTCTGGTCAAAAGACTTAATTTAAAGACACAACCACATCCAAGACCCTACAAGCTTCAATGGTTGAATGATTGTGGGGAAGTACGGATAACTCAACAAACTCTTGTTTCTTTTACAATTGGAAAATATGTTGATGATGTTTTATGTGATGTTGTATCCATGCATGTTGGAGATTTACTACTGGGGAGGCCATGGCAATTTGATCGTCGGGTAATGTATGATGGGTATGCAAATCGATACTCCTTTACTCACAACGGTAGAAAAACTACTCTTATCCCATTGTCTCCAAAAAATGTATTTATTGATCATTGCAAACTTGAAAAGAAAAGGCAAGAGGTTGATGCAAAAGCAGAGATTGAAAAAGAATCAAGTGAAAAAAAGAGCTTGAGAGAAAAGCAAGAGAGTAACACTCAGCCTAGAGAAAAAAAAGAGAGAAAAGCCAAATCAGTAAGCTTGTATGTTAGATCAAGTGAGGCTAGGAATGTTTTGCTCTCTAACCAGACTATTCTTGTACTTATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTGTTGTACTTTTGCAAGAGTTTGAACATTTATTTTCCGAGGAGATGCCTAGTAGTTTGCCACCACTTAAGGGATTGAACACAAGATTGACTTCATTCCTGGCGCGCCCATTCCAAACTGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATACAAAGGCAAGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGATTTAAAATCGGGTTATCATCAAATTCGCATGCATATTGGGGATGAGTGGGAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAACTAATGCACCTAGTACATTCATGAGACTAATGAATCATGTCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGATGAGGAGAAAGTGAAGGCTATAAAAGATTGGCCTACTCCGAAAAATGTGAGTGAGGTAAGAAGTTTTCATGGTCTTGCAAGTTTCTACCGTAGGTTCATTAAAAATTTTAGTACAATTGCTTCACTCTTGAATGAACTTGTTAAGAAAAATGTCTCCTTTATATGGGAAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCGTGCATTACCTAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCTTTAATGTTCTTTAGTGAGAAGTTGACTGGTGCATCTTTGAGGTATCCAACTTATGACAAAGAGCTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGATCATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACGACATGCTAAGTGGTTAGAATTTATTGAAACATTCCCGTATGTCATAAAATATAAACAAGGAAAGGAGAACATTGTAGCAGATGCTTTATCACGAAGGTATGTCCTCCTCAATACTTTGAATGCTAGGTTGTTGGGTTTTGAACACATAAAGGATTTGTATCAACATGACATGTTCTTTGCTCCTTTTGTTGAATCTTGTGAAAAAGGACTCATTGTGGATAATTACTTGTTGTTAGATGGATTTTTGTTCCGAAAAGGCAAACTTTGCATACCATCTTGTTCCATCCGTGAGCTACTTGTGAGGGAAGCTCATGGAGGTGGTTTAATGGCACACCATGGAGTTTCTAAAACTTATGATATGCTCTCTGAACATTTTTTTTTGCCTAAAATGAGACATGATGTTCATAAAGTTTGTGGTCGTTGCATAGCATGTAAACAAGCTAAGTCTAGGCTTCAACCACATGGTTTATACTCCCCATTACCAGTTCCTAATGGTCCATGGATTGATATATCAATGGATTTTGTTTTAGGTTTACCTAGGACTAGGAAAGGTTATGATAGCATTTTTGTTGTGGTTGATCGATTTAGTAAAATGGCTCATTTTATTCCTTGTCACAAAACTGATGATGCAAAACATATTGCAGACCTGTTCTTTAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGCGTGTTTTATGGGGTAAGTTAGGAACTAAGCTAGTATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAACAGAACCATGACTGCTATGCTTAGGGCTATTATTGATAAGAATCTTAAGACTTGGGAGGATTGTTTGCCATTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGTTTATGGCTTTAATCCTTTAACCCCTATTGACTTGTTACCCATACCGTCAAAAGAGTTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTGTTCATAAATTGCACAAGCAAGTGAAAGAACAAATTGAGAAACAAAATTCCAAGGTTGCCACCCGAATTAATAAAGGACGTAAGATTGTCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGAAAAGAAAGATTTCCTACTCAAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCGCATCAACGACAATGCTTATAAAATTGATTTACCAGGTAAGTACGGTGTTAGTGCAACTTTTAGTGTTGTTGATTTGAGCCCTTTTGATGTAGGTGATGGCTTGGATTCGAGGACGAATCCTTCTCAAGAGGGGGAGAATGATATGAACCACGACCAAGGAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCCAAGAAGCTACAACAAACTATAAATAAGCATATATAA
BLAST of CmoCh13G003030 vs. Swiss-Prot
Match: YI31B_YEAST (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-I PE=3 SV=2)

HSP 1 Score: 360.5 bits (924), Expect = 7.4e-98
Identity = 224/629 (35.61%), Postives = 334/629 (53.10%), Query Frame = 1

Query: 614  EEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKKNVSFIWEKDQE 673
            + K  AI+D+PTPK V + + F G+ ++YRRFI N S IA  +   +       W + Q+
Sbjct: 832  QHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQ--WTEKQD 891

Query: 674  LAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVL--MQNQRPLM----FFSEKL 733
             A   LK  L ++P+    N ++ + +  DAS  GIGAVL  + N+  L+    +FS+ L
Sbjct: 892  KAIEKLKAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSL 951

Query: 734  TGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLE 793
              A   YP  + EL  +++AL  +++ L  K F + TDH SL  L+ +N+  RR  +WL+
Sbjct: 952  ESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLD 1011

Query: 794  FIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFE-----------------HI 853
             + T+ + ++Y  G +N+VADA+SR    +    +R +  E                 H+
Sbjct: 1012 DLATYDFTLEYLAGPKNVVADAISRAIYTITPETSRPIDTESWKSYYKSDPLCSAVLIHM 1071

Query: 854  KDLYQHDMF------FAPFVESCEKG-LIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 913
            K+L QH++       F  + +  E       NY L D  ++ + +L +P    +  ++R 
Sbjct: 1072 KELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP-IKQQNAVMRL 1131

Query: 914  AHGGGLMA-HHGVSKTYDMLSEHFFLPKMRHDVHKVCGRCIACKQAKS-RLQPHGLYSPL 973
             H   L   H GV+ T   +S  ++ PK++H + +    C+ C+  KS R + HGL  PL
Sbjct: 1132 YHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPL 1191

Query: 974  PVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFRE 1033
            P+  G W+DISMDFV GLP T    + I VVVDRFSK AHFI   KT DA  + DL FR 
Sbjct: 1192 PIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRY 1251

Query: 1034 VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAML 1093
            +   HG P++I SDRDV+  +  ++ L  +LG K   S+  HPQTDGQ+E   +T+  +L
Sbjct: 1252 IFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLL 1311

Query: 1094 RAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNF 1153
            RA +  N++ W   LP IEF YN     T   +PFEI  G+ P TP     I S + VN 
Sbjct: 1312 RAYVSTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTP----AIKSDDEVNA 1371

Query: 1154 DANAKVEFVHKLHK---QVKEQIEKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPT 1208
             +   VE    L     Q KEQ+E    ++ T  N+ RK ++   GD V VH R   F  
Sbjct: 1372 RSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVH-RDAYFKK 1431

BLAST of CmoCh13G003030 vs. Swiss-Prot
Match: YG31B_YEAST (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 360.5 bits (924), Expect = 7.4e-98
Identity = 224/629 (35.61%), Postives = 335/629 (53.26%), Query Frame = 1

Query: 614  EEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKKNVSFIWEKDQE 673
            + K  AI+D+PTPK V + + F G+ ++YRRFI N S IA  +   +       W + Q+
Sbjct: 806  QHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQ--WTEKQD 865

Query: 674  LAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVL--MQNQRPLM----FFSEKL 733
             A + LK+ L ++P+    N ++ + +  DAS  GIGAVL  + N+  L+    +FS+ L
Sbjct: 866  KAIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSL 925

Query: 734  TGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLE 793
              A   YP  + EL  +++AL  +++ L  K F + TDH SL  L+ +N+  RR  +WL+
Sbjct: 926  ESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLD 985

Query: 794  FIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFE-----------------HI 853
             + T+ + ++Y  G +N+VADA+SR    +    +R +  E                 H+
Sbjct: 986  DLATYDFTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHM 1045

Query: 854  KDLYQHDMF------FAPFVESCEKG-LIVDNYLLLDGFLFRKGKLCIPSCSIRELLVRE 913
            K+L QH++       F  + +  E       NY L D  ++ + +L +P    +  ++R 
Sbjct: 1046 KELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP-IKQQNAVMRL 1105

Query: 914  AHGGGLMA-HHGVSKTYDMLSEHFFLPKMRHDVHKVCGRCIACKQAKS-RLQPHGLYSPL 973
             H   L   H GV+ T   +S  ++ PK++H + +    C+ C+  KS R + HGL  PL
Sbjct: 1106 YHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPL 1165

Query: 974  PVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFRE 1033
            P+  G W+DISMDFV GLP T    + I VVVDRFSK AHFI   KT DA  + DL FR 
Sbjct: 1166 PIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRY 1225

Query: 1034 VVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAML 1093
            +   HG P++I SDRDV+  +  ++ L  +LG K   S+  HPQTDGQ+E   +T+  +L
Sbjct: 1226 IFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLL 1285

Query: 1094 RAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNF 1153
            RA    N++ W   LP IEF YN     T   +PFEI  G+ P TP     I S + VN 
Sbjct: 1286 RAYASTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTP----AIKSDDEVNA 1345

Query: 1154 DANAKVEFVHKLHK---QVKEQIEKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPT 1208
             +   VE    L     Q KEQ+E    ++ T  N+ RK ++   GD V VH R   F  
Sbjct: 1346 RSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIGDHVLVH-RDAYFKK 1405

BLAST of CmoCh13G003030 vs. Swiss-Prot
Match: TF27_SCHPO (Transposon Tf2-7 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-7 PE=3 SV=1)

HSP 1 Score: 333.2 bits (853), Expect = 1.3e-89
Identity = 211/655 (32.21%), Postives = 334/655 (50.99%), Query Frame = 1

Query: 603  YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKK 662
            Y +S  G    +E +  +  W  PKN  E+R F G  ++ R+FI   S +   LN L+KK
Sbjct: 613  YHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKK 672

Query: 663  NVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVLMQNQR---- 722
            +V + W   Q  A   +K+ L S P+    +F     +E DAS V +GAVL Q       
Sbjct: 673  DVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKY 732

Query: 723  -PLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP--KEFIIHTDHESLKHLRVQ 782
             P+ ++S K++ A L Y   DKE+ A++++L+ W+HYL    + F I TDH +L   R+ 
Sbjct: 733  YPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIG-RIT 792

Query: 783  NKL---NRRHAKWLEFIETFPYVIKYKQGKENIVADALSR---------------RYVLL 842
            N+    N+R A+W  F++ F + I Y+ G  N +ADALSR                   +
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFV 852

Query: 843  NTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFR-KGKLCIPS-CS 902
            N ++        +   Y +D      + + +K  + +N  L DG L   K ++ +P+   
Sbjct: 853  NQISITDDFKNQVVTEYTNDTKLLNLLNNEDKR-VEENIQLKDGLLINSKDQILLPNDTQ 912

Query: 903  IRELLVREAHGGGLMAHHGVSKTYDMLSEHFFLPKMRHDVHKVCGRCIACKQAKSRL-QP 962
            +   ++++ H  G + H G+    +++   F    +R  + +    C  C+  KSR  +P
Sbjct: 913  LTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKP 972

Query: 963  HGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHI 1022
            +G   P+P    PW  +SMDF+  LP +  GY+++FVVVDRFSKMA  +PC K+  A+  
Sbjct: 973  YGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQT 1032

Query: 1023 ADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVN 1082
            A +F + V+   G PK I++D D  F S  W+    K    + +S    PQTDGQTE  N
Sbjct: 1033 ARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTN 1092

Query: 1083 RTMTAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP-LTPIDLLPI 1142
            +T+  +LR +   +  TW D +  ++ +YN  +HS T+ TPFEIV+ ++P L+P++L   
Sbjct: 1093 QTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSF 1152

Query: 1143 PSKEFVNFDANAKVEFVHKLHKQVKEQIEKQNSKVATRIN-KGRKIVIFKPGDWVWVHFR 1202
              K   N     +V       + VKE +   N K+    + K ++I  F+PGD V V   
Sbjct: 1153 SDKTDENSQETIQV------FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1212

Query: 1203 KERFPTQRKSKLLPRGDGPFQVLERINDNAYKIDLPG--KYGVSATFSVVDLSPF 1226
            K  F   + +KL P   GPF VL++   N Y++DLP   K+  S+TF V  L  +
Sbjct: 1213 KTGF-LHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257

BLAST of CmoCh13G003030 vs. Swiss-Prot
Match: TF22_SCHPO (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 333.2 bits (853), Expect = 1.3e-89
Identity = 211/655 (32.21%), Postives = 334/655 (50.99%), Query Frame = 1

Query: 603  YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKK 662
            Y +S  G    +E +  +  W  PKN  E+R F G  ++ R+FI   S +   LN L+KK
Sbjct: 613  YHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKK 672

Query: 663  NVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVLMQNQR---- 722
            +V + W   Q  A   +K+ L S P+    +F     +E DAS V +GAVL Q       
Sbjct: 673  DVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKY 732

Query: 723  -PLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP--KEFIIHTDHESLKHLRVQ 782
             P+ ++S K++ A L Y   DKE+ A++++L+ W+HYL    + F I TDH +L   R+ 
Sbjct: 733  YPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIG-RIT 792

Query: 783  NKL---NRRHAKWLEFIETFPYVIKYKQGKENIVADALSR---------------RYVLL 842
            N+    N+R A+W  F++ F + I Y+ G  N +ADALSR                   +
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFV 852

Query: 843  NTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFR-KGKLCIPS-CS 902
            N ++        +   Y +D      + + +K  + +N  L DG L   K ++ +P+   
Sbjct: 853  NQISITDDFKNQVVTEYTNDTKLLNLLNNEDKR-VEENIQLKDGLLINSKDQILLPNDTQ 912

Query: 903  IRELLVREAHGGGLMAHHGVSKTYDMLSEHFFLPKMRHDVHKVCGRCIACKQAKSRL-QP 962
            +   ++++ H  G + H G+    +++   F    +R  + +    C  C+  KSR  +P
Sbjct: 913  LTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKP 972

Query: 963  HGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHI 1022
            +G   P+P    PW  +SMDF+  LP +  GY+++FVVVDRFSKMA  +PC K+  A+  
Sbjct: 973  YGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQT 1032

Query: 1023 ADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVN 1082
            A +F + V+   G PK I++D D  F S  W+    K    + +S    PQTDGQTE  N
Sbjct: 1033 ARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTN 1092

Query: 1083 RTMTAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP-LTPIDLLPI 1142
            +T+  +LR +   +  TW D +  ++ +YN  +HS T+ TPFEIV+ ++P L+P++L   
Sbjct: 1093 QTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSF 1152

Query: 1143 PSKEFVNFDANAKVEFVHKLHKQVKEQIEKQNSKVATRIN-KGRKIVIFKPGDWVWVHFR 1202
              K   N     +V       + VKE +   N K+    + K ++I  F+PGD V V   
Sbjct: 1153 SDKTDENSQETIQV------FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1212

Query: 1203 KERFPTQRKSKLLPRGDGPFQVLERINDNAYKIDLPG--KYGVSATFSVVDLSPF 1226
            K  F   + +KL P   GPF VL++   N Y++DLP   K+  S+TF V  L  +
Sbjct: 1213 KTGF-LHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257

BLAST of CmoCh13G003030 vs. Swiss-Prot
Match: TF25_SCHPO (Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=Tf2-5 PE=3 SV=1)

HSP 1 Score: 333.2 bits (853), Expect = 1.3e-89
Identity = 211/655 (32.21%), Postives = 334/655 (50.99%), Query Frame = 1

Query: 603  YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKK 662
            Y +S  G    +E +  +  W  PKN  E+R F G  ++ R+FI   S +   LN L+KK
Sbjct: 613  YHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKK 672

Query: 663  NVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVLMQNQR---- 722
            +V + W   Q  A   +K+ L S P+    +F     +E DAS V +GAVL Q       
Sbjct: 673  DVRWKWTPTQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKY 732

Query: 723  -PLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWP--KEFIIHTDHESLKHLRVQ 782
             P+ ++S K++ A L Y   DKE+ A++++L+ W+HYL    + F I TDH +L   R+ 
Sbjct: 733  YPVGYYSAKMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIG-RIT 792

Query: 783  NKL---NRRHAKWLEFIETFPYVIKYKQGKENIVADALSR---------------RYVLL 842
            N+    N+R A+W  F++ F + I Y+ G  N +ADALSR                   +
Sbjct: 793  NESEPENKRLARWQLFLQDFNFEINYRPGSANHIADALSRIVDETEPIPKDSEDNSINFV 852

Query: 843  NTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFR-KGKLCIPS-CS 902
            N ++        +   Y +D      + + +K  + +N  L DG L   K ++ +P+   
Sbjct: 853  NQISITDDFKNQVVTEYTNDTKLLNLLNNEDKR-VEENIQLKDGLLINSKDQILLPNDTQ 912

Query: 903  IRELLVREAHGGGLMAHHGVSKTYDMLSEHFFLPKMRHDVHKVCGRCIACKQAKSRL-QP 962
            +   ++++ H  G + H G+    +++   F    +R  + +    C  C+  KSR  +P
Sbjct: 913  LTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQINKSRNHKP 972

Query: 963  HGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHI 1022
            +G   P+P    PW  +SMDF+  LP +  GY+++FVVVDRFSKMA  +PC K+  A+  
Sbjct: 973  YGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTKSITAEQT 1032

Query: 1023 ADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVN 1082
            A +F + V+   G PK I++D D  F S  W+    K    + +S    PQTDGQTE  N
Sbjct: 1033 ARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTN 1092

Query: 1083 RTMTAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP-LTPIDLLPI 1142
            +T+  +LR +   +  TW D +  ++ +YN  +HS T+ TPFEIV+ ++P L+P++L   
Sbjct: 1093 QTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALSPLELPSF 1152

Query: 1143 PSKEFVNFDANAKVEFVHKLHKQVKEQIEKQNSKVATRIN-KGRKIVIFKPGDWVWVHFR 1202
              K   N     +V       + VKE +   N K+    + K ++I  F+PGD V V   
Sbjct: 1153 SDKTDENSQETIQV------FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRT 1212

Query: 1203 KERFPTQRKSKLLPRGDGPFQVLERINDNAYKIDLPG--KYGVSATFSVVDLSPF 1226
            K  F   + +KL P   GPF VL++   N Y++DLP   K+  S+TF V  L  +
Sbjct: 1213 KTGF-LHKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEKY 1257

BLAST of CmoCh13G003030 vs. TrEMBL
Match: A0A151UF56_CAJCA (Transposon Ty3-I Gag-Pol polyprotein OS=Cajanus cajan GN=KK1_049062 PE=4 SV=1)

HSP 1 Score: 963.4 bits (2489), Expect = 2.8e-277
Identity = 451/646 (69.81%), Postives = 541/646 (83.75%), Query Frame = 1

Query: 603  YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKK 662
            ++VSS GV+VDEEK++AI++WPTPKNVSEVRSFHGLASFYRRF+K+FST+A+ LNE+VKK
Sbjct: 150  FVVSSKGVQVDEEKIRAIQEWPTPKNVSEVRSFHGLASFYRRFVKDFSTLAAPLNEIVKK 209

Query: 663  NVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVLMQNQRPLMF 722
            +V F W + QE AF+ LK KL++AP+ ALPNF  +FEIECDAS VGIGAVLMQ   P+ +
Sbjct: 210  HVGFKWGEKQEKAFSELKHKLTNAPILALPNFAKSFEIECDASNVGIGAVLMQEGHPIAY 269

Query: 723  FSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRH 782
            FSEKL GA+L YPTYDKELYALVRAL+TWQHYL PKEF+IH+DHESLK+L+ Q KLN+RH
Sbjct: 270  FSEKLNGAALNYPTYDKELYALVRALRTWQHYLLPKEFVIHSDHESLKYLKGQGKLNKRH 329

Query: 783  AKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAP 842
            AKW+EF+E FPYVIK+K+GK N+VADALSRR+ LL+ L  +L G E +KD+Y HD+ FA 
Sbjct: 330  AKWVEFLEQFPYVIKHKKGKGNVVADALSRRHNLLSMLETKLFGLESLKDMYMHDVDFAE 389

Query: 843  FVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDML 902
               +CEK    + Y   +GFLF+  +LC+P CSIRELLV E+H GGLM H GV KT ++L
Sbjct: 390  NFAACEK-FSENGYYRHNGFLFKANRLCVPKCSIRELLVSESHEGGLMGHFGVQKTLEIL 449

Query: 903  SEHFFLPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRT 962
             EHF+ P M+HDVHK C  CI CK+AKS+++PHGLY+PLPVP+ PWIDISMDFVLGLPRT
Sbjct: 450  QEHFYWPHMKHDVHKFCDHCIVCKKAKSKVKPHGLYTPLPVPDFPWIDISMDFVLGLPRT 509

Query: 963  RKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLS 1022
            + G DSIFVVVDRFSKMAHFIPC K +DA H+ADLFFREVVRLHG+P+SIVSDRD KFLS
Sbjct: 510  KNGKDSIFVVVDRFSKMAHFIPCKKVNDACHVADLFFREVVRLHGLPRSIVSDRDTKFLS 569

Query: 1023 HFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCLPFIEFA 1082
            HFWR LWGKLGTKL++STTCHPQTDGQTEVVNRT+  +LR ++ KN+K WE+ LP +EFA
Sbjct: 570  HFWRTLWGKLGTKLLFSTTCHPQTDGQTEVVNRTLGTLLRTVLKKNIKFWEEHLPHVEFA 629

Query: 1083 YNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPS-KEFVNFDANAKVEFVHKLHKQVKEQI 1142
            YNR VHSTTKC+PFEIVYGFNPLTP+DLLP+P+  EF + DA AK E+V KLH+QVK QI
Sbjct: 630  YNRAVHSTTKCSPFEIVYGFNPLTPLDLLPMPNISEFKHKDAQAKAEYVKKLHEQVKAQI 689

Query: 1143 EKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERINDN 1202
            EK+      + NKGRK VIF+PGDWVWVH RKERFP QRKSKL PRGDGPFQVLE+INDN
Sbjct: 690  EKKIESYVKQANKGRKKVIFEPGDWVWVHMRKERFPEQRKSKLQPRGDGPFQVLEKINDN 749

Query: 1203 AYKIDLPGKYGVSATFSVVDLSPFDVGDG-LDSRTNPSQEGENDMN 1247
            AYKIDLPG+YGVS++F+V DL+ FD GD  +  R N +QEGEND++
Sbjct: 750  AYKIDLPGEYGVSSSFNVADLTHFDAGDEFIALRKNVAQEGENDVD 794

BLAST of CmoCh13G003030 vs. TrEMBL
Match: A5BSP7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_001042 PE=4 SV=1)

HSP 1 Score: 948.7 bits (2451), Expect = 7.1e-273
Identity = 485/868 (55.88%), Postives = 589/868 (67.86%), Query Frame = 1

Query: 465  QEFEHLFSEEMPSSLPPLKGLNTRLT-----------SFLARPFQTDQLIGLIQRRLKR- 524
            QE+E +F  ++PS LPP++G+  ++            ++ + P +T +L   ++  L + 
Sbjct: 472  QEYEDVFPNDVPSGLPPIRGIEXQIDFVSXATIPNRXAYRSNPEETKELQRQVEXLLTKG 531

Query: 525  --------------------------YKGKAINKITIKYRHPIPRLDDMLDELHGCSLFT 584
                                         + IN IT+KYRHPIPRLDDMLDELHG  +FT
Sbjct: 532  HVRESLSPCVVXVXLVXKKDGTWRMCVDXRXINNITVKYRHPIPRLDDMLDELHGSCVFT 591

Query: 585  KIDLKSGYHQIRMHIGDEWETAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLVS 644
            KIDLKSGYHQIRM  GDEW+TAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNH LR ++  
Sbjct: 592  KIDLKSGYHQIRMKEGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHALRSFISR 651

Query: 645  SNGVEVDEEKVKAIKDWPTPKNVSE-VRSFHGLA-------------------SFYRRFI 704
               V  D+  V +       KN+ E +   H +                    SFYR F+
Sbjct: 652  FVVVYFDDILVYS-------KNLDEYINHLHCVLALCFLVMLLVXKELRWMRKSFYRXFV 711

Query: 705  KNFSTIASLLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASG 764
            K+FST+A  L E+VKK V F W    + AF  +KE L                       
Sbjct: 712  KDFSTLAVPLTEIVKKFVGFKWGSXXDRAFIXIKEMLC---------------------- 771

Query: 765  VGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDH 824
             GI A+LMQ +RP+ +FSEKL G +L YPTYDKELYALVRAL+TWQHYLWPKEF+IHTDH
Sbjct: 772  -GIRAILMQEKRPITYFSEKLNGTTLNYPTYDKELYALVRALETWQHYLWPKEFVIHTDH 831

Query: 825  ESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLG 884
            ESLKHL+ Q KLNRRHAKW+EFIETF YVIKYKQGKENIVADALSRRY L++TLNA+LLG
Sbjct: 832  ESLKHLKGQGKLNRRHAKWVEFIETFLYVIKYKQGKENIVADALSRRYALVSTLNAKLLG 891

Query: 885  FEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHG 944
            FE++K+LY +D  FA    +CEK                + +LC+P+ S+RELLVREAHG
Sbjct: 892  FEYVKELYANDDDFASVYGACEKTTF-------------ENRLCVPNSSMRELLVREAHG 951

Query: 945  GGLMAHHGVSKTYDMLSEHFFLPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNG 1004
            GGLM H GV KT D+L EHFF PKM+ DV + C RCI C+QAKSR+ PHGLY+PLPVP+ 
Sbjct: 952  GGLMGHFGVRKTLDVLHEHFFWPKMKCDVERACARCITCRQAKSRVLPHGLYTPLPVPSA 1011

Query: 1005 PWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLH 1064
            PW+DISMDFVLGLPR+R G  SIFVVVDRFSKM +FI CHK DDA HIA+LFFRE+VRLH
Sbjct: 1012 PWVDISMDFVLGLPRSRNGRXSIFVVVDRFSKMTYFISCHKIDDATHIANLFFREIVRLH 1071

Query: 1065 GIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIID 1124
            G+P+SIVSDRDVKFLS+FW+VLW KLGTKL++STTCHPQTDGQTEVVNRT++        
Sbjct: 1072 GVPRSIVSDRDVKFLSYFWKVLWRKLGTKLLFSTTCHPQTDGQTEVVNRTLSTF------ 1131

Query: 1125 KNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNFDANAK 1184
                                VHSTT  +PFEIVYGFNPLT +DLLP+   E  + D   K
Sbjct: 1132 --------------------VHSTTNFSPFEIVYGFNPLTSLDLLPLXVNEMXSLDGEKK 1191

Query: 1185 VEFVHKLHKQVKEQIEKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPTQRKSKLLP 1244
            VE V KLH+ V++ IEK+N +  ++ NKG   V+F+PGDWVWVH RKERFPT+R SKL P
Sbjct: 1192 VEMVKKLHESVRKHIEKKNEQYVSKANKGXXQVLFEPGDWVWVHMRKERFPTRRXSKLHP 1251

Query: 1245 RGDGPFQVLERINDNAYKIDLPGKYGVSATFSVVDLSPFDVGDGLDSRTNPSQEGENDMN 1272
            RGDGPFQVLERINDNAYK+D+PG+Y +SATF+V DLSPFDVGD  DS TNP +E  ND N
Sbjct: 1252 RGDGPFQVLERINDNAYKLDIPGEYNISATFNVSDLSPFDVGD--DSXTNPFEEKGNDEN 1268

BLAST of CmoCh13G003030 vs. TrEMBL
Match: Q9LQH2_ARATH (F15O4.13 OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 933.7 bits (2412), Expect = 2.4e-268
Identity = 436/649 (67.18%), Postives = 531/649 (81.82%), Query Frame = 1

Query: 597  NHVLREYLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLL 656
            N V   ++VS++GV+VDEEKVKAI++WP+PK+V EVRSFHGLA FYRRF+K+FST+A+ L
Sbjct: 1105 NLVFLGFVVSTDGVKVDEEKVKAIREWPSPKSVGEVRSFHGLAGFYRRFVKDFSTLAAPL 1164

Query: 657  NELVKKNVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVLMQN 716
             E++KKNV F WE+ QE AF  LKEKL+ AP+ +LP+F  TFEIECDASGVGIG VLMQ+
Sbjct: 1165 TEVIKKNVGFKWEQAQEDAFQALKEKLTHAPVLSLPDFLKTFEIECDASGVGIGVVLMQD 1224

Query: 717  QRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQN 776
            ++P+ +FSEKL GA+L YPTYDKELYALVRALQT QHYLWPKEF+IHTDHESLKHL+ Q 
Sbjct: 1225 KKPIAYFSEKLGGATLNYPTYDKELYALVRALQTGQHYLWPKEFVIHTDHESLKHLKGQQ 1284

Query: 777  KLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQH 836
            KLN+RHA+W+EFIETFPYVIKYK+GK+N+VADALSRRYVLL++L+A+LLGFEHIK LY +
Sbjct: 1285 KLNKRHARWVEFIETFPYVIKYKKGKDNVVADALSRRYVLLSSLDAKLLGFEHIKSLYAN 1344

Query: 837  DMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVS 896
            D  F     SCEK      Y   DGFLF   +LCIP+ S+REL +REAHGGGLM H GVS
Sbjct: 1345 DSDFEKIYSSCEK-FAFGKYYRHDGFLFYDNRLCIPNSSLRELFIREAHGGGLMGHFGVS 1404

Query: 897  KTYDMLSEHFFLPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFV 956
            KT  ++ +HF  P M+ DV ++C RC  CKQAK++ QPHGLY+PLP+P+ PW DISMDFV
Sbjct: 1405 KTIKVMQDHFHWPHMKRDVERICERCPTCKQAKAKSQPHGLYTPLPIPSHPWNDISMDFV 1464

Query: 957  LGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDR 1016
            +GLPRTR G DSIFVVVDRFSKMAHFIPCHKTDDA HIA+LFFREVVRLHG+PK+IVSDR
Sbjct: 1465 VGLPRTRTGKDSIFVVVDRFSKMAHFIPCHKTDDAIHIANLFFREVVRLHGMPKTIVSDR 1524

Query: 1017 DVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCL 1076
            D KFLS+FW+ LW KLGTKL++STTCHPQTDGQTEVVNRT++ +LRA+I KNLKTWEDCL
Sbjct: 1525 DTKFLSYFWKTLWSKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRALIKKNLKTWEDCL 1584

Query: 1077 PFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNFDANAKVEFVHKLHKQ 1136
            P +EFAYN  +HS +K +PF+IVYGFNP TP+DL+P+P  E V+ D   K E V ++H+Q
Sbjct: 1585 PHVEFAYNHSMHSASKFSPFQIVYGFNPTTPLDLMPLPLSERVSLDGKKKAELVQQIHEQ 1644

Query: 1137 VKEQIEKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLE 1196
             K+ IE++  + A   NK RK VIF  GD VW+H RKERFP +RKSKL+ R DGPF+VL+
Sbjct: 1645 AKKNIEEKTKQYAKHANKSRKEVIFNEGDLVWIHLRKERFPKERKSKLMSRIDGPFKVLK 1704

Query: 1197 RINDNAYKIDLPGKYGVSATFSVVDLSPFDVGDGLDSRTNPSQEGENDM 1246
            RIN+NAY +DL GKY VS +F+V DL PF + D  D R+NP Q GE+D+
Sbjct: 1705 RINNNAYSLDLQGKYNVSNSFNVADLFPF-IADNTDLRSNPFQLGEDDV 1751

BLAST of CmoCh13G003030 vs. TrEMBL
Match: Q8L7J3_MAIZE (Gag-pol polyprotein OS=Zea mays PE=4 SV=1)

HSP 1 Score: 932.6 bits (2409), Expect = 5.2e-268
Identity = 440/676 (65.09%), Postives = 533/676 (78.85%), Query Frame = 1

Query: 603  YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKK 662
            Y+V+  G+EVD+ KV+AI  WP PK +++VRSF GLA FYRRF+K+FSTIA+ LNEL KK
Sbjct: 922  YVVTPQGIEVDQAKVEAIHGWPMPKTITQVRSFLGLAGFYRRFVKDFSTIAAPLNELTKK 981

Query: 663  NVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVLMQNQRPLMF 722
             V F W K QE AFN LK+KL+ APL  LP+F  TFE+ECDASG+G+G VL+Q  +P+ +
Sbjct: 982  GVHFSWGKVQEHAFNVLKDKLTHAPLLQLPDFNKTFELECDASGIGLGGVLLQEGKPVAY 1041

Query: 723  FSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRH 782
            FSEKL+G+ L Y TYDKELYALVR L+TWQHYLWPKEF+IH+DHESLKH+R Q KLNRRH
Sbjct: 1042 FSEKLSGSVLNYSTYDKELYALVRTLETWQHYLWPKEFVIHSDHESLKHIRSQGKLNRRH 1101

Query: 783  AKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAP 842
            AKW+EFIE+FPYVIK+K+GKENI+ADALSRRY LLN L+ ++ G E IKD Y HD  F  
Sbjct: 1102 AKWVEFIESFPYVIKHKKGKENIIADALSRRYTLLNQLDYKIFGLETIKDQYVHDADFKD 1161

Query: 843  FVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDML 902
             +  C+ G   + Y++ DGF+FR  KLCIP+ S+R LL++EAHGGGLM H G  KT D+L
Sbjct: 1162 VLLHCKDGKGWNKYIVSDGFVFRANKLCIPASSVRLLLLQEAHGGGLMGHFGAKKTEDIL 1221

Query: 903  SEHFFLPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRT 962
            + HFF PKMR DV ++  RC  C++AKSRL PHGLY PLPVP+ PW DISMDFVLGLPRT
Sbjct: 1222 AGHFFWPKMRRDVVRLVARCTTCQKAKSRLNPHGLYLPLPVPSAPWEDISMDFVLGLPRT 1281

Query: 963  RKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLS 1022
            RKG DS+FVVVDRFSKMAHFIPCHKTDDA HIADLFFRE+VRLHG+P +IVSDRD KFLS
Sbjct: 1282 RKGRDSVFVVVDRFSKMAHFIPCHKTDDATHIADLFFREIVRLHGVPNTIVSDRDAKFLS 1341

Query: 1023 HFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCLPFIEFA 1082
            HFWR LW KLGTKL++STTCHPQTDGQTEVVNRT++ MLRA++ KN+K WEDCLP IEFA
Sbjct: 1342 HFWRTLWAKLGTKLLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNIKMWEDCLPHIEFA 1401

Query: 1083 YNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNFDANAKVEFVHKLHKQVKEQIE 1142
            YNR +HSTTK  PF+IVYG  P  PIDL+P+PS E +NFDA  + E + KLH+  KE IE
Sbjct: 1402 YNRSLHSTTKMCPFQIVYGLLPRAPIDLMPLPSSEKLNFDATRRAELMLKLHETTKENIE 1461

Query: 1143 KQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERINDNA 1202
            + N++     +KGRK + F+PGD VW+H RKERFP  RKSKLLPR DGPF+VLE+INDNA
Sbjct: 1462 RMNARYKFASDKGRKEINFEPGDLVWLHLRKERFPELRKSKLLPRADGPFKVLEKINDNA 1521

Query: 1203 YKIDLPGKYGVSATFSVVDLSPFDVGD--GLDSRTNPSQEGENDMN-HDQGISIP----- 1262
            Y++DLP  +GVS TF++ DL P+ +G+   L+SRT   QEGEND + H    SIP     
Sbjct: 1522 YRLDLPADFGVSPTFNIADLKPY-LGEEVELESRTTQMQEGENDEDIHTTDASIPIQVPI 1581

Query: 1263 QGPITRTRAKKLQQTI 1271
             GPITR RA++L   +
Sbjct: 1582 SGPITRARARQLNHQV 1596

BLAST of CmoCh13G003030 vs. TrEMBL
Match: A4K7M3_ORYSJ (Putative polyprotein OS=Oryza sativa subsp. japonica PE=4 SV=1)

HSP 1 Score: 929.1 bits (2400), Expect = 5.8e-267
Identity = 437/686 (63.70%), Postives = 538/686 (78.43%), Query Frame = 1

Query: 603  YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKK 662
            ++VS  G++VDE KVKAIKDWPTP+NVS+V+SF GLA FYRRF++ FSTIA+ LNEL KK
Sbjct: 922  FVVSGLGIQVDESKVKAIKDWPTPENVSQVKSFRGLAGFYRRFVRGFSTIAAPLNELTKK 981

Query: 663  NVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVLMQNQRPLMF 722
             V+F W + QE AF  LK++LS  PL  LP+F  TFE+ECDASG+GIG VLMQN +P+ +
Sbjct: 982  GVAFQWGEPQEKAFQELKKRLSEGPLLVLPDFTKTFEVECDASGIGIGGVLMQNGQPVAY 1041

Query: 723  FSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRH 782
            FSEKL GA L Y  YDKELYALVRAL+TWQHYLWPKEF+IH+DHE+LK+L+ Q KLNRRH
Sbjct: 1042 FSEKLGGAQLNYSVYDKELYALVRALETWQHYLWPKEFVIHSDHEALKYLKGQAKLNRRH 1101

Query: 783  AKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAP 842
            AKW+EFIETFPYV+KYK+GKENIVADALSR+ VLLN L  ++ G E IK+LY  D+ F+ 
Sbjct: 1102 AKWVEFIETFPYVVKYKKGKENIVADALSRKNVLLNQLEVKVTGIESIKELYSADLDFSE 1161

Query: 843  FVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDML 902
                C  G   + Y + DGFLFR  KLC+P CS+R LL++E H GGLM H G  KTYDML
Sbjct: 1162 PYAKCTAGKGWEKYHIHDGFLFRANKLCVPHCSVRLLLLQETHAGGLMGHFGWRKTYDML 1221

Query: 903  SEHFFLPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRT 962
            ++HF+ PKMR DV ++  RC+ C +AKS+L PHGLY+PLPVP+ PW DISMDFVLGLPRT
Sbjct: 1222 ADHFYWPKMRRDVQRLVQRCVTCHKAKSKLNPHGLYTPLPVPSAPWEDISMDFVLGLPRT 1281

Query: 963  RKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLS 1022
            ++G DSIFVVVDRFSKMAHFIPCHK+DDA HIA LFF E+VRLHG+PK+IVSDRD KFLS
Sbjct: 1282 KRGRDSIFVVVDRFSKMAHFIPCHKSDDASHIASLFFSEIVRLHGMPKTIVSDRDTKFLS 1341

Query: 1023 HFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCLPFIEFA 1082
            +FW+ LW KLGT+L++STTCHPQTDGQTEVVNRT++ +LRA+I KNLK WE+CLP +EFA
Sbjct: 1342 YFWKTLWAKLGTRLLFSTTCHPQTDGQTEVVNRTLSMLLRALIKKNLKEWEECLPHVEFA 1401

Query: 1083 YNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNFDANAKVEFVHKLHKQVKEQIE 1142
            YNR VHSTT   PFE+VYGF PL+PIDLLP+P +E  + +A+ +  +V K+H++ KE IE
Sbjct: 1402 YNRAVHSTTNMCPFEVVYGFKPLSPIDLLPLPLQERSDMEASKRATYVKKIHEKTKEAIE 1461

Query: 1143 KQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERINDNA 1202
            K++   A   NK RK V F+PGD VWVH RK+RFP +RKSKL+PRGDGPF+VL +INDNA
Sbjct: 1462 KRSKYYAAWANKNRKKVTFEPGDLVWVHLRKDRFPQKRKSKLMPRGDGPFRVLSKINDNA 1521

Query: 1203 YKIDLPGKYGVSATFSVVDLSP-FDVGDGLDSRTNPSQEGEND-----MNHDQGISIP-- 1262
            YKI+LP  YGVS+TF+V DL+P F + D   SR+ P QEGE+D     ++       P  
Sbjct: 1522 YKIELPEDYGVSSTFNVADLTPFFGLEDSESSRSTPFQEGEDDEDIPTVHATSSTKQPSS 1581

Query: 1263 ------QGPITRTRAKKLQQTINKHI 1275
                  QGP+TR+RAKKLQ  +N  +
Sbjct: 1582 NTKDTIQGPLTRSRAKKLQVQVNSFL 1607

BLAST of CmoCh13G003030 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 85.1 bits (209), Expect = 3.4e-16
Identity = 41/96 (42.71%), Postives = 60/96 (62.50%), Query Frame = 1

Query: 603 YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKK 662
           +++S  GV  D  K++A+  WP PKN +E+R F GL  +YRRF+KN+  I   L EL+KK
Sbjct: 37  HIISGEGVSADPAKLEAMVGWPEPKNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKK 96

Query: 663 NVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTF 699
           N S  W +   LAF  LK  +++ P+ ALP+ +  F
Sbjct: 97  N-SLKWTEMAALAFKALKGAVTTLPVLALPDLKLPF 131

BLAST of CmoCh13G003030 vs. NCBI nr
Match: gi|823145097|ref|XP_012472412.1| (PREDICTED: uncharacterized protein LOC105789586 [Gossypium raimondii])

HSP 1 Score: 1289.6 bits (3336), Expect = 0.0e+00
Identity = 651/1230 (52.93%), Postives = 866/1230 (70.41%), Query Frame = 1

Query: 4    VQRVHEEAFCSTIFSRDMAQKLQALKQGRKSVEDYYKEMDTLMDRLELDEDMEALMARFL 63
            ++ V  + F  + + R++ Q+LQ L QG +SVEDYYK+M+  M R +++ED EA MARFL
Sbjct: 303  MKAVMRKRFVPSYYHRELYQRLQNLTQGNRSVEDYYKDMEIAMIRADVEEDREATMARFL 362

Query: 64   NGLNTEIADKTDLQPYSNIEELLHIAIKIERQIQQRSQRYSSKTFPN-STSTWKKDSKNI 123
             GLN +IA+  + Q Y  + +++H+AIK+E+Q++++     ++T+P  ST+ W + +   
Sbjct: 363  AGLNRDIANIVEFQHYVEVMDMVHMAIKVEKQLKRKGP---TQTYPTTSTNKWAQGTSKA 422

Query: 124  DYKHRNQEINEKPQAKFEKGESSRTGKEKVEKSNVRNRDLKCWRCQGVGHYSRDCPNARI 183
              + +   +  KP        S+   K K E  +  +RD+KC++CQG GH +  CPN R+
Sbjct: 423  PNRPKKPFVAAKPNQV-----SADASKNKNEAVSNHSRDIKCFKCQGRGHIASQCPNRRV 482

Query: 184  MTIKE-GEIVTDDEAHDDINEETDESEEFSEEDPTHISLVTRQALNTHIKEDGLDQRENL 243
            M ++  GEI ++DE  ++     +E EE        + LV +++LN  + ++   QR+N+
Sbjct: 483  MVVRSNGEIESEDEQEEEPEIPMEEGEELELPVEGEL-LVVKRSLNIQVAKEE-QQRDNI 542

Query: 244  FQTRCLVQSVPCSVVIDSGSCTNVVSSILVKRLNLKTQPHPRPYKLQWLNDCGEVRITQQ 303
            F TRC VQ   CS++ID GSCTNV SS+LV++L L T  HP PYKLQWLND GE+++T+Q
Sbjct: 543  FHTRCHVQGKVCSLIIDGGSCTNVASSLLVEKLGLATTKHPTPYKLQWLNDGGELKVTKQ 602

Query: 304  TLVSFTIGKYVDDVLCDVVSMHVGDLLLGRPWQFDRRVMYDGYANRYSFTHNGRKTTLIP 363
              V+F+IGKY D+V+CDVV MH G LLLGRPWQFDRRV++DGY NRYSF H GR  TL P
Sbjct: 603  ARVAFSIGKYQDEVVCDVVPMHAGHLLLGRPWQFDRRVVHDGYTNRYSFKHLGRNVTLAP 662

Query: 364  LSPKNVFIDHCKLEKKRQEVDAKAEIEKESSEKKSLREKQESNTQPREKKERKAKS--VS 423
            L+PK V  D  K+++  +    K + +K   +KK   ++ E  T+  ++KE++ ++   S
Sbjct: 663  LTPKQVHEDQLKMKQSIEREKEKEKNKKSEKKKKEKNDESEIKTRVTKEKEQECENEKTS 722

Query: 424  LYVRSSEARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFVVLLQEFEHLFSEEMPSSL 483
            ++ R  E R ++L+ Q I V M K   + TN L  +LP+  V LLQEF  +F EE+P+ L
Sbjct: 723  VFARKREIRKLMLARQPIFVPMYKECLFETNELENTLPTPIVSLLQEFGDIFPEEVPNGL 782

Query: 484  PPLKGLNTRLT-----------SFLARPFQTDQLI---------GLIQRRLK-------- 543
            PP++G+  ++            ++ + P +T +L          G I+  L         
Sbjct: 783  PPIRGIEHQIDFVPGAAIPNRPAYRSNPEETKELEKQVAELMEKGYIRESLSPCAVPVLL 842

Query: 544  --RYKG--------KAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHI 603
              +  G        +AINKITIKYRHPIPRLD+MLDEL G  LF+KIDLKSGYHQIRM  
Sbjct: 843  VPKKDGSWRMCVDYRAINKITIKYRHPIPRLDNMLDELSGAQLFSKIDLKSGYHQIRMRE 902

Query: 604  GDEWETAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLVSSNGVEVDEEKV--KA 663
            GDEW+TAFKTKYGLYEWLVMPFGLTNAPSTFMRLMN+VLR ++     V  D+  V  K+
Sbjct: 903  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNYVLRSFIGRFCVVYFDDILVYSKS 962

Query: 664  IKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKKNVSFIWEKDQELAFNTL 723
            ++D         ++    +    R   K F      L  ++KKN  F+W  +QE +FN L
Sbjct: 963  LED--------HIQHLRAVLEVLR---KEFCCCP--LTGIIKKNSPFVWTDEQENSFNKL 1022

Query: 724  KEKLSSAPLRALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDK 783
            KE L++APL +LP+F  TFEIECDASG+GIGA LMQ+ RP+ +FSEKL GA+L YPTYDK
Sbjct: 1023 KECLTNAPLLSLPDFNKTFEIECDASGIGIGAALMQDGRPIAYFSEKLNGATLNYPTYDK 1082

Query: 784  ELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYK 843
            ELYALVRAL+TWQHYLWPKEF+IH+DHE+LK+L+ Q KLN+RHAKW+E++E+F YVIKYK
Sbjct: 1083 ELYALVRALETWQHYLWPKEFVIHSDHEALKNLKGQTKLNKRHAKWVEYLESFLYVIKYK 1142

Query: 844  QGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLL 903
            +GKEN+VADALSRRY L+N ++++LLGFE +KDLY+ D  F    ESC  G   + Y   
Sbjct: 1143 KGKENVVADALSRRYALVNLMDSKLLGFEFLKDLYKSDADFGEIYESCSHGA-GEKYFQH 1202

Query: 904  DGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFLPKMRHDVHKVC 963
            +G+LFR+GKLC+P  S+R +LV EAH GGLM H G++KT  +L EHF+ PKM+ DV + C
Sbjct: 1203 EGYLFREGKLCVPQSSVRNVLVEEAHSGGLMGHFGIAKTLAILHEHFYWPKMKRDVIRKC 1262

Query: 964  GRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKM 1023
             RCI CK+AKSR++PHGLY+PLP+P+ PW+DISMDFVLGLPRT++G DSIFVVVDRFSKM
Sbjct: 1263 DRCITCKKAKSRIKPHGLYTPLPIPDAPWVDISMDFVLGLPRTKRGRDSIFVVVDRFSKM 1322

Query: 1024 AHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYS 1083
            AHFIPC+KTDDA ++A+LFF+EVVRLHGIP++IVSDRD KFLSHFWR LWGKLGTKL++ 
Sbjct: 1323 AHFIPCNKTDDATNVANLFFKEVVRLHGIPRTIVSDRDTKFLSHFWRTLWGKLGTKLLFL 1382

Query: 1084 TTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIV 1143
            TTCHPQTDGQTEVVNR ++ +LRAI+ KNLK WEDCLP +EFAYNR VHS TK +PFE+V
Sbjct: 1383 TTCHPQTDGQTEVVNRVLSTLLRAILKKNLKMWEDCLPHVEFAYNRTVHSATKFSPFEVV 1442

Query: 1144 YGFNPLTPIDLLPIPSKEFVNFDANAKVEFVHKLHKQVKEQIEKQNSKVATRINKGRKIV 1190
            YGFNP+TP+DL+ +PS E V+ D   K EFV +LHK VK+ IE++  +      KGRK V
Sbjct: 1443 YGFNPITPLDLISMPSNELVHVDGKKKAEFVKQLHKNVKDNIERRTEQYVRGAKKGRKRV 1502

BLAST of CmoCh13G003030 vs. NCBI nr
Match: gi|923748122|ref|XP_013673383.1| (PREDICTED: uncharacterized protein LOC106377669 [Brassica napus])

HSP 1 Score: 975.7 bits (2521), Expect = 7.8e-281
Identity = 470/826 (56.90%), Postives = 596/826 (72.15%), Query Frame = 1

Query: 426  EARNVLLSNQTILVLMCKGSCYFTNMLNPSLPSDFVVLLQEFEHLFSEEMPSSLPPLKGL 485
            + +  +  +  +L+++ K  CY   +  P +P     L+  ++ +F +E+P+ LPP++G+
Sbjct: 130  DMQKAMTQSGQVLLMIFKEGCY-AGLEAPEVPDVVQDLMGRYKDVFPDEIPAGLPPVRGI 189

Query: 486  NTRL-----------TSFLARPFQTDQLIGLIQRRLKR---------------------- 545
              ++            ++   P +  +L   +Q  + +                      
Sbjct: 190  EHQIDLVPGAPLPNRAAYRVNPEEAKELERQVQDLMDKGYIRESLSPCAVPVLLVPKKDG 249

Query: 546  -----YKGKAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWET 605
                    +AIN ITIKYR+PIPRLDDMLDEL G  +F+KIDL+SGYHQ+RM  GDEW+T
Sbjct: 250  TWRMCVDCRAINNITIKYRYPIPRLDDMLDELSGSVVFSKIDLRSGYHQVRMKEGDEWKT 309

Query: 606  AFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYL---VSSNGVEVDEEKVKAIKDWP 665
            AFKTK GLYEWLVMPFGLTNAPSTFMRLMN VLR Y+   VSS G++VDEEK+KAI+DWP
Sbjct: 310  AFKTKQGLYEWLVMPFGLTNAPSTFMRLMNEVLRPYIGFVVSSQGLKVDEEKIKAIQDWP 369

Query: 666  TPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKKNVSFIWEKDQELAFNTLKEKLS 725
            TP  +   RSFHGLASFYRRF+K+FSTIA+ +  ++KKNV+F W   QE +FN LK  L+
Sbjct: 370  TPTTIGHTRSFHGLASFYRRFVKDFSTIAAPMTSVIKKNVTFAWGPAQEESFNRLKYSLT 429

Query: 726  SAPLRALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYAL 785
             AP+  LPNF+  FEIECDASG GIGAVL Q  RP+ FFSEKL+GA+L YPTYDKELYAL
Sbjct: 430  HAPVLTLPNFDKPFEIECDASGTGIGAVLTQGGRPVAFFSEKLSGAALNYPTYDKELYAL 489

Query: 786  VRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKEN 845
            VR+L+TW+HYL  KEFIIHTDHE+LKHLR Q  L +RHA+WLEF+ETFPYVIKYK+GK+N
Sbjct: 490  VRSLETWRHYLLSKEFIIHTDHETLKHLRGQTTLKKRHARWLEFVETFPYVIKYKKGKDN 549

Query: 846  IVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLF 905
            IVADALSRR+ L+ T+ A+++GFE +K  Y+ D  F+    +  KG +   Y   DGFLF
Sbjct: 550  IVADALSRRHTLITTMEAKIMGFEQLKMSYETDPDFSELYSNTAKGAMGPFYQQ-DGFLF 609

Query: 906  RKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFLPKMRHDVHKVCGRCIA 965
            ++ +LCIP  S+R+LL REAHGGGLM H G  KT  +LS+HF+ P+MR  V  +C +C  
Sbjct: 610  KEKRLCIPHGSMRDLLTREAHGGGLMGHFGRDKTLSVLSDHFYWPRMRRYVESLCAKCTV 669

Query: 966  CKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIP 1025
            C + KSR  P+GL+ PLP+P  PW+D+SMDFVLGLP+     DSIFVVVDRFSKMAHFIP
Sbjct: 670  CLKTKSRSHPYGLHMPLPIPIAPWVDLSMDFVLGLPKIEHK-DSIFVVVDRFSKMAHFIP 729

Query: 1026 CHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHP 1085
            C KT+DA   ADLFF+EVVRLHG+P++IVSDRD KFLSHFWR LW KLGTKL++STTCHP
Sbjct: 730  CSKTNDASQTADLFFKEVVRLHGVPRTIVSDRDRKFLSHFWRTLWKKLGTKLLFSTTCHP 789

Query: 1086 QTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNP 1145
            QTDGQTEVVNRT++ +LRA + KNLK W   LPFIEFAYN   HS    +PFEIVYGF P
Sbjct: 790  QTDGQTEVVNRTLSQLLRATVGKNLKNWLSLLPFIEFAYNHARHSINNLSPFEIVYGFRP 849

Query: 1146 LTPIDLLPIPSKEFVNFDANAKVEFVHKLHKQVKEQIEKQNSKVATRINKGRKIVIFKPG 1205
             TP+DL  +P    V+   + K EFV  +H+QVKE +EK+  +   ++NKG   VIF+PG
Sbjct: 850  ETPMDLAALPPAMQVSQSGDKKAEFVKNMHRQVKETLEKKAERNRIKLNKGSTEVIFQPG 909

Query: 1206 DWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERINDNAYKIDLPGK 1211
            DWVW+H R ERFP +RKSKL PRGDGPF+VL+RINDNAY+++LP +
Sbjct: 910  DWVWLHMRPERFPEERKSKLAPRGDGPFRVLDRINDNAYRLELPDR 952

BLAST of CmoCh13G003030 vs. NCBI nr
Match: gi|1012366710|gb|KYP77891.1| (Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan])

HSP 1 Score: 963.4 bits (2489), Expect = 4.0e-277
Identity = 451/646 (69.81%), Postives = 541/646 (83.75%), Query Frame = 1

Query: 603  YLVSSNGVEVDEEKVKAIKDWPTPKNVSEVRSFHGLASFYRRFIKNFSTIASLLNELVKK 662
            ++VSS GV+VDEEK++AI++WPTPKNVSEVRSFHGLASFYRRF+K+FST+A+ LNE+VKK
Sbjct: 150  FVVSSKGVQVDEEKIRAIQEWPTPKNVSEVRSFHGLASFYRRFVKDFSTLAAPLNEIVKK 209

Query: 663  NVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASGVGIGAVLMQNQRPLMF 722
            +V F W + QE AF+ LK KL++AP+ ALPNF  +FEIECDAS VGIGAVLMQ   P+ +
Sbjct: 210  HVGFKWGEKQEKAFSELKHKLTNAPILALPNFAKSFEIECDASNVGIGAVLMQEGHPIAY 269

Query: 723  FSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRRH 782
            FSEKL GA+L YPTYDKELYALVRAL+TWQHYL PKEF+IH+DHESLK+L+ Q KLN+RH
Sbjct: 270  FSEKLNGAALNYPTYDKELYALVRALRTWQHYLLPKEFVIHSDHESLKYLKGQGKLNKRH 329

Query: 783  AKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLGFEHIKDLYQHDMFFAP 842
            AKW+EF+E FPYVIK+K+GK N+VADALSRR+ LL+ L  +L G E +KD+Y HD+ FA 
Sbjct: 330  AKWVEFLEQFPYVIKHKKGKGNVVADALSRRHNLLSMLETKLFGLESLKDMYMHDVDFAE 389

Query: 843  FVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHGGGLMAHHGVSKTYDML 902
               +CEK    + Y   +GFLF+  +LC+P CSIRELLV E+H GGLM H GV KT ++L
Sbjct: 390  NFAACEK-FSENGYYRHNGFLFKANRLCVPKCSIRELLVSESHEGGLMGHFGVQKTLEIL 449

Query: 903  SEHFFLPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNGPWIDISMDFVLGLPRT 962
             EHF+ P M+HDVHK C  CI CK+AKS+++PHGLY+PLPVP+ PWIDISMDFVLGLPRT
Sbjct: 450  QEHFYWPHMKHDVHKFCDHCIVCKKAKSKVKPHGLYTPLPVPDFPWIDISMDFVLGLPRT 509

Query: 963  RKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLHGIPKSIVSDRDVKFLS 1022
            + G DSIFVVVDRFSKMAHFIPC K +DA H+ADLFFREVVRLHG+P+SIVSDRD KFLS
Sbjct: 510  KNGKDSIFVVVDRFSKMAHFIPCKKVNDACHVADLFFREVVRLHGLPRSIVSDRDTKFLS 569

Query: 1023 HFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIIDKNLKTWEDCLPFIEFA 1082
            HFWR LWGKLGTKL++STTCHPQTDGQTEVVNRT+  +LR ++ KN+K WE+ LP +EFA
Sbjct: 570  HFWRTLWGKLGTKLLFSTTCHPQTDGQTEVVNRTLGTLLRTVLKKNIKFWEEHLPHVEFA 629

Query: 1083 YNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPS-KEFVNFDANAKVEFVHKLHKQVKEQI 1142
            YNR VHSTTKC+PFEIVYGFNPLTP+DLLP+P+  EF + DA AK E+V KLH+QVK QI
Sbjct: 630  YNRAVHSTTKCSPFEIVYGFNPLTPLDLLPMPNISEFKHKDAQAKAEYVKKLHEQVKAQI 689

Query: 1143 EKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPTQRKSKLLPRGDGPFQVLERINDN 1202
            EK+      + NKGRK VIF+PGDWVWVH RKERFP QRKSKL PRGDGPFQVLE+INDN
Sbjct: 690  EKKIESYVKQANKGRKKVIFEPGDWVWVHMRKERFPEQRKSKLQPRGDGPFQVLEKINDN 749

Query: 1203 AYKIDLPGKYGVSATFSVVDLSPFDVGDG-LDSRTNPSQEGENDMN 1247
            AYKIDLPG+YGVS++F+V DL+ FD GD  +  R N +QEGEND++
Sbjct: 750  AYKIDLPGEYGVSSSFNVADLTHFDAGDEFIALRKNVAQEGENDVD 794

BLAST of CmoCh13G003030 vs. NCBI nr
Match: gi|727521082|ref|XP_010436772.1| (PREDICTED: uncharacterized protein LOC104720585 [Camelina sativa])

HSP 1 Score: 954.1 bits (2465), Expect = 2.4e-274
Identity = 514/1093 (47.03%), Postives = 689/1093 (63.04%), Query Frame = 1

Query: 17   FSRDMAQKLQALKQGRKSVEDYYKEMDTLMDRLELDEDMEALMARFLNGLNTEIADKTDL 76
            + RD+ ++ + L QG ++V++Y++E + LM+ LEL+E  E+L+A+F++GL   IA K ++
Sbjct: 187  YQRDLQKRFRKLSQGTRTVDEYFEEFEKLMNSLELEESSESLIAQFIDGLQERIARKVEI 246

Query: 77   QPYSNIEELLHIAIKIERQIQQRSQRYSSKTFPNSTSTWK-KDSKNIDYKHRNQEINEKP 136
              Y+++ ELLH A+++E+QIQ++    S +    + + W   +++++D   + + +    
Sbjct: 247  SNYNSLHELLHKAVQVEQQIQRKM---SLRNRTRNNTPWNASNNRSMD---KGKIVENDS 306

Query: 137  QAKFEKGESSRTGKEKVEK----SNVRNRDLKCWRCQGVGHYSRDCPNARIMTIKEGEIV 196
            + K +  E+ +T K K  K    S  R RD+ C++CQG GH +R+CPN R+M +      
Sbjct: 307  RFKNKSNEAPKTSKPKPGKFPNTSQSRTRDITCFKCQGRGHMARECPNQRVMIVTPS--- 366

Query: 197  TDDEAHDDINEETDESEEFSEEDPTHISLVTRQALNTHIKEDGLDQRENLFQTRCLVQSV 256
             D E+ D  +E   + E   E   T   LV ++ L+  +      QREN+F TRC +++ 
Sbjct: 367  GDYESQDKQDEYQTDPENDVEYPDTRELLVIQRILSVLVNPKEKVQRENIFHTRCKIKNK 426

Query: 257  PCSVVIDSGSCTNVVSSILVKRLNLKTQPHPRPYKLQWLNDCGEVRITQQTLVSFTIGKY 316
             C+++ID GSCTNV S  +V RL L+   HPRPYKL+ LN+  E+ I +Q +VSF++GKY
Sbjct: 427  VCNLIIDGGSCTNVSSKYMVDRLGLQKTKHPRPYKLRLLNNDTELNIAEQVIVSFSVGKY 486

Query: 317  VDDVLCDVVSMHVGDLLLGRPWQFDRRVMYDGYANRYSFTHNGRKTTLIPLSPKNVFIDH 376
             D V+CDVV M  G LLLGRPWQFDR   + G  N Y+FTHN  K  L PLSP  V    
Sbjct: 487  QDQVICDVVPMRAGHLLLGRPWQFDRATTHVGRTNHYTFTHNDCKFNLAPLSPSEVH--- 546

Query: 377  CKLEKKRQEVDAKAEIEKESSEKKSLREKQESNTQPREKKERKAKSVSLYVRSSEARNVL 436
                          E++K  +++  +R            K  +AK   L +   E     
Sbjct: 547  --------------ELQKHMNKEVEVRTSNLYLRSIEVCKTMRAKGTVLLMMFKE----- 606

Query: 437  LSNQTILVLMCKGSCYFTNMLNPSLPSDFVVLLQEFEHLFSEEMPSSLPPLKGLNTRL-- 496
                          C  T      LP++   +L +++ +F EE+P  LPP+ G+  ++  
Sbjct: 607  --------------CLSTGTSELELPAEVQAVLGQYKDVFPEEIPPGLPPICGIEHQIDL 666

Query: 497  ---------TSFLARPFQTDQLIGLIQRRLKR---------------------------Y 556
                      ++   P ++ +L   ++  + +                            
Sbjct: 667  VPGSALPNKPAYRMNPEESKELEKQVRELMDKGYIRESLSPCAVPVLLVPKKDGTWRMCV 726

Query: 557  KGKAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIRMHIGDEWETAFKTKY 616
              +AIN ITIKYR PIPRLDDMLDEL G  +F+KIDL+SGY+Q+RM  GDEW+TA KTK 
Sbjct: 727  DCRAINNITIKYRDPIPRLDDMLDELSGAIVFSKIDLRSGYNQVRMREGDEWKTALKTKQ 786

Query: 617  GLYEWLVMPFGLTNAPSTFMRLMNHVLREYLVSSNGVEVDEEKVKAIKDWPTPKNVSEVR 676
            GLYEWLVMPFGLTNAPSTFMRLMN VLR ++VS  G++VDEEK+KAI++WPTP ++    
Sbjct: 787  GLYEWLVMPFGLTNAPSTFMRLMNQVLRSFIVSKQGLQVDEEKIKAIREWPTPTSIG--- 846

Query: 677  SFHGLASFYRRFIKNFSTIASLLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLRALPN 736
              HG A       K+F+ +   L +                           AP+ AL +
Sbjct: 847  --HGEAQ-----EKSFNILKERLTQ---------------------------APVLALSD 906

Query: 737  FESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQH 796
            FE  FE+ECDASG+GIGAVL Q +RP+ FFSEKL+G +L YPTYDKELYAL RAL+TWQH
Sbjct: 907  FEVMFEVECDASGLGIGAVLHQMKRPVAFFSEKLSGPTLNYPTYDKELYALGRALETWQH 966

Query: 797  YLWPKEFIIHTDHESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRR 856
            YL  KEFIIHTDHE+LKHL+ Q  L RRHAKWLEFIETFPYVIKYK+GKEN+VADALSRR
Sbjct: 967  YLLSKEFIIHTDHETLKHLKGQTSLKRRHAKWLEFIETFPYVIKYKKGKENVVADALSRR 1026

Query: 857  YVLLNTLNARLLGFEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPS 916
            + L+ T+ A+++GFEHIK+LY+ D       +   KG   + Y L DGFLFR  +LCIP 
Sbjct: 1027 HALIATMEAKVMGFEHIKELYKDDPELGECYKEYGKGAYQEFY-LQDGFLFRDKRLCIPQ 1086

Query: 917  CSIRELLVREAHGGGLMAHHGVSKTYDMLSEHFFLPKMRHDVHKVCGRCIACKQAKSRLQ 976
             S+REL++ EAHGGGLM H GV KT  ++ EHFF P ++  V + C RCI C +AKSRL 
Sbjct: 1087 GSMRELILTEAHGGGLMGHFGVDKTLAVVMEHFFWPHLKKHVERFCARCIVCHKAKSRLH 1146

Query: 977  PHGLYSPLPVPNGPWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKH 1036
            PHGLY PLP+PN PW+DISMDFVLGLP+  K  DSIFVVVD FSKMAHFIPC KT+DA  
Sbjct: 1147 PHGLYLPLPIPNAPWVDISMDFVLGLPKI-KHKDSIFVVVDWFSKMAHFIPCDKTNDATQ 1195

Query: 1037 IADLFFREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVV 1067
             A+LFF+EVVRLHGIP++IVSDRD KFLSHFW+ LW KLGTKL++STTCHPQTDGQTEVV
Sbjct: 1207 TANLFFKEVVRLHGIPRTIVSDRDTKFLSHFWKTLWRKLGTKLLFSTTCHPQTDGQTEVV 1195

BLAST of CmoCh13G003030 vs. NCBI nr
Match: gi|147800031|emb|CAN74973.1| (hypothetical protein VITISV_001042 [Vitis vinifera])

HSP 1 Score: 948.7 bits (2451), Expect = 1.0e-272
Identity = 485/868 (55.88%), Postives = 589/868 (67.86%), Query Frame = 1

Query: 465  QEFEHLFSEEMPSSLPPLKGLNTRLT-----------SFLARPFQTDQLIGLIQRRLKR- 524
            QE+E +F  ++PS LPP++G+  ++            ++ + P +T +L   ++  L + 
Sbjct: 472  QEYEDVFPNDVPSGLPPIRGIEXQIDFVSXATIPNRXAYRSNPEETKELQRQVEXLLTKG 531

Query: 525  --------------------------YKGKAINKITIKYRHPIPRLDDMLDELHGCSLFT 584
                                         + IN IT+KYRHPIPRLDDMLDELHG  +FT
Sbjct: 532  HVRESLSPCVVXVXLVXKKDGTWRMCVDXRXINNITVKYRHPIPRLDDMLDELHGSCVFT 591

Query: 585  KIDLKSGYHQIRMHIGDEWETAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHVLREYLVS 644
            KIDLKSGYHQIRM  GDEW+TAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNH LR ++  
Sbjct: 592  KIDLKSGYHQIRMKEGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHALRSFISR 651

Query: 645  SNGVEVDEEKVKAIKDWPTPKNVSE-VRSFHGLA-------------------SFYRRFI 704
               V  D+  V +       KN+ E +   H +                    SFYR F+
Sbjct: 652  FVVVYFDDILVYS-------KNLDEYINHLHCVLALCFLVMLLVXKELRWMRKSFYRXFV 711

Query: 705  KNFSTIASLLNELVKKNVSFIWEKDQELAFNTLKEKLSSAPLRALPNFESTFEIECDASG 764
            K+FST+A  L E+VKK V F W    + AF  +KE L                       
Sbjct: 712  KDFSTLAVPLTEIVKKFVGFKWGSXXDRAFIXIKEMLC---------------------- 771

Query: 765  VGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKELYALVRALQTWQHYLWPKEFIIHTDH 824
             GI A+LMQ +RP+ +FSEKL G +L YPTYDKELYALVRAL+TWQHYLWPKEF+IHTDH
Sbjct: 772  -GIRAILMQEKRPITYFSEKLNGTTLNYPTYDKELYALVRALETWQHYLWPKEFVIHTDH 831

Query: 825  ESLKHLRVQNKLNRRHAKWLEFIETFPYVIKYKQGKENIVADALSRRYVLLNTLNARLLG 884
            ESLKHL+ Q KLNRRHAKW+EFIETF YVIKYKQGKENIVADALSRRY L++TLNA+LLG
Sbjct: 832  ESLKHLKGQGKLNRRHAKWVEFIETFLYVIKYKQGKENIVADALSRRYALVSTLNAKLLG 891

Query: 885  FEHIKDLYQHDMFFAPFVESCEKGLIVDNYLLLDGFLFRKGKLCIPSCSIRELLVREAHG 944
            FE++K+LY +D  FA    +CEK                + +LC+P+ S+RELLVREAHG
Sbjct: 892  FEYVKELYANDDDFASVYGACEKTTF-------------ENRLCVPNSSMRELLVREAHG 951

Query: 945  GGLMAHHGVSKTYDMLSEHFFLPKMRHDVHKVCGRCIACKQAKSRLQPHGLYSPLPVPNG 1004
            GGLM H GV KT D+L EHFF PKM+ DV + C RCI C+QAKSR+ PHGLY+PLPVP+ 
Sbjct: 952  GGLMGHFGVRKTLDVLHEHFFWPKMKCDVERACARCITCRQAKSRVLPHGLYTPLPVPSA 1011

Query: 1005 PWIDISMDFVLGLPRTRKGYDSIFVVVDRFSKMAHFIPCHKTDDAKHIADLFFREVVRLH 1064
            PW+DISMDFVLGLPR+R G  SIFVVVDRFSKM +FI CHK DDA HIA+LFFRE+VRLH
Sbjct: 1012 PWVDISMDFVLGLPRSRNGRXSIFVVVDRFSKMTYFISCHKIDDATHIANLFFREIVRLH 1071

Query: 1065 GIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVNRTMTAMLRAIID 1124
            G+P+SIVSDRDVKFLS+FW+VLW KLGTKL++STTCHPQTDGQTEVVNRT++        
Sbjct: 1072 GVPRSIVSDRDVKFLSYFWKVLWRKLGTKLLFSTTCHPQTDGQTEVVNRTLSTF------ 1131

Query: 1125 KNLKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPIPSKEFVNFDANAK 1184
                                VHSTT  +PFEIVYGFNPLT +DLLP+   E  + D   K
Sbjct: 1132 --------------------VHSTTNFSPFEIVYGFNPLTSLDLLPLXVNEMXSLDGEKK 1191

Query: 1185 VEFVHKLHKQVKEQIEKQNSKVATRINKGRKIVIFKPGDWVWVHFRKERFPTQRKSKLLP 1244
            VE V KLH+ V++ IEK+N +  ++ NKG   V+F+PGDWVWVH RKERFPT+R SKL P
Sbjct: 1192 VEMVKKLHESVRKHIEKKNEQYVSKANKGXXQVLFEPGDWVWVHMRKERFPTRRXSKLHP 1251

Query: 1245 RGDGPFQVLERINDNAYKIDLPGKYGVSATFSVVDLSPFDVGDGLDSRTNPSQEGENDMN 1272
            RGDGPFQVLERINDNAYK+D+PG+Y +SATF+V DLSPFDVGD  DS TNP +E  ND N
Sbjct: 1252 RGDGPFQVLERINDNAYKLDIPGEYNISATFNVSDLSPFDVGD--DSXTNPFEEKGNDEN 1268

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YI31B_YEAST7.4e-9835.61Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YG31B_YEAST7.4e-9835.61Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
TF27_SCHPO1.3e-8932.21Transposon Tf2-7 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF22_SCHPO1.3e-8932.21Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
TF25_SCHPO1.3e-8932.21Transposon Tf2-5 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A151UF56_CAJCA2.8e-27769.81Transposon Ty3-I Gag-Pol polyprotein OS=Cajanus cajan GN=KK1_049062 PE=4 SV=1[more]
A5BSP7_VITVI7.1e-27355.88Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_001042 PE=4 SV=1[more]
Q9LQH2_ARATH2.4e-26867.18F15O4.13 OS=Arabidopsis thaliana PE=4 SV=1[more]
Q8L7J3_MAIZE5.2e-26865.09Gag-pol polyprotein OS=Zea mays PE=4 SV=1[more]
A4K7M3_ORYSJ5.8e-26763.70Putative polyprotein OS=Oryza sativa subsp. japonica PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.13.4e-1642.71ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|823145097|ref|XP_012472412.1|0.0e+0052.93PREDICTED: uncharacterized protein LOC105789586 [Gossypium raimondii][more]
gi|923748122|ref|XP_013673383.1|7.8e-28156.90PREDICTED: uncharacterized protein LOC106377669 [Brassica napus][more]
gi|1012366710|gb|KYP77891.1|4.0e-27769.81Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan][more]
gi|727521082|ref|XP_010436772.1|2.4e-27447.03PREDICTED: uncharacterized protein LOC104720585 [Camelina sativa][more]
gi|147800031|emb|CAN74973.1|1.0e-27255.88hypothetical protein VITISV_001042 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR001584Integrase_cat-core
IPR001878Znf_CCHC
IPR005162Retrotrans_gag_dom
IPR012337RNaseH-like_sf
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006310 DNA recombination
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh13G003030.1CmoCh13G003030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 513..604
score: 8.5
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 951..1060
score: 2.9
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 943..1103
score: 18
IPR001878Zinc finger, CCHC-typeGENE3DG3DSA:4.10.60.10coord: 159..179
score: 1.
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 163..179
score: 1.
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 163..179
score: 9.
IPR001878Zinc finger, CCHC-typePROFILEPS50158ZF_CCHCcoord: 163..179
score: 9
IPR001878Zinc finger, CCHC-typeunknownSSF57756Retrovirus zinc finger-like domainscoord: 153..180
score: 1.3
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 16..68
score: 7.3
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 952..1104
score: 3.5
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 940..1101
score: 4.68
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 240..333
score: 1.6
NoneNo IPR availableunknownCoilCoilcoord: 380..400
scor
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 516..596
score: 1.8
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 516..1255
score: 2.0E
NoneNo IPR availablePANTHERPTHR24559:SF174SUBFAMILY NOT NAMEDcoord: 516..1255
score: 2.0E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 516..799
score: 5.67E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None