1 |
|
2006-11-26 Ingo Feinerer <h0125130@wu-wien.ac.at> |
2 |
|
|
3 |
|
* R/textrepo.R: Updated TextRepository (constructor), append_elem, |
4 |
|
append_meta and remove_meta. |
5 |
|
|
6 |
|
* R/textdoccol.R: Removed modify_metadata method. |
7 |
|
|
8 |
|
* R/textrepo.R: Removed modify_metadata method. |
9 |
|
|
10 |
|
* R/textdoccol.R (remove_meta): Supports removal of document |
11 |
|
collection metadata and document (= in data frame) metadata. |
12 |
|
|
13 |
|
2006-11-23 Ingo Feinerer <h0125130@wu-wien.ac.at> |
14 |
|
|
15 |
|
* R/textdoccol.R (append_doc): Bug fix for handling empty metadata. |
16 |
|
|
17 |
|
* data/crude.rda: Rebuilt. |
18 |
|
|
19 |
|
* data/acq.rda: Rebuilt. |
20 |
|
|
21 |
|
* inst/doc/textmin.Rnw: Updated vignette to reflect code changes. |
22 |
|
|
23 |
|
* R/textdoccol.R ([): Bug fix for subsetting a document |
24 |
|
collection's data frame. |
25 |
|
|
26 |
|
2006-11-22 Ingo Feinerer <h0125130@wu-wien.ac.at> |
27 |
|
|
28 |
|
* R/textdoccol.R: Bug fixes in s_filter. Added full query support |
29 |
|
to s_filter. |
30 |
|
|
31 |
|
* R/textdoccol.R: Local text documents' metadata can now be copied |
32 |
|
to a document collection's data frame with prescind_meta. |
33 |
|
|
34 |
|
2006-11-21 Ingo Feinerer <h0125130@wu-wien.ac.at> |
35 |
|
|
36 |
|
* R/: Text documents' slot metadata is now accessible in s_filter. |
37 |
|
|
38 |
|
* R/: Rewrote s_filter function (has still some restrictions). |
39 |
|
|
40 |
|
2006-11-20 Ingo Feinerer <h0125130@wu-wien.ac.at> |
41 |
|
|
42 |
|
* R/: Various fixes in handling metadata. |
43 |
|
|
44 |
|
* R/: Added update mechanism for text document collections. |
45 |
|
|
46 |
|
2006-11-19 Ingo Feinerer <h0125130@wu-wien.ac.at> |
47 |
|
|
48 |
|
* R/: Merging of document collections now creates a binary tree |
49 |
|
for reconstructing merged document collections. |
50 |
|
|
51 |
|
* R/: Redesign of metadata for document collections. |
52 |
|
|
53 |
|
2006-11-07 Ingo Feinerer <h0125130@wu-wien.ac.at> |
54 |
|
|
55 |
|
* R/: Messages now use \code{ngettext}. |
56 |
|
|
57 |
|
2006-11-03 Ingo Feinerer <h0125130@wu-wien.ac.at> |
58 |
|
|
59 |
|
* R/: Added functions for modifying and removing metadata. |
60 |
|
|
61 |
|
2006-11-01 Ingo Feinerer <h0125130@wu-wien.ac.at> |
62 |
|
|
63 |
|
* man/: Updated some documentation. |
64 |
|
|
65 |
|
* R/: Corrected some connection issues. |
66 |
|
|
67 |
|
* inst/doc: Worked on the vignette. |
68 |
|
|
69 |
|
2006-10-31 Ingo Feinerer <h0125130@wu-wien.ac.at> |
70 |
|
|
71 |
|
* inst/: Added texts and started vignette. |
72 |
|
|
73 |
|
* R/: Final changes based upon David's comments. |
74 |
|
|
75 |
|
2006-10-29 Ingo Feinerer <h0125130@wu-wien.ac.at> |
76 |
|
|
77 |
|
* NAMESPACE: Corrected exports (generic methods need exportMethods |
78 |
|
directives!). |
79 |
|
|
80 |
|
2006-10-26 Ingo Feinerer <h0125130@wu-wien.ac.at> |
81 |
|
|
82 |
|
* R/: Modified the TextDocCol constructur and various parsers. It |
83 |
|
is now modular and supports various file formats via plugins (see |
84 |
|
the new "Source" class). |
85 |
|
|
86 |
|
2006-10-24 Ingo Feinerer <h0125130@wu-wien.ac.at> |
87 |
|
|
88 |
|
* man/: Revised documentation after previous code changes. |
89 |
|
|
90 |
|
2006-10-23 Ingo Feinerer <h0125130@wu-wien.ac.at> |
91 |
|
|
92 |
|
* R/: Remaining changes as discussed with David. |
93 |
|
|
94 |
|
2006-10-22 Ingo Feinerer <h0125130@wu-wien.ac.at> |
95 |
|
|
96 |
|
* R/: Some changes as suggested by David. The rest will follow |
97 |
|
within the next days. |
98 |
|
|
99 |
|
2006-09-26 Ingo Feinerer <h0125130@wu-wien.ac.at> |
100 |
|
|
101 |
|
* man/: Finished documentation. |
102 |
|
|
103 |
|
2006-09-25 Ingo Feinerer <h0125130@wu-wien.ac.at> |
104 |
|
|
105 |
|
* man/: Wrote some documentation. |
106 |
|
|
107 |
|
2006-09-24 Ingo Feinerer <h0125130@wu-wien.ac.at> |
108 |
|
|
109 |
|
* R/: Further syntactic sugar in form of additional assignment and |
110 |
|
accessor methods. |
111 |
|
|
112 |
|
2006-09-13 Ingo Feinerer <h0125130@wu-wien.ac.at> |
113 |
|
|
114 |
|
* R/: Syntactic sugar in form of "length", "show" and "summary" |
115 |
|
operators. |
116 |
|
|
117 |
|
2006-08-24 Ingo Feinerer <h0125130@wu-wien.ac.at> |
118 |
|
|
119 |
|
* R/: Diverse updates. Mainly on default operators ("[" or "c") |
120 |
|
and dissimilarities. |
121 |
|
|
122 |
|
2006-08-12 Ingo Feinerer <h0125130@wu-wien.ac.at> |
123 |
|
|
124 |
|
* R/: Added similarity functions. |
125 |
|
|
126 |
|
* data/: Added english stopwords. |
127 |
|
|
128 |
|
2006-08-07 Ingo Feinerer <h0125130@wu-wien.ac.at> |
129 |
|
|
130 |
|
* data/: Examples compiled for new features |
131 |
|
|
132 |
|
* R/: Changes due to new structure. |
133 |
|
|
134 |
|
* NAMESPACE: Corrected namespace to reflect new structure. |
135 |
|
|
136 |
|
* R/termdocmatrix.R: Adapted for new naming scheme. |
137 |
|
|
138 |
|
2006-08-06 Ingo Feinerer <h0125130@wu-wien.ac.at> |
139 |
|
|
140 |
|
* R/textdoccol.R: Adapted code for new class structure. Wrote |
141 |
|
several transform and filter functions operating on text document |
142 |
|
collections (alias text document databases). |
143 |
|
|
144 |
|
* R/aobjects.R: Adapted class structure with inheritance, |
145 |
|
repositories and additional meta data. Loading files on demand is |
146 |
|
now possible. |
147 |
|
|
148 |
|
2006-07-13 Ingo Feinerer <h0125130@wu-wien.ac.at> |
149 |
|
|
150 |
|
* R/: Some cosmetic cleanups. |
151 |
|
|
152 |
|
* inst/: Removed vignette on clustering. That and much more is now |
153 |
|
described in the JSS paper on text mining. Based upon that |
154 |
|
article an elaborated vignette will be incorporated in the future. |
155 |
|
|
156 |
|
2006-07-01 Ingo Feinerer <h0125130@wu-wien.ac.at> |
157 |
|
|
158 |
|
* R/: Updated generic S4 methods to comply with signature changes |
159 |
|
in newer versions of R (> 2.3) |
160 |
|
|
161 |
|
2006-03-12 Ingo Feinerer <h0125130@wu-wien.ac.at> |
162 |
|
|
163 |
|
* ext/R/importRIS.R: Automatic RIS import is now possible. |
164 |
|
|
165 |
|
2006-02-14 Ingo Feinerer <h0125130@wu-wien.ac.at> |
166 |
|
|
167 |
|
* R/textdoccol.R: Added RIS HTML input format. |
168 |
|
|
169 |
|
2006-01-21 Ingo Feinerer <h0125130@wu-wien.ac.at> |
170 |
|
|
171 |
|
* R/textdoccol.R: Removed bug that caused invalid text document |
172 |
|
collections when handling many input files. |
173 |
|
|
174 |
|
2006-01-11 Ingo Feinerer <h0125130@wu-wien.ac.at> |
175 |
|
|
176 |
|
* R/textdoccol.R: Restructured and extended file import |
177 |
|
mechanism. |
178 |
|
|
179 |
|
* inst/doc/clustering.Rnw: Adapted vignette for use with |
180 |
|
ReutNews.rda |
181 |
|
|
182 |
|
* man/ReutNews.Rd: Documentation for ReutNews.rda |
183 |
|
|
184 |
|
* data/ReutNews.rda: A tiny Reuters21578 example data set. |
185 |
|
|
186 |
|
2005-12-22 Ingo Feinerer <h0125130@wu-wien.ac.at> |
187 |
|
|
188 |
|
* inst/doc/clustering.Rnw: Wrote a small vignette to present the |
189 |
|
clustering facilities of this package. |
190 |
|
|
191 |
|
2005-12-15 Ingo Feinerer <h0125130@wu-wien.ac.at> |
192 |
|
|
193 |
|
* R/aobjects.R: Changed package document structure to avoid class |
194 |
|
dependency problems. |
195 |
|
|
196 |
|
2005-12-06 Ingo Feinerer <h0125130@wu-wien.ac.at> |
197 |
|
|
198 |
|
* Wrote a script for the ModLewis Split for the Reuters-21578 XML |
199 |
|
data set. |
200 |
|
|
201 |
|
* Finished documentation and reordered directory structure. Now "R |
202 |
|
CMD check textmin" works without errors. |
203 |
|
|
204 |
|
2005-12-04 Ingo Feinerer <h0125130@wu-wien.ac.at> |
205 |
|
|
206 |
|
* src/: Various splits can now be easily created for the |
207 |
|
Reuters21578 data set. |
208 |
|
|
209 |
|
2005-12-03 Ingo Feinerer <h0125130@wu-wien.ac.at> |
210 |
|
|
211 |
|
* Updated documentation |
212 |
|
|
213 |
|
2005-11-30 Ingo Feinerer <h0125130@wu-wien.ac.at> |
214 |
|
|
215 |
|
* Wrote R documentation for some classes and methods. |
216 |
|
|
217 |
|
2005-11-19 Ingo Feinerer <h0125130@wu-wien.ac.at> |
218 |
|
|
219 |
|
* R/textdoccol.R: Constructor of textdoccol allows import of CSV |
220 |
|
files. See the questionnaire data/Umfrage.csv for such an example. |
221 |
|
We are now able to import files in Reuters-21578 XML format. |
222 |
|
|
223 |
|
* Changed class interfaces in various files. Weighting of the text |
224 |
|
matrix is now possible. |
225 |
|
|
226 |
2005-11-08 Ingo Feinerer <h0125130@wu-wien.ac.at> |
2005-11-08 Ingo Feinerer <h0125130@wu-wien.ac.at> |
227 |
|
|
228 |
* R/textdoccol.R: One can build term-document matrices if |
* R/textdoccol.R: One can build term-document matrices if |