SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC trunk/tm/ChangeLog revision 713, Wed Mar 14 13:44:11 2007 UTC
# Line 1  Line 1 
1    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * R/stopwords.R: Introduced a function returning the stopwords for
4            a given language (English, German and French at the moment)
5    
6            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
7            otherwise falls back to Snowball package.
8    
9    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
10    
11            * man/dissimilarity-methods.Rd: Make clear that any method offered
12            by "dists" from package "cba" can be used.
13    
14    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
15    
16            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
17            to Kurt's latex suggestion. Removed points and underscores in
18            variable names for consistent naming.
19    
20            * DESCRIPTION: Update to version 0.1-2.
21    
22            * man/TextRepository.Rd: Fixed bug in documentation.
23    
24    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
25    
26            * DESCRIPTION: Update to version 0.1-1.
27    
28    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
29    
30            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
31            wordStem.
32    
33    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
34    
35            * R/: Changes due to Kurt's review.
36    
37    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
38    
39            * R/: Implemented improvements based upon comments by David
40            Meyer.
41    
42    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
43    
44            * inst/doc/: Rewrote vignette.
45    
46            * man/: Improved documentation.
47    
48    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
49    
50            * man/: Updated documentation.
51    
52            * DESCRIPTION: Changed package name to "tm". Updated version to
53            0.1 for first CRAN release.
54    
55            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
56            list archive example.
57    
58            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
59            archive example.
60    
61            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
62            from (several mails per box) mbox format to (single mail per file)
63            eml format.
64    
65    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
66    
67            * data/crude.rda: Rebuilt.
68    
69            * data/acq.rda: Rebuilt.
70    
71            * R/reader.R: Factored out reader and parser methods from
72            textdoccol.R.
73    
74            * R/source.R: Factored out Source methods from aobjects.R and
75            textdoccol.R.
76            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
77            feeds.
78    
79            * R/textdoccol.R (DirSource): Added support for recursive
80            traversal of directories.
81    
82    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
83    
84            * R/textdoccol.R ([[): Loads the document corpus automatically
85            into memory upon access.
86            (tm_transform, tm_filter): Removed several checks whether the
87            document is already loaded ([[ ensures this now).
88            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
89            mailing list archive.
90    
91    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
92    
93            * R/aobjects.R (TextDocument): Is now a virtual class.
94            (Source): Is now a virtual class.
95    
96    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
97    
98            * R/textdoccol.R (c): Support for an arbitrary number of document
99            collections.
100    
101    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
102    
103            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
104            append_meta and remove_meta.
105    
106            * R/textdoccol.R: Removed modify_metadata method.
107    
108            * R/textrepo.R: Removed modify_metadata method.
109    
110            * R/textdoccol.R (remove_meta): Supports removal of document
111            collection metadata and document (= in data frame) metadata.
112    
113    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
114    
115            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
116    
117            * data/crude.rda: Rebuilt.
118    
119            * data/acq.rda: Rebuilt.
120    
121            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
122    
123            * R/textdoccol.R ([): Bug fix for subsetting a document
124            collection's data frame.
125    
126    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
127    
128            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
129            to s_filter.
130    
131            * R/textdoccol.R: Local text documents' metadata can now be copied
132            to a document collection's data frame with prescind_meta.
133    
134    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
135    
136            * R/: Text documents' slot metadata is now accessible in s_filter.
137    
138            * R/: Rewrote s_filter function (has still some restrictions).
139    
140    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
141    
142            * R/: Various fixes in handling metadata.
143    
144            * R/: Added update mechanism for text document collections.
145    
146    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
147    
148            * R/: Merging of document collections now creates a binary tree
149            for reconstructing merged document collections.
150    
151            * R/: Redesign of metadata for document collections.
152    
153    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * R/: Messages now use \code{ngettext}.
156    
157    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
158    
159            * R/: Added functions for modifying and removing metadata.
160    
161    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
162    
163            * man/: Updated some documentation.
164    
165            * R/: Corrected some connection issues.
166    
167            * inst/doc: Worked on the vignette.
168    
169    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
170    
171            * inst/: Added texts and started vignette.
172    
173            * R/: Final changes based upon David's comments.
174    
175    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
176    
177            * NAMESPACE: Corrected exports (generic methods need exportMethods
178            directives!).
179    
180    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
181    
182            * R/: Modified the TextDocCol constructur and various parsers. It
183            is now modular and supports various file formats via plugins (see
184            the new "Source" class).
185    
186    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
187    
188            * man/: Revised documentation after previous code changes.
189    
190    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
191    
192            * R/: Remaining changes as discussed with David.
193    
194    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
195    
196            * R/: Some changes as suggested by David. The rest will follow
197            within the next days.
198    
199    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
200    
201            * man/: Finished documentation.
202    
203    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
204    
205            * man/: Wrote some documentation.
206    
207    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
208    
209            * R/: Further syntactic sugar in form of additional assignment and
210            accessor methods.
211    
212    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
213    
214            * R/: Syntactic sugar in form of "length", "show" and "summary"
215            operators.
216    
217    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
218    
219            * R/: Diverse updates. Mainly on default operators ("[" or "c")
220            and dissimilarities.
221    
222    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
223    
224            * R/: Added similarity functions.
225    
226            * data/: Added english stopwords.
227    
228    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * data/: Examples compiled for new features
231    
232            * R/: Changes due to new structure.
233    
234            * NAMESPACE: Corrected namespace to reflect new structure.
235    
236            * R/termdocmatrix.R: Adapted for new naming scheme.
237    
238    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
239    
240            * R/textdoccol.R: Adapted code for new class structure. Wrote
241            several transform and filter functions operating on text document
242            collections (alias text document databases).
243    
244            * R/aobjects.R: Adapted class structure with inheritance,
245            repositories and additional meta data. Loading files on demand is
246            now possible.
247    
248    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
249    
250            * R/: Some cosmetic cleanups.
251    
252            * inst/: Removed vignette on clustering. That and much more is now
253            described in the JSS paper on text mining. Based upon that
254            article an elaborated vignette will be incorporated in the future.
255    
256    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
257    
258            * R/: Updated generic S4 methods to comply with signature changes
259            in newer versions of R (> 2.3)
260    
261    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
262    
263            * ext/R/importRIS.R: Automatic RIS import is now possible.
264    
265    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
266    
267            * R/textdoccol.R: Added RIS HTML input format.
268    
269    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
270    
271            * R/textdoccol.R: Removed bug that caused invalid text document
272            collections when handling many input files.
273    
274    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
275    
276            * R/textdoccol.R: Restructured and extended file import
277            mechanism.
278    
279            * inst/doc/clustering.Rnw: Adapted vignette for use with
280            ReutNews.rda
281    
282            * man/ReutNews.Rd: Documentation for ReutNews.rda
283    
284            * data/ReutNews.rda: A tiny Reuters21578 example data set.
285    
286    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
287    
288            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
289            clustering facilities of this package.
290    
291    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
292    
293            * R/aobjects.R: Changed package document structure to avoid class
294            dependency problems.
295    
296    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
297    
298            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
299            data set.
300    
301            * Finished documentation and reordered directory structure. Now "R
302            CMD check textmin" works without errors.
303    
304    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * src/: Various splits can now be easily created for the
307            Reuters21578 data set.
308    
309    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
310    
311            * Updated documentation
312    
313    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
314    
315            * Wrote R documentation for some classes and methods.
316    
317    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
320            files. See the questionnaire data/Umfrage.csv for such an example.
321            We are now able to import files in Reuters-21578 XML format.
322    
323            * Changed class interfaces in various files. Weighting of the text
324            matrix is now possible.
325    
326    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
327    
328            * R/textdoccol.R: One can build term-document matrices if
329            nessecary (with buildTDM(...)) and fill the field tdm from a text
330            document collection with it.
331    
332            * R/textmatrix.R: Wrote S4 class for term-document matrices.
333    
334    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * R/textdoccol.R: We now can read in a whole XML file with several
337            news items.
338    
339  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
340    
341          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.713

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge