SCM

SCM Repository

[tm] Diff of /trunk/tm/ChangeLog
ViewVC logotype

Diff of /trunk/tm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 23, Sat Nov 19 18:25:41 2005 UTC trunk/tm/ChangeLog revision 732, Wed Apr 11 18:11:54 2007 UTC
# Line 1  Line 1 
1    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * DESCRIPTION: Update to version 0.2, since a lot of new features
4            have been integrated.
5    
6            * inst/stopwords: Updated existing stopwords and added stopwords
7            for various other languages.
8    
9    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
10    
11            * man/: Updated documentation.
12    
13            * Work/testDb.R: Script to test database stuff.
14    
15            * R/: Fixed various database related bugs. Seems to be rather
16            useable now, i.e., consider as alpha status for now.
17    
18    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
19    
20            * R/: Fixed some bugs related to database support.
21    
22    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
23    
24            * man/: Added a lot of examples to the manuals.
25    
26    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
27    
28            * man/: Updated parts of the documentation.
29    
30            * R/textdoccol.R (asPlain): Added conversion from newsgroup
31            documents to plain text documents.
32    
33    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
34    
35            * R/textdoccol.R: Finished experimental database support. Not yet
36            intensively tested.
37    
38            * R/source.R: Now each source has a default reader.
39    
40            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
41            class anymore.
42    
43            * R/plaintextdoc.R: Custom show method for plain text documents.
44    
45            * R/aobjects.R: Added a class for structured text documents.
46    
47            * R/reader.R: Replaced remaining \code{parser} occurrences with
48            \code{reader}.
49    
50            * R/textdoccol.R (summary): Indent tags.
51    
52            * R/textdoccol.R (removePunctuation): Transform method to remove
53            punctuation marks.
54    
55    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
56    
57            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
58            using prescindMeta().
59    
60    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
61    
62            * R/textdoccol.R: Improved database support.
63    
64    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
65    
66            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
67    
68            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
69            language code.
70    
71            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
72            into parserControl argument.
73    
74            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
75    
76    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
77    
78            * Work/tmDataSetup.R: The datasets acq and crude can now be
79            created on the fly.
80    
81            * R/stopwords.R: Introduced a function returning the stopwords for
82            a given language (English, German and French at the moment)
83    
84            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
85            otherwise falls back to Snowball package.
86    
87    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
88    
89            * man/dissimilarity-methods.Rd: Make clear that any method offered
90            by "dists" from package "cba" can be used.
91    
92    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
93    
94            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
95            to Kurt's latex suggestion. Removed points and underscores in
96            variable names for consistent naming.
97    
98            * DESCRIPTION: Update to version 0.1-2.
99    
100            * man/TextRepository.Rd: Fixed bug in documentation.
101    
102    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
103    
104            * DESCRIPTION: Update to version 0.1-1.
105    
106    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
107    
108            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
109            wordStem.
110    
111    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
112    
113            * R/: Changes due to Kurt's review.
114    
115    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
116    
117            * R/: Implemented improvements based upon comments by David
118            Meyer.
119    
120    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
121    
122            * inst/doc/: Rewrote vignette.
123    
124            * man/: Improved documentation.
125    
126    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
127    
128            * man/: Updated documentation.
129    
130            * DESCRIPTION: Changed package name to "tm". Updated version to
131            0.1 for first CRAN release.
132    
133            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
134            list archive example.
135    
136            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
137            archive example.
138    
139            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
140            from (several mails per box) mbox format to (single mail per file)
141            eml format.
142    
143    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
144    
145            * data/crude.rda: Rebuilt.
146    
147            * data/acq.rda: Rebuilt.
148    
149            * R/reader.R: Factored out reader and parser methods from
150            textdoccol.R.
151    
152            * R/source.R: Factored out Source methods from aobjects.R and
153            textdoccol.R.
154            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
155            feeds.
156    
157            * R/textdoccol.R (DirSource): Added support for recursive
158            traversal of directories.
159    
160    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
161    
162            * R/textdoccol.R ([[): Loads the document corpus automatically
163            into memory upon access.
164            (tm_transform, tm_filter): Removed several checks whether the
165            document is already loaded ([[ ensures this now).
166            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
167            mailing list archive.
168    
169    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
170    
171            * R/aobjects.R (TextDocument): Is now a virtual class.
172            (Source): Is now a virtual class.
173    
174    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
175    
176            * R/textdoccol.R (c): Support for an arbitrary number of document
177            collections.
178    
179    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
180    
181            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
182            append_meta and remove_meta.
183    
184            * R/textdoccol.R: Removed modify_metadata method.
185    
186            * R/textrepo.R: Removed modify_metadata method.
187    
188            * R/textdoccol.R (remove_meta): Supports removal of document
189            collection metadata and document (= in data frame) metadata.
190    
191    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
192    
193            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
194    
195            * data/crude.rda: Rebuilt.
196    
197            * data/acq.rda: Rebuilt.
198    
199            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
200    
201            * R/textdoccol.R ([): Bug fix for subsetting a document
202            collection's data frame.
203    
204    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
205    
206            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
207            to s_filter.
208    
209            * R/textdoccol.R: Local text documents' metadata can now be copied
210            to a document collection's data frame with prescind_meta.
211    
212    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
213    
214            * R/: Text documents' slot metadata is now accessible in s_filter.
215    
216            * R/: Rewrote s_filter function (has still some restrictions).
217    
218    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
219    
220            * R/: Various fixes in handling metadata.
221    
222            * R/: Added update mechanism for text document collections.
223    
224    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
225    
226            * R/: Merging of document collections now creates a binary tree
227            for reconstructing merged document collections.
228    
229            * R/: Redesign of metadata for document collections.
230    
231    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
232    
233            * R/: Messages now use \code{ngettext}.
234    
235    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
236    
237            * R/: Added functions for modifying and removing metadata.
238    
239    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
240    
241            * man/: Updated some documentation.
242    
243            * R/: Corrected some connection issues.
244    
245            * inst/doc: Worked on the vignette.
246    
247    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
248    
249            * inst/: Added texts and started vignette.
250    
251            * R/: Final changes based upon David's comments.
252    
253    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * NAMESPACE: Corrected exports (generic methods need exportMethods
256            directives!).
257    
258    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
259    
260            * R/: Modified the TextDocCol constructur and various parsers. It
261            is now modular and supports various file formats via plugins (see
262            the new "Source" class).
263    
264    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
265    
266            * man/: Revised documentation after previous code changes.
267    
268    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
269    
270            * R/: Remaining changes as discussed with David.
271    
272    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
273    
274            * R/: Some changes as suggested by David. The rest will follow
275            within the next days.
276    
277    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
278    
279            * man/: Finished documentation.
280    
281    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
282    
283            * man/: Wrote some documentation.
284    
285    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
286    
287            * R/: Further syntactic sugar in form of additional assignment and
288            accessor methods.
289    
290    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
291    
292            * R/: Syntactic sugar in form of "length", "show" and "summary"
293            operators.
294    
295    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
296    
297            * R/: Diverse updates. Mainly on default operators ("[" or "c")
298            and dissimilarities.
299    
300    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
301    
302            * R/: Added similarity functions.
303    
304            * data/: Added english stopwords.
305    
306    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
307    
308            * data/: Examples compiled for new features
309    
310            * R/: Changes due to new structure.
311    
312            * NAMESPACE: Corrected namespace to reflect new structure.
313    
314            * R/termdocmatrix.R: Adapted for new naming scheme.
315    
316    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
317    
318            * R/textdoccol.R: Adapted code for new class structure. Wrote
319            several transform and filter functions operating on text document
320            collections (alias text document databases).
321    
322            * R/aobjects.R: Adapted class structure with inheritance,
323            repositories and additional meta data. Loading files on demand is
324            now possible.
325    
326    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
327    
328            * R/: Some cosmetic cleanups.
329    
330            * inst/: Removed vignette on clustering. That and much more is now
331            described in the JSS paper on text mining. Based upon that
332            article an elaborated vignette will be incorporated in the future.
333    
334    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * R/: Updated generic S4 methods to comply with signature changes
337            in newer versions of R (> 2.3)
338    
339    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
340    
341            * ext/R/importRIS.R: Automatic RIS import is now possible.
342    
343    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * R/textdoccol.R: Added RIS HTML input format.
346    
347    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * R/textdoccol.R: Removed bug that caused invalid text document
350            collections when handling many input files.
351    
352    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * R/textdoccol.R: Restructured and extended file import
355            mechanism.
356    
357            * inst/doc/clustering.Rnw: Adapted vignette for use with
358            ReutNews.rda
359    
360            * man/ReutNews.Rd: Documentation for ReutNews.rda
361    
362            * data/ReutNews.rda: A tiny Reuters21578 example data set.
363    
364    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
365    
366            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
367            clustering facilities of this package.
368    
369    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
370    
371            * R/aobjects.R: Changed package document structure to avoid class
372            dependency problems.
373    
374    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
375    
376            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
377            data set.
378    
379            * Finished documentation and reordered directory structure. Now "R
380            CMD check textmin" works without errors.
381    
382    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
383    
384            * src/: Various splits can now be easily created for the
385            Reuters21578 data set.
386    
387    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
388    
389            * Updated documentation
390    
391    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
392    
393            * Wrote R documentation for some classes and methods.
394    
395  2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
396    
397          * R/textdoccol.R: Constructor of textdoccol allows import of CSV          * R/textdoccol.R: Constructor of textdoccol allows import of CSV

Legend:
Removed from v.23  
changed lines
  Added in v.732

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge