SCM

SCM Repository

[tm] Diff of /trunk/tm/ChangeLog
ViewVC logotype

Diff of /trunk/tm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 41, Sun Mar 12 17:14:15 2006 UTC trunk/tm/ChangeLog revision 750, Fri May 11 16:46:15 2007 UTC
# Line 1  Line 1 
1    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * man/sFilter.Rd: Corrected documentation on statement format (use
4            '==' instead of '=').
5    
6    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
7    
8            * R/aobjects.R (StructuredTextDocument): Inherits from
9            TextDocument.
10    
11    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
12    
13            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
14            on sparse matrices as proposed by Martin Maechler.
15    
16    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
17    
18            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
19            \pkg{filehash} version makes them deprecated.
20    
21    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
22    
23            * R/termdocmatrix.R (textvector): Stemming is now performed before
24            erasing stopwords.
25            (weightMatrix): Adapted to handle sparse matrices.
26            (TermDocMatrix): Sparse matrix is now efficiently built by
27            direct stepwise insertion of row values into it.
28    
29    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
30    
31            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
32            due to ongoing problems. For our purposes the latter is as useful
33            as the replaced package.
34    
35    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
36    
37            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
38    
39            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
40    
41    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
42    
43            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
44            languages with available stopwords.
45    
46    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
47    
48            * inst/doc/tm.Rnw: Minor corrections in the vignette.
49    
50    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
51    
52            * DESCRIPTION: Update to version 0.2, since a lot of new features
53            have been integrated.
54    
55            * inst/stopwords: Updated existing stopwords and added stopwords
56            for various other languages.
57    
58    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
59    
60            * man/: Updated documentation.
61    
62            * Work/testDb.R: Script to test database stuff.
63    
64            * R/: Fixed various database related bugs. Seems to be rather
65            useable now, i.e., consider as alpha status for now.
66    
67    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
68    
69            * R/: Fixed some bugs related to database support.
70    
71    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
72    
73            * man/: Added a lot of examples to the manuals.
74    
75    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
76    
77            * man/: Updated parts of the documentation.
78    
79            * R/textdoccol.R (asPlain): Added conversion from newsgroup
80            documents to plain text documents.
81    
82    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
83    
84            * R/textdoccol.R: Finished experimental database support. Not yet
85            intensively tested.
86    
87            * R/source.R: Now each source has a default reader.
88    
89            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
90            class anymore.
91    
92            * R/plaintextdoc.R: Custom show method for plain text documents.
93    
94            * R/aobjects.R: Added a class for structured text documents.
95    
96            * R/reader.R: Replaced remaining \code{parser} occurrences with
97            \code{reader}.
98    
99            * R/textdoccol.R (summary): Indent tags.
100    
101            * R/textdoccol.R (removePunctuation): Transform method to remove
102            punctuation marks.
103    
104    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
105    
106            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
107            using prescindMeta().
108    
109    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
110    
111            * R/textdoccol.R: Improved database support.
112    
113    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
114    
115            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
116    
117            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
118            language code.
119    
120            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
121            into parserControl argument.
122    
123            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
124    
125    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
126    
127            * Work/tmDataSetup.R: The datasets acq and crude can now be
128            created on the fly.
129    
130            * R/stopwords.R: Introduced a function returning the stopwords for
131            a given language (English, German and French at the moment)
132    
133            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
134            otherwise falls back to Snowball package.
135    
136    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
137    
138            * man/dissimilarity-methods.Rd: Make clear that any method offered
139            by "dists" from package "cba" can be used.
140    
141    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
142    
143            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
144            to Kurt's latex suggestion. Removed points and underscores in
145            variable names for consistent naming.
146    
147            * DESCRIPTION: Update to version 0.1-2.
148    
149            * man/TextRepository.Rd: Fixed bug in documentation.
150    
151    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
152    
153            * DESCRIPTION: Update to version 0.1-1.
154    
155    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
156    
157            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
158            wordStem.
159    
160    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
161    
162            * R/: Changes due to Kurt's review.
163    
164    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
165    
166            * R/: Implemented improvements based upon comments by David
167            Meyer.
168    
169    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
170    
171            * inst/doc/: Rewrote vignette.
172    
173            * man/: Improved documentation.
174    
175    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
176    
177            * man/: Updated documentation.
178    
179            * DESCRIPTION: Changed package name to "tm". Updated version to
180            0.1 for first CRAN release.
181    
182            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
183            list archive example.
184    
185            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
186            archive example.
187    
188            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
189            from (several mails per box) mbox format to (single mail per file)
190            eml format.
191    
192    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
193    
194            * data/crude.rda: Rebuilt.
195    
196            * data/acq.rda: Rebuilt.
197    
198            * R/reader.R: Factored out reader and parser methods from
199            textdoccol.R.
200    
201            * R/source.R: Factored out Source methods from aobjects.R and
202            textdoccol.R.
203            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
204            feeds.
205    
206            * R/textdoccol.R (DirSource): Added support for recursive
207            traversal of directories.
208    
209    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
210    
211            * R/textdoccol.R ([[): Loads the document corpus automatically
212            into memory upon access.
213            (tm_transform, tm_filter): Removed several checks whether the
214            document is already loaded ([[ ensures this now).
215            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
216            mailing list archive.
217    
218    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
219    
220            * R/aobjects.R (TextDocument): Is now a virtual class.
221            (Source): Is now a virtual class.
222    
223    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
224    
225            * R/textdoccol.R (c): Support for an arbitrary number of document
226            collections.
227    
228    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
231            append_meta and remove_meta.
232    
233            * R/textdoccol.R: Removed modify_metadata method.
234    
235            * R/textrepo.R: Removed modify_metadata method.
236    
237            * R/textdoccol.R (remove_meta): Supports removal of document
238            collection metadata and document (= in data frame) metadata.
239    
240    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
241    
242            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
243    
244            * data/crude.rda: Rebuilt.
245    
246            * data/acq.rda: Rebuilt.
247    
248            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
249    
250            * R/textdoccol.R ([): Bug fix for subsetting a document
251            collection's data frame.
252    
253    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
256            to s_filter.
257    
258            * R/textdoccol.R: Local text documents' metadata can now be copied
259            to a document collection's data frame with prescind_meta.
260    
261    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
262    
263            * R/: Text documents' slot metadata is now accessible in s_filter.
264    
265            * R/: Rewrote s_filter function (has still some restrictions).
266    
267    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
268    
269            * R/: Various fixes in handling metadata.
270    
271            * R/: Added update mechanism for text document collections.
272    
273    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
274    
275            * R/: Merging of document collections now creates a binary tree
276            for reconstructing merged document collections.
277    
278            * R/: Redesign of metadata for document collections.
279    
280    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
281    
282            * R/: Messages now use \code{ngettext}.
283    
284    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/: Added functions for modifying and removing metadata.
287    
288    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * man/: Updated some documentation.
291    
292            * R/: Corrected some connection issues.
293    
294            * inst/doc: Worked on the vignette.
295    
296    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
297    
298            * inst/: Added texts and started vignette.
299    
300            * R/: Final changes based upon David's comments.
301    
302    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
303    
304            * NAMESPACE: Corrected exports (generic methods need exportMethods
305            directives!).
306    
307    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
308    
309            * R/: Modified the TextDocCol constructur and various parsers. It
310            is now modular and supports various file formats via plugins (see
311            the new "Source" class).
312    
313    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
314    
315            * man/: Revised documentation after previous code changes.
316    
317    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * R/: Remaining changes as discussed with David.
320    
321    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
322    
323            * R/: Some changes as suggested by David. The rest will follow
324            within the next days.
325    
326    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
327    
328            * man/: Finished documentation.
329    
330    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332            * man/: Wrote some documentation.
333    
334    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * R/: Further syntactic sugar in form of additional assignment and
337            accessor methods.
338    
339    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
340    
341            * R/: Syntactic sugar in form of "length", "show" and "summary"
342            operators.
343    
344    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
345    
346            * R/: Diverse updates. Mainly on default operators ("[" or "c")
347            and dissimilarities.
348    
349    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
350    
351            * R/: Added similarity functions.
352    
353            * data/: Added english stopwords.
354    
355    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
356    
357            * data/: Examples compiled for new features
358    
359            * R/: Changes due to new structure.
360    
361            * NAMESPACE: Corrected namespace to reflect new structure.
362    
363            * R/termdocmatrix.R: Adapted for new naming scheme.
364    
365    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
366    
367            * R/textdoccol.R: Adapted code for new class structure. Wrote
368            several transform and filter functions operating on text document
369            collections (alias text document databases).
370    
371            * R/aobjects.R: Adapted class structure with inheritance,
372            repositories and additional meta data. Loading files on demand is
373            now possible.
374    
375    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
376    
377            * R/: Some cosmetic cleanups.
378    
379            * inst/: Removed vignette on clustering. That and much more is now
380            described in the JSS paper on text mining. Based upon that
381            article an elaborated vignette will be incorporated in the future.
382    
383    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
384    
385            * R/: Updated generic S4 methods to comply with signature changes
386            in newer versions of R (> 2.3)
387    
388  2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
389    
390          * ext/R/importRIS.R: Automatic RIS import is now possible.          * ext/R/importRIS.R: Automatic RIS import is now possible.

Legend:
Removed from v.41  
changed lines
  Added in v.750

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge