SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 34, Thu Dec 22 15:18:10 2005 UTC trunk/tm/ChangeLog revision 751, Tue May 15 18:01:43 2007 UTC
# Line 1  Line 1 
1    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
4    
5    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
6    
7            * man/sFilter.Rd: Corrected documentation on statement format (use
8            '==' instead of '=').
9    
10    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
11    
12            * R/aobjects.R (StructuredTextDocument): Inherits from
13            TextDocument.
14    
15    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
16    
17            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
18            on sparse matrices as proposed by Martin Maechler.
19    
20    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
21    
22            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
23            \pkg{filehash} version makes them deprecated.
24    
25    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
26    
27            * R/termdocmatrix.R (textvector): Stemming is now performed before
28            erasing stopwords.
29            (weightMatrix): Adapted to handle sparse matrices.
30            (TermDocMatrix): Sparse matrix is now efficiently built by
31            direct stepwise insertion of row values into it.
32    
33    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
34    
35            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
36            due to ongoing problems. For our purposes the latter is as useful
37            as the replaced package.
38    
39    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
40    
41            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
42    
43            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
44    
45    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
46    
47            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
48            languages with available stopwords.
49    
50    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
51    
52            * inst/doc/tm.Rnw: Minor corrections in the vignette.
53    
54    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
55    
56            * DESCRIPTION: Update to version 0.2, since a lot of new features
57            have been integrated.
58    
59            * inst/stopwords: Updated existing stopwords and added stopwords
60            for various other languages.
61    
62    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
63    
64            * man/: Updated documentation.
65    
66            * Work/testDb.R: Script to test database stuff.
67    
68            * R/: Fixed various database related bugs. Seems to be rather
69            useable now, i.e., consider as alpha status for now.
70    
71    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
72    
73            * R/: Fixed some bugs related to database support.
74    
75    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
76    
77            * man/: Added a lot of examples to the manuals.
78    
79    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
80    
81            * man/: Updated parts of the documentation.
82    
83            * R/textdoccol.R (asPlain): Added conversion from newsgroup
84            documents to plain text documents.
85    
86    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
87    
88            * R/textdoccol.R: Finished experimental database support. Not yet
89            intensively tested.
90    
91            * R/source.R: Now each source has a default reader.
92    
93            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
94            class anymore.
95    
96            * R/plaintextdoc.R: Custom show method for plain text documents.
97    
98            * R/aobjects.R: Added a class for structured text documents.
99    
100            * R/reader.R: Replaced remaining \code{parser} occurrences with
101            \code{reader}.
102    
103            * R/textdoccol.R (summary): Indent tags.
104    
105            * R/textdoccol.R (removePunctuation): Transform method to remove
106            punctuation marks.
107    
108    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
109    
110            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
111            using prescindMeta().
112    
113    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
114    
115            * R/textdoccol.R: Improved database support.
116    
117    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
118    
119            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
120    
121            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
122            language code.
123    
124            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
125            into parserControl argument.
126    
127            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
128    
129    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
130    
131            * Work/tmDataSetup.R: The datasets acq and crude can now be
132            created on the fly.
133    
134            * R/stopwords.R: Introduced a function returning the stopwords for
135            a given language (English, German and French at the moment)
136    
137            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
138            otherwise falls back to Snowball package.
139    
140    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
141    
142            * man/dissimilarity-methods.Rd: Make clear that any method offered
143            by "dists" from package "cba" can be used.
144    
145    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
146    
147            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
148            to Kurt's latex suggestion. Removed points and underscores in
149            variable names for consistent naming.
150    
151            * DESCRIPTION: Update to version 0.1-2.
152    
153            * man/TextRepository.Rd: Fixed bug in documentation.
154    
155    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
156    
157            * DESCRIPTION: Update to version 0.1-1.
158    
159    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
160    
161            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
162            wordStem.
163    
164    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
165    
166            * R/: Changes due to Kurt's review.
167    
168    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
169    
170            * R/: Implemented improvements based upon comments by David
171            Meyer.
172    
173    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
174    
175            * inst/doc/: Rewrote vignette.
176    
177            * man/: Improved documentation.
178    
179    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
180    
181            * man/: Updated documentation.
182    
183            * DESCRIPTION: Changed package name to "tm". Updated version to
184            0.1 for first CRAN release.
185    
186            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
187            list archive example.
188    
189            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
190            archive example.
191    
192            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
193            from (several mails per box) mbox format to (single mail per file)
194            eml format.
195    
196    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
197    
198            * data/crude.rda: Rebuilt.
199    
200            * data/acq.rda: Rebuilt.
201    
202            * R/reader.R: Factored out reader and parser methods from
203            textdoccol.R.
204    
205            * R/source.R: Factored out Source methods from aobjects.R and
206            textdoccol.R.
207            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
208            feeds.
209    
210            * R/textdoccol.R (DirSource): Added support for recursive
211            traversal of directories.
212    
213    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
214    
215            * R/textdoccol.R ([[): Loads the document corpus automatically
216            into memory upon access.
217            (tm_transform, tm_filter): Removed several checks whether the
218            document is already loaded ([[ ensures this now).
219            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
220            mailing list archive.
221    
222    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
223    
224            * R/aobjects.R (TextDocument): Is now a virtual class.
225            (Source): Is now a virtual class.
226    
227    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
228    
229            * R/textdoccol.R (c): Support for an arbitrary number of document
230            collections.
231    
232    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
233    
234            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
235            append_meta and remove_meta.
236    
237            * R/textdoccol.R: Removed modify_metadata method.
238    
239            * R/textrepo.R: Removed modify_metadata method.
240    
241            * R/textdoccol.R (remove_meta): Supports removal of document
242            collection metadata and document (= in data frame) metadata.
243    
244    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
247    
248            * data/crude.rda: Rebuilt.
249    
250            * data/acq.rda: Rebuilt.
251    
252            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
253    
254            * R/textdoccol.R ([): Bug fix for subsetting a document
255            collection's data frame.
256    
257    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
260            to s_filter.
261    
262            * R/textdoccol.R: Local text documents' metadata can now be copied
263            to a document collection's data frame with prescind_meta.
264    
265    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
266    
267            * R/: Text documents' slot metadata is now accessible in s_filter.
268    
269            * R/: Rewrote s_filter function (has still some restrictions).
270    
271    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/: Various fixes in handling metadata.
274    
275            * R/: Added update mechanism for text document collections.
276    
277    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
278    
279            * R/: Merging of document collections now creates a binary tree
280            for reconstructing merged document collections.
281    
282            * R/: Redesign of metadata for document collections.
283    
284    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/: Messages now use \code{ngettext}.
287    
288    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * R/: Added functions for modifying and removing metadata.
291    
292    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
293    
294            * man/: Updated some documentation.
295    
296            * R/: Corrected some connection issues.
297    
298            * inst/doc: Worked on the vignette.
299    
300    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
301    
302            * inst/: Added texts and started vignette.
303    
304            * R/: Final changes based upon David's comments.
305    
306    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
307    
308            * NAMESPACE: Corrected exports (generic methods need exportMethods
309            directives!).
310    
311    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
312    
313            * R/: Modified the TextDocCol constructur and various parsers. It
314            is now modular and supports various file formats via plugins (see
315            the new "Source" class).
316    
317    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * man/: Revised documentation after previous code changes.
320    
321    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
322    
323            * R/: Remaining changes as discussed with David.
324    
325    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
326    
327            * R/: Some changes as suggested by David. The rest will follow
328            within the next days.
329    
330    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332            * man/: Finished documentation.
333    
334    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * man/: Wrote some documentation.
337    
338    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
339    
340            * R/: Further syntactic sugar in form of additional assignment and
341            accessor methods.
342    
343    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * R/: Syntactic sugar in form of "length", "show" and "summary"
346            operators.
347    
348    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
349    
350            * R/: Diverse updates. Mainly on default operators ("[" or "c")
351            and dissimilarities.
352    
353    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
354    
355            * R/: Added similarity functions.
356    
357            * data/: Added english stopwords.
358    
359    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
360    
361            * data/: Examples compiled for new features
362    
363            * R/: Changes due to new structure.
364    
365            * NAMESPACE: Corrected namespace to reflect new structure.
366    
367            * R/termdocmatrix.R: Adapted for new naming scheme.
368    
369    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
370    
371            * R/textdoccol.R: Adapted code for new class structure. Wrote
372            several transform and filter functions operating on text document
373            collections (alias text document databases).
374    
375            * R/aobjects.R: Adapted class structure with inheritance,
376            repositories and additional meta data. Loading files on demand is
377            now possible.
378    
379    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
380    
381            * R/: Some cosmetic cleanups.
382    
383            * inst/: Removed vignette on clustering. That and much more is now
384            described in the JSS paper on text mining. Based upon that
385            article an elaborated vignette will be incorporated in the future.
386    
387    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
388    
389            * R/: Updated generic S4 methods to comply with signature changes
390            in newer versions of R (> 2.3)
391    
392    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
393    
394            * ext/R/importRIS.R: Automatic RIS import is now possible.
395    
396    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
397    
398            * R/textdoccol.R: Added RIS HTML input format.
399    
400    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
401    
402            * R/textdoccol.R: Removed bug that caused invalid text document
403            collections when handling many input files.
404    
405    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
406    
407            * R/textdoccol.R: Restructured and extended file import
408            mechanism.
409    
410            * inst/doc/clustering.Rnw: Adapted vignette for use with
411            ReutNews.rda
412    
413            * man/ReutNews.Rd: Documentation for ReutNews.rda
414    
415            * data/ReutNews.rda: A tiny Reuters21578 example data set.
416    
417  2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
418    
419          * inst/doc/clustering.Rnw: Wrote a small vignette to present the          * inst/doc/clustering.Rnw: Wrote a small vignette to present the

Legend:
Removed from v.34  
changed lines
  Added in v.751

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge