SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC trunk/tm/ChangeLog revision 754, Tue May 22 18:11:22 2007 UTC
# Line 1  Line 1 
1    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
4    
5    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
6    
7            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
8            vectors. Thanks to Ariel Maguyon for his error report.
9            (removeSparseTerms): New function to remove columns from a
10            term-document matrix exceeding a sparse factor.
11    
12    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
13    
14            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
15    
16    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
17    
18            * man/sFilter.Rd: Corrected documentation on statement format (use
19            '==' instead of '=').
20    
21    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
22    
23            * R/aobjects.R (StructuredTextDocument): Inherits from
24            TextDocument.
25    
26    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
27    
28            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
29            on sparse matrices as proposed by Martin Maechler.
30    
31    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
32    
33            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
34            \pkg{filehash} version makes them deprecated.
35    
36    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
37    
38            * R/termdocmatrix.R (textvector): Stemming is now performed before
39            erasing stopwords.
40            (weightMatrix): Adapted to handle sparse matrices.
41            (TermDocMatrix): Sparse matrix is now efficiently built by
42            direct stepwise insertion of row values into it.
43    
44    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
45    
46            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
47            due to ongoing problems. For our purposes the latter is as useful
48            as the replaced package.
49    
50    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
51    
52            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
53    
54            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
55    
56    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
57    
58            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
59            languages with available stopwords.
60    
61    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
62    
63            * inst/doc/tm.Rnw: Minor corrections in the vignette.
64    
65    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
66    
67            * DESCRIPTION: Update to version 0.2, since a lot of new features
68            have been integrated.
69    
70            * inst/stopwords: Updated existing stopwords and added stopwords
71            for various other languages.
72    
73    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
74    
75            * man/: Updated documentation.
76    
77            * Work/testDb.R: Script to test database stuff.
78    
79            * R/: Fixed various database related bugs. Seems to be rather
80            useable now, i.e., consider as alpha status for now.
81    
82    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
83    
84            * R/: Fixed some bugs related to database support.
85    
86    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
87    
88            * man/: Added a lot of examples to the manuals.
89    
90    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
91    
92            * man/: Updated parts of the documentation.
93    
94            * R/textdoccol.R (asPlain): Added conversion from newsgroup
95            documents to plain text documents.
96    
97    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
98    
99            * R/textdoccol.R: Finished experimental database support. Not yet
100            intensively tested.
101    
102            * R/source.R: Now each source has a default reader.
103    
104            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
105            class anymore.
106    
107            * R/plaintextdoc.R: Custom show method for plain text documents.
108    
109            * R/aobjects.R: Added a class for structured text documents.
110    
111            * R/reader.R: Replaced remaining \code{parser} occurrences with
112            \code{reader}.
113    
114            * R/textdoccol.R (summary): Indent tags.
115    
116            * R/textdoccol.R (removePunctuation): Transform method to remove
117            punctuation marks.
118    
119    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
120    
121            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
122            using prescindMeta().
123    
124    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
125    
126            * R/textdoccol.R: Improved database support.
127    
128    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
129    
130            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
131    
132            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
133            language code.
134    
135            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
136            into parserControl argument.
137    
138            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
139    
140    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
141    
142            * Work/tmDataSetup.R: The datasets acq and crude can now be
143            created on the fly.
144    
145            * R/stopwords.R: Introduced a function returning the stopwords for
146            a given language (English, German and French at the moment)
147    
148            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
149            otherwise falls back to Snowball package.
150    
151    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
152    
153            * man/dissimilarity-methods.Rd: Make clear that any method offered
154            by "dists" from package "cba" can be used.
155    
156    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
157    
158            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
159            to Kurt's latex suggestion. Removed points and underscores in
160            variable names for consistent naming.
161    
162            * DESCRIPTION: Update to version 0.1-2.
163    
164            * man/TextRepository.Rd: Fixed bug in documentation.
165    
166    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
167    
168            * DESCRIPTION: Update to version 0.1-1.
169    
170    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
171    
172            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
173            wordStem.
174    
175    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
176    
177            * R/: Changes due to Kurt's review.
178    
179    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
180    
181            * R/: Implemented improvements based upon comments by David
182            Meyer.
183    
184    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
185    
186            * inst/doc/: Rewrote vignette.
187    
188            * man/: Improved documentation.
189    
190    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
191    
192            * man/: Updated documentation.
193    
194            * DESCRIPTION: Changed package name to "tm". Updated version to
195            0.1 for first CRAN release.
196    
197            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
198            list archive example.
199    
200            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
201            archive example.
202    
203            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
204            from (several mails per box) mbox format to (single mail per file)
205            eml format.
206    
207    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
208    
209            * data/crude.rda: Rebuilt.
210    
211            * data/acq.rda: Rebuilt.
212    
213            * R/reader.R: Factored out reader and parser methods from
214            textdoccol.R.
215    
216            * R/source.R: Factored out Source methods from aobjects.R and
217            textdoccol.R.
218            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
219            feeds.
220    
221            * R/textdoccol.R (DirSource): Added support for recursive
222            traversal of directories.
223    
224    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
225    
226            * R/textdoccol.R ([[): Loads the document corpus automatically
227            into memory upon access.
228            (tm_transform, tm_filter): Removed several checks whether the
229            document is already loaded ([[ ensures this now).
230            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
231            mailing list archive.
232    
233    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
234    
235            * R/aobjects.R (TextDocument): Is now a virtual class.
236            (Source): Is now a virtual class.
237    
238    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
239    
240            * R/textdoccol.R (c): Support for an arbitrary number of document
241            collections.
242    
243    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
244    
245            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
246            append_meta and remove_meta.
247    
248            * R/textdoccol.R: Removed modify_metadata method.
249    
250            * R/textrepo.R: Removed modify_metadata method.
251    
252            * R/textdoccol.R (remove_meta): Supports removal of document
253            collection metadata and document (= in data frame) metadata.
254    
255    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
256    
257            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
258    
259            * data/crude.rda: Rebuilt.
260    
261            * data/acq.rda: Rebuilt.
262    
263            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
264    
265            * R/textdoccol.R ([): Bug fix for subsetting a document
266            collection's data frame.
267    
268    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
269    
270            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
271            to s_filter.
272    
273            * R/textdoccol.R: Local text documents' metadata can now be copied
274            to a document collection's data frame with prescind_meta.
275    
276    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
277    
278            * R/: Text documents' slot metadata is now accessible in s_filter.
279    
280            * R/: Rewrote s_filter function (has still some restrictions).
281    
282    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
283    
284            * R/: Various fixes in handling metadata.
285    
286            * R/: Added update mechanism for text document collections.
287    
288    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * R/: Merging of document collections now creates a binary tree
291            for reconstructing merged document collections.
292    
293            * R/: Redesign of metadata for document collections.
294    
295    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
296    
297            * R/: Messages now use \code{ngettext}.
298    
299    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * R/: Added functions for modifying and removing metadata.
302    
303    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
304    
305            * man/: Updated some documentation.
306    
307            * R/: Corrected some connection issues.
308    
309            * inst/doc: Worked on the vignette.
310    
311    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
312    
313            * inst/: Added texts and started vignette.
314    
315            * R/: Final changes based upon David's comments.
316    
317    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * NAMESPACE: Corrected exports (generic methods need exportMethods
320            directives!).
321    
322    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * R/: Modified the TextDocCol constructur and various parsers. It
325            is now modular and supports various file formats via plugins (see
326            the new "Source" class).
327    
328    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
329    
330            * man/: Revised documentation after previous code changes.
331    
332    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
333    
334            * R/: Remaining changes as discussed with David.
335    
336    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
337    
338            * R/: Some changes as suggested by David. The rest will follow
339            within the next days.
340    
341    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
342    
343            * man/: Finished documentation.
344    
345    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
346    
347            * man/: Wrote some documentation.
348    
349    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
350    
351            * R/: Further syntactic sugar in form of additional assignment and
352            accessor methods.
353    
354    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
355    
356            * R/: Syntactic sugar in form of "length", "show" and "summary"
357            operators.
358    
359    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
360    
361            * R/: Diverse updates. Mainly on default operators ("[" or "c")
362            and dissimilarities.
363    
364    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
365    
366            * R/: Added similarity functions.
367    
368            * data/: Added english stopwords.
369    
370    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
371    
372            * data/: Examples compiled for new features
373    
374            * R/: Changes due to new structure.
375    
376            * NAMESPACE: Corrected namespace to reflect new structure.
377    
378            * R/termdocmatrix.R: Adapted for new naming scheme.
379    
380    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
381    
382            * R/textdoccol.R: Adapted code for new class structure. Wrote
383            several transform and filter functions operating on text document
384            collections (alias text document databases).
385    
386            * R/aobjects.R: Adapted class structure with inheritance,
387            repositories and additional meta data. Loading files on demand is
388            now possible.
389    
390    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
391    
392            * R/: Some cosmetic cleanups.
393    
394            * inst/: Removed vignette on clustering. That and much more is now
395            described in the JSS paper on text mining. Based upon that
396            article an elaborated vignette will be incorporated in the future.
397    
398    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
399    
400            * R/: Updated generic S4 methods to comply with signature changes
401            in newer versions of R (> 2.3)
402    
403    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
404    
405            * ext/R/importRIS.R: Automatic RIS import is now possible.
406    
407    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * R/textdoccol.R: Added RIS HTML input format.
410    
411    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
412    
413            * R/textdoccol.R: Removed bug that caused invalid text document
414            collections when handling many input files.
415    
416  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
417    
418          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.754

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge