SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 28, Tue Dec 6 13:46:33 2005 UTC trunk/tm/ChangeLog revision 741, Sat Apr 21 18:35:16 2007 UTC
# Line 1  Line 1 
1    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
4            due to ongoing problems. For our purposes the latter is as useful
5            as the replaced package.
6    
7    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
8    
9            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
10    
11            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
12    
13    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
14    
15            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
16            languages with available stopwords.
17    
18    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
19    
20            * inst/doc/tm.Rnw: Minor corrections in the vignette.
21    
22    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
23    
24            * DESCRIPTION: Update to version 0.2, since a lot of new features
25            have been integrated.
26    
27            * inst/stopwords: Updated existing stopwords and added stopwords
28            for various other languages.
29    
30    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
31    
32            * man/: Updated documentation.
33    
34            * Work/testDb.R: Script to test database stuff.
35    
36            * R/: Fixed various database related bugs. Seems to be rather
37            useable now, i.e., consider as alpha status for now.
38    
39    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
40    
41            * R/: Fixed some bugs related to database support.
42    
43    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
44    
45            * man/: Added a lot of examples to the manuals.
46    
47    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
48    
49            * man/: Updated parts of the documentation.
50    
51            * R/textdoccol.R (asPlain): Added conversion from newsgroup
52            documents to plain text documents.
53    
54    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
55    
56            * R/textdoccol.R: Finished experimental database support. Not yet
57            intensively tested.
58    
59            * R/source.R: Now each source has a default reader.
60    
61            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
62            class anymore.
63    
64            * R/plaintextdoc.R: Custom show method for plain text documents.
65    
66            * R/aobjects.R: Added a class for structured text documents.
67    
68            * R/reader.R: Replaced remaining \code{parser} occurrences with
69            \code{reader}.
70    
71            * R/textdoccol.R (summary): Indent tags.
72    
73            * R/textdoccol.R (removePunctuation): Transform method to remove
74            punctuation marks.
75    
76    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
77    
78            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
79            using prescindMeta().
80    
81    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
82    
83            * R/textdoccol.R: Improved database support.
84    
85    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
86    
87            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
88    
89            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
90            language code.
91    
92            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
93            into parserControl argument.
94    
95            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
96    
97    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
98    
99            * Work/tmDataSetup.R: The datasets acq and crude can now be
100            created on the fly.
101    
102            * R/stopwords.R: Introduced a function returning the stopwords for
103            a given language (English, German and French at the moment)
104    
105            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
106            otherwise falls back to Snowball package.
107    
108    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
109    
110            * man/dissimilarity-methods.Rd: Make clear that any method offered
111            by "dists" from package "cba" can be used.
112    
113    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
114    
115            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
116            to Kurt's latex suggestion. Removed points and underscores in
117            variable names for consistent naming.
118    
119            * DESCRIPTION: Update to version 0.1-2.
120    
121            * man/TextRepository.Rd: Fixed bug in documentation.
122    
123    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
124    
125            * DESCRIPTION: Update to version 0.1-1.
126    
127    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
128    
129            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
130            wordStem.
131    
132    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
133    
134            * R/: Changes due to Kurt's review.
135    
136    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
137    
138            * R/: Implemented improvements based upon comments by David
139            Meyer.
140    
141    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
142    
143            * inst/doc/: Rewrote vignette.
144    
145            * man/: Improved documentation.
146    
147    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
148    
149            * man/: Updated documentation.
150    
151            * DESCRIPTION: Changed package name to "tm". Updated version to
152            0.1 for first CRAN release.
153    
154            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
155            list archive example.
156    
157            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
158            archive example.
159    
160            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
161            from (several mails per box) mbox format to (single mail per file)
162            eml format.
163    
164    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
165    
166            * data/crude.rda: Rebuilt.
167    
168            * data/acq.rda: Rebuilt.
169    
170            * R/reader.R: Factored out reader and parser methods from
171            textdoccol.R.
172    
173            * R/source.R: Factored out Source methods from aobjects.R and
174            textdoccol.R.
175            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
176            feeds.
177    
178            * R/textdoccol.R (DirSource): Added support for recursive
179            traversal of directories.
180    
181    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
182    
183            * R/textdoccol.R ([[): Loads the document corpus automatically
184            into memory upon access.
185            (tm_transform, tm_filter): Removed several checks whether the
186            document is already loaded ([[ ensures this now).
187            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
188            mailing list archive.
189    
190    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
191    
192            * R/aobjects.R (TextDocument): Is now a virtual class.
193            (Source): Is now a virtual class.
194    
195    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
196    
197            * R/textdoccol.R (c): Support for an arbitrary number of document
198            collections.
199    
200    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
201    
202            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
203            append_meta and remove_meta.
204    
205            * R/textdoccol.R: Removed modify_metadata method.
206    
207            * R/textrepo.R: Removed modify_metadata method.
208    
209            * R/textdoccol.R (remove_meta): Supports removal of document
210            collection metadata and document (= in data frame) metadata.
211    
212    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
213    
214            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
215    
216            * data/crude.rda: Rebuilt.
217    
218            * data/acq.rda: Rebuilt.
219    
220            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
221    
222            * R/textdoccol.R ([): Bug fix for subsetting a document
223            collection's data frame.
224    
225    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
226    
227            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
228            to s_filter.
229    
230            * R/textdoccol.R: Local text documents' metadata can now be copied
231            to a document collection's data frame with prescind_meta.
232    
233    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
234    
235            * R/: Text documents' slot metadata is now accessible in s_filter.
236    
237            * R/: Rewrote s_filter function (has still some restrictions).
238    
239    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
240    
241            * R/: Various fixes in handling metadata.
242    
243            * R/: Added update mechanism for text document collections.
244    
245    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
246    
247            * R/: Merging of document collections now creates a binary tree
248            for reconstructing merged document collections.
249    
250            * R/: Redesign of metadata for document collections.
251    
252    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
253    
254            * R/: Messages now use \code{ngettext}.
255    
256    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
257    
258            * R/: Added functions for modifying and removing metadata.
259    
260    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
261    
262            * man/: Updated some documentation.
263    
264            * R/: Corrected some connection issues.
265    
266            * inst/doc: Worked on the vignette.
267    
268    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
269    
270            * inst/: Added texts and started vignette.
271    
272            * R/: Final changes based upon David's comments.
273    
274    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
275    
276            * NAMESPACE: Corrected exports (generic methods need exportMethods
277            directives!).
278    
279    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
280    
281            * R/: Modified the TextDocCol constructur and various parsers. It
282            is now modular and supports various file formats via plugins (see
283            the new "Source" class).
284    
285    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
286    
287            * man/: Revised documentation after previous code changes.
288    
289    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * R/: Remaining changes as discussed with David.
292    
293    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/: Some changes as suggested by David. The rest will follow
296            within the next days.
297    
298    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
299    
300            * man/: Finished documentation.
301    
302    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
303    
304            * man/: Wrote some documentation.
305    
306    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
307    
308            * R/: Further syntactic sugar in form of additional assignment and
309            accessor methods.
310    
311    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
312    
313            * R/: Syntactic sugar in form of "length", "show" and "summary"
314            operators.
315    
316    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
317    
318            * R/: Diverse updates. Mainly on default operators ("[" or "c")
319            and dissimilarities.
320    
321    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
322    
323            * R/: Added similarity functions.
324    
325            * data/: Added english stopwords.
326    
327    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
328    
329            * data/: Examples compiled for new features
330    
331            * R/: Changes due to new structure.
332    
333            * NAMESPACE: Corrected namespace to reflect new structure.
334    
335            * R/termdocmatrix.R: Adapted for new naming scheme.
336    
337    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
338    
339            * R/textdoccol.R: Adapted code for new class structure. Wrote
340            several transform and filter functions operating on text document
341            collections (alias text document databases).
342    
343            * R/aobjects.R: Adapted class structure with inheritance,
344            repositories and additional meta data. Loading files on demand is
345            now possible.
346    
347    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * R/: Some cosmetic cleanups.
350    
351            * inst/: Removed vignette on clustering. That and much more is now
352            described in the JSS paper on text mining. Based upon that
353            article an elaborated vignette will be incorporated in the future.
354    
355    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
356    
357            * R/: Updated generic S4 methods to comply with signature changes
358            in newer versions of R (> 2.3)
359    
360    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * ext/R/importRIS.R: Automatic RIS import is now possible.
363    
364    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
365    
366            * R/textdoccol.R: Added RIS HTML input format.
367    
368    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
369    
370            * R/textdoccol.R: Removed bug that caused invalid text document
371            collections when handling many input files.
372    
373    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
374    
375            * R/textdoccol.R: Restructured and extended file import
376            mechanism.
377    
378            * inst/doc/clustering.Rnw: Adapted vignette for use with
379            ReutNews.rda
380    
381            * man/ReutNews.Rd: Documentation for ReutNews.rda
382    
383            * data/ReutNews.rda: A tiny Reuters21578 example data set.
384    
385    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
386    
387            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
388            clustering facilities of this package.
389    
390    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
391    
392            * R/aobjects.R: Changed package document structure to avoid class
393            dependency problems.
394    
395  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
396    
397            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
398            data set.
399    
400          * Finished documentation and reordered directory structure. Now "R          * Finished documentation and reordered directory structure. Now "R
401          CMD check textmin" works without errors.          CMD check textmin" works without errors.
402    

Legend:
Removed from v.28  
changed lines
  Added in v.741

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge