SCM

SCM Repository

[tm] Diff of /trunk/tm/ChangeLog
ViewVC logotype

Diff of /trunk/tm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 39, Sat Jan 21 09:37:39 2006 UTC trunk/tm/ChangeLog revision 736, Sat Apr 14 17:37:16 2007 UTC
# Line 1  Line 1 
1    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * inst/doc/tm.Rnw: Minor corrections in the vignette.
4    
5    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
6    
7            * DESCRIPTION: Update to version 0.2, since a lot of new features
8            have been integrated.
9    
10            * inst/stopwords: Updated existing stopwords and added stopwords
11            for various other languages.
12    
13    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
14    
15            * man/: Updated documentation.
16    
17            * Work/testDb.R: Script to test database stuff.
18    
19            * R/: Fixed various database related bugs. Seems to be rather
20            useable now, i.e., consider as alpha status for now.
21    
22    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
23    
24            * R/: Fixed some bugs related to database support.
25    
26    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
27    
28            * man/: Added a lot of examples to the manuals.
29    
30    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
31    
32            * man/: Updated parts of the documentation.
33    
34            * R/textdoccol.R (asPlain): Added conversion from newsgroup
35            documents to plain text documents.
36    
37    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
38    
39            * R/textdoccol.R: Finished experimental database support. Not yet
40            intensively tested.
41    
42            * R/source.R: Now each source has a default reader.
43    
44            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
45            class anymore.
46    
47            * R/plaintextdoc.R: Custom show method for plain text documents.
48    
49            * R/aobjects.R: Added a class for structured text documents.
50    
51            * R/reader.R: Replaced remaining \code{parser} occurrences with
52            \code{reader}.
53    
54            * R/textdoccol.R (summary): Indent tags.
55    
56            * R/textdoccol.R (removePunctuation): Transform method to remove
57            punctuation marks.
58    
59    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
60    
61            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
62            using prescindMeta().
63    
64    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
65    
66            * R/textdoccol.R: Improved database support.
67    
68    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
69    
70            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
71    
72            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
73            language code.
74    
75            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
76            into parserControl argument.
77    
78            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
79    
80    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
81    
82            * Work/tmDataSetup.R: The datasets acq and crude can now be
83            created on the fly.
84    
85            * R/stopwords.R: Introduced a function returning the stopwords for
86            a given language (English, German and French at the moment)
87    
88            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
89            otherwise falls back to Snowball package.
90    
91    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
92    
93            * man/dissimilarity-methods.Rd: Make clear that any method offered
94            by "dists" from package "cba" can be used.
95    
96    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
97    
98            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
99            to Kurt's latex suggestion. Removed points and underscores in
100            variable names for consistent naming.
101    
102            * DESCRIPTION: Update to version 0.1-2.
103    
104            * man/TextRepository.Rd: Fixed bug in documentation.
105    
106    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
107    
108            * DESCRIPTION: Update to version 0.1-1.
109    
110    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
111    
112            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
113            wordStem.
114    
115    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
116    
117            * R/: Changes due to Kurt's review.
118    
119    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
120    
121            * R/: Implemented improvements based upon comments by David
122            Meyer.
123    
124    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
125    
126            * inst/doc/: Rewrote vignette.
127    
128            * man/: Improved documentation.
129    
130    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
131    
132            * man/: Updated documentation.
133    
134            * DESCRIPTION: Changed package name to "tm". Updated version to
135            0.1 for first CRAN release.
136    
137            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
138            list archive example.
139    
140            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
141            archive example.
142    
143            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
144            from (several mails per box) mbox format to (single mail per file)
145            eml format.
146    
147    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
148    
149            * data/crude.rda: Rebuilt.
150    
151            * data/acq.rda: Rebuilt.
152    
153            * R/reader.R: Factored out reader and parser methods from
154            textdoccol.R.
155    
156            * R/source.R: Factored out Source methods from aobjects.R and
157            textdoccol.R.
158            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
159            feeds.
160    
161            * R/textdoccol.R (DirSource): Added support for recursive
162            traversal of directories.
163    
164    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
165    
166            * R/textdoccol.R ([[): Loads the document corpus automatically
167            into memory upon access.
168            (tm_transform, tm_filter): Removed several checks whether the
169            document is already loaded ([[ ensures this now).
170            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
171            mailing list archive.
172    
173    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
174    
175            * R/aobjects.R (TextDocument): Is now a virtual class.
176            (Source): Is now a virtual class.
177    
178    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
179    
180            * R/textdoccol.R (c): Support for an arbitrary number of document
181            collections.
182    
183    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
184    
185            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
186            append_meta and remove_meta.
187    
188            * R/textdoccol.R: Removed modify_metadata method.
189    
190            * R/textrepo.R: Removed modify_metadata method.
191    
192            * R/textdoccol.R (remove_meta): Supports removal of document
193            collection metadata and document (= in data frame) metadata.
194    
195    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
196    
197            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
198    
199            * data/crude.rda: Rebuilt.
200    
201            * data/acq.rda: Rebuilt.
202    
203            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
204    
205            * R/textdoccol.R ([): Bug fix for subsetting a document
206            collection's data frame.
207    
208    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
209    
210            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
211            to s_filter.
212    
213            * R/textdoccol.R: Local text documents' metadata can now be copied
214            to a document collection's data frame with prescind_meta.
215    
216    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
217    
218            * R/: Text documents' slot metadata is now accessible in s_filter.
219    
220            * R/: Rewrote s_filter function (has still some restrictions).
221    
222    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
223    
224            * R/: Various fixes in handling metadata.
225    
226            * R/: Added update mechanism for text document collections.
227    
228    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * R/: Merging of document collections now creates a binary tree
231            for reconstructing merged document collections.
232    
233            * R/: Redesign of metadata for document collections.
234    
235    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
236    
237            * R/: Messages now use \code{ngettext}.
238    
239    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
240    
241            * R/: Added functions for modifying and removing metadata.
242    
243    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
244    
245            * man/: Updated some documentation.
246    
247            * R/: Corrected some connection issues.
248    
249            * inst/doc: Worked on the vignette.
250    
251    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
252    
253            * inst/: Added texts and started vignette.
254    
255            * R/: Final changes based upon David's comments.
256    
257    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * NAMESPACE: Corrected exports (generic methods need exportMethods
260            directives!).
261    
262    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
263    
264            * R/: Modified the TextDocCol constructur and various parsers. It
265            is now modular and supports various file formats via plugins (see
266            the new "Source" class).
267    
268    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
269    
270            * man/: Revised documentation after previous code changes.
271    
272    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
273    
274            * R/: Remaining changes as discussed with David.
275    
276    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
277    
278            * R/: Some changes as suggested by David. The rest will follow
279            within the next days.
280    
281    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
282    
283            * man/: Finished documentation.
284    
285    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
286    
287            * man/: Wrote some documentation.
288    
289    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * R/: Further syntactic sugar in form of additional assignment and
292            accessor methods.
293    
294    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
295    
296            * R/: Syntactic sugar in form of "length", "show" and "summary"
297            operators.
298    
299    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * R/: Diverse updates. Mainly on default operators ("[" or "c")
302            and dissimilarities.
303    
304    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * R/: Added similarity functions.
307    
308            * data/: Added english stopwords.
309    
310    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
311    
312            * data/: Examples compiled for new features
313    
314            * R/: Changes due to new structure.
315    
316            * NAMESPACE: Corrected namespace to reflect new structure.
317    
318            * R/termdocmatrix.R: Adapted for new naming scheme.
319    
320    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
321    
322            * R/textdoccol.R: Adapted code for new class structure. Wrote
323            several transform and filter functions operating on text document
324            collections (alias text document databases).
325    
326            * R/aobjects.R: Adapted class structure with inheritance,
327            repositories and additional meta data. Loading files on demand is
328            now possible.
329    
330    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332            * R/: Some cosmetic cleanups.
333    
334            * inst/: Removed vignette on clustering. That and much more is now
335            described in the JSS paper on text mining. Based upon that
336            article an elaborated vignette will be incorporated in the future.
337    
338    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
339    
340            * R/: Updated generic S4 methods to comply with signature changes
341            in newer versions of R (> 2.3)
342    
343    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * ext/R/importRIS.R: Automatic RIS import is now possible.
346    
347    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * R/textdoccol.R: Added RIS HTML input format.
350    
351  2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
352    
353          * R/textdoccol.R: Removed bug that caused invalid text document          * R/textdoccol.R: Removed bug that caused invalid text document

Legend:
Removed from v.39  
changed lines
  Added in v.736

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge