SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC trunk/tm/ChangeLog revision 731, Wed Apr 11 14:01:40 2007 UTC
# Line 1  Line 1 
1    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * inst/stopwords: Updated stopwords.
4    
5    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
6    
7            * man/: Updated documentation.
8    
9            * Work/testDb.R: Script to test database stuff.
10    
11            * R/: Fixed various database related bugs. Seems to be rather
12            useable now, i.e., consider as alpha status for now.
13    
14    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
15    
16            * R/: Fixed some bugs related to database support.
17    
18    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
19    
20            * man/: Added a lot of examples to the manuals.
21    
22    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
23    
24            * man/: Updated parts of the documentation.
25    
26            * R/textdoccol.R (asPlain): Added conversion from newsgroup
27            documents to plain text documents.
28    
29    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
30    
31            * R/textdoccol.R: Finished experimental database support. Not yet
32            intensively tested.
33    
34            * R/source.R: Now each source has a default reader.
35    
36            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
37            class anymore.
38    
39            * R/plaintextdoc.R: Custom show method for plain text documents.
40    
41            * R/aobjects.R: Added a class for structured text documents.
42    
43            * R/reader.R: Replaced remaining \code{parser} occurrences with
44            \code{reader}.
45    
46            * R/textdoccol.R (summary): Indent tags.
47    
48            * R/textdoccol.R (removePunctuation): Transform method to remove
49            punctuation marks.
50    
51    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
52    
53            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
54            using prescindMeta().
55    
56    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
57    
58            * R/textdoccol.R: Improved database support.
59    
60    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
61    
62            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
63    
64            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
65            language code.
66    
67            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
68            into parserControl argument.
69    
70            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
71    
72    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
73    
74            * Work/tmDataSetup.R: The datasets acq and crude can now be
75            created on the fly.
76    
77            * R/stopwords.R: Introduced a function returning the stopwords for
78            a given language (English, German and French at the moment)
79    
80            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
81            otherwise falls back to Snowball package.
82    
83    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
84    
85            * man/dissimilarity-methods.Rd: Make clear that any method offered
86            by "dists" from package "cba" can be used.
87    
88    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
89    
90            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
91            to Kurt's latex suggestion. Removed points and underscores in
92            variable names for consistent naming.
93    
94            * DESCRIPTION: Update to version 0.1-2.
95    
96            * man/TextRepository.Rd: Fixed bug in documentation.
97    
98    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
99    
100            * DESCRIPTION: Update to version 0.1-1.
101    
102    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
103    
104            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
105            wordStem.
106    
107    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
108    
109            * R/: Changes due to Kurt's review.
110    
111    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
112    
113            * R/: Implemented improvements based upon comments by David
114            Meyer.
115    
116    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
117    
118            * inst/doc/: Rewrote vignette.
119    
120            * man/: Improved documentation.
121    
122    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
123    
124            * man/: Updated documentation.
125    
126            * DESCRIPTION: Changed package name to "tm". Updated version to
127            0.1 for first CRAN release.
128    
129            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
130            list archive example.
131    
132            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
133            archive example.
134    
135            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
136            from (several mails per box) mbox format to (single mail per file)
137            eml format.
138    
139    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
140    
141            * data/crude.rda: Rebuilt.
142    
143            * data/acq.rda: Rebuilt.
144    
145            * R/reader.R: Factored out reader and parser methods from
146            textdoccol.R.
147    
148            * R/source.R: Factored out Source methods from aobjects.R and
149            textdoccol.R.
150            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
151            feeds.
152    
153            * R/textdoccol.R (DirSource): Added support for recursive
154            traversal of directories.
155    
156    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
157    
158            * R/textdoccol.R ([[): Loads the document corpus automatically
159            into memory upon access.
160            (tm_transform, tm_filter): Removed several checks whether the
161            document is already loaded ([[ ensures this now).
162            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
163            mailing list archive.
164    
165    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
166    
167            * R/aobjects.R (TextDocument): Is now a virtual class.
168            (Source): Is now a virtual class.
169    
170    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
171    
172            * R/textdoccol.R (c): Support for an arbitrary number of document
173            collections.
174    
175    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
176    
177            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
178            append_meta and remove_meta.
179    
180            * R/textdoccol.R: Removed modify_metadata method.
181    
182            * R/textrepo.R: Removed modify_metadata method.
183    
184            * R/textdoccol.R (remove_meta): Supports removal of document
185            collection metadata and document (= in data frame) metadata.
186    
187    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
188    
189            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
190    
191            * data/crude.rda: Rebuilt.
192    
193            * data/acq.rda: Rebuilt.
194    
195            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
196    
197            * R/textdoccol.R ([): Bug fix for subsetting a document
198            collection's data frame.
199    
200    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
201    
202            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
203            to s_filter.
204    
205            * R/textdoccol.R: Local text documents' metadata can now be copied
206            to a document collection's data frame with prescind_meta.
207    
208    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
209    
210            * R/: Text documents' slot metadata is now accessible in s_filter.
211    
212            * R/: Rewrote s_filter function (has still some restrictions).
213    
214    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * R/: Various fixes in handling metadata.
217    
218            * R/: Added update mechanism for text document collections.
219    
220    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
221    
222            * R/: Merging of document collections now creates a binary tree
223            for reconstructing merged document collections.
224    
225            * R/: Redesign of metadata for document collections.
226    
227    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
228    
229            * R/: Messages now use \code{ngettext}.
230    
231    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
232    
233            * R/: Added functions for modifying and removing metadata.
234    
235    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
236    
237            * man/: Updated some documentation.
238    
239            * R/: Corrected some connection issues.
240    
241            * inst/doc: Worked on the vignette.
242    
243    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
244    
245            * inst/: Added texts and started vignette.
246    
247            * R/: Final changes based upon David's comments.
248    
249    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
250    
251            * NAMESPACE: Corrected exports (generic methods need exportMethods
252            directives!).
253    
254    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
255    
256            * R/: Modified the TextDocCol constructur and various parsers. It
257            is now modular and supports various file formats via plugins (see
258            the new "Source" class).
259    
260    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
261    
262            * man/: Revised documentation after previous code changes.
263    
264    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
265    
266            * R/: Remaining changes as discussed with David.
267    
268    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
269    
270            * R/: Some changes as suggested by David. The rest will follow
271            within the next days.
272    
273    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
274    
275            * man/: Finished documentation.
276    
277    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
278    
279            * man/: Wrote some documentation.
280    
281    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
282    
283            * R/: Further syntactic sugar in form of additional assignment and
284            accessor methods.
285    
286    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
287    
288            * R/: Syntactic sugar in form of "length", "show" and "summary"
289            operators.
290    
291    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
292    
293            * R/: Diverse updates. Mainly on default operators ("[" or "c")
294            and dissimilarities.
295    
296    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
297    
298            * R/: Added similarity functions.
299    
300            * data/: Added english stopwords.
301    
302    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
303    
304            * data/: Examples compiled for new features
305    
306            * R/: Changes due to new structure.
307    
308            * NAMESPACE: Corrected namespace to reflect new structure.
309    
310            * R/termdocmatrix.R: Adapted for new naming scheme.
311    
312    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
313    
314            * R/textdoccol.R: Adapted code for new class structure. Wrote
315            several transform and filter functions operating on text document
316            collections (alias text document databases).
317    
318            * R/aobjects.R: Adapted class structure with inheritance,
319            repositories and additional meta data. Loading files on demand is
320            now possible.
321    
322    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * R/: Some cosmetic cleanups.
325    
326            * inst/: Removed vignette on clustering. That and much more is now
327            described in the JSS paper on text mining. Based upon that
328            article an elaborated vignette will be incorporated in the future.
329    
330    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332            * R/: Updated generic S4 methods to comply with signature changes
333            in newer versions of R (> 2.3)
334    
335    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
336    
337            * ext/R/importRIS.R: Automatic RIS import is now possible.
338    
339    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
340    
341            * R/textdoccol.R: Added RIS HTML input format.
342    
343    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * R/textdoccol.R: Removed bug that caused invalid text document
346            collections when handling many input files.
347    
348    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
349    
350            * R/textdoccol.R: Restructured and extended file import
351            mechanism.
352    
353            * inst/doc/clustering.Rnw: Adapted vignette for use with
354            ReutNews.rda
355    
356            * man/ReutNews.Rd: Documentation for ReutNews.rda
357    
358            * data/ReutNews.rda: A tiny Reuters21578 example data set.
359    
360    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
363            clustering facilities of this package.
364    
365    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
366    
367            * R/aobjects.R: Changed package document structure to avoid class
368            dependency problems.
369    
370    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
371    
372            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
373            data set.
374    
375            * Finished documentation and reordered directory structure. Now "R
376            CMD check textmin" works without errors.
377    
378    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
379    
380            * src/: Various splits can now be easily created for the
381            Reuters21578 data set.
382    
383    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
384    
385            * Updated documentation
386    
387    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
388    
389            * Wrote R documentation for some classes and methods.
390    
391    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
392    
393            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
394            files. See the questionnaire data/Umfrage.csv for such an example.
395            We are now able to import files in Reuters-21578 XML format.
396    
397            * Changed class interfaces in various files. Weighting of the text
398            matrix is now possible.
399    
400    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
401    
402            * R/textdoccol.R: One can build term-document matrices if
403            nessecary (with buildTDM(...)) and fill the field tdm from a text
404            document collection with it.
405    
406            * R/textmatrix.R: Wrote S4 class for term-document matrices.
407    
408    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
409    
410            * R/textdoccol.R: We now can read in a whole XML file with several
411            news items.
412    
413  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
414    
415          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.731

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge