SCM

SCM Repository

[tm] Diff of /trunk/tm/ChangeLog
ViewVC logotype

Diff of /trunk/tm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 26, Sat Dec 3 15:20:17 2005 UTC trunk/tm/ChangeLog revision 721, Wed Mar 21 13:54:43 2007 UTC
# Line 1  Line 1 
1    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
4            using prescindMeta().
5    
6    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
7    
8            * R/textdoccol.R: Improved database support.
9    
10    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
11    
12            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
13    
14            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
15            language code.
16    
17            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
18            into parserControl argument.
19    
20            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
21    
22    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
23    
24            * Work/tmDataSetup.R: The datasets acq and crude can now be
25            created on the fly.
26    
27            * R/stopwords.R: Introduced a function returning the stopwords for
28            a given language (English, German and French at the moment)
29    
30            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
31            otherwise falls back to Snowball package.
32    
33    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
34    
35            * man/dissimilarity-methods.Rd: Make clear that any method offered
36            by "dists" from package "cba" can be used.
37    
38    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
39    
40            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
41            to Kurt's latex suggestion. Removed points and underscores in
42            variable names for consistent naming.
43    
44            * DESCRIPTION: Update to version 0.1-2.
45    
46            * man/TextRepository.Rd: Fixed bug in documentation.
47    
48    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
49    
50            * DESCRIPTION: Update to version 0.1-1.
51    
52    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
53    
54            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
55            wordStem.
56    
57    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
58    
59            * R/: Changes due to Kurt's review.
60    
61    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
62    
63            * R/: Implemented improvements based upon comments by David
64            Meyer.
65    
66    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
67    
68            * inst/doc/: Rewrote vignette.
69    
70            * man/: Improved documentation.
71    
72    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
73    
74            * man/: Updated documentation.
75    
76            * DESCRIPTION: Changed package name to "tm". Updated version to
77            0.1 for first CRAN release.
78    
79            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
80            list archive example.
81    
82            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
83            archive example.
84    
85            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
86            from (several mails per box) mbox format to (single mail per file)
87            eml format.
88    
89    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
90    
91            * data/crude.rda: Rebuilt.
92    
93            * data/acq.rda: Rebuilt.
94    
95            * R/reader.R: Factored out reader and parser methods from
96            textdoccol.R.
97    
98            * R/source.R: Factored out Source methods from aobjects.R and
99            textdoccol.R.
100            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
101            feeds.
102    
103            * R/textdoccol.R (DirSource): Added support for recursive
104            traversal of directories.
105    
106    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
107    
108            * R/textdoccol.R ([[): Loads the document corpus automatically
109            into memory upon access.
110            (tm_transform, tm_filter): Removed several checks whether the
111            document is already loaded ([[ ensures this now).
112            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
113            mailing list archive.
114    
115    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
116    
117            * R/aobjects.R (TextDocument): Is now a virtual class.
118            (Source): Is now a virtual class.
119    
120    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
121    
122            * R/textdoccol.R (c): Support for an arbitrary number of document
123            collections.
124    
125    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
126    
127            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
128            append_meta and remove_meta.
129    
130            * R/textdoccol.R: Removed modify_metadata method.
131    
132            * R/textrepo.R: Removed modify_metadata method.
133    
134            * R/textdoccol.R (remove_meta): Supports removal of document
135            collection metadata and document (= in data frame) metadata.
136    
137    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
138    
139            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
140    
141            * data/crude.rda: Rebuilt.
142    
143            * data/acq.rda: Rebuilt.
144    
145            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
146    
147            * R/textdoccol.R ([): Bug fix for subsetting a document
148            collection's data frame.
149    
150    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
151    
152            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
153            to s_filter.
154    
155            * R/textdoccol.R: Local text documents' metadata can now be copied
156            to a document collection's data frame with prescind_meta.
157    
158    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
159    
160            * R/: Text documents' slot metadata is now accessible in s_filter.
161    
162            * R/: Rewrote s_filter function (has still some restrictions).
163    
164    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
165    
166            * R/: Various fixes in handling metadata.
167    
168            * R/: Added update mechanism for text document collections.
169    
170    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
171    
172            * R/: Merging of document collections now creates a binary tree
173            for reconstructing merged document collections.
174    
175            * R/: Redesign of metadata for document collections.
176    
177    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
178    
179            * R/: Messages now use \code{ngettext}.
180    
181    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
182    
183            * R/: Added functions for modifying and removing metadata.
184    
185    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
186    
187            * man/: Updated some documentation.
188    
189            * R/: Corrected some connection issues.
190    
191            * inst/doc: Worked on the vignette.
192    
193    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
194    
195            * inst/: Added texts and started vignette.
196    
197            * R/: Final changes based upon David's comments.
198    
199    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
200    
201            * NAMESPACE: Corrected exports (generic methods need exportMethods
202            directives!).
203    
204    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
205    
206            * R/: Modified the TextDocCol constructur and various parsers. It
207            is now modular and supports various file formats via plugins (see
208            the new "Source" class).
209    
210    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * man/: Revised documentation after previous code changes.
213    
214    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * R/: Remaining changes as discussed with David.
217    
218    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
219    
220            * R/: Some changes as suggested by David. The rest will follow
221            within the next days.
222    
223    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
224    
225            * man/: Finished documentation.
226    
227    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
228    
229            * man/: Wrote some documentation.
230    
231    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
232    
233            * R/: Further syntactic sugar in form of additional assignment and
234            accessor methods.
235    
236    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
237    
238            * R/: Syntactic sugar in form of "length", "show" and "summary"
239            operators.
240    
241    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
242    
243            * R/: Diverse updates. Mainly on default operators ("[" or "c")
244            and dissimilarities.
245    
246    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
247    
248            * R/: Added similarity functions.
249    
250            * data/: Added english stopwords.
251    
252    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
253    
254            * data/: Examples compiled for new features
255    
256            * R/: Changes due to new structure.
257    
258            * NAMESPACE: Corrected namespace to reflect new structure.
259    
260            * R/termdocmatrix.R: Adapted for new naming scheme.
261    
262    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
263    
264            * R/textdoccol.R: Adapted code for new class structure. Wrote
265            several transform and filter functions operating on text document
266            collections (alias text document databases).
267    
268            * R/aobjects.R: Adapted class structure with inheritance,
269            repositories and additional meta data. Loading files on demand is
270            now possible.
271    
272    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
273    
274            * R/: Some cosmetic cleanups.
275    
276            * inst/: Removed vignette on clustering. That and much more is now
277            described in the JSS paper on text mining. Based upon that
278            article an elaborated vignette will be incorporated in the future.
279    
280    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
281    
282            * R/: Updated generic S4 methods to comply with signature changes
283            in newer versions of R (> 2.3)
284    
285    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
286    
287            * ext/R/importRIS.R: Automatic RIS import is now possible.
288    
289    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * R/textdoccol.R: Added RIS HTML input format.
292    
293    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/textdoccol.R: Removed bug that caused invalid text document
296            collections when handling many input files.
297    
298    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
299    
300            * R/textdoccol.R: Restructured and extended file import
301            mechanism.
302    
303            * inst/doc/clustering.Rnw: Adapted vignette for use with
304            ReutNews.rda
305    
306            * man/ReutNews.Rd: Documentation for ReutNews.rda
307    
308            * data/ReutNews.rda: A tiny Reuters21578 example data set.
309    
310    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
311    
312            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
313            clustering facilities of this package.
314    
315    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
316    
317            * R/aobjects.R: Changed package document structure to avoid class
318            dependency problems.
319    
320    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
321    
322            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
323            data set.
324    
325            * Finished documentation and reordered directory structure. Now "R
326            CMD check textmin" works without errors.
327    
328    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
329    
330            * src/: Various splits can now be easily created for the
331            Reuters21578 data set.
332    
333  2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
334    
335          * Updated documentation          * Updated documentation

Legend:
Removed from v.26  
changed lines
  Added in v.721

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge