SCM

SCM Repository

[tm] Diff of /trunk/tm/ChangeLog
ViewVC logotype

Diff of /trunk/tm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 28, Tue Dec 6 13:46:33 2005 UTC trunk/tm/ChangeLog revision 719, Sun Mar 18 09:24:47 2007 UTC
# Line 1  Line 1 
1    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * R/textdoccol.R: Improved database support.
4    
5    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
6    
7            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
8    
9            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
10            language code.
11    
12            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
13            into parserControl argument.
14    
15            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
16    
17    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
18    
19            * Work/tmDataSetup.R: The datasets acq and crude can now be
20            created on the fly.
21    
22            * R/stopwords.R: Introduced a function returning the stopwords for
23            a given language (English, German and French at the moment)
24    
25            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
26            otherwise falls back to Snowball package.
27    
28    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
29    
30            * man/dissimilarity-methods.Rd: Make clear that any method offered
31            by "dists" from package "cba" can be used.
32    
33    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
34    
35            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
36            to Kurt's latex suggestion. Removed points and underscores in
37            variable names for consistent naming.
38    
39            * DESCRIPTION: Update to version 0.1-2.
40    
41            * man/TextRepository.Rd: Fixed bug in documentation.
42    
43    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
44    
45            * DESCRIPTION: Update to version 0.1-1.
46    
47    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
48    
49            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
50            wordStem.
51    
52    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
53    
54            * R/: Changes due to Kurt's review.
55    
56    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
57    
58            * R/: Implemented improvements based upon comments by David
59            Meyer.
60    
61    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
62    
63            * inst/doc/: Rewrote vignette.
64    
65            * man/: Improved documentation.
66    
67    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
68    
69            * man/: Updated documentation.
70    
71            * DESCRIPTION: Changed package name to "tm". Updated version to
72            0.1 for first CRAN release.
73    
74            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
75            list archive example.
76    
77            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
78            archive example.
79    
80            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
81            from (several mails per box) mbox format to (single mail per file)
82            eml format.
83    
84    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
85    
86            * data/crude.rda: Rebuilt.
87    
88            * data/acq.rda: Rebuilt.
89    
90            * R/reader.R: Factored out reader and parser methods from
91            textdoccol.R.
92    
93            * R/source.R: Factored out Source methods from aobjects.R and
94            textdoccol.R.
95            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
96            feeds.
97    
98            * R/textdoccol.R (DirSource): Added support for recursive
99            traversal of directories.
100    
101    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
102    
103            * R/textdoccol.R ([[): Loads the document corpus automatically
104            into memory upon access.
105            (tm_transform, tm_filter): Removed several checks whether the
106            document is already loaded ([[ ensures this now).
107            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
108            mailing list archive.
109    
110    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
111    
112            * R/aobjects.R (TextDocument): Is now a virtual class.
113            (Source): Is now a virtual class.
114    
115    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
116    
117            * R/textdoccol.R (c): Support for an arbitrary number of document
118            collections.
119    
120    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
121    
122            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
123            append_meta and remove_meta.
124    
125            * R/textdoccol.R: Removed modify_metadata method.
126    
127            * R/textrepo.R: Removed modify_metadata method.
128    
129            * R/textdoccol.R (remove_meta): Supports removal of document
130            collection metadata and document (= in data frame) metadata.
131    
132    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
133    
134            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
135    
136            * data/crude.rda: Rebuilt.
137    
138            * data/acq.rda: Rebuilt.
139    
140            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
141    
142            * R/textdoccol.R ([): Bug fix for subsetting a document
143            collection's data frame.
144    
145    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
146    
147            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
148            to s_filter.
149    
150            * R/textdoccol.R: Local text documents' metadata can now be copied
151            to a document collection's data frame with prescind_meta.
152    
153    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * R/: Text documents' slot metadata is now accessible in s_filter.
156    
157            * R/: Rewrote s_filter function (has still some restrictions).
158    
159    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
160    
161            * R/: Various fixes in handling metadata.
162    
163            * R/: Added update mechanism for text document collections.
164    
165    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
166    
167            * R/: Merging of document collections now creates a binary tree
168            for reconstructing merged document collections.
169    
170            * R/: Redesign of metadata for document collections.
171    
172    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
173    
174            * R/: Messages now use \code{ngettext}.
175    
176    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
177    
178            * R/: Added functions for modifying and removing metadata.
179    
180    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
181    
182            * man/: Updated some documentation.
183    
184            * R/: Corrected some connection issues.
185    
186            * inst/doc: Worked on the vignette.
187    
188    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
189    
190            * inst/: Added texts and started vignette.
191    
192            * R/: Final changes based upon David's comments.
193    
194    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
195    
196            * NAMESPACE: Corrected exports (generic methods need exportMethods
197            directives!).
198    
199    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
200    
201            * R/: Modified the TextDocCol constructur and various parsers. It
202            is now modular and supports various file formats via plugins (see
203            the new "Source" class).
204    
205    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
206    
207            * man/: Revised documentation after previous code changes.
208    
209    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
210    
211            * R/: Remaining changes as discussed with David.
212    
213    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
214    
215            * R/: Some changes as suggested by David. The rest will follow
216            within the next days.
217    
218    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
219    
220            * man/: Finished documentation.
221    
222    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
223    
224            * man/: Wrote some documentation.
225    
226    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
227    
228            * R/: Further syntactic sugar in form of additional assignment and
229            accessor methods.
230    
231    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
232    
233            * R/: Syntactic sugar in form of "length", "show" and "summary"
234            operators.
235    
236    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
237    
238            * R/: Diverse updates. Mainly on default operators ("[" or "c")
239            and dissimilarities.
240    
241    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
242    
243            * R/: Added similarity functions.
244    
245            * data/: Added english stopwords.
246    
247    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
248    
249            * data/: Examples compiled for new features
250    
251            * R/: Changes due to new structure.
252    
253            * NAMESPACE: Corrected namespace to reflect new structure.
254    
255            * R/termdocmatrix.R: Adapted for new naming scheme.
256    
257    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * R/textdoccol.R: Adapted code for new class structure. Wrote
260            several transform and filter functions operating on text document
261            collections (alias text document databases).
262    
263            * R/aobjects.R: Adapted class structure with inheritance,
264            repositories and additional meta data. Loading files on demand is
265            now possible.
266    
267    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
268    
269            * R/: Some cosmetic cleanups.
270    
271            * inst/: Removed vignette on clustering. That and much more is now
272            described in the JSS paper on text mining. Based upon that
273            article an elaborated vignette will be incorporated in the future.
274    
275    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
276    
277            * R/: Updated generic S4 methods to comply with signature changes
278            in newer versions of R (> 2.3)
279    
280    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
281    
282            * ext/R/importRIS.R: Automatic RIS import is now possible.
283    
284    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/textdoccol.R: Added RIS HTML input format.
287    
288    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * R/textdoccol.R: Removed bug that caused invalid text document
291            collections when handling many input files.
292    
293    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/textdoccol.R: Restructured and extended file import
296            mechanism.
297    
298            * inst/doc/clustering.Rnw: Adapted vignette for use with
299            ReutNews.rda
300    
301            * man/ReutNews.Rd: Documentation for ReutNews.rda
302    
303            * data/ReutNews.rda: A tiny Reuters21578 example data set.
304    
305    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
306    
307            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
308            clustering facilities of this package.
309    
310    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
311    
312            * R/aobjects.R: Changed package document structure to avoid class
313            dependency problems.
314    
315  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
316    
317            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
318            data set.
319    
320          * Finished documentation and reordered directory structure. Now "R          * Finished documentation and reordered directory structure. Now "R
321          CMD check textmin" works without errors.          CMD check textmin" works without errors.
322    

Legend:
Removed from v.28  
changed lines
  Added in v.719

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge