SCM

SCM Repository

[tm] Diff of /trunk/tm/ChangeLog
ViewVC logotype

Diff of /trunk/tm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 22, Sat Nov 19 16:58:34 2005 UTC trunk/tm/ChangeLog revision 717, Fri Mar 16 11:13:04 2007 UTC
# Line 1  Line 1 
1    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
4            language code.
5    
6            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
7            into parserControl argument.
8    
9            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
10    
11    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
12    
13            * Work/tmDataSetup.R: The datasets acq and crude can now be
14            created on the fly.
15    
16            * R/stopwords.R: Introduced a function returning the stopwords for
17            a given language (English, German and French at the moment)
18    
19            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
20            otherwise falls back to Snowball package.
21    
22    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
23    
24            * man/dissimilarity-methods.Rd: Make clear that any method offered
25            by "dists" from package "cba" can be used.
26    
27    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
28    
29            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
30            to Kurt's latex suggestion. Removed points and underscores in
31            variable names for consistent naming.
32    
33            * DESCRIPTION: Update to version 0.1-2.
34    
35            * man/TextRepository.Rd: Fixed bug in documentation.
36    
37    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
38    
39            * DESCRIPTION: Update to version 0.1-1.
40    
41    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
42    
43            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
44            wordStem.
45    
46    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
47    
48            * R/: Changes due to Kurt's review.
49    
50    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
51    
52            * R/: Implemented improvements based upon comments by David
53            Meyer.
54    
55    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
56    
57            * inst/doc/: Rewrote vignette.
58    
59            * man/: Improved documentation.
60    
61    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
62    
63            * man/: Updated documentation.
64    
65            * DESCRIPTION: Changed package name to "tm". Updated version to
66            0.1 for first CRAN release.
67    
68            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
69            list archive example.
70    
71            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
72            archive example.
73    
74            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
75            from (several mails per box) mbox format to (single mail per file)
76            eml format.
77    
78    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
79    
80            * data/crude.rda: Rebuilt.
81    
82            * data/acq.rda: Rebuilt.
83    
84            * R/reader.R: Factored out reader and parser methods from
85            textdoccol.R.
86    
87            * R/source.R: Factored out Source methods from aobjects.R and
88            textdoccol.R.
89            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
90            feeds.
91    
92            * R/textdoccol.R (DirSource): Added support for recursive
93            traversal of directories.
94    
95    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
96    
97            * R/textdoccol.R ([[): Loads the document corpus automatically
98            into memory upon access.
99            (tm_transform, tm_filter): Removed several checks whether the
100            document is already loaded ([[ ensures this now).
101            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
102            mailing list archive.
103    
104    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
105    
106            * R/aobjects.R (TextDocument): Is now a virtual class.
107            (Source): Is now a virtual class.
108    
109    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
110    
111            * R/textdoccol.R (c): Support for an arbitrary number of document
112            collections.
113    
114    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
115    
116            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
117            append_meta and remove_meta.
118    
119            * R/textdoccol.R: Removed modify_metadata method.
120    
121            * R/textrepo.R: Removed modify_metadata method.
122    
123            * R/textdoccol.R (remove_meta): Supports removal of document
124            collection metadata and document (= in data frame) metadata.
125    
126    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
127    
128            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
129    
130            * data/crude.rda: Rebuilt.
131    
132            * data/acq.rda: Rebuilt.
133    
134            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
135    
136            * R/textdoccol.R ([): Bug fix for subsetting a document
137            collection's data frame.
138    
139    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
140    
141            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
142            to s_filter.
143    
144            * R/textdoccol.R: Local text documents' metadata can now be copied
145            to a document collection's data frame with prescind_meta.
146    
147    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
148    
149            * R/: Text documents' slot metadata is now accessible in s_filter.
150    
151            * R/: Rewrote s_filter function (has still some restrictions).
152    
153    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * R/: Various fixes in handling metadata.
156    
157            * R/: Added update mechanism for text document collections.
158    
159    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
160    
161            * R/: Merging of document collections now creates a binary tree
162            for reconstructing merged document collections.
163    
164            * R/: Redesign of metadata for document collections.
165    
166    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
167    
168            * R/: Messages now use \code{ngettext}.
169    
170    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
171    
172            * R/: Added functions for modifying and removing metadata.
173    
174    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
175    
176            * man/: Updated some documentation.
177    
178            * R/: Corrected some connection issues.
179    
180            * inst/doc: Worked on the vignette.
181    
182    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
183    
184            * inst/: Added texts and started vignette.
185    
186            * R/: Final changes based upon David's comments.
187    
188    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
189    
190            * NAMESPACE: Corrected exports (generic methods need exportMethods
191            directives!).
192    
193    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
194    
195            * R/: Modified the TextDocCol constructur and various parsers. It
196            is now modular and supports various file formats via plugins (see
197            the new "Source" class).
198    
199    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
200    
201            * man/: Revised documentation after previous code changes.
202    
203    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
204    
205            * R/: Remaining changes as discussed with David.
206    
207    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
208    
209            * R/: Some changes as suggested by David. The rest will follow
210            within the next days.
211    
212    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
213    
214            * man/: Finished documentation.
215    
216    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
217    
218            * man/: Wrote some documentation.
219    
220    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
221    
222            * R/: Further syntactic sugar in form of additional assignment and
223            accessor methods.
224    
225    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
226    
227            * R/: Syntactic sugar in form of "length", "show" and "summary"
228            operators.
229    
230    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
231    
232            * R/: Diverse updates. Mainly on default operators ("[" or "c")
233            and dissimilarities.
234    
235    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
236    
237            * R/: Added similarity functions.
238    
239            * data/: Added english stopwords.
240    
241    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
242    
243            * data/: Examples compiled for new features
244    
245            * R/: Changes due to new structure.
246    
247            * NAMESPACE: Corrected namespace to reflect new structure.
248    
249            * R/termdocmatrix.R: Adapted for new naming scheme.
250    
251    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
252    
253            * R/textdoccol.R: Adapted code for new class structure. Wrote
254            several transform and filter functions operating on text document
255            collections (alias text document databases).
256    
257            * R/aobjects.R: Adapted class structure with inheritance,
258            repositories and additional meta data. Loading files on demand is
259            now possible.
260    
261    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
262    
263            * R/: Some cosmetic cleanups.
264    
265            * inst/: Removed vignette on clustering. That and much more is now
266            described in the JSS paper on text mining. Based upon that
267            article an elaborated vignette will be incorporated in the future.
268    
269    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
270    
271            * R/: Updated generic S4 methods to comply with signature changes
272            in newer versions of R (> 2.3)
273    
274    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
275    
276            * ext/R/importRIS.R: Automatic RIS import is now possible.
277    
278    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
279    
280            * R/textdoccol.R: Added RIS HTML input format.
281    
282    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
283    
284            * R/textdoccol.R: Removed bug that caused invalid text document
285            collections when handling many input files.
286    
287    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
288    
289            * R/textdoccol.R: Restructured and extended file import
290            mechanism.
291    
292            * inst/doc/clustering.Rnw: Adapted vignette for use with
293            ReutNews.rda
294    
295            * man/ReutNews.Rd: Documentation for ReutNews.rda
296    
297            * data/ReutNews.rda: A tiny Reuters21578 example data set.
298    
299    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
302            clustering facilities of this package.
303    
304    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * R/aobjects.R: Changed package document structure to avoid class
307            dependency problems.
308    
309    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
310    
311            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
312            data set.
313    
314            * Finished documentation and reordered directory structure. Now "R
315            CMD check textmin" works without errors.
316    
317    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * src/: Various splits can now be easily created for the
320            Reuters21578 data set.
321    
322    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * Updated documentation
325    
326    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
327    
328            * Wrote R documentation for some classes and methods.
329    
330  2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332          * R/textdoccol.R: Constructor of textdoccol allows import of CSV          * R/textdoccol.R: Constructor of textdoccol allows import of CSV
333          files. See the questionnaire data/Umfrage.csv for such an example.          files. See the questionnaire data/Umfrage.csv for such an example.
334            We are now able to import files in Reuters-21578 XML format.
335    
336          * Changed class interfaces in various files. Weighting of the text          * Changed class interfaces in various files. Weighting of the text
337          matrix is now possible.          matrix is now possible.

Legend:
Removed from v.22  
changed lines
  Added in v.717

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge