SCM

SCM Repository

[tm] Diff of /trunk/tm/ChangeLog
ViewVC logotype

Diff of /trunk/tm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 20, Tue Nov 8 16:40:52 2005 UTC trunk/tm/ChangeLog revision 725, Fri Apr 6 01:10:28 2007 UTC
# Line 1  Line 1 
1    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * man/: Updated parts of the documentation.
4    
5            * R/textdoccol.R (asPlain): Added conversion from newsgroup
6            documents to plain text documents.
7    
8    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
9    
10            * R/textdoccol.R: Finished experimental database support. Not yet
11            intensively tested.
12    
13            * R/source.R: Now each source has a default reader.
14    
15            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
16            class anymore.
17    
18            * R/plaintextdoc.R: Custom show method for plain text documents.
19    
20            * R/aobjects.R: Added a class for structured text documents.
21    
22            * R/reader.R: Replaced remaining \code{parser} occurrences with
23            \code{reader}.
24    
25            * R/textdoccol.R (summary): Indent tags.
26    
27            * R/textdoccol.R (removePunctuation): Transform method to remove
28            punctuation marks.
29    
30    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
31    
32            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
33            using prescindMeta().
34    
35    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
36    
37            * R/textdoccol.R: Improved database support.
38    
39    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
40    
41            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
42    
43            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
44            language code.
45    
46            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
47            into parserControl argument.
48    
49            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
50    
51    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
52    
53            * Work/tmDataSetup.R: The datasets acq and crude can now be
54            created on the fly.
55    
56            * R/stopwords.R: Introduced a function returning the stopwords for
57            a given language (English, German and French at the moment)
58    
59            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
60            otherwise falls back to Snowball package.
61    
62    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
63    
64            * man/dissimilarity-methods.Rd: Make clear that any method offered
65            by "dists" from package "cba" can be used.
66    
67    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
68    
69            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
70            to Kurt's latex suggestion. Removed points and underscores in
71            variable names for consistent naming.
72    
73            * DESCRIPTION: Update to version 0.1-2.
74    
75            * man/TextRepository.Rd: Fixed bug in documentation.
76    
77    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
78    
79            * DESCRIPTION: Update to version 0.1-1.
80    
81    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
82    
83            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
84            wordStem.
85    
86    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
87    
88            * R/: Changes due to Kurt's review.
89    
90    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
91    
92            * R/: Implemented improvements based upon comments by David
93            Meyer.
94    
95    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
96    
97            * inst/doc/: Rewrote vignette.
98    
99            * man/: Improved documentation.
100    
101    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
102    
103            * man/: Updated documentation.
104    
105            * DESCRIPTION: Changed package name to "tm". Updated version to
106            0.1 for first CRAN release.
107    
108            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
109            list archive example.
110    
111            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
112            archive example.
113    
114            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
115            from (several mails per box) mbox format to (single mail per file)
116            eml format.
117    
118    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
119    
120            * data/crude.rda: Rebuilt.
121    
122            * data/acq.rda: Rebuilt.
123    
124            * R/reader.R: Factored out reader and parser methods from
125            textdoccol.R.
126    
127            * R/source.R: Factored out Source methods from aobjects.R and
128            textdoccol.R.
129            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
130            feeds.
131    
132            * R/textdoccol.R (DirSource): Added support for recursive
133            traversal of directories.
134    
135    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
136    
137            * R/textdoccol.R ([[): Loads the document corpus automatically
138            into memory upon access.
139            (tm_transform, tm_filter): Removed several checks whether the
140            document is already loaded ([[ ensures this now).
141            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
142            mailing list archive.
143    
144    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
145    
146            * R/aobjects.R (TextDocument): Is now a virtual class.
147            (Source): Is now a virtual class.
148    
149    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
150    
151            * R/textdoccol.R (c): Support for an arbitrary number of document
152            collections.
153    
154    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
155    
156            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
157            append_meta and remove_meta.
158    
159            * R/textdoccol.R: Removed modify_metadata method.
160    
161            * R/textrepo.R: Removed modify_metadata method.
162    
163            * R/textdoccol.R (remove_meta): Supports removal of document
164            collection metadata and document (= in data frame) metadata.
165    
166    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
167    
168            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
169    
170            * data/crude.rda: Rebuilt.
171    
172            * data/acq.rda: Rebuilt.
173    
174            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
175    
176            * R/textdoccol.R ([): Bug fix for subsetting a document
177            collection's data frame.
178    
179    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
180    
181            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
182            to s_filter.
183    
184            * R/textdoccol.R: Local text documents' metadata can now be copied
185            to a document collection's data frame with prescind_meta.
186    
187    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
188    
189            * R/: Text documents' slot metadata is now accessible in s_filter.
190    
191            * R/: Rewrote s_filter function (has still some restrictions).
192    
193    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
194    
195            * R/: Various fixes in handling metadata.
196    
197            * R/: Added update mechanism for text document collections.
198    
199    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
200    
201            * R/: Merging of document collections now creates a binary tree
202            for reconstructing merged document collections.
203    
204            * R/: Redesign of metadata for document collections.
205    
206    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
207    
208            * R/: Messages now use \code{ngettext}.
209    
210    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * R/: Added functions for modifying and removing metadata.
213    
214    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * man/: Updated some documentation.
217    
218            * R/: Corrected some connection issues.
219    
220            * inst/doc: Worked on the vignette.
221    
222    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
223    
224            * inst/: Added texts and started vignette.
225    
226            * R/: Final changes based upon David's comments.
227    
228    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * NAMESPACE: Corrected exports (generic methods need exportMethods
231            directives!).
232    
233    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
234    
235            * R/: Modified the TextDocCol constructur and various parsers. It
236            is now modular and supports various file formats via plugins (see
237            the new "Source" class).
238    
239    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
240    
241            * man/: Revised documentation after previous code changes.
242    
243    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
244    
245            * R/: Remaining changes as discussed with David.
246    
247    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
248    
249            * R/: Some changes as suggested by David. The rest will follow
250            within the next days.
251    
252    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
253    
254            * man/: Finished documentation.
255    
256    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
257    
258            * man/: Wrote some documentation.
259    
260    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
261    
262            * R/: Further syntactic sugar in form of additional assignment and
263            accessor methods.
264    
265    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
266    
267            * R/: Syntactic sugar in form of "length", "show" and "summary"
268            operators.
269    
270    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
271    
272            * R/: Diverse updates. Mainly on default operators ("[" or "c")
273            and dissimilarities.
274    
275    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
276    
277            * R/: Added similarity functions.
278    
279            * data/: Added english stopwords.
280    
281    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
282    
283            * data/: Examples compiled for new features
284    
285            * R/: Changes due to new structure.
286    
287            * NAMESPACE: Corrected namespace to reflect new structure.
288    
289            * R/termdocmatrix.R: Adapted for new naming scheme.
290    
291    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
292    
293            * R/textdoccol.R: Adapted code for new class structure. Wrote
294            several transform and filter functions operating on text document
295            collections (alias text document databases).
296    
297            * R/aobjects.R: Adapted class structure with inheritance,
298            repositories and additional meta data. Loading files on demand is
299            now possible.
300    
301    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
302    
303            * R/: Some cosmetic cleanups.
304    
305            * inst/: Removed vignette on clustering. That and much more is now
306            described in the JSS paper on text mining. Based upon that
307            article an elaborated vignette will be incorporated in the future.
308    
309    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
310    
311            * R/: Updated generic S4 methods to comply with signature changes
312            in newer versions of R (> 2.3)
313    
314    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
315    
316            * ext/R/importRIS.R: Automatic RIS import is now possible.
317    
318    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
319    
320            * R/textdoccol.R: Added RIS HTML input format.
321    
322    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * R/textdoccol.R: Removed bug that caused invalid text document
325            collections when handling many input files.
326    
327    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
328    
329            * R/textdoccol.R: Restructured and extended file import
330            mechanism.
331    
332            * inst/doc/clustering.Rnw: Adapted vignette for use with
333            ReutNews.rda
334    
335            * man/ReutNews.Rd: Documentation for ReutNews.rda
336    
337            * data/ReutNews.rda: A tiny Reuters21578 example data set.
338    
339    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
340    
341            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
342            clustering facilities of this package.
343    
344    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
345    
346            * R/aobjects.R: Changed package document structure to avoid class
347            dependency problems.
348    
349    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
350    
351            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
352            data set.
353    
354            * Finished documentation and reordered directory structure. Now "R
355            CMD check textmin" works without errors.
356    
357    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
358    
359            * src/: Various splits can now be easily created for the
360            Reuters21578 data set.
361    
362    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
363    
364            * Updated documentation
365    
366    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
367    
368            * Wrote R documentation for some classes and methods.
369    
370    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
371    
372            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
373            files. See the questionnaire data/Umfrage.csv for such an example.
374            We are now able to import files in Reuters-21578 XML format.
375    
376            * Changed class interfaces in various files. Weighting of the text
377            matrix is now possible.
378    
379  2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
380    
381          * R/textdoccol.R: One can build term-document matrices if          * R/textdoccol.R: One can build term-document matrices if

Legend:
Removed from v.20  
changed lines
  Added in v.725

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge