SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC trunk/tm/ChangeLog revision 723, Sun Apr 1 16:12:26 2007 UTC
# Line 1  Line 1 
1    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * R/source.R: Now each source has a default reader.
4    
5            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
6            class anymore.
7    
8            * R/plaintextdoc.R: Custom show method for plain text documents.
9    
10            * R/aobjects.R: Added a class for structured text documents.
11    
12            * R/reader.R: Replaced remaining \code{parser} occurrences with
13            \code{reader}.
14    
15            * R/textdoccol.R (summary): Indent tags.
16    
17            * R/textdocco.R (removePunctuation): Transform method to remove
18            punctuation marks.
19    
20    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
21    
22            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
23            using prescindMeta().
24    
25    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
26    
27            * R/textdoccol.R: Improved database support.
28    
29    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
30    
31            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
32    
33            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
34            language code.
35    
36            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
37            into parserControl argument.
38    
39            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
40    
41    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
42    
43            * Work/tmDataSetup.R: The datasets acq and crude can now be
44            created on the fly.
45    
46            * R/stopwords.R: Introduced a function returning the stopwords for
47            a given language (English, German and French at the moment)
48    
49            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
50            otherwise falls back to Snowball package.
51    
52    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
53    
54            * man/dissimilarity-methods.Rd: Make clear that any method offered
55            by "dists" from package "cba" can be used.
56    
57    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
58    
59            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
60            to Kurt's latex suggestion. Removed points and underscores in
61            variable names for consistent naming.
62    
63            * DESCRIPTION: Update to version 0.1-2.
64    
65            * man/TextRepository.Rd: Fixed bug in documentation.
66    
67    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
68    
69            * DESCRIPTION: Update to version 0.1-1.
70    
71    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
72    
73            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
74            wordStem.
75    
76    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
77    
78            * R/: Changes due to Kurt's review.
79    
80    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
81    
82            * R/: Implemented improvements based upon comments by David
83            Meyer.
84    
85    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
86    
87            * inst/doc/: Rewrote vignette.
88    
89            * man/: Improved documentation.
90    
91    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
92    
93            * man/: Updated documentation.
94    
95            * DESCRIPTION: Changed package name to "tm". Updated version to
96            0.1 for first CRAN release.
97    
98            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
99            list archive example.
100    
101            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
102            archive example.
103    
104            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
105            from (several mails per box) mbox format to (single mail per file)
106            eml format.
107    
108    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
109    
110            * data/crude.rda: Rebuilt.
111    
112            * data/acq.rda: Rebuilt.
113    
114            * R/reader.R: Factored out reader and parser methods from
115            textdoccol.R.
116    
117            * R/source.R: Factored out Source methods from aobjects.R and
118            textdoccol.R.
119            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
120            feeds.
121    
122            * R/textdoccol.R (DirSource): Added support for recursive
123            traversal of directories.
124    
125    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
126    
127            * R/textdoccol.R ([[): Loads the document corpus automatically
128            into memory upon access.
129            (tm_transform, tm_filter): Removed several checks whether the
130            document is already loaded ([[ ensures this now).
131            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
132            mailing list archive.
133    
134    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
135    
136            * R/aobjects.R (TextDocument): Is now a virtual class.
137            (Source): Is now a virtual class.
138    
139    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
140    
141            * R/textdoccol.R (c): Support for an arbitrary number of document
142            collections.
143    
144    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
145    
146            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
147            append_meta and remove_meta.
148    
149            * R/textdoccol.R: Removed modify_metadata method.
150    
151            * R/textrepo.R: Removed modify_metadata method.
152    
153            * R/textdoccol.R (remove_meta): Supports removal of document
154            collection metadata and document (= in data frame) metadata.
155    
156    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
157    
158            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
159    
160            * data/crude.rda: Rebuilt.
161    
162            * data/acq.rda: Rebuilt.
163    
164            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
165    
166            * R/textdoccol.R ([): Bug fix for subsetting a document
167            collection's data frame.
168    
169    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
170    
171            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
172            to s_filter.
173    
174            * R/textdoccol.R: Local text documents' metadata can now be copied
175            to a document collection's data frame with prescind_meta.
176    
177    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
178    
179            * R/: Text documents' slot metadata is now accessible in s_filter.
180    
181            * R/: Rewrote s_filter function (has still some restrictions).
182    
183    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
184    
185            * R/: Various fixes in handling metadata.
186    
187            * R/: Added update mechanism for text document collections.
188    
189    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
190    
191            * R/: Merging of document collections now creates a binary tree
192            for reconstructing merged document collections.
193    
194            * R/: Redesign of metadata for document collections.
195    
196    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
197    
198            * R/: Messages now use \code{ngettext}.
199    
200    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
201    
202            * R/: Added functions for modifying and removing metadata.
203    
204    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
205    
206            * man/: Updated some documentation.
207    
208            * R/: Corrected some connection issues.
209    
210            * inst/doc: Worked on the vignette.
211    
212    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
213    
214            * inst/: Added texts and started vignette.
215    
216            * R/: Final changes based upon David's comments.
217    
218    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
219    
220            * NAMESPACE: Corrected exports (generic methods need exportMethods
221            directives!).
222    
223    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
224    
225            * R/: Modified the TextDocCol constructur and various parsers. It
226            is now modular and supports various file formats via plugins (see
227            the new "Source" class).
228    
229    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
230    
231            * man/: Revised documentation after previous code changes.
232    
233    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
234    
235            * R/: Remaining changes as discussed with David.
236    
237    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
238    
239            * R/: Some changes as suggested by David. The rest will follow
240            within the next days.
241    
242    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
243    
244            * man/: Finished documentation.
245    
246    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
247    
248            * man/: Wrote some documentation.
249    
250    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
251    
252            * R/: Further syntactic sugar in form of additional assignment and
253            accessor methods.
254    
255    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
256    
257            * R/: Syntactic sugar in form of "length", "show" and "summary"
258            operators.
259    
260    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
261    
262            * R/: Diverse updates. Mainly on default operators ("[" or "c")
263            and dissimilarities.
264    
265    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
266    
267            * R/: Added similarity functions.
268    
269            * data/: Added english stopwords.
270    
271    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * data/: Examples compiled for new features
274    
275            * R/: Changes due to new structure.
276    
277            * NAMESPACE: Corrected namespace to reflect new structure.
278    
279            * R/termdocmatrix.R: Adapted for new naming scheme.
280    
281    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
282    
283            * R/textdoccol.R: Adapted code for new class structure. Wrote
284            several transform and filter functions operating on text document
285            collections (alias text document databases).
286    
287            * R/aobjects.R: Adapted class structure with inheritance,
288            repositories and additional meta data. Loading files on demand is
289            now possible.
290    
291    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
292    
293            * R/: Some cosmetic cleanups.
294    
295            * inst/: Removed vignette on clustering. That and much more is now
296            described in the JSS paper on text mining. Based upon that
297            article an elaborated vignette will be incorporated in the future.
298    
299    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * R/: Updated generic S4 methods to comply with signature changes
302            in newer versions of R (> 2.3)
303    
304    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * ext/R/importRIS.R: Automatic RIS import is now possible.
307    
308    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
309    
310            * R/textdoccol.R: Added RIS HTML input format.
311    
312    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
313    
314            * R/textdoccol.R: Removed bug that caused invalid text document
315            collections when handling many input files.
316    
317    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * R/textdoccol.R: Restructured and extended file import
320            mechanism.
321    
322            * inst/doc/clustering.Rnw: Adapted vignette for use with
323            ReutNews.rda
324    
325            * man/ReutNews.Rd: Documentation for ReutNews.rda
326    
327            * data/ReutNews.rda: A tiny Reuters21578 example data set.
328    
329    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
330    
331            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
332            clustering facilities of this package.
333    
334    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * R/aobjects.R: Changed package document structure to avoid class
337            dependency problems.
338    
339    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
340    
341            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
342            data set.
343    
344            * Finished documentation and reordered directory structure. Now "R
345            CMD check textmin" works without errors.
346    
347    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * src/: Various splits can now be easily created for the
350            Reuters21578 data set.
351    
352    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * Updated documentation
355    
356    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * Wrote R documentation for some classes and methods.
359    
360    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
363            files. See the questionnaire data/Umfrage.csv for such an example.
364            We are now able to import files in Reuters-21578 XML format.
365    
366            * Changed class interfaces in various files. Weighting of the text
367            matrix is now possible.
368    
369    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
370    
371            * R/textdoccol.R: One can build term-document matrices if
372            nessecary (with buildTDM(...)) and fill the field tdm from a text
373            document collection with it.
374    
375            * R/textmatrix.R: Wrote S4 class for term-document matrices.
376    
377    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
378    
379            * R/textdoccol.R: We now can read in a whole XML file with several
380            news items.
381    
382  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
383    
384          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.723

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge