SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC trunk/tm/ChangeLog revision 718, Fri Mar 16 12:55:16 2007 UTC
# Line 1  Line 1 
1    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
4    
5            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
6            language code.
7    
8            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
9            into parserControl argument.
10    
11            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
12    
13    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
14    
15            * Work/tmDataSetup.R: The datasets acq and crude can now be
16            created on the fly.
17    
18            * R/stopwords.R: Introduced a function returning the stopwords for
19            a given language (English, German and French at the moment)
20    
21            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
22            otherwise falls back to Snowball package.
23    
24    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
25    
26            * man/dissimilarity-methods.Rd: Make clear that any method offered
27            by "dists" from package "cba" can be used.
28    
29    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
30    
31            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
32            to Kurt's latex suggestion. Removed points and underscores in
33            variable names for consistent naming.
34    
35            * DESCRIPTION: Update to version 0.1-2.
36    
37            * man/TextRepository.Rd: Fixed bug in documentation.
38    
39    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
40    
41            * DESCRIPTION: Update to version 0.1-1.
42    
43    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
44    
45            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
46            wordStem.
47    
48    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
49    
50            * R/: Changes due to Kurt's review.
51    
52    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
53    
54            * R/: Implemented improvements based upon comments by David
55            Meyer.
56    
57    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
58    
59            * inst/doc/: Rewrote vignette.
60    
61            * man/: Improved documentation.
62    
63    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
64    
65            * man/: Updated documentation.
66    
67            * DESCRIPTION: Changed package name to "tm". Updated version to
68            0.1 for first CRAN release.
69    
70            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
71            list archive example.
72    
73            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
74            archive example.
75    
76            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
77            from (several mails per box) mbox format to (single mail per file)
78            eml format.
79    
80    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
81    
82            * data/crude.rda: Rebuilt.
83    
84            * data/acq.rda: Rebuilt.
85    
86            * R/reader.R: Factored out reader and parser methods from
87            textdoccol.R.
88    
89            * R/source.R: Factored out Source methods from aobjects.R and
90            textdoccol.R.
91            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
92            feeds.
93    
94            * R/textdoccol.R (DirSource): Added support for recursive
95            traversal of directories.
96    
97    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
98    
99            * R/textdoccol.R ([[): Loads the document corpus automatically
100            into memory upon access.
101            (tm_transform, tm_filter): Removed several checks whether the
102            document is already loaded ([[ ensures this now).
103            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
104            mailing list archive.
105    
106    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
107    
108            * R/aobjects.R (TextDocument): Is now a virtual class.
109            (Source): Is now a virtual class.
110    
111    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
112    
113            * R/textdoccol.R (c): Support for an arbitrary number of document
114            collections.
115    
116    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
117    
118            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
119            append_meta and remove_meta.
120    
121            * R/textdoccol.R: Removed modify_metadata method.
122    
123            * R/textrepo.R: Removed modify_metadata method.
124    
125            * R/textdoccol.R (remove_meta): Supports removal of document
126            collection metadata and document (= in data frame) metadata.
127    
128    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
129    
130            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
131    
132            * data/crude.rda: Rebuilt.
133    
134            * data/acq.rda: Rebuilt.
135    
136            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
137    
138            * R/textdoccol.R ([): Bug fix for subsetting a document
139            collection's data frame.
140    
141    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
142    
143            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
144            to s_filter.
145    
146            * R/textdoccol.R: Local text documents' metadata can now be copied
147            to a document collection's data frame with prescind_meta.
148    
149    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
150    
151            * R/: Text documents' slot metadata is now accessible in s_filter.
152    
153            * R/: Rewrote s_filter function (has still some restrictions).
154    
155    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
156    
157            * R/: Various fixes in handling metadata.
158    
159            * R/: Added update mechanism for text document collections.
160    
161    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
162    
163            * R/: Merging of document collections now creates a binary tree
164            for reconstructing merged document collections.
165    
166            * R/: Redesign of metadata for document collections.
167    
168    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
169    
170            * R/: Messages now use \code{ngettext}.
171    
172    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
173    
174            * R/: Added functions for modifying and removing metadata.
175    
176    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
177    
178            * man/: Updated some documentation.
179    
180            * R/: Corrected some connection issues.
181    
182            * inst/doc: Worked on the vignette.
183    
184    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
185    
186            * inst/: Added texts and started vignette.
187    
188            * R/: Final changes based upon David's comments.
189    
190    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
191    
192            * NAMESPACE: Corrected exports (generic methods need exportMethods
193            directives!).
194    
195    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
196    
197            * R/: Modified the TextDocCol constructur and various parsers. It
198            is now modular and supports various file formats via plugins (see
199            the new "Source" class).
200    
201    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
202    
203            * man/: Revised documentation after previous code changes.
204    
205    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
206    
207            * R/: Remaining changes as discussed with David.
208    
209    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
210    
211            * R/: Some changes as suggested by David. The rest will follow
212            within the next days.
213    
214    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * man/: Finished documentation.
217    
218    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
219    
220            * man/: Wrote some documentation.
221    
222    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
223    
224            * R/: Further syntactic sugar in form of additional assignment and
225            accessor methods.
226    
227    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
228    
229            * R/: Syntactic sugar in form of "length", "show" and "summary"
230            operators.
231    
232    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
233    
234            * R/: Diverse updates. Mainly on default operators ("[" or "c")
235            and dissimilarities.
236    
237    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
238    
239            * R/: Added similarity functions.
240    
241            * data/: Added english stopwords.
242    
243    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
244    
245            * data/: Examples compiled for new features
246    
247            * R/: Changes due to new structure.
248    
249            * NAMESPACE: Corrected namespace to reflect new structure.
250    
251            * R/termdocmatrix.R: Adapted for new naming scheme.
252    
253    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * R/textdoccol.R: Adapted code for new class structure. Wrote
256            several transform and filter functions operating on text document
257            collections (alias text document databases).
258    
259            * R/aobjects.R: Adapted class structure with inheritance,
260            repositories and additional meta data. Loading files on demand is
261            now possible.
262    
263    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
264    
265            * R/: Some cosmetic cleanups.
266    
267            * inst/: Removed vignette on clustering. That and much more is now
268            described in the JSS paper on text mining. Based upon that
269            article an elaborated vignette will be incorporated in the future.
270    
271    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/: Updated generic S4 methods to comply with signature changes
274            in newer versions of R (> 2.3)
275    
276    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
277    
278            * ext/R/importRIS.R: Automatic RIS import is now possible.
279    
280    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
281    
282            * R/textdoccol.R: Added RIS HTML input format.
283    
284    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/textdoccol.R: Removed bug that caused invalid text document
287            collections when handling many input files.
288    
289    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * R/textdoccol.R: Restructured and extended file import
292            mechanism.
293    
294            * inst/doc/clustering.Rnw: Adapted vignette for use with
295            ReutNews.rda
296    
297            * man/ReutNews.Rd: Documentation for ReutNews.rda
298    
299            * data/ReutNews.rda: A tiny Reuters21578 example data set.
300    
301    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
302    
303            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
304            clustering facilities of this package.
305    
306    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
307    
308            * R/aobjects.R: Changed package document structure to avoid class
309            dependency problems.
310    
311    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
312    
313            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
314            data set.
315    
316            * Finished documentation and reordered directory structure. Now "R
317            CMD check textmin" works without errors.
318    
319    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
320    
321            * src/: Various splits can now be easily created for the
322            Reuters21578 data set.
323    
324    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
325    
326            * Updated documentation
327    
328    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
329    
330            * Wrote R documentation for some classes and methods.
331    
332    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
333    
334            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
335            files. See the questionnaire data/Umfrage.csv for such an example.
336            We are now able to import files in Reuters-21578 XML format.
337    
338            * Changed class interfaces in various files. Weighting of the text
339            matrix is now possible.
340    
341    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
342    
343            * R/textdoccol.R: One can build term-document matrices if
344            nessecary (with buildTDM(...)) and fill the field tdm from a text
345            document collection with it.
346    
347            * R/textmatrix.R: Wrote S4 class for term-document matrices.
348    
349    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
350    
351            * R/textdoccol.R: We now can read in a whole XML file with several
352            news items.
353    
354  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
355    
356          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.718

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge