SCM

SCM Repository

[tm] Diff of /trunk/tm/ChangeLog
ViewVC logotype

Diff of /trunk/tm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 27, Sun Dec 4 15:30:18 2005 UTC trunk/tm/ChangeLog revision 722, Sun Apr 1 15:53:58 2007 UTC
# Line 1  Line 1 
1    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
4            class anymore.
5    
6            * R/plaintextdoc.R: Custom show method for plain text documents.
7    
8            * R/aobjects.R: Added a class for structured text documents.
9    
10            * R/reader.R: Replaced remaining \code{parser} occurrences with
11            \code{reader}.
12    
13            * R/textdoccol.R (summary): Indent tags.
14    
15            * R/textdocco.R (removePunctuation): Transform method to remove
16            punctuation marks.
17    
18    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
19    
20            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
21            using prescindMeta().
22    
23    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
24    
25            * R/textdoccol.R: Improved database support.
26    
27    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
28    
29            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
30    
31            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
32            language code.
33    
34            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
35            into parserControl argument.
36    
37            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
38    
39    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
40    
41            * Work/tmDataSetup.R: The datasets acq and crude can now be
42            created on the fly.
43    
44            * R/stopwords.R: Introduced a function returning the stopwords for
45            a given language (English, German and French at the moment)
46    
47            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
48            otherwise falls back to Snowball package.
49    
50    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
51    
52            * man/dissimilarity-methods.Rd: Make clear that any method offered
53            by "dists" from package "cba" can be used.
54    
55    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
56    
57            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
58            to Kurt's latex suggestion. Removed points and underscores in
59            variable names for consistent naming.
60    
61            * DESCRIPTION: Update to version 0.1-2.
62    
63            * man/TextRepository.Rd: Fixed bug in documentation.
64    
65    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
66    
67            * DESCRIPTION: Update to version 0.1-1.
68    
69    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
70    
71            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
72            wordStem.
73    
74    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
75    
76            * R/: Changes due to Kurt's review.
77    
78    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
79    
80            * R/: Implemented improvements based upon comments by David
81            Meyer.
82    
83    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
84    
85            * inst/doc/: Rewrote vignette.
86    
87            * man/: Improved documentation.
88    
89    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
90    
91            * man/: Updated documentation.
92    
93            * DESCRIPTION: Changed package name to "tm". Updated version to
94            0.1 for first CRAN release.
95    
96            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
97            list archive example.
98    
99            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
100            archive example.
101    
102            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
103            from (several mails per box) mbox format to (single mail per file)
104            eml format.
105    
106    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
107    
108            * data/crude.rda: Rebuilt.
109    
110            * data/acq.rda: Rebuilt.
111    
112            * R/reader.R: Factored out reader and parser methods from
113            textdoccol.R.
114    
115            * R/source.R: Factored out Source methods from aobjects.R and
116            textdoccol.R.
117            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
118            feeds.
119    
120            * R/textdoccol.R (DirSource): Added support for recursive
121            traversal of directories.
122    
123    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
124    
125            * R/textdoccol.R ([[): Loads the document corpus automatically
126            into memory upon access.
127            (tm_transform, tm_filter): Removed several checks whether the
128            document is already loaded ([[ ensures this now).
129            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
130            mailing list archive.
131    
132    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
133    
134            * R/aobjects.R (TextDocument): Is now a virtual class.
135            (Source): Is now a virtual class.
136    
137    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
138    
139            * R/textdoccol.R (c): Support for an arbitrary number of document
140            collections.
141    
142    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
143    
144            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
145            append_meta and remove_meta.
146    
147            * R/textdoccol.R: Removed modify_metadata method.
148    
149            * R/textrepo.R: Removed modify_metadata method.
150    
151            * R/textdoccol.R (remove_meta): Supports removal of document
152            collection metadata and document (= in data frame) metadata.
153    
154    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
155    
156            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
157    
158            * data/crude.rda: Rebuilt.
159    
160            * data/acq.rda: Rebuilt.
161    
162            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
163    
164            * R/textdoccol.R ([): Bug fix for subsetting a document
165            collection's data frame.
166    
167    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
168    
169            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
170            to s_filter.
171    
172            * R/textdoccol.R: Local text documents' metadata can now be copied
173            to a document collection's data frame with prescind_meta.
174    
175    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
176    
177            * R/: Text documents' slot metadata is now accessible in s_filter.
178    
179            * R/: Rewrote s_filter function (has still some restrictions).
180    
181    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
182    
183            * R/: Various fixes in handling metadata.
184    
185            * R/: Added update mechanism for text document collections.
186    
187    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
188    
189            * R/: Merging of document collections now creates a binary tree
190            for reconstructing merged document collections.
191    
192            * R/: Redesign of metadata for document collections.
193    
194    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
195    
196            * R/: Messages now use \code{ngettext}.
197    
198    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
199    
200            * R/: Added functions for modifying and removing metadata.
201    
202    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
203    
204            * man/: Updated some documentation.
205    
206            * R/: Corrected some connection issues.
207    
208            * inst/doc: Worked on the vignette.
209    
210    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * inst/: Added texts and started vignette.
213    
214            * R/: Final changes based upon David's comments.
215    
216    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
217    
218            * NAMESPACE: Corrected exports (generic methods need exportMethods
219            directives!).
220    
221    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
222    
223            * R/: Modified the TextDocCol constructur and various parsers. It
224            is now modular and supports various file formats via plugins (see
225            the new "Source" class).
226    
227    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
228    
229            * man/: Revised documentation after previous code changes.
230    
231    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
232    
233            * R/: Remaining changes as discussed with David.
234    
235    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
236    
237            * R/: Some changes as suggested by David. The rest will follow
238            within the next days.
239    
240    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
241    
242            * man/: Finished documentation.
243    
244    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * man/: Wrote some documentation.
247    
248    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
249    
250            * R/: Further syntactic sugar in form of additional assignment and
251            accessor methods.
252    
253    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * R/: Syntactic sugar in form of "length", "show" and "summary"
256            operators.
257    
258    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
259    
260            * R/: Diverse updates. Mainly on default operators ("[" or "c")
261            and dissimilarities.
262    
263    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
264    
265            * R/: Added similarity functions.
266    
267            * data/: Added english stopwords.
268    
269    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
270    
271            * data/: Examples compiled for new features
272    
273            * R/: Changes due to new structure.
274    
275            * NAMESPACE: Corrected namespace to reflect new structure.
276    
277            * R/termdocmatrix.R: Adapted for new naming scheme.
278    
279    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
280    
281            * R/textdoccol.R: Adapted code for new class structure. Wrote
282            several transform and filter functions operating on text document
283            collections (alias text document databases).
284    
285            * R/aobjects.R: Adapted class structure with inheritance,
286            repositories and additional meta data. Loading files on demand is
287            now possible.
288    
289    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * R/: Some cosmetic cleanups.
292    
293            * inst/: Removed vignette on clustering. That and much more is now
294            described in the JSS paper on text mining. Based upon that
295            article an elaborated vignette will be incorporated in the future.
296    
297    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
298    
299            * R/: Updated generic S4 methods to comply with signature changes
300            in newer versions of R (> 2.3)
301    
302    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
303    
304            * ext/R/importRIS.R: Automatic RIS import is now possible.
305    
306    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
307    
308            * R/textdoccol.R: Added RIS HTML input format.
309    
310    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
311    
312            * R/textdoccol.R: Removed bug that caused invalid text document
313            collections when handling many input files.
314    
315    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
316    
317            * R/textdoccol.R: Restructured and extended file import
318            mechanism.
319    
320            * inst/doc/clustering.Rnw: Adapted vignette for use with
321            ReutNews.rda
322    
323            * man/ReutNews.Rd: Documentation for ReutNews.rda
324    
325            * data/ReutNews.rda: A tiny Reuters21578 example data set.
326    
327    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
328    
329            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
330            clustering facilities of this package.
331    
332    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
333    
334            * R/aobjects.R: Changed package document structure to avoid class
335            dependency problems.
336    
337    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
338    
339            * Wrote a script for the ModLewis Split for the Reuters-21578 XML
340            data set.
341    
342            * Finished documentation and reordered directory structure. Now "R
343            CMD check textmin" works without errors.
344    
345  2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
346    
347          * src/: Various splits can now be easily created for the          * src/: Various splits can now be easily created for the

Legend:
Removed from v.27  
changed lines
  Added in v.722

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge