SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC trunk/tm/ChangeLog revision 810, Mon Jan 21 17:14:06 2008 UTC
# Line 1  Line 1 
1    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
2    
3            * R/: Better support for encodings.
4    
5    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
6    
7            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
8            selection when no reader argument is given.
9    
10    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
11    
12            * R/source.R (CSVSource): Now uses read.csv instead of scan
13            internally.
14    
15    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
16    
17            * R/reader.R (getReaders): Returns available reader functions.
18    
19            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
20            as default.
21    
22    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
23    
24            * R/stopwords.R (stopwords): Shortened code, removed codetools
25            variable warnings.
26    
27            * man/: Documentation for showMeta, added an example for tmMap.
28    
29            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
30            some minor typos fixed.
31    
32    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
33    
34            * R/aobjects.R (showMeta): Added method for pretty printing a
35            text document's meta data.
36    
37    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
38    
39            * R/textdoccol.R (TextDocCol): Better handling of empty
40            arguments.
41    
42            * NAMESPACE: Exported readDOC.
43    
44            * man/completeStems.Rd: Added an example.
45    
46    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
47    
48            * R/stopwords.R (stopwords): Look up .dat files at every
49            call. Allows users to modify stopword .dat files interactively.
50    
51    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
52    
53            * R/termdocmatrix.R (termFreq): Correct processing of empty
54            documents.
55    
56    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
57    
58            * man/: Updated documentation.
59    
60    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
61    
62            * R/complete.R (completeStems): Completes (heuristically) word
63            stems.
64    
65            * R/termdocmatrix.R (TermDocMatrix2): New modular
66            constructor.
67    
68            * NAMESPACE: Exported termFreq.
69    
70    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
71    
72            * R/reader.R (readDOC): Added MS Word reader (using antiword).
73    
74    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
75    
76            * R/weight.R: Weighting functions for TermDocMatrix.
77    
78    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
79    
80            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
81            functions for accessing dimension, column, and row names.
82    
83            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
84    
85    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
86    
87            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
88    
89    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
90    
91            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
92    
93    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
94    
95            * R/reader.R (readPDF): Removed manual checks for pdftotext and
96            pdfinfo. The system call gives a warning anyway.
97    
98    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
99    
100            * R/textdoccol.R (asPlain): Conversion from
101            StructuredTextDocuments to PlainTextDocuments.
102    
103    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
104    
105            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
106            for accessing term-document matrices.
107    
108            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
109            are installed.
110    
111    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
112    
113            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
114            Christian Buchta.
115    
116    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
117    
118            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
119    
120    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
121    
122            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
123    
124            * R/reader.R (readPDF): Added PDF reader.
125    
126    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
127    
128            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
129    
130            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
131    
132            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
133    
134            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
135    
136    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
137    
138            * R/distmeasure.R (dissimilarity): Replaced dists call from
139            package cba by new dist call from package proxy.
140    
141    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
142    
143            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
144    
145    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
146    
147            * R/termdocmatrix.R: require() uses the quietly option to suppress
148            loading messages.
149    
150    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
151    
152            * R/dictionary.R: Added dictionary support.
153    
154    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
155    
156            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
157            documents. This simplifies some functions, e.g., asPlain.
158    
159    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
160    
161            * inst/doc/tm.Rnw: Fixed some typos in vignette.
162    
163    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
164    
165            * R/textdoccol.R (replaceWords): Added method to replace a set of
166            words by a single word. Useful for synonyms.
167    
168    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
169    
170            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
171    
172    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
173    
174            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
175            vectors. Thanks to Ariel Maguyon for his error report.
176            (removeSparseTerms): New function to remove columns from a
177            term-document matrix exceeding a sparse factor.
178    
179    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
180    
181            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
182    
183    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
184    
185            * man/sFilter.Rd: Corrected documentation on statement format (use
186            '==' instead of '=').
187    
188    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
189    
190            * R/aobjects.R (StructuredTextDocument): Inherits from
191            TextDocument.
192    
193    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
194    
195            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
196            on sparse matrices as proposed by Martin Maechler.
197    
198    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
199    
200            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
201            \pkg{filehash} version makes them deprecated.
202    
203    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
204    
205            * R/termdocmatrix.R (textvector): Stemming is now performed before
206            erasing stopwords.
207            (weightMatrix): Adapted to handle sparse matrices.
208            (TermDocMatrix): Sparse matrix is now efficiently built by
209            direct stepwise insertion of row values into it.
210    
211    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
212    
213            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
214            due to ongoing problems. For our purposes the latter is as useful
215            as the replaced package.
216    
217    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
218    
219            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
220    
221            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
222    
223    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
224    
225            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
226            languages with available stopwords.
227    
228    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * inst/doc/tm.Rnw: Minor corrections in the vignette.
231    
232    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
233    
234            * DESCRIPTION: Update to version 0.2, since a lot of new features
235            have been integrated.
236    
237            * inst/stopwords: Updated existing stopwords and added stopwords
238            for various other languages.
239    
240    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
241    
242            * man/: Updated documentation.
243    
244            * Work/testDb.R: Script to test database stuff.
245    
246            * R/: Fixed various database related bugs. Seems to be rather
247            useable now, i.e., consider as alpha status for now.
248    
249    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
250    
251            * R/: Fixed some bugs related to database support.
252    
253    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * man/: Added a lot of examples to the manuals.
256    
257    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * man/: Updated parts of the documentation.
260    
261            * R/textdoccol.R (asPlain): Added conversion from newsgroup
262            documents to plain text documents.
263    
264    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
265    
266            * R/textdoccol.R: Finished experimental database support. Not yet
267            intensively tested.
268    
269            * R/source.R: Now each source has a default reader.
270    
271            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
272            class anymore.
273    
274            * R/plaintextdoc.R: Custom show method for plain text documents.
275    
276            * R/aobjects.R: Added a class for structured text documents.
277    
278            * R/reader.R: Replaced remaining \code{parser} occurrences with
279            \code{reader}.
280    
281            * R/textdoccol.R (summary): Indent tags.
282    
283            * R/textdoccol.R (removePunctuation): Transform method to remove
284            punctuation marks.
285    
286    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
287    
288            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
289            using prescindMeta().
290    
291    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
292    
293            * R/textdoccol.R: Improved database support.
294    
295    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
296    
297            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
298    
299            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
300            language code.
301    
302            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
303            into parserControl argument.
304    
305            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
306    
307    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
308    
309            * Work/tmDataSetup.R: The datasets acq and crude can now be
310            created on the fly.
311    
312            * R/stopwords.R: Introduced a function returning the stopwords for
313            a given language (English, German and French at the moment)
314    
315            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
316            otherwise falls back to Snowball package.
317    
318    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
319    
320            * man/dissimilarity-methods.Rd: Make clear that any method offered
321            by "dists" from package "cba" can be used.
322    
323    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
324    
325            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
326            to Kurt's latex suggestion. Removed points and underscores in
327            variable names for consistent naming.
328    
329            * DESCRIPTION: Update to version 0.1-2.
330    
331            * man/TextRepository.Rd: Fixed bug in documentation.
332    
333    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
334    
335            * DESCRIPTION: Update to version 0.1-1.
336    
337    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
338    
339            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
340            wordStem.
341    
342    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
343    
344            * R/: Changes due to Kurt's review.
345    
346    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
347    
348            * R/: Implemented improvements based upon comments by David
349            Meyer.
350    
351    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
352    
353            * inst/doc/: Rewrote vignette.
354    
355            * man/: Improved documentation.
356    
357    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
358    
359            * man/: Updated documentation.
360    
361            * DESCRIPTION: Changed package name to "tm". Updated version to
362            0.1 for first CRAN release.
363    
364            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
365            list archive example.
366    
367            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
368            archive example.
369    
370            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
371            from (several mails per box) mbox format to (single mail per file)
372            eml format.
373    
374    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
375    
376            * data/crude.rda: Rebuilt.
377    
378            * data/acq.rda: Rebuilt.
379    
380            * R/reader.R: Factored out reader and parser methods from
381            textdoccol.R.
382    
383            * R/source.R: Factored out Source methods from aobjects.R and
384            textdoccol.R.
385            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
386            feeds.
387    
388            * R/textdoccol.R (DirSource): Added support for recursive
389            traversal of directories.
390    
391    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
392    
393            * R/textdoccol.R ([[): Loads the document corpus automatically
394            into memory upon access.
395            (tm_transform, tm_filter): Removed several checks whether the
396            document is already loaded ([[ ensures this now).
397            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
398            mailing list archive.
399    
400    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
401    
402            * R/aobjects.R (TextDocument): Is now a virtual class.
403            (Source): Is now a virtual class.
404    
405    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
406    
407            * R/textdoccol.R (c): Support for an arbitrary number of document
408            collections.
409    
410    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
411    
412            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
413            append_meta and remove_meta.
414    
415            * R/textdoccol.R: Removed modify_metadata method.
416    
417            * R/textrepo.R: Removed modify_metadata method.
418    
419            * R/textdoccol.R (remove_meta): Supports removal of document
420            collection metadata and document (= in data frame) metadata.
421    
422    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
423    
424            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
425    
426            * data/crude.rda: Rebuilt.
427    
428            * data/acq.rda: Rebuilt.
429    
430            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
431    
432            * R/textdoccol.R ([): Bug fix for subsetting a document
433            collection's data frame.
434    
435    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
436    
437            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
438            to s_filter.
439    
440            * R/textdoccol.R: Local text documents' metadata can now be copied
441            to a document collection's data frame with prescind_meta.
442    
443    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
444    
445            * R/: Text documents' slot metadata is now accessible in s_filter.
446    
447            * R/: Rewrote s_filter function (has still some restrictions).
448    
449    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
450    
451            * R/: Various fixes in handling metadata.
452    
453            * R/: Added update mechanism for text document collections.
454    
455    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
456    
457            * R/: Merging of document collections now creates a binary tree
458            for reconstructing merged document collections.
459    
460            * R/: Redesign of metadata for document collections.
461    
462    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
463    
464            * R/: Messages now use \code{ngettext}.
465    
466    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
467    
468            * R/: Added functions for modifying and removing metadata.
469    
470    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
471    
472            * man/: Updated some documentation.
473    
474            * R/: Corrected some connection issues.
475    
476            * inst/doc: Worked on the vignette.
477    
478    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
479    
480            * inst/: Added texts and started vignette.
481    
482            * R/: Final changes based upon David's comments.
483    
484    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
485    
486            * NAMESPACE: Corrected exports (generic methods need exportMethods
487            directives!).
488    
489    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
490    
491            * R/: Modified the TextDocCol constructur and various parsers. It
492            is now modular and supports various file formats via plugins (see
493            the new "Source" class).
494    
495    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
496    
497            * man/: Revised documentation after previous code changes.
498    
499    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
500    
501            * R/: Remaining changes as discussed with David.
502    
503    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
504    
505            * R/: Some changes as suggested by David. The rest will follow
506            within the next days.
507    
508    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
509    
510            * man/: Finished documentation.
511    
512    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
513    
514            * man/: Wrote some documentation.
515    
516    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
517    
518            * R/: Further syntactic sugar in form of additional assignment and
519            accessor methods.
520    
521    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
522    
523            * R/: Syntactic sugar in form of "length", "show" and "summary"
524            operators.
525    
526    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
527    
528            * R/: Diverse updates. Mainly on default operators ("[" or "c")
529            and dissimilarities.
530    
531    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
532    
533            * R/: Added similarity functions.
534    
535            * data/: Added english stopwords.
536    
537    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
538    
539            * data/: Examples compiled for new features
540    
541            * R/: Changes due to new structure.
542    
543            * NAMESPACE: Corrected namespace to reflect new structure.
544    
545            * R/termdocmatrix.R: Adapted for new naming scheme.
546    
547    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
548    
549            * R/textdoccol.R: Adapted code for new class structure. Wrote
550            several transform and filter functions operating on text document
551            collections (alias text document databases).
552    
553            * R/aobjects.R: Adapted class structure with inheritance,
554            repositories and additional meta data. Loading files on demand is
555            now possible.
556    
557    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
558    
559            * R/: Some cosmetic cleanups.
560    
561            * inst/: Removed vignette on clustering. That and much more is now
562            described in the JSS paper on text mining. Based upon that
563            article an elaborated vignette will be incorporated in the future.
564    
565    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
566    
567            * R/: Updated generic S4 methods to comply with signature changes
568            in newer versions of R (> 2.3)
569    
570    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
571    
572            * ext/R/importRIS.R: Automatic RIS import is now possible.
573    
574    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
575    
576            * R/textdoccol.R: Added RIS HTML input format.
577    
578    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
579    
580            * R/textdoccol.R: Removed bug that caused invalid text document
581            collections when handling many input files.
582    
583    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
584    
585            * R/textdoccol.R: Restructured and extended file import
586            mechanism.
587    
588            * inst/doc/clustering.Rnw: Adapted vignette for use with
589            ReutNews.rda
590    
591            * man/ReutNews.Rd: Documentation for ReutNews.rda
592    
593            * data/ReutNews.rda: A tiny Reuters21578 example data set.
594    
595    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
596    
597            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
598            clustering facilities of this package.
599    
600    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
601    
602            * R/aobjects.R: Changed package document structure to avoid class
603            dependency problems.
604    
605    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
606    
607            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
608            data set.
609    
610            *  Finished documentation and reordered directory structure. Now "R
611            CMD check textmin" works without errors.
612    
613    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
614    
615            * src/: Various splits can now be easily created for the
616            Reuters21578 data set.
617    
618    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
619    
620            *  Updated documentation
621    
622    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
623    
624            *  Wrote R documentation for some classes and methods.
625    
626    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
627    
628            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
629            files. See the questionnaire data/Umfrage.csv for such an example.
630            We are now able to import files in Reuters-21578 XML format.
631    
632            *  Changed class interfaces in various files. Weighting of the text
633            matrix is now possible.
634    
635    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
636    
637            * R/textdoccol.R: One can build term-document matrices if
638            nessecary (with buildTDM(...)) and fill the field tdm from a text
639            document collection with it.
640    
641            * R/textmatrix.R: Wrote S4 class for term-document matrices.
642    
643    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
644    
645            * R/textdoccol.R: We now can read in a whole XML file with several
646            news items.
647    
648  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
649    
650          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.810

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge