SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC pkg/ChangeLog revision 1239, Fri Aug 9 10:11:21 2013 UTC
# Line 1  Line 1 
1    2013-08-09  Ingo Feinerer <feinerer@logic.at>
2    
3            * DESCRIPTION (License): Changed to GPL-3.
4    
5    2013-07-25  Ingo Feinerer <feinerer@logic.at>
6    
7            * R/complete.R (stemCompletion): Report NA instead of error when no
8            completion can be found by the prevalent heuristic. Suggested by Hugh
9            Devlin.
10    
11    2013-07-10  Ingo Feinerer <feinerer@logic.at>
12    
13            * R/reader.R (readPDF): Use tm:::pdfinfo() (which needs the pdfinfo
14            command line tool) instead of tools:::pdf_info().
15    
16    2013-04-11  Ingo Feinerer <feinerer@logic.at>
17    
18            * R/transform.R (removeWords): Use PCRE UCP to use Unicode properties
19            to determine character types.
20    
21    2012-12-14  Ingo Feinerer <feinerer@logic.at>
22    
23            * R/matrix.R (TermDocumentMatrix): Ensure dimnames of type character
24            when generating a simple_triplet_matrix. Reported by Arho Suominen.
25    
26    2012-12-10  Ingo Feinerer <feinerer@logic.at>
27    
28            * man/tm_reduce.Rd: Document right to left folding order. Adapt
29            example as well. Suggested by Mark Rosenstein.
30    
31    2012-12-04  Ingo Feinerer <feinerer@logic.at>
32    
33            * R/filter.R (sFilter): Avoid attach() and simplify.
34    
35    2012-11-02  Ingo Feinerer <feinerer@logic.at>
36    
37            * R/doc.R (.TextDocument): Use casts to ensure data types and to avoid
38            removal of attributes.
39    
40    2012-10-03 Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/weight.R (weightTfIdf, weightSMART): Gracefully handle empty
43            columns and rows (avoids blow-up due to NaN values). Suggested by Jaap
44            Frölich.
45    
46    2012-07-27 Ingo Feinerer  <feinerer@logic.at>
47    
48            * R/transform.R (removeWords): Allow longer stopword lists.
49    
50    2012-01-31  Ingo Feinerer  <feinerer@logic.at>
51    
52            * R/reader.R (readXML): Readers can now set the document language
53            themselves.
54    
55    2012-01-14  Ingo Feinerer  <feinerer@logic.at>
56    
57            * R/source.R (XMLSource, getElem.XMLSource): Simplifications as
58            proposed by Milan Bouchet-Valat.
59    
60    2012-01-11  Ingo Feinerer  <feinerer@logic.at>
61    
62            * R/matrix.R (termFreq): Fix processing of user provided
63            stopwords. Reported by Bettina Grün.
64    
65    2011-12-23  Ingo Feinerer  <feinerer@logic.at>
66    
67            * R/matrix.R (termFreq): Fix invalid handling of
68            control$wordLengths[1]. Reported by Steven C. Bagley.
69    
70    2011-12-17  Ingo Feinerer  <feinerer@logic.at>
71    
72            * DESCRIPTION (Version): Prepare for CRAN Christmas release.
73    
74    2011-12-12  Ingo Feinerer  <feinerer@logic.at>
75    
76            * R/utils.R (map_IETF_Snowball): Map empty input to "porter".
77    
78    2011-12-07  Ingo Feinerer  <feinerer@logic.at>
79    
80            * R/transform.R (removePunctuation): Add option to preserve
81            intra-word dashes.
82    
83    2011-12-06  Ingo Feinerer  <feinerer@logic.at>
84    
85            * R/matrix.R (termFreq): Allow reordering of control option
86            processing.
87    
88    2011-11-17  Ingo Feinerer  <feinerer@logic.at>
89    
90            * R/reader.R (readPDF): Use tools:::pdf_info() instead of external
91            pdfinfo tool.
92    
93            * inst/stopwords/SMART.dat: Add SMART information retrieval system
94            stopwords (which are also used by the MC toolkit).
95    
96            * R/matrix (termFreq): Allow local option \code{bounds$local} to
97            restrict how often a term may appear in each document (generalizes
98            \code{minDocFreq}). Similarly the local option \code{wordLenghts}
99            for word length bounds (generalizes \code{minWordLength}).
100    
101            * R/matrix.R (TermDocumentMatrix.VCorpus): New global option
102            \code{bounds$global} for restricting how often a term is allowed
103            to appear in different documents.
104    
105            * R/matrix.R (TermDocumentMatrix.VCorpus): Distinguish between
106            local options delegated internally to termFreq() and global
107            options which are processed by the term-document matrix
108            constructor itself.
109    
110    2011-11-15  Ingo Feinerer  <feinerer@logic.at>
111    
112            * man/getTokenizers.Rd: Document getTokenizers().
113    
114            * man/tokenizer.Rd: Document MC_tokenizer() and scan_tokenizer().
115    
116    2011-11-04  Ingo Feinerer  <feinerer@logic.at>
117    
118            * man/matrix.Rd: Document as.TermDocumentMatrix.term_frequency.
119    
120            * man/combine.Rd: Document c.term_frequency().
121    
122    2011-10-11  Ingo Feinerer  <feinerer@logic.at>
123    
124            * R/meta.R (`meta<-.Corpus`): Assume that the replacement value
125            can be accessed via '[' and not '[['.
126    
127    2011-08-24  Ingo Feinerer  <feinerer@logic.at>
128    
129            * R/stopwords.R (stopwords): Raise an error if no stopwords are
130            available for requested language. Suggested by Derek M Jones.
131    
132    2011-05-27  Ingo Feinerer  <feinerer@logic.at>
133    
134            * R/weight.R (weightSMART): Implement Cosine and pivoted unique
135            normalization.
136    
137    2011-02-17  Ingo Feinerer  <feinerer@logic.at>
138    
139            * R/transform.R (stemDocument.PlainTextDocument): Use language
140            argument.
141    
142    2011-02-04  Ingo Feinerer  <feinerer@logic.at>
143    
144            * R/source.R: Store strings and connections instead of unevaluated
145            calls.
146    
147    2010-11-26  Ingo Feinerer  <feinerer@logic.at>
148    
149            * R/corpus.R (Corpus): Allow init and exit hooks for readers.
150    
151    2010-10-22  Ingo Feinerer  <feinerer@logic.at>
152    
153            * R/matrix.R (.TermDocumentMatrix): Make Weighting an attribute
154            (instead of a list element).
155    
156    2010-10-16  Ingo Feinerer  <feinerer@logic.at>
157    
158            * R/corpus.R (`[[.VCorpus`, `[[.PCorpus'): Access individual
159            documents by names (fallback to IDs if names are not set).
160    
161    2010-08-25  Ingo Feinerer  <feinerer@logic.at>
162    
163            * R/corpus.R (c.Corpus): When concatenating corpora, the argument
164            \code{recursive} now determines whether existing corpus meta data
165            is used.
166    
167    2010-08-06  Ingo Feinerer  <feinerer@logic.at>
168    
169            * R/transform.R: Removed convert_UTF_8(). Use enc2utf8() instead.
170    
171    2010-06-17  Ingo Feinerer  <feinerer@logic.at>
172    
173            * R/matrix.R (TermDocumentMatrix): If a dictionary is given do not
174            remove terms not occurring in the corpus anymore.
175    
176    2010-06-02  Ingo Feinerer  <feinerer@logic.at>
177    
178            * R/plot.R (Zipf_plot, Heaps_plot): Plotting functions for Zipf's
179            and Heaps' law.
180    
181    2010-05-18  Ingo Feinerer  <feinerer@logic.at>
182    
183            * R/corpus.R (Corpus, PCorpus): Use element names as IDs if
184            provided by a source.
185    
186    2010-04-09  Ingo Feinerer  <feinerer@logic.at>
187    
188            * R/source.R (.Source): Provide document names.
189    
190    2010-04-07  Ingo Feinerer  <feinerer@logic.at>
191    
192            * R/meta.R (`content_or_meta`): Utility function.
193    
194    2010-03-19  Ingo Feinerer  <feinerer@logic.at>
195    
196            * R/reader.R (readReut21578XML, readReut21578XMLasPlain): Extract
197            TOPICS, LEWISSPLIT, CGISPLIT, and OLDID meta tags.
198    
199    2010-03-03  Ingo Feinerer  <feinerer@logic.at>
200    
201            * R/weight.R (weightTfIdf): Added normalization option.
202    
203            * man/tm_tag_score.Rd: Add General Inquirer example for sentiment
204            analysis.
205    
206    2010-02-25  Ingo Feinerer  <feinerer@logic.at>
207    
208            * R/score.R (tm_tag_score): Compute a score from the number of
209            tags matching in a document.
210    
211    2010-02-18  Ingo Feinerer  <feinerer@logic.at>
212    
213            * R/complete.R (stemCompletion): New completion heuristics.
214    
215    2010-02-17  Ingo Feinerer  <feinerer@logic.at>
216    
217            * R/plot.R (plot.TermDocumentMatrix): Memory improvements.
218    
219    2010-02-06  Ingo Feinerer  <feinerer@logic.at>
220    
221            * DESCRIPTION (Depends): Depend on R (>= 2.10.0) to ensure that
222            setOldClass(c(..., "list")) works.
223    
224    2010-01-22  Ingo Feinerer  <feinerer@logic.at>
225    
226            * R/transform.R (stemDocument.character): In case input is a
227            simple character just delegate to the default Snowball stemmer.
228    
229    2010-01-15  Ingo Feinerer  <feinerer@logic.at>
230    
231            * R/reader.R (readReut21578XML, readRCV1): Extract more meta
232            data.
233    
234    2010-01-12  Ingo Feinerer  <feinerer@logic.at>
235    
236            * R/doc.R (`Content<-`): Be careful with names attribute.
237    
238    2010-01-07  Stefan Theussl  <stefan.theussl@wu.ac.at>
239    
240            * R/source.R (DirSource): Improved implementation especially when
241            handling many (> 1M) files.
242    
243    2009-12-22  Ingo Feinerer  <feinerer@logic.at>
244    
245            * R/source.R (getElem.URISource): Use encoding argument.
246    
247    2009-12-11  Ingo Feinerer  <feinerer@logic.at>
248    
249            * R/doc.R (setOldClass): Register S3 document classes to be
250            recognized by S4 methods.
251    
252    2009-11-25  Ingo Feinerer  <feinerer@logic.at>
253    
254            * R/matrix.R (termFreq): Add option to remove punctuation
255            characters.
256    
257    2009-11-19  Ingo Feinerer  <feinerer@logic.at>
258    
259            * R/matrix.R (c.TermDocumentMatrix): Added combine method for
260            merging multiple term-document matrices.
261    
262    2009-11-17  Ingo Feinerer  <feinerer@logic.at>
263    
264            * R/corpus.R (setOldClass): Register S3 corpus classes to be
265            recognized by S4 methods.
266    
267            * man/plot.Rd: Use \dontrun{} in \examples{} section in the hope
268            that CRAN Mac OS X builds do not fail any longer.
269    
270    2009-11-15  Ingo Feinerer  <feinerer@logic.at>
271    
272            * R/matrix.R (tokenize): Use scan(..., what = "character") instead
273            of RWeka:AlphabeticTokenizer() as default.
274    
275    2009-11-14  Ingo Feinerer  <feinerer@logic.at>
276    
277            * R/transform.R (removeWords.PlainTextDocument): Fix bug which
278            caused words at the beginning or the end of a line not to be removed. Do
279            not delete whitespace anymore.
280    
281    2009-11-12  Ingo Feinerer  <feinerer@logic.at>
282    
283            * R/source.R (DirSource): Default to working directory if no path
284            is specified.
285    
286    2009-11-11  Ingo Feinerer  <feinerer@logic.at>
287    
288            * R/source.R (DirSource): Stop on empty directories.
289    
290    2009-11-07  Ingo Feinerer  <feinerer@logic.at>
291    
292            * R/matrix.R (TermDocumentMatrix): Avoid prefixes originating from
293            named documents.
294    
295    2009-10-21  Ingo Feinerer  <feinerer@logic.at>
296    
297            * R/transform.R (removeWords): Improve regular expressions.
298    
299    2009-10-19  Ingo Feinerer  <feinerer@logic.at>
300    
301            * R/meta.R (DublinCore): Allow lower case tags.
302    
303    2009-10-09  Ingo Feinerer  <feinerer@logic.at>
304    
305            * R/source.R (GmaneSource, ReutersSource): Use xmlChildren(x)
306            instead of x$children.
307    
308    2009-09-15  Ingo Feinerer  <feinerer@logic.at>
309    
310            * R/preprocess.R (preprocessReut21578XML): Fix generated file names.
311    
312    2009-09-06  Ingo Feinerer  <feinerer@logic.at>
313    
314            * R/: Use S3 instead of S4 class system.
315    
316    2009-08-11  Ingo Feinerer  <feinerer@logic.at>
317    
318            * R/reader.R (readMail): Moved to tm.plugin.mail package.
319    
320    2009-07-04  Ingo Feinerer  <feinerer@logic.at>
321    
322            * R/reader.R (readNewsgroup): Rename to readMail as newsgroup
323            postings are basically e-mails with some extra headers.
324    
325    2009-07-03  Ingo Feinerer  <feinerer@logic.at>
326    
327            * R/transform.R: Move convertMboxEml, removeCitation,
328            removeMultipart, and removeSignature to the tm.plugin.mail package
329            since they are mainly utility functions (for handling e-mails) and
330            not very framework specific.
331    
332    2009-06-28  Ingo Feinerer  <feinerer@logic.at>
333    
334            * man/: Fix documentation.
335    
336    2009-06-26  Ingo Feinerer  <feinerer@logic.at>
337    
338            * R/reader.R (readReut21578XMLasPlain): New reader which returns a
339            plain text document instead of an XML document for texts of the
340            Reuters-21578 dataset.
341    
342            * R/sparse.R: Removed since the slam package is now available on
343            CRAN.
344    
345            * DESCRIPTION (Depends): Add slam package.
346    
347    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
348    
349            * R/transform.R (stemDoc): Fix character(0) handling.
350    
351    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
352    
353            * R/doc.R (show): Pretty print.
354    
355    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
356    
357            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
358            gracefully.
359    
360    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
361    
362            * R/corpus.R: Make corpus virtual. Implement corpus with standard
363            and permanent storage semantics.
364    
365            * DESCRIPTION: New major release. A *lot* of improvements.
366    
367    2009-05-04   Ingo Feinerer <feinerer@logic.at>
368    
369            * NAMESPACE: Export some simple_triplet_matrix functions.
370    
371    2009-04-28   Ingo Feinerer <feinerer@logic.at>
372    
373            * R/weight.R: Adapt tf-idf to new matrix format.
374    
375    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
376    
377            * R/matrix.R: Create two distinct classes for term-document and
378            document-term matrices.
379    
380    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
381    
382            * R/termdocmatrix.R: No longer use Matrix package. This reduces
383            package start-up time significantly.
384    
385    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
386    
387            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
388    
389    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
390    
391            * R/transform.R (tmReduce): Combine multiple maps into one
392            transformation.
393    
394    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
395    
396            * R/weight.R: Remove weightLogical since it does not return a
397            dgCMatrix.
398    
399            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
400            or TermDocumentMatrix instead.
401    
402    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
403    
404            * inst/doc/extensions.Rnw: Finished vignette.
405    
406    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
407    
408            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
409            DocumentTermMatrix representations.
410    
411    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
412    
413            * R/reader.R (readXML): New reader for arbitrary XML files.
414    
415    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
416    
417            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
418            (XMLSource): New XMLSource class for arbitrary XML files.
419            (Source): New slot Vectorized.
420    
421    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
422    
423            * R/reader.R (readTabular): Experimental reader for tabular data
424            structures which can be customized via user-defined mappings.
425    
426            * R/reader.R: Always use UTC time zone.
427    
428            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
429    
430    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
431    
432            * R/reader.R (readDOC): Options can be passed over to antiword.
433    
434            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
435            pdftotext.
436    
437    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
438    
439            * R/source.R (DirSource): Add pattern and ignore.case arguments
440            which are internally passed over to list.files().
441    
442    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
443    
444            * inst/doc/tm.Rnw: Suppress pointless loading message.
445    
446    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
447    
448            * DESCRIPTION: Speed up package loading (via moving packages not
449            strictly necessary for normal operation to Suggests instead of
450            Depends).
451    
452    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
453    
454            * R/reader.R (readNewsgroup): The date format is now configurable.
455    
456    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
457    
458            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
459    
460    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
461    
462            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
463    
464    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
465    
466            * R/source.R (DataframeSource): New source class for data frames.
467    
468            * R/source.R: Fixed non-standard call evaluation.
469    
470    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
471    
472            * R/source.R (URISource): New source class for a single document.
473    
474    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
475    
476            * R/source.R: Refactoring.
477    
478    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
479    
480            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
481            Rmpi installations more gracefully.
482    
483    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
484    
485            * R/source.R (Source): Add Length slot.
486    
487    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
488    
489            * R/AAA.R: Unify duplicated .onLoad function.
490    
491    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
492    
493            * DESCRIPTION (Suggests): Added Rmpi.
494    
495    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
496    
497            * R/source.R (getElem): Fix 'no visible binding' warning.
498    
499            * man/WeightFunction.Rd: Fix signature.
500    
501    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
502    
503            * R/weight.R: Introduce name abbreviations for weighting functions.
504    
505    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
506    
507            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
508    
509            * R/cluster.R: Provide convenience functions for using a MPI
510            cluster.
511    
512            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
513            available.
514    
515            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
516            available.
517    
518    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
519    
520            * R/textdoccol.R (lapply): Removed debug print out.
521    
522    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
523    
524            * R/reader.R (readRCV1): Improved meta data extraction from
525            Reuters Corpus Volume 1 documents.
526    
527    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
528    
529            * R/transform.R: Ensure that all mappings preserve multiline
530            structures.
531    
532    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
533    
534            * R/filter.R: Every filter has now an attribute indicating whether
535            it sould be applied to document level (doclevel).
536    
537            * R/textdoccol.R (tmFilter): Set searchFullText as new default
538            filter.
539    
540    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
541    
542            * R/transform.R (replacePatterns): Replaced removeWords by
543            replacePatterns. Suggested by Christian Buchta.
544    
545            * R/textdoccol.R (inspect): Improved formatting.
546    
547    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
548    
549            * inst/CITATION: Updated JSS article information.
550    
551            * R/textdoccol.R (setAs): Added coerce method from list to
552            corpus.
553    
554            * R/meta.R (meta): Improved meta data handling.
555    
556    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
557    
558            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
559            Christian Buchta.
560    
561            * inst/CITATION: Added template to include JSS article reference.
562    
563    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
564    
565            * R/textdoccol.R (tmMap): Introduced lazy mapping.
566    
567            * R/source.R: Added VectorSource.
568    
569    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
570    
571            * man/: Language codes should be in ISO 639-1 format.
572    
573            * R/textdoccol.R (asPlain): Preserve local meta data.
574    
575    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
576    
577            * R/textdoccol.R (writeCorpus): Function for writing a corpus
578            containing plain text documents to disk.
579    
580    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
581    
582            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
583            always set correctly.
584    
585            * R/textdoccol.R: Set load = TRUE as default for load on demand
586            since in most cases this is the wanted behaviour.
587    
588    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
589    
590            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
591    
592            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
593    
594    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
595    
596            * R/meta.R (meta): New function for consistent access to meta data
597            of document collections, repositories, and texts.
598    
599    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
600    
601            * R/: Better support for encodings.
602    
603    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
604    
605            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
606            selection when no reader argument is given.
607    
608    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
609    
610            * R/source.R (CSVSource): Now uses read.csv instead of scan
611            internally.
612    
613    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
614    
615            * R/reader.R (getReaders): Returns available reader functions.
616    
617            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
618            as default.
619    
620    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
621    
622            * R/stopwords.R (stopwords): Shortened code, removed codetools
623            variable warnings.
624    
625            * man/: Documentation for showMeta, added an example for tmMap.
626    
627            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
628            some minor typos fixed.
629    
630    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
631    
632            * R/aobjects.R (showMeta): Added method for pretty printing a
633            text document's meta data.
634    
635    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
636    
637            * R/textdoccol.R (TextDocCol): Better handling of empty
638            arguments.
639    
640            * NAMESPACE: Exported readDOC.
641    
642            * man/completeStems.Rd: Added an example.
643    
644    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
645    
646            * R/stopwords.R (stopwords): Look up .dat files at every
647            call. Allows users to modify stopword .dat files interactively.
648    
649    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
650    
651            * R/termdocmatrix.R (termFreq): Correct processing of empty
652            documents.
653    
654    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * man/: Updated documentation.
657    
658    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
659    
660            * R/complete.R (completeStems): Completes (heuristically) word
661            stems.
662    
663            * R/termdocmatrix.R (TermDocMatrix2): New modular
664            constructor.
665    
666            * NAMESPACE: Exported termFreq.
667    
668    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
669    
670            * R/reader.R (readDOC): Added MS Word reader (using antiword).
671    
672    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
673    
674            * R/weight.R: Weighting functions for TermDocMatrix.
675    
676    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
677    
678            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
679            functions for accessing dimension, column, and row names.
680    
681            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
682    
683    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
684    
685            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
686    
687    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
688    
689            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
690    
691    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
692    
693            * R/reader.R (readPDF): Removed manual checks for pdftotext and
694            pdfinfo. The system call gives a warning anyway.
695    
696    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
697    
698            * R/textdoccol.R (asPlain): Conversion from
699            StructuredTextDocuments to PlainTextDocuments.
700    
701    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
702    
703            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
704            for accessing term-document matrices.
705    
706            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
707            are installed.
708    
709    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
710    
711            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
712            Christian Buchta.
713    
714    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
715    
716            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
717    
718    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
719    
720            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
721    
722            * R/reader.R (readPDF): Added PDF reader.
723    
724    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
725    
726            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
727    
728            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
729    
730            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
731    
732            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
733    
734    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
735    
736            * R/distmeasure.R (dissimilarity): Replaced dists call from
737            package cba by new dist call from package proxy.
738    
739    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
740    
741            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
742    
743    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
744    
745            * R/termdocmatrix.R: require() uses the quietly option to suppress
746            loading messages.
747    
748    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
749    
750            * R/dictionary.R: Added dictionary support.
751    
752    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
753    
754            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
755            documents. This simplifies some functions, e.g., asPlain.
756    
757    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
758    
759            * inst/doc/tm.Rnw: Fixed some typos in vignette.
760    
761    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
762    
763            * R/textdoccol.R (replaceWords): Added method to replace a set of
764            words by a single word. Useful for synonyms.
765    
766    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
767    
768            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
769    
770    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
771    
772            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
773            vectors. Thanks to Ariel Maguyon for his error report.
774            (removeSparseTerms): New function to remove columns from a
775            term-document matrix exceeding a sparse factor.
776    
777    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
778    
779            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
780    
781    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
782    
783            * man/sFilter.Rd: Corrected documentation on statement format (use
784            '==' instead of '=').
785    
786    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
787    
788            * R/aobjects.R (StructuredTextDocument): Inherits from
789            TextDocument.
790    
791    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
792    
793            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
794            on sparse matrices as proposed by Martin Maechler.
795    
796    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
797    
798            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
799            \pkg{filehash} version makes them deprecated.
800    
801    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
802    
803            * R/termdocmatrix.R (textvector): Stemming is now performed before
804            erasing stopwords.
805            (weightMatrix): Adapted to handle sparse matrices.
806            (TermDocMatrix): Sparse matrix is now efficiently built by
807            direct stepwise insertion of row values into it.
808    
809    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
810    
811            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
812            due to ongoing problems. For our purposes the latter is as useful
813            as the replaced package.
814    
815    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
816    
817            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
818    
819            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
820    
821    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
822    
823            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
824            languages with available stopwords.
825    
826    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
827    
828            * inst/doc/tm.Rnw: Minor corrections in the vignette.
829    
830    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
831    
832            * DESCRIPTION: Update to version 0.2, since a lot of new features
833            have been integrated.
834    
835            * inst/stopwords: Updated existing stopwords and added stopwords
836            for various other languages.
837    
838    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
839    
840            * man/: Updated documentation.
841    
842            * Work/testDb.R: Script to test database stuff.
843    
844            * R/: Fixed various database related bugs. Seems to be rather
845            useable now, i.e., consider as alpha status for now.
846    
847    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
848    
849            * R/: Fixed some bugs related to database support.
850    
851    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
852    
853            * man/: Added a lot of examples to the manuals.
854    
855    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
856    
857            * man/: Updated parts of the documentation.
858    
859            * R/textdoccol.R (asPlain): Added conversion from newsgroup
860            documents to plain text documents.
861    
862    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
863    
864            * R/textdoccol.R: Finished experimental database support. Not yet
865            intensively tested.
866    
867            * R/source.R: Now each source has a default reader.
868    
869            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
870            class anymore.
871    
872            * R/plaintextdoc.R: Custom show method for plain text documents.
873    
874            * R/aobjects.R: Added a class for structured text documents.
875    
876            * R/reader.R: Replaced remaining \code{parser} occurrences with
877            \code{reader}.
878    
879            * R/textdoccol.R (summary): Indent tags.
880    
881            * R/textdoccol.R (removePunctuation): Transform method to remove
882            punctuation marks.
883    
884    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
885    
886            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
887            using prescindMeta().
888    
889    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
890    
891            * R/textdoccol.R: Improved database support.
892    
893    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
894    
895            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
896    
897            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
898            language code.
899    
900            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
901            into parserControl argument.
902    
903            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
904    
905    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
906    
907            * Work/tmDataSetup.R: The datasets acq and crude can now be
908            created on the fly.
909    
910            * R/stopwords.R: Introduced a function returning the stopwords for
911            a given language (English, German and French at the moment)
912    
913            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
914            otherwise falls back to Snowball package.
915    
916    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
917    
918            * man/dissimilarity-methods.Rd: Make clear that any method offered
919            by "dists" from package "cba" can be used.
920    
921    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
922    
923            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
924            to Kurt's latex suggestion. Removed points and underscores in
925            variable names for consistent naming.
926    
927            * DESCRIPTION: Update to version 0.1-2.
928    
929            * man/TextRepository.Rd: Fixed bug in documentation.
930    
931    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
932    
933            * DESCRIPTION: Update to version 0.1-1.
934    
935    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
936    
937            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
938            wordStem.
939    
940    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
941    
942            * R/: Changes due to Kurt's review.
943    
944    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
945    
946            * R/: Implemented improvements based upon comments by David
947            Meyer.
948    
949    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
950    
951            * inst/doc/: Rewrote vignette.
952    
953            * man/: Improved documentation.
954    
955    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
956    
957            * man/: Updated documentation.
958    
959            * DESCRIPTION: Changed package name to "tm". Updated version to
960            0.1 for first CRAN release.
961    
962            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
963            list archive example.
964    
965            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
966            archive example.
967    
968            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
969            from (several mails per box) mbox format to (single mail per file)
970            eml format.
971    
972    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
973    
974            * data/crude.rda: Rebuilt.
975    
976            * data/acq.rda: Rebuilt.
977    
978            * R/reader.R: Factored out reader and parser methods from
979            textdoccol.R.
980    
981            * R/source.R: Factored out Source methods from aobjects.R and
982            textdoccol.R.
983            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
984            feeds.
985    
986            * R/textdoccol.R (DirSource): Added support for recursive
987            traversal of directories.
988    
989    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
990    
991            * R/textdoccol.R ([[): Loads the document corpus automatically
992            into memory upon access.
993            (tm_transform, tm_filter): Removed several checks whether the
994            document is already loaded ([[ ensures this now).
995            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
996            mailing list archive.
997    
998    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
999    
1000            * R/aobjects.R (TextDocument): Is now a virtual class.
1001            (Source): Is now a virtual class.
1002    
1003    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1004    
1005            * R/textdoccol.R (c): Support for an arbitrary number of document
1006            collections.
1007    
1008    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1009    
1010            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
1011            append_meta and remove_meta.
1012    
1013            * R/textdoccol.R: Removed modify_metadata method.
1014    
1015            * R/textrepo.R: Removed modify_metadata method.
1016    
1017            * R/textdoccol.R (remove_meta): Supports removal of document
1018            collection metadata and document (= in data frame) metadata.
1019    
1020    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1021    
1022            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
1023    
1024            * data/crude.rda: Rebuilt.
1025    
1026            * data/acq.rda: Rebuilt.
1027    
1028            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
1029    
1030            * R/textdoccol.R ([): Bug fix for subsetting a document
1031            collection's data frame.
1032    
1033    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1034    
1035            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
1036            to s_filter.
1037    
1038            * R/textdoccol.R: Local text documents' metadata can now be copied
1039            to a document collection's data frame with prescind_meta.
1040    
1041    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1042    
1043            * R/: Text documents' slot metadata is now accessible in s_filter.
1044    
1045            * R/: Rewrote s_filter function (has still some restrictions).
1046    
1047    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1048    
1049            * R/: Various fixes in handling metadata.
1050    
1051            * R/: Added update mechanism for text document collections.
1052    
1053    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1054    
1055            * R/: Merging of document collections now creates a binary tree
1056            for reconstructing merged document collections.
1057    
1058            * R/: Redesign of metadata for document collections.
1059    
1060    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1061    
1062            * R/: Messages now use \code{ngettext}.
1063    
1064    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1065    
1066            * R/: Added functions for modifying and removing metadata.
1067    
1068    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1069    
1070            * man/: Updated some documentation.
1071    
1072            * R/: Corrected some connection issues.
1073    
1074            * inst/doc: Worked on the vignette.
1075    
1076    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1077    
1078            * inst/: Added texts and started vignette.
1079    
1080            * R/: Final changes based upon David's comments.
1081    
1082    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1083    
1084            * NAMESPACE: Corrected exports (generic methods need exportMethods
1085            directives!).
1086    
1087    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1088    
1089            * R/: Modified the TextDocCol constructur and various parsers. It
1090            is now modular and supports various file formats via plugins (see
1091            the new "Source" class).
1092    
1093    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1094    
1095            * man/: Revised documentation after previous code changes.
1096    
1097    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1098    
1099            * R/: Remaining changes as discussed with David.
1100    
1101    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1102    
1103            * R/: Some changes as suggested by David. The rest will follow
1104            within the next days.
1105    
1106    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1107    
1108            * man/: Finished documentation.
1109    
1110    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1111    
1112            * man/: Wrote some documentation.
1113    
1114    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1115    
1116            * R/: Further syntactic sugar in form of additional assignment and
1117            accessor methods.
1118    
1119    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1120    
1121            * R/: Syntactic sugar in form of "length", "show" and "summary"
1122            operators.
1123    
1124    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1125    
1126            * R/: Diverse updates. Mainly on default operators ("[" or "c")
1127            and dissimilarities.
1128    
1129    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1130    
1131            * R/: Added similarity functions.
1132    
1133            * data/: Added english stopwords.
1134    
1135    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1136    
1137            * data/: Examples compiled for new features
1138    
1139            * R/: Changes due to new structure.
1140    
1141            * NAMESPACE: Corrected namespace to reflect new structure.
1142    
1143            * R/termdocmatrix.R: Adapted for new naming scheme.
1144    
1145    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1146    
1147            * R/textdoccol.R: Adapted code for new class structure. Wrote
1148            several transform and filter functions operating on text document
1149            collections (alias text document databases).
1150    
1151            * R/aobjects.R: Adapted class structure with inheritance,
1152            repositories and additional meta data. Loading files on demand is
1153            now possible.
1154    
1155    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1156    
1157            * R/: Some cosmetic cleanups.
1158    
1159            * inst/: Removed vignette on clustering. That and much more is now
1160            described in the JSS paper on text mining. Based upon that
1161            article an elaborated vignette will be incorporated in the future.
1162    
1163    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1164    
1165            * R/: Updated generic S4 methods to comply with signature changes
1166            in newer versions of R (> 2.3)
1167    
1168    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1169    
1170            * ext/R/importRIS.R: Automatic RIS import is now possible.
1171    
1172    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1173    
1174            * R/textdoccol.R: Added RIS HTML input format.
1175    
1176    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1177    
1178            * R/textdoccol.R: Removed bug that caused invalid text document
1179            collections when handling many input files.
1180    
1181    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1182    
1183            * R/textdoccol.R: Restructured and extended file import
1184            mechanism.
1185    
1186            * inst/doc/clustering.Rnw: Adapted vignette for use with
1187            ReutNews.rda
1188    
1189            * man/ReutNews.Rd: Documentation for ReutNews.rda
1190    
1191            * data/ReutNews.rda: A tiny Reuters21578 example data set.
1192    
1193    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1194    
1195            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
1196            clustering facilities of this package.
1197    
1198    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1199    
1200            * R/aobjects.R: Changed package document structure to avoid class
1201            dependency problems.
1202    
1203    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1204    
1205            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
1206            data set.
1207    
1208            *  Finished documentation and reordered directory structure. Now "R
1209            CMD check textmin" works without errors.
1210    
1211    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1212    
1213            * src/: Various splits can now be easily created for the
1214            Reuters21578 data set.
1215    
1216    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1217    
1218            *  Updated documentation
1219    
1220    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1221    
1222            *  Wrote R documentation for some classes and methods.
1223    
1224    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1225    
1226            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
1227            files. See the questionnaire data/Umfrage.csv for such an example.
1228            We are now able to import files in Reuters-21578 XML format.
1229    
1230            *  Changed class interfaces in various files. Weighting of the text
1231            matrix is now possible.
1232    
1233    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1234    
1235            * R/textdoccol.R: One can build term-document matrices if
1236            nessecary (with buildTDM(...)) and fill the field tdm from a text
1237            document collection with it.
1238    
1239            * R/textmatrix.R: Wrote S4 class for term-document matrices.
1240    
1241    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1242    
1243            * R/textdoccol.R: We now can read in a whole XML file with several
1244            news items.
1245    
1246  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1247    
1248          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.1239

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge