SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC pkg/ChangeLog revision 1231, Wed Jul 10 06:51:26 2013 UTC
# Line 1  Line 1 
1    2013-07-10  Ingo Feinerer <feinerer@logic.at>
2    
3            * R/reader.R (readPDF): Use tm:::pdfinfo() (which needs the pdfinfo
4            command line tool) instead of tools:::pdf_info().
5    
6    2013-04-11  Ingo Feinerer <feinerer@logic.at>
7    
8            * R/transform.R (removeWords): Use PCRE UCP to use Unicode properties
9            to determine character types.
10    
11    2012-12-14  Ingo Feinerer <feinerer@logic.at>
12    
13            * R/matrix.R (TermDocumentMatrix): Ensure dimnames of type character
14            when generating a simple_triplet_matrix. Reported by Arho Suominen.
15    
16    2012-12-10  Ingo Feinerer <feinerer@logic.at>
17    
18            * man/tm_reduce.Rd: Document right to left folding order. Adapt
19            example as well. Suggested by Mark Rosenstein.
20    
21    2012-12-04  Ingo Feinerer <feinerer@logic.at>
22    
23            * R/filter.R (sFilter): Avoid attach() and simplify.
24    
25    2012-11-02  Ingo Feinerer <feinerer@logic.at>
26    
27            * R/doc.R (.TextDocument): Use casts to ensure data types and to avoid
28            removal of attributes.
29    
30    2012-10-03 Ingo Feinerer  <feinerer@logic.at>
31    
32            * R/weight.R (weightTfIdf, weightSMART): Gracefully handle empty
33            columns and rows (avoids blow-up due to NaN values). Suggested by Jaap
34            Frölich.
35    
36    2012-07-27 Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/transform.R (removeWords): Allow longer stopword lists.
39    
40    2012-01-31  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/reader.R (readXML): Readers can now set the document language
43            themselves.
44    
45    2012-01-14  Ingo Feinerer  <feinerer@logic.at>
46    
47            * R/source.R (XMLSource, getElem.XMLSource): Simplifications as
48            proposed by Milan Bouchet-Valat.
49    
50    2012-01-11  Ingo Feinerer  <feinerer@logic.at>
51    
52            * R/matrix.R (termFreq): Fix processing of user provided
53            stopwords. Reported by Bettina Grün.
54    
55    2011-12-23  Ingo Feinerer  <feinerer@logic.at>
56    
57            * R/matrix.R (termFreq): Fix invalid handling of
58            control$wordLengths[1]. Reported by Steven C. Bagley.
59    
60    2011-12-17  Ingo Feinerer  <feinerer@logic.at>
61    
62            * DESCRIPTION (Version): Prepare for CRAN Christmas release.
63    
64    2011-12-12  Ingo Feinerer  <feinerer@logic.at>
65    
66            * R/utils.R (map_IETF_Snowball): Map empty input to "porter".
67    
68    2011-12-07  Ingo Feinerer  <feinerer@logic.at>
69    
70            * R/transform.R (removePunctuation): Add option to preserve
71            intra-word dashes.
72    
73    2011-12-06  Ingo Feinerer  <feinerer@logic.at>
74    
75            * R/matrix.R (termFreq): Allow reordering of control option
76            processing.
77    
78    2011-11-17  Ingo Feinerer  <feinerer@logic.at>
79    
80            * R/reader.R (readPDF): Use tools:::pdf_info() instead of external
81            pdfinfo tool.
82    
83            * inst/stopwords/SMART.dat: Add SMART information retrieval system
84            stopwords (which are also used by the MC toolkit).
85    
86            * R/matrix (termFreq): Allow local option \code{bounds$local} to
87            restrict how often a term may appear in each document (generalizes
88            \code{minDocFreq}). Similarly the local option \code{wordLenghts}
89            for word length bounds (generalizes \code{minWordLength}).
90    
91            * R/matrix.R (TermDocumentMatrix.VCorpus): New global option
92            \code{bounds$global} for restricting how often a term is allowed
93            to appear in different documents.
94    
95            * R/matrix.R (TermDocumentMatrix.VCorpus): Distinguish between
96            local options delegated internally to termFreq() and global
97            options which are processed by the term-document matrix
98            constructor itself.
99    
100    2011-11-15  Ingo Feinerer  <feinerer@logic.at>
101    
102            * man/getTokenizers.Rd: Document getTokenizers().
103    
104            * man/tokenizer.Rd: Document MC_tokenizer() and scan_tokenizer().
105    
106    2011-11-04  Ingo Feinerer  <feinerer@logic.at>
107    
108            * man/matrix.Rd: Document as.TermDocumentMatrix.term_frequency.
109    
110            * man/combine.Rd: Document c.term_frequency().
111    
112    2011-10-11  Ingo Feinerer  <feinerer@logic.at>
113    
114            * R/meta.R (`meta<-.Corpus`): Assume that the replacement value
115            can be accessed via '[' and not '[['.
116    
117    2011-08-24  Ingo Feinerer  <feinerer@logic.at>
118    
119            * R/stopwords.R (stopwords): Raise an error if no stopwords are
120            available for requested language. Suggested by Derek M Jones.
121    
122    2011-05-27  Ingo Feinerer  <feinerer@logic.at>
123    
124            * R/weight.R (weightSMART): Implement Cosine and pivoted unique
125            normalization.
126    
127    2011-02-17  Ingo Feinerer  <feinerer@logic.at>
128    
129            * R/transform.R (stemDocument.PlainTextDocument): Use language
130            argument.
131    
132    2011-02-04  Ingo Feinerer  <feinerer@logic.at>
133    
134            * R/source.R: Store strings and connections instead of unevaluated
135            calls.
136    
137    2010-11-26  Ingo Feinerer  <feinerer@logic.at>
138    
139            * R/corpus.R (Corpus): Allow init and exit hooks for readers.
140    
141    2010-10-22  Ingo Feinerer  <feinerer@logic.at>
142    
143            * R/matrix.R (.TermDocumentMatrix): Make Weighting an attribute
144            (instead of a list element).
145    
146    2010-10-16  Ingo Feinerer  <feinerer@logic.at>
147    
148            * R/corpus.R (`[[.VCorpus`, `[[.PCorpus'): Access individual
149            documents by names (fallback to IDs if names are not set).
150    
151    2010-08-25  Ingo Feinerer  <feinerer@logic.at>
152    
153            * R/corpus.R (c.Corpus): When concatenating corpora, the argument
154            \code{recursive} now determines whether existing corpus meta data
155            is used.
156    
157    2010-08-06  Ingo Feinerer  <feinerer@logic.at>
158    
159            * R/transform.R: Removed convert_UTF_8(). Use enc2utf8() instead.
160    
161    2010-06-17  Ingo Feinerer  <feinerer@logic.at>
162    
163            * R/matrix.R (TermDocumentMatrix): If a dictionary is given do not
164            remove terms not occurring in the corpus anymore.
165    
166    2010-06-02  Ingo Feinerer  <feinerer@logic.at>
167    
168            * R/plot.R (Zipf_plot, Heaps_plot): Plotting functions for Zipf's
169            and Heaps' law.
170    
171    2010-05-18  Ingo Feinerer  <feinerer@logic.at>
172    
173            * R/corpus.R (Corpus, PCorpus): Use element names as IDs if
174            provided by a source.
175    
176    2010-04-09  Ingo Feinerer  <feinerer@logic.at>
177    
178            * R/source.R (.Source): Provide document names.
179    
180    2010-04-07  Ingo Feinerer  <feinerer@logic.at>
181    
182            * R/meta.R (`content_or_meta`): Utility function.
183    
184    2010-03-19  Ingo Feinerer  <feinerer@logic.at>
185    
186            * R/reader.R (readReut21578XML, readReut21578XMLasPlain): Extract
187            TOPICS, LEWISSPLIT, CGISPLIT, and OLDID meta tags.
188    
189    2010-03-03  Ingo Feinerer  <feinerer@logic.at>
190    
191            * R/weight.R (weightTfIdf): Added normalization option.
192    
193            * man/tm_tag_score.Rd: Add General Inquirer example for sentiment
194            analysis.
195    
196    2010-02-25  Ingo Feinerer  <feinerer@logic.at>
197    
198            * R/score.R (tm_tag_score): Compute a score from the number of
199            tags matching in a document.
200    
201    2010-02-18  Ingo Feinerer  <feinerer@logic.at>
202    
203            * R/complete.R (stemCompletion): New completion heuristics.
204    
205    2010-02-17  Ingo Feinerer  <feinerer@logic.at>
206    
207            * R/plot.R (plot.TermDocumentMatrix): Memory improvements.
208    
209    2010-02-06  Ingo Feinerer  <feinerer@logic.at>
210    
211            * DESCRIPTION (Depends): Depend on R (>= 2.10.0) to ensure that
212            setOldClass(c(..., "list")) works.
213    
214    2010-01-22  Ingo Feinerer  <feinerer@logic.at>
215    
216            * R/transform.R (stemDocument.character): In case input is a
217            simple character just delegate to the default Snowball stemmer.
218    
219    2010-01-15  Ingo Feinerer  <feinerer@logic.at>
220    
221            * R/reader.R (readReut21578XML, readRCV1): Extract more meta
222            data.
223    
224    2010-01-12  Ingo Feinerer  <feinerer@logic.at>
225    
226            * R/doc.R (`Content<-`): Be careful with names attribute.
227    
228    2010-01-07  Stefan Theussl  <stefan.theussl@wu.ac.at>
229    
230            * R/source.R (DirSource): Improved implementation especially when
231            handling many (> 1M) files.
232    
233    2009-12-22  Ingo Feinerer  <feinerer@logic.at>
234    
235            * R/source.R (getElem.URISource): Use encoding argument.
236    
237    2009-12-11  Ingo Feinerer  <feinerer@logic.at>
238    
239            * R/doc.R (setOldClass): Register S3 document classes to be
240            recognized by S4 methods.
241    
242    2009-11-25  Ingo Feinerer  <feinerer@logic.at>
243    
244            * R/matrix.R (termFreq): Add option to remove punctuation
245            characters.
246    
247    2009-11-19  Ingo Feinerer  <feinerer@logic.at>
248    
249            * R/matrix.R (c.TermDocumentMatrix): Added combine method for
250            merging multiple term-document matrices.
251    
252    2009-11-17  Ingo Feinerer  <feinerer@logic.at>
253    
254            * R/corpus.R (setOldClass): Register S3 corpus classes to be
255            recognized by S4 methods.
256    
257            * man/plot.Rd: Use \dontrun{} in \examples{} section in the hope
258            that CRAN Mac OS X builds do not fail any longer.
259    
260    2009-11-15  Ingo Feinerer  <feinerer@logic.at>
261    
262            * R/matrix.R (tokenize): Use scan(..., what = "character") instead
263            of RWeka:AlphabeticTokenizer() as default.
264    
265    2009-11-14  Ingo Feinerer  <feinerer@logic.at>
266    
267            * R/transform.R (removeWords.PlainTextDocument): Fix bug which
268            caused words at the beginning or the end of a line not to be removed. Do
269            not delete whitespace anymore.
270    
271    2009-11-12  Ingo Feinerer  <feinerer@logic.at>
272    
273            * R/source.R (DirSource): Default to working directory if no path
274            is specified.
275    
276    2009-11-11  Ingo Feinerer  <feinerer@logic.at>
277    
278            * R/source.R (DirSource): Stop on empty directories.
279    
280    2009-11-07  Ingo Feinerer  <feinerer@logic.at>
281    
282            * R/matrix.R (TermDocumentMatrix): Avoid prefixes originating from
283            named documents.
284    
285    2009-10-21  Ingo Feinerer  <feinerer@logic.at>
286    
287            * R/transform.R (removeWords): Improve regular expressions.
288    
289    2009-10-19  Ingo Feinerer  <feinerer@logic.at>
290    
291            * R/meta.R (DublinCore): Allow lower case tags.
292    
293    2009-10-09  Ingo Feinerer  <feinerer@logic.at>
294    
295            * R/source.R (GmaneSource, ReutersSource): Use xmlChildren(x)
296            instead of x$children.
297    
298    2009-09-15  Ingo Feinerer  <feinerer@logic.at>
299    
300            * R/preprocess.R (preprocessReut21578XML): Fix generated file names.
301    
302    2009-09-06  Ingo Feinerer  <feinerer@logic.at>
303    
304            * R/: Use S3 instead of S4 class system.
305    
306    2009-08-11  Ingo Feinerer  <feinerer@logic.at>
307    
308            * R/reader.R (readMail): Moved to tm.plugin.mail package.
309    
310    2009-07-04  Ingo Feinerer  <feinerer@logic.at>
311    
312            * R/reader.R (readNewsgroup): Rename to readMail as newsgroup
313            postings are basically e-mails with some extra headers.
314    
315    2009-07-03  Ingo Feinerer  <feinerer@logic.at>
316    
317            * R/transform.R: Move convertMboxEml, removeCitation,
318            removeMultipart, and removeSignature to the tm.plugin.mail package
319            since they are mainly utility functions (for handling e-mails) and
320            not very framework specific.
321    
322    2009-06-28  Ingo Feinerer  <feinerer@logic.at>
323    
324            * man/: Fix documentation.
325    
326    2009-06-26  Ingo Feinerer  <feinerer@logic.at>
327    
328            * R/reader.R (readReut21578XMLasPlain): New reader which returns a
329            plain text document instead of an XML document for texts of the
330            Reuters-21578 dataset.
331    
332            * R/sparse.R: Removed since the slam package is now available on
333            CRAN.
334    
335            * DESCRIPTION (Depends): Add slam package.
336    
337    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
338    
339            * R/transform.R (stemDoc): Fix character(0) handling.
340    
341    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
342    
343            * R/doc.R (show): Pretty print.
344    
345    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
346    
347            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
348            gracefully.
349    
350    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
351    
352            * R/corpus.R: Make corpus virtual. Implement corpus with standard
353            and permanent storage semantics.
354    
355            * DESCRIPTION: New major release. A *lot* of improvements.
356    
357    2009-05-04   Ingo Feinerer <feinerer@logic.at>
358    
359            * NAMESPACE: Export some simple_triplet_matrix functions.
360    
361    2009-04-28   Ingo Feinerer <feinerer@logic.at>
362    
363            * R/weight.R: Adapt tf-idf to new matrix format.
364    
365    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
366    
367            * R/matrix.R: Create two distinct classes for term-document and
368            document-term matrices.
369    
370    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
371    
372            * R/termdocmatrix.R: No longer use Matrix package. This reduces
373            package start-up time significantly.
374    
375    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
376    
377            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
378    
379    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
380    
381            * R/transform.R (tmReduce): Combine multiple maps into one
382            transformation.
383    
384    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
385    
386            * R/weight.R: Remove weightLogical since it does not return a
387            dgCMatrix.
388    
389            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
390            or TermDocumentMatrix instead.
391    
392    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
393    
394            * inst/doc/extensions.Rnw: Finished vignette.
395    
396    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
397    
398            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
399            DocumentTermMatrix representations.
400    
401    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
402    
403            * R/reader.R (readXML): New reader for arbitrary XML files.
404    
405    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
406    
407            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
408            (XMLSource): New XMLSource class for arbitrary XML files.
409            (Source): New slot Vectorized.
410    
411    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
412    
413            * R/reader.R (readTabular): Experimental reader for tabular data
414            structures which can be customized via user-defined mappings.
415    
416            * R/reader.R: Always use UTC time zone.
417    
418            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
419    
420    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
421    
422            * R/reader.R (readDOC): Options can be passed over to antiword.
423    
424            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
425            pdftotext.
426    
427    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
428    
429            * R/source.R (DirSource): Add pattern and ignore.case arguments
430            which are internally passed over to list.files().
431    
432    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
433    
434            * inst/doc/tm.Rnw: Suppress pointless loading message.
435    
436    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
437    
438            * DESCRIPTION: Speed up package loading (via moving packages not
439            strictly necessary for normal operation to Suggests instead of
440            Depends).
441    
442    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
443    
444            * R/reader.R (readNewsgroup): The date format is now configurable.
445    
446    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
447    
448            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
449    
450    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
451    
452            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
453    
454    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
455    
456            * R/source.R (DataframeSource): New source class for data frames.
457    
458            * R/source.R: Fixed non-standard call evaluation.
459    
460    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
461    
462            * R/source.R (URISource): New source class for a single document.
463    
464    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
465    
466            * R/source.R: Refactoring.
467    
468    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
469    
470            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
471            Rmpi installations more gracefully.
472    
473    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
474    
475            * R/source.R (Source): Add Length slot.
476    
477    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
478    
479            * R/AAA.R: Unify duplicated .onLoad function.
480    
481    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
482    
483            * DESCRIPTION (Suggests): Added Rmpi.
484    
485    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
486    
487            * R/source.R (getElem): Fix 'no visible binding' warning.
488    
489            * man/WeightFunction.Rd: Fix signature.
490    
491    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
492    
493            * R/weight.R: Introduce name abbreviations for weighting functions.
494    
495    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
496    
497            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
498    
499            * R/cluster.R: Provide convenience functions for using a MPI
500            cluster.
501    
502            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
503            available.
504    
505            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
506            available.
507    
508    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
509    
510            * R/textdoccol.R (lapply): Removed debug print out.
511    
512    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
513    
514            * R/reader.R (readRCV1): Improved meta data extraction from
515            Reuters Corpus Volume 1 documents.
516    
517    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
518    
519            * R/transform.R: Ensure that all mappings preserve multiline
520            structures.
521    
522    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
523    
524            * R/filter.R: Every filter has now an attribute indicating whether
525            it sould be applied to document level (doclevel).
526    
527            * R/textdoccol.R (tmFilter): Set searchFullText as new default
528            filter.
529    
530    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
531    
532            * R/transform.R (replacePatterns): Replaced removeWords by
533            replacePatterns. Suggested by Christian Buchta.
534    
535            * R/textdoccol.R (inspect): Improved formatting.
536    
537    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
538    
539            * inst/CITATION: Updated JSS article information.
540    
541            * R/textdoccol.R (setAs): Added coerce method from list to
542            corpus.
543    
544            * R/meta.R (meta): Improved meta data handling.
545    
546    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
547    
548            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
549            Christian Buchta.
550    
551            * inst/CITATION: Added template to include JSS article reference.
552    
553    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
554    
555            * R/textdoccol.R (tmMap): Introduced lazy mapping.
556    
557            * R/source.R: Added VectorSource.
558    
559    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
560    
561            * man/: Language codes should be in ISO 639-1 format.
562    
563            * R/textdoccol.R (asPlain): Preserve local meta data.
564    
565    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
566    
567            * R/textdoccol.R (writeCorpus): Function for writing a corpus
568            containing plain text documents to disk.
569    
570    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
571    
572            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
573            always set correctly.
574    
575            * R/textdoccol.R: Set load = TRUE as default for load on demand
576            since in most cases this is the wanted behaviour.
577    
578    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
579    
580            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
581    
582            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
583    
584    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
585    
586            * R/meta.R (meta): New function for consistent access to meta data
587            of document collections, repositories, and texts.
588    
589    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
590    
591            * R/: Better support for encodings.
592    
593    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
594    
595            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
596            selection when no reader argument is given.
597    
598    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
599    
600            * R/source.R (CSVSource): Now uses read.csv instead of scan
601            internally.
602    
603    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
604    
605            * R/reader.R (getReaders): Returns available reader functions.
606    
607            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
608            as default.
609    
610    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
611    
612            * R/stopwords.R (stopwords): Shortened code, removed codetools
613            variable warnings.
614    
615            * man/: Documentation for showMeta, added an example for tmMap.
616    
617            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
618            some minor typos fixed.
619    
620    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
621    
622            * R/aobjects.R (showMeta): Added method for pretty printing a
623            text document's meta data.
624    
625    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
626    
627            * R/textdoccol.R (TextDocCol): Better handling of empty
628            arguments.
629    
630            * NAMESPACE: Exported readDOC.
631    
632            * man/completeStems.Rd: Added an example.
633    
634    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
635    
636            * R/stopwords.R (stopwords): Look up .dat files at every
637            call. Allows users to modify stopword .dat files interactively.
638    
639    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
640    
641            * R/termdocmatrix.R (termFreq): Correct processing of empty
642            documents.
643    
644    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
645    
646            * man/: Updated documentation.
647    
648    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
649    
650            * R/complete.R (completeStems): Completes (heuristically) word
651            stems.
652    
653            * R/termdocmatrix.R (TermDocMatrix2): New modular
654            constructor.
655    
656            * NAMESPACE: Exported termFreq.
657    
658    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
659    
660            * R/reader.R (readDOC): Added MS Word reader (using antiword).
661    
662    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
663    
664            * R/weight.R: Weighting functions for TermDocMatrix.
665    
666    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
667    
668            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
669            functions for accessing dimension, column, and row names.
670    
671            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
672    
673    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
674    
675            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
676    
677    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
678    
679            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
680    
681    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
682    
683            * R/reader.R (readPDF): Removed manual checks for pdftotext and
684            pdfinfo. The system call gives a warning anyway.
685    
686    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
687    
688            * R/textdoccol.R (asPlain): Conversion from
689            StructuredTextDocuments to PlainTextDocuments.
690    
691    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
692    
693            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
694            for accessing term-document matrices.
695    
696            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
697            are installed.
698    
699    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
700    
701            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
702            Christian Buchta.
703    
704    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
705    
706            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
707    
708    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
709    
710            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
711    
712            * R/reader.R (readPDF): Added PDF reader.
713    
714    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
715    
716            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
717    
718            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
719    
720            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
721    
722            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
723    
724    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
725    
726            * R/distmeasure.R (dissimilarity): Replaced dists call from
727            package cba by new dist call from package proxy.
728    
729    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
730    
731            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
732    
733    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
734    
735            * R/termdocmatrix.R: require() uses the quietly option to suppress
736            loading messages.
737    
738    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
739    
740            * R/dictionary.R: Added dictionary support.
741    
742    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
743    
744            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
745            documents. This simplifies some functions, e.g., asPlain.
746    
747    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
748    
749            * inst/doc/tm.Rnw: Fixed some typos in vignette.
750    
751    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * R/textdoccol.R (replaceWords): Added method to replace a set of
754            words by a single word. Useful for synonyms.
755    
756    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
757    
758            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
759    
760    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
761    
762            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
763            vectors. Thanks to Ariel Maguyon for his error report.
764            (removeSparseTerms): New function to remove columns from a
765            term-document matrix exceeding a sparse factor.
766    
767    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
768    
769            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
770    
771    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
772    
773            * man/sFilter.Rd: Corrected documentation on statement format (use
774            '==' instead of '=').
775    
776    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
777    
778            * R/aobjects.R (StructuredTextDocument): Inherits from
779            TextDocument.
780    
781    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
782    
783            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
784            on sparse matrices as proposed by Martin Maechler.
785    
786    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
787    
788            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
789            \pkg{filehash} version makes them deprecated.
790    
791    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
792    
793            * R/termdocmatrix.R (textvector): Stemming is now performed before
794            erasing stopwords.
795            (weightMatrix): Adapted to handle sparse matrices.
796            (TermDocMatrix): Sparse matrix is now efficiently built by
797            direct stepwise insertion of row values into it.
798    
799    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
800    
801            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
802            due to ongoing problems. For our purposes the latter is as useful
803            as the replaced package.
804    
805    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
806    
807            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
808    
809            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
810    
811    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
812    
813            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
814            languages with available stopwords.
815    
816    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
817    
818            * inst/doc/tm.Rnw: Minor corrections in the vignette.
819    
820    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
821    
822            * DESCRIPTION: Update to version 0.2, since a lot of new features
823            have been integrated.
824    
825            * inst/stopwords: Updated existing stopwords and added stopwords
826            for various other languages.
827    
828    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
829    
830            * man/: Updated documentation.
831    
832            * Work/testDb.R: Script to test database stuff.
833    
834            * R/: Fixed various database related bugs. Seems to be rather
835            useable now, i.e., consider as alpha status for now.
836    
837    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
838    
839            * R/: Fixed some bugs related to database support.
840    
841    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
842    
843            * man/: Added a lot of examples to the manuals.
844    
845    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
846    
847            * man/: Updated parts of the documentation.
848    
849            * R/textdoccol.R (asPlain): Added conversion from newsgroup
850            documents to plain text documents.
851    
852    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
853    
854            * R/textdoccol.R: Finished experimental database support. Not yet
855            intensively tested.
856    
857            * R/source.R: Now each source has a default reader.
858    
859            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
860            class anymore.
861    
862            * R/plaintextdoc.R: Custom show method for plain text documents.
863    
864            * R/aobjects.R: Added a class for structured text documents.
865    
866            * R/reader.R: Replaced remaining \code{parser} occurrences with
867            \code{reader}.
868    
869            * R/textdoccol.R (summary): Indent tags.
870    
871            * R/textdoccol.R (removePunctuation): Transform method to remove
872            punctuation marks.
873    
874    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
875    
876            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
877            using prescindMeta().
878    
879    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
880    
881            * R/textdoccol.R: Improved database support.
882    
883    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
884    
885            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
886    
887            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
888            language code.
889    
890            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
891            into parserControl argument.
892    
893            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
894    
895    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
896    
897            * Work/tmDataSetup.R: The datasets acq and crude can now be
898            created on the fly.
899    
900            * R/stopwords.R: Introduced a function returning the stopwords for
901            a given language (English, German and French at the moment)
902    
903            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
904            otherwise falls back to Snowball package.
905    
906    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
907    
908            * man/dissimilarity-methods.Rd: Make clear that any method offered
909            by "dists" from package "cba" can be used.
910    
911    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
912    
913            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
914            to Kurt's latex suggestion. Removed points and underscores in
915            variable names for consistent naming.
916    
917            * DESCRIPTION: Update to version 0.1-2.
918    
919            * man/TextRepository.Rd: Fixed bug in documentation.
920    
921    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
922    
923            * DESCRIPTION: Update to version 0.1-1.
924    
925    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
926    
927            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
928            wordStem.
929    
930    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
931    
932            * R/: Changes due to Kurt's review.
933    
934    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
935    
936            * R/: Implemented improvements based upon comments by David
937            Meyer.
938    
939    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
940    
941            * inst/doc/: Rewrote vignette.
942    
943            * man/: Improved documentation.
944    
945    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
946    
947            * man/: Updated documentation.
948    
949            * DESCRIPTION: Changed package name to "tm". Updated version to
950            0.1 for first CRAN release.
951    
952            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
953            list archive example.
954    
955            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
956            archive example.
957    
958            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
959            from (several mails per box) mbox format to (single mail per file)
960            eml format.
961    
962    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
963    
964            * data/crude.rda: Rebuilt.
965    
966            * data/acq.rda: Rebuilt.
967    
968            * R/reader.R: Factored out reader and parser methods from
969            textdoccol.R.
970    
971            * R/source.R: Factored out Source methods from aobjects.R and
972            textdoccol.R.
973            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
974            feeds.
975    
976            * R/textdoccol.R (DirSource): Added support for recursive
977            traversal of directories.
978    
979    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
980    
981            * R/textdoccol.R ([[): Loads the document corpus automatically
982            into memory upon access.
983            (tm_transform, tm_filter): Removed several checks whether the
984            document is already loaded ([[ ensures this now).
985            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
986            mailing list archive.
987    
988    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
989    
990            * R/aobjects.R (TextDocument): Is now a virtual class.
991            (Source): Is now a virtual class.
992    
993    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
994    
995            * R/textdoccol.R (c): Support for an arbitrary number of document
996            collections.
997    
998    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
999    
1000            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
1001            append_meta and remove_meta.
1002    
1003            * R/textdoccol.R: Removed modify_metadata method.
1004    
1005            * R/textrepo.R: Removed modify_metadata method.
1006    
1007            * R/textdoccol.R (remove_meta): Supports removal of document
1008            collection metadata and document (= in data frame) metadata.
1009    
1010    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1011    
1012            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
1013    
1014            * data/crude.rda: Rebuilt.
1015    
1016            * data/acq.rda: Rebuilt.
1017    
1018            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
1019    
1020            * R/textdoccol.R ([): Bug fix for subsetting a document
1021            collection's data frame.
1022    
1023    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1024    
1025            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
1026            to s_filter.
1027    
1028            * R/textdoccol.R: Local text documents' metadata can now be copied
1029            to a document collection's data frame with prescind_meta.
1030    
1031    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1032    
1033            * R/: Text documents' slot metadata is now accessible in s_filter.
1034    
1035            * R/: Rewrote s_filter function (has still some restrictions).
1036    
1037    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1038    
1039            * R/: Various fixes in handling metadata.
1040    
1041            * R/: Added update mechanism for text document collections.
1042    
1043    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1044    
1045            * R/: Merging of document collections now creates a binary tree
1046            for reconstructing merged document collections.
1047    
1048            * R/: Redesign of metadata for document collections.
1049    
1050    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1051    
1052            * R/: Messages now use \code{ngettext}.
1053    
1054    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1055    
1056            * R/: Added functions for modifying and removing metadata.
1057    
1058    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1059    
1060            * man/: Updated some documentation.
1061    
1062            * R/: Corrected some connection issues.
1063    
1064            * inst/doc: Worked on the vignette.
1065    
1066    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1067    
1068            * inst/: Added texts and started vignette.
1069    
1070            * R/: Final changes based upon David's comments.
1071    
1072    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1073    
1074            * NAMESPACE: Corrected exports (generic methods need exportMethods
1075            directives!).
1076    
1077    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1078    
1079            * R/: Modified the TextDocCol constructur and various parsers. It
1080            is now modular and supports various file formats via plugins (see
1081            the new "Source" class).
1082    
1083    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1084    
1085            * man/: Revised documentation after previous code changes.
1086    
1087    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1088    
1089            * R/: Remaining changes as discussed with David.
1090    
1091    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1092    
1093            * R/: Some changes as suggested by David. The rest will follow
1094            within the next days.
1095    
1096    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1097    
1098            * man/: Finished documentation.
1099    
1100    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1101    
1102            * man/: Wrote some documentation.
1103    
1104    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1105    
1106            * R/: Further syntactic sugar in form of additional assignment and
1107            accessor methods.
1108    
1109    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1110    
1111            * R/: Syntactic sugar in form of "length", "show" and "summary"
1112            operators.
1113    
1114    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1115    
1116            * R/: Diverse updates. Mainly on default operators ("[" or "c")
1117            and dissimilarities.
1118    
1119    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1120    
1121            * R/: Added similarity functions.
1122    
1123            * data/: Added english stopwords.
1124    
1125    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1126    
1127            * data/: Examples compiled for new features
1128    
1129            * R/: Changes due to new structure.
1130    
1131            * NAMESPACE: Corrected namespace to reflect new structure.
1132    
1133            * R/termdocmatrix.R: Adapted for new naming scheme.
1134    
1135    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1136    
1137            * R/textdoccol.R: Adapted code for new class structure. Wrote
1138            several transform and filter functions operating on text document
1139            collections (alias text document databases).
1140    
1141            * R/aobjects.R: Adapted class structure with inheritance,
1142            repositories and additional meta data. Loading files on demand is
1143            now possible.
1144    
1145    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1146    
1147            * R/: Some cosmetic cleanups.
1148    
1149            * inst/: Removed vignette on clustering. That and much more is now
1150            described in the JSS paper on text mining. Based upon that
1151            article an elaborated vignette will be incorporated in the future.
1152    
1153    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1154    
1155            * R/: Updated generic S4 methods to comply with signature changes
1156            in newer versions of R (> 2.3)
1157    
1158    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1159    
1160            * ext/R/importRIS.R: Automatic RIS import is now possible.
1161    
1162    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1163    
1164            * R/textdoccol.R: Added RIS HTML input format.
1165    
1166    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1167    
1168            * R/textdoccol.R: Removed bug that caused invalid text document
1169            collections when handling many input files.
1170    
1171    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1172    
1173            * R/textdoccol.R: Restructured and extended file import
1174            mechanism.
1175    
1176            * inst/doc/clustering.Rnw: Adapted vignette for use with
1177            ReutNews.rda
1178    
1179            * man/ReutNews.Rd: Documentation for ReutNews.rda
1180    
1181            * data/ReutNews.rda: A tiny Reuters21578 example data set.
1182    
1183    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1184    
1185            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
1186            clustering facilities of this package.
1187    
1188    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1189    
1190            * R/aobjects.R: Changed package document structure to avoid class
1191            dependency problems.
1192    
1193    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1194    
1195            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
1196            data set.
1197    
1198            *  Finished documentation and reordered directory structure. Now "R
1199            CMD check textmin" works without errors.
1200    
1201    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1202    
1203            * src/: Various splits can now be easily created for the
1204            Reuters21578 data set.
1205    
1206    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1207    
1208            *  Updated documentation
1209    
1210    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1211    
1212            *  Wrote R documentation for some classes and methods.
1213    
1214    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1215    
1216            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
1217            files. See the questionnaire data/Umfrage.csv for such an example.
1218            We are now able to import files in Reuters-21578 XML format.
1219    
1220            *  Changed class interfaces in various files. Weighting of the text
1221            matrix is now possible.
1222    
1223    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1224    
1225            * R/textdoccol.R: One can build term-document matrices if
1226            nessecary (with buildTDM(...)) and fill the field tdm from a text
1227            document collection with it.
1228    
1229            * R/textmatrix.R: Wrote S4 class for term-document matrices.
1230    
1231    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1232    
1233            * R/textdoccol.R: We now can read in a whole XML file with several
1234            news items.
1235    
1236  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1237    
1238          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.1231

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge