SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 28, Tue Dec 6 13:46:33 2005 UTC pkg/ChangeLog revision 1445, Sun Oct 9 09:30:58 2016 UTC
# Line 1  Line 1 
1    2014-04-20  Ingo Feinerer <feinerer@logic.at>
2    
3            * ChangeLog: Not maintained as a separate file anymore. Please consult
4            the tm Subversion log messages (available at
5            https://r-forge.r-project.org/scm/viewvc.php/pkg/?root=tm) instead.
6    
7    2014-02-25  Ingo Feinerer <feinerer@logic.at>
8    
9            * NAMESPACE: Export pGetElem.URISource.
10    
11    2014-02-23  Ingo Feinerer <feinerer@logic.at>
12    
13            * R/complete.R (stemCompletion.PlainTextDocument): Avoid spurious
14            duplicate results. Reported by Seong-Hyeon Kim.
15    
16    2014-01-28  Ingo Feinerer <feinerer@logic.at>
17    
18            * R/utils.R (map_IETF_Snowball): Process three letter codes.
19    
20    2014-01-07  Ingo Feinerer <feinerer@logic.at>
21    
22            * DESCRIPTION (Version): Prepare for CRAN New Year release.
23    
24    2014-01-05  Ingo Feinerer <feinerer@logic.at>
25    
26            * R/matrix.R (findAssocs): Allow multiple and non-existing terms.
27            Suggested by Christian Buchta.
28    
29            * R/source.R (is.Source): New check for valid source.
30    
31    2013-12-28  Ingo Feinerer <feinerer@logic.at>
32    
33            * R/matrix.R (findAssocs): Make corlimit inclusive.
34    
35    2013-09-27  Ingo Feinerer <feinerer@logic.at>
36    
37            * R/source.R: Allow multiple URIs for URISource.
38    
39    2013-09-19  Ingo Feinerer <feinerer@logic.at>
40    
41            * R/source.R (Source): New Source constructor.
42    
43    2013-08-26  Ingo Feinerer <feinerer@logic.at>
44    
45            * R/source.R (DirSource): Report non-existent or non-readable files.
46            Suggested by Ajinkya Kale and Milan Bouchet-Valat.
47    
48    2013-08-19  Ingo Feinerer <feinerer@logic.at>
49    
50            * R/corpus.R (setOldClass): Do not register VCorpus as S4 class
51            anymore.
52    
53            * R/doc.R (setOldClass): Do not register PlainTextDocument as S4 class
54            anymore.
55    
56    2013-08-09  Ingo Feinerer <feinerer@logic.at>
57    
58            * DESCRIPTION (License): Changed to GPL-3.
59    
60    2013-07-25  Ingo Feinerer <feinerer@logic.at>
61    
62            * R/complete.R (stemCompletion): Report NA instead of error when no
63            completion can be found by the prevalent heuristic. Suggested by Hugh
64            Devlin.
65    
66    2013-07-10  Ingo Feinerer <feinerer@logic.at>
67    
68            * R/reader.R (readPDF): Use tm:::pdfinfo() (which needs the pdfinfo
69            command line tool) instead of tools:::pdf_info().
70    
71    2013-04-11  Ingo Feinerer <feinerer@logic.at>
72    
73            * R/transform.R (removeWords): Use PCRE UCP to use Unicode properties
74            to determine character types.
75    
76    2012-12-14  Ingo Feinerer <feinerer@logic.at>
77    
78            * R/matrix.R (TermDocumentMatrix): Ensure dimnames of type character
79            when generating a simple_triplet_matrix. Reported by Arho Suominen.
80    
81    2012-12-10  Ingo Feinerer <feinerer@logic.at>
82    
83            * man/tm_reduce.Rd: Document right to left folding order. Adapt
84            example as well. Suggested by Mark Rosenstein.
85    
86    2012-12-04  Ingo Feinerer <feinerer@logic.at>
87    
88            * R/filter.R (sFilter): Avoid attach() and simplify.
89    
90    2012-11-02  Ingo Feinerer <feinerer@logic.at>
91    
92            * R/doc.R (.TextDocument): Use casts to ensure data types and to avoid
93            removal of attributes.
94    
95    2012-10-03 Ingo Feinerer  <feinerer@logic.at>
96    
97            * R/weight.R (weightTfIdf, weightSMART): Gracefully handle empty
98            columns and rows (avoids blow-up due to NaN values). Suggested by Jaap
99            Frölich.
100    
101    2012-07-27 Ingo Feinerer  <feinerer@logic.at>
102    
103            * R/transform.R (removeWords): Allow longer stopword lists.
104    
105    2012-01-31  Ingo Feinerer  <feinerer@logic.at>
106    
107            * R/reader.R (readXML): Readers can now set the document language
108            themselves.
109    
110    2012-01-14  Ingo Feinerer  <feinerer@logic.at>
111    
112            * R/source.R (XMLSource, getElem.XMLSource): Simplifications as
113            proposed by Milan Bouchet-Valat.
114    
115    2012-01-11  Ingo Feinerer  <feinerer@logic.at>
116    
117            * R/matrix.R (termFreq): Fix processing of user provided
118            stopwords. Reported by Bettina Grün.
119    
120    2011-12-23  Ingo Feinerer  <feinerer@logic.at>
121    
122            * R/matrix.R (termFreq): Fix invalid handling of
123            control$wordLengths[1]. Reported by Steven C. Bagley.
124    
125    2011-12-17  Ingo Feinerer  <feinerer@logic.at>
126    
127            * DESCRIPTION (Version): Prepare for CRAN Christmas release.
128    
129    2011-12-12  Ingo Feinerer  <feinerer@logic.at>
130    
131            * R/utils.R (map_IETF_Snowball): Map empty input to "porter".
132    
133    2011-12-07  Ingo Feinerer  <feinerer@logic.at>
134    
135            * R/transform.R (removePunctuation): Add option to preserve
136            intra-word dashes.
137    
138    2011-12-06  Ingo Feinerer  <feinerer@logic.at>
139    
140            * R/matrix.R (termFreq): Allow reordering of control option
141            processing.
142    
143    2011-11-17  Ingo Feinerer  <feinerer@logic.at>
144    
145            * R/reader.R (readPDF): Use tools:::pdf_info() instead of external
146            pdfinfo tool.
147    
148            * inst/stopwords/SMART.dat: Add SMART information retrieval system
149            stopwords (which are also used by the MC toolkit).
150    
151            * R/matrix (termFreq): Allow local option \code{bounds$local} to
152            restrict how often a term may appear in each document (generalizes
153            \code{minDocFreq}). Similarly the local option \code{wordLengths}
154            for word length bounds (generalizes \code{minWordLength}).
155    
156            * R/matrix.R (TermDocumentMatrix.VCorpus): New global option
157            \code{bounds$global} for restricting how often a term is allowed
158            to appear in different documents.
159    
160            * R/matrix.R (TermDocumentMatrix.VCorpus): Distinguish between
161            local options delegated internally to termFreq() and global
162            options which are processed by the term-document matrix
163            constructor itself.
164    
165    2011-11-15  Ingo Feinerer  <feinerer@logic.at>
166    
167            * man/getTokenizers.Rd: Document getTokenizers().
168    
169            * man/tokenizer.Rd: Document MC_tokenizer() and scan_tokenizer().
170    
171    2011-11-04  Ingo Feinerer  <feinerer@logic.at>
172    
173            * man/matrix.Rd: Document as.TermDocumentMatrix.term_frequency.
174    
175            * man/combine.Rd: Document c.term_frequency().
176    
177    2011-10-11  Ingo Feinerer  <feinerer@logic.at>
178    
179            * R/meta.R (`meta<-.Corpus`): Assume that the replacement value
180            can be accessed via '[' and not '[['.
181    
182    2011-08-24  Ingo Feinerer  <feinerer@logic.at>
183    
184            * R/stopwords.R (stopwords): Raise an error if no stopwords are
185            available for requested language. Suggested by Derek M Jones.
186    
187    2011-05-27  Ingo Feinerer  <feinerer@logic.at>
188    
189            * R/weight.R (weightSMART): Implement Cosine and pivoted unique
190            normalization.
191    
192    2011-02-17  Ingo Feinerer  <feinerer@logic.at>
193    
194            * R/transform.R (stemDocument.PlainTextDocument): Use language
195            argument.
196    
197    2011-02-04  Ingo Feinerer  <feinerer@logic.at>
198    
199            * R/source.R: Store strings and connections instead of unevaluated
200            calls.
201    
202    2010-11-26  Ingo Feinerer  <feinerer@logic.at>
203    
204            * R/corpus.R (Corpus): Allow init and exit hooks for readers.
205    
206    2010-10-22  Ingo Feinerer  <feinerer@logic.at>
207    
208            * R/matrix.R (.TermDocumentMatrix): Make Weighting an attribute
209            (instead of a list element).
210    
211    2010-10-16  Ingo Feinerer  <feinerer@logic.at>
212    
213            * R/corpus.R (`[[.VCorpus`, `[[.PCorpus'): Access individual
214            documents by names (fallback to IDs if names are not set).
215    
216    2010-08-25  Ingo Feinerer  <feinerer@logic.at>
217    
218            * R/corpus.R (c.Corpus): When concatenating corpora, the argument
219            \code{recursive} now determines whether existing corpus metadata
220            is used.
221    
222    2010-08-06  Ingo Feinerer  <feinerer@logic.at>
223    
224            * R/transform.R: Removed convert_UTF_8(). Use enc2utf8() instead.
225    
226    2010-06-17  Ingo Feinerer  <feinerer@logic.at>
227    
228            * R/matrix.R (TermDocumentMatrix): If a dictionary is given do not
229            remove terms not occurring in the corpus anymore.
230    
231    2010-06-02  Ingo Feinerer  <feinerer@logic.at>
232    
233            * R/plot.R (Zipf_plot, Heaps_plot): Plotting functions for Zipf's
234            and Heaps' law.
235    
236    2010-05-18  Ingo Feinerer  <feinerer@logic.at>
237    
238            * R/corpus.R (Corpus, PCorpus): Use element names as IDs if
239            provided by a source.
240    
241    2010-04-09  Ingo Feinerer  <feinerer@logic.at>
242    
243            * R/source.R (.Source): Provide document names.
244    
245    2010-04-07  Ingo Feinerer  <feinerer@logic.at>
246    
247            * R/meta.R (`content_or_meta`): Utility function.
248    
249    2010-03-19  Ingo Feinerer  <feinerer@logic.at>
250    
251            * R/reader.R (readReut21578XML, readReut21578XMLasPlain): Extract
252            TOPICS, LEWISSPLIT, CGISPLIT, and OLDID meta tags.
253    
254    2010-03-03  Ingo Feinerer  <feinerer@logic.at>
255    
256            * R/weight.R (weightTfIdf): Added normalization option.
257    
258            * man/tm_tag_score.Rd: Add General Inquirer example for sentiment
259            analysis.
260    
261    2010-02-25  Ingo Feinerer  <feinerer@logic.at>
262    
263            * R/score.R (tm_tag_score): Compute a score from the number of
264            tags matching in a document.
265    
266    2010-02-18  Ingo Feinerer  <feinerer@logic.at>
267    
268            * R/complete.R (stemCompletion): New completion heuristics.
269    
270    2010-02-17  Ingo Feinerer  <feinerer@logic.at>
271    
272            * R/plot.R (plot.TermDocumentMatrix): Memory improvements.
273    
274    2010-02-06  Ingo Feinerer  <feinerer@logic.at>
275    
276            * DESCRIPTION (Depends): Depend on R (>= 2.10.0) to ensure that
277            setOldClass(c(..., "list")) works.
278    
279    2010-01-22  Ingo Feinerer  <feinerer@logic.at>
280    
281            * R/transform.R (stemDocument.character): In case input is a
282            simple character just delegate to the default Snowball stemmer.
283    
284    2010-01-15  Ingo Feinerer  <feinerer@logic.at>
285    
286            * R/reader.R (readReut21578XML, readRCV1): Extract more meta
287            data.
288    
289    2010-01-12  Ingo Feinerer  <feinerer@logic.at>
290    
291            * R/doc.R (`Content<-`): Be careful with names attribute.
292    
293    2010-01-07  Stefan Theussl  <stefan.theussl@wu.ac.at>
294    
295            * R/source.R (DirSource): Improved implementation especially when
296            handling many (> 1M) files.
297    
298    2009-12-22  Ingo Feinerer  <feinerer@logic.at>
299    
300            * R/source.R (getElem.URISource): Use encoding argument.
301    
302    2009-12-11  Ingo Feinerer  <feinerer@logic.at>
303    
304            * R/doc.R (setOldClass): Register S3 document classes to be
305            recognized by S4 methods.
306    
307    2009-11-25  Ingo Feinerer  <feinerer@logic.at>
308    
309            * R/matrix.R (termFreq): Add option to remove punctuation
310            characters.
311    
312    2009-11-19  Ingo Feinerer  <feinerer@logic.at>
313    
314            * R/matrix.R (c.TermDocumentMatrix): Added combine method for
315            merging multiple term-document matrices.
316    
317    2009-11-17  Ingo Feinerer  <feinerer@logic.at>
318    
319            * R/corpus.R (setOldClass): Register S3 corpus classes to be
320            recognized by S4 methods.
321    
322            * man/plot.Rd: Use \dontrun{} in \examples{} section in the hope
323            that CRAN Mac OS X builds do not fail any longer.
324    
325    2009-11-15  Ingo Feinerer  <feinerer@logic.at>
326    
327            * R/matrix.R (tokenize): Use scan(..., what = "character") instead
328            of RWeka:AlphabeticTokenizer() as default.
329    
330    2009-11-14  Ingo Feinerer  <feinerer@logic.at>
331    
332            * R/transform.R (removeWords.PlainTextDocument): Fix bug which
333            caused words at the beginning or the end of a line not to be removed. Do
334            not delete whitespace anymore.
335    
336    2009-11-12  Ingo Feinerer  <feinerer@logic.at>
337    
338            * R/source.R (DirSource): Default to working directory if no path
339            is specified.
340    
341    2009-11-11  Ingo Feinerer  <feinerer@logic.at>
342    
343            * R/source.R (DirSource): Stop on empty directories.
344    
345    2009-11-07  Ingo Feinerer  <feinerer@logic.at>
346    
347            * R/matrix.R (TermDocumentMatrix): Avoid prefixes originating from
348            named documents.
349    
350    2009-10-21  Ingo Feinerer  <feinerer@logic.at>
351    
352            * R/transform.R (removeWords): Improve regular expressions.
353    
354    2009-10-19  Ingo Feinerer  <feinerer@logic.at>
355    
356            * R/meta.R (DublinCore): Allow lower case tags.
357    
358    2009-10-09  Ingo Feinerer  <feinerer@logic.at>
359    
360            * R/source.R (GmaneSource, ReutersSource): Use xmlChildren(x)
361            instead of x$children.
362    
363    2009-09-15  Ingo Feinerer  <feinerer@logic.at>
364    
365            * R/preprocess.R (preprocessReut21578XML): Fix generated file names.
366    
367    2009-09-06  Ingo Feinerer  <feinerer@logic.at>
368    
369            * R/: Use S3 instead of S4 class system.
370    
371    2009-08-11  Ingo Feinerer  <feinerer@logic.at>
372    
373            * R/reader.R (readMail): Moved to tm.plugin.mail package.
374    
375    2009-07-04  Ingo Feinerer  <feinerer@logic.at>
376    
377            * R/reader.R (readNewsgroup): Rename to readMail as newsgroup
378            postings are basically e-mails with some extra headers.
379    
380    2009-07-03  Ingo Feinerer  <feinerer@logic.at>
381    
382            * R/transform.R: Move convertMboxEml, removeCitation,
383            removeMultipart, and removeSignature to the tm.plugin.mail package
384            since they are mainly utility functions (for handling e-mails) and
385            not very framework specific.
386    
387    2009-06-28  Ingo Feinerer  <feinerer@logic.at>
388    
389            * man/: Fix documentation.
390    
391    2009-06-26  Ingo Feinerer  <feinerer@logic.at>
392    
393            * R/reader.R (readReut21578XMLasPlain): New reader which returns a
394            plain text document instead of an XML document for texts of the
395            Reuters-21578 dataset.
396    
397            * R/sparse.R: Removed since the slam package is now available on
398            CRAN.
399    
400            * DESCRIPTION (Depends): Add slam package.
401    
402    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
403    
404            * R/transform.R (stemDoc): Fix character(0) handling.
405    
406    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
407    
408            * R/doc.R (show): Pretty print.
409    
410    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
411    
412            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
413            gracefully.
414    
415    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
416    
417            * R/corpus.R: Make corpus virtual. Implement corpus with standard
418            and permanent storage semantics.
419    
420            * DESCRIPTION: New major release. A *lot* of improvements.
421    
422    2009-05-04   Ingo Feinerer <feinerer@logic.at>
423    
424            * NAMESPACE: Export some simple_triplet_matrix functions.
425    
426    2009-04-28   Ingo Feinerer <feinerer@logic.at>
427    
428            * R/weight.R: Adapt tf-idf to new matrix format.
429    
430    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
431    
432            * R/matrix.R: Create two distinct classes for term-document and
433            document-term matrices.
434    
435    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
436    
437            * R/termdocmatrix.R: No longer use Matrix package. This reduces
438            package start-up time significantly.
439    
440    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
441    
442            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
443    
444    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
445    
446            * R/transform.R (tmReduce): Combine multiple maps into one
447            transformation.
448    
449    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
450    
451            * R/weight.R: Remove weightLogical since it does not return a
452            dgCMatrix.
453    
454            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
455            or TermDocumentMatrix instead.
456    
457    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
458    
459            * inst/doc/extensions.Rnw: Finished vignette.
460    
461    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
462    
463            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
464            DocumentTermMatrix representations.
465    
466    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
467    
468            * R/reader.R (readXML): New reader for arbitrary XML files.
469    
470    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
471    
472            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
473            (XMLSource): New XMLSource class for arbitrary XML files.
474            (Source): New slot Vectorized.
475    
476    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
477    
478            * R/reader.R (readTabular): Experimental reader for tabular data
479            structures which can be customized via user-defined mappings.
480    
481            * R/reader.R: Always use UTC time zone.
482    
483            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
484    
485    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
486    
487            * R/reader.R (readDOC): Options can be passed over to antiword.
488    
489            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
490            pdftotext.
491    
492    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
493    
494            * R/source.R (DirSource): Add pattern and ignore.case arguments
495            which are internally passed over to list.files().
496    
497    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
498    
499            * inst/doc/tm.Rnw: Suppress pointless loading message.
500    
501    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
502    
503            * DESCRIPTION: Speed up package loading (via moving packages not
504            strictly necessary for normal operation to Suggests instead of
505            Depends).
506    
507    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
508    
509            * R/reader.R (readNewsgroup): The date format is now configurable.
510    
511    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
512    
513            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
514    
515    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
516    
517            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
518    
519    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
520    
521            * R/source.R (DataframeSource): New source class for data frames.
522    
523            * R/source.R: Fixed non-standard call evaluation.
524    
525    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
526    
527            * R/source.R (URISource): New source class for a single document.
528    
529    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
530    
531            * R/source.R: Refactoring.
532    
533    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
534    
535            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
536            Rmpi installations more gracefully.
537    
538    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
539    
540            * R/source.R (Source): Add Length slot.
541    
542    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
543    
544            * R/AAA.R: Unify duplicated .onLoad function.
545    
546    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
547    
548            * DESCRIPTION (Suggests): Added Rmpi.
549    
550    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
551    
552            * R/source.R (getElem): Fix 'no visible binding' warning.
553    
554            * man/WeightFunction.Rd: Fix signature.
555    
556    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
557    
558            * R/weight.R: Introduce name abbreviations for weighting functions.
559    
560    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
561    
562            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
563    
564            * R/cluster.R: Provide convenience functions for using a MPI
565            cluster.
566    
567            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
568            available.
569    
570            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
571            available.
572    
573    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
574    
575            * R/textdoccol.R (lapply): Removed debug print out.
576    
577    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
578    
579            * R/reader.R (readRCV1): Improved metadata extraction from
580            Reuters Corpus Volume 1 documents.
581    
582    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
583    
584            * R/transform.R: Ensure that all mappings preserve multiline
585            structures.
586    
587    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
588    
589            * R/filter.R: Every filter has now an attribute indicating whether
590            it sould be applied to document level (doclevel).
591    
592            * R/textdoccol.R (tmFilter): Set searchFullText as new default
593            filter.
594    
595    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
596    
597            * R/transform.R (replacePatterns): Replaced removeWords by
598            replacePatterns. Suggested by Christian Buchta.
599    
600            * R/textdoccol.R (inspect): Improved formatting.
601    
602    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
603    
604            * inst/CITATION: Updated JSS article information.
605    
606            * R/textdoccol.R (setAs): Added coerce method from list to
607            corpus.
608    
609            * R/meta.R (meta): Improved metadata handling.
610    
611    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
612    
613            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
614            Christian Buchta.
615    
616            * inst/CITATION: Added template to include JSS article reference.
617    
618    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
619    
620            * R/textdoccol.R (tmMap): Introduced lazy mapping.
621    
622            * R/source.R: Added VectorSource.
623    
624    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
625    
626            * man/: Language codes should be in ISO 639-1 format.
627    
628            * R/textdoccol.R (asPlain): Preserve local metadata.
629    
630    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
631    
632            * R/textdoccol.R (writeCorpus): Function for writing a corpus
633            containing plain text documents to disk.
634    
635    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
636    
637            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
638            always set correctly.
639    
640            * R/textdoccol.R: Set load = TRUE as default for load on demand
641            since in most cases this is the wanted behaviour.
642    
643    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
644    
645            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
646    
647            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
648    
649    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
650    
651            * R/meta.R (meta): New function for consistent access to metadata
652            of document collections, repositories, and texts.
653    
654    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * R/: Better support for encodings.
657    
658    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
659    
660            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
661            selection when no reader argument is given.
662    
663    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
664    
665            * R/source.R (CSVSource): Now uses read.csv instead of scan
666            internally.
667    
668    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
669    
670            * R/reader.R (getReaders): Returns available reader functions.
671    
672            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
673            as default.
674    
675    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
676    
677            * R/stopwords.R (stopwords): Shortened code, removed codetools
678            variable warnings.
679    
680            * man/: Documentation for showMeta, added an example for tmMap.
681    
682            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
683            some minor typos fixed.
684    
685    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
686    
687            * R/aobjects.R (showMeta): Added method for pretty printing a
688            text document's metadata.
689    
690    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
691    
692            * R/textdoccol.R (TextDocCol): Better handling of empty
693            arguments.
694    
695            * NAMESPACE: Exported readDOC.
696    
697            * man/completeStems.Rd: Added an example.
698    
699    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
700    
701            * R/stopwords.R (stopwords): Look up .dat files at every
702            call. Allows users to modify stopword .dat files interactively.
703    
704    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
705    
706            * R/termdocmatrix.R (termFreq): Correct processing of empty
707            documents.
708    
709    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
710    
711            * man/: Updated documentation.
712    
713    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
714    
715            * R/complete.R (completeStems): Completes (heuristically) word
716            stems.
717    
718            * R/termdocmatrix.R (TermDocMatrix2): New modular
719            constructor.
720    
721            * NAMESPACE: Exported termFreq.
722    
723    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
724    
725            * R/reader.R (readDOC): Added MS Word reader (using antiword).
726    
727    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
728    
729            * R/weight.R: Weighting functions for TermDocMatrix.
730    
731    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
732    
733            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
734            functions for accessing dimension, column, and row names.
735    
736            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
737    
738    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
739    
740            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
741    
742    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
743    
744            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
745    
746    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
747    
748            * R/reader.R (readPDF): Removed manual checks for pdftotext and
749            pdfinfo. The system call gives a warning anyway.
750    
751    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * R/textdoccol.R (asPlain): Conversion from
754            StructuredTextDocuments to PlainTextDocuments.
755    
756    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
757    
758            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
759            for accessing term-document matrices.
760    
761            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
762            are installed.
763    
764    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
765    
766            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
767            Christian Buchta.
768    
769    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
770    
771            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
772    
773    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
774    
775            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
776    
777            * R/reader.R (readPDF): Added PDF reader.
778    
779    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
780    
781            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
782    
783            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
784    
785            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
786    
787            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
788    
789    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
790    
791            * R/distmeasure.R (dissimilarity): Replaced dists call from
792            package cba by new dist call from package proxy.
793    
794    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
795    
796            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
797    
798    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
799    
800            * R/termdocmatrix.R: require() uses the quietly option to suppress
801            loading messages.
802    
803    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
804    
805            * R/dictionary.R: Added dictionary support.
806    
807    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
808    
809            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
810            documents. This simplifies some functions, e.g., asPlain.
811    
812    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
813    
814            * inst/doc/tm.Rnw: Fixed some typos in vignette.
815    
816    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
817    
818            * R/textdoccol.R (replaceWords): Added method to replace a set of
819            words by a single word. Useful for synonyms.
820    
821    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
822    
823            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
824    
825    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
826    
827            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
828            vectors. Thanks to Ariel Maguyon for his error report.
829            (removeSparseTerms): New function to remove columns from a
830            term-document matrix exceeding a sparse factor.
831    
832    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
833    
834            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
835    
836    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
837    
838            * man/sFilter.Rd: Corrected documentation on statement format (use
839            '==' instead of '=').
840    
841    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
842    
843            * R/aobjects.R (StructuredTextDocument): Inherits from
844            TextDocument.
845    
846    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
847    
848            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
849            on sparse matrices as proposed by Martin Maechler.
850    
851    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
852    
853            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
854            \pkg{filehash} version makes them deprecated.
855    
856    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
857    
858            * R/termdocmatrix.R (textvector): Stemming is now performed before
859            erasing stopwords.
860            (weightMatrix): Adapted to handle sparse matrices.
861            (TermDocMatrix): Sparse matrix is now efficiently built by
862            direct stepwise insertion of row values into it.
863    
864    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
865    
866            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
867            due to ongoing problems. For our purposes the latter is as useful
868            as the replaced package.
869    
870    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
871    
872            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
873    
874            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
875    
876    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
877    
878            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
879            languages with available stopwords.
880    
881    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
882    
883            * inst/doc/tm.Rnw: Minor corrections in the vignette.
884    
885    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
886    
887            * DESCRIPTION: Update to version 0.2, since a lot of new features
888            have been integrated.
889    
890            * inst/stopwords: Updated existing stopwords and added stopwords
891            for various other languages.
892    
893    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
894    
895            * man/: Updated documentation.
896    
897            * Work/testDb.R: Script to test database stuff.
898    
899            * R/: Fixed various database related bugs. Seems to be rather
900            useable now, i.e., consider as alpha status for now.
901    
902    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
903    
904            * R/: Fixed some bugs related to database support.
905    
906    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
907    
908            * man/: Added a lot of examples to the manuals.
909    
910    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
911    
912            * man/: Updated parts of the documentation.
913    
914            * R/textdoccol.R (asPlain): Added conversion from newsgroup
915            documents to plain text documents.
916    
917    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
918    
919            * R/textdoccol.R: Finished experimental database support. Not yet
920            intensively tested.
921    
922            * R/source.R: Now each source has a default reader.
923    
924            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
925            class anymore.
926    
927            * R/plaintextdoc.R: Custom show method for plain text documents.
928    
929            * R/aobjects.R: Added a class for structured text documents.
930    
931            * R/reader.R: Replaced remaining \code{parser} occurrences with
932            \code{reader}.
933    
934            * R/textdoccol.R (summary): Indent tags.
935    
936            * R/textdoccol.R (removePunctuation): Transform method to remove
937            punctuation marks.
938    
939    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
940    
941            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
942            using prescindMeta().
943    
944    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
945    
946            * R/textdoccol.R: Improved database support.
947    
948    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
949    
950            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
951    
952            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
953            language code.
954    
955            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
956            into parserControl argument.
957    
958            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
959    
960    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
961    
962            * Work/tmDataSetup.R: The datasets acq and crude can now be
963            created on the fly.
964    
965            * R/stopwords.R: Introduced a function returning the stopwords for
966            a given language (English, German and French at the moment)
967    
968            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
969            otherwise falls back to Snowball package.
970    
971    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
972    
973            * man/dissimilarity-methods.Rd: Make clear that any method offered
974            by "dists" from package "cba" can be used.
975    
976    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
977    
978            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
979            to Kurt's latex suggestion. Removed points and underscores in
980            variable names for consistent naming.
981    
982            * DESCRIPTION: Update to version 0.1-2.
983    
984            * man/TextRepository.Rd: Fixed bug in documentation.
985    
986    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
987    
988            * DESCRIPTION: Update to version 0.1-1.
989    
990    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
991    
992            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
993            wordStem.
994    
995    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
996    
997            * R/: Changes due to Kurt's review.
998    
999    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1000    
1001            * R/: Implemented improvements based upon comments by David
1002            Meyer.
1003    
1004    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1005    
1006            * inst/doc/: Rewrote vignette.
1007    
1008            * man/: Improved documentation.
1009    
1010    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1011    
1012            * man/: Updated documentation.
1013    
1014            * DESCRIPTION: Changed package name to "tm". Updated version to
1015            0.1 for first CRAN release.
1016    
1017            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
1018            list archive example.
1019    
1020            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
1021            archive example.
1022    
1023            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
1024            from (several mails per box) mbox format to (single mail per file)
1025            eml format.
1026    
1027    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1028    
1029            * data/crude.rda: Rebuilt.
1030    
1031            * data/acq.rda: Rebuilt.
1032    
1033            * R/reader.R: Factored out reader and parser methods from
1034            textdoccol.R.
1035    
1036            * R/source.R: Factored out Source methods from aobjects.R and
1037            textdoccol.R.
1038            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
1039            feeds.
1040    
1041            * R/textdoccol.R (DirSource): Added support for recursive
1042            traversal of directories.
1043    
1044    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1045    
1046            * R/textdoccol.R ([[): Loads the document corpus automatically
1047            into memory upon access.
1048            (tm_transform, tm_filter): Removed several checks whether the
1049            document is already loaded ([[ ensures this now).
1050            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
1051            mailing list archive.
1052    
1053    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1054    
1055            * R/aobjects.R (TextDocument): Is now a virtual class.
1056            (Source): Is now a virtual class.
1057    
1058    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1059    
1060            * R/textdoccol.R (c): Support for an arbitrary number of document
1061            collections.
1062    
1063    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1064    
1065            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
1066            append_meta and remove_meta.
1067    
1068            * R/textdoccol.R: Removed modify_metadata method.
1069    
1070            * R/textrepo.R: Removed modify_metadata method.
1071    
1072            * R/textdoccol.R (remove_meta): Supports removal of document
1073            collection metadata and document (= in data frame) metadata.
1074    
1075    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1076    
1077            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
1078    
1079            * data/crude.rda: Rebuilt.
1080    
1081            * data/acq.rda: Rebuilt.
1082    
1083            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
1084    
1085            * R/textdoccol.R ([): Bug fix for subsetting a document
1086            collection's data frame.
1087    
1088    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1089    
1090            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
1091            to s_filter.
1092    
1093            * R/textdoccol.R: Local text documents' metadata can now be copied
1094            to a document collection's data frame with prescind_meta.
1095    
1096    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1097    
1098            * R/: Text documents' slot metadata is now accessible in s_filter.
1099    
1100            * R/: Rewrote s_filter function (has still some restrictions).
1101    
1102    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1103    
1104            * R/: Various fixes in handling metadata.
1105    
1106            * R/: Added update mechanism for text document collections.
1107    
1108    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1109    
1110            * R/: Merging of document collections now creates a binary tree
1111            for reconstructing merged document collections.
1112    
1113            * R/: Redesign of metadata for document collections.
1114    
1115    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1116    
1117            * R/: Messages now use \code{ngettext}.
1118    
1119    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1120    
1121            * R/: Added functions for modifying and removing metadata.
1122    
1123    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1124    
1125            * man/: Updated some documentation.
1126    
1127            * R/: Corrected some connection issues.
1128    
1129            * inst/doc: Worked on the vignette.
1130    
1131    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1132    
1133            * inst/: Added texts and started vignette.
1134    
1135            * R/: Final changes based upon David's comments.
1136    
1137    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1138    
1139            * NAMESPACE: Corrected exports (generic methods need exportMethods
1140            directives!).
1141    
1142    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1143    
1144            * R/: Modified the TextDocCol constructur and various parsers. It
1145            is now modular and supports various file formats via plugins (see
1146            the new "Source" class).
1147    
1148    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1149    
1150            * man/: Revised documentation after previous code changes.
1151    
1152    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1153    
1154            * R/: Remaining changes as discussed with David.
1155    
1156    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1157    
1158            * R/: Some changes as suggested by David. The rest will follow
1159            within the next days.
1160    
1161    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1162    
1163            * man/: Finished documentation.
1164    
1165    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1166    
1167            * man/: Wrote some documentation.
1168    
1169    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1170    
1171            * R/: Further syntactic sugar in form of additional assignment and
1172            accessor methods.
1173    
1174    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1175    
1176            * R/: Syntactic sugar in form of "length", "show" and "summary"
1177            operators.
1178    
1179    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1180    
1181            * R/: Diverse updates. Mainly on default operators ("[" or "c")
1182            and dissimilarities.
1183    
1184    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1185    
1186            * R/: Added similarity functions.
1187    
1188            * data/: Added english stopwords.
1189    
1190    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1191    
1192            * data/: Examples compiled for new features
1193    
1194            * R/: Changes due to new structure.
1195    
1196            * NAMESPACE: Corrected namespace to reflect new structure.
1197    
1198            * R/termdocmatrix.R: Adapted for new naming scheme.
1199    
1200    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1201    
1202            * R/textdoccol.R: Adapted code for new class structure. Wrote
1203            several transform and filter functions operating on text document
1204            collections (alias text document databases).
1205    
1206            * R/aobjects.R: Adapted class structure with inheritance,
1207            repositories and additional metadata. Loading files on demand is
1208            now possible.
1209    
1210    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1211    
1212            * R/: Some cosmetic cleanups.
1213    
1214            * inst/: Removed vignette on clustering. That and much more is now
1215            described in the JSS paper on text mining. Based upon that
1216            article an elaborated vignette will be incorporated in the future.
1217    
1218    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1219    
1220            * R/: Updated generic S4 methods to comply with signature changes
1221            in newer versions of R (> 2.3)
1222    
1223    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1224    
1225            * ext/R/importRIS.R: Automatic RIS import is now possible.
1226    
1227    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1228    
1229            * R/textdoccol.R: Added RIS HTML input format.
1230    
1231    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1232    
1233            * R/textdoccol.R: Removed bug that caused invalid text document
1234            collections when handling many input files.
1235    
1236    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1237    
1238            * R/textdoccol.R: Restructured and extended file import
1239            mechanism.
1240    
1241            * inst/doc/clustering.Rnw: Adapted vignette for use with
1242            ReutNews.rda
1243    
1244            * man/ReutNews.Rd: Documentation for ReutNews.rda
1245    
1246            * data/ReutNews.rda: A tiny Reuters21578 example data set.
1247    
1248    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1249    
1250            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
1251            clustering facilities of this package.
1252    
1253    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1254    
1255            * R/aobjects.R: Changed package document structure to avoid class
1256            dependency problems.
1257    
1258  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
1259    
1260            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
1261            data set.
1262    
1263          * Finished documentation and reordered directory structure. Now "R          * Finished documentation and reordered directory structure. Now "R
1264          CMD check textmin" works without errors.          CMD check textmin" works without errors.
1265    

Legend:
Removed from v.28  
changed lines
  Added in v.1445

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge