SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC pkg/ChangeLog revision 1010, Fri Oct 9 12:48:37 2009 UTC
# Line 1  Line 1 
1    2009-10-09  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/source.R (GmaneSource, ReutersSource): Use xmlChildren(x)
4            instead of x$children.
5    
6    2009-09-15  Ingo Feinerer  <feinerer@logic.at>
7    
8            * R/preprocess.R (preprocessReut21578XML): Fix generated file names.
9    
10    2009-09-06  Ingo Feinerer  <feinerer@logic.at>
11    
12            * R/: Use S3 instead of S4 class system.
13    
14    2009-08-11  Ingo Feinerer  <feinerer@logic.at>
15    
16            * R/reader.R (readMail): Moved to tm.plugin.mail package.
17    
18    2009-07-04  Ingo Feinerer  <feinerer@logic.at>
19    
20            * R/reader.R (readNewsgroup): Rename to readMail as newsgroup
21            postings are basically e-mails with some extra headers.
22    
23    2009-07-03  Ingo Feinerer  <feinerer@logic.at>
24    
25            * R/transform.R: Move convertMboxEml, removeCitation,
26            removeMultipart, and removeSignature to the tm.plugin.mail package
27            since they are mainly utility functions (for handling e-mails) and
28            not very framework specific.
29    
30    2009-06-28  Ingo Feinerer  <feinerer@logic.at>
31    
32            * man/: Fix documentation.
33    
34    2009-06-26  Ingo Feinerer  <feinerer@logic.at>
35    
36            * R/reader.R (readReut21578XMLasPlain): New reader which returns a
37            plain text document instead of an XML document for texts of the
38            Reuters-21578 dataset.
39    
40            * R/sparse.R: Removed since the slam package is now available on
41            CRAN.
42    
43            * DESCRIPTION (Depends): Add slam package.
44    
45    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
46    
47            * R/transform.R (stemDoc): Fix character(0) handling.
48    
49    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
50    
51            * R/doc.R (show): Pretty print.
52    
53    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
54    
55            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
56            gracefully.
57    
58    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
59    
60            * R/corpus.R: Make corpus virtual. Implement corpus with standard
61            and permanent storage semantics.
62    
63            * DESCRIPTION: New major release. A *lot* of improvements.
64    
65    2009-05-04   Ingo Feinerer <feinerer@logic.at>
66    
67            * NAMESPACE: Export some simple_triplet_matrix functions.
68    
69    2009-04-28   Ingo Feinerer <feinerer@logic.at>
70    
71            * R/weight.R: Adapt tf-idf to new matrix format.
72    
73    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
74    
75            * R/matrix.R: Create two distinct classes for term-document and
76            document-term matrices.
77    
78    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
79    
80            * R/termdocmatrix.R: No longer use Matrix package. This reduces
81            package start-up time significantly.
82    
83    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
84    
85            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
86    
87    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
88    
89            * R/transform.R (tmReduce): Combine multiple maps into one
90            transformation.
91    
92    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
93    
94            * R/weight.R: Remove weightLogical since it does not return a
95            dgCMatrix.
96    
97            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
98            or TermDocumentMatrix instead.
99    
100    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
101    
102            * inst/doc/extensions.Rnw: Finished vignette.
103    
104    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
105    
106            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
107            DocumentTermMatrix representations.
108    
109    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
110    
111            * R/reader.R (readXML): New reader for arbitrary XML files.
112    
113    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
114    
115            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
116            (XMLSource): New XMLSource class for arbitrary XML files.
117            (Source): New slot Vectorized.
118    
119    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
120    
121            * R/reader.R (readTabular): Experimental reader for tabular data
122            structures which can be customized via user-defined mappings.
123    
124            * R/reader.R: Always use UTC time zone.
125    
126            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
127    
128    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
129    
130            * R/reader.R (readDOC): Options can be passed over to antiword.
131    
132            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
133            pdftotext.
134    
135    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
136    
137            * R/source.R (DirSource): Add pattern and ignore.case arguments
138            which are internally passed over to list.files().
139    
140    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
141    
142            * inst/doc/tm.Rnw: Suppress pointless loading message.
143    
144    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
145    
146            * DESCRIPTION: Speed up package loading (via moving packages not
147            strictly necessary for normal operation to Suggests instead of
148            Depends).
149    
150    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
151    
152            * R/reader.R (readNewsgroup): The date format is now configurable.
153    
154    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
155    
156            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
157    
158    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
159    
160            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
161    
162    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
163    
164            * R/source.R (DataframeSource): New source class for data frames.
165    
166            * R/source.R: Fixed non-standard call evaluation.
167    
168    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
169    
170            * R/source.R (URISource): New source class for a single document.
171    
172    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
173    
174            * R/source.R: Refactoring.
175    
176    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
177    
178            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
179            Rmpi installations more gracefully.
180    
181    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
182    
183            * R/source.R (Source): Add Length slot.
184    
185    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
186    
187            * R/AAA.R: Unify duplicated .onLoad function.
188    
189    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
190    
191            * DESCRIPTION (Suggests): Added Rmpi.
192    
193    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
194    
195            * R/source.R (getElem): Fix 'no visible binding' warning.
196    
197            * man/WeightFunction.Rd: Fix signature.
198    
199    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
200    
201            * R/weight.R: Introduce name abbreviations for weighting functions.
202    
203    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
204    
205            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
206    
207            * R/cluster.R: Provide convenience functions for using a MPI
208            cluster.
209    
210            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
211            available.
212    
213            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
214            available.
215    
216    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
217    
218            * R/textdoccol.R (lapply): Removed debug print out.
219    
220    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
221    
222            * R/reader.R (readRCV1): Improved meta data extraction from
223            Reuters Corpus Volume 1 documents.
224    
225    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
226    
227            * R/transform.R: Ensure that all mappings preserve multiline
228            structures.
229    
230    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
231    
232            * R/filter.R: Every filter has now an attribute indicating whether
233            it sould be applied to document level (doclevel).
234    
235            * R/textdoccol.R (tmFilter): Set searchFullText as new default
236            filter.
237    
238    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
239    
240            * R/transform.R (replacePatterns): Replaced removeWords by
241            replacePatterns. Suggested by Christian Buchta.
242    
243            * R/textdoccol.R (inspect): Improved formatting.
244    
245    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
246    
247            * inst/CITATION: Updated JSS article information.
248    
249            * R/textdoccol.R (setAs): Added coerce method from list to
250            corpus.
251    
252            * R/meta.R (meta): Improved meta data handling.
253    
254    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
255    
256            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
257            Christian Buchta.
258    
259            * inst/CITATION: Added template to include JSS article reference.
260    
261    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
262    
263            * R/textdoccol.R (tmMap): Introduced lazy mapping.
264    
265            * R/source.R: Added VectorSource.
266    
267    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
268    
269            * man/: Language codes should be in ISO 639-1 format.
270    
271            * R/textdoccol.R (asPlain): Preserve local meta data.
272    
273    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
274    
275            * R/textdoccol.R (writeCorpus): Function for writing a corpus
276            containing plain text documents to disk.
277    
278    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
279    
280            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
281            always set correctly.
282    
283            * R/textdoccol.R: Set load = TRUE as default for load on demand
284            since in most cases this is the wanted behaviour.
285    
286    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
287    
288            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
289    
290            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
291    
292    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
293    
294            * R/meta.R (meta): New function for consistent access to meta data
295            of document collections, repositories, and texts.
296    
297    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
298    
299            * R/: Better support for encodings.
300    
301    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
302    
303            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
304            selection when no reader argument is given.
305    
306    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
307    
308            * R/source.R (CSVSource): Now uses read.csv instead of scan
309            internally.
310    
311    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
312    
313            * R/reader.R (getReaders): Returns available reader functions.
314    
315            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
316            as default.
317    
318    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
319    
320            * R/stopwords.R (stopwords): Shortened code, removed codetools
321            variable warnings.
322    
323            * man/: Documentation for showMeta, added an example for tmMap.
324    
325            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
326            some minor typos fixed.
327    
328    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
329    
330            * R/aobjects.R (showMeta): Added method for pretty printing a
331            text document's meta data.
332    
333    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
334    
335            * R/textdoccol.R (TextDocCol): Better handling of empty
336            arguments.
337    
338            * NAMESPACE: Exported readDOC.
339    
340            * man/completeStems.Rd: Added an example.
341    
342    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
343    
344            * R/stopwords.R (stopwords): Look up .dat files at every
345            call. Allows users to modify stopword .dat files interactively.
346    
347    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * R/termdocmatrix.R (termFreq): Correct processing of empty
350            documents.
351    
352    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * man/: Updated documentation.
355    
356    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * R/complete.R (completeStems): Completes (heuristically) word
359            stems.
360    
361            * R/termdocmatrix.R (TermDocMatrix2): New modular
362            constructor.
363    
364            * NAMESPACE: Exported termFreq.
365    
366    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
367    
368            * R/reader.R (readDOC): Added MS Word reader (using antiword).
369    
370    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
371    
372            * R/weight.R: Weighting functions for TermDocMatrix.
373    
374    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
375    
376            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
377            functions for accessing dimension, column, and row names.
378    
379            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
380    
381    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
382    
383            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
384    
385    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
386    
387            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
388    
389    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
390    
391            * R/reader.R (readPDF): Removed manual checks for pdftotext and
392            pdfinfo. The system call gives a warning anyway.
393    
394    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
395    
396            * R/textdoccol.R (asPlain): Conversion from
397            StructuredTextDocuments to PlainTextDocuments.
398    
399    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
400    
401            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
402            for accessing term-document matrices.
403    
404            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
405            are installed.
406    
407    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
410            Christian Buchta.
411    
412    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
415    
416    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
417    
418            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
419    
420            * R/reader.R (readPDF): Added PDF reader.
421    
422    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
423    
424            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
425    
426            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
427    
428            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
429    
430            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
431    
432    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
433    
434            * R/distmeasure.R (dissimilarity): Replaced dists call from
435            package cba by new dist call from package proxy.
436    
437    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
440    
441    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
442    
443            * R/termdocmatrix.R: require() uses the quietly option to suppress
444            loading messages.
445    
446    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
447    
448            * R/dictionary.R: Added dictionary support.
449    
450    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
451    
452            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
453            documents. This simplifies some functions, e.g., asPlain.
454    
455    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
456    
457            * inst/doc/tm.Rnw: Fixed some typos in vignette.
458    
459    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
460    
461            * R/textdoccol.R (replaceWords): Added method to replace a set of
462            words by a single word. Useful for synonyms.
463    
464    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
465    
466            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
467    
468    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
469    
470            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
471            vectors. Thanks to Ariel Maguyon for his error report.
472            (removeSparseTerms): New function to remove columns from a
473            term-document matrix exceeding a sparse factor.
474    
475    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
476    
477            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
478    
479    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
480    
481            * man/sFilter.Rd: Corrected documentation on statement format (use
482            '==' instead of '=').
483    
484    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
485    
486            * R/aobjects.R (StructuredTextDocument): Inherits from
487            TextDocument.
488    
489    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
490    
491            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
492            on sparse matrices as proposed by Martin Maechler.
493    
494    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
495    
496            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
497            \pkg{filehash} version makes them deprecated.
498    
499    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
500    
501            * R/termdocmatrix.R (textvector): Stemming is now performed before
502            erasing stopwords.
503            (weightMatrix): Adapted to handle sparse matrices.
504            (TermDocMatrix): Sparse matrix is now efficiently built by
505            direct stepwise insertion of row values into it.
506    
507    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
508    
509            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
510            due to ongoing problems. For our purposes the latter is as useful
511            as the replaced package.
512    
513    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
514    
515            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
516    
517            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
518    
519    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
520    
521            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
522            languages with available stopwords.
523    
524    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
525    
526            * inst/doc/tm.Rnw: Minor corrections in the vignette.
527    
528    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
529    
530            * DESCRIPTION: Update to version 0.2, since a lot of new features
531            have been integrated.
532    
533            * inst/stopwords: Updated existing stopwords and added stopwords
534            for various other languages.
535    
536    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
537    
538            * man/: Updated documentation.
539    
540            * Work/testDb.R: Script to test database stuff.
541    
542            * R/: Fixed various database related bugs. Seems to be rather
543            useable now, i.e., consider as alpha status for now.
544    
545    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
546    
547            * R/: Fixed some bugs related to database support.
548    
549    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
550    
551            * man/: Added a lot of examples to the manuals.
552    
553    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
554    
555            * man/: Updated parts of the documentation.
556    
557            * R/textdoccol.R (asPlain): Added conversion from newsgroup
558            documents to plain text documents.
559    
560    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
561    
562            * R/textdoccol.R: Finished experimental database support. Not yet
563            intensively tested.
564    
565            * R/source.R: Now each source has a default reader.
566    
567            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
568            class anymore.
569    
570            * R/plaintextdoc.R: Custom show method for plain text documents.
571    
572            * R/aobjects.R: Added a class for structured text documents.
573    
574            * R/reader.R: Replaced remaining \code{parser} occurrences with
575            \code{reader}.
576    
577            * R/textdoccol.R (summary): Indent tags.
578    
579            * R/textdoccol.R (removePunctuation): Transform method to remove
580            punctuation marks.
581    
582    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
583    
584            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
585            using prescindMeta().
586    
587    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
588    
589            * R/textdoccol.R: Improved database support.
590    
591    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
592    
593            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
594    
595            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
596            language code.
597    
598            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
599            into parserControl argument.
600    
601            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
602    
603    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
604    
605            * Work/tmDataSetup.R: The datasets acq and crude can now be
606            created on the fly.
607    
608            * R/stopwords.R: Introduced a function returning the stopwords for
609            a given language (English, German and French at the moment)
610    
611            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
612            otherwise falls back to Snowball package.
613    
614    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
615    
616            * man/dissimilarity-methods.Rd: Make clear that any method offered
617            by "dists" from package "cba" can be used.
618    
619    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
620    
621            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
622            to Kurt's latex suggestion. Removed points and underscores in
623            variable names for consistent naming.
624    
625            * DESCRIPTION: Update to version 0.1-2.
626    
627            * man/TextRepository.Rd: Fixed bug in documentation.
628    
629    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
630    
631            * DESCRIPTION: Update to version 0.1-1.
632    
633    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
634    
635            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
636            wordStem.
637    
638    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
639    
640            * R/: Changes due to Kurt's review.
641    
642    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
643    
644            * R/: Implemented improvements based upon comments by David
645            Meyer.
646    
647    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
648    
649            * inst/doc/: Rewrote vignette.
650    
651            * man/: Improved documentation.
652    
653    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
654    
655            * man/: Updated documentation.
656    
657            * DESCRIPTION: Changed package name to "tm". Updated version to
658            0.1 for first CRAN release.
659    
660            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
661            list archive example.
662    
663            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
664            archive example.
665    
666            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
667            from (several mails per box) mbox format to (single mail per file)
668            eml format.
669    
670    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
671    
672            * data/crude.rda: Rebuilt.
673    
674            * data/acq.rda: Rebuilt.
675    
676            * R/reader.R: Factored out reader and parser methods from
677            textdoccol.R.
678    
679            * R/source.R: Factored out Source methods from aobjects.R and
680            textdoccol.R.
681            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
682            feeds.
683    
684            * R/textdoccol.R (DirSource): Added support for recursive
685            traversal of directories.
686    
687    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
688    
689            * R/textdoccol.R ([[): Loads the document corpus automatically
690            into memory upon access.
691            (tm_transform, tm_filter): Removed several checks whether the
692            document is already loaded ([[ ensures this now).
693            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
694            mailing list archive.
695    
696    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
697    
698            * R/aobjects.R (TextDocument): Is now a virtual class.
699            (Source): Is now a virtual class.
700    
701    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
702    
703            * R/textdoccol.R (c): Support for an arbitrary number of document
704            collections.
705    
706    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
707    
708            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
709            append_meta and remove_meta.
710    
711            * R/textdoccol.R: Removed modify_metadata method.
712    
713            * R/textrepo.R: Removed modify_metadata method.
714    
715            * R/textdoccol.R (remove_meta): Supports removal of document
716            collection metadata and document (= in data frame) metadata.
717    
718    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
719    
720            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
721    
722            * data/crude.rda: Rebuilt.
723    
724            * data/acq.rda: Rebuilt.
725    
726            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
727    
728            * R/textdoccol.R ([): Bug fix for subsetting a document
729            collection's data frame.
730    
731    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
732    
733            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
734            to s_filter.
735    
736            * R/textdoccol.R: Local text documents' metadata can now be copied
737            to a document collection's data frame with prescind_meta.
738    
739    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
740    
741            * R/: Text documents' slot metadata is now accessible in s_filter.
742    
743            * R/: Rewrote s_filter function (has still some restrictions).
744    
745    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
746    
747            * R/: Various fixes in handling metadata.
748    
749            * R/: Added update mechanism for text document collections.
750    
751    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * R/: Merging of document collections now creates a binary tree
754            for reconstructing merged document collections.
755    
756            * R/: Redesign of metadata for document collections.
757    
758    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
759    
760            * R/: Messages now use \code{ngettext}.
761    
762    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
763    
764            * R/: Added functions for modifying and removing metadata.
765    
766    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
767    
768            * man/: Updated some documentation.
769    
770            * R/: Corrected some connection issues.
771    
772            * inst/doc: Worked on the vignette.
773    
774    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
775    
776            * inst/: Added texts and started vignette.
777    
778            * R/: Final changes based upon David's comments.
779    
780    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
781    
782            * NAMESPACE: Corrected exports (generic methods need exportMethods
783            directives!).
784    
785    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
786    
787            * R/: Modified the TextDocCol constructur and various parsers. It
788            is now modular and supports various file formats via plugins (see
789            the new "Source" class).
790    
791    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
792    
793            * man/: Revised documentation after previous code changes.
794    
795    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
796    
797            * R/: Remaining changes as discussed with David.
798    
799    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
800    
801            * R/: Some changes as suggested by David. The rest will follow
802            within the next days.
803    
804    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
805    
806            * man/: Finished documentation.
807    
808    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
809    
810            * man/: Wrote some documentation.
811    
812    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
813    
814            * R/: Further syntactic sugar in form of additional assignment and
815            accessor methods.
816    
817    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
818    
819            * R/: Syntactic sugar in form of "length", "show" and "summary"
820            operators.
821    
822    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
823    
824            * R/: Diverse updates. Mainly on default operators ("[" or "c")
825            and dissimilarities.
826    
827    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
828    
829            * R/: Added similarity functions.
830    
831            * data/: Added english stopwords.
832    
833    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
834    
835            * data/: Examples compiled for new features
836    
837            * R/: Changes due to new structure.
838    
839            * NAMESPACE: Corrected namespace to reflect new structure.
840    
841            * R/termdocmatrix.R: Adapted for new naming scheme.
842    
843    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
844    
845            * R/textdoccol.R: Adapted code for new class structure. Wrote
846            several transform and filter functions operating on text document
847            collections (alias text document databases).
848    
849            * R/aobjects.R: Adapted class structure with inheritance,
850            repositories and additional meta data. Loading files on demand is
851            now possible.
852    
853    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
854    
855            * R/: Some cosmetic cleanups.
856    
857            * inst/: Removed vignette on clustering. That and much more is now
858            described in the JSS paper on text mining. Based upon that
859            article an elaborated vignette will be incorporated in the future.
860    
861    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
862    
863            * R/: Updated generic S4 methods to comply with signature changes
864            in newer versions of R (> 2.3)
865    
866    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
867    
868            * ext/R/importRIS.R: Automatic RIS import is now possible.
869    
870    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
871    
872            * R/textdoccol.R: Added RIS HTML input format.
873    
874    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
875    
876            * R/textdoccol.R: Removed bug that caused invalid text document
877            collections when handling many input files.
878    
879  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
880    
881          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.1010

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge