SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC pkg/ChangeLog revision 982, Tue Aug 11 07:48:04 2009 UTC
# Line 1  Line 1 
1    2009-08-11  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/reader.R (readMail): Moved to tm.plugin.mail package.
4    
5    2009-07-04  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/reader.R (readNewsgroup): Rename to readMail as newsgroup
8            postings are basically e-mails with some extra headers.
9    
10    2009-07-03  Ingo Feinerer  <feinerer@logic.at>
11    
12            * R/transform.R: Move convertMboxEml, removeCitation,
13            removeMultipart, and removeSignature to the tm.plugin.mail package
14            since they are mainly utility functions (for handling e-mails) and
15            not very framework specific.
16    
17    2009-06-28  Ingo Feinerer  <feinerer@logic.at>
18    
19            * man/: Fix documentation.
20    
21    2009-06-26  Ingo Feinerer  <feinerer@logic.at>
22    
23            * R/reader.R (readReut21578XMLasPlain): New reader which returns a
24            plain text document instead of an XML document for texts of the
25            Reuters-21578 dataset.
26    
27            * R/sparse.R: Removed since the slam package is now available on
28            CRAN.
29    
30            * DESCRIPTION (Depends): Add slam package.
31    
32    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
33    
34            * R/transform.R (stemDoc): Fix character(0) handling.
35    
36    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/doc.R (show): Pretty print.
39    
40    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
43            gracefully.
44    
45    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
46    
47            * R/corpus.R: Make corpus virtual. Implement corpus with standard
48            and permanent storage semantics.
49    
50            * DESCRIPTION: New major release. A *lot* of improvements.
51    
52    2009-05-04   Ingo Feinerer <feinerer@logic.at>
53    
54            * NAMESPACE: Export some simple_triplet_matrix functions.
55    
56    2009-04-28   Ingo Feinerer <feinerer@logic.at>
57    
58            * R/weight.R: Adapt tf-idf to new matrix format.
59    
60    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
61    
62            * R/matrix.R: Create two distinct classes for term-document and
63            document-term matrices.
64    
65    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
66    
67            * R/termdocmatrix.R: No longer use Matrix package. This reduces
68            package start-up time significantly.
69    
70    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
71    
72            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
73    
74    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
75    
76            * R/transform.R (tmReduce): Combine multiple maps into one
77            transformation.
78    
79    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
80    
81            * R/weight.R: Remove weightLogical since it does not return a
82            dgCMatrix.
83    
84            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
85            or TermDocumentMatrix instead.
86    
87    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
88    
89            * inst/doc/extensions.Rnw: Finished vignette.
90    
91    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
92    
93            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
94            DocumentTermMatrix representations.
95    
96    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
97    
98            * R/reader.R (readXML): New reader for arbitrary XML files.
99    
100    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
101    
102            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
103            (XMLSource): New XMLSource class for arbitrary XML files.
104            (Source): New slot Vectorized.
105    
106    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
107    
108            * R/reader.R (readTabular): Experimental reader for tabular data
109            structures which can be customized via user-defined mappings.
110    
111            * R/reader.R: Always use UTC time zone.
112    
113            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
114    
115    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
116    
117            * R/reader.R (readDOC): Options can be passed over to antiword.
118    
119            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
120            pdftotext.
121    
122    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
123    
124            * R/source.R (DirSource): Add pattern and ignore.case arguments
125            which are internally passed over to list.files().
126    
127    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
128    
129            * inst/doc/tm.Rnw: Suppress pointless loading message.
130    
131    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
132    
133            * DESCRIPTION: Speed up package loading (via moving packages not
134            strictly necessary for normal operation to Suggests instead of
135            Depends).
136    
137    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
138    
139            * R/reader.R (readNewsgroup): The date format is now configurable.
140    
141    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
142    
143            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
144    
145    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
146    
147            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
148    
149    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
150    
151            * R/source.R (DataframeSource): New source class for data frames.
152    
153            * R/source.R: Fixed non-standard call evaluation.
154    
155    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
156    
157            * R/source.R (URISource): New source class for a single document.
158    
159    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
160    
161            * R/source.R: Refactoring.
162    
163    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
164    
165            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
166            Rmpi installations more gracefully.
167    
168    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
169    
170            * R/source.R (Source): Add Length slot.
171    
172    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
173    
174            * R/AAA.R: Unify duplicated .onLoad function.
175    
176    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
177    
178            * DESCRIPTION (Suggests): Added Rmpi.
179    
180    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
181    
182            * R/source.R (getElem): Fix 'no visible binding' warning.
183    
184            * man/WeightFunction.Rd: Fix signature.
185    
186    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
187    
188            * R/weight.R: Introduce name abbreviations for weighting functions.
189    
190    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
191    
192            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
193    
194            * R/cluster.R: Provide convenience functions for using a MPI
195            cluster.
196    
197            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
198            available.
199    
200            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
201            available.
202    
203    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
204    
205            * R/textdoccol.R (lapply): Removed debug print out.
206    
207    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
208    
209            * R/reader.R (readRCV1): Improved meta data extraction from
210            Reuters Corpus Volume 1 documents.
211    
212    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
213    
214            * R/transform.R: Ensure that all mappings preserve multiline
215            structures.
216    
217    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
218    
219            * R/filter.R: Every filter has now an attribute indicating whether
220            it sould be applied to document level (doclevel).
221    
222            * R/textdoccol.R (tmFilter): Set searchFullText as new default
223            filter.
224    
225    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
226    
227            * R/transform.R (replacePatterns): Replaced removeWords by
228            replacePatterns. Suggested by Christian Buchta.
229    
230            * R/textdoccol.R (inspect): Improved formatting.
231    
232    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
233    
234            * inst/CITATION: Updated JSS article information.
235    
236            * R/textdoccol.R (setAs): Added coerce method from list to
237            corpus.
238    
239            * R/meta.R (meta): Improved meta data handling.
240    
241    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
242    
243            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
244            Christian Buchta.
245    
246            * inst/CITATION: Added template to include JSS article reference.
247    
248    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
249    
250            * R/textdoccol.R (tmMap): Introduced lazy mapping.
251    
252            * R/source.R: Added VectorSource.
253    
254    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
255    
256            * man/: Language codes should be in ISO 639-1 format.
257    
258            * R/textdoccol.R (asPlain): Preserve local meta data.
259    
260    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
261    
262            * R/textdoccol.R (writeCorpus): Function for writing a corpus
263            containing plain text documents to disk.
264    
265    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
266    
267            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
268            always set correctly.
269    
270            * R/textdoccol.R: Set load = TRUE as default for load on demand
271            since in most cases this is the wanted behaviour.
272    
273    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
274    
275            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
276    
277            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
278    
279    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
280    
281            * R/meta.R (meta): New function for consistent access to meta data
282            of document collections, repositories, and texts.
283    
284    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/: Better support for encodings.
287    
288    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
291            selection when no reader argument is given.
292    
293    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
294    
295            * R/source.R (CSVSource): Now uses read.csv instead of scan
296            internally.
297    
298    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
299    
300            * R/reader.R (getReaders): Returns available reader functions.
301    
302            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
303            as default.
304    
305    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
306    
307            * R/stopwords.R (stopwords): Shortened code, removed codetools
308            variable warnings.
309    
310            * man/: Documentation for showMeta, added an example for tmMap.
311    
312            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
313            some minor typos fixed.
314    
315    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
316    
317            * R/aobjects.R (showMeta): Added method for pretty printing a
318            text document's meta data.
319    
320    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
321    
322            * R/textdoccol.R (TextDocCol): Better handling of empty
323            arguments.
324    
325            * NAMESPACE: Exported readDOC.
326    
327            * man/completeStems.Rd: Added an example.
328    
329    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
330    
331            * R/stopwords.R (stopwords): Look up .dat files at every
332            call. Allows users to modify stopword .dat files interactively.
333    
334    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * R/termdocmatrix.R (termFreq): Correct processing of empty
337            documents.
338    
339    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
340    
341            * man/: Updated documentation.
342    
343    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * R/complete.R (completeStems): Completes (heuristically) word
346            stems.
347    
348            * R/termdocmatrix.R (TermDocMatrix2): New modular
349            constructor.
350    
351            * NAMESPACE: Exported termFreq.
352    
353    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
354    
355            * R/reader.R (readDOC): Added MS Word reader (using antiword).
356    
357    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
358    
359            * R/weight.R: Weighting functions for TermDocMatrix.
360    
361    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
362    
363            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
364            functions for accessing dimension, column, and row names.
365    
366            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
367    
368    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
369    
370            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
371    
372    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
373    
374            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
375    
376    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
377    
378            * R/reader.R (readPDF): Removed manual checks for pdftotext and
379            pdfinfo. The system call gives a warning anyway.
380    
381    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
382    
383            * R/textdoccol.R (asPlain): Conversion from
384            StructuredTextDocuments to PlainTextDocuments.
385    
386    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
387    
388            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
389            for accessing term-document matrices.
390    
391            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
392            are installed.
393    
394    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
395    
396            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
397            Christian Buchta.
398    
399    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
400    
401            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
402    
403    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
404    
405            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
406    
407            * R/reader.R (readPDF): Added PDF reader.
408    
409    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
410    
411            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
412    
413            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
414    
415            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
416    
417            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
418    
419    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
420    
421            * R/distmeasure.R (dissimilarity): Replaced dists call from
422            package cba by new dist call from package proxy.
423    
424    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
425    
426            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
427    
428    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
429    
430            * R/termdocmatrix.R: require() uses the quietly option to suppress
431            loading messages.
432    
433    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
434    
435            * R/dictionary.R: Added dictionary support.
436    
437    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
440            documents. This simplifies some functions, e.g., asPlain.
441    
442    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
443    
444            * inst/doc/tm.Rnw: Fixed some typos in vignette.
445    
446    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
447    
448            * R/textdoccol.R (replaceWords): Added method to replace a set of
449            words by a single word. Useful for synonyms.
450    
451    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
452    
453            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
454    
455    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
456    
457            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
458            vectors. Thanks to Ariel Maguyon for his error report.
459            (removeSparseTerms): New function to remove columns from a
460            term-document matrix exceeding a sparse factor.
461    
462    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
463    
464            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
465    
466    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
467    
468            * man/sFilter.Rd: Corrected documentation on statement format (use
469            '==' instead of '=').
470    
471    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
472    
473            * R/aobjects.R (StructuredTextDocument): Inherits from
474            TextDocument.
475    
476    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
477    
478            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
479            on sparse matrices as proposed by Martin Maechler.
480    
481    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
482    
483            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
484            \pkg{filehash} version makes them deprecated.
485    
486    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
487    
488            * R/termdocmatrix.R (textvector): Stemming is now performed before
489            erasing stopwords.
490            (weightMatrix): Adapted to handle sparse matrices.
491            (TermDocMatrix): Sparse matrix is now efficiently built by
492            direct stepwise insertion of row values into it.
493    
494    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
495    
496            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
497            due to ongoing problems. For our purposes the latter is as useful
498            as the replaced package.
499    
500    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
501    
502            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
503    
504            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
505    
506    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
507    
508            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
509            languages with available stopwords.
510    
511    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
512    
513            * inst/doc/tm.Rnw: Minor corrections in the vignette.
514    
515    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
516    
517            * DESCRIPTION: Update to version 0.2, since a lot of new features
518            have been integrated.
519    
520            * inst/stopwords: Updated existing stopwords and added stopwords
521            for various other languages.
522    
523    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
524    
525            * man/: Updated documentation.
526    
527            * Work/testDb.R: Script to test database stuff.
528    
529            * R/: Fixed various database related bugs. Seems to be rather
530            useable now, i.e., consider as alpha status for now.
531    
532    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
533    
534            * R/: Fixed some bugs related to database support.
535    
536    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
537    
538            * man/: Added a lot of examples to the manuals.
539    
540    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
541    
542            * man/: Updated parts of the documentation.
543    
544            * R/textdoccol.R (asPlain): Added conversion from newsgroup
545            documents to plain text documents.
546    
547    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
548    
549            * R/textdoccol.R: Finished experimental database support. Not yet
550            intensively tested.
551    
552            * R/source.R: Now each source has a default reader.
553    
554            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
555            class anymore.
556    
557            * R/plaintextdoc.R: Custom show method for plain text documents.
558    
559            * R/aobjects.R: Added a class for structured text documents.
560    
561            * R/reader.R: Replaced remaining \code{parser} occurrences with
562            \code{reader}.
563    
564            * R/textdoccol.R (summary): Indent tags.
565    
566            * R/textdoccol.R (removePunctuation): Transform method to remove
567            punctuation marks.
568    
569    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
570    
571            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
572            using prescindMeta().
573    
574    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
575    
576            * R/textdoccol.R: Improved database support.
577    
578    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
579    
580            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
581    
582            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
583            language code.
584    
585            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
586            into parserControl argument.
587    
588            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
589    
590    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
591    
592            * Work/tmDataSetup.R: The datasets acq and crude can now be
593            created on the fly.
594    
595            * R/stopwords.R: Introduced a function returning the stopwords for
596            a given language (English, German and French at the moment)
597    
598            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
599            otherwise falls back to Snowball package.
600    
601    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
602    
603            * man/dissimilarity-methods.Rd: Make clear that any method offered
604            by "dists" from package "cba" can be used.
605    
606    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
607    
608            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
609            to Kurt's latex suggestion. Removed points and underscores in
610            variable names for consistent naming.
611    
612            * DESCRIPTION: Update to version 0.1-2.
613    
614            * man/TextRepository.Rd: Fixed bug in documentation.
615    
616    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
617    
618            * DESCRIPTION: Update to version 0.1-1.
619    
620    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
621    
622            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
623            wordStem.
624    
625    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
626    
627            * R/: Changes due to Kurt's review.
628    
629    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
630    
631            * R/: Implemented improvements based upon comments by David
632            Meyer.
633    
634    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
635    
636            * inst/doc/: Rewrote vignette.
637    
638            * man/: Improved documentation.
639    
640    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
641    
642            * man/: Updated documentation.
643    
644            * DESCRIPTION: Changed package name to "tm". Updated version to
645            0.1 for first CRAN release.
646    
647            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
648            list archive example.
649    
650            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
651            archive example.
652    
653            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
654            from (several mails per box) mbox format to (single mail per file)
655            eml format.
656    
657    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
658    
659            * data/crude.rda: Rebuilt.
660    
661            * data/acq.rda: Rebuilt.
662    
663            * R/reader.R: Factored out reader and parser methods from
664            textdoccol.R.
665    
666            * R/source.R: Factored out Source methods from aobjects.R and
667            textdoccol.R.
668            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
669            feeds.
670    
671            * R/textdoccol.R (DirSource): Added support for recursive
672            traversal of directories.
673    
674    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
675    
676            * R/textdoccol.R ([[): Loads the document corpus automatically
677            into memory upon access.
678            (tm_transform, tm_filter): Removed several checks whether the
679            document is already loaded ([[ ensures this now).
680            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
681            mailing list archive.
682    
683    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
684    
685            * R/aobjects.R (TextDocument): Is now a virtual class.
686            (Source): Is now a virtual class.
687    
688    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
689    
690            * R/textdoccol.R (c): Support for an arbitrary number of document
691            collections.
692    
693    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
694    
695            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
696            append_meta and remove_meta.
697    
698            * R/textdoccol.R: Removed modify_metadata method.
699    
700            * R/textrepo.R: Removed modify_metadata method.
701    
702            * R/textdoccol.R (remove_meta): Supports removal of document
703            collection metadata and document (= in data frame) metadata.
704    
705    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
706    
707            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
708    
709            * data/crude.rda: Rebuilt.
710    
711            * data/acq.rda: Rebuilt.
712    
713            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
714    
715            * R/textdoccol.R ([): Bug fix for subsetting a document
716            collection's data frame.
717    
718    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
719    
720            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
721            to s_filter.
722    
723            * R/textdoccol.R: Local text documents' metadata can now be copied
724            to a document collection's data frame with prescind_meta.
725    
726    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
727    
728            * R/: Text documents' slot metadata is now accessible in s_filter.
729    
730            * R/: Rewrote s_filter function (has still some restrictions).
731    
732    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
733    
734            * R/: Various fixes in handling metadata.
735    
736            * R/: Added update mechanism for text document collections.
737    
738    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
739    
740            * R/: Merging of document collections now creates a binary tree
741            for reconstructing merged document collections.
742    
743            * R/: Redesign of metadata for document collections.
744    
745    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
746    
747            * R/: Messages now use \code{ngettext}.
748    
749    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
750    
751            * R/: Added functions for modifying and removing metadata.
752    
753    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
754    
755            * man/: Updated some documentation.
756    
757            * R/: Corrected some connection issues.
758    
759            * inst/doc: Worked on the vignette.
760    
761    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
762    
763            * inst/: Added texts and started vignette.
764    
765            * R/: Final changes based upon David's comments.
766    
767    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
768    
769            * NAMESPACE: Corrected exports (generic methods need exportMethods
770            directives!).
771    
772    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
773    
774            * R/: Modified the TextDocCol constructur and various parsers. It
775            is now modular and supports various file formats via plugins (see
776            the new "Source" class).
777    
778    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
779    
780            * man/: Revised documentation after previous code changes.
781    
782    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
783    
784            * R/: Remaining changes as discussed with David.
785    
786    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
787    
788            * R/: Some changes as suggested by David. The rest will follow
789            within the next days.
790    
791    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
792    
793            * man/: Finished documentation.
794    
795    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
796    
797            * man/: Wrote some documentation.
798    
799    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
800    
801            * R/: Further syntactic sugar in form of additional assignment and
802            accessor methods.
803    
804    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
805    
806            * R/: Syntactic sugar in form of "length", "show" and "summary"
807            operators.
808    
809    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
810    
811            * R/: Diverse updates. Mainly on default operators ("[" or "c")
812            and dissimilarities.
813    
814    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
815    
816            * R/: Added similarity functions.
817    
818            * data/: Added english stopwords.
819    
820    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
821    
822            * data/: Examples compiled for new features
823    
824            * R/: Changes due to new structure.
825    
826            * NAMESPACE: Corrected namespace to reflect new structure.
827    
828            * R/termdocmatrix.R: Adapted for new naming scheme.
829    
830    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
831    
832            * R/textdoccol.R: Adapted code for new class structure. Wrote
833            several transform and filter functions operating on text document
834            collections (alias text document databases).
835    
836            * R/aobjects.R: Adapted class structure with inheritance,
837            repositories and additional meta data. Loading files on demand is
838            now possible.
839    
840    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
841    
842            * R/: Some cosmetic cleanups.
843    
844            * inst/: Removed vignette on clustering. That and much more is now
845            described in the JSS paper on text mining. Based upon that
846            article an elaborated vignette will be incorporated in the future.
847    
848    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
849    
850            * R/: Updated generic S4 methods to comply with signature changes
851            in newer versions of R (> 2.3)
852    
853    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
854    
855            * ext/R/importRIS.R: Automatic RIS import is now possible.
856    
857    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
858    
859            * R/textdoccol.R: Added RIS HTML input format.
860    
861    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
862    
863            * R/textdoccol.R: Removed bug that caused invalid text document
864            collections when handling many input files.
865    
866  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
867    
868          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.982

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge