SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 28, Tue Dec 6 13:46:33 2005 UTC pkg/ChangeLog revision 960, Fri Jun 26 17:43:45 2009 UTC
# Line 1  Line 1 
1    2009-06-26  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/reader.R (readReut21578XMLasPlain): New reader which returns a
4            plain text document instead of an XML document for texts of the
5            Reuters-21578 dataset.
6    
7            * R/sparse.R: Removed since the slam package is now available on
8            CRAN.
9    
10            * DESCRIPTION (Depends): Add slam package.
11    
12    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
13    
14            * R/transform.R (stemDoc): Fix character(0) handling.
15    
16    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
17    
18            * R/doc.R (show): Pretty print.
19    
20    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
21    
22            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
23            gracefully.
24    
25    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
26    
27            * R/corpus.R: Make corpus virtual. Implement corpus with standard
28            and permanent storage semantics.
29    
30            * DESCRIPTION: New major release. A *lot* of improvements.
31    
32    2009-05-04   Ingo Feinerer <feinerer@logic.at>
33    
34            * NAMESPACE: Export some simple_triplet_matrix functions.
35    
36    2009-04-28   Ingo Feinerer <feinerer@logic.at>
37    
38            * R/weight.R: Adapt tf-idf to new matrix format.
39    
40    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/matrix.R: Create two distinct classes for term-document and
43            document-term matrices.
44    
45    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
46    
47            * R/termdocmatrix.R: No longer use Matrix package. This reduces
48            package start-up time significantly.
49    
50    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
51    
52            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
53    
54    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
55    
56            * R/transform.R (tmReduce): Combine multiple maps into one
57            transformation.
58    
59    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
60    
61            * R/weight.R: Remove weightLogical since it does not return a
62            dgCMatrix.
63    
64            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
65            or TermDocumentMatrix instead.
66    
67    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
68    
69            * inst/doc/extensions.Rnw: Finished vignette.
70    
71    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
72    
73            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
74            DocumentTermMatrix representations.
75    
76    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
77    
78            * R/reader.R (readXML): New reader for arbitrary XML files.
79    
80    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
81    
82            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
83            (XMLSource): New XMLSource class for arbitrary XML files.
84            (Source): New slot Vectorized.
85    
86    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
87    
88            * R/reader.R (readTabular): Experimental reader for tabular data
89            structures which can be customized via user-defined mappings.
90    
91            * R/reader.R: Always use UTC time zone.
92    
93            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
94    
95    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
96    
97            * R/reader.R (readDOC): Options can be passed over to antiword.
98    
99            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
100            pdftotext.
101    
102    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
103    
104            * R/source.R (DirSource): Add pattern and ignore.case arguments
105            which are internally passed over to list.files().
106    
107    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
108    
109            * inst/doc/tm.Rnw: Suppress pointless loading message.
110    
111    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
112    
113            * DESCRIPTION: Speed up package loading (via moving packages not
114            strictly necessary for normal operation to Suggests instead of
115            Depends).
116    
117    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
118    
119            * R/reader.R (readNewsgroup): The date format is now configurable.
120    
121    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
122    
123            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
124    
125    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
126    
127            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
128    
129    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
130    
131            * R/source.R (DataframeSource): New source class for data frames.
132    
133            * R/source.R: Fixed non-standard call evaluation.
134    
135    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
136    
137            * R/source.R (URISource): New source class for a single document.
138    
139    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
140    
141            * R/source.R: Refactoring.
142    
143    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
144    
145            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
146            Rmpi installations more gracefully.
147    
148    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
149    
150            * R/source.R (Source): Add Length slot.
151    
152    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
153    
154            * R/AAA.R: Unify duplicated .onLoad function.
155    
156    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
157    
158            * DESCRIPTION (Suggests): Added Rmpi.
159    
160    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
161    
162            * R/source.R (getElem): Fix 'no visible binding' warning.
163    
164            * man/WeightFunction.Rd: Fix signature.
165    
166    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
167    
168            * R/weight.R: Introduce name abbreviations for weighting functions.
169    
170    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
171    
172            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
173    
174            * R/cluster.R: Provide convenience functions for using a MPI
175            cluster.
176    
177            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
178            available.
179    
180            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
181            available.
182    
183    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
184    
185            * R/textdoccol.R (lapply): Removed debug print out.
186    
187    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
188    
189            * R/reader.R (readRCV1): Improved meta data extraction from
190            Reuters Corpus Volume 1 documents.
191    
192    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
193    
194            * R/transform.R: Ensure that all mappings preserve multiline
195            structures.
196    
197    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
198    
199            * R/filter.R: Every filter has now an attribute indicating whether
200            it sould be applied to document level (doclevel).
201    
202            * R/textdoccol.R (tmFilter): Set searchFullText as new default
203            filter.
204    
205    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
206    
207            * R/transform.R (replacePatterns): Replaced removeWords by
208            replacePatterns. Suggested by Christian Buchta.
209    
210            * R/textdoccol.R (inspect): Improved formatting.
211    
212    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
213    
214            * inst/CITATION: Updated JSS article information.
215    
216            * R/textdoccol.R (setAs): Added coerce method from list to
217            corpus.
218    
219            * R/meta.R (meta): Improved meta data handling.
220    
221    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
222    
223            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
224            Christian Buchta.
225    
226            * inst/CITATION: Added template to include JSS article reference.
227    
228    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
229    
230            * R/textdoccol.R (tmMap): Introduced lazy mapping.
231    
232            * R/source.R: Added VectorSource.
233    
234    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
235    
236            * man/: Language codes should be in ISO 639-1 format.
237    
238            * R/textdoccol.R (asPlain): Preserve local meta data.
239    
240    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
241    
242            * R/textdoccol.R (writeCorpus): Function for writing a corpus
243            containing plain text documents to disk.
244    
245    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
246    
247            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
248            always set correctly.
249    
250            * R/textdoccol.R: Set load = TRUE as default for load on demand
251            since in most cases this is the wanted behaviour.
252    
253    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
256    
257            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
258    
259    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
260    
261            * R/meta.R (meta): New function for consistent access to meta data
262            of document collections, repositories, and texts.
263    
264    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
265    
266            * R/: Better support for encodings.
267    
268    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
269    
270            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
271            selection when no reader argument is given.
272    
273    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
274    
275            * R/source.R (CSVSource): Now uses read.csv instead of scan
276            internally.
277    
278    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
279    
280            * R/reader.R (getReaders): Returns available reader functions.
281    
282            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
283            as default.
284    
285    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
286    
287            * R/stopwords.R (stopwords): Shortened code, removed codetools
288            variable warnings.
289    
290            * man/: Documentation for showMeta, added an example for tmMap.
291    
292            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
293            some minor typos fixed.
294    
295    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
296    
297            * R/aobjects.R (showMeta): Added method for pretty printing a
298            text document's meta data.
299    
300    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
301    
302            * R/textdoccol.R (TextDocCol): Better handling of empty
303            arguments.
304    
305            * NAMESPACE: Exported readDOC.
306    
307            * man/completeStems.Rd: Added an example.
308    
309    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
310    
311            * R/stopwords.R (stopwords): Look up .dat files at every
312            call. Allows users to modify stopword .dat files interactively.
313    
314    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
315    
316            * R/termdocmatrix.R (termFreq): Correct processing of empty
317            documents.
318    
319    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
320    
321            * man/: Updated documentation.
322    
323    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
324    
325            * R/complete.R (completeStems): Completes (heuristically) word
326            stems.
327    
328            * R/termdocmatrix.R (TermDocMatrix2): New modular
329            constructor.
330    
331            * NAMESPACE: Exported termFreq.
332    
333    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
334    
335            * R/reader.R (readDOC): Added MS Word reader (using antiword).
336    
337    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
338    
339            * R/weight.R: Weighting functions for TermDocMatrix.
340    
341    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
342    
343            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
344            functions for accessing dimension, column, and row names.
345    
346            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
347    
348    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
349    
350            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
351    
352    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
355    
356    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * R/reader.R (readPDF): Removed manual checks for pdftotext and
359            pdfinfo. The system call gives a warning anyway.
360    
361    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
362    
363            * R/textdoccol.R (asPlain): Conversion from
364            StructuredTextDocuments to PlainTextDocuments.
365    
366    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
367    
368            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
369            for accessing term-document matrices.
370    
371            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
372            are installed.
373    
374    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
375    
376            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
377            Christian Buchta.
378    
379    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
380    
381            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
382    
383    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
384    
385            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
386    
387            * R/reader.R (readPDF): Added PDF reader.
388    
389    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
390    
391            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
392    
393            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
394    
395            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
396    
397            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
398    
399    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
400    
401            * R/distmeasure.R (dissimilarity): Replaced dists call from
402            package cba by new dist call from package proxy.
403    
404    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
405    
406            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
407    
408    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
409    
410            * R/termdocmatrix.R: require() uses the quietly option to suppress
411            loading messages.
412    
413    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
414    
415            * R/dictionary.R: Added dictionary support.
416    
417    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
418    
419            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
420            documents. This simplifies some functions, e.g., asPlain.
421    
422    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
423    
424            * inst/doc/tm.Rnw: Fixed some typos in vignette.
425    
426    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
427    
428            * R/textdoccol.R (replaceWords): Added method to replace a set of
429            words by a single word. Useful for synonyms.
430    
431    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
432    
433            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
434    
435    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
436    
437            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
438            vectors. Thanks to Ariel Maguyon for his error report.
439            (removeSparseTerms): New function to remove columns from a
440            term-document matrix exceeding a sparse factor.
441    
442    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
443    
444            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
445    
446    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
447    
448            * man/sFilter.Rd: Corrected documentation on statement format (use
449            '==' instead of '=').
450    
451    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
452    
453            * R/aobjects.R (StructuredTextDocument): Inherits from
454            TextDocument.
455    
456    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
457    
458            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
459            on sparse matrices as proposed by Martin Maechler.
460    
461    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
462    
463            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
464            \pkg{filehash} version makes them deprecated.
465    
466    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
467    
468            * R/termdocmatrix.R (textvector): Stemming is now performed before
469            erasing stopwords.
470            (weightMatrix): Adapted to handle sparse matrices.
471            (TermDocMatrix): Sparse matrix is now efficiently built by
472            direct stepwise insertion of row values into it.
473    
474    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
475    
476            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
477            due to ongoing problems. For our purposes the latter is as useful
478            as the replaced package.
479    
480    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
481    
482            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
483    
484            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
485    
486    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
487    
488            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
489            languages with available stopwords.
490    
491    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
492    
493            * inst/doc/tm.Rnw: Minor corrections in the vignette.
494    
495    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
496    
497            * DESCRIPTION: Update to version 0.2, since a lot of new features
498            have been integrated.
499    
500            * inst/stopwords: Updated existing stopwords and added stopwords
501            for various other languages.
502    
503    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
504    
505            * man/: Updated documentation.
506    
507            * Work/testDb.R: Script to test database stuff.
508    
509            * R/: Fixed various database related bugs. Seems to be rather
510            useable now, i.e., consider as alpha status for now.
511    
512    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
513    
514            * R/: Fixed some bugs related to database support.
515    
516    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
517    
518            * man/: Added a lot of examples to the manuals.
519    
520    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
521    
522            * man/: Updated parts of the documentation.
523    
524            * R/textdoccol.R (asPlain): Added conversion from newsgroup
525            documents to plain text documents.
526    
527    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
528    
529            * R/textdoccol.R: Finished experimental database support. Not yet
530            intensively tested.
531    
532            * R/source.R: Now each source has a default reader.
533    
534            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
535            class anymore.
536    
537            * R/plaintextdoc.R: Custom show method for plain text documents.
538    
539            * R/aobjects.R: Added a class for structured text documents.
540    
541            * R/reader.R: Replaced remaining \code{parser} occurrences with
542            \code{reader}.
543    
544            * R/textdoccol.R (summary): Indent tags.
545    
546            * R/textdoccol.R (removePunctuation): Transform method to remove
547            punctuation marks.
548    
549    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
550    
551            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
552            using prescindMeta().
553    
554    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
555    
556            * R/textdoccol.R: Improved database support.
557    
558    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
559    
560            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
561    
562            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
563            language code.
564    
565            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
566            into parserControl argument.
567    
568            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
569    
570    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
571    
572            * Work/tmDataSetup.R: The datasets acq and crude can now be
573            created on the fly.
574    
575            * R/stopwords.R: Introduced a function returning the stopwords for
576            a given language (English, German and French at the moment)
577    
578            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
579            otherwise falls back to Snowball package.
580    
581    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
582    
583            * man/dissimilarity-methods.Rd: Make clear that any method offered
584            by "dists" from package "cba" can be used.
585    
586    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
587    
588            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
589            to Kurt's latex suggestion. Removed points and underscores in
590            variable names for consistent naming.
591    
592            * DESCRIPTION: Update to version 0.1-2.
593    
594            * man/TextRepository.Rd: Fixed bug in documentation.
595    
596    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
597    
598            * DESCRIPTION: Update to version 0.1-1.
599    
600    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
601    
602            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
603            wordStem.
604    
605    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
606    
607            * R/: Changes due to Kurt's review.
608    
609    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
610    
611            * R/: Implemented improvements based upon comments by David
612            Meyer.
613    
614    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
615    
616            * inst/doc/: Rewrote vignette.
617    
618            * man/: Improved documentation.
619    
620    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
621    
622            * man/: Updated documentation.
623    
624            * DESCRIPTION: Changed package name to "tm". Updated version to
625            0.1 for first CRAN release.
626    
627            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
628            list archive example.
629    
630            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
631            archive example.
632    
633            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
634            from (several mails per box) mbox format to (single mail per file)
635            eml format.
636    
637    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
638    
639            * data/crude.rda: Rebuilt.
640    
641            * data/acq.rda: Rebuilt.
642    
643            * R/reader.R: Factored out reader and parser methods from
644            textdoccol.R.
645    
646            * R/source.R: Factored out Source methods from aobjects.R and
647            textdoccol.R.
648            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
649            feeds.
650    
651            * R/textdoccol.R (DirSource): Added support for recursive
652            traversal of directories.
653    
654    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * R/textdoccol.R ([[): Loads the document corpus automatically
657            into memory upon access.
658            (tm_transform, tm_filter): Removed several checks whether the
659            document is already loaded ([[ ensures this now).
660            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
661            mailing list archive.
662    
663    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
664    
665            * R/aobjects.R (TextDocument): Is now a virtual class.
666            (Source): Is now a virtual class.
667    
668    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
669    
670            * R/textdoccol.R (c): Support for an arbitrary number of document
671            collections.
672    
673    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
674    
675            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
676            append_meta and remove_meta.
677    
678            * R/textdoccol.R: Removed modify_metadata method.
679    
680            * R/textrepo.R: Removed modify_metadata method.
681    
682            * R/textdoccol.R (remove_meta): Supports removal of document
683            collection metadata and document (= in data frame) metadata.
684    
685    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
686    
687            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
688    
689            * data/crude.rda: Rebuilt.
690    
691            * data/acq.rda: Rebuilt.
692    
693            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
694    
695            * R/textdoccol.R ([): Bug fix for subsetting a document
696            collection's data frame.
697    
698    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
699    
700            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
701            to s_filter.
702    
703            * R/textdoccol.R: Local text documents' metadata can now be copied
704            to a document collection's data frame with prescind_meta.
705    
706    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
707    
708            * R/: Text documents' slot metadata is now accessible in s_filter.
709    
710            * R/: Rewrote s_filter function (has still some restrictions).
711    
712    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
713    
714            * R/: Various fixes in handling metadata.
715    
716            * R/: Added update mechanism for text document collections.
717    
718    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
719    
720            * R/: Merging of document collections now creates a binary tree
721            for reconstructing merged document collections.
722    
723            * R/: Redesign of metadata for document collections.
724    
725    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
726    
727            * R/: Messages now use \code{ngettext}.
728    
729    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
730    
731            * R/: Added functions for modifying and removing metadata.
732    
733    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
734    
735            * man/: Updated some documentation.
736    
737            * R/: Corrected some connection issues.
738    
739            * inst/doc: Worked on the vignette.
740    
741    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
742    
743            * inst/: Added texts and started vignette.
744    
745            * R/: Final changes based upon David's comments.
746    
747    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
748    
749            * NAMESPACE: Corrected exports (generic methods need exportMethods
750            directives!).
751    
752    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
753    
754            * R/: Modified the TextDocCol constructur and various parsers. It
755            is now modular and supports various file formats via plugins (see
756            the new "Source" class).
757    
758    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
759    
760            * man/: Revised documentation after previous code changes.
761    
762    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
763    
764            * R/: Remaining changes as discussed with David.
765    
766    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
767    
768            * R/: Some changes as suggested by David. The rest will follow
769            within the next days.
770    
771    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
772    
773            * man/: Finished documentation.
774    
775    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
776    
777            * man/: Wrote some documentation.
778    
779    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
780    
781            * R/: Further syntactic sugar in form of additional assignment and
782            accessor methods.
783    
784    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
785    
786            * R/: Syntactic sugar in form of "length", "show" and "summary"
787            operators.
788    
789    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
790    
791            * R/: Diverse updates. Mainly on default operators ("[" or "c")
792            and dissimilarities.
793    
794    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
795    
796            * R/: Added similarity functions.
797    
798            * data/: Added english stopwords.
799    
800    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
801    
802            * data/: Examples compiled for new features
803    
804            * R/: Changes due to new structure.
805    
806            * NAMESPACE: Corrected namespace to reflect new structure.
807    
808            * R/termdocmatrix.R: Adapted for new naming scheme.
809    
810    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
811    
812            * R/textdoccol.R: Adapted code for new class structure. Wrote
813            several transform and filter functions operating on text document
814            collections (alias text document databases).
815    
816            * R/aobjects.R: Adapted class structure with inheritance,
817            repositories and additional meta data. Loading files on demand is
818            now possible.
819    
820    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
821    
822            * R/: Some cosmetic cleanups.
823    
824            * inst/: Removed vignette on clustering. That and much more is now
825            described in the JSS paper on text mining. Based upon that
826            article an elaborated vignette will be incorporated in the future.
827    
828    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
829    
830            * R/: Updated generic S4 methods to comply with signature changes
831            in newer versions of R (> 2.3)
832    
833    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
834    
835            * ext/R/importRIS.R: Automatic RIS import is now possible.
836    
837    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
838    
839            * R/textdoccol.R: Added RIS HTML input format.
840    
841    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
842    
843            * R/textdoccol.R: Removed bug that caused invalid text document
844            collections when handling many input files.
845    
846    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
847    
848            * R/textdoccol.R: Restructured and extended file import
849            mechanism.
850    
851            * inst/doc/clustering.Rnw: Adapted vignette for use with
852            ReutNews.rda
853    
854            * man/ReutNews.Rd: Documentation for ReutNews.rda
855    
856            * data/ReutNews.rda: A tiny Reuters21578 example data set.
857    
858    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
859    
860            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
861            clustering facilities of this package.
862    
863    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
864    
865            * R/aobjects.R: Changed package document structure to avoid class
866            dependency problems.
867    
868  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
869    
870            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
871            data set.
872    
873          * Finished documentation and reordered directory structure. Now "R          * Finished documentation and reordered directory structure. Now "R
874          CMD check textmin" works without errors.          CMD check textmin" works without errors.
875    

Legend:
Removed from v.28  
changed lines
  Added in v.960

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge