SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC pkg/ChangeLog revision 962, Sun Jun 28 15:52:33 2009 UTC
# Line 1  Line 1 
1    2009-06-28  Ingo Feinerer  <feinerer@logic.at>
2    
3            * man/: Fix documentation.
4    
5    2009-06-26  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/reader.R (readReut21578XMLasPlain): New reader which returns a
8            plain text document instead of an XML document for texts of the
9            Reuters-21578 dataset.
10    
11            * R/sparse.R: Removed since the slam package is now available on
12            CRAN.
13    
14            * DESCRIPTION (Depends): Add slam package.
15    
16    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
17    
18            * R/transform.R (stemDoc): Fix character(0) handling.
19    
20    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
21    
22            * R/doc.R (show): Pretty print.
23    
24    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
25    
26            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
27            gracefully.
28    
29    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
30    
31            * R/corpus.R: Make corpus virtual. Implement corpus with standard
32            and permanent storage semantics.
33    
34            * DESCRIPTION: New major release. A *lot* of improvements.
35    
36    2009-05-04   Ingo Feinerer <feinerer@logic.at>
37    
38            * NAMESPACE: Export some simple_triplet_matrix functions.
39    
40    2009-04-28   Ingo Feinerer <feinerer@logic.at>
41    
42            * R/weight.R: Adapt tf-idf to new matrix format.
43    
44    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
45    
46            * R/matrix.R: Create two distinct classes for term-document and
47            document-term matrices.
48    
49    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
50    
51            * R/termdocmatrix.R: No longer use Matrix package. This reduces
52            package start-up time significantly.
53    
54    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
55    
56            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
57    
58    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
59    
60            * R/transform.R (tmReduce): Combine multiple maps into one
61            transformation.
62    
63    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
64    
65            * R/weight.R: Remove weightLogical since it does not return a
66            dgCMatrix.
67    
68            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
69            or TermDocumentMatrix instead.
70    
71    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
72    
73            * inst/doc/extensions.Rnw: Finished vignette.
74    
75    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
76    
77            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
78            DocumentTermMatrix representations.
79    
80    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
81    
82            * R/reader.R (readXML): New reader for arbitrary XML files.
83    
84    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
85    
86            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
87            (XMLSource): New XMLSource class for arbitrary XML files.
88            (Source): New slot Vectorized.
89    
90    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
91    
92            * R/reader.R (readTabular): Experimental reader for tabular data
93            structures which can be customized via user-defined mappings.
94    
95            * R/reader.R: Always use UTC time zone.
96    
97            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
98    
99    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
100    
101            * R/reader.R (readDOC): Options can be passed over to antiword.
102    
103            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
104            pdftotext.
105    
106    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
107    
108            * R/source.R (DirSource): Add pattern and ignore.case arguments
109            which are internally passed over to list.files().
110    
111    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
112    
113            * inst/doc/tm.Rnw: Suppress pointless loading message.
114    
115    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
116    
117            * DESCRIPTION: Speed up package loading (via moving packages not
118            strictly necessary for normal operation to Suggests instead of
119            Depends).
120    
121    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
122    
123            * R/reader.R (readNewsgroup): The date format is now configurable.
124    
125    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
126    
127            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
128    
129    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
130    
131            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
132    
133    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
134    
135            * R/source.R (DataframeSource): New source class for data frames.
136    
137            * R/source.R: Fixed non-standard call evaluation.
138    
139    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
140    
141            * R/source.R (URISource): New source class for a single document.
142    
143    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
144    
145            * R/source.R: Refactoring.
146    
147    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
148    
149            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
150            Rmpi installations more gracefully.
151    
152    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
153    
154            * R/source.R (Source): Add Length slot.
155    
156    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
157    
158            * R/AAA.R: Unify duplicated .onLoad function.
159    
160    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
161    
162            * DESCRIPTION (Suggests): Added Rmpi.
163    
164    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
165    
166            * R/source.R (getElem): Fix 'no visible binding' warning.
167    
168            * man/WeightFunction.Rd: Fix signature.
169    
170    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
171    
172            * R/weight.R: Introduce name abbreviations for weighting functions.
173    
174    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
175    
176            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
177    
178            * R/cluster.R: Provide convenience functions for using a MPI
179            cluster.
180    
181            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
182            available.
183    
184            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
185            available.
186    
187    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
188    
189            * R/textdoccol.R (lapply): Removed debug print out.
190    
191    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
192    
193            * R/reader.R (readRCV1): Improved meta data extraction from
194            Reuters Corpus Volume 1 documents.
195    
196    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
197    
198            * R/transform.R: Ensure that all mappings preserve multiline
199            structures.
200    
201    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
202    
203            * R/filter.R: Every filter has now an attribute indicating whether
204            it sould be applied to document level (doclevel).
205    
206            * R/textdoccol.R (tmFilter): Set searchFullText as new default
207            filter.
208    
209    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
210    
211            * R/transform.R (replacePatterns): Replaced removeWords by
212            replacePatterns. Suggested by Christian Buchta.
213    
214            * R/textdoccol.R (inspect): Improved formatting.
215    
216    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
217    
218            * inst/CITATION: Updated JSS article information.
219    
220            * R/textdoccol.R (setAs): Added coerce method from list to
221            corpus.
222    
223            * R/meta.R (meta): Improved meta data handling.
224    
225    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
226    
227            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
228            Christian Buchta.
229    
230            * inst/CITATION: Added template to include JSS article reference.
231    
232    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
233    
234            * R/textdoccol.R (tmMap): Introduced lazy mapping.
235    
236            * R/source.R: Added VectorSource.
237    
238    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
239    
240            * man/: Language codes should be in ISO 639-1 format.
241    
242            * R/textdoccol.R (asPlain): Preserve local meta data.
243    
244    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * R/textdoccol.R (writeCorpus): Function for writing a corpus
247            containing plain text documents to disk.
248    
249    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
250    
251            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
252            always set correctly.
253    
254            * R/textdoccol.R: Set load = TRUE as default for load on demand
255            since in most cases this is the wanted behaviour.
256    
257    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
260    
261            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
262    
263    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
264    
265            * R/meta.R (meta): New function for consistent access to meta data
266            of document collections, repositories, and texts.
267    
268    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
269    
270            * R/: Better support for encodings.
271    
272    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
273    
274            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
275            selection when no reader argument is given.
276    
277    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
278    
279            * R/source.R (CSVSource): Now uses read.csv instead of scan
280            internally.
281    
282    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
283    
284            * R/reader.R (getReaders): Returns available reader functions.
285    
286            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
287            as default.
288    
289    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * R/stopwords.R (stopwords): Shortened code, removed codetools
292            variable warnings.
293    
294            * man/: Documentation for showMeta, added an example for tmMap.
295    
296            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
297            some minor typos fixed.
298    
299    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * R/aobjects.R (showMeta): Added method for pretty printing a
302            text document's meta data.
303    
304    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * R/textdoccol.R (TextDocCol): Better handling of empty
307            arguments.
308    
309            * NAMESPACE: Exported readDOC.
310    
311            * man/completeStems.Rd: Added an example.
312    
313    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
314    
315            * R/stopwords.R (stopwords): Look up .dat files at every
316            call. Allows users to modify stopword .dat files interactively.
317    
318    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
319    
320            * R/termdocmatrix.R (termFreq): Correct processing of empty
321            documents.
322    
323    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
324    
325            * man/: Updated documentation.
326    
327    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
328    
329            * R/complete.R (completeStems): Completes (heuristically) word
330            stems.
331    
332            * R/termdocmatrix.R (TermDocMatrix2): New modular
333            constructor.
334    
335            * NAMESPACE: Exported termFreq.
336    
337    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
338    
339            * R/reader.R (readDOC): Added MS Word reader (using antiword).
340    
341    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
342    
343            * R/weight.R: Weighting functions for TermDocMatrix.
344    
345    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
346    
347            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
348            functions for accessing dimension, column, and row names.
349    
350            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
351    
352    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
355    
356    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
359    
360    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * R/reader.R (readPDF): Removed manual checks for pdftotext and
363            pdfinfo. The system call gives a warning anyway.
364    
365    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
366    
367            * R/textdoccol.R (asPlain): Conversion from
368            StructuredTextDocuments to PlainTextDocuments.
369    
370    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
371    
372            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
373            for accessing term-document matrices.
374    
375            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
376            are installed.
377    
378    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
379    
380            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
381            Christian Buchta.
382    
383    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
384    
385            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
386    
387    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
388    
389            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
390    
391            * R/reader.R (readPDF): Added PDF reader.
392    
393    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
394    
395            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
396    
397            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
398    
399            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
400    
401            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
402    
403    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
404    
405            * R/distmeasure.R (dissimilarity): Replaced dists call from
406            package cba by new dist call from package proxy.
407    
408    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
409    
410            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
411    
412    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * R/termdocmatrix.R: require() uses the quietly option to suppress
415            loading messages.
416    
417    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
418    
419            * R/dictionary.R: Added dictionary support.
420    
421    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
422    
423            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
424            documents. This simplifies some functions, e.g., asPlain.
425    
426    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
427    
428            * inst/doc/tm.Rnw: Fixed some typos in vignette.
429    
430    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
431    
432            * R/textdoccol.R (replaceWords): Added method to replace a set of
433            words by a single word. Useful for synonyms.
434    
435    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
436    
437            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
438    
439    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
440    
441            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
442            vectors. Thanks to Ariel Maguyon for his error report.
443            (removeSparseTerms): New function to remove columns from a
444            term-document matrix exceeding a sparse factor.
445    
446    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
447    
448            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
449    
450    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
451    
452            * man/sFilter.Rd: Corrected documentation on statement format (use
453            '==' instead of '=').
454    
455    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
456    
457            * R/aobjects.R (StructuredTextDocument): Inherits from
458            TextDocument.
459    
460    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
461    
462            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
463            on sparse matrices as proposed by Martin Maechler.
464    
465    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
466    
467            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
468            \pkg{filehash} version makes them deprecated.
469    
470    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
471    
472            * R/termdocmatrix.R (textvector): Stemming is now performed before
473            erasing stopwords.
474            (weightMatrix): Adapted to handle sparse matrices.
475            (TermDocMatrix): Sparse matrix is now efficiently built by
476            direct stepwise insertion of row values into it.
477    
478    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
479    
480            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
481            due to ongoing problems. For our purposes the latter is as useful
482            as the replaced package.
483    
484    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
485    
486            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
487    
488            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
489    
490    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
491    
492            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
493            languages with available stopwords.
494    
495    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
496    
497            * inst/doc/tm.Rnw: Minor corrections in the vignette.
498    
499    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
500    
501            * DESCRIPTION: Update to version 0.2, since a lot of new features
502            have been integrated.
503    
504            * inst/stopwords: Updated existing stopwords and added stopwords
505            for various other languages.
506    
507    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
508    
509            * man/: Updated documentation.
510    
511            * Work/testDb.R: Script to test database stuff.
512    
513            * R/: Fixed various database related bugs. Seems to be rather
514            useable now, i.e., consider as alpha status for now.
515    
516    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
517    
518            * R/: Fixed some bugs related to database support.
519    
520    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
521    
522            * man/: Added a lot of examples to the manuals.
523    
524    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
525    
526            * man/: Updated parts of the documentation.
527    
528            * R/textdoccol.R (asPlain): Added conversion from newsgroup
529            documents to plain text documents.
530    
531    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
532    
533            * R/textdoccol.R: Finished experimental database support. Not yet
534            intensively tested.
535    
536            * R/source.R: Now each source has a default reader.
537    
538            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
539            class anymore.
540    
541            * R/plaintextdoc.R: Custom show method for plain text documents.
542    
543            * R/aobjects.R: Added a class for structured text documents.
544    
545            * R/reader.R: Replaced remaining \code{parser} occurrences with
546            \code{reader}.
547    
548            * R/textdoccol.R (summary): Indent tags.
549    
550            * R/textdoccol.R (removePunctuation): Transform method to remove
551            punctuation marks.
552    
553    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
554    
555            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
556            using prescindMeta().
557    
558    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
559    
560            * R/textdoccol.R: Improved database support.
561    
562    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
563    
564            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
565    
566            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
567            language code.
568    
569            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
570            into parserControl argument.
571    
572            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
573    
574    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
575    
576            * Work/tmDataSetup.R: The datasets acq and crude can now be
577            created on the fly.
578    
579            * R/stopwords.R: Introduced a function returning the stopwords for
580            a given language (English, German and French at the moment)
581    
582            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
583            otherwise falls back to Snowball package.
584    
585    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
586    
587            * man/dissimilarity-methods.Rd: Make clear that any method offered
588            by "dists" from package "cba" can be used.
589    
590    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
591    
592            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
593            to Kurt's latex suggestion. Removed points and underscores in
594            variable names for consistent naming.
595    
596            * DESCRIPTION: Update to version 0.1-2.
597    
598            * man/TextRepository.Rd: Fixed bug in documentation.
599    
600    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
601    
602            * DESCRIPTION: Update to version 0.1-1.
603    
604    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
605    
606            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
607            wordStem.
608    
609    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
610    
611            * R/: Changes due to Kurt's review.
612    
613    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
614    
615            * R/: Implemented improvements based upon comments by David
616            Meyer.
617    
618    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
619    
620            * inst/doc/: Rewrote vignette.
621    
622            * man/: Improved documentation.
623    
624    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
625    
626            * man/: Updated documentation.
627    
628            * DESCRIPTION: Changed package name to "tm". Updated version to
629            0.1 for first CRAN release.
630    
631            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
632            list archive example.
633    
634            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
635            archive example.
636    
637            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
638            from (several mails per box) mbox format to (single mail per file)
639            eml format.
640    
641    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
642    
643            * data/crude.rda: Rebuilt.
644    
645            * data/acq.rda: Rebuilt.
646    
647            * R/reader.R: Factored out reader and parser methods from
648            textdoccol.R.
649    
650            * R/source.R: Factored out Source methods from aobjects.R and
651            textdoccol.R.
652            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
653            feeds.
654    
655            * R/textdoccol.R (DirSource): Added support for recursive
656            traversal of directories.
657    
658    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
659    
660            * R/textdoccol.R ([[): Loads the document corpus automatically
661            into memory upon access.
662            (tm_transform, tm_filter): Removed several checks whether the
663            document is already loaded ([[ ensures this now).
664            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
665            mailing list archive.
666    
667    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
668    
669            * R/aobjects.R (TextDocument): Is now a virtual class.
670            (Source): Is now a virtual class.
671    
672    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
673    
674            * R/textdoccol.R (c): Support for an arbitrary number of document
675            collections.
676    
677    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
678    
679            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
680            append_meta and remove_meta.
681    
682            * R/textdoccol.R: Removed modify_metadata method.
683    
684            * R/textrepo.R: Removed modify_metadata method.
685    
686            * R/textdoccol.R (remove_meta): Supports removal of document
687            collection metadata and document (= in data frame) metadata.
688    
689    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
690    
691            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
692    
693            * data/crude.rda: Rebuilt.
694    
695            * data/acq.rda: Rebuilt.
696    
697            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
698    
699            * R/textdoccol.R ([): Bug fix for subsetting a document
700            collection's data frame.
701    
702    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
703    
704            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
705            to s_filter.
706    
707            * R/textdoccol.R: Local text documents' metadata can now be copied
708            to a document collection's data frame with prescind_meta.
709    
710    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
711    
712            * R/: Text documents' slot metadata is now accessible in s_filter.
713    
714            * R/: Rewrote s_filter function (has still some restrictions).
715    
716    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
717    
718            * R/: Various fixes in handling metadata.
719    
720            * R/: Added update mechanism for text document collections.
721    
722    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
723    
724            * R/: Merging of document collections now creates a binary tree
725            for reconstructing merged document collections.
726    
727            * R/: Redesign of metadata for document collections.
728    
729    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
730    
731            * R/: Messages now use \code{ngettext}.
732    
733    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
734    
735            * R/: Added functions for modifying and removing metadata.
736    
737    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
738    
739            * man/: Updated some documentation.
740    
741            * R/: Corrected some connection issues.
742    
743            * inst/doc: Worked on the vignette.
744    
745    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
746    
747            * inst/: Added texts and started vignette.
748    
749            * R/: Final changes based upon David's comments.
750    
751    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * NAMESPACE: Corrected exports (generic methods need exportMethods
754            directives!).
755    
756    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
757    
758            * R/: Modified the TextDocCol constructur and various parsers. It
759            is now modular and supports various file formats via plugins (see
760            the new "Source" class).
761    
762    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
763    
764            * man/: Revised documentation after previous code changes.
765    
766    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
767    
768            * R/: Remaining changes as discussed with David.
769    
770    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
771    
772            * R/: Some changes as suggested by David. The rest will follow
773            within the next days.
774    
775    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
776    
777            * man/: Finished documentation.
778    
779    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
780    
781            * man/: Wrote some documentation.
782    
783    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
784    
785            * R/: Further syntactic sugar in form of additional assignment and
786            accessor methods.
787    
788    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
789    
790            * R/: Syntactic sugar in form of "length", "show" and "summary"
791            operators.
792    
793    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
794    
795            * R/: Diverse updates. Mainly on default operators ("[" or "c")
796            and dissimilarities.
797    
798    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
799    
800            * R/: Added similarity functions.
801    
802            * data/: Added english stopwords.
803    
804    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
805    
806            * data/: Examples compiled for new features
807    
808            * R/: Changes due to new structure.
809    
810            * NAMESPACE: Corrected namespace to reflect new structure.
811    
812            * R/termdocmatrix.R: Adapted for new naming scheme.
813    
814    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
815    
816            * R/textdoccol.R: Adapted code for new class structure. Wrote
817            several transform and filter functions operating on text document
818            collections (alias text document databases).
819    
820            * R/aobjects.R: Adapted class structure with inheritance,
821            repositories and additional meta data. Loading files on demand is
822            now possible.
823    
824    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
825    
826            * R/: Some cosmetic cleanups.
827    
828            * inst/: Removed vignette on clustering. That and much more is now
829            described in the JSS paper on text mining. Based upon that
830            article an elaborated vignette will be incorporated in the future.
831    
832    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
833    
834            * R/: Updated generic S4 methods to comply with signature changes
835            in newer versions of R (> 2.3)
836    
837    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
838    
839            * ext/R/importRIS.R: Automatic RIS import is now possible.
840    
841    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
842    
843            * R/textdoccol.R: Added RIS HTML input format.
844    
845    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
846    
847            * R/textdoccol.R: Removed bug that caused invalid text document
848            collections when handling many input files.
849    
850  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
851    
852          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.962

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge