SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC pkg/ChangeLog revision 972, Fri Jul 3 16:16:59 2009 UTC
# Line 1  Line 1 
1    2009-07-03  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/transform.R: Move removeCitation, removeMultipart, and
4            removeSignature to the tau package since they are mainly utility
5            functions (for handling e-mails) and not very framework specific.
6    
7    2009-06-28  Ingo Feinerer  <feinerer@logic.at>
8    
9            * man/: Fix documentation.
10    
11    2009-06-26  Ingo Feinerer  <feinerer@logic.at>
12    
13            * R/reader.R (readReut21578XMLasPlain): New reader which returns a
14            plain text document instead of an XML document for texts of the
15            Reuters-21578 dataset.
16    
17            * R/sparse.R: Removed since the slam package is now available on
18            CRAN.
19    
20            * DESCRIPTION (Depends): Add slam package.
21    
22    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
23    
24            * R/transform.R (stemDoc): Fix character(0) handling.
25    
26    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
27    
28            * R/doc.R (show): Pretty print.
29    
30    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
31    
32            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
33            gracefully.
34    
35    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
36    
37            * R/corpus.R: Make corpus virtual. Implement corpus with standard
38            and permanent storage semantics.
39    
40            * DESCRIPTION: New major release. A *lot* of improvements.
41    
42    2009-05-04   Ingo Feinerer <feinerer@logic.at>
43    
44            * NAMESPACE: Export some simple_triplet_matrix functions.
45    
46    2009-04-28   Ingo Feinerer <feinerer@logic.at>
47    
48            * R/weight.R: Adapt tf-idf to new matrix format.
49    
50    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
51    
52            * R/matrix.R: Create two distinct classes for term-document and
53            document-term matrices.
54    
55    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
56    
57            * R/termdocmatrix.R: No longer use Matrix package. This reduces
58            package start-up time significantly.
59    
60    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
61    
62            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
63    
64    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
65    
66            * R/transform.R (tmReduce): Combine multiple maps into one
67            transformation.
68    
69    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
70    
71            * R/weight.R: Remove weightLogical since it does not return a
72            dgCMatrix.
73    
74            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
75            or TermDocumentMatrix instead.
76    
77    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
78    
79            * inst/doc/extensions.Rnw: Finished vignette.
80    
81    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
82    
83            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
84            DocumentTermMatrix representations.
85    
86    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
87    
88            * R/reader.R (readXML): New reader for arbitrary XML files.
89    
90    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
91    
92            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
93            (XMLSource): New XMLSource class for arbitrary XML files.
94            (Source): New slot Vectorized.
95    
96    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
97    
98            * R/reader.R (readTabular): Experimental reader for tabular data
99            structures which can be customized via user-defined mappings.
100    
101            * R/reader.R: Always use UTC time zone.
102    
103            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
104    
105    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
106    
107            * R/reader.R (readDOC): Options can be passed over to antiword.
108    
109            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
110            pdftotext.
111    
112    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
113    
114            * R/source.R (DirSource): Add pattern and ignore.case arguments
115            which are internally passed over to list.files().
116    
117    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
118    
119            * inst/doc/tm.Rnw: Suppress pointless loading message.
120    
121    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
122    
123            * DESCRIPTION: Speed up package loading (via moving packages not
124            strictly necessary for normal operation to Suggests instead of
125            Depends).
126    
127    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
128    
129            * R/reader.R (readNewsgroup): The date format is now configurable.
130    
131    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
132    
133            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
134    
135    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
136    
137            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
138    
139    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
140    
141            * R/source.R (DataframeSource): New source class for data frames.
142    
143            * R/source.R: Fixed non-standard call evaluation.
144    
145    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
146    
147            * R/source.R (URISource): New source class for a single document.
148    
149    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
150    
151            * R/source.R: Refactoring.
152    
153    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
154    
155            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
156            Rmpi installations more gracefully.
157    
158    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
159    
160            * R/source.R (Source): Add Length slot.
161    
162    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
163    
164            * R/AAA.R: Unify duplicated .onLoad function.
165    
166    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
167    
168            * DESCRIPTION (Suggests): Added Rmpi.
169    
170    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
171    
172            * R/source.R (getElem): Fix 'no visible binding' warning.
173    
174            * man/WeightFunction.Rd: Fix signature.
175    
176    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
177    
178            * R/weight.R: Introduce name abbreviations for weighting functions.
179    
180    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
181    
182            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
183    
184            * R/cluster.R: Provide convenience functions for using a MPI
185            cluster.
186    
187            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
188            available.
189    
190            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
191            available.
192    
193    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
194    
195            * R/textdoccol.R (lapply): Removed debug print out.
196    
197    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
198    
199            * R/reader.R (readRCV1): Improved meta data extraction from
200            Reuters Corpus Volume 1 documents.
201    
202    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
203    
204            * R/transform.R: Ensure that all mappings preserve multiline
205            structures.
206    
207    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
208    
209            * R/filter.R: Every filter has now an attribute indicating whether
210            it sould be applied to document level (doclevel).
211    
212            * R/textdoccol.R (tmFilter): Set searchFullText as new default
213            filter.
214    
215    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
216    
217            * R/transform.R (replacePatterns): Replaced removeWords by
218            replacePatterns. Suggested by Christian Buchta.
219    
220            * R/textdoccol.R (inspect): Improved formatting.
221    
222    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
223    
224            * inst/CITATION: Updated JSS article information.
225    
226            * R/textdoccol.R (setAs): Added coerce method from list to
227            corpus.
228    
229            * R/meta.R (meta): Improved meta data handling.
230    
231    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
232    
233            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
234            Christian Buchta.
235    
236            * inst/CITATION: Added template to include JSS article reference.
237    
238    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
239    
240            * R/textdoccol.R (tmMap): Introduced lazy mapping.
241    
242            * R/source.R: Added VectorSource.
243    
244    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * man/: Language codes should be in ISO 639-1 format.
247    
248            * R/textdoccol.R (asPlain): Preserve local meta data.
249    
250    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
251    
252            * R/textdoccol.R (writeCorpus): Function for writing a corpus
253            containing plain text documents to disk.
254    
255    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
256    
257            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
258            always set correctly.
259    
260            * R/textdoccol.R: Set load = TRUE as default for load on demand
261            since in most cases this is the wanted behaviour.
262    
263    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
264    
265            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
266    
267            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
268    
269    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
270    
271            * R/meta.R (meta): New function for consistent access to meta data
272            of document collections, repositories, and texts.
273    
274    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
275    
276            * R/: Better support for encodings.
277    
278    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
279    
280            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
281            selection when no reader argument is given.
282    
283    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
284    
285            * R/source.R (CSVSource): Now uses read.csv instead of scan
286            internally.
287    
288    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
289    
290            * R/reader.R (getReaders): Returns available reader functions.
291    
292            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
293            as default.
294    
295    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
296    
297            * R/stopwords.R (stopwords): Shortened code, removed codetools
298            variable warnings.
299    
300            * man/: Documentation for showMeta, added an example for tmMap.
301    
302            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
303            some minor typos fixed.
304    
305    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
306    
307            * R/aobjects.R (showMeta): Added method for pretty printing a
308            text document's meta data.
309    
310    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
311    
312            * R/textdoccol.R (TextDocCol): Better handling of empty
313            arguments.
314    
315            * NAMESPACE: Exported readDOC.
316    
317            * man/completeStems.Rd: Added an example.
318    
319    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
320    
321            * R/stopwords.R (stopwords): Look up .dat files at every
322            call. Allows users to modify stopword .dat files interactively.
323    
324    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
325    
326            * R/termdocmatrix.R (termFreq): Correct processing of empty
327            documents.
328    
329    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
330    
331            * man/: Updated documentation.
332    
333    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
334    
335            * R/complete.R (completeStems): Completes (heuristically) word
336            stems.
337    
338            * R/termdocmatrix.R (TermDocMatrix2): New modular
339            constructor.
340    
341            * NAMESPACE: Exported termFreq.
342    
343    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * R/reader.R (readDOC): Added MS Word reader (using antiword).
346    
347    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * R/weight.R: Weighting functions for TermDocMatrix.
350    
351    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
352    
353            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
354            functions for accessing dimension, column, and row names.
355    
356            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
357    
358    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
359    
360            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
361    
362    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
363    
364            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
365    
366    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
367    
368            * R/reader.R (readPDF): Removed manual checks for pdftotext and
369            pdfinfo. The system call gives a warning anyway.
370    
371    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
372    
373            * R/textdoccol.R (asPlain): Conversion from
374            StructuredTextDocuments to PlainTextDocuments.
375    
376    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
377    
378            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
379            for accessing term-document matrices.
380    
381            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
382            are installed.
383    
384    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
385    
386            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
387            Christian Buchta.
388    
389    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
390    
391            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
392    
393    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
394    
395            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
396    
397            * R/reader.R (readPDF): Added PDF reader.
398    
399    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
400    
401            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
402    
403            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
404    
405            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
406    
407            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
408    
409    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
410    
411            * R/distmeasure.R (dissimilarity): Replaced dists call from
412            package cba by new dist call from package proxy.
413    
414    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
415    
416            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
417    
418    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
419    
420            * R/termdocmatrix.R: require() uses the quietly option to suppress
421            loading messages.
422    
423    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
424    
425            * R/dictionary.R: Added dictionary support.
426    
427    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
428    
429            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
430            documents. This simplifies some functions, e.g., asPlain.
431    
432    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
433    
434            * inst/doc/tm.Rnw: Fixed some typos in vignette.
435    
436    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
437    
438            * R/textdoccol.R (replaceWords): Added method to replace a set of
439            words by a single word. Useful for synonyms.
440    
441    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
442    
443            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
444    
445    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
446    
447            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
448            vectors. Thanks to Ariel Maguyon for his error report.
449            (removeSparseTerms): New function to remove columns from a
450            term-document matrix exceeding a sparse factor.
451    
452    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
453    
454            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
455    
456    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
457    
458            * man/sFilter.Rd: Corrected documentation on statement format (use
459            '==' instead of '=').
460    
461    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
462    
463            * R/aobjects.R (StructuredTextDocument): Inherits from
464            TextDocument.
465    
466    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
467    
468            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
469            on sparse matrices as proposed by Martin Maechler.
470    
471    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
472    
473            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
474            \pkg{filehash} version makes them deprecated.
475    
476    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
477    
478            * R/termdocmatrix.R (textvector): Stemming is now performed before
479            erasing stopwords.
480            (weightMatrix): Adapted to handle sparse matrices.
481            (TermDocMatrix): Sparse matrix is now efficiently built by
482            direct stepwise insertion of row values into it.
483    
484    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
485    
486            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
487            due to ongoing problems. For our purposes the latter is as useful
488            as the replaced package.
489    
490    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
491    
492            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
493    
494            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
495    
496    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
497    
498            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
499            languages with available stopwords.
500    
501    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
502    
503            * inst/doc/tm.Rnw: Minor corrections in the vignette.
504    
505    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
506    
507            * DESCRIPTION: Update to version 0.2, since a lot of new features
508            have been integrated.
509    
510            * inst/stopwords: Updated existing stopwords and added stopwords
511            for various other languages.
512    
513    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
514    
515            * man/: Updated documentation.
516    
517            * Work/testDb.R: Script to test database stuff.
518    
519            * R/: Fixed various database related bugs. Seems to be rather
520            useable now, i.e., consider as alpha status for now.
521    
522    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
523    
524            * R/: Fixed some bugs related to database support.
525    
526    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
527    
528            * man/: Added a lot of examples to the manuals.
529    
530    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
531    
532            * man/: Updated parts of the documentation.
533    
534            * R/textdoccol.R (asPlain): Added conversion from newsgroup
535            documents to plain text documents.
536    
537    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
538    
539            * R/textdoccol.R: Finished experimental database support. Not yet
540            intensively tested.
541    
542            * R/source.R: Now each source has a default reader.
543    
544            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
545            class anymore.
546    
547            * R/plaintextdoc.R: Custom show method for plain text documents.
548    
549            * R/aobjects.R: Added a class for structured text documents.
550    
551            * R/reader.R: Replaced remaining \code{parser} occurrences with
552            \code{reader}.
553    
554            * R/textdoccol.R (summary): Indent tags.
555    
556            * R/textdoccol.R (removePunctuation): Transform method to remove
557            punctuation marks.
558    
559    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
560    
561            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
562            using prescindMeta().
563    
564    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
565    
566            * R/textdoccol.R: Improved database support.
567    
568    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
569    
570            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
571    
572            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
573            language code.
574    
575            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
576            into parserControl argument.
577    
578            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
579    
580    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
581    
582            * Work/tmDataSetup.R: The datasets acq and crude can now be
583            created on the fly.
584    
585            * R/stopwords.R: Introduced a function returning the stopwords for
586            a given language (English, German and French at the moment)
587    
588            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
589            otherwise falls back to Snowball package.
590    
591    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
592    
593            * man/dissimilarity-methods.Rd: Make clear that any method offered
594            by "dists" from package "cba" can be used.
595    
596    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
597    
598            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
599            to Kurt's latex suggestion. Removed points and underscores in
600            variable names for consistent naming.
601    
602            * DESCRIPTION: Update to version 0.1-2.
603    
604            * man/TextRepository.Rd: Fixed bug in documentation.
605    
606    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
607    
608            * DESCRIPTION: Update to version 0.1-1.
609    
610    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
611    
612            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
613            wordStem.
614    
615    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
616    
617            * R/: Changes due to Kurt's review.
618    
619    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
620    
621            * R/: Implemented improvements based upon comments by David
622            Meyer.
623    
624    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
625    
626            * inst/doc/: Rewrote vignette.
627    
628            * man/: Improved documentation.
629    
630    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
631    
632            * man/: Updated documentation.
633    
634            * DESCRIPTION: Changed package name to "tm". Updated version to
635            0.1 for first CRAN release.
636    
637            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
638            list archive example.
639    
640            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
641            archive example.
642    
643            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
644            from (several mails per box) mbox format to (single mail per file)
645            eml format.
646    
647    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
648    
649            * data/crude.rda: Rebuilt.
650    
651            * data/acq.rda: Rebuilt.
652    
653            * R/reader.R: Factored out reader and parser methods from
654            textdoccol.R.
655    
656            * R/source.R: Factored out Source methods from aobjects.R and
657            textdoccol.R.
658            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
659            feeds.
660    
661            * R/textdoccol.R (DirSource): Added support for recursive
662            traversal of directories.
663    
664    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
665    
666            * R/textdoccol.R ([[): Loads the document corpus automatically
667            into memory upon access.
668            (tm_transform, tm_filter): Removed several checks whether the
669            document is already loaded ([[ ensures this now).
670            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
671            mailing list archive.
672    
673    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
674    
675            * R/aobjects.R (TextDocument): Is now a virtual class.
676            (Source): Is now a virtual class.
677    
678    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
679    
680            * R/textdoccol.R (c): Support for an arbitrary number of document
681            collections.
682    
683    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
684    
685            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
686            append_meta and remove_meta.
687    
688            * R/textdoccol.R: Removed modify_metadata method.
689    
690            * R/textrepo.R: Removed modify_metadata method.
691    
692            * R/textdoccol.R (remove_meta): Supports removal of document
693            collection metadata and document (= in data frame) metadata.
694    
695    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
696    
697            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
698    
699            * data/crude.rda: Rebuilt.
700    
701            * data/acq.rda: Rebuilt.
702    
703            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
704    
705            * R/textdoccol.R ([): Bug fix for subsetting a document
706            collection's data frame.
707    
708    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
709    
710            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
711            to s_filter.
712    
713            * R/textdoccol.R: Local text documents' metadata can now be copied
714            to a document collection's data frame with prescind_meta.
715    
716    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
717    
718            * R/: Text documents' slot metadata is now accessible in s_filter.
719    
720            * R/: Rewrote s_filter function (has still some restrictions).
721    
722    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
723    
724            * R/: Various fixes in handling metadata.
725    
726            * R/: Added update mechanism for text document collections.
727    
728    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
729    
730            * R/: Merging of document collections now creates a binary tree
731            for reconstructing merged document collections.
732    
733            * R/: Redesign of metadata for document collections.
734    
735    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
736    
737            * R/: Messages now use \code{ngettext}.
738    
739    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
740    
741            * R/: Added functions for modifying and removing metadata.
742    
743    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
744    
745            * man/: Updated some documentation.
746    
747            * R/: Corrected some connection issues.
748    
749            * inst/doc: Worked on the vignette.
750    
751    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * inst/: Added texts and started vignette.
754    
755            * R/: Final changes based upon David's comments.
756    
757    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
758    
759            * NAMESPACE: Corrected exports (generic methods need exportMethods
760            directives!).
761    
762    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
763    
764            * R/: Modified the TextDocCol constructur and various parsers. It
765            is now modular and supports various file formats via plugins (see
766            the new "Source" class).
767    
768    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
769    
770            * man/: Revised documentation after previous code changes.
771    
772    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
773    
774            * R/: Remaining changes as discussed with David.
775    
776    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
777    
778            * R/: Some changes as suggested by David. The rest will follow
779            within the next days.
780    
781    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
782    
783            * man/: Finished documentation.
784    
785    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
786    
787            * man/: Wrote some documentation.
788    
789    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
790    
791            * R/: Further syntactic sugar in form of additional assignment and
792            accessor methods.
793    
794    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
795    
796            * R/: Syntactic sugar in form of "length", "show" and "summary"
797            operators.
798    
799    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
800    
801            * R/: Diverse updates. Mainly on default operators ("[" or "c")
802            and dissimilarities.
803    
804    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
805    
806            * R/: Added similarity functions.
807    
808            * data/: Added english stopwords.
809    
810    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
811    
812            * data/: Examples compiled for new features
813    
814            * R/: Changes due to new structure.
815    
816            * NAMESPACE: Corrected namespace to reflect new structure.
817    
818            * R/termdocmatrix.R: Adapted for new naming scheme.
819    
820    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
821    
822            * R/textdoccol.R: Adapted code for new class structure. Wrote
823            several transform and filter functions operating on text document
824            collections (alias text document databases).
825    
826            * R/aobjects.R: Adapted class structure with inheritance,
827            repositories and additional meta data. Loading files on demand is
828            now possible.
829    
830    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
831    
832            * R/: Some cosmetic cleanups.
833    
834            * inst/: Removed vignette on clustering. That and much more is now
835            described in the JSS paper on text mining. Based upon that
836            article an elaborated vignette will be incorporated in the future.
837    
838    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
839    
840            * R/: Updated generic S4 methods to comply with signature changes
841            in newer versions of R (> 2.3)
842    
843    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
844    
845            * ext/R/importRIS.R: Automatic RIS import is now possible.
846    
847    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
848    
849            * R/textdoccol.R: Added RIS HTML input format.
850    
851    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
852    
853            * R/textdoccol.R: Removed bug that caused invalid text document
854            collections when handling many input files.
855    
856  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
857    
858          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.972

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge