SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 28, Tue Dec 6 13:46:33 2005 UTC pkg/ChangeLog revision 959, Wed Jun 17 18:22:35 2009 UTC
# Line 1  Line 1 
1    2009-06-17  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/transform.R (stemDoc): Fix character(0) handling.
4    
5    2009-06-12  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/doc.R (show): Pretty print.
8    
9    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
10    
11            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
12            gracefully.
13    
14    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
15    
16            * R/corpus.R: Make corpus virtual. Implement corpus with standard
17            and permanent storage semantics.
18    
19            * DESCRIPTION: New major release. A *lot* of improvements.
20    
21    2009-05-04   Ingo Feinerer <feinerer@logic.at>
22    
23            * NAMESPACE: Export some simple_triplet_matrix functions.
24    
25    2009-04-28   Ingo Feinerer <feinerer@logic.at>
26    
27            * R/weight.R: Adapt tf-idf to new matrix format.
28    
29    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
30    
31            * R/matrix.R: Create two distinct classes for term-document and
32            document-term matrices.
33    
34    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
35    
36            * R/termdocmatrix.R: No longer use Matrix package. This reduces
37            package start-up time significantly.
38    
39    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
40    
41            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
42    
43    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
44    
45            * R/transform.R (tmReduce): Combine multiple maps into one
46            transformation.
47    
48    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
49    
50            * R/weight.R: Remove weightLogical since it does not return a
51            dgCMatrix.
52    
53            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
54            or TermDocumentMatrix instead.
55    
56    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
57    
58            * inst/doc/extensions.Rnw: Finished vignette.
59    
60    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
61    
62            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
63            DocumentTermMatrix representations.
64    
65    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
66    
67            * R/reader.R (readXML): New reader for arbitrary XML files.
68    
69    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
70    
71            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
72            (XMLSource): New XMLSource class for arbitrary XML files.
73            (Source): New slot Vectorized.
74    
75    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
76    
77            * R/reader.R (readTabular): Experimental reader for tabular data
78            structures which can be customized via user-defined mappings.
79    
80            * R/reader.R: Always use UTC time zone.
81    
82            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
83    
84    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
85    
86            * R/reader.R (readDOC): Options can be passed over to antiword.
87    
88            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
89            pdftotext.
90    
91    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
92    
93            * R/source.R (DirSource): Add pattern and ignore.case arguments
94            which are internally passed over to list.files().
95    
96    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
97    
98            * inst/doc/tm.Rnw: Suppress pointless loading message.
99    
100    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
101    
102            * DESCRIPTION: Speed up package loading (via moving packages not
103            strictly necessary for normal operation to Suggests instead of
104            Depends).
105    
106    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
107    
108            * R/reader.R (readNewsgroup): The date format is now configurable.
109    
110    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
111    
112            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
113    
114    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
115    
116            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
117    
118    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
119    
120            * R/source.R (DataframeSource): New source class for data frames.
121    
122            * R/source.R: Fixed non-standard call evaluation.
123    
124    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
125    
126            * R/source.R (URISource): New source class for a single document.
127    
128    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
129    
130            * R/source.R: Refactoring.
131    
132    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
133    
134            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
135            Rmpi installations more gracefully.
136    
137    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
138    
139            * R/source.R (Source): Add Length slot.
140    
141    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
142    
143            * R/AAA.R: Unify duplicated .onLoad function.
144    
145    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
146    
147            * DESCRIPTION (Suggests): Added Rmpi.
148    
149    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
150    
151            * R/source.R (getElem): Fix 'no visible binding' warning.
152    
153            * man/WeightFunction.Rd: Fix signature.
154    
155    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
156    
157            * R/weight.R: Introduce name abbreviations for weighting functions.
158    
159    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
160    
161            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
162    
163            * R/cluster.R: Provide convenience functions for using a MPI
164            cluster.
165    
166            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
167            available.
168    
169            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
170            available.
171    
172    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
173    
174            * R/textdoccol.R (lapply): Removed debug print out.
175    
176    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
177    
178            * R/reader.R (readRCV1): Improved meta data extraction from
179            Reuters Corpus Volume 1 documents.
180    
181    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
182    
183            * R/transform.R: Ensure that all mappings preserve multiline
184            structures.
185    
186    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
187    
188            * R/filter.R: Every filter has now an attribute indicating whether
189            it sould be applied to document level (doclevel).
190    
191            * R/textdoccol.R (tmFilter): Set searchFullText as new default
192            filter.
193    
194    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
195    
196            * R/transform.R (replacePatterns): Replaced removeWords by
197            replacePatterns. Suggested by Christian Buchta.
198    
199            * R/textdoccol.R (inspect): Improved formatting.
200    
201    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
202    
203            * inst/CITATION: Updated JSS article information.
204    
205            * R/textdoccol.R (setAs): Added coerce method from list to
206            corpus.
207    
208            * R/meta.R (meta): Improved meta data handling.
209    
210    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
213            Christian Buchta.
214    
215            * inst/CITATION: Added template to include JSS article reference.
216    
217    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
218    
219            * R/textdoccol.R (tmMap): Introduced lazy mapping.
220    
221            * R/source.R: Added VectorSource.
222    
223    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
224    
225            * man/: Language codes should be in ISO 639-1 format.
226    
227            * R/textdoccol.R (asPlain): Preserve local meta data.
228    
229    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
230    
231            * R/textdoccol.R (writeCorpus): Function for writing a corpus
232            containing plain text documents to disk.
233    
234    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
235    
236            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
237            always set correctly.
238    
239            * R/textdoccol.R: Set load = TRUE as default for load on demand
240            since in most cases this is the wanted behaviour.
241    
242    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
243    
244            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
245    
246            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
247    
248    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
249    
250            * R/meta.R (meta): New function for consistent access to meta data
251            of document collections, repositories, and texts.
252    
253    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * R/: Better support for encodings.
256    
257    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
260            selection when no reader argument is given.
261    
262    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
263    
264            * R/source.R (CSVSource): Now uses read.csv instead of scan
265            internally.
266    
267    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
268    
269            * R/reader.R (getReaders): Returns available reader functions.
270    
271            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
272            as default.
273    
274    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
275    
276            * R/stopwords.R (stopwords): Shortened code, removed codetools
277            variable warnings.
278    
279            * man/: Documentation for showMeta, added an example for tmMap.
280    
281            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
282            some minor typos fixed.
283    
284    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/aobjects.R (showMeta): Added method for pretty printing a
287            text document's meta data.
288    
289    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
290    
291            * R/textdoccol.R (TextDocCol): Better handling of empty
292            arguments.
293    
294            * NAMESPACE: Exported readDOC.
295    
296            * man/completeStems.Rd: Added an example.
297    
298    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
299    
300            * R/stopwords.R (stopwords): Look up .dat files at every
301            call. Allows users to modify stopword .dat files interactively.
302    
303    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
304    
305            * R/termdocmatrix.R (termFreq): Correct processing of empty
306            documents.
307    
308    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
309    
310            * man/: Updated documentation.
311    
312    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
313    
314            * R/complete.R (completeStems): Completes (heuristically) word
315            stems.
316    
317            * R/termdocmatrix.R (TermDocMatrix2): New modular
318            constructor.
319    
320            * NAMESPACE: Exported termFreq.
321    
322    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * R/reader.R (readDOC): Added MS Word reader (using antiword).
325    
326    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
327    
328            * R/weight.R: Weighting functions for TermDocMatrix.
329    
330    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
331    
332            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
333            functions for accessing dimension, column, and row names.
334    
335            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
336    
337    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
338    
339            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
340    
341    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
342    
343            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
344    
345    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
346    
347            * R/reader.R (readPDF): Removed manual checks for pdftotext and
348            pdfinfo. The system call gives a warning anyway.
349    
350    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
351    
352            * R/textdoccol.R (asPlain): Conversion from
353            StructuredTextDocuments to PlainTextDocuments.
354    
355    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
356    
357            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
358            for accessing term-document matrices.
359    
360            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
361            are installed.
362    
363    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
364    
365            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
366            Christian Buchta.
367    
368    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
369    
370            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
371    
372    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
373    
374            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
375    
376            * R/reader.R (readPDF): Added PDF reader.
377    
378    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
379    
380            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
381    
382            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
383    
384            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
385    
386            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
387    
388    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
389    
390            * R/distmeasure.R (dissimilarity): Replaced dists call from
391            package cba by new dist call from package proxy.
392    
393    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
394    
395            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
396    
397    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
398    
399            * R/termdocmatrix.R: require() uses the quietly option to suppress
400            loading messages.
401    
402    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
403    
404            * R/dictionary.R: Added dictionary support.
405    
406    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
407    
408            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
409            documents. This simplifies some functions, e.g., asPlain.
410    
411    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
412    
413            * inst/doc/tm.Rnw: Fixed some typos in vignette.
414    
415    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
416    
417            * R/textdoccol.R (replaceWords): Added method to replace a set of
418            words by a single word. Useful for synonyms.
419    
420    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
421    
422            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
423    
424    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
425    
426            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
427            vectors. Thanks to Ariel Maguyon for his error report.
428            (removeSparseTerms): New function to remove columns from a
429            term-document matrix exceeding a sparse factor.
430    
431    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
432    
433            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
434    
435    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
436    
437            * man/sFilter.Rd: Corrected documentation on statement format (use
438            '==' instead of '=').
439    
440    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
441    
442            * R/aobjects.R (StructuredTextDocument): Inherits from
443            TextDocument.
444    
445    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
446    
447            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
448            on sparse matrices as proposed by Martin Maechler.
449    
450    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
451    
452            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
453            \pkg{filehash} version makes them deprecated.
454    
455    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
456    
457            * R/termdocmatrix.R (textvector): Stemming is now performed before
458            erasing stopwords.
459            (weightMatrix): Adapted to handle sparse matrices.
460            (TermDocMatrix): Sparse matrix is now efficiently built by
461            direct stepwise insertion of row values into it.
462    
463    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
464    
465            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
466            due to ongoing problems. For our purposes the latter is as useful
467            as the replaced package.
468    
469    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
470    
471            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
472    
473            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
474    
475    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
476    
477            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
478            languages with available stopwords.
479    
480    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
481    
482            * inst/doc/tm.Rnw: Minor corrections in the vignette.
483    
484    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
485    
486            * DESCRIPTION: Update to version 0.2, since a lot of new features
487            have been integrated.
488    
489            * inst/stopwords: Updated existing stopwords and added stopwords
490            for various other languages.
491    
492    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
493    
494            * man/: Updated documentation.
495    
496            * Work/testDb.R: Script to test database stuff.
497    
498            * R/: Fixed various database related bugs. Seems to be rather
499            useable now, i.e., consider as alpha status for now.
500    
501    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
502    
503            * R/: Fixed some bugs related to database support.
504    
505    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
506    
507            * man/: Added a lot of examples to the manuals.
508    
509    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
510    
511            * man/: Updated parts of the documentation.
512    
513            * R/textdoccol.R (asPlain): Added conversion from newsgroup
514            documents to plain text documents.
515    
516    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
517    
518            * R/textdoccol.R: Finished experimental database support. Not yet
519            intensively tested.
520    
521            * R/source.R: Now each source has a default reader.
522    
523            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
524            class anymore.
525    
526            * R/plaintextdoc.R: Custom show method for plain text documents.
527    
528            * R/aobjects.R: Added a class for structured text documents.
529    
530            * R/reader.R: Replaced remaining \code{parser} occurrences with
531            \code{reader}.
532    
533            * R/textdoccol.R (summary): Indent tags.
534    
535            * R/textdoccol.R (removePunctuation): Transform method to remove
536            punctuation marks.
537    
538    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
539    
540            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
541            using prescindMeta().
542    
543    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
544    
545            * R/textdoccol.R: Improved database support.
546    
547    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
548    
549            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
550    
551            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
552            language code.
553    
554            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
555            into parserControl argument.
556    
557            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
558    
559    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
560    
561            * Work/tmDataSetup.R: The datasets acq and crude can now be
562            created on the fly.
563    
564            * R/stopwords.R: Introduced a function returning the stopwords for
565            a given language (English, German and French at the moment)
566    
567            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
568            otherwise falls back to Snowball package.
569    
570    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
571    
572            * man/dissimilarity-methods.Rd: Make clear that any method offered
573            by "dists" from package "cba" can be used.
574    
575    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
576    
577            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
578            to Kurt's latex suggestion. Removed points and underscores in
579            variable names for consistent naming.
580    
581            * DESCRIPTION: Update to version 0.1-2.
582    
583            * man/TextRepository.Rd: Fixed bug in documentation.
584    
585    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
586    
587            * DESCRIPTION: Update to version 0.1-1.
588    
589    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
590    
591            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
592            wordStem.
593    
594    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
595    
596            * R/: Changes due to Kurt's review.
597    
598    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
599    
600            * R/: Implemented improvements based upon comments by David
601            Meyer.
602    
603    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
604    
605            * inst/doc/: Rewrote vignette.
606    
607            * man/: Improved documentation.
608    
609    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
610    
611            * man/: Updated documentation.
612    
613            * DESCRIPTION: Changed package name to "tm". Updated version to
614            0.1 for first CRAN release.
615    
616            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
617            list archive example.
618    
619            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
620            archive example.
621    
622            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
623            from (several mails per box) mbox format to (single mail per file)
624            eml format.
625    
626    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
627    
628            * data/crude.rda: Rebuilt.
629    
630            * data/acq.rda: Rebuilt.
631    
632            * R/reader.R: Factored out reader and parser methods from
633            textdoccol.R.
634    
635            * R/source.R: Factored out Source methods from aobjects.R and
636            textdoccol.R.
637            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
638            feeds.
639    
640            * R/textdoccol.R (DirSource): Added support for recursive
641            traversal of directories.
642    
643    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
644    
645            * R/textdoccol.R ([[): Loads the document corpus automatically
646            into memory upon access.
647            (tm_transform, tm_filter): Removed several checks whether the
648            document is already loaded ([[ ensures this now).
649            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
650            mailing list archive.
651    
652    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
653    
654            * R/aobjects.R (TextDocument): Is now a virtual class.
655            (Source): Is now a virtual class.
656    
657    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
658    
659            * R/textdoccol.R (c): Support for an arbitrary number of document
660            collections.
661    
662    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
663    
664            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
665            append_meta and remove_meta.
666    
667            * R/textdoccol.R: Removed modify_metadata method.
668    
669            * R/textrepo.R: Removed modify_metadata method.
670    
671            * R/textdoccol.R (remove_meta): Supports removal of document
672            collection metadata and document (= in data frame) metadata.
673    
674    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
675    
676            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
677    
678            * data/crude.rda: Rebuilt.
679    
680            * data/acq.rda: Rebuilt.
681    
682            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
683    
684            * R/textdoccol.R ([): Bug fix for subsetting a document
685            collection's data frame.
686    
687    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
688    
689            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
690            to s_filter.
691    
692            * R/textdoccol.R: Local text documents' metadata can now be copied
693            to a document collection's data frame with prescind_meta.
694    
695    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
696    
697            * R/: Text documents' slot metadata is now accessible in s_filter.
698    
699            * R/: Rewrote s_filter function (has still some restrictions).
700    
701    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
702    
703            * R/: Various fixes in handling metadata.
704    
705            * R/: Added update mechanism for text document collections.
706    
707    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
708    
709            * R/: Merging of document collections now creates a binary tree
710            for reconstructing merged document collections.
711    
712            * R/: Redesign of metadata for document collections.
713    
714    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
715    
716            * R/: Messages now use \code{ngettext}.
717    
718    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
719    
720            * R/: Added functions for modifying and removing metadata.
721    
722    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
723    
724            * man/: Updated some documentation.
725    
726            * R/: Corrected some connection issues.
727    
728            * inst/doc: Worked on the vignette.
729    
730    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
731    
732            * inst/: Added texts and started vignette.
733    
734            * R/: Final changes based upon David's comments.
735    
736    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
737    
738            * NAMESPACE: Corrected exports (generic methods need exportMethods
739            directives!).
740    
741    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
742    
743            * R/: Modified the TextDocCol constructur and various parsers. It
744            is now modular and supports various file formats via plugins (see
745            the new "Source" class).
746    
747    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
748    
749            * man/: Revised documentation after previous code changes.
750    
751    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
752    
753            * R/: Remaining changes as discussed with David.
754    
755    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
756    
757            * R/: Some changes as suggested by David. The rest will follow
758            within the next days.
759    
760    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
761    
762            * man/: Finished documentation.
763    
764    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
765    
766            * man/: Wrote some documentation.
767    
768    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
769    
770            * R/: Further syntactic sugar in form of additional assignment and
771            accessor methods.
772    
773    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
774    
775            * R/: Syntactic sugar in form of "length", "show" and "summary"
776            operators.
777    
778    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
779    
780            * R/: Diverse updates. Mainly on default operators ("[" or "c")
781            and dissimilarities.
782    
783    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
784    
785            * R/: Added similarity functions.
786    
787            * data/: Added english stopwords.
788    
789    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
790    
791            * data/: Examples compiled for new features
792    
793            * R/: Changes due to new structure.
794    
795            * NAMESPACE: Corrected namespace to reflect new structure.
796    
797            * R/termdocmatrix.R: Adapted for new naming scheme.
798    
799    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
800    
801            * R/textdoccol.R: Adapted code for new class structure. Wrote
802            several transform and filter functions operating on text document
803            collections (alias text document databases).
804    
805            * R/aobjects.R: Adapted class structure with inheritance,
806            repositories and additional meta data. Loading files on demand is
807            now possible.
808    
809    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
810    
811            * R/: Some cosmetic cleanups.
812    
813            * inst/: Removed vignette on clustering. That and much more is now
814            described in the JSS paper on text mining. Based upon that
815            article an elaborated vignette will be incorporated in the future.
816    
817    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
818    
819            * R/: Updated generic S4 methods to comply with signature changes
820            in newer versions of R (> 2.3)
821    
822    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
823    
824            * ext/R/importRIS.R: Automatic RIS import is now possible.
825    
826    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
827    
828            * R/textdoccol.R: Added RIS HTML input format.
829    
830    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
831    
832            * R/textdoccol.R: Removed bug that caused invalid text document
833            collections when handling many input files.
834    
835    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
836    
837            * R/textdoccol.R: Restructured and extended file import
838            mechanism.
839    
840            * inst/doc/clustering.Rnw: Adapted vignette for use with
841            ReutNews.rda
842    
843            * man/ReutNews.Rd: Documentation for ReutNews.rda
844    
845            * data/ReutNews.rda: A tiny Reuters21578 example data set.
846    
847    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
848    
849            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
850            clustering facilities of this package.
851    
852    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
853    
854            * R/aobjects.R: Changed package document structure to avoid class
855            dependency problems.
856    
857  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
858    
859            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
860            data set.
861    
862          * Finished documentation and reordered directory structure. Now "R          * Finished documentation and reordered directory structure. Now "R
863          CMD check textmin" works without errors.          CMD check textmin" works without errors.
864    

Legend:
Removed from v.28  
changed lines
  Added in v.959

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge