SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC pkg/ChangeLog revision 954, Wed May 27 18:33:32 2009 UTC
# Line 1  Line 1 
1    2009-05-27  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/matrix.R (print.TermDocumentMatrix): Handle empty matrices
4            gracefully.
5    
6    2009-05-13  Ingo Feinerer  <feinerer@logic.at>
7    
8            * R/corpus.R: Make corpus virtual. Implement corpus with standard
9            and permanent storage semantics.
10    
11            * DESCRIPTION: New major release. A *lot* of improvements.
12    
13    2009-05-04   Ingo Feinerer <feinerer@logic.at>
14    
15            * NAMESPACE: Export some simple_triplet_matrix functions.
16    
17    2009-04-28   Ingo Feinerer <feinerer@logic.at>
18    
19            * R/weight.R: Adapt tf-idf to new matrix format.
20    
21    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
22    
23            * R/matrix.R: Create two distinct classes for term-document and
24            document-term matrices.
25    
26    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
27    
28            * R/termdocmatrix.R: No longer use Matrix package. This reduces
29            package start-up time significantly.
30    
31    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
32    
33            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
34    
35    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
36    
37            * R/transform.R (tmReduce): Combine multiple maps into one
38            transformation.
39    
40    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/weight.R: Remove weightLogical since it does not return a
43            dgCMatrix.
44    
45            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
46            or TermDocumentMatrix instead.
47    
48    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
49    
50            * inst/doc/extensions.Rnw: Finished vignette.
51    
52    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
53    
54            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
55            DocumentTermMatrix representations.
56    
57    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
58    
59            * R/reader.R (readXML): New reader for arbitrary XML files.
60    
61    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
62    
63            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
64            (XMLSource): New XMLSource class for arbitrary XML files.
65            (Source): New slot Vectorized.
66    
67    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
68    
69            * R/reader.R (readTabular): Experimental reader for tabular data
70            structures which can be customized via user-defined mappings.
71    
72            * R/reader.R: Always use UTC time zone.
73    
74            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
75    
76    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
77    
78            * R/reader.R (readDOC): Options can be passed over to antiword.
79    
80            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
81            pdftotext.
82    
83    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
84    
85            * R/source.R (DirSource): Add pattern and ignore.case arguments
86            which are internally passed over to list.files().
87    
88    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
89    
90            * inst/doc/tm.Rnw: Suppress pointless loading message.
91    
92    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
93    
94            * DESCRIPTION: Speed up package loading (via moving packages not
95            strictly necessary for normal operation to Suggests instead of
96            Depends).
97    
98    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
99    
100            * R/reader.R (readNewsgroup): The date format is now configurable.
101    
102    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
103    
104            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
105    
106    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
107    
108            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
109    
110    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
111    
112            * R/source.R (DataframeSource): New source class for data frames.
113    
114            * R/source.R: Fixed non-standard call evaluation.
115    
116    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
117    
118            * R/source.R (URISource): New source class for a single document.
119    
120    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
121    
122            * R/source.R: Refactoring.
123    
124    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
125    
126            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
127            Rmpi installations more gracefully.
128    
129    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
130    
131            * R/source.R (Source): Add Length slot.
132    
133    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
134    
135            * R/AAA.R: Unify duplicated .onLoad function.
136    
137    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
138    
139            * DESCRIPTION (Suggests): Added Rmpi.
140    
141    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
142    
143            * R/source.R (getElem): Fix 'no visible binding' warning.
144    
145            * man/WeightFunction.Rd: Fix signature.
146    
147    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
148    
149            * R/weight.R: Introduce name abbreviations for weighting functions.
150    
151    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
152    
153            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
154    
155            * R/cluster.R: Provide convenience functions for using a MPI
156            cluster.
157    
158            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
159            available.
160    
161            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
162            available.
163    
164    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
165    
166            * R/textdoccol.R (lapply): Removed debug print out.
167    
168    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
169    
170            * R/reader.R (readRCV1): Improved meta data extraction from
171            Reuters Corpus Volume 1 documents.
172    
173    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
174    
175            * R/transform.R: Ensure that all mappings preserve multiline
176            structures.
177    
178    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
179    
180            * R/filter.R: Every filter has now an attribute indicating whether
181            it sould be applied to document level (doclevel).
182    
183            * R/textdoccol.R (tmFilter): Set searchFullText as new default
184            filter.
185    
186    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
187    
188            * R/transform.R (replacePatterns): Replaced removeWords by
189            replacePatterns. Suggested by Christian Buchta.
190    
191            * R/textdoccol.R (inspect): Improved formatting.
192    
193    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
194    
195            * inst/CITATION: Updated JSS article information.
196    
197            * R/textdoccol.R (setAs): Added coerce method from list to
198            corpus.
199    
200            * R/meta.R (meta): Improved meta data handling.
201    
202    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
203    
204            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
205            Christian Buchta.
206    
207            * inst/CITATION: Added template to include JSS article reference.
208    
209    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
210    
211            * R/textdoccol.R (tmMap): Introduced lazy mapping.
212    
213            * R/source.R: Added VectorSource.
214    
215    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
216    
217            * man/: Language codes should be in ISO 639-1 format.
218    
219            * R/textdoccol.R (asPlain): Preserve local meta data.
220    
221    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
222    
223            * R/textdoccol.R (writeCorpus): Function for writing a corpus
224            containing plain text documents to disk.
225    
226    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
227    
228            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
229            always set correctly.
230    
231            * R/textdoccol.R: Set load = TRUE as default for load on demand
232            since in most cases this is the wanted behaviour.
233    
234    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
235    
236            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
237    
238            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
239    
240    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
241    
242            * R/meta.R (meta): New function for consistent access to meta data
243            of document collections, repositories, and texts.
244    
245    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
246    
247            * R/: Better support for encodings.
248    
249    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
250    
251            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
252            selection when no reader argument is given.
253    
254    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
255    
256            * R/source.R (CSVSource): Now uses read.csv instead of scan
257            internally.
258    
259    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
260    
261            * R/reader.R (getReaders): Returns available reader functions.
262    
263            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
264            as default.
265    
266    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
267    
268            * R/stopwords.R (stopwords): Shortened code, removed codetools
269            variable warnings.
270    
271            * man/: Documentation for showMeta, added an example for tmMap.
272    
273            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
274            some minor typos fixed.
275    
276    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
277    
278            * R/aobjects.R (showMeta): Added method for pretty printing a
279            text document's meta data.
280    
281    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
282    
283            * R/textdoccol.R (TextDocCol): Better handling of empty
284            arguments.
285    
286            * NAMESPACE: Exported readDOC.
287    
288            * man/completeStems.Rd: Added an example.
289    
290    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
291    
292            * R/stopwords.R (stopwords): Look up .dat files at every
293            call. Allows users to modify stopword .dat files interactively.
294    
295    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
296    
297            * R/termdocmatrix.R (termFreq): Correct processing of empty
298            documents.
299    
300    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
301    
302            * man/: Updated documentation.
303    
304    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * R/complete.R (completeStems): Completes (heuristically) word
307            stems.
308    
309            * R/termdocmatrix.R (TermDocMatrix2): New modular
310            constructor.
311    
312            * NAMESPACE: Exported termFreq.
313    
314    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
315    
316            * R/reader.R (readDOC): Added MS Word reader (using antiword).
317    
318    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
319    
320            * R/weight.R: Weighting functions for TermDocMatrix.
321    
322    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
325            functions for accessing dimension, column, and row names.
326    
327            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
328    
329    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
330    
331            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
332    
333    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
334    
335            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
336    
337    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
338    
339            * R/reader.R (readPDF): Removed manual checks for pdftotext and
340            pdfinfo. The system call gives a warning anyway.
341    
342    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
343    
344            * R/textdoccol.R (asPlain): Conversion from
345            StructuredTextDocuments to PlainTextDocuments.
346    
347    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
350            for accessing term-document matrices.
351    
352            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
353            are installed.
354    
355    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
356    
357            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
358            Christian Buchta.
359    
360    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
363    
364    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
365    
366            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
367    
368            * R/reader.R (readPDF): Added PDF reader.
369    
370    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
371    
372            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
373    
374            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
375    
376            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
377    
378            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
379    
380    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
381    
382            * R/distmeasure.R (dissimilarity): Replaced dists call from
383            package cba by new dist call from package proxy.
384    
385    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
386    
387            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
388    
389    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
390    
391            * R/termdocmatrix.R: require() uses the quietly option to suppress
392            loading messages.
393    
394    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
395    
396            * R/dictionary.R: Added dictionary support.
397    
398    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
399    
400            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
401            documents. This simplifies some functions, e.g., asPlain.
402    
403    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
404    
405            * inst/doc/tm.Rnw: Fixed some typos in vignette.
406    
407    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * R/textdoccol.R (replaceWords): Added method to replace a set of
410            words by a single word. Useful for synonyms.
411    
412    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
415    
416    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
417    
418            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
419            vectors. Thanks to Ariel Maguyon for his error report.
420            (removeSparseTerms): New function to remove columns from a
421            term-document matrix exceeding a sparse factor.
422    
423    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
424    
425            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
426    
427    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
428    
429            * man/sFilter.Rd: Corrected documentation on statement format (use
430            '==' instead of '=').
431    
432    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
433    
434            * R/aobjects.R (StructuredTextDocument): Inherits from
435            TextDocument.
436    
437    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
440            on sparse matrices as proposed by Martin Maechler.
441    
442    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
443    
444            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
445            \pkg{filehash} version makes them deprecated.
446    
447    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
448    
449            * R/termdocmatrix.R (textvector): Stemming is now performed before
450            erasing stopwords.
451            (weightMatrix): Adapted to handle sparse matrices.
452            (TermDocMatrix): Sparse matrix is now efficiently built by
453            direct stepwise insertion of row values into it.
454    
455    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
456    
457            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
458            due to ongoing problems. For our purposes the latter is as useful
459            as the replaced package.
460    
461    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
462    
463            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
464    
465            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
466    
467    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
468    
469            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
470            languages with available stopwords.
471    
472    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
473    
474            * inst/doc/tm.Rnw: Minor corrections in the vignette.
475    
476    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
477    
478            * DESCRIPTION: Update to version 0.2, since a lot of new features
479            have been integrated.
480    
481            * inst/stopwords: Updated existing stopwords and added stopwords
482            for various other languages.
483    
484    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
485    
486            * man/: Updated documentation.
487    
488            * Work/testDb.R: Script to test database stuff.
489    
490            * R/: Fixed various database related bugs. Seems to be rather
491            useable now, i.e., consider as alpha status for now.
492    
493    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
494    
495            * R/: Fixed some bugs related to database support.
496    
497    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
498    
499            * man/: Added a lot of examples to the manuals.
500    
501    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
502    
503            * man/: Updated parts of the documentation.
504    
505            * R/textdoccol.R (asPlain): Added conversion from newsgroup
506            documents to plain text documents.
507    
508    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
509    
510            * R/textdoccol.R: Finished experimental database support. Not yet
511            intensively tested.
512    
513            * R/source.R: Now each source has a default reader.
514    
515            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
516            class anymore.
517    
518            * R/plaintextdoc.R: Custom show method for plain text documents.
519    
520            * R/aobjects.R: Added a class for structured text documents.
521    
522            * R/reader.R: Replaced remaining \code{parser} occurrences with
523            \code{reader}.
524    
525            * R/textdoccol.R (summary): Indent tags.
526    
527            * R/textdoccol.R (removePunctuation): Transform method to remove
528            punctuation marks.
529    
530    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
531    
532            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
533            using prescindMeta().
534    
535    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
536    
537            * R/textdoccol.R: Improved database support.
538    
539    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
540    
541            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
542    
543            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
544            language code.
545    
546            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
547            into parserControl argument.
548    
549            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
550    
551    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
552    
553            * Work/tmDataSetup.R: The datasets acq and crude can now be
554            created on the fly.
555    
556            * R/stopwords.R: Introduced a function returning the stopwords for
557            a given language (English, German and French at the moment)
558    
559            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
560            otherwise falls back to Snowball package.
561    
562    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
563    
564            * man/dissimilarity-methods.Rd: Make clear that any method offered
565            by "dists" from package "cba" can be used.
566    
567    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
568    
569            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
570            to Kurt's latex suggestion. Removed points and underscores in
571            variable names for consistent naming.
572    
573            * DESCRIPTION: Update to version 0.1-2.
574    
575            * man/TextRepository.Rd: Fixed bug in documentation.
576    
577    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
578    
579            * DESCRIPTION: Update to version 0.1-1.
580    
581    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
582    
583            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
584            wordStem.
585    
586    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
587    
588            * R/: Changes due to Kurt's review.
589    
590    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
591    
592            * R/: Implemented improvements based upon comments by David
593            Meyer.
594    
595    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
596    
597            * inst/doc/: Rewrote vignette.
598    
599            * man/: Improved documentation.
600    
601    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
602    
603            * man/: Updated documentation.
604    
605            * DESCRIPTION: Changed package name to "tm". Updated version to
606            0.1 for first CRAN release.
607    
608            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
609            list archive example.
610    
611            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
612            archive example.
613    
614            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
615            from (several mails per box) mbox format to (single mail per file)
616            eml format.
617    
618    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
619    
620            * data/crude.rda: Rebuilt.
621    
622            * data/acq.rda: Rebuilt.
623    
624            * R/reader.R: Factored out reader and parser methods from
625            textdoccol.R.
626    
627            * R/source.R: Factored out Source methods from aobjects.R and
628            textdoccol.R.
629            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
630            feeds.
631    
632            * R/textdoccol.R (DirSource): Added support for recursive
633            traversal of directories.
634    
635    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
636    
637            * R/textdoccol.R ([[): Loads the document corpus automatically
638            into memory upon access.
639            (tm_transform, tm_filter): Removed several checks whether the
640            document is already loaded ([[ ensures this now).
641            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
642            mailing list archive.
643    
644    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
645    
646            * R/aobjects.R (TextDocument): Is now a virtual class.
647            (Source): Is now a virtual class.
648    
649    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
650    
651            * R/textdoccol.R (c): Support for an arbitrary number of document
652            collections.
653    
654    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
657            append_meta and remove_meta.
658    
659            * R/textdoccol.R: Removed modify_metadata method.
660    
661            * R/textrepo.R: Removed modify_metadata method.
662    
663            * R/textdoccol.R (remove_meta): Supports removal of document
664            collection metadata and document (= in data frame) metadata.
665    
666    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
667    
668            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
669    
670            * data/crude.rda: Rebuilt.
671    
672            * data/acq.rda: Rebuilt.
673    
674            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
675    
676            * R/textdoccol.R ([): Bug fix for subsetting a document
677            collection's data frame.
678    
679    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
680    
681            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
682            to s_filter.
683    
684            * R/textdoccol.R: Local text documents' metadata can now be copied
685            to a document collection's data frame with prescind_meta.
686    
687    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
688    
689            * R/: Text documents' slot metadata is now accessible in s_filter.
690    
691            * R/: Rewrote s_filter function (has still some restrictions).
692    
693    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
694    
695            * R/: Various fixes in handling metadata.
696    
697            * R/: Added update mechanism for text document collections.
698    
699    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
700    
701            * R/: Merging of document collections now creates a binary tree
702            for reconstructing merged document collections.
703    
704            * R/: Redesign of metadata for document collections.
705    
706    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
707    
708            * R/: Messages now use \code{ngettext}.
709    
710    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
711    
712            * R/: Added functions for modifying and removing metadata.
713    
714    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
715    
716            * man/: Updated some documentation.
717    
718            * R/: Corrected some connection issues.
719    
720            * inst/doc: Worked on the vignette.
721    
722    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
723    
724            * inst/: Added texts and started vignette.
725    
726            * R/: Final changes based upon David's comments.
727    
728    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
729    
730            * NAMESPACE: Corrected exports (generic methods need exportMethods
731            directives!).
732    
733    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
734    
735            * R/: Modified the TextDocCol constructur and various parsers. It
736            is now modular and supports various file formats via plugins (see
737            the new "Source" class).
738    
739    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
740    
741            * man/: Revised documentation after previous code changes.
742    
743    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
744    
745            * R/: Remaining changes as discussed with David.
746    
747    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
748    
749            * R/: Some changes as suggested by David. The rest will follow
750            within the next days.
751    
752    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
753    
754            * man/: Finished documentation.
755    
756    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
757    
758            * man/: Wrote some documentation.
759    
760    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
761    
762            * R/: Further syntactic sugar in form of additional assignment and
763            accessor methods.
764    
765    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
766    
767            * R/: Syntactic sugar in form of "length", "show" and "summary"
768            operators.
769    
770    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
771    
772            * R/: Diverse updates. Mainly on default operators ("[" or "c")
773            and dissimilarities.
774    
775    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
776    
777            * R/: Added similarity functions.
778    
779            * data/: Added english stopwords.
780    
781    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
782    
783            * data/: Examples compiled for new features
784    
785            * R/: Changes due to new structure.
786    
787            * NAMESPACE: Corrected namespace to reflect new structure.
788    
789            * R/termdocmatrix.R: Adapted for new naming scheme.
790    
791    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
792    
793            * R/textdoccol.R: Adapted code for new class structure. Wrote
794            several transform and filter functions operating on text document
795            collections (alias text document databases).
796    
797            * R/aobjects.R: Adapted class structure with inheritance,
798            repositories and additional meta data. Loading files on demand is
799            now possible.
800    
801    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
802    
803            * R/: Some cosmetic cleanups.
804    
805            * inst/: Removed vignette on clustering. That and much more is now
806            described in the JSS paper on text mining. Based upon that
807            article an elaborated vignette will be incorporated in the future.
808    
809    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
810    
811            * R/: Updated generic S4 methods to comply with signature changes
812            in newer versions of R (> 2.3)
813    
814    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
815    
816            * ext/R/importRIS.R: Automatic RIS import is now possible.
817    
818    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
819    
820            * R/textdoccol.R: Added RIS HTML input format.
821    
822    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
823    
824            * R/textdoccol.R: Removed bug that caused invalid text document
825            collections when handling many input files.
826    
827  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
828    
829          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.954

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge