SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 28, Tue Dec 6 13:46:33 2005 UTC pkg/ChangeLog revision 941, Mon Apr 27 15:36:43 2009 UTC
# Line 1  Line 1 
1    2009-04-27  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/matrix.R: Create two distinct classes for term-document and
4            document-term matrices.
5    
6    2009-04-26  Ingo Feinerer  <feinerer@logic.at>
7    
8            * R/termdocmatrix.R: No longer use Matrix package. This reduces
9            package start-up time significantly.
10    
11    2009-04-11  Ingo Feinerer  <feinerer@logic.at>
12    
13            * inst/doc/tm.Rnw: Fix code/documentation mismatch.
14    
15    2009-04-04  Ingo Feinerer  <feinerer@logic.at>
16    
17            * R/transform.R (tmReduce): Combine multiple maps into one
18            transformation.
19    
20    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
21    
22            * R/weight.R: Remove weightLogical since it does not return a
23            dgCMatrix.
24    
25            * R/termdocmatrix.R: Removed TermDocMatrix. Use DocumentTermMatrix
26            or TermDocumentMatrix instead.
27    
28    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
29    
30            * inst/doc/extensions.Rnw: Finished vignette.
31    
32    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
33    
34            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
35            DocumentTermMatrix representations.
36    
37    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
38    
39            * R/reader.R (readXML): New reader for arbitrary XML files.
40    
41    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
42    
43            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
44            (XMLSource): New XMLSource class for arbitrary XML files.
45            (Source): New slot Vectorized.
46    
47    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
48    
49            * R/reader.R (readTabular): Experimental reader for tabular data
50            structures which can be customized via user-defined mappings.
51    
52            * R/reader.R: Always use UTC time zone.
53    
54            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
55    
56    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
57    
58            * R/reader.R (readDOC): Options can be passed over to antiword.
59    
60            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
61            pdftotext.
62    
63    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
64    
65            * R/source.R (DirSource): Add pattern and ignore.case arguments
66            which are internally passed over to list.files().
67    
68    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
69    
70            * inst/doc/tm.Rnw: Suppress pointless loading message.
71    
72    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
73    
74            * DESCRIPTION: Speed up package loading (via moving packages not
75            strictly necessary for normal operation to Suggests instead of
76            Depends).
77    
78    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
79    
80            * R/reader.R (readNewsgroup): The date format is now configurable.
81    
82    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
83    
84            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
85    
86    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
87    
88            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
89    
90    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
91    
92            * R/source.R (DataframeSource): New source class for data frames.
93    
94            * R/source.R: Fixed non-standard call evaluation.
95    
96    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
97    
98            * R/source.R (URISource): New source class for a single document.
99    
100    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
101    
102            * R/source.R: Refactoring.
103    
104    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
105    
106            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
107            Rmpi installations more gracefully.
108    
109    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
110    
111            * R/source.R (Source): Add Length slot.
112    
113    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
114    
115            * R/AAA.R: Unify duplicated .onLoad function.
116    
117    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
118    
119            * DESCRIPTION (Suggests): Added Rmpi.
120    
121    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
122    
123            * R/source.R (getElem): Fix 'no visible binding' warning.
124    
125            * man/WeightFunction.Rd: Fix signature.
126    
127    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
128    
129            * R/weight.R: Introduce name abbreviations for weighting functions.
130    
131    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
132    
133            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
134    
135            * R/cluster.R: Provide convenience functions for using a MPI
136            cluster.
137    
138            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
139            available.
140    
141            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
142            available.
143    
144    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
145    
146            * R/textdoccol.R (lapply): Removed debug print out.
147    
148    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
149    
150            * R/reader.R (readRCV1): Improved meta data extraction from
151            Reuters Corpus Volume 1 documents.
152    
153    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * R/transform.R: Ensure that all mappings preserve multiline
156            structures.
157    
158    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
159    
160            * R/filter.R: Every filter has now an attribute indicating whether
161            it sould be applied to document level (doclevel).
162    
163            * R/textdoccol.R (tmFilter): Set searchFullText as new default
164            filter.
165    
166    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
167    
168            * R/transform.R (replacePatterns): Replaced removeWords by
169            replacePatterns. Suggested by Christian Buchta.
170    
171            * R/textdoccol.R (inspect): Improved formatting.
172    
173    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
174    
175            * inst/CITATION: Updated JSS article information.
176    
177            * R/textdoccol.R (setAs): Added coerce method from list to
178            corpus.
179    
180            * R/meta.R (meta): Improved meta data handling.
181    
182    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
183    
184            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
185            Christian Buchta.
186    
187            * inst/CITATION: Added template to include JSS article reference.
188    
189    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
190    
191            * R/textdoccol.R (tmMap): Introduced lazy mapping.
192    
193            * R/source.R: Added VectorSource.
194    
195    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
196    
197            * man/: Language codes should be in ISO 639-1 format.
198    
199            * R/textdoccol.R (asPlain): Preserve local meta data.
200    
201    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
202    
203            * R/textdoccol.R (writeCorpus): Function for writing a corpus
204            containing plain text documents to disk.
205    
206    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
207    
208            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
209            always set correctly.
210    
211            * R/textdoccol.R: Set load = TRUE as default for load on demand
212            since in most cases this is the wanted behaviour.
213    
214    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
215    
216            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
217    
218            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
219    
220    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
221    
222            * R/meta.R (meta): New function for consistent access to meta data
223            of document collections, repositories, and texts.
224    
225    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
226    
227            * R/: Better support for encodings.
228    
229    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
230    
231            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
232            selection when no reader argument is given.
233    
234    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
235    
236            * R/source.R (CSVSource): Now uses read.csv instead of scan
237            internally.
238    
239    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
240    
241            * R/reader.R (getReaders): Returns available reader functions.
242    
243            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
244            as default.
245    
246    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
247    
248            * R/stopwords.R (stopwords): Shortened code, removed codetools
249            variable warnings.
250    
251            * man/: Documentation for showMeta, added an example for tmMap.
252    
253            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
254            some minor typos fixed.
255    
256    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
257    
258            * R/aobjects.R (showMeta): Added method for pretty printing a
259            text document's meta data.
260    
261    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
262    
263            * R/textdoccol.R (TextDocCol): Better handling of empty
264            arguments.
265    
266            * NAMESPACE: Exported readDOC.
267    
268            * man/completeStems.Rd: Added an example.
269    
270    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
271    
272            * R/stopwords.R (stopwords): Look up .dat files at every
273            call. Allows users to modify stopword .dat files interactively.
274    
275    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
276    
277            * R/termdocmatrix.R (termFreq): Correct processing of empty
278            documents.
279    
280    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
281    
282            * man/: Updated documentation.
283    
284    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
285    
286            * R/complete.R (completeStems): Completes (heuristically) word
287            stems.
288    
289            * R/termdocmatrix.R (TermDocMatrix2): New modular
290            constructor.
291    
292            * NAMESPACE: Exported termFreq.
293    
294    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
295    
296            * R/reader.R (readDOC): Added MS Word reader (using antiword).
297    
298    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
299    
300            * R/weight.R: Weighting functions for TermDocMatrix.
301    
302    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
303    
304            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
305            functions for accessing dimension, column, and row names.
306    
307            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
308    
309    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
310    
311            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
312    
313    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
314    
315            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
316    
317    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * R/reader.R (readPDF): Removed manual checks for pdftotext and
320            pdfinfo. The system call gives a warning anyway.
321    
322    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
323    
324            * R/textdoccol.R (asPlain): Conversion from
325            StructuredTextDocuments to PlainTextDocuments.
326    
327    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
328    
329            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
330            for accessing term-document matrices.
331    
332            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
333            are installed.
334    
335    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
336    
337            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
338            Christian Buchta.
339    
340    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
341    
342            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
343    
344    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
345    
346            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
347    
348            * R/reader.R (readPDF): Added PDF reader.
349    
350    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
351    
352            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
353    
354            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
355    
356            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
357    
358            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
359    
360    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * R/distmeasure.R (dissimilarity): Replaced dists call from
363            package cba by new dist call from package proxy.
364    
365    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
366    
367            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
368    
369    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
370    
371            * R/termdocmatrix.R: require() uses the quietly option to suppress
372            loading messages.
373    
374    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
375    
376            * R/dictionary.R: Added dictionary support.
377    
378    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
379    
380            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
381            documents. This simplifies some functions, e.g., asPlain.
382    
383    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
384    
385            * inst/doc/tm.Rnw: Fixed some typos in vignette.
386    
387    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
388    
389            * R/textdoccol.R (replaceWords): Added method to replace a set of
390            words by a single word. Useful for synonyms.
391    
392    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
393    
394            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
395    
396    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
397    
398            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
399            vectors. Thanks to Ariel Maguyon for his error report.
400            (removeSparseTerms): New function to remove columns from a
401            term-document matrix exceeding a sparse factor.
402    
403    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
404    
405            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
406    
407    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * man/sFilter.Rd: Corrected documentation on statement format (use
410            '==' instead of '=').
411    
412    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * R/aobjects.R (StructuredTextDocument): Inherits from
415            TextDocument.
416    
417    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
418    
419            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
420            on sparse matrices as proposed by Martin Maechler.
421    
422    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
423    
424            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
425            \pkg{filehash} version makes them deprecated.
426    
427    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
428    
429            * R/termdocmatrix.R (textvector): Stemming is now performed before
430            erasing stopwords.
431            (weightMatrix): Adapted to handle sparse matrices.
432            (TermDocMatrix): Sparse matrix is now efficiently built by
433            direct stepwise insertion of row values into it.
434    
435    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
436    
437            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
438            due to ongoing problems. For our purposes the latter is as useful
439            as the replaced package.
440    
441    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
442    
443            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
444    
445            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
446    
447    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
448    
449            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
450            languages with available stopwords.
451    
452    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
453    
454            * inst/doc/tm.Rnw: Minor corrections in the vignette.
455    
456    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
457    
458            * DESCRIPTION: Update to version 0.2, since a lot of new features
459            have been integrated.
460    
461            * inst/stopwords: Updated existing stopwords and added stopwords
462            for various other languages.
463    
464    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
465    
466            * man/: Updated documentation.
467    
468            * Work/testDb.R: Script to test database stuff.
469    
470            * R/: Fixed various database related bugs. Seems to be rather
471            useable now, i.e., consider as alpha status for now.
472    
473    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
474    
475            * R/: Fixed some bugs related to database support.
476    
477    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
478    
479            * man/: Added a lot of examples to the manuals.
480    
481    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
482    
483            * man/: Updated parts of the documentation.
484    
485            * R/textdoccol.R (asPlain): Added conversion from newsgroup
486            documents to plain text documents.
487    
488    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
489    
490            * R/textdoccol.R: Finished experimental database support. Not yet
491            intensively tested.
492    
493            * R/source.R: Now each source has a default reader.
494    
495            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
496            class anymore.
497    
498            * R/plaintextdoc.R: Custom show method for plain text documents.
499    
500            * R/aobjects.R: Added a class for structured text documents.
501    
502            * R/reader.R: Replaced remaining \code{parser} occurrences with
503            \code{reader}.
504    
505            * R/textdoccol.R (summary): Indent tags.
506    
507            * R/textdoccol.R (removePunctuation): Transform method to remove
508            punctuation marks.
509    
510    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
511    
512            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
513            using prescindMeta().
514    
515    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
516    
517            * R/textdoccol.R: Improved database support.
518    
519    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
520    
521            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
522    
523            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
524            language code.
525    
526            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
527            into parserControl argument.
528    
529            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
530    
531    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
532    
533            * Work/tmDataSetup.R: The datasets acq and crude can now be
534            created on the fly.
535    
536            * R/stopwords.R: Introduced a function returning the stopwords for
537            a given language (English, German and French at the moment)
538    
539            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
540            otherwise falls back to Snowball package.
541    
542    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
543    
544            * man/dissimilarity-methods.Rd: Make clear that any method offered
545            by "dists" from package "cba" can be used.
546    
547    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
548    
549            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
550            to Kurt's latex suggestion. Removed points and underscores in
551            variable names for consistent naming.
552    
553            * DESCRIPTION: Update to version 0.1-2.
554    
555            * man/TextRepository.Rd: Fixed bug in documentation.
556    
557    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
558    
559            * DESCRIPTION: Update to version 0.1-1.
560    
561    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
562    
563            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
564            wordStem.
565    
566    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
567    
568            * R/: Changes due to Kurt's review.
569    
570    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
571    
572            * R/: Implemented improvements based upon comments by David
573            Meyer.
574    
575    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
576    
577            * inst/doc/: Rewrote vignette.
578    
579            * man/: Improved documentation.
580    
581    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
582    
583            * man/: Updated documentation.
584    
585            * DESCRIPTION: Changed package name to "tm". Updated version to
586            0.1 for first CRAN release.
587    
588            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
589            list archive example.
590    
591            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
592            archive example.
593    
594            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
595            from (several mails per box) mbox format to (single mail per file)
596            eml format.
597    
598    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
599    
600            * data/crude.rda: Rebuilt.
601    
602            * data/acq.rda: Rebuilt.
603    
604            * R/reader.R: Factored out reader and parser methods from
605            textdoccol.R.
606    
607            * R/source.R: Factored out Source methods from aobjects.R and
608            textdoccol.R.
609            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
610            feeds.
611    
612            * R/textdoccol.R (DirSource): Added support for recursive
613            traversal of directories.
614    
615    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
616    
617            * R/textdoccol.R ([[): Loads the document corpus automatically
618            into memory upon access.
619            (tm_transform, tm_filter): Removed several checks whether the
620            document is already loaded ([[ ensures this now).
621            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
622            mailing list archive.
623    
624    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
625    
626            * R/aobjects.R (TextDocument): Is now a virtual class.
627            (Source): Is now a virtual class.
628    
629    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
630    
631            * R/textdoccol.R (c): Support for an arbitrary number of document
632            collections.
633    
634    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
635    
636            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
637            append_meta and remove_meta.
638    
639            * R/textdoccol.R: Removed modify_metadata method.
640    
641            * R/textrepo.R: Removed modify_metadata method.
642    
643            * R/textdoccol.R (remove_meta): Supports removal of document
644            collection metadata and document (= in data frame) metadata.
645    
646    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
647    
648            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
649    
650            * data/crude.rda: Rebuilt.
651    
652            * data/acq.rda: Rebuilt.
653    
654            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
655    
656            * R/textdoccol.R ([): Bug fix for subsetting a document
657            collection's data frame.
658    
659    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
660    
661            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
662            to s_filter.
663    
664            * R/textdoccol.R: Local text documents' metadata can now be copied
665            to a document collection's data frame with prescind_meta.
666    
667    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
668    
669            * R/: Text documents' slot metadata is now accessible in s_filter.
670    
671            * R/: Rewrote s_filter function (has still some restrictions).
672    
673    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
674    
675            * R/: Various fixes in handling metadata.
676    
677            * R/: Added update mechanism for text document collections.
678    
679    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
680    
681            * R/: Merging of document collections now creates a binary tree
682            for reconstructing merged document collections.
683    
684            * R/: Redesign of metadata for document collections.
685    
686    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
687    
688            * R/: Messages now use \code{ngettext}.
689    
690    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
691    
692            * R/: Added functions for modifying and removing metadata.
693    
694    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
695    
696            * man/: Updated some documentation.
697    
698            * R/: Corrected some connection issues.
699    
700            * inst/doc: Worked on the vignette.
701    
702    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
703    
704            * inst/: Added texts and started vignette.
705    
706            * R/: Final changes based upon David's comments.
707    
708    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
709    
710            * NAMESPACE: Corrected exports (generic methods need exportMethods
711            directives!).
712    
713    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
714    
715            * R/: Modified the TextDocCol constructur and various parsers. It
716            is now modular and supports various file formats via plugins (see
717            the new "Source" class).
718    
719    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
720    
721            * man/: Revised documentation after previous code changes.
722    
723    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
724    
725            * R/: Remaining changes as discussed with David.
726    
727    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
728    
729            * R/: Some changes as suggested by David. The rest will follow
730            within the next days.
731    
732    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
733    
734            * man/: Finished documentation.
735    
736    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
737    
738            * man/: Wrote some documentation.
739    
740    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
741    
742            * R/: Further syntactic sugar in form of additional assignment and
743            accessor methods.
744    
745    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
746    
747            * R/: Syntactic sugar in form of "length", "show" and "summary"
748            operators.
749    
750    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
751    
752            * R/: Diverse updates. Mainly on default operators ("[" or "c")
753            and dissimilarities.
754    
755    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
756    
757            * R/: Added similarity functions.
758    
759            * data/: Added english stopwords.
760    
761    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
762    
763            * data/: Examples compiled for new features
764    
765            * R/: Changes due to new structure.
766    
767            * NAMESPACE: Corrected namespace to reflect new structure.
768    
769            * R/termdocmatrix.R: Adapted for new naming scheme.
770    
771    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
772    
773            * R/textdoccol.R: Adapted code for new class structure. Wrote
774            several transform and filter functions operating on text document
775            collections (alias text document databases).
776    
777            * R/aobjects.R: Adapted class structure with inheritance,
778            repositories and additional meta data. Loading files on demand is
779            now possible.
780    
781    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
782    
783            * R/: Some cosmetic cleanups.
784    
785            * inst/: Removed vignette on clustering. That and much more is now
786            described in the JSS paper on text mining. Based upon that
787            article an elaborated vignette will be incorporated in the future.
788    
789    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
790    
791            * R/: Updated generic S4 methods to comply with signature changes
792            in newer versions of R (> 2.3)
793    
794    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
795    
796            * ext/R/importRIS.R: Automatic RIS import is now possible.
797    
798    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
799    
800            * R/textdoccol.R: Added RIS HTML input format.
801    
802    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
803    
804            * R/textdoccol.R: Removed bug that caused invalid text document
805            collections when handling many input files.
806    
807    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
808    
809            * R/textdoccol.R: Restructured and extended file import
810            mechanism.
811    
812            * inst/doc/clustering.Rnw: Adapted vignette for use with
813            ReutNews.rda
814    
815            * man/ReutNews.Rd: Documentation for ReutNews.rda
816    
817            * data/ReutNews.rda: A tiny Reuters21578 example data set.
818    
819    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
820    
821            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
822            clustering facilities of this package.
823    
824    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
825    
826            * R/aobjects.R: Changed package document structure to avoid class
827            dependency problems.
828    
829  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
830    
831            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
832            data set.
833    
834          * Finished documentation and reordered directory structure. Now "R          * Finished documentation and reordered directory structure. Now "R
835          CMD check textmin" works without errors.          CMD check textmin" works without errors.
836    

Legend:
Removed from v.28  
changed lines
  Added in v.941

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge