SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 26, Sat Dec 3 15:20:17 2005 UTC pkg/ChangeLog revision 911, Sun Mar 22 17:55:16 2009 UTC
# Line 1  Line 1 
1    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
4            (XMLSource): New XMLSource class for arbitrary XML files.
5            (Source): New slot Vectorized.
6    
7    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
8    
9            * R/reader.R (readCustom): Experimental reader which can be
10            customized via user-defined mappings.
11    
12            * R/reader.R: Always use UTC time zone.
13    
14            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
15    
16    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
17    
18            * R/reader.R (readDOC): Options can be passed over to antiword.
19    
20            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
21            pdftotext.
22    
23    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
24    
25            * R/source.R (DirSource): Add pattern and ignore.case arguments
26            which are internally passed over to list.files().
27    
28    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
29    
30            * inst/doc/tm.Rnw: Suppress pointless loading message.
31    
32    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
33    
34            * DESCRIPTION: Speed up package loading (via moving packages not
35            strictly necessary for normal operation to Suggests instead of
36            Depends).
37    
38    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
39    
40            * R/reader.R (readNewsgroup): The date format is now configurable.
41    
42    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
43    
44            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
45    
46    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
47    
48            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
49    
50    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
51    
52            * R/source.R (DataframeSource): New source class for data frames.
53    
54            * R/source.R: Fixed non-standard call evaluation.
55    
56    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
57    
58            * R/source.R (URISource): New source class for a single document.
59    
60    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
61    
62            * R/source.R: Refactoring.
63    
64    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
65    
66            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
67            Rmpi installations more gracefully.
68    
69    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
70    
71            * R/source.R (Source): Add Length slot.
72    
73    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
74    
75            * R/AAA.R: Unify duplicated .onLoad function.
76    
77    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
78    
79            * DESCRIPTION (Suggests): Added Rmpi.
80    
81    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
82    
83            * R/source.R (getElem): Fix 'no visible binding' warning.
84    
85            * man/WeightFunction.Rd: Fix signature.
86    
87    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
88    
89            * R/weight.R: Introduce name abbreviations for weighting functions.
90    
91    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
92    
93            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
94    
95            * R/cluster.R: Provide convenience functions for using a MPI
96            cluster.
97    
98            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
99            available.
100    
101            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
102            available.
103    
104    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
105    
106            * R/textdoccol.R (lapply): Removed debug print out.
107    
108    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
109    
110            * R/reader.R (readRCV1): Improved meta data extraction from
111            Reuters Corpus Volume 1 documents.
112    
113    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
114    
115            * R/transform.R: Ensure that all mappings preserve multiline
116            structures.
117    
118    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
119    
120            * R/filter.R: Every filter has now an attribute indicating whether
121            it sould be applied to document level (doclevel).
122    
123            * R/textdoccol.R (tmFilter): Set searchFullText as new default
124            filter.
125    
126    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
127    
128            * R/transform.R (replacePatterns): Replaced removeWords by
129            replacePatterns. Suggested by Christian Buchta.
130    
131            * R/textdoccol.R (inspect): Improved formatting.
132    
133    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
134    
135            * inst/CITATION: Updated JSS article information.
136    
137            * R/textdoccol.R (setAs): Added coerce method from list to
138            corpus.
139    
140            * R/meta.R (meta): Improved meta data handling.
141    
142    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
143    
144            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
145            Christian Buchta.
146    
147            * inst/CITATION: Added template to include JSS article reference.
148    
149    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
150    
151            * R/textdoccol.R (tmMap): Introduced lazy mapping.
152    
153            * R/source.R: Added VectorSource.
154    
155    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
156    
157            * man/: Language codes should be in ISO 639-1 format.
158    
159            * R/textdoccol.R (asPlain): Preserve local meta data.
160    
161    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
162    
163            * R/textdoccol.R (writeCorpus): Function for writing a corpus
164            containing plain text documents to disk.
165    
166    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
167    
168            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
169            always set correctly.
170    
171            * R/textdoccol.R: Set load = TRUE as default for load on demand
172            since in most cases this is the wanted behaviour.
173    
174    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
175    
176            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
177    
178            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
179    
180    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
181    
182            * R/meta.R (meta): New function for consistent access to meta data
183            of document collections, repositories, and texts.
184    
185    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
186    
187            * R/: Better support for encodings.
188    
189    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
190    
191            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
192            selection when no reader argument is given.
193    
194    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
195    
196            * R/source.R (CSVSource): Now uses read.csv instead of scan
197            internally.
198    
199    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
200    
201            * R/reader.R (getReaders): Returns available reader functions.
202    
203            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
204            as default.
205    
206    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
207    
208            * R/stopwords.R (stopwords): Shortened code, removed codetools
209            variable warnings.
210    
211            * man/: Documentation for showMeta, added an example for tmMap.
212    
213            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
214            some minor typos fixed.
215    
216    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
217    
218            * R/aobjects.R (showMeta): Added method for pretty printing a
219            text document's meta data.
220    
221    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
222    
223            * R/textdoccol.R (TextDocCol): Better handling of empty
224            arguments.
225    
226            * NAMESPACE: Exported readDOC.
227    
228            * man/completeStems.Rd: Added an example.
229    
230    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
231    
232            * R/stopwords.R (stopwords): Look up .dat files at every
233            call. Allows users to modify stopword .dat files interactively.
234    
235    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
236    
237            * R/termdocmatrix.R (termFreq): Correct processing of empty
238            documents.
239    
240    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
241    
242            * man/: Updated documentation.
243    
244    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * R/complete.R (completeStems): Completes (heuristically) word
247            stems.
248    
249            * R/termdocmatrix.R (TermDocMatrix2): New modular
250            constructor.
251    
252            * NAMESPACE: Exported termFreq.
253    
254    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
255    
256            * R/reader.R (readDOC): Added MS Word reader (using antiword).
257    
258    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
259    
260            * R/weight.R: Weighting functions for TermDocMatrix.
261    
262    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
263    
264            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
265            functions for accessing dimension, column, and row names.
266    
267            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
268    
269    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
270    
271            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
272    
273    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
274    
275            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
276    
277    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
278    
279            * R/reader.R (readPDF): Removed manual checks for pdftotext and
280            pdfinfo. The system call gives a warning anyway.
281    
282    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
283    
284            * R/textdoccol.R (asPlain): Conversion from
285            StructuredTextDocuments to PlainTextDocuments.
286    
287    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
288    
289            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
290            for accessing term-document matrices.
291    
292            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
293            are installed.
294    
295    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
296    
297            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
298            Christian Buchta.
299    
300    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
301    
302            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
303    
304    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
307    
308            * R/reader.R (readPDF): Added PDF reader.
309    
310    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
311    
312            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
313    
314            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
315    
316            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
317    
318            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
319    
320    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
321    
322            * R/distmeasure.R (dissimilarity): Replaced dists call from
323            package cba by new dist call from package proxy.
324    
325    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
326    
327            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
328    
329    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
330    
331            * R/termdocmatrix.R: require() uses the quietly option to suppress
332            loading messages.
333    
334    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * R/dictionary.R: Added dictionary support.
337    
338    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
339    
340            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
341            documents. This simplifies some functions, e.g., asPlain.
342    
343    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * inst/doc/tm.Rnw: Fixed some typos in vignette.
346    
347    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * R/textdoccol.R (replaceWords): Added method to replace a set of
350            words by a single word. Useful for synonyms.
351    
352    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
355    
356    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
359            vectors. Thanks to Ariel Maguyon for his error report.
360            (removeSparseTerms): New function to remove columns from a
361            term-document matrix exceeding a sparse factor.
362    
363    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
364    
365            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
366    
367    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
368    
369            * man/sFilter.Rd: Corrected documentation on statement format (use
370            '==' instead of '=').
371    
372    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
373    
374            * R/aobjects.R (StructuredTextDocument): Inherits from
375            TextDocument.
376    
377    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
378    
379            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
380            on sparse matrices as proposed by Martin Maechler.
381    
382    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
383    
384            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
385            \pkg{filehash} version makes them deprecated.
386    
387    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
388    
389            * R/termdocmatrix.R (textvector): Stemming is now performed before
390            erasing stopwords.
391            (weightMatrix): Adapted to handle sparse matrices.
392            (TermDocMatrix): Sparse matrix is now efficiently built by
393            direct stepwise insertion of row values into it.
394    
395    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
396    
397            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
398            due to ongoing problems. For our purposes the latter is as useful
399            as the replaced package.
400    
401    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
402    
403            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
404    
405            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
406    
407    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
408    
409            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
410            languages with available stopwords.
411    
412    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * inst/doc/tm.Rnw: Minor corrections in the vignette.
415    
416    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
417    
418            * DESCRIPTION: Update to version 0.2, since a lot of new features
419            have been integrated.
420    
421            * inst/stopwords: Updated existing stopwords and added stopwords
422            for various other languages.
423    
424    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
425    
426            * man/: Updated documentation.
427    
428            * Work/testDb.R: Script to test database stuff.
429    
430            * R/: Fixed various database related bugs. Seems to be rather
431            useable now, i.e., consider as alpha status for now.
432    
433    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
434    
435            * R/: Fixed some bugs related to database support.
436    
437    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * man/: Added a lot of examples to the manuals.
440    
441    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
442    
443            * man/: Updated parts of the documentation.
444    
445            * R/textdoccol.R (asPlain): Added conversion from newsgroup
446            documents to plain text documents.
447    
448    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
449    
450            * R/textdoccol.R: Finished experimental database support. Not yet
451            intensively tested.
452    
453            * R/source.R: Now each source has a default reader.
454    
455            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
456            class anymore.
457    
458            * R/plaintextdoc.R: Custom show method for plain text documents.
459    
460            * R/aobjects.R: Added a class for structured text documents.
461    
462            * R/reader.R: Replaced remaining \code{parser} occurrences with
463            \code{reader}.
464    
465            * R/textdoccol.R (summary): Indent tags.
466    
467            * R/textdoccol.R (removePunctuation): Transform method to remove
468            punctuation marks.
469    
470    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
471    
472            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
473            using prescindMeta().
474    
475    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
476    
477            * R/textdoccol.R: Improved database support.
478    
479    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
480    
481            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
482    
483            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
484            language code.
485    
486            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
487            into parserControl argument.
488    
489            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
490    
491    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
492    
493            * Work/tmDataSetup.R: The datasets acq and crude can now be
494            created on the fly.
495    
496            * R/stopwords.R: Introduced a function returning the stopwords for
497            a given language (English, German and French at the moment)
498    
499            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
500            otherwise falls back to Snowball package.
501    
502    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
503    
504            * man/dissimilarity-methods.Rd: Make clear that any method offered
505            by "dists" from package "cba" can be used.
506    
507    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
508    
509            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
510            to Kurt's latex suggestion. Removed points and underscores in
511            variable names for consistent naming.
512    
513            * DESCRIPTION: Update to version 0.1-2.
514    
515            * man/TextRepository.Rd: Fixed bug in documentation.
516    
517    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
518    
519            * DESCRIPTION: Update to version 0.1-1.
520    
521    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
522    
523            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
524            wordStem.
525    
526    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
527    
528            * R/: Changes due to Kurt's review.
529    
530    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
531    
532            * R/: Implemented improvements based upon comments by David
533            Meyer.
534    
535    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
536    
537            * inst/doc/: Rewrote vignette.
538    
539            * man/: Improved documentation.
540    
541    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
542    
543            * man/: Updated documentation.
544    
545            * DESCRIPTION: Changed package name to "tm". Updated version to
546            0.1 for first CRAN release.
547    
548            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
549            list archive example.
550    
551            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
552            archive example.
553    
554            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
555            from (several mails per box) mbox format to (single mail per file)
556            eml format.
557    
558    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
559    
560            * data/crude.rda: Rebuilt.
561    
562            * data/acq.rda: Rebuilt.
563    
564            * R/reader.R: Factored out reader and parser methods from
565            textdoccol.R.
566    
567            * R/source.R: Factored out Source methods from aobjects.R and
568            textdoccol.R.
569            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
570            feeds.
571    
572            * R/textdoccol.R (DirSource): Added support for recursive
573            traversal of directories.
574    
575    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
576    
577            * R/textdoccol.R ([[): Loads the document corpus automatically
578            into memory upon access.
579            (tm_transform, tm_filter): Removed several checks whether the
580            document is already loaded ([[ ensures this now).
581            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
582            mailing list archive.
583    
584    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
585    
586            * R/aobjects.R (TextDocument): Is now a virtual class.
587            (Source): Is now a virtual class.
588    
589    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
590    
591            * R/textdoccol.R (c): Support for an arbitrary number of document
592            collections.
593    
594    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
595    
596            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
597            append_meta and remove_meta.
598    
599            * R/textdoccol.R: Removed modify_metadata method.
600    
601            * R/textrepo.R: Removed modify_metadata method.
602    
603            * R/textdoccol.R (remove_meta): Supports removal of document
604            collection metadata and document (= in data frame) metadata.
605    
606    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
607    
608            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
609    
610            * data/crude.rda: Rebuilt.
611    
612            * data/acq.rda: Rebuilt.
613    
614            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
615    
616            * R/textdoccol.R ([): Bug fix for subsetting a document
617            collection's data frame.
618    
619    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
620    
621            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
622            to s_filter.
623    
624            * R/textdoccol.R: Local text documents' metadata can now be copied
625            to a document collection's data frame with prescind_meta.
626    
627    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
628    
629            * R/: Text documents' slot metadata is now accessible in s_filter.
630    
631            * R/: Rewrote s_filter function (has still some restrictions).
632    
633    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
634    
635            * R/: Various fixes in handling metadata.
636    
637            * R/: Added update mechanism for text document collections.
638    
639    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
640    
641            * R/: Merging of document collections now creates a binary tree
642            for reconstructing merged document collections.
643    
644            * R/: Redesign of metadata for document collections.
645    
646    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
647    
648            * R/: Messages now use \code{ngettext}.
649    
650    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
651    
652            * R/: Added functions for modifying and removing metadata.
653    
654    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * man/: Updated some documentation.
657    
658            * R/: Corrected some connection issues.
659    
660            * inst/doc: Worked on the vignette.
661    
662    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
663    
664            * inst/: Added texts and started vignette.
665    
666            * R/: Final changes based upon David's comments.
667    
668    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
669    
670            * NAMESPACE: Corrected exports (generic methods need exportMethods
671            directives!).
672    
673    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
674    
675            * R/: Modified the TextDocCol constructur and various parsers. It
676            is now modular and supports various file formats via plugins (see
677            the new "Source" class).
678    
679    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
680    
681            * man/: Revised documentation after previous code changes.
682    
683    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
684    
685            * R/: Remaining changes as discussed with David.
686    
687    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
688    
689            * R/: Some changes as suggested by David. The rest will follow
690            within the next days.
691    
692    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
693    
694            * man/: Finished documentation.
695    
696    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
697    
698            * man/: Wrote some documentation.
699    
700    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
701    
702            * R/: Further syntactic sugar in form of additional assignment and
703            accessor methods.
704    
705    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
706    
707            * R/: Syntactic sugar in form of "length", "show" and "summary"
708            operators.
709    
710    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
711    
712            * R/: Diverse updates. Mainly on default operators ("[" or "c")
713            and dissimilarities.
714    
715    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
716    
717            * R/: Added similarity functions.
718    
719            * data/: Added english stopwords.
720    
721    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
722    
723            * data/: Examples compiled for new features
724    
725            * R/: Changes due to new structure.
726    
727            * NAMESPACE: Corrected namespace to reflect new structure.
728    
729            * R/termdocmatrix.R: Adapted for new naming scheme.
730    
731    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
732    
733            * R/textdoccol.R: Adapted code for new class structure. Wrote
734            several transform and filter functions operating on text document
735            collections (alias text document databases).
736    
737            * R/aobjects.R: Adapted class structure with inheritance,
738            repositories and additional meta data. Loading files on demand is
739            now possible.
740    
741    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
742    
743            * R/: Some cosmetic cleanups.
744    
745            * inst/: Removed vignette on clustering. That and much more is now
746            described in the JSS paper on text mining. Based upon that
747            article an elaborated vignette will be incorporated in the future.
748    
749    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
750    
751            * R/: Updated generic S4 methods to comply with signature changes
752            in newer versions of R (> 2.3)
753    
754    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
755    
756            * ext/R/importRIS.R: Automatic RIS import is now possible.
757    
758    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
759    
760            * R/textdoccol.R: Added RIS HTML input format.
761    
762    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
763    
764            * R/textdoccol.R: Removed bug that caused invalid text document
765            collections when handling many input files.
766    
767    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
768    
769            * R/textdoccol.R: Restructured and extended file import
770            mechanism.
771    
772            * inst/doc/clustering.Rnw: Adapted vignette for use with
773            ReutNews.rda
774    
775            * man/ReutNews.Rd: Documentation for ReutNews.rda
776    
777            * data/ReutNews.rda: A tiny Reuters21578 example data set.
778    
779    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
780    
781            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
782            clustering facilities of this package.
783    
784    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
785    
786            * R/aobjects.R: Changed package document structure to avoid class
787            dependency problems.
788    
789    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
790    
791            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
792            data set.
793    
794            *  Finished documentation and reordered directory structure. Now "R
795            CMD check textmin" works without errors.
796    
797    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
798    
799            * src/: Various splits can now be easily created for the
800            Reuters21578 data set.
801    
802  2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
803    
804          * Updated documentation          * Updated documentation

Legend:
Removed from v.26  
changed lines
  Added in v.911

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge