SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC pkg/ChangeLog revision 918, Fri Mar 27 13:45:36 2009 UTC
# Line 1  Line 1 
1    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
4            DocumentTermMatrix representations.
5    
6    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
7    
8            * R/reader.R (readXML): New reader for arbitrary XML files.
9    
10    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
11    
12            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
13            (XMLSource): New XMLSource class for arbitrary XML files.
14            (Source): New slot Vectorized.
15    
16    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
17    
18            * R/reader.R (readCustom): Experimental reader which can be
19            customized via user-defined mappings.
20    
21            * R/reader.R: Always use UTC time zone.
22    
23            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
24    
25    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
26    
27            * R/reader.R (readDOC): Options can be passed over to antiword.
28    
29            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
30            pdftotext.
31    
32    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
33    
34            * R/source.R (DirSource): Add pattern and ignore.case arguments
35            which are internally passed over to list.files().
36    
37    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
38    
39            * inst/doc/tm.Rnw: Suppress pointless loading message.
40    
41    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
42    
43            * DESCRIPTION: Speed up package loading (via moving packages not
44            strictly necessary for normal operation to Suggests instead of
45            Depends).
46    
47    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
48    
49            * R/reader.R (readNewsgroup): The date format is now configurable.
50    
51    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
52    
53            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
54    
55    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
56    
57            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
58    
59    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
60    
61            * R/source.R (DataframeSource): New source class for data frames.
62    
63            * R/source.R: Fixed non-standard call evaluation.
64    
65    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
66    
67            * R/source.R (URISource): New source class for a single document.
68    
69    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
70    
71            * R/source.R: Refactoring.
72    
73    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
74    
75            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
76            Rmpi installations more gracefully.
77    
78    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
79    
80            * R/source.R (Source): Add Length slot.
81    
82    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
83    
84            * R/AAA.R: Unify duplicated .onLoad function.
85    
86    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
87    
88            * DESCRIPTION (Suggests): Added Rmpi.
89    
90    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
91    
92            * R/source.R (getElem): Fix 'no visible binding' warning.
93    
94            * man/WeightFunction.Rd: Fix signature.
95    
96    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
97    
98            * R/weight.R: Introduce name abbreviations for weighting functions.
99    
100    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
101    
102            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
103    
104            * R/cluster.R: Provide convenience functions for using a MPI
105            cluster.
106    
107            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
108            available.
109    
110            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
111            available.
112    
113    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
114    
115            * R/textdoccol.R (lapply): Removed debug print out.
116    
117    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
118    
119            * R/reader.R (readRCV1): Improved meta data extraction from
120            Reuters Corpus Volume 1 documents.
121    
122    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
123    
124            * R/transform.R: Ensure that all mappings preserve multiline
125            structures.
126    
127    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
128    
129            * R/filter.R: Every filter has now an attribute indicating whether
130            it sould be applied to document level (doclevel).
131    
132            * R/textdoccol.R (tmFilter): Set searchFullText as new default
133            filter.
134    
135    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
136    
137            * R/transform.R (replacePatterns): Replaced removeWords by
138            replacePatterns. Suggested by Christian Buchta.
139    
140            * R/textdoccol.R (inspect): Improved formatting.
141    
142    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
143    
144            * inst/CITATION: Updated JSS article information.
145    
146            * R/textdoccol.R (setAs): Added coerce method from list to
147            corpus.
148    
149            * R/meta.R (meta): Improved meta data handling.
150    
151    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
152    
153            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
154            Christian Buchta.
155    
156            * inst/CITATION: Added template to include JSS article reference.
157    
158    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
159    
160            * R/textdoccol.R (tmMap): Introduced lazy mapping.
161    
162            * R/source.R: Added VectorSource.
163    
164    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
165    
166            * man/: Language codes should be in ISO 639-1 format.
167    
168            * R/textdoccol.R (asPlain): Preserve local meta data.
169    
170    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
171    
172            * R/textdoccol.R (writeCorpus): Function for writing a corpus
173            containing plain text documents to disk.
174    
175    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
176    
177            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
178            always set correctly.
179    
180            * R/textdoccol.R: Set load = TRUE as default for load on demand
181            since in most cases this is the wanted behaviour.
182    
183    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
184    
185            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
186    
187            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
188    
189    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
190    
191            * R/meta.R (meta): New function for consistent access to meta data
192            of document collections, repositories, and texts.
193    
194    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
195    
196            * R/: Better support for encodings.
197    
198    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
199    
200            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
201            selection when no reader argument is given.
202    
203    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
204    
205            * R/source.R (CSVSource): Now uses read.csv instead of scan
206            internally.
207    
208    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
209    
210            * R/reader.R (getReaders): Returns available reader functions.
211    
212            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
213            as default.
214    
215    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
216    
217            * R/stopwords.R (stopwords): Shortened code, removed codetools
218            variable warnings.
219    
220            * man/: Documentation for showMeta, added an example for tmMap.
221    
222            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
223            some minor typos fixed.
224    
225    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
226    
227            * R/aobjects.R (showMeta): Added method for pretty printing a
228            text document's meta data.
229    
230    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
231    
232            * R/textdoccol.R (TextDocCol): Better handling of empty
233            arguments.
234    
235            * NAMESPACE: Exported readDOC.
236    
237            * man/completeStems.Rd: Added an example.
238    
239    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
240    
241            * R/stopwords.R (stopwords): Look up .dat files at every
242            call. Allows users to modify stopword .dat files interactively.
243    
244    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * R/termdocmatrix.R (termFreq): Correct processing of empty
247            documents.
248    
249    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
250    
251            * man/: Updated documentation.
252    
253    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * R/complete.R (completeStems): Completes (heuristically) word
256            stems.
257    
258            * R/termdocmatrix.R (TermDocMatrix2): New modular
259            constructor.
260    
261            * NAMESPACE: Exported termFreq.
262    
263    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
264    
265            * R/reader.R (readDOC): Added MS Word reader (using antiword).
266    
267    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
268    
269            * R/weight.R: Weighting functions for TermDocMatrix.
270    
271    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
274            functions for accessing dimension, column, and row names.
275    
276            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
277    
278    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
279    
280            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
281    
282    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
283    
284            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
285    
286    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
287    
288            * R/reader.R (readPDF): Removed manual checks for pdftotext and
289            pdfinfo. The system call gives a warning anyway.
290    
291    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
292    
293            * R/textdoccol.R (asPlain): Conversion from
294            StructuredTextDocuments to PlainTextDocuments.
295    
296    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
297    
298            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
299            for accessing term-document matrices.
300    
301            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
302            are installed.
303    
304    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
307            Christian Buchta.
308    
309    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
310    
311            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
312    
313    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
314    
315            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
316    
317            * R/reader.R (readPDF): Added PDF reader.
318    
319    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
320    
321            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
322    
323            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
324    
325            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
326    
327            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
328    
329    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
330    
331            * R/distmeasure.R (dissimilarity): Replaced dists call from
332            package cba by new dist call from package proxy.
333    
334    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
335    
336            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
337    
338    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
339    
340            * R/termdocmatrix.R: require() uses the quietly option to suppress
341            loading messages.
342    
343    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
344    
345            * R/dictionary.R: Added dictionary support.
346    
347    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
350            documents. This simplifies some functions, e.g., asPlain.
351    
352    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
353    
354            * inst/doc/tm.Rnw: Fixed some typos in vignette.
355    
356    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * R/textdoccol.R (replaceWords): Added method to replace a set of
359            words by a single word. Useful for synonyms.
360    
361    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
362    
363            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
364    
365    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
366    
367            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
368            vectors. Thanks to Ariel Maguyon for his error report.
369            (removeSparseTerms): New function to remove columns from a
370            term-document matrix exceeding a sparse factor.
371    
372    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
373    
374            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
375    
376    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
377    
378            * man/sFilter.Rd: Corrected documentation on statement format (use
379            '==' instead of '=').
380    
381    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
382    
383            * R/aobjects.R (StructuredTextDocument): Inherits from
384            TextDocument.
385    
386    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
387    
388            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
389            on sparse matrices as proposed by Martin Maechler.
390    
391    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
392    
393            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
394            \pkg{filehash} version makes them deprecated.
395    
396    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
397    
398            * R/termdocmatrix.R (textvector): Stemming is now performed before
399            erasing stopwords.
400            (weightMatrix): Adapted to handle sparse matrices.
401            (TermDocMatrix): Sparse matrix is now efficiently built by
402            direct stepwise insertion of row values into it.
403    
404    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
405    
406            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
407            due to ongoing problems. For our purposes the latter is as useful
408            as the replaced package.
409    
410    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
411    
412            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
413    
414            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
415    
416    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
417    
418            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
419            languages with available stopwords.
420    
421    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
422    
423            * inst/doc/tm.Rnw: Minor corrections in the vignette.
424    
425    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
426    
427            * DESCRIPTION: Update to version 0.2, since a lot of new features
428            have been integrated.
429    
430            * inst/stopwords: Updated existing stopwords and added stopwords
431            for various other languages.
432    
433    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
434    
435            * man/: Updated documentation.
436    
437            * Work/testDb.R: Script to test database stuff.
438    
439            * R/: Fixed various database related bugs. Seems to be rather
440            useable now, i.e., consider as alpha status for now.
441    
442    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
443    
444            * R/: Fixed some bugs related to database support.
445    
446    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
447    
448            * man/: Added a lot of examples to the manuals.
449    
450    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
451    
452            * man/: Updated parts of the documentation.
453    
454            * R/textdoccol.R (asPlain): Added conversion from newsgroup
455            documents to plain text documents.
456    
457    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
458    
459            * R/textdoccol.R: Finished experimental database support. Not yet
460            intensively tested.
461    
462            * R/source.R: Now each source has a default reader.
463    
464            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
465            class anymore.
466    
467            * R/plaintextdoc.R: Custom show method for plain text documents.
468    
469            * R/aobjects.R: Added a class for structured text documents.
470    
471            * R/reader.R: Replaced remaining \code{parser} occurrences with
472            \code{reader}.
473    
474            * R/textdoccol.R (summary): Indent tags.
475    
476            * R/textdoccol.R (removePunctuation): Transform method to remove
477            punctuation marks.
478    
479    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
480    
481            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
482            using prescindMeta().
483    
484    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
485    
486            * R/textdoccol.R: Improved database support.
487    
488    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
489    
490            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
491    
492            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
493            language code.
494    
495            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
496            into parserControl argument.
497    
498            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
499    
500    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
501    
502            * Work/tmDataSetup.R: The datasets acq and crude can now be
503            created on the fly.
504    
505            * R/stopwords.R: Introduced a function returning the stopwords for
506            a given language (English, German and French at the moment)
507    
508            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
509            otherwise falls back to Snowball package.
510    
511    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
512    
513            * man/dissimilarity-methods.Rd: Make clear that any method offered
514            by "dists" from package "cba" can be used.
515    
516    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
517    
518            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
519            to Kurt's latex suggestion. Removed points and underscores in
520            variable names for consistent naming.
521    
522            * DESCRIPTION: Update to version 0.1-2.
523    
524            * man/TextRepository.Rd: Fixed bug in documentation.
525    
526    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
527    
528            * DESCRIPTION: Update to version 0.1-1.
529    
530    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
531    
532            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
533            wordStem.
534    
535    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
536    
537            * R/: Changes due to Kurt's review.
538    
539    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
540    
541            * R/: Implemented improvements based upon comments by David
542            Meyer.
543    
544    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
545    
546            * inst/doc/: Rewrote vignette.
547    
548            * man/: Improved documentation.
549    
550    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
551    
552            * man/: Updated documentation.
553    
554            * DESCRIPTION: Changed package name to "tm". Updated version to
555            0.1 for first CRAN release.
556    
557            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
558            list archive example.
559    
560            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
561            archive example.
562    
563            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
564            from (several mails per box) mbox format to (single mail per file)
565            eml format.
566    
567    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
568    
569            * data/crude.rda: Rebuilt.
570    
571            * data/acq.rda: Rebuilt.
572    
573            * R/reader.R: Factored out reader and parser methods from
574            textdoccol.R.
575    
576            * R/source.R: Factored out Source methods from aobjects.R and
577            textdoccol.R.
578            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
579            feeds.
580    
581            * R/textdoccol.R (DirSource): Added support for recursive
582            traversal of directories.
583    
584    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
585    
586            * R/textdoccol.R ([[): Loads the document corpus automatically
587            into memory upon access.
588            (tm_transform, tm_filter): Removed several checks whether the
589            document is already loaded ([[ ensures this now).
590            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
591            mailing list archive.
592    
593    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
594    
595            * R/aobjects.R (TextDocument): Is now a virtual class.
596            (Source): Is now a virtual class.
597    
598    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
599    
600            * R/textdoccol.R (c): Support for an arbitrary number of document
601            collections.
602    
603    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
604    
605            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
606            append_meta and remove_meta.
607    
608            * R/textdoccol.R: Removed modify_metadata method.
609    
610            * R/textrepo.R: Removed modify_metadata method.
611    
612            * R/textdoccol.R (remove_meta): Supports removal of document
613            collection metadata and document (= in data frame) metadata.
614    
615    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
616    
617            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
618    
619            * data/crude.rda: Rebuilt.
620    
621            * data/acq.rda: Rebuilt.
622    
623            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
624    
625            * R/textdoccol.R ([): Bug fix for subsetting a document
626            collection's data frame.
627    
628    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
629    
630            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
631            to s_filter.
632    
633            * R/textdoccol.R: Local text documents' metadata can now be copied
634            to a document collection's data frame with prescind_meta.
635    
636    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
637    
638            * R/: Text documents' slot metadata is now accessible in s_filter.
639    
640            * R/: Rewrote s_filter function (has still some restrictions).
641    
642    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
643    
644            * R/: Various fixes in handling metadata.
645    
646            * R/: Added update mechanism for text document collections.
647    
648    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
649    
650            * R/: Merging of document collections now creates a binary tree
651            for reconstructing merged document collections.
652    
653            * R/: Redesign of metadata for document collections.
654    
655    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
656    
657            * R/: Messages now use \code{ngettext}.
658    
659    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
660    
661            * R/: Added functions for modifying and removing metadata.
662    
663    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
664    
665            * man/: Updated some documentation.
666    
667            * R/: Corrected some connection issues.
668    
669            * inst/doc: Worked on the vignette.
670    
671    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
672    
673            * inst/: Added texts and started vignette.
674    
675            * R/: Final changes based upon David's comments.
676    
677    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
678    
679            * NAMESPACE: Corrected exports (generic methods need exportMethods
680            directives!).
681    
682    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
683    
684            * R/: Modified the TextDocCol constructur and various parsers. It
685            is now modular and supports various file formats via plugins (see
686            the new "Source" class).
687    
688    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
689    
690            * man/: Revised documentation after previous code changes.
691    
692    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
693    
694            * R/: Remaining changes as discussed with David.
695    
696    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
697    
698            * R/: Some changes as suggested by David. The rest will follow
699            within the next days.
700    
701    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
702    
703            * man/: Finished documentation.
704    
705    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
706    
707            * man/: Wrote some documentation.
708    
709    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
710    
711            * R/: Further syntactic sugar in form of additional assignment and
712            accessor methods.
713    
714    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
715    
716            * R/: Syntactic sugar in form of "length", "show" and "summary"
717            operators.
718    
719    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
720    
721            * R/: Diverse updates. Mainly on default operators ("[" or "c")
722            and dissimilarities.
723    
724    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
725    
726            * R/: Added similarity functions.
727    
728            * data/: Added english stopwords.
729    
730    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
731    
732            * data/: Examples compiled for new features
733    
734            * R/: Changes due to new structure.
735    
736            * NAMESPACE: Corrected namespace to reflect new structure.
737    
738            * R/termdocmatrix.R: Adapted for new naming scheme.
739    
740    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
741    
742            * R/textdoccol.R: Adapted code for new class structure. Wrote
743            several transform and filter functions operating on text document
744            collections (alias text document databases).
745    
746            * R/aobjects.R: Adapted class structure with inheritance,
747            repositories and additional meta data. Loading files on demand is
748            now possible.
749    
750    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
751    
752            * R/: Some cosmetic cleanups.
753    
754            * inst/: Removed vignette on clustering. That and much more is now
755            described in the JSS paper on text mining. Based upon that
756            article an elaborated vignette will be incorporated in the future.
757    
758    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
759    
760            * R/: Updated generic S4 methods to comply with signature changes
761            in newer versions of R (> 2.3)
762    
763    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
764    
765            * ext/R/importRIS.R: Automatic RIS import is now possible.
766    
767    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
768    
769            * R/textdoccol.R: Added RIS HTML input format.
770    
771    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
772    
773            * R/textdoccol.R: Removed bug that caused invalid text document
774            collections when handling many input files.
775    
776    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
777    
778            * R/textdoccol.R: Restructured and extended file import
779            mechanism.
780    
781            * inst/doc/clustering.Rnw: Adapted vignette for use with
782            ReutNews.rda
783    
784            * man/ReutNews.Rd: Documentation for ReutNews.rda
785    
786            * data/ReutNews.rda: A tiny Reuters21578 example data set.
787    
788    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
789    
790            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
791            clustering facilities of this package.
792    
793    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
794    
795            * R/aobjects.R: Changed package document structure to avoid class
796            dependency problems.
797    
798    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
799    
800            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
801            data set.
802    
803            *  Finished documentation and reordered directory structure. Now "R
804            CMD check textmin" works without errors.
805    
806    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
807    
808            * src/: Various splits can now be easily created for the
809            Reuters21578 data set.
810    
811    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
812    
813            *  Updated documentation
814    
815    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
816    
817            *  Wrote R documentation for some classes and methods.
818    
819    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
820    
821            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
822            files. See the questionnaire data/Umfrage.csv for such an example.
823            We are now able to import files in Reuters-21578 XML format.
824    
825            *  Changed class interfaces in various files. Weighting of the text
826            matrix is now possible.
827    
828    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
829    
830            * R/textdoccol.R: One can build term-document matrices if
831            nessecary (with buildTDM(...)) and fill the field tdm from a text
832            document collection with it.
833    
834            * R/textmatrix.R: Wrote S4 class for term-document matrices.
835    
836    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
837    
838            * R/textdoccol.R: We now can read in a whole XML file with several
839            news items.
840    
841  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
842    
843          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.918

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge