SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC pkg/ChangeLog revision 919, Sat Mar 28 17:13:29 2009 UTC
# Line 1  Line 1 
1    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
2    
3            * inst/doc/extensions.Rnw: Finished vignette.
4    
5    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
8            DocumentTermMatrix representations.
9    
10    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
11    
12            * R/reader.R (readXML): New reader for arbitrary XML files.
13    
14    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
15    
16            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
17            (XMLSource): New XMLSource class for arbitrary XML files.
18            (Source): New slot Vectorized.
19    
20    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
21    
22            * R/reader.R (readCustom): Experimental reader which can be
23            customized via user-defined mappings.
24    
25            * R/reader.R: Always use UTC time zone.
26    
27            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
28    
29    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
30    
31            * R/reader.R (readDOC): Options can be passed over to antiword.
32    
33            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
34            pdftotext.
35    
36    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
37    
38            * R/source.R (DirSource): Add pattern and ignore.case arguments
39            which are internally passed over to list.files().
40    
41    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
42    
43            * inst/doc/tm.Rnw: Suppress pointless loading message.
44    
45    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
46    
47            * DESCRIPTION: Speed up package loading (via moving packages not
48            strictly necessary for normal operation to Suggests instead of
49            Depends).
50    
51    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
52    
53            * R/reader.R (readNewsgroup): The date format is now configurable.
54    
55    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
56    
57            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
58    
59    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
60    
61            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
62    
63    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
64    
65            * R/source.R (DataframeSource): New source class for data frames.
66    
67            * R/source.R: Fixed non-standard call evaluation.
68    
69    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
70    
71            * R/source.R (URISource): New source class for a single document.
72    
73    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
74    
75            * R/source.R: Refactoring.
76    
77    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
78    
79            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
80            Rmpi installations more gracefully.
81    
82    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
83    
84            * R/source.R (Source): Add Length slot.
85    
86    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
87    
88            * R/AAA.R: Unify duplicated .onLoad function.
89    
90    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
91    
92            * DESCRIPTION (Suggests): Added Rmpi.
93    
94    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
95    
96            * R/source.R (getElem): Fix 'no visible binding' warning.
97    
98            * man/WeightFunction.Rd: Fix signature.
99    
100    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
101    
102            * R/weight.R: Introduce name abbreviations for weighting functions.
103    
104    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
105    
106            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
107    
108            * R/cluster.R: Provide convenience functions for using a MPI
109            cluster.
110    
111            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
112            available.
113    
114            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
115            available.
116    
117    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
118    
119            * R/textdoccol.R (lapply): Removed debug print out.
120    
121    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
122    
123            * R/reader.R (readRCV1): Improved meta data extraction from
124            Reuters Corpus Volume 1 documents.
125    
126    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
127    
128            * R/transform.R: Ensure that all mappings preserve multiline
129            structures.
130    
131    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
132    
133            * R/filter.R: Every filter has now an attribute indicating whether
134            it sould be applied to document level (doclevel).
135    
136            * R/textdoccol.R (tmFilter): Set searchFullText as new default
137            filter.
138    
139    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
140    
141            * R/transform.R (replacePatterns): Replaced removeWords by
142            replacePatterns. Suggested by Christian Buchta.
143    
144            * R/textdoccol.R (inspect): Improved formatting.
145    
146    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
147    
148            * inst/CITATION: Updated JSS article information.
149    
150            * R/textdoccol.R (setAs): Added coerce method from list to
151            corpus.
152    
153            * R/meta.R (meta): Improved meta data handling.
154    
155    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
156    
157            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
158            Christian Buchta.
159    
160            * inst/CITATION: Added template to include JSS article reference.
161    
162    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
163    
164            * R/textdoccol.R (tmMap): Introduced lazy mapping.
165    
166            * R/source.R: Added VectorSource.
167    
168    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
169    
170            * man/: Language codes should be in ISO 639-1 format.
171    
172            * R/textdoccol.R (asPlain): Preserve local meta data.
173    
174    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
175    
176            * R/textdoccol.R (writeCorpus): Function for writing a corpus
177            containing plain text documents to disk.
178    
179    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
180    
181            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
182            always set correctly.
183    
184            * R/textdoccol.R: Set load = TRUE as default for load on demand
185            since in most cases this is the wanted behaviour.
186    
187    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
188    
189            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
190    
191            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
192    
193    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
194    
195            * R/meta.R (meta): New function for consistent access to meta data
196            of document collections, repositories, and texts.
197    
198    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
199    
200            * R/: Better support for encodings.
201    
202    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
203    
204            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
205            selection when no reader argument is given.
206    
207    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
208    
209            * R/source.R (CSVSource): Now uses read.csv instead of scan
210            internally.
211    
212    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
213    
214            * R/reader.R (getReaders): Returns available reader functions.
215    
216            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
217            as default.
218    
219    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
220    
221            * R/stopwords.R (stopwords): Shortened code, removed codetools
222            variable warnings.
223    
224            * man/: Documentation for showMeta, added an example for tmMap.
225    
226            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
227            some minor typos fixed.
228    
229    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
230    
231            * R/aobjects.R (showMeta): Added method for pretty printing a
232            text document's meta data.
233    
234    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
235    
236            * R/textdoccol.R (TextDocCol): Better handling of empty
237            arguments.
238    
239            * NAMESPACE: Exported readDOC.
240    
241            * man/completeStems.Rd: Added an example.
242    
243    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
244    
245            * R/stopwords.R (stopwords): Look up .dat files at every
246            call. Allows users to modify stopword .dat files interactively.
247    
248    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
249    
250            * R/termdocmatrix.R (termFreq): Correct processing of empty
251            documents.
252    
253    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
254    
255            * man/: Updated documentation.
256    
257    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * R/complete.R (completeStems): Completes (heuristically) word
260            stems.
261    
262            * R/termdocmatrix.R (TermDocMatrix2): New modular
263            constructor.
264    
265            * NAMESPACE: Exported termFreq.
266    
267    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
268    
269            * R/reader.R (readDOC): Added MS Word reader (using antiword).
270    
271    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/weight.R: Weighting functions for TermDocMatrix.
274    
275    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
276    
277            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
278            functions for accessing dimension, column, and row names.
279    
280            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
281    
282    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
283    
284            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
285    
286    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
287    
288            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
289    
290    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
291    
292            * R/reader.R (readPDF): Removed manual checks for pdftotext and
293            pdfinfo. The system call gives a warning anyway.
294    
295    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
296    
297            * R/textdoccol.R (asPlain): Conversion from
298            StructuredTextDocuments to PlainTextDocuments.
299    
300    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
301    
302            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
303            for accessing term-document matrices.
304    
305            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
306            are installed.
307    
308    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
309    
310            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
311            Christian Buchta.
312    
313    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
314    
315            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
316    
317    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
320    
321            * R/reader.R (readPDF): Added PDF reader.
322    
323    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
324    
325            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
326    
327            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
328    
329            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
330    
331            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
332    
333    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
334    
335            * R/distmeasure.R (dissimilarity): Replaced dists call from
336            package cba by new dist call from package proxy.
337    
338    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
339    
340            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
341    
342    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
343    
344            * R/termdocmatrix.R: require() uses the quietly option to suppress
345            loading messages.
346    
347    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * R/dictionary.R: Added dictionary support.
350    
351    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
352    
353            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
354            documents. This simplifies some functions, e.g., asPlain.
355    
356    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * inst/doc/tm.Rnw: Fixed some typos in vignette.
359    
360    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * R/textdoccol.R (replaceWords): Added method to replace a set of
363            words by a single word. Useful for synonyms.
364    
365    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
366    
367            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
368    
369    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
370    
371            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
372            vectors. Thanks to Ariel Maguyon for his error report.
373            (removeSparseTerms): New function to remove columns from a
374            term-document matrix exceeding a sparse factor.
375    
376    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
377    
378            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
379    
380    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
381    
382            * man/sFilter.Rd: Corrected documentation on statement format (use
383            '==' instead of '=').
384    
385    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
386    
387            * R/aobjects.R (StructuredTextDocument): Inherits from
388            TextDocument.
389    
390    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
391    
392            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
393            on sparse matrices as proposed by Martin Maechler.
394    
395    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
396    
397            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
398            \pkg{filehash} version makes them deprecated.
399    
400    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
401    
402            * R/termdocmatrix.R (textvector): Stemming is now performed before
403            erasing stopwords.
404            (weightMatrix): Adapted to handle sparse matrices.
405            (TermDocMatrix): Sparse matrix is now efficiently built by
406            direct stepwise insertion of row values into it.
407    
408    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
409    
410            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
411            due to ongoing problems. For our purposes the latter is as useful
412            as the replaced package.
413    
414    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
415    
416            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
417    
418            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
419    
420    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
421    
422            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
423            languages with available stopwords.
424    
425    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
426    
427            * inst/doc/tm.Rnw: Minor corrections in the vignette.
428    
429    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
430    
431            * DESCRIPTION: Update to version 0.2, since a lot of new features
432            have been integrated.
433    
434            * inst/stopwords: Updated existing stopwords and added stopwords
435            for various other languages.
436    
437    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * man/: Updated documentation.
440    
441            * Work/testDb.R: Script to test database stuff.
442    
443            * R/: Fixed various database related bugs. Seems to be rather
444            useable now, i.e., consider as alpha status for now.
445    
446    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
447    
448            * R/: Fixed some bugs related to database support.
449    
450    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
451    
452            * man/: Added a lot of examples to the manuals.
453    
454    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
455    
456            * man/: Updated parts of the documentation.
457    
458            * R/textdoccol.R (asPlain): Added conversion from newsgroup
459            documents to plain text documents.
460    
461    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
462    
463            * R/textdoccol.R: Finished experimental database support. Not yet
464            intensively tested.
465    
466            * R/source.R: Now each source has a default reader.
467    
468            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
469            class anymore.
470    
471            * R/plaintextdoc.R: Custom show method for plain text documents.
472    
473            * R/aobjects.R: Added a class for structured text documents.
474    
475            * R/reader.R: Replaced remaining \code{parser} occurrences with
476            \code{reader}.
477    
478            * R/textdoccol.R (summary): Indent tags.
479    
480            * R/textdoccol.R (removePunctuation): Transform method to remove
481            punctuation marks.
482    
483    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
484    
485            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
486            using prescindMeta().
487    
488    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
489    
490            * R/textdoccol.R: Improved database support.
491    
492    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
493    
494            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
495    
496            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
497            language code.
498    
499            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
500            into parserControl argument.
501    
502            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
503    
504    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
505    
506            * Work/tmDataSetup.R: The datasets acq and crude can now be
507            created on the fly.
508    
509            * R/stopwords.R: Introduced a function returning the stopwords for
510            a given language (English, German and French at the moment)
511    
512            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
513            otherwise falls back to Snowball package.
514    
515    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
516    
517            * man/dissimilarity-methods.Rd: Make clear that any method offered
518            by "dists" from package "cba" can be used.
519    
520    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
521    
522            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
523            to Kurt's latex suggestion. Removed points and underscores in
524            variable names for consistent naming.
525    
526            * DESCRIPTION: Update to version 0.1-2.
527    
528            * man/TextRepository.Rd: Fixed bug in documentation.
529    
530    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
531    
532            * DESCRIPTION: Update to version 0.1-1.
533    
534    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
535    
536            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
537            wordStem.
538    
539    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
540    
541            * R/: Changes due to Kurt's review.
542    
543    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
544    
545            * R/: Implemented improvements based upon comments by David
546            Meyer.
547    
548    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
549    
550            * inst/doc/: Rewrote vignette.
551    
552            * man/: Improved documentation.
553    
554    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
555    
556            * man/: Updated documentation.
557    
558            * DESCRIPTION: Changed package name to "tm". Updated version to
559            0.1 for first CRAN release.
560    
561            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
562            list archive example.
563    
564            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
565            archive example.
566    
567            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
568            from (several mails per box) mbox format to (single mail per file)
569            eml format.
570    
571    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
572    
573            * data/crude.rda: Rebuilt.
574    
575            * data/acq.rda: Rebuilt.
576    
577            * R/reader.R: Factored out reader and parser methods from
578            textdoccol.R.
579    
580            * R/source.R: Factored out Source methods from aobjects.R and
581            textdoccol.R.
582            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
583            feeds.
584    
585            * R/textdoccol.R (DirSource): Added support for recursive
586            traversal of directories.
587    
588    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
589    
590            * R/textdoccol.R ([[): Loads the document corpus automatically
591            into memory upon access.
592            (tm_transform, tm_filter): Removed several checks whether the
593            document is already loaded ([[ ensures this now).
594            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
595            mailing list archive.
596    
597    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
598    
599            * R/aobjects.R (TextDocument): Is now a virtual class.
600            (Source): Is now a virtual class.
601    
602    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
603    
604            * R/textdoccol.R (c): Support for an arbitrary number of document
605            collections.
606    
607    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
608    
609            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
610            append_meta and remove_meta.
611    
612            * R/textdoccol.R: Removed modify_metadata method.
613    
614            * R/textrepo.R: Removed modify_metadata method.
615    
616            * R/textdoccol.R (remove_meta): Supports removal of document
617            collection metadata and document (= in data frame) metadata.
618    
619    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
620    
621            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
622    
623            * data/crude.rda: Rebuilt.
624    
625            * data/acq.rda: Rebuilt.
626    
627            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
628    
629            * R/textdoccol.R ([): Bug fix for subsetting a document
630            collection's data frame.
631    
632    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
633    
634            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
635            to s_filter.
636    
637            * R/textdoccol.R: Local text documents' metadata can now be copied
638            to a document collection's data frame with prescind_meta.
639    
640    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
641    
642            * R/: Text documents' slot metadata is now accessible in s_filter.
643    
644            * R/: Rewrote s_filter function (has still some restrictions).
645    
646    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
647    
648            * R/: Various fixes in handling metadata.
649    
650            * R/: Added update mechanism for text document collections.
651    
652    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
653    
654            * R/: Merging of document collections now creates a binary tree
655            for reconstructing merged document collections.
656    
657            * R/: Redesign of metadata for document collections.
658    
659    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
660    
661            * R/: Messages now use \code{ngettext}.
662    
663    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
664    
665            * R/: Added functions for modifying and removing metadata.
666    
667    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
668    
669            * man/: Updated some documentation.
670    
671            * R/: Corrected some connection issues.
672    
673            * inst/doc: Worked on the vignette.
674    
675    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
676    
677            * inst/: Added texts and started vignette.
678    
679            * R/: Final changes based upon David's comments.
680    
681    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
682    
683            * NAMESPACE: Corrected exports (generic methods need exportMethods
684            directives!).
685    
686    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
687    
688            * R/: Modified the TextDocCol constructur and various parsers. It
689            is now modular and supports various file formats via plugins (see
690            the new "Source" class).
691    
692    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
693    
694            * man/: Revised documentation after previous code changes.
695    
696    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
697    
698            * R/: Remaining changes as discussed with David.
699    
700    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
701    
702            * R/: Some changes as suggested by David. The rest will follow
703            within the next days.
704    
705    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
706    
707            * man/: Finished documentation.
708    
709    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
710    
711            * man/: Wrote some documentation.
712    
713    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
714    
715            * R/: Further syntactic sugar in form of additional assignment and
716            accessor methods.
717    
718    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
719    
720            * R/: Syntactic sugar in form of "length", "show" and "summary"
721            operators.
722    
723    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
724    
725            * R/: Diverse updates. Mainly on default operators ("[" or "c")
726            and dissimilarities.
727    
728    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
729    
730            * R/: Added similarity functions.
731    
732            * data/: Added english stopwords.
733    
734    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
735    
736            * data/: Examples compiled for new features
737    
738            * R/: Changes due to new structure.
739    
740            * NAMESPACE: Corrected namespace to reflect new structure.
741    
742            * R/termdocmatrix.R: Adapted for new naming scheme.
743    
744    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
745    
746            * R/textdoccol.R: Adapted code for new class structure. Wrote
747            several transform and filter functions operating on text document
748            collections (alias text document databases).
749    
750            * R/aobjects.R: Adapted class structure with inheritance,
751            repositories and additional meta data. Loading files on demand is
752            now possible.
753    
754    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
755    
756            * R/: Some cosmetic cleanups.
757    
758            * inst/: Removed vignette on clustering. That and much more is now
759            described in the JSS paper on text mining. Based upon that
760            article an elaborated vignette will be incorporated in the future.
761    
762    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
763    
764            * R/: Updated generic S4 methods to comply with signature changes
765            in newer versions of R (> 2.3)
766    
767    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
768    
769            * ext/R/importRIS.R: Automatic RIS import is now possible.
770    
771    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
772    
773            * R/textdoccol.R: Added RIS HTML input format.
774    
775    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
776    
777            * R/textdoccol.R: Removed bug that caused invalid text document
778            collections when handling many input files.
779    
780  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
781    
782          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.919

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge