SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 37, Wed Jan 11 17:49:17 2006 UTC pkg/ChangeLog revision 912, Mon Mar 23 22:50:36 2009 UTC
# Line 1  Line 1 
1    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/reader.R (readXML): New reader for arbitrary XML files.
4    
5    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
6    
7            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
8            (XMLSource): New XMLSource class for arbitrary XML files.
9            (Source): New slot Vectorized.
10    
11    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
12    
13            * R/reader.R (readCustom): Experimental reader which can be
14            customized via user-defined mappings.
15    
16            * R/reader.R: Always use UTC time zone.
17    
18            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
19    
20    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
21    
22            * R/reader.R (readDOC): Options can be passed over to antiword.
23    
24            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
25            pdftotext.
26    
27    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
28    
29            * R/source.R (DirSource): Add pattern and ignore.case arguments
30            which are internally passed over to list.files().
31    
32    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
33    
34            * inst/doc/tm.Rnw: Suppress pointless loading message.
35    
36    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
37    
38            * DESCRIPTION: Speed up package loading (via moving packages not
39            strictly necessary for normal operation to Suggests instead of
40            Depends).
41    
42    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
43    
44            * R/reader.R (readNewsgroup): The date format is now configurable.
45    
46    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
47    
48            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
49    
50    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
51    
52            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
53    
54    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
55    
56            * R/source.R (DataframeSource): New source class for data frames.
57    
58            * R/source.R: Fixed non-standard call evaluation.
59    
60    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
61    
62            * R/source.R (URISource): New source class for a single document.
63    
64    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
65    
66            * R/source.R: Refactoring.
67    
68    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
69    
70            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
71            Rmpi installations more gracefully.
72    
73    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
74    
75            * R/source.R (Source): Add Length slot.
76    
77    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
78    
79            * R/AAA.R: Unify duplicated .onLoad function.
80    
81    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
82    
83            * DESCRIPTION (Suggests): Added Rmpi.
84    
85    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
86    
87            * R/source.R (getElem): Fix 'no visible binding' warning.
88    
89            * man/WeightFunction.Rd: Fix signature.
90    
91    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
92    
93            * R/weight.R: Introduce name abbreviations for weighting functions.
94    
95    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
96    
97            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
98    
99            * R/cluster.R: Provide convenience functions for using a MPI
100            cluster.
101    
102            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
103            available.
104    
105            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
106            available.
107    
108    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
109    
110            * R/textdoccol.R (lapply): Removed debug print out.
111    
112    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
113    
114            * R/reader.R (readRCV1): Improved meta data extraction from
115            Reuters Corpus Volume 1 documents.
116    
117    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
118    
119            * R/transform.R: Ensure that all mappings preserve multiline
120            structures.
121    
122    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
123    
124            * R/filter.R: Every filter has now an attribute indicating whether
125            it sould be applied to document level (doclevel).
126    
127            * R/textdoccol.R (tmFilter): Set searchFullText as new default
128            filter.
129    
130    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
131    
132            * R/transform.R (replacePatterns): Replaced removeWords by
133            replacePatterns. Suggested by Christian Buchta.
134    
135            * R/textdoccol.R (inspect): Improved formatting.
136    
137    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
138    
139            * inst/CITATION: Updated JSS article information.
140    
141            * R/textdoccol.R (setAs): Added coerce method from list to
142            corpus.
143    
144            * R/meta.R (meta): Improved meta data handling.
145    
146    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
147    
148            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
149            Christian Buchta.
150    
151            * inst/CITATION: Added template to include JSS article reference.
152    
153    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
154    
155            * R/textdoccol.R (tmMap): Introduced lazy mapping.
156    
157            * R/source.R: Added VectorSource.
158    
159    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
160    
161            * man/: Language codes should be in ISO 639-1 format.
162    
163            * R/textdoccol.R (asPlain): Preserve local meta data.
164    
165    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
166    
167            * R/textdoccol.R (writeCorpus): Function for writing a corpus
168            containing plain text documents to disk.
169    
170    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
171    
172            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
173            always set correctly.
174    
175            * R/textdoccol.R: Set load = TRUE as default for load on demand
176            since in most cases this is the wanted behaviour.
177    
178    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
179    
180            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
181    
182            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
183    
184    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
185    
186            * R/meta.R (meta): New function for consistent access to meta data
187            of document collections, repositories, and texts.
188    
189    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
190    
191            * R/: Better support for encodings.
192    
193    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
194    
195            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
196            selection when no reader argument is given.
197    
198    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
199    
200            * R/source.R (CSVSource): Now uses read.csv instead of scan
201            internally.
202    
203    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
204    
205            * R/reader.R (getReaders): Returns available reader functions.
206    
207            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
208            as default.
209    
210    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
211    
212            * R/stopwords.R (stopwords): Shortened code, removed codetools
213            variable warnings.
214    
215            * man/: Documentation for showMeta, added an example for tmMap.
216    
217            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
218            some minor typos fixed.
219    
220    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
221    
222            * R/aobjects.R (showMeta): Added method for pretty printing a
223            text document's meta data.
224    
225    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
226    
227            * R/textdoccol.R (TextDocCol): Better handling of empty
228            arguments.
229    
230            * NAMESPACE: Exported readDOC.
231    
232            * man/completeStems.Rd: Added an example.
233    
234    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
235    
236            * R/stopwords.R (stopwords): Look up .dat files at every
237            call. Allows users to modify stopword .dat files interactively.
238    
239    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
240    
241            * R/termdocmatrix.R (termFreq): Correct processing of empty
242            documents.
243    
244    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
245    
246            * man/: Updated documentation.
247    
248    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
249    
250            * R/complete.R (completeStems): Completes (heuristically) word
251            stems.
252    
253            * R/termdocmatrix.R (TermDocMatrix2): New modular
254            constructor.
255    
256            * NAMESPACE: Exported termFreq.
257    
258    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
259    
260            * R/reader.R (readDOC): Added MS Word reader (using antiword).
261    
262    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
263    
264            * R/weight.R: Weighting functions for TermDocMatrix.
265    
266    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
267    
268            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
269            functions for accessing dimension, column, and row names.
270    
271            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
272    
273    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
274    
275            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
276    
277    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
278    
279            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
280    
281    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
282    
283            * R/reader.R (readPDF): Removed manual checks for pdftotext and
284            pdfinfo. The system call gives a warning anyway.
285    
286    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
287    
288            * R/textdoccol.R (asPlain): Conversion from
289            StructuredTextDocuments to PlainTextDocuments.
290    
291    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
292    
293            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
294            for accessing term-document matrices.
295    
296            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
297            are installed.
298    
299    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
302            Christian Buchta.
303    
304    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
307    
308    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
309    
310            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
311    
312            * R/reader.R (readPDF): Added PDF reader.
313    
314    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
315    
316            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
317    
318            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
319    
320            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
321    
322            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
323    
324    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
325    
326            * R/distmeasure.R (dissimilarity): Replaced dists call from
327            package cba by new dist call from package proxy.
328    
329    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
330    
331            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
332    
333    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
334    
335            * R/termdocmatrix.R: require() uses the quietly option to suppress
336            loading messages.
337    
338    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
339    
340            * R/dictionary.R: Added dictionary support.
341    
342    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
343    
344            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
345            documents. This simplifies some functions, e.g., asPlain.
346    
347    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
348    
349            * inst/doc/tm.Rnw: Fixed some typos in vignette.
350    
351    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
352    
353            * R/textdoccol.R (replaceWords): Added method to replace a set of
354            words by a single word. Useful for synonyms.
355    
356    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
357    
358            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
359    
360    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
363            vectors. Thanks to Ariel Maguyon for his error report.
364            (removeSparseTerms): New function to remove columns from a
365            term-document matrix exceeding a sparse factor.
366    
367    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
368    
369            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
370    
371    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
372    
373            * man/sFilter.Rd: Corrected documentation on statement format (use
374            '==' instead of '=').
375    
376    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
377    
378            * R/aobjects.R (StructuredTextDocument): Inherits from
379            TextDocument.
380    
381    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
382    
383            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
384            on sparse matrices as proposed by Martin Maechler.
385    
386    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
387    
388            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
389            \pkg{filehash} version makes them deprecated.
390    
391    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
392    
393            * R/termdocmatrix.R (textvector): Stemming is now performed before
394            erasing stopwords.
395            (weightMatrix): Adapted to handle sparse matrices.
396            (TermDocMatrix): Sparse matrix is now efficiently built by
397            direct stepwise insertion of row values into it.
398    
399    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
400    
401            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
402            due to ongoing problems. For our purposes the latter is as useful
403            as the replaced package.
404    
405    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
406    
407            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
408    
409            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
410    
411    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
412    
413            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
414            languages with available stopwords.
415    
416    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
417    
418            * inst/doc/tm.Rnw: Minor corrections in the vignette.
419    
420    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
421    
422            * DESCRIPTION: Update to version 0.2, since a lot of new features
423            have been integrated.
424    
425            * inst/stopwords: Updated existing stopwords and added stopwords
426            for various other languages.
427    
428    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
429    
430            * man/: Updated documentation.
431    
432            * Work/testDb.R: Script to test database stuff.
433    
434            * R/: Fixed various database related bugs. Seems to be rather
435            useable now, i.e., consider as alpha status for now.
436    
437    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
438    
439            * R/: Fixed some bugs related to database support.
440    
441    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
442    
443            * man/: Added a lot of examples to the manuals.
444    
445    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
446    
447            * man/: Updated parts of the documentation.
448    
449            * R/textdoccol.R (asPlain): Added conversion from newsgroup
450            documents to plain text documents.
451    
452    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
453    
454            * R/textdoccol.R: Finished experimental database support. Not yet
455            intensively tested.
456    
457            * R/source.R: Now each source has a default reader.
458    
459            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
460            class anymore.
461    
462            * R/plaintextdoc.R: Custom show method for plain text documents.
463    
464            * R/aobjects.R: Added a class for structured text documents.
465    
466            * R/reader.R: Replaced remaining \code{parser} occurrences with
467            \code{reader}.
468    
469            * R/textdoccol.R (summary): Indent tags.
470    
471            * R/textdoccol.R (removePunctuation): Transform method to remove
472            punctuation marks.
473    
474    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
475    
476            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
477            using prescindMeta().
478    
479    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
480    
481            * R/textdoccol.R: Improved database support.
482    
483    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
484    
485            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
486    
487            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
488            language code.
489    
490            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
491            into parserControl argument.
492    
493            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
494    
495    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
496    
497            * Work/tmDataSetup.R: The datasets acq and crude can now be
498            created on the fly.
499    
500            * R/stopwords.R: Introduced a function returning the stopwords for
501            a given language (English, German and French at the moment)
502    
503            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
504            otherwise falls back to Snowball package.
505    
506    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
507    
508            * man/dissimilarity-methods.Rd: Make clear that any method offered
509            by "dists" from package "cba" can be used.
510    
511    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
512    
513            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
514            to Kurt's latex suggestion. Removed points and underscores in
515            variable names for consistent naming.
516    
517            * DESCRIPTION: Update to version 0.1-2.
518    
519            * man/TextRepository.Rd: Fixed bug in documentation.
520    
521    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
522    
523            * DESCRIPTION: Update to version 0.1-1.
524    
525    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
526    
527            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
528            wordStem.
529    
530    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
531    
532            * R/: Changes due to Kurt's review.
533    
534    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
535    
536            * R/: Implemented improvements based upon comments by David
537            Meyer.
538    
539    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
540    
541            * inst/doc/: Rewrote vignette.
542    
543            * man/: Improved documentation.
544    
545    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
546    
547            * man/: Updated documentation.
548    
549            * DESCRIPTION: Changed package name to "tm". Updated version to
550            0.1 for first CRAN release.
551    
552            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
553            list archive example.
554    
555            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
556            archive example.
557    
558            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
559            from (several mails per box) mbox format to (single mail per file)
560            eml format.
561    
562    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
563    
564            * data/crude.rda: Rebuilt.
565    
566            * data/acq.rda: Rebuilt.
567    
568            * R/reader.R: Factored out reader and parser methods from
569            textdoccol.R.
570    
571            * R/source.R: Factored out Source methods from aobjects.R and
572            textdoccol.R.
573            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
574            feeds.
575    
576            * R/textdoccol.R (DirSource): Added support for recursive
577            traversal of directories.
578    
579    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
580    
581            * R/textdoccol.R ([[): Loads the document corpus automatically
582            into memory upon access.
583            (tm_transform, tm_filter): Removed several checks whether the
584            document is already loaded ([[ ensures this now).
585            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
586            mailing list archive.
587    
588    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
589    
590            * R/aobjects.R (TextDocument): Is now a virtual class.
591            (Source): Is now a virtual class.
592    
593    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
594    
595            * R/textdoccol.R (c): Support for an arbitrary number of document
596            collections.
597    
598    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
599    
600            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
601            append_meta and remove_meta.
602    
603            * R/textdoccol.R: Removed modify_metadata method.
604    
605            * R/textrepo.R: Removed modify_metadata method.
606    
607            * R/textdoccol.R (remove_meta): Supports removal of document
608            collection metadata and document (= in data frame) metadata.
609    
610    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
611    
612            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
613    
614            * data/crude.rda: Rebuilt.
615    
616            * data/acq.rda: Rebuilt.
617    
618            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
619    
620            * R/textdoccol.R ([): Bug fix for subsetting a document
621            collection's data frame.
622    
623    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
624    
625            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
626            to s_filter.
627    
628            * R/textdoccol.R: Local text documents' metadata can now be copied
629            to a document collection's data frame with prescind_meta.
630    
631    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
632    
633            * R/: Text documents' slot metadata is now accessible in s_filter.
634    
635            * R/: Rewrote s_filter function (has still some restrictions).
636    
637    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
638    
639            * R/: Various fixes in handling metadata.
640    
641            * R/: Added update mechanism for text document collections.
642    
643    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
644    
645            * R/: Merging of document collections now creates a binary tree
646            for reconstructing merged document collections.
647    
648            * R/: Redesign of metadata for document collections.
649    
650    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
651    
652            * R/: Messages now use \code{ngettext}.
653    
654    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
655    
656            * R/: Added functions for modifying and removing metadata.
657    
658    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
659    
660            * man/: Updated some documentation.
661    
662            * R/: Corrected some connection issues.
663    
664            * inst/doc: Worked on the vignette.
665    
666    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
667    
668            * inst/: Added texts and started vignette.
669    
670            * R/: Final changes based upon David's comments.
671    
672    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
673    
674            * NAMESPACE: Corrected exports (generic methods need exportMethods
675            directives!).
676    
677    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
678    
679            * R/: Modified the TextDocCol constructur and various parsers. It
680            is now modular and supports various file formats via plugins (see
681            the new "Source" class).
682    
683    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
684    
685            * man/: Revised documentation after previous code changes.
686    
687    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
688    
689            * R/: Remaining changes as discussed with David.
690    
691    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
692    
693            * R/: Some changes as suggested by David. The rest will follow
694            within the next days.
695    
696    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
697    
698            * man/: Finished documentation.
699    
700    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
701    
702            * man/: Wrote some documentation.
703    
704    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
705    
706            * R/: Further syntactic sugar in form of additional assignment and
707            accessor methods.
708    
709    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
710    
711            * R/: Syntactic sugar in form of "length", "show" and "summary"
712            operators.
713    
714    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
715    
716            * R/: Diverse updates. Mainly on default operators ("[" or "c")
717            and dissimilarities.
718    
719    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
720    
721            * R/: Added similarity functions.
722    
723            * data/: Added english stopwords.
724    
725    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
726    
727            * data/: Examples compiled for new features
728    
729            * R/: Changes due to new structure.
730    
731            * NAMESPACE: Corrected namespace to reflect new structure.
732    
733            * R/termdocmatrix.R: Adapted for new naming scheme.
734    
735    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
736    
737            * R/textdoccol.R: Adapted code for new class structure. Wrote
738            several transform and filter functions operating on text document
739            collections (alias text document databases).
740    
741            * R/aobjects.R: Adapted class structure with inheritance,
742            repositories and additional meta data. Loading files on demand is
743            now possible.
744    
745    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
746    
747            * R/: Some cosmetic cleanups.
748    
749            * inst/: Removed vignette on clustering. That and much more is now
750            described in the JSS paper on text mining. Based upon that
751            article an elaborated vignette will be incorporated in the future.
752    
753    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
754    
755            * R/: Updated generic S4 methods to comply with signature changes
756            in newer versions of R (> 2.3)
757    
758    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
759    
760            * ext/R/importRIS.R: Automatic RIS import is now possible.
761    
762    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
763    
764            * R/textdoccol.R: Added RIS HTML input format.
765    
766    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
767    
768            * R/textdoccol.R: Removed bug that caused invalid text document
769            collections when handling many input files.
770    
771  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
772    
773          * R/textdoccol.R: Restructured and extended file import          * R/textdoccol.R: Restructured and extended file import

Legend:
Removed from v.37  
changed lines
  Added in v.912

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge