SCM

SCM Repository

[tm] Diff of /pkg/ChangeLog
ViewVC logotype

Diff of /pkg/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

trunk/R/trunk/ChangeLog revision 17, Sat Nov 5 14:47:12 2005 UTC pkg/ChangeLog revision 923, Fri Apr 3 08:07:20 2009 UTC
# Line 1  Line 1 
1    2009-04-03  Ingo Feinerer  <feinerer@logic.at>
2    
3            * R/termdocmatrix.R: Further work on new TermDocumentMatrix.
4    
5    2009-03-28  Ingo Feinerer  <feinerer@logic.at>
6    
7            * inst/doc/extensions.Rnw: Finished vignette.
8    
9    2009-03-27  Ingo Feinerer  <feinerer@logic.at>
10    
11            * R/termdocmatrix.R: Start to work on new TermDocumentMatrix and
12            DocumentTermMatrix representations.
13    
14    2009-03-23  Ingo Feinerer  <feinerer@logic.at>
15    
16            * R/reader.R (readXML): New reader for arbitrary XML files.
17    
18    2009-03-22  Ingo Feinerer  <feinerer@logic.at>
19    
20            * R/source.R (CSVSource): Defunct (use DataframeSource instead).
21            (XMLSource): New XMLSource class for arbitrary XML files.
22            (Source): New slot Vectorized.
23    
24    2009-03-21  Ingo Feinerer  <feinerer@logic.at>
25    
26            * R/reader.R (readCustom): Experimental reader which can be
27            customized via user-defined mappings.
28    
29            * R/reader.R: Always use UTC time zone.
30    
31            * R/AAA.R (.onLoad): No longer try to start a MPI cluster.
32    
33    2009-03-20  Ingo Feinerer  <feinerer@logic.at>
34    
35            * R/reader.R (readDOC): Options can be passed over to antiword.
36    
37            * R/reader.R (readPDF): Options can be passed over to pdfinfo and
38            pdftotext.
39    
40    2009-03-10  Ingo Feinerer  <feinerer@logic.at>
41    
42            * R/source.R (DirSource): Add pattern and ignore.case arguments
43            which are internally passed over to list.files().
44    
45    2009-03-02  Ingo Feinerer  <feinerer@logic.at>
46    
47            * inst/doc/tm.Rnw: Suppress pointless loading message.
48    
49    2009-01-29  Ingo Feinerer  <feinerer@logic.at>
50    
51            * DESCRIPTION: Speed up package loading (via moving packages not
52            strictly necessary for normal operation to Suggests instead of
53            Depends).
54    
55    2009-01-08  Ingo Feinerer  <feinerer@logic.at>
56    
57            * R/reader.R (readNewsgroup): The date format is now configurable.
58    
59    2008-12-20  Ingo Feinerer  <feinerer@logic.at>
60    
61            * R/preprocess.R (convertMboxEml): Fix off-by-one error.
62    
63    2008-12-16  Ingo Feinerer  <feinerer@logic.at>
64    
65            * R/termdocmatrix.R (TermDocMatrix): Sort row indices.
66    
67    2008-12-06  Ingo Feinerer  <feinerer@logic.at>
68    
69            * R/source.R (DataframeSource): New source class for data frames.
70    
71            * R/source.R: Fixed non-standard call evaluation.
72    
73    2008-11-29  Ingo Feinerer  <feinerer@logic.at>
74    
75            * R/source.R (URISource): New source class for a single document.
76    
77    2008-11-27  Ingo Feinerer  <feinerer@logic.at>
78    
79            * R/source.R: Refactoring.
80    
81    2008-11-25  Ingo Feinerer  <feinerer@logic.at>
82    
83            * R/AAA.R (.onLoad, .Last): Use tryCatch() to handle misconfigured
84            Rmpi installations more gracefully.
85    
86    2008-11-08  Ingo Feinerer  <feinerer@logic.at>
87    
88            * R/source.R (Source): Add Length slot.
89    
90    2008-11-06  Ingo Feinerer  <feinerer@logic.at>
91    
92            * R/AAA.R: Unify duplicated .onLoad function.
93    
94    2008-11-03  Ingo Feinerer  <feinerer@logic.at>
95    
96            * DESCRIPTION (Suggests): Added Rmpi.
97    
98    2008-11-02  Ingo Feinerer  <feinerer@logic.at>
99    
100            * R/source.R (getElem): Fix 'no visible binding' warning.
101    
102            * man/WeightFunction.Rd: Fix signature.
103    
104    2008-08-03  Ingo Feinerer  <feinerer@logic.at>
105    
106            * R/weight.R: Introduce name abbreviations for weighting functions.
107    
108    2008-07-24  Ingo Feinerer  <feinerer@logic.at>
109    
110            * R/AAA.R (.onLoad, .Last): Start and stop MPI cluster.
111    
112            * R/cluster.R: Provide convenience functions for using a MPI
113            cluster.
114    
115            * R/termdocmatrix.R (TermDocMatrix): Use MPI cluster if
116            available.
117    
118            * R/textdoccol.R (tmIndex, tmFilter, tmMap): Use MPI cluster if
119            available.
120    
121    2008-07-17  Ingo Feinerer  <feinerer@logic.at>
122    
123            * R/textdoccol.R (lapply): Removed debug print out.
124    
125    2008-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
126    
127            * R/reader.R (readRCV1): Improved meta data extraction from
128            Reuters Corpus Volume 1 documents.
129    
130    2008-05-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
131    
132            * R/transform.R: Ensure that all mappings preserve multiline
133            structures.
134    
135    2008-05-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
136    
137            * R/filter.R: Every filter has now an attribute indicating whether
138            it sould be applied to document level (doclevel).
139    
140            * R/textdoccol.R (tmFilter): Set searchFullText as new default
141            filter.
142    
143    2008-04-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
144    
145            * R/transform.R (replacePatterns): Replaced removeWords by
146            replacePatterns. Suggested by Christian Buchta.
147    
148            * R/textdoccol.R (inspect): Improved formatting.
149    
150    2008-04-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
151    
152            * inst/CITATION: Updated JSS article information.
153    
154            * R/textdoccol.R (setAs): Added coerce method from list to
155            corpus.
156    
157            * R/meta.R (meta): Improved meta data handling.
158    
159    2008-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
160    
161            * R/textdoccol.R (materialize, tmMap): Improvements suggested by
162            Christian Buchta.
163    
164            * inst/CITATION: Added template to include JSS article reference.
165    
166    2008-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
167    
168            * R/textdoccol.R (tmMap): Introduced lazy mapping.
169    
170            * R/source.R: Added VectorSource.
171    
172    2008-02-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
173    
174            * man/: Language codes should be in ISO 639-1 format.
175    
176            * R/textdoccol.R (asPlain): Preserve local meta data.
177    
178    2008-01-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
179    
180            * R/textdoccol.R (writeCorpus): Function for writing a corpus
181            containing plain text documents to disk.
182    
183    2008-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
184    
185            * R/termdocmatrix.R (TermDocMatrix): Ensure that dimnames are
186            always set correctly.
187    
188            * R/textdoccol.R: Set load = TRUE as default for load on demand
189            since in most cases this is the wanted behaviour.
190    
191    2008-01-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
192    
193            * R/: Renamed TextDocCol to Corpus, and Corpus to Content.
194    
195            * DESCRIPTION: Updated Version to 0.3 due to core name changes.
196    
197    2008-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
198    
199            * R/meta.R (meta): New function for consistent access to meta data
200            of document collections, repositories, and texts.
201    
202    2008-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
203    
204            * R/: Better support for encodings.
205    
206    2008-01-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
207    
208            * R/textdoccol.R (TextDocCol): Fixed bug regarding default reader
209            selection when no reader argument is given.
210    
211    2008-01-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
212    
213            * R/source.R (CSVSource): Now uses read.csv instead of scan
214            internally.
215    
216    2008-01-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
217    
218            * R/reader.R (getReaders): Returns available reader functions.
219    
220            * R/termdocmatrix.R (TermDocMatrix): Set new modular constructor
221            as default.
222    
223    2007-12-02  Ingo Feinerer  <h0125130@wu-wien.ac.at>
224    
225            * R/stopwords.R (stopwords): Shortened code, removed codetools
226            variable warnings.
227    
228            * man/: Documentation for showMeta, added an example for tmMap.
229    
230            * inst/doc/tm.Rnw: Updated vignette, comments on MS word reader,
231            some minor typos fixed.
232    
233    2007-12-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
234    
235            * R/aobjects.R (showMeta): Added method for pretty printing a
236            text document's meta data.
237    
238    2007-11-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
239    
240            * R/textdoccol.R (TextDocCol): Better handling of empty
241            arguments.
242    
243            * NAMESPACE: Exported readDOC.
244    
245            * man/completeStems.Rd: Added an example.
246    
247    2007-11-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
248    
249            * R/stopwords.R (stopwords): Look up .dat files at every
250            call. Allows users to modify stopword .dat files interactively.
251    
252    2007-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
253    
254            * R/termdocmatrix.R (termFreq): Correct processing of empty
255            documents.
256    
257    2007-10-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
258    
259            * man/: Updated documentation.
260    
261    2007-10-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
262    
263            * R/complete.R (completeStems): Completes (heuristically) word
264            stems.
265    
266            * R/termdocmatrix.R (TermDocMatrix2): New modular
267            constructor.
268    
269            * NAMESPACE: Exported termFreq.
270    
271    2007-10-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
272    
273            * R/reader.R (readDOC): Added MS Word reader (using antiword).
274    
275    2007-10-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
276    
277            * R/weight.R: Weighting functions for TermDocMatrix.
278    
279    2007-10-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
280    
281            * R/termdocmatrix.R (dimnames, colnames, rownames): Wrapper
282            functions for accessing dimension, column, and row names.
283    
284            * R/plot.R (plot.TermDocMatrix): Plot correlations between terms.
285    
286    2007-09-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
287    
288            * man/removePunctuation.Rd: Added documentation. Function also exported to NAMESPACE.
289    
290    2007-08-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
291    
292            * R/fungen.R: Use S4 class for function generators instead of S3 attributes.
293    
294    2007-07-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
295    
296            * R/reader.R (readPDF): Removed manual checks for pdftotext and
297            pdfinfo. The system call gives a warning anyway.
298    
299    2007-07-28  Ingo Feinerer  <h0125130@wu-wien.ac.at>
300    
301            * R/textdoccol.R (asPlain): Conversion from
302            StructuredTextDocuments to PlainTextDocuments.
303    
304    2007-07-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
305    
306            * R/termdocmatrix.R: Added convenience methods ("[", nrow, ncol)
307            for accessing term-document matrices.
308    
309            * inst/doc/tm.Rnw: readPDF is only called if pdftotext and pdfinfo
310            are installed.
311    
312    2007-07-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
313    
314            * R/termdocmatrix.R (TermDocMatrix): Improved efficiency. Kudos to
315            Christian Buchta.
316    
317    2007-07-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
318    
319            * inst/doc/tm.Rnw: Update vignette (readPDF, readHTML, preprocessReut21578XML).
320    
321    2007-07-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
322    
323            * R/reader.R (readHTML): Added very simple HTML reader to obtain StructuredTextDocuments.
324    
325            * R/reader.R (readPDF): Added PDF reader.
326    
327    2007-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
328    
329            * DESCRIPTION: Moved proxy from Depends to Imports to avoid name clashes.
330    
331            * inst/stopwords/english.dat: Added the term "yes" to stopwords.
332    
333            * R/termdocmatrix.R (dim): dim function for TermDocMatrix.
334    
335            * R/preprocess.R (convertMboxEml): Accepts gzipped mboxes.
336    
337    2007-07-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
338    
339            * R/distmeasure.R (dissimilarity): Replaced dists call from
340            package cba by new dist call from package proxy.
341    
342    2007-07-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
343    
344            * inst/doc/tm.Rnw: Described removeSparseTerms and Dictionary.
345    
346    2007-06-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
347    
348            * R/termdocmatrix.R: require() uses the quietly option to suppress
349            loading messages.
350    
351    2007-06-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
352    
353            * R/dictionary.R: Added dictionary support.
354    
355    2007-06-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
356    
357            * R/aobjects.R: Added classes for Reuters21578 XML and RCV1
358            documents. This simplifies some functions, e.g., asPlain.
359    
360    2007-06-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
361    
362            * inst/doc/tm.Rnw: Fixed some typos in vignette.
363    
364    2007-06-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
365    
366            * R/textdoccol.R (replaceWords): Added method to replace a set of
367            words by a single word. Useful for synonyms.
368    
369    2007-05-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
370    
371            * man/TermDocMatrix.Rd: Fixed documentation on Data slot.
372    
373    2007-05-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
374    
375            * R/termdocmatrix.R (textvector): Small fix for dealing with empty
376            vectors. Thanks to Ariel Maguyon for his error report.
377            (removeSparseTerms): New function to remove columns from a
378            term-document matrix exceeding a sparse factor.
379    
380    2007-05-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
381    
382            * man/tmUpdate.Rd: Corrected documentation on readerControl parameter.
383    
384    2007-05-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
385    
386            * man/sFilter.Rd: Corrected documentation on statement format (use
387            '==' instead of '=').
388    
389    2007-05-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
390    
391            * R/aobjects.R (StructuredTextDocument): Inherits from
392            TextDocument.
393    
394    2007-05-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
395    
396            * R/termdocmatrix.R (findFreqTerms): Perform efficient computation
397            on sparse matrices as proposed by Martin Maechler.
398    
399    2007-04-27  Ingo Feinerer  <h0125130@wu-wien.ac.at>
400    
401            * R/textdoccol.R: Removed \code{dbDisconnect} calls since last
402            \pkg{filehash} version makes them deprecated.
403    
404    2007-04-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
405    
406            * R/termdocmatrix.R (textvector): Stemming is now performed before
407            erasing stopwords.
408            (weightMatrix): Adapted to handle sparse matrices.
409            (TermDocMatrix): Sparse matrix is now efficiently built by
410            direct stepwise insertion of row values into it.
411    
412    2007-04-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
413    
414            * DESCRIPTION: Replaced \pkg{filehashSQLite} with \pkg{filehash}
415            due to ongoing problems. For our purposes the latter is as useful
416            as the replaced package.
417    
418    2007-04-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
419    
420            * man/TextDocCol.Rd: Replaced \code{readPlain} with \code{object@DefaultReader}.
421    
422            * man/TermDocMatrix.Rd: Remove deprecated \code{language} argument.
423    
424    2007-04-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
425    
426            * R/resolve.R (resolveISOCode): Added ISO 639-1 codes for
427            languages with available stopwords.
428    
429    2007-04-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
430    
431            * inst/doc/tm.Rnw: Minor corrections in the vignette.
432    
433    2007-04-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
434    
435            * DESCRIPTION: Update to version 0.2, since a lot of new features
436            have been integrated.
437    
438            * inst/stopwords: Updated existing stopwords and added stopwords
439            for various other languages.
440    
441    2007-04-10  Ingo Feinerer  <h0125130@wu-wien.ac.at>
442    
443            * man/: Updated documentation.
444    
445            * Work/testDb.R: Script to test database stuff.
446    
447            * R/: Fixed various database related bugs. Seems to be rather
448            useable now, i.e., consider as alpha status for now.
449    
450    2007-04-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
451    
452            * R/: Fixed some bugs related to database support.
453    
454    2007-04-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
455    
456            * man/: Added a lot of examples to the manuals.
457    
458    2007-04-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
459    
460            * man/: Updated parts of the documentation.
461    
462            * R/textdoccol.R (asPlain): Added conversion from newsgroup
463            documents to plain text documents.
464    
465    2007-04-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
466    
467            * R/textdoccol.R: Finished experimental database support. Not yet
468            intensively tested.
469    
470            * R/source.R: Now each source has a default reader.
471    
472            * R/reader.R: \code{FunctionGenerator} is now an attribute, not a
473            class anymore.
474    
475            * R/plaintextdoc.R: Custom show method for plain text documents.
476    
477            * R/aobjects.R: Added a class for structured text documents.
478    
479            * R/reader.R: Replaced remaining \code{parser} occurrences with
480            \code{reader}.
481    
482            * R/textdoccol.R (summary): Indent tags.
483    
484            * R/textdoccol.R (removePunctuation): Transform method to remove
485            punctuation marks.
486    
487    2007-03-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
488    
489            * R/textdoccol.R (sFilter): Simplified sFilter significantly by
490            using prescindMeta().
491    
492    2007-03-18  Ingo Feinerer  <h0125130@wu-wien.ac.at>
493    
494            * R/textdoccol.R: Improved database support.
495    
496    2007-03-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
497    
498            * R/termdocmatrix.R (TermDocMatrix): Uses sparse matrices.
499    
500            * R/resolve.R (resolveISOcode): Extracts the language from a ISO
501            language code.
502    
503            * R/textdoccol.R (TextDocCol): Refactored several parser arguments
504            into parserControl argument.
505    
506            * R/aobjects.R (TextDocument): Introduced the "Language" slot.
507    
508    2007-03-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
509    
510            * Work/tmDataSetup.R: The datasets acq and crude can now be
511            created on the fly.
512    
513            * R/stopwords.R: Introduced a function returning the stopwords for
514            a given language (English, German and French at the moment)
515    
516            * R/textdoccol.R (stemDoc): Stemming uses Rstem if available,
517            otherwise falls back to Snowball package.
518    
519    2007-01-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
520    
521            * man/dissimilarity-methods.Rd: Make clear that any method offered
522            by "dists" from package "cba" can be used.
523    
524    2007-01-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
525    
526            * inst/doc/tm.Rnw: Fixed quotes-appearing-as-boxes-bug according
527            to Kurt's latex suggestion. Removed points and underscores in
528            variable names for consistent naming.
529    
530            * DESCRIPTION: Update to version 0.1-2.
531    
532            * man/TextRepository.Rd: Fixed bug in documentation.
533    
534    2007-01-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
535    
536            * DESCRIPTION: Update to version 0.1-1.
537    
538    2007-01-09  Ingo Feinerer  <h0125130@wu-wien.ac.at>
539    
540            * R/textdoccol.R (stemDoc): Use Rstem::wordStem instead of
541            wordStem.
542    
543    2007-01-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
544    
545            * R/: Changes due to Kurt's review.
546    
547    2006-12-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
548    
549            * R/: Implemented improvements based upon comments by David
550            Meyer.
551    
552    2006-12-17  Ingo Feinerer  <h0125130@wu-wien.ac.at>
553    
554            * inst/doc/: Rewrote vignette.
555    
556            * man/: Improved documentation.
557    
558    2006-12-16  Ingo Feinerer  <h0125130@wu-wien.ac.at>
559    
560            * man/: Updated documentation.
561    
562            * DESCRIPTION: Changed package name to "tm". Updated version to
563            0.1 for first CRAN release.
564    
565            * inst/texts/gmane.comp.lang.r.general.mbox: mbox Gmane R mailing
566            list archive example.
567    
568            * inst/texts/gmane.comp.lang.r.gr.rdf: RSS Gmane R mailing list
569            archive example.
570    
571            * R/preprocess.R (convert_mbox_eml): A simple e-mail converter
572            from (several mails per box) mbox format to (single mail per file)
573            eml format.
574    
575    2006-12-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
576    
577            * data/crude.rda: Rebuilt.
578    
579            * data/acq.rda: Rebuilt.
580    
581            * R/reader.R: Factored out reader and parser methods from
582            textdoccol.R.
583    
584            * R/source.R: Factored out Source methods from aobjects.R and
585            textdoccol.R.
586            (GmaneRSource): Encapsulates Gmane R mailing list archive RSS
587            feeds.
588    
589            * R/textdoccol.R (DirSource): Added support for recursive
590            traversal of directories.
591    
592    2006-12-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
593    
594            * R/textdoccol.R ([[): Loads the document corpus automatically
595            into memory upon access.
596            (tm_transform, tm_filter): Removed several checks whether the
597            document is already loaded ([[ ensures this now).
598            (gmane_r_reader): Reader for RSS feeds as provided by the Gmane R
599            mailing list archive.
600    
601    2006-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
602    
603            * R/aobjects.R (TextDocument): Is now a virtual class.
604            (Source): Is now a virtual class.
605    
606    2006-12-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
607    
608            * R/textdoccol.R (c): Support for an arbitrary number of document
609            collections.
610    
611    2006-11-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
612    
613            * R/textrepo.R: Updated TextRepository (constructor), append_elem,
614            append_meta and remove_meta.
615    
616            * R/textdoccol.R: Removed modify_metadata method.
617    
618            * R/textrepo.R: Removed modify_metadata method.
619    
620            * R/textdoccol.R (remove_meta): Supports removal of document
621            collection metadata and document (= in data frame) metadata.
622    
623    2006-11-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
624    
625            * R/textdoccol.R (append_doc): Bug fix for handling empty metadata.
626    
627            * data/crude.rda: Rebuilt.
628    
629            * data/acq.rda: Rebuilt.
630    
631            * inst/doc/textmin.Rnw: Updated vignette to reflect code changes.
632    
633            * R/textdoccol.R ([): Bug fix for subsetting a document
634            collection's data frame.
635    
636    2006-11-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
637    
638            * R/textdoccol.R: Bug fixes in s_filter. Added full query support
639            to s_filter.
640    
641            * R/textdoccol.R: Local text documents' metadata can now be copied
642            to a document collection's data frame with prescind_meta.
643    
644    2006-11-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
645    
646            * R/: Text documents' slot metadata is now accessible in s_filter.
647    
648            * R/: Rewrote s_filter function (has still some restrictions).
649    
650    2006-11-20  Ingo Feinerer  <h0125130@wu-wien.ac.at>
651    
652            * R/: Various fixes in handling metadata.
653    
654            * R/: Added update mechanism for text document collections.
655    
656    2006-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
657    
658            * R/: Merging of document collections now creates a binary tree
659            for reconstructing merged document collections.
660    
661            * R/: Redesign of metadata for document collections.
662    
663    2006-11-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
664    
665            * R/: Messages now use \code{ngettext}.
666    
667    2006-11-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
668    
669            * R/: Added functions for modifying and removing metadata.
670    
671    2006-11-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
672    
673            * man/: Updated some documentation.
674    
675            * R/: Corrected some connection issues.
676    
677            * inst/doc: Worked on the vignette.
678    
679    2006-10-31  Ingo Feinerer  <h0125130@wu-wien.ac.at>
680    
681            * inst/: Added texts and started vignette.
682    
683            * R/: Final changes based upon David's comments.
684    
685    2006-10-29  Ingo Feinerer  <h0125130@wu-wien.ac.at>
686    
687            * NAMESPACE: Corrected exports (generic methods need exportMethods
688            directives!).
689    
690    2006-10-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
691    
692            * R/: Modified the TextDocCol constructur and various parsers. It
693            is now modular and supports various file formats via plugins (see
694            the new "Source" class).
695    
696    2006-10-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
697    
698            * man/: Revised documentation after previous code changes.
699    
700    2006-10-23  Ingo Feinerer  <h0125130@wu-wien.ac.at>
701    
702            * R/: Remaining changes as discussed with David.
703    
704    2006-10-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
705    
706            * R/: Some changes as suggested by David. The rest will follow
707            within the next days.
708    
709    2006-09-26  Ingo Feinerer  <h0125130@wu-wien.ac.at>
710    
711            * man/: Finished documentation.
712    
713    2006-09-25  Ingo Feinerer  <h0125130@wu-wien.ac.at>
714    
715            * man/: Wrote some documentation.
716    
717    2006-09-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
718    
719            * R/: Further syntactic sugar in form of additional assignment and
720            accessor methods.
721    
722    2006-09-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
723    
724            * R/: Syntactic sugar in form of "length", "show" and "summary"
725            operators.
726    
727    2006-08-24  Ingo Feinerer  <h0125130@wu-wien.ac.at>
728    
729            * R/: Diverse updates. Mainly on default operators ("[" or "c")
730            and dissimilarities.
731    
732    2006-08-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
733    
734            * R/: Added similarity functions.
735    
736            * data/: Added english stopwords.
737    
738    2006-08-07  Ingo Feinerer  <h0125130@wu-wien.ac.at>
739    
740            * data/: Examples compiled for new features
741    
742            * R/: Changes due to new structure.
743    
744            * NAMESPACE: Corrected namespace to reflect new structure.
745    
746            * R/termdocmatrix.R: Adapted for new naming scheme.
747    
748    2006-08-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
749    
750            * R/textdoccol.R: Adapted code for new class structure. Wrote
751            several transform and filter functions operating on text document
752            collections (alias text document databases).
753    
754            * R/aobjects.R: Adapted class structure with inheritance,
755            repositories and additional meta data. Loading files on demand is
756            now possible.
757    
758    2006-07-13  Ingo Feinerer  <h0125130@wu-wien.ac.at>
759    
760            * R/: Some cosmetic cleanups.
761    
762            * inst/: Removed vignette on clustering. That and much more is now
763            described in the JSS paper on text mining. Based upon that
764            article an elaborated vignette will be incorporated in the future.
765    
766    2006-07-01  Ingo Feinerer  <h0125130@wu-wien.ac.at>
767    
768            * R/: Updated generic S4 methods to comply with signature changes
769            in newer versions of R (> 2.3)
770    
771    2006-03-12  Ingo Feinerer  <h0125130@wu-wien.ac.at>
772    
773            * ext/R/importRIS.R: Automatic RIS import is now possible.
774    
775    2006-02-14  Ingo Feinerer  <h0125130@wu-wien.ac.at>
776    
777            * R/textdoccol.R: Added RIS HTML input format.
778    
779    2006-01-21  Ingo Feinerer  <h0125130@wu-wien.ac.at>
780    
781            * R/textdoccol.R: Removed bug that caused invalid text document
782            collections when handling many input files.
783    
784    2006-01-11  Ingo Feinerer  <h0125130@wu-wien.ac.at>
785    
786            * R/textdoccol.R: Restructured and extended file import
787            mechanism.
788    
789            * inst/doc/clustering.Rnw: Adapted vignette for use with
790            ReutNews.rda
791    
792            * man/ReutNews.Rd: Documentation for ReutNews.rda
793    
794            * data/ReutNews.rda: A tiny Reuters21578 example data set.
795    
796    2005-12-22  Ingo Feinerer  <h0125130@wu-wien.ac.at>
797    
798            * inst/doc/clustering.Rnw: Wrote a small vignette to present the
799            clustering facilities of this package.
800    
801    2005-12-15  Ingo Feinerer  <h0125130@wu-wien.ac.at>
802    
803            * R/aobjects.R: Changed package document structure to avoid class
804            dependency problems.
805    
806    2005-12-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
807    
808            *  Wrote a script for the ModLewis Split for the Reuters-21578 XML
809            data set.
810    
811            *  Finished documentation and reordered directory structure. Now "R
812            CMD check textmin" works without errors.
813    
814    2005-12-04  Ingo Feinerer  <h0125130@wu-wien.ac.at>
815    
816            * src/: Various splits can now be easily created for the
817            Reuters21578 data set.
818    
819    2005-12-03  Ingo Feinerer  <h0125130@wu-wien.ac.at>
820    
821            *  Updated documentation
822    
823    2005-11-30  Ingo Feinerer  <h0125130@wu-wien.ac.at>
824    
825            *  Wrote R documentation for some classes and methods.
826    
827    2005-11-19  Ingo Feinerer  <h0125130@wu-wien.ac.at>
828    
829            * R/textdoccol.R: Constructor of textdoccol allows import of CSV
830            files. See the questionnaire data/Umfrage.csv for such an example.
831            We are now able to import files in Reuters-21578 XML format.
832    
833            *  Changed class interfaces in various files. Weighting of the text
834            matrix is now possible.
835    
836    2005-11-08  Ingo Feinerer  <h0125130@wu-wien.ac.at>
837    
838            * R/textdoccol.R: One can build term-document matrices if
839            nessecary (with buildTDM(...)) and fill the field tdm from a text
840            document collection with it.
841    
842            * R/textmatrix.R: Wrote S4 class for term-document matrices.
843    
844    2005-11-06  Ingo Feinerer  <h0125130@wu-wien.ac.at>
845    
846            * R/textdoccol.R: We now can read in a whole XML file with several
847            news items.
848    
849  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>  2005-11-05  Ingo Feinerer  <h0125130@wu-wien.ac.at>
850    
851          * R/textdoccol.R: Set up an S4 class for a collection of text          * R/textdoccol.R: Set up an S4 class for a collection of text

Legend:
Removed from v.17  
changed lines
  Added in v.923

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business Powered By FusionForge