medical: Dataset generated from medical reports

Description Usage Format Source Examples

Description

Multilabel dataset from the text domain.

Usage

1

Format

An mldr object with 978 instances, 1449 attributes and 45 labels

Source

Crammer, K. and Dredze, M. and Ganchev, K. and Talukdar, P. P. and Carroll, S., "Automatic Code Assignment to Medical Text", in Proc. Workshop on Biological, Translational, and Clinical Language Processing, Prague, Czech Republic, BioNLP07, pp. 129-136, 2007

Examples

1
2
3
4
5
## Not run: 
toBibtex(medical)
medical$measures

## End(Not run)

Example output

Attaching package: 'mldr.datasets'

The following object is masked from 'package:stats':

    density

[1] "@inproceedings{,\n  title = \"Automatic Code Assignment to Medical Text\",\n  author = \"Crammer, K. and Dredze, M. and Ganchev, K. and Talukdar, P. P. and Carroll, S.\",\n  booktitle = \"Proc. Workshop on Biological, Translational, and Clinical Language Processing,  Prague, Czech Republic, BioNLP07\",\n  year = \"2007\",\n  pages = \"129--136\"\n}"
$num.attributes
[1] 1494

$num.instances
[1] 978

$num.inputs
[1] 1449

$num.labels
[1] 45

$num.labelsets
[1] 94

$num.single.labelsets
[1] 33

$max.frequency
[1] 155

$cardinality
[1] 1.245399

$density
[1] 0.02767553

$meanIR
[1] 89.50136

$scumble
[1] 0.04705599

$scumble.cv
[1] 3.043226

$tcs
[1] 15.62859

mldr.datasets documentation built on May 2, 2019, 3:43 p.m.