Evaluation1: Evaluation of Word Alignment Quality

Description Usage Arguments Details Value Author(s) References See Also

View source: R/Evaluation1.R

Description

It measures Precision, Recall, AER, and F_measurs metrics to evaluate the quality of word alignment.

Usage

1
2
3
Evaluation1(file_gold = "gold.RData", 
            file_align = "alignment.-1.3.RData",
	    agn = c("my.agn", "an.agn"), alpha = 0.3)

Arguments

file_gold

the gold standarad file name.

file_align

the alignment file name.

agn

character string including two values. If "my.agn", the user wants to evaluate one-to-many word alignment using the word_alignIBM1 function in this package. If "an.agn", the user wants to evaluate word alignment results which are obtained by another software.

alpha

is a parameter that sets the trade-off between Precision and Recall.

Details

To evaluate word alignment quality, we need to a "reference alignment" (a gold standard for the word alignment) of a test set. In order to read the gold into R format and to compare it with the word alignment results, the gold standard file name must be set in file_gold.

Value

A list.

Recall

A decimal number.

Precision

A decimal number.

AER

A decimal number.

F_measure.PS

A decimal number.

F_measure.S

A decimal number.

Author(s)

Neda Daneshgar and Majid Sarmad.

References

Fraser F., Marcu D. (2007), "MeasuringWord Alignment Quality for Statistical Machine Translation.", Computational Linguistics, 33(3), 293-303.

Koehn P. (2010), "Statistical Machine Translation.", Cambridge University, New York.

Och F., Ney H.(2003)."A Systematic Comparison Of Various Statistical Alignment Models.", 2003 Association for Computational Linguistics, J03-1002, 29(1).

Wang X. "Evaluation of Two Word Alignment Systems.", Final Thesis, Department of Computer and Information Science.

See Also

cons.agn, align_test.set, word_alignIBM1


word.alignment documentation built on Nov. 17, 2017, 6:17 a.m.