Substring selection for biomedical document classification.