Since CLAMP-Cancer is a stand-alone eclipse plugin, its folder structure is similar to other eclipse plugins.
Configuration Folder: This folder contains CLAMP-Cancer configuration files.
StartCLAMP: This is the launching point for the CLAMP-Cancer GUI. In Windows, this is an executable file while in Mac, this is an application.

Workspace Folder:

This folder contains seven sub-folders:

  1. ComponentLibrary: contains the components used in machine learning feature extraction and NLP functions.
  2. MyCorpus: contains the customized corpus built by the users.
  3. MyPipeline: contains the customized pipeline created by users for clinical notes processing.
  4. PipelineLibrary: contains the built-in pipelines ready to use for a series of common clinical applications.
  5. Log: Includes CLAMP-Cancer run-time log files
  6. Metadata: The metadata used by CLAMP-Cancer are included in this folder.
  7. Resources: This folder includes third-party libraries. Currently it has two items:
    1. CRFSuite: the CRF implementation for Name Entity Recognition tasks
    2. Umls_index: the Lucene index built for CLAMP-Cancer based on the UMLS thesaurus.

If you want to use UMLS terminologies, then you will need to create an UMLS account. Please follow the following link to create an UMLS account if you do not have any.
The following table lists libraries included in CLAMP-Cancer.

Libraries included in CLAMP-Cancer
