Introduced in PET: An Annotated Dataset for Process Extraction from Natural Language Text2022
The dataset contains 45 documents containing narrative description of business process and their annotations. Annotated with activities, gateways, actors, and flow information.
Each document is composed of three files:
Doc_name.txt (Process description in CONLL format)
Doc_name.process-elements.IOB2.txt (Process elements annotated with IOB2 Schema in CONLL format)
Doc_name.relations.tsv (Process relations between process elements. Each line is a triplette (source, relation tag, target). Source and target are in the form: n_sent_x words range.)