Process Fragments Discovery from Emails: Functional, Data and Behavioral Perspectives Discovery

Introduction

    In this site, we provide more detailed experiment results in relation to BP discovery from emails w.r.t multiple perspectives (i.e. functional, data, organizational and behavioral). We first provide more details concerning emails belonging to the evaluation dataset. Then, we report the obtained results in relation to two key phases in our approach: (1) event log generation, and (2) event log mining.

Experiments data

    For discovering BP fragments, we used emails (of Enron) sent by nine employees to different collaborators as a first step. The followig table describes the data used for discovering activities. For each employee, it shows : (1) his/her name, (2) his/her organizational role (3) his/her business role in Enron, (4) the number of his/her sent emails (only emails of non empty main bodies were considered), and (5) the IDs of the analysed emails.

    After that, to generate a structured event log to be used in the event log mining phase, activities' patterns were used to detect activities' occurrences in emails sent by: (1) the same set of nine employees, (2) employees that frequently receive activities' oriented emails, and (3) emails belonging to the same threads as that of the discovered activities.

    IDEmployee NameOrganizational RoleBusiness RoleNumber of analyzed emailsDownload the IDs of the analysed emails
    E1John ForneyManaging DirectorTrading343 allOriginIds_m__forney.txt
    E2David PortzSenior CounselLegal102allOriginIds_david_portz.txt
    E3Vince KaminskiManaging DirectorRisk Management2283allOriginIds_vince_kaminski.txt
    E4Shirley CrenshawAssistantRisk Management357allOriginIds_shirley_crenshaw.txt
    E5Daren FarmerManagerLogistics738allOriginIds_daren_farmer.txt
    E6Megan ParkerSpecialistSettlements108allOriginIds_megan_parker.txt
    E7Mary PoormanSpecialistLogistics100allOriginIds_mary_poorman.txt
    E8Robert CottenEmployeeEmployee158allOriginIds_robert_cotten.txt
    E9Aimee LannouSpecialistLogistics80allOriginIds_aimee_lannou.txt

First Step Towards Event Log Generation: Unsupervised Learning For Activity Discovery

Visualization of Activities Per Employee

    In this section, we provide the graph visualizations that we obtained for each employee after analysing his/her emails to discover his/her activities. We addionally provide for each employee a csv file that mainly reports for each discovered activity: (i) its frequency, (ii) the IDs of the emails where it appeared, (iii) the patterns assigned to it, (iv) the employees receiving activity related emails as well as their number of occurrences in the email interlocutors lists.

    Click here to display/hide the activities of the employee E1
    Click here to download the activities of the employee E1
    Click here to display/hide the activities of the employee E2
    Click here to download the activities of the employee E2
    Click here to display/hide the activities of the employee E3
    Click here to download the activities of the employee E3
    Click here to display/hide the activities of the employee E3
    Click here to download the activities of the employee E4
    Click here to display/hide the activities of the employee E5
    Click here to download the activities of the employee E5
    Click here to display/hide the activities of the employee E6
    Click here to download the activities of the employee E6
    Click here to display/hide the activities of the employee E7
    Click here to download the activities of the employee E7
    Click here to display/hide the activities of the employee E8
    Click here to download the activities of the employee E8
    Click here to display/hide the activities of the employee E9
    Click here to download the activities of the employee E9

Event Log Generation

    The second phase in our approach aims to generate a structured event log from an email log. Each event in our event log has the following attributes:

    We provide our generated event log after organizing its events according to their threads and email IDs. This event log (combined with our thread structure) is a dictionary (json) file having the following format: dic_allThreads[threadkey] = {emailKey:(email.timestamp, email.sender, email.to',email.cc, email.body, email.subject, email.relation, email.activities, email.mainBody, email.ID, email.BDValues,email.relation), emailKey ∈ dic_allThreads[threadkey].keys()}

    The attribute 'email.relation' denotes email relation with its previous email (according to the chronological order of sending emails). An activity in email.activities is a tuple that has the following format : (activity label, activity pattern, activity speech act, pattern Origin in email, Verb position of activity in email, activity sentence)

    We provide also a sublog extract for each discovered activity to give an idea on its functional aspect. The following table describes sublogs of all activities used in our work. For each activity, it shows : (1) its ID, (2) its label (as it was discovered by our activity discovery approach), (3) the size of its sublogs (number of events, i.e. N_Occ), (4) the precision of activity occurrences discovery (Pr_Occ), (5) the precision of the speech act discovery, (6) the number of senders frequently involved (N_senders), (7) the activity sublog in xlsx format, (8) a dictionary file detailing employees' affectation to their groups (Groups), (9) a dictionary file detaling visualization labels associated to each employee (Employee_Viz_Labels), and (10) the obtained visualization (in png format) of the corresponding actor perspective visualization

    For each activity, the provided sublog extract reports for each event the following attributes: (1) emailID, (2) email timestamp, (3) email sender, (4) activity name, (5) activity speech act, and (6) the sentence where the activity occurred to better understand its context of use.

    <
    IDActivitySize (N_Occ)Pr_OccPr_SAN_sendersublogGroupsEmployee_Viz_LabelsActor Perspective Visualization
    1flow deal{gas}{price}1890.95 ± 0.070.95 ± 0.0411subLog1.xlsxactors_labels_el1.textdic_groups_el1.txt
    Click here to display/hide el1 results
    2change_convert deal{numeric}{price}{ticket}{pricing}{counterparty}1180.98 ± 0.020.92 ± 0.045subLog2.xlsxactors_labels_el2.textdic_groups_el2.txt
    Click here to display/hide el2 results
    3enter deal{numeric}{ticket}{meter}{purchase}{wellhead}1171.0 ± 0.00.9 ± 0.095subLog3.xlsxactors_labels_el3.textdic_groups_el3.txt
    Click here to display/hide el3 results
    4create deal{numeric}{ticket}1131.0 ± 0.00.93 ± 0.084subLog4.xlsxactors_labels_el4.textdic_groups_el4.txt
    Click here to display/hide el4 results
    5sell_trade numeric0mw{pricenumeric}{hour0end}{he0numeric}{object}{power}991.0 ± 0.00.91 ± 0.01subLog5.xlsxactors_labels_el5.textdic_groups_el5.txt
    Click here to display/hide el5 results
    6set_put_fix_determine_adjust deal{numeric}940.94 ± 0.050.76 ± 0.144subLog6.xlsxactors_labels_el6.textdic_groups_el6.txt
    Click here to display/hide el6 results
    7conduct interview{telephone}{informal}{eb0numeric}930.98 ± 0.040.9 ± 0.042subLog7.xlsxactors_labels_el7.textdic_groups_el7.txt
    Click here to display/hide el7 results
    8extend deal{numeric}{rest}901.0 ± 0.00.8 ± 0.225subLog8.xlsxactors_labels_el8.textdic_groups_el8.txt
    Click here to display/hide el8 results
    9see_determine_check_view deal{numeric}830.89 ± 0.110.85 ± 0.15subLog9.xlsxactors_labels_el9.textdic_groups_el9.txt
    Click here to display/hide el9 results
    10send_attach resume{version}{electronic}810.88 ± 0.030.94 ± 0.033subLog10.xlsxactors_labels_el10.textdic_groups_el10.txt
    Click here to display/hide el10 results
    11allocate volume0management{deal}{contract}{meter_mtr}{numeric}711.0 ± 0.00.78 ± 0.013subLog11.xlsxactors_labels_el11.textdic_groups_el11.txt
    Click here to display/hide el11 results
    12purchase_buy gas{plant}690.9 ± 0.060.92 ± 0.012subLog12.xlsxactors_labels_el12.textdic_groups_el12.txt
    Click here to display/hide el12 results
    13attach spreadsheet621.0 ± 0.01.0 ± 0.02subLog13.xlsxactors_labels_el13.textdic_groups_el13.txt
    Click here to display/hide el13 results
    14attach_transmit_send agreement{sheet}{discussion}{cover}{draft}{document}600.85 ± 0.240.97 ± 0.02subLog14.xlsxactors_labels_el14.textdic_groups_el14.txt
    Click here to display/hide el14 results
    15set_determine_arrange interview{phone}600.82 ± 0.11.0 ± 0.02subLog15.xlsxactors_labels_el15.textdic_groups_el15.txt
    Click here to display/hide el15 results
    16make decision{deal_trade}590.69 ± 0.40.9 ± 0.02subLog16.xlsxactors_labels_el16.textdic_groups_el16.txt
    Click here to display/hide el16 results
    17handle_cover deal{numeric}{meter_mtr}581.0 ± 0.00.82 ± 0.073subLog17.xlsxactors_labels_el17.textdic_groups_el17.txt
    Click here to display/hide el17 results
    18show deal570.82 ± 0.140.76 ± 0.184subLog18.xlsxactors_labels_el18.textdic_groups_el18.txt
    Click here to display/hide el18 results
    19forward resume{request}{associate0program}520.93 ± 0.041.0 ± 0.02subLog19.xlsxactors_labels_el19.textdic_groups_el19.txt
    Click here to display/hide el19 results
    20send_attach_ship transport0contract{deal}{term}490.83 ± 0.00.79 ± 0.01subLog20.xlsxactors_labels_el20.textdic_groups_el20.txt
    Click here to display/hide el20 results
    21invite_receive personnelpronoun{audience_interview}{meeting}{iris}{site}{preliminary}490.85 ± 0.00.88 ± 0.01subLog21.xlsxactors_labels_el21.textdic_groups_el21.txt
    Click here to display/hide el21 results
    22set_arrange_put_order_rank_prepare_determine_define_adjust personname{meeting}{assistant}440.76 ± 0.00.85 ± 0.01subLog22.xlsxactors_labels_el22.textdic_groups_el22.txt
    Click here to display/hide el22 results
    23revise numeric{meter}{deal}{volume}{nomination}{effective}411.0 ± 0.00.95 ± 0.01subLog23.xlsxactors_labels_el23.textdic_groups_el23.txt
    Click here to display/hide el23 results
    24sell_trade natural0gas{plant}{counterparty}411.0 ± 0.00.89 ± 0.01subLog24.xlsxactors_labels_el24.textdic_groups_el24.txt
    Click here to display/hide el24 results
    25make change401.0 ± 0.00.82 ± 0.172subLog25.xlsxactors_labels_el25.textdic_groups_el25.txt
    Click here to display/hide el25 results
    26schedule locname{zone}{load}371.0 ± 0.00.94 ± 0.01subLog26.xlsxactors_labels_el26.textdic_groups_el26.txt
    Click here to display/hide el26 results
    27add deal{numeric}360.89 ± 0.090.88 ± 0.112subLog27.xlsxactors_labels_el27.textdic_groups_el27.txt
    Click here to display/hide el27 results
    28use_apply model_simulation{case0study}{example_model}{case_example}350.52 ± 0.00.87 ± 0.01subLog28.xlsxactors_labels_el28.textdic_groups_el28.txt
    Click here to display/hide el28 results
    29purchase_buy numeric0mw{pricenumeric}{hour0end}341.0 ± 0.00.92 ± 0.01subLog29.xlsxactors_labels_el29.textdic_groups_el29.txt
    Click here to display/hide el29 results
    30make reservation{hotel}{dinner}331.0 ± 0.00.79 ± 0.082subLog30.xlsxactors_labels_el30.textdic_groups_el30.txt
    Click here to display/hide el30 results
    31make presentation{topic}{student}{energy0derivative}{university}{progress}300.88 ± 0.00.9 ± 0.01subLog31.xlsxactors_labels_el31.textdic_groups_el31.txt
    Click here to display/hide el31 results
    32long numeric0mw{hour0end}{orgname0book}{pricenumeric}{offpeak}{deal}270.96 ± 0.00.96 ± 0.01subLog32.xlsxactors_labels_el32.textdic_groups_el32.txt
    Click here to display/hide el32 results
    33schedule_reschedule interview{telephone}270.95 ± 0.060.95 ± 0.032subLog33.xlsxactors_labels_el33.textdic_groups_el33.txt
    Click here to display/hide el33 results
    34keep_hold personnelpronoun260subLog34.xlsx
    35schedule_reschedule meeting{assistant}261.0 ± 0.00.92 ± 0.062subLog35.xlsxactors_labels_el35.textdic_groups_el35.txt
    Click here to display/hide el35 results
    36send_attach info_information{unit}{numeric}260.96 ± 0.070.91 ± 0.062subLog36.xlsxactors_labels_el36.textdic_groups_el36.txt
    Click here to display/hide el36 results
    37wrap_roll deal{numeric}241.0 ± 0.00.86 ± 0.122subLog37.xlsxactors_labels_el37.textdic_groups_el37.txt
    Click here to display/hide el37 results
    38reach cell0phone241.0 ± 0.00.82 ± 0.112subLog38.xlsxactors_labels_el38.textdic_groups_el38.txt
    Click here to display/hide el38 results
    39run oomc0mean{calc}230.95 ± 0.00.94 ± 0.01subLog39.xlsxactors_labels_el39.textdic_groups_el39.txt
    Click here to display/hide el39 results
    40see file210.92 ± 0.071.0 ± 0.02subLog40.xlsxactors_labels_el40.textdic_groups_el40.txt
    Click here to display/hide el40 results
    41pay cost_price{balance0energy}201.0 ± 0.00.79 ± 0.01subLog41.xlsxactors_labels_el41.textdic_groups_el41.txt
    Click here to display/hide el41 results
    42receive price{numeric0mw}{balance0energy}200.92 ± 0.00.82 ± 0.01subLog42.xlsxactors_labels_el42.textdic_groups_el42.txt
    Click here to display/hide el42 results
    43attend meeting201.0 ± 0.01.0 ± 0.02subLog43.xlsxactors_labels_el43.textdic_groups_el43.txt
    Click here to display/hide el43 results
    44bill pricenumeric{deal}191.0 ± 0.00.89 ± 0.01subLog44.xlsxactors_labels_el44.textdic_groups_el44.txt
    Click here to display/hide el44 results
    45receive gas190.36 ± 0.00.75 ± 0.01subLog45.xlsxactors_labels_el45.textdic_groups_el45.txt
    Click here to display/hide el45 results
    46settle_resolve issue181.0 ± 0.01.0 ± 0.01subLog46.xlsxactors_labels_el46.textdic_groups_el46.txt
    Click here to display/hide el46 results
    47use_apply deal{numeric}180.85 ± 0.120.91 ± 0.082subLog47.xlsxactors_labels_el47.textdic_groups_el47.txt
    Click here to display/hide el47 results
    48expire deal181.0 ± 0.01.0 ± 0.02subLog48.xlsxactors_labels_el48.textdic_groups_el48.txt
    Click here to display/hide el48 results
    49deliver gas180.92 ± 0.01.0 ± 0.01subLog49.xlsxactors_labels_el49.textdic_groups_el49.txt
    Click here to display/hide el49 results
    50receive_incur gas180.71 ± 0.01.0 ± 0.01subLog50.xlsxactors_labels_el50.textdic_groups_el50.txt
    Click here to display/hide el50 results
    51see_understand_determine_check gas170.88 ± 0.00.43 ± 0.01subLog51.xlsxactors_labels_el51.textdic_groups_el51.txt
    Click here to display/hide el51 results
    52develop_produce_prepare_evolve example_model{model_simulation}170.91 ± 0.01.0 ± 0.01subLog52.xlsxactors_labels_el52.textdic_groups_el52.txt
    Click here to display/hide el52 results
    53pay pricenumeric{txu}161.0 ± 0.00.83 ± 0.01subLog53.xlsxactors_labels_el53.textdic_groups_el53.txt
    Click here to display/hide el53 results
    54forward deal{ticket}160subLog54.xlsx
    55supply_provide info_information161.0 ± 0.00.86 ± 0.01subLog55.xlsxactors_labels_el55.textdic_groups_el55.txt
    Click here to display/hide el55 results
    56invite personnelpronoun161.0 ± 0.00.62 ± 0.01subLog56.xlsxactors_labels_el56.textdic_groups_el56.txt
    Click here to display/hide el56 results
    57invite personname161.0 ± 0.00.85 ± 0.01subLog57.xlsxactors_labels_el57.textdic_groups_el57.txt
    Click here to display/hide el57 results
    58write_publish book161.0 ± 0.00.71 ± 0.01subLog58.xlsxactors_labels_el58.textdic_groups_el58.txt
    Click here to display/hide el58 results
    59adjust_set volume{numeric}{wellhead}151.0 ± 0.00.79 ± 0.12subLog59.xlsxactors_labels_el59.textdic_groups_el59.txt
    Click here to display/hide el59 results
    60check_break object150.83 ± 0.00.8 ± 0.01subLog60.xlsxactors_labels_el60.textdic_groups_el60.txt
    Click here to display/hide el60 results
    61send_direct_attach request150.67 ± 0.01.0 ± 0.01subLog61.xlsxactors_labels_el61.textdic_groups_el61.txt
    Click here to display/hide el61 results
    62replace deal_trade{sale}{power}{counterparty}141.0 ± 0.00.83 ± 0.01subLog62.xlsxactors_labels_el62.textdic_groups_el62.txt
    Click here to display/hide el62 results
    63short numeric0mw{he0numeric}140.92 ± 0.00.92 ± 0.01subLog63.xlsxactors_labels_el63.textdic_groups_el63.txt
    Click here to display/hide el63 results
    64read_translate_learn information130.64 ± 0.00.86 ± 0.01subLog64.xlsxactors_labels_el64.textdic_groups_el64.txt
    Click here to display/hide el64 results
    65put_assign contract130.6 ± 0.00.67 ± 0.01subLog65.xlsxactors_labels_el65.textdic_groups_el65.txt
    Click here to display/hide el65 results
    66submit request{access}131.0 ± 0.01.0 ± 0.01subLog66.xlsxactors_labels_el66.textdic_groups_el66.txt
    Click here to display/hide el66 results
    67keep_continue desk120.57 ± 0.01.0 ± 0.01subLog67.xlsxactors_labels_el67.textdic_groups_el67.txt
    Click here to display/hide el67 results
    68schedule volume121.0 ± 0.01.0 ± 0.01subLog68.xlsxactors_labels_el68.textdic_groups_el68.txt
    Click here to display/hide el68 results
    69cut deal121.0 ± 0.00.9 ± 0.01subLog69.xlsxactors_labels_el69.textdic_groups_el69.txt
    Click here to display/hide el69 results
    70submit schedule121.0 ± 0.00.67 ± 0.01subLog70.xlsxactors_labels_el70.textdic_groups_el70.txt
    Click here to display/hide el70 results
    71coordinate date0time{schedule}{interview}121.0 ± 0.01.0 ± 0.01subLog71.xlsxactors_labels_el71.textdic_groups_el71.txt
    Click here to display/hide el71 results
    72respond delay121.0 ± 0.00.82 ± 0.01subLog72.xlsxactors_labels_el72.textdic_groups_el72.txt
    Click here to display/hide el72 results
    73leave_result line_note{middle}{majority}{area_arena}110subLog73.xlsx
    74use_apply transport0contract{pipe}110.86 ± 0.01.0 ± 0.01subLog74.xlsxactors_labels_el74.textdic_groups_el74.txt
    Click here to display/hide el74 results
    75make business110subLog75.xlsx
    76produce_acquire_develop_grow_evolve_create cost_price{power0market}{index_power}110subLog76.xlsx
    77leave voice111.0 ± 0.01.0 ± 0.01subLog77.xlsxactors_labels_el77.textdic_groups_el77.txt
    Click here to display/hide el77 results
    78serve_attend conference110.89 ± 0.01.0 ± 0.01subLog78.xlsxactors_labels_el78.textdic_groups_el78.txt
    Click here to display/hide el78 results
    79send_commit_attach presentation{slide}110.91 ± 0.01.0 ± 0.01subLog79.xlsxactors_labels_el79.textdic_groups_el79.txt
    Click here to display/hide el79 results
    80execute agreement101.0 ± 0.00.89 ± 0.01subLog80.xlsxactors_labels_el80.textdic_groups_el80.txt
    Click here to display/hide el80 results
    81expect volume101.0 ± 0.00.75 ± 0.01subLog81.xlsxactors_labels_el81.textdic_groups_el81.txt
    Click here to display/hide el81 results
    82pay supplier{pipeline}101.0 ± 0.00.75 ± 0.01subLog82.xlsxactors_labels_el82.textdic_groups_el82.txt
    Click here to display/hide el82 results
    83provide_leave cost_price100.67 ± 0.01.0 ± 0.01subLog83.xlsxactors_labels_el83.textdic_groups_el83.txt
    Click here to display/hide el83 results
    84show_indicate numeric0mw{resource0plan}100.88 ± 0.00.86 ± 0.01subLog84.xlsxactors_labels_el84.textdic_groups_el84.txt
    Click here to display/hide el84 results
    85complete transaction{agreement}101.0 ± 0.00.67 ± 0.01subLog85.xlsxactors_labels_el85.textdic_groups_el85.txt
    Click here to display/hide el85 results
    86cover_compensate cost_price{trip}100.9 ± 0.00.78 ± 0.01subLog86.xlsxactors_labels_el86.textdic_groups_el86.txt
    Click here to display/hide el86 results
    87support_confirm wholesale0power{trading}100.57 ± 0.00.25 ± 0.01subLog87.xlsxactors_labels_el87.textdic_groups_el87.txt
    Click here to display/hide el87 results
    88stay_remain_delay_continue line_pipeline{balance}90subLog88.xlsx
    89pathe deal91.0 ± 0.01.0 ± 0.01subLog89.xlsxactors_labels_el89.textdic_groups_el89.txt
    Click here to display/hide el89 results
    90send_attach file91.0 ± 0.01.0 ± 0.01subLog90.xlsxactors_labels_el90.textdic_groups_el90.txt
    Click here to display/hide el90 results
    91make contribution_part{field_theater}{area_field}90.5 ± 0.01.0 ± 0.01subLog91.xlsxactors_labels_el91.textdic_groups_el91.txt
    Click here to display/hide el91 results
    92register conference91.0 ± 0.01.0 ± 0.01subLog92.xlsxactors_labels_el92.textdic_groups_el92.txt
    Click here to display/hide el92 results
    93organize_prepare_coordinate power0point{power0market}{conference}{risk0management}90.5 ± 0.01.0 ± 0.01subLog93.xlsxactors_labels_el93.textdic_groups_el93.txt
    Click here to display/hide el93 results
    94evaluate_assess project{model}{approximation_estimate}91.0 ± 0.00.89 ± 0.01subLog94.xlsxactors_labels_el94.textdic_groups_el94.txt
    Click here to display/hide el94 results
    95note deal81.0 ± 0.01.0 ± 0.01subLog95.xlsxactors_labels_el95.textdic_groups_el95.txt
    Click here to display/hide el95 results
    96review comment81.0 ± 0.01.0 ± 0.01subLog96.xlsxactors_labels_el96.textdic_groups_el96.txt
    Click here to display/hide el96 results
    97reflect deal81.0 ± 0.01.0 ± 0.01subLog97.xlsxactors_labels_el97.textdic_groups_el97.txt
    Click here to display/hide el97 results
    98reserve_hold eb0numeric81.0 ± 0.01.0 ± 0.01subLog98.xlsxactors_labels_el98.textdic_groups_el98.txt
    Click here to display/hide el98 results
    99tie_attach transaction{line_note}80.57 ± 0.01.0 ± 0.01subLog99.xlsxactors_labels_el99.textdic_groups_el99.txt
    Click here to display/hide el99 results
    100see_visit_realize_learn project80subLog100.xlsx
    101agree_match_fit_accord contract71.0 ± 0.00.8 ± 0.01subLog101.xlsxactors_labels_el101.textdic_groups_el101.txt
    Click here to display/hide el101 results
    102attach schedule71.0 ± 0.01.0 ± 0.01subLog102.xlsxactors_labels_el102.textdic_groups_el102.txt
    Click here to display/hide el102 results

Event Log Mining

    This phase aims to mine our generated event log in order to discover artifacts, BP fragments and the corresponding perspectives: functional, data, organizational and behavioral perspectives.

    In what follows, we provide our experiment results in relation to: