Tags:
Owner:
thomas.guyet@irisa.fr
Temporal (Negative) Sequential pattern mining algorithm
Please log in to perform a job with this app.
TENSE is an algorithm to extract negative sequential patterns from dataset of sequences. A negative sequential pattern is a pattern negative itemsets, ie specifying absence of event. Negative temporal patterns add numerical temporal constraints about durations between successive events.
The TENSE algorithm combines the mining of negative temporal patterns (NegPSpan) with a density clustering algorithm (MAFIA algorithm). It enables to add maxgap constraints.
Our software enables to evaluate alternative configurations of TENSE pattern extraction:
The algorithm process dataset of sequences. Possible file extensions are .dat
or .txt
.
The input format of a dataset is is the IBM format. Each line of the file represents an itemset. The line give, in that order, the sequence id, the itemset timestamp, the size of the itemset and the set of the items.
The following example illustrates the encoding of the dataset (see embeddings example):
1 5 4
0 1 1 1
0 2 1 3
0 3 1 2
0 4 1 5
0 5 1 4
1 1 1 1
1 2 2 2 3
1 3 1 5
1 4 1 4
2 1 1 1
2 2 1 2
2 3 1 5
2 4 1 4
3 1 1 1
3 2 1 5
3 3 1 4
cneg
: use NegPSpan (eNSP otherwise)f
: minimum support (number of transactions), for eNSP it is the minimum support for positive patternsn
: minimum support of negative patterns for eNSP algorithmsmg
: max gap constraintMW
: maximum number of windowsmW
: minimum number of windows alpha
: density threshold for dense unitbeta
: merging thresholdThis online version prevents from too long processes or too heavy memory requirements (for fair use of our servers). The following additional setting can not by modified:
In input :
In output :
16/03/2018 : Version 0.1, initial version
This app id is : 175
This curl command will create a job, and return your job url, and also the average execution time
files and/or dataset are optionnal, think to remove them if not wantedcurl -H 'Authorization: Token token=<your_private_token>' -X POST -F job[webapp_id]=175 -F job[param]="" -F job[queue]=standard -F files[0]=@test.txt -F files[1]=@test2.csv -F job[file_url]=<my_file_url> -F job[dataset]=<my_dataset_name> https://allgo.inria.fr/api/v1/jobs
Then, check your job to get the url files with :
curl -H 'Authorization: Token token=<your_private_token>' -X GET https://allgo.inria.fr/api/v1/jobs/<job_id>