Widgets
- Corpus
- Import Documents
- Create Corpus
- The Guardian
- NY Times
- Pubmed
- Twitter
- Wikipedia
- Preprocess Text
- Corpus to Network
- Bag of Words
- Document Embedding
- Similarity Hashing
- Sentiment Analysis
- Tweet Profiler
- Topic Modelling
- LDAvis
- Corpus Viewer
- Score Documents
- Word Cloud
- Concordance
- Document Map
- Word Enrichment
- Duplicate Detection
- Word List
- Extract Keywords
- Annotated Corpus Map
- Ontology
- Semantic Viewer
- Collocations
- Statistics
Moving Transform
Compute aggregations over a sliding window, consecutive blocks or time periods of time series.
Inputs
- Time series: Time series (e.g. one provided by As Timeseries widget).
Outputs
- Time series: Data with computed aggregations
- Define the method for forming blocks of data:
- Sliding window: use a sliding window of the specified width. The function is often used for smoothing the data.
- Window width: the number of instances in a window
- Data output: Discard original data outputs only computed columns; Keep original data also keeps the original data except for the first N-1 instances (where N is the window width), which occur before the first complete window; Include leading instances also includes the first N-1 instances, but with missing values for entries in the computed columns.
- Consecutive blocks: aggregates data within consecutive blocks
- Block width: the number of instances in block
- Data output: Discard original data will output only aggregated columns; Keep first instance, Keep middle instance and Keep last instance will take the first, middle or last data instance as the representative for the block.
- Aggregate time periods: aggregates data based on time periods (years, months, days, hours, minutes, seconds) or in the same month of year (12 instances), week of year (52 instances), day of year (365 or 366 instances), day of month (~31 instances), day of week (7 instances), hour of day (24 instances). This functionality replaces the Aggregate widget.
- Sliding window: use a sliding window of the specified width. The function is often used for smoothing the data.
- If Apply Automatically is ticked, changes are communicated automatically. Alternatively, click Apply.
- Variable selection:
- Filter provides a shortcut for searching variables by (a part of) the name. Start typing the variable name to select it from the list.
- Alternatively, select the variable from the variable list. You can use Ctrl (Cmd) key, or dragging across the list, to select multiple variables and apply the same aggregation on them.
- Show only numeric variables hides non-numeric variables from the variable list.
- A list of possible aggregations:
- Mean value
- Sum
- Product
- Minimum
- Maximum
- Span
- Median
- Mode
- Standard deviation
- Variance
- Linear MA
- Exponential MA
- Harmonic mean
- Geometric mean
- Non-zero count: count non-zero values
- Defined count: count the number of non-missing values
- Cumulative sum
- Cumulative product
Examples
A simple use case for Moving Transform is applying a smoothing function over the timeseries. We used the Yahoo Finance to retrieve the daily stock value for Amazon (AMZN) and passed the data to Moving Transform. To get a 5-day moving average, we used a rolling window with mean aggregation. In Line Chart, we can see the initial (blue) values of High variable are now smoothed (red line).
Another way of using Moving Transform is to aggregated the data by a given period. Let’s say we wish to observe the yearly averages for the Cyber security breaches data from the Datasets widget. We set breach_start as our time variable in As Timeseries. In Moving Transform, we aggregated by years, and used mode for US State and Type of Breach variables and sum for the Individuals Affected variable. In the Data Table we see the aggregated data. There are 14 instances, each representing a single year.