This plugin integrates WordPress with AWS Pinpoint and provides an extensible tracker out of the box.
It also automatically integrates with the WP Consent Level API plugin or the client side only version maintained by Human Made. At least the statistics-anonymous
category must be consented to for any tracking to occur and statistics
is required for tracking personally identifiable information.
Once installed the plugin will queue up an analytics tracker script that provides a few client side functions you can use:
Altis.Analytics.onReady( callback <function> )
Use this function to ensure analytics has loaded before making calls to registerAttribute()
or record()
.
Altis.Analytics.updateEndpoint( data <object> )
Updates the data associated with the current user. Use this to provide updated custom user attributes and metrics, a user ID, and demographic data.
Important: If used in conjunction with the WP Consent API all data passed to this function under the User
property is removed. You should only store personally identifiable information under the User.UserId
and User.UserAttributes
properties. All other endpoint data as outlined further down should only be used for anonymous demographic data.
Altis.Analytics.getEndpoint()
Returns the current endpoint data object.
Altis.Analytics.record( eventName <string> [, data <object> [, endpoint <object>]] )
Records an event. The data passed in should be an object with either or both an attributes
property and metrics
property:
{
attributes: {
name: 'value', // <string>
// ...
},
metrics: {
name: 1 // <number>
// ...
},
}
The optional 3rd parameter allows you to simulataneously update the endpoint data as if calling Altis.Analytics.updateEndpoint()
. These attributes, metrics and endpoint data can be later queried via Elasticsearch.
Altis.Analytics.updateAudiences()
Synchronises the current audiences associated with the page session. You shouldn't ever need to call this manually but it is called any time updateEndpoint()
, registerAttribute()
or registerMetric()
are called. You can hook into the updateAudiences
event to respond to changes in this data.
Altis.Analytics.getAudiences()
Retrieves an array of the audience IDs for the current page session.
Altis.Analytics.registerAttribute( name <string>, value <string | callback> )
Sometimes you may want to record a dynamic attribute value for all events on the page. The registerAttribute()
function allows this. Values must be a single string. If a function is passed as the value will be evaluated at the time an event recorded. Promises are supported as a return value.
Altis.Analytics.registerMetric( name <string>, value <number | callback> )
Similar to registerAttribute()
above but for numbers.
Altis.Analytics.on( event <string>, callback <callback> ) : EventListener
Attaches and returns an event listener. The available events and their callback arguments are:
updateEndpoint
Called any time the current endpoint data is updated. The callback receives the endpoint object.
Altis.Analytics.on( 'updateEndpoint', function ( endpoint ) { console.log( endpoint.Demographic ); // { Platform: 'Mac OS', .... } } );
record
: Called any time an event is recorded. The callback receives the pinpoint event object.
Altis.Analytics.on( 'record', function ( event ) { console.log( event.Attributes, event.event_type ); // { referer: '', ... }, 'pageView' } );
updateAudiences
Called any time the audiences are updated. The callback receives an array of audience IDs.
Altis.Analytics.on( 'updateAudiences', function ( audiences ) { console.log( audiences ); // [ 1, 2, 3, ... ] } );
Altis.Analytics.off( listener <EventListener> )
Removes an event listener returned by Altis.Analytics.on()
.
ALTIS_ANALYTICS_ELASTICSEARCH_URL
Allows you to define the Elasticsearch server URL directly.
ALTIS_ANALYTICS_LOG_QUERIES
Define as true to enable logging queries to the error log.
ALTIS_ANALYTICS_FALLBACK_CAPS
By default, Altis Analytics will grant any role who can edit pages the ability to edit audiences. You can explicitly remove the capabilities from a role (i.e. 'edit_audiences' => false
), but in some cases you may wish to remove this fallback entirely.
Define as false to disable the capability fallback to page capabilities.
ALTIS_ANALYTICS_PINPOINT_BUCKET_ARN
If you have the Pinpoint Kinesis Firehose set to back up data to S3, defining this constant allows for data to be cleaned from the backups periodically.
ALTIS_ANALYTICS_PINPOINT_BUCKET_REGION
The region for the Kinsesis Firehose S3 bucket (if configured).
The plugin provides a few hooks for you to control the default endpoint data and attributes recorded with events.
altis.analytics.consent_enabled <bool>
This defaults to true if the WP Consent API plugin or a derivative is installed and active by checking if the constant WP_CONSENT_API_URL
is defined.
altis.analytics.data.endpoint <array>
Allows you to provide server-side data used to update the visitors associated endpoint with custom attributes and demographic data. Mostly useful for providing extra data for logged in sessions such as IP based location data.
altis.analytics.data.attributes <array>
Allows you to provide server-side data recorded for all events on the page. Useful if you need to query records based on the page context.
altis.analytics.data.metrics <array>
Similar to the altis.analytics.data.attributes
filter but allows you to pass metrics based on server-side data.
altis.analytics.data <array>
Filters the entire array passed to the client side.
altis.analytics.elasticsearch.url <string>
Filters the Elasticsearch server URL.
altis.analytics.noop <bool>
Returning false
from this filter will prevent any events or updated endpoint data from being sent to Pinpoint. The built in usage for this is to prevent logging events on page previews.
altis.analytics.max_index_age <int>
Filter the maximum number of days to keep real time stats available for. The default number of days is the maximum of 90, after which data is removed. This is important for streamlining your user's privacy.
Insights and aggregated analytics data can be calculated, updated and stored in the database in cases where you wish to retain information for longer periods of time such as number of page views.
altis.analytics.max_s3_backup_age <int>
Filter the maximum number of days to keep backup data for. The default number of days is 90, in accordance with AWS Pinpoint, after which time data is removed. This is important for streamlining your users' privacy. There is no upper limit on this value, however you should make sure any long term data storage is explained to users when opting in to tracking.
altis.analytics.exclude_bots <bool>
This defaults to true but can switched off to allow tracking bots that can run JavaScript, note this will affect your data.
You can check if a recorded event was created by a bot by checking if the attributes.isBot
value exists.
altis.analytics.elasticsearch.timeout <int>
Filter the amount of seconds to wait for a repsonse from Elasticsearch. The default value is 20
seconds. The minimum value is 5
seconds and the maximum is 30
seconds.
Altis\Analytics\Utils\query( array $query, array $params = [] ) : array
Queries the analytics data with the Elasticsearch Query DSL provided in $query
. $params
will be added as query string parameters to the Elasticsearch request URL.
Altis\Analytics\Utils\get_elasticsearch_url() : string
Get the Elasticsearch server URL.
Altis\Analytics\Utils\milliseconds() : integer
Get the current time since the unix epoch in milliseconds.
Altis\Analytics\Utils\merge_aggregations( array $current, array $new, string $bucket_type = '' ) : array
A utility function for merging aggregations returned by Elasticsearch queries. This is necessary to keep a store of your analytics data as Elasticsearch indexes can be rotated or lost.
Data can be queried from Elasticsearch, providing a powerful tool for filtering and aggregating records. If you are using Altis requests are automatically signed.
It is recommended to use the Altis\Analytics\Utils\query()
function for this.
A user session covers every event recorded between opening the website and closing it. For every event recorded the following data is recorded depending on the scope.
event_type
: The type of event recorded, eg.pageView
,click
,_session.start
or_session.stop
.event_timestamp
: The timestamp in milliseconds of when the event was recorded on the site.attributes
date
: ISO-8601 standard date string.session
: Unique ID across all page views.pageSession
: Unique ID for one page view.url
: The current page URL.hash
: The current URL hash.referer
: The page referer.network
: The current network's primary URL.networkId
: The current network's Id.blog
: The current site URL.blogId
: The current blog ID.qv_utm_campaign
: The Urchin Tracker campaign from the query string if set.qv_utm_source
: The Urchin Tracker source from the query string if set.qv_utm_medium
: The Urchin Tracker medium from the query string if set.qv_*
: Any query string parameters will be recorded with the prefixqv_
.- Any attributes added via the
altis.analytics.data.attributes
filter. - Any attributes added via
Altis.Analytics.registerAttribute()
or passed toAltis.Analytics.record()
.
metrics
scrollDepthMax
: Maximum scroll depth on page so far. Percentage value between 1-100.scrollDepthNow
: Scroll depth at time of event. Percentage value between 1-100.elapsed
: Time elapsed in milliseconds since the start of the page view.day
: The day of the week, 1 being Sunday through to 7 being Saturday.hour
: The hour of the day in 24 hour format.month
: The month of the year.- Any metrics added via
Altis.Analytics.registerMetric()
or passed toAltis.Analytics.record()
.
endpoint
Id
: A unique UUID for the endpoint.Address
: An optional target for push notifications such as an email address or phone number.OptOut
: The push notification channels this visitor has opted out of. Defaults to "ALL".Attributes
- Any custom attributes associated with this endpoint.
Metrics
sessions
: Number of separate browsing sessions for this endpoint.pageViews
: Number of total page views for this endpoint.- Any custom metrics associated with the endpoint.
Demographic
AppVersion
: Current application version, can be provided via thealtis.analytics.data
filter.Locale
: Locale code of the endpoint, derived from the browser.Make
: Make of the current browser / browser engine eg. "Blink".Model
: Model of the current browser eg "Chrome"ModelVersion
: Browser version.Platform
: The device operating system.PlatformVersion
: The operating system version.
Location
Country
: The endpoint's country if known / available.City
: The endpoint's city if known or available.
User
UserAttributes
- Any custom attributes associated with the user if known.
UserId
: An ID associated with the user in your application. Useful for linking endpoints across devices.
session
session_id
: Persists for a subsession, triggered by page visibility changes. Recorded with_session.start
and_session.stop
events.start_timestamp
: Time in milliseconds when the subsession started. Recorded with_session.start
events.stop_timestamp
: Time in milliseconds when the subsession ended. Recorded with_session.stop
events.duration
: Duration in milliseconds for a subsession. Recorded with_session.stop
events.
<?php
$result = Altis\Analytics\Utils\query( [
// Don't return any hits to keep the response size small.
'size' => 0,
// Restrict the results to a single page we're interested in.
'query' => [
'bool' => [
'filter' => [
[ 'term' => [ 'attributes.url.keyword' => 'https://example.com/page' ] ]
]
]
],
// Aggregate data sets
'aggs' => [
'sessions' => [
// Create buckets by page session ID so all records in each bucket belong
// to a single user and page session.
'terms' => [
'field' => 'attributes.pageSession.keyword',
// Use data from the top 100 unique page sessions.
// By default terms aggregations return the top 10 hits.
'size' => 100
],
// Sub aggregations.
'aggs' => [
// Get the time on page by summing all session.duration values for the page session.
'time_on_page' => [
'sum' => [ 'field' => 'session.duration' ]
]
]
],
// Create a stats aggregation for all the time on page values found above.
'stats' => [
'stats_bucket' => [
'buckets_path' => 'sessions>time_in_page'
]
]
],
// Order by latest events.
'order' => [ 'event_timestamp' => 'desc' ]
] );
The output will look something like the following:
{
"took": 15,
"timed_out": false,
"_shards": {
"total": 5,
"successful": 5,
"skipped": 0,
"failed": 0
},
"hits": {
"total": 329,
"max_score": 0
},
"aggregations": {
"sessions": {
"doc_count_error_upper_bound": 0,
"sum_other_doc_count": 0,
"buckets": [
{ "...": "..." }
]
},
"stats": {
"count": 92,
"min": 0,
"max": 4963998,
"avg": 78251.14130434782,
"sum": 7199105
}
}
}
You can further trim the size of the returned response using the filter_path
query parameter. For example if we're only interested in the stats aggregation we can set filter_path=-aggregations.sessions
to remove it from the response.
Audiences are user-defined categories of users, based on conditions related to their analytics data.
Audiences allow for the creation of conditions to narrow down event queries or endpoints but also can be used for determining effects on the client side.
To enable the use of any event record data in the audience editor it needs to be mapped to a human readable label using the Altis\Analytics\Audiences\register_field()
function:
use function Altis\Analytics\Audiences\register_field;
add_action( 'init', function () {
register_field(
'endpoint.Location.Country', // The Elasticsearch field to query.
__( 'Country' ), // A label for the field.
__( 'The visitor country.' ) // Optional description for the field.
[ // Optional field arguments
'options' => '\\Altis\\Analytics\\Utils\\get_countries', // A callback to provide prepopulated list of options.
'disable_free_text' => false, // Whether to allow free text to be used or to restrict to available options.
]
);
} );
In the above example the 1st parameter endpoint.Location.Country
represents the field in the event record to query against. Other examples include attributes.qv_utm_campaign
or endpoint.User.UserAttibrutes.custom
for example.
The 2nd parameter is a human readable label for the audience field, and the 3rd is the human readable description that goes below the field UI.
The 4th parameter is an optional arguments array, which can include:
options
which is a callback that returns a list of valid options, that will complement existing data for that field.disable_free_text
is a boolean to allow/restrict the user to set custom strings rather than choose from the list.
A specific infrastructure set up is required to use this plugin:
- AWS Pinpoint Project
- Event stream configured to point to below Kinesis Firehose
- Associated Cognito Identity Pool ID
- AWS Kinesis Firehose
- Backing up to AWS S3 Bucket
- Sending data to AWS Elasticsearch Instance to an index called
analytics
To use the plugin from a direct clone you will need to run the build steps:
npm install
npm run build
To use the dev server do:
npm run start
You must define the following constants:
define( 'ALTIS_ANALYTICS_PINPOINT_ID', '...' );
define( 'ALTIS_ANALYTICS_PINPOINT_REGION', '...' );
define( 'ALTIS_ANALYTICS_COGNITO_ID', '...' );
define( 'ALTIS_ANALYTICS_COGNITO_REGION', '...' );
If you wish to test locally you can use the following constants to override the service endpoints:
define( 'ALTIS_ANALYTICS_PINPOINT_ENDPOINT', 'your-pinpoint-endpoint.com' );
define( 'ALTIS_ANALYTICS_COGNITO_ENDPOINT', 'your-cognito-endpoint.com' );
The humanmade/local-pinpoint
and humanmade/local-cognito
docker images provide a local version of the AWS Pinpoint API, limited to just the necessary methods.
docker pull humanmade/local-pinpoint
docker run -d \
--name local-pinpoint \
-e ELASTICSEARCH_HOST=<your elasticsearch instance url> \
-p 3000 \
humanmade/local-pinpoint
docker pull humanmade/local-cognito
docker run -d \
--name local-cognito \
-p 3000:3001 \
humanmade/local-cognito
You can then point the above endpoint constants to http://localhost:3000
and http://localhost:3001
respectively.
Made with ❤️ by Human Made