This application simulates a high velocity stream of events (impressions, clickthroughs, conversions) that are enriched and ingested. These events are randomly generated in the client, but represent a stream of events that would be received from web traffic.
The "TrackEvent" stored procedure processes these events. It looks up the corresponding advertiser and campaign based on the creative ID which represents which ad was shown. It also retrieves the corresponding web site and page based on the inventory ID from the event. The timestamp and event type fields are converted to aid in aggregation, and all of this data is then inserted into the impression_data table.
Several views maintain real-time aggregations on this table to provide a minutely summary for each advertiser, plus drill-down reports grouped by campaign and creative to show detail-level metrics, costs and rates with real-time accuracy.
Several new SQL features in VoltDB are demonstrated in this application, including:
The code is divided into projects:
See below for instructions on running these applications. For any questions, please contact firstname.lastname@example.org.
Before running these scripts you need to have VoltDB 4.0 (Enterprise or Community) or later installed, and you should add the voltdb-$(VERSION)/bin directory to your PATH environment variable, for example:
Start the web server
Start the database and client
Open a web browser to http://hostname:8081
To stop the demo:
Stop the client (if it hasn't already completed)
Stop the database
Stop the web server
You can control various characteristics of the demo by modifying the parameters passed into the InvestmentBenchmark java application in the "client" function of the run.sh script.
Speed & Duration:
--duration=120 (benchmark duration in seconds) --autotune=true (true = ignore rate limit, run at max throughput until latency is impacted) (false = run at the specified rate limit) --ratelimit=20000 (when autotune=false, run up to this rate of requests/second)
Metadata volumes and ratios:
--sites=100 (number of web sites where ad events may occur) --pagespersite=10 (number of pages per web site) --advertisers=100 (number of advertisers) --campaignsperadvertiser=10 (number of campaigns per advertiser) --creativespercampaign=5 (number of creatives or banners per campaign)
Before running this demo on a cluster, make the following changes:
On each server, edit the run.sh file to set the HOST variable to the name of the first server in the cluster:
On each server, edit db/deployment.xml to change hostcount from 1 to the actual number of servers:
On each server, start the database
On one server, Edit the run.sh script to set the SERVERS variable to a comma-separated list of the servers in the cluster
Run the client script: