Agile Videos

Automated testing for analytics

About this video

TDD is great for testing code logic and small parts of an application. Most teams are pretty good at this.

However, what happens when your application logic depends on data? Maybe you have a reporting/analytics system or a complex CMS. Business logic varies with the shape of the data. Unit testing alone falls short. Integration and functional testing become more difficult and costly.

To solve this problem, we developed a multi-layer approach to testing our analytics application. We combined unit, integration, and functional tests with multiple test data sets to balance test coverage and maintenance costs.

Transcript

0:00
Brief introduction. My name is John Reuning, I work for CA Technologies in the Rally Software Group. So Rally Software was acquired by CA about nine months ago. And so I’m part of that organization.

0:12
To give you guys some context and what we do. I have a couple questions for you.

0:19
How many of you love data?

0:22
Okay, how many of you hate data?

0:26
There we go. Yes, me too. I have a love hate relationship with data, especially when it comes into how the behavior of an application varies with the shape and the nature of the data.

0:37
And I’m going to talk a little bit about the lookback API application within the Rally stack. So the lookback API is something that allows people to access time series data of all of the user stories, defects, tasks, etc. in the rally tool.

0:56
What this means is there’s a huge amount of data in there. And there’s an ETL application that takes all of the data from our main product database, where everybody interacts with the user stories, cast defects into the lookback API database, which keeps almost every version of every artifact in the database. And so what we have here is kind of an indicator of the flow of the data, it starts off in our main product database, and then gets shuffled over through an ETL process into a look back API database, and then on into an aggregation and analytics system that has the data going out the door to users based on the requests.

1:39
So the question is, how do you test something, we pretty much I think the community is more or less figured out testing within an application, a unit test integration test where the code is the behavior of the application, but what happens when the behavior of the application varies on the shape of the data – have to include a bunch of data in there. And so I’m going to talk a little bit about how we have a multi layered approach to testing a data heavy application stack.

2:10
And so what you see here basically, is these circles indicate focus for the different levels of testing. So we have very traditional unit and integration tests, we cover methods, we cover classes. Some of the integration tests, though, you start to see … encompass parts of the database. So we have this thing here is an ETL application. And we have a Spock based integration tests that set up various scenarios within just the ETL portion of the application, and measure the behavior test the behavior of taking data from one database, transforming it and putting it into another. And it’s all pretty well constrained at the integration test level. We set up the test cases and have the the data generated and then tested.

3:03
And the same holds true of the loopback KPI in the aggregation piece of this where data is coming out of another database. And then, again, going through various forms of transformation, whether it’s taking fine grained time series data, and aggregating that and sending it up in summary form, or just taking time series data and sending it straight out the door.

3:25
Now at the outer layer, we’ve put together something that uses j behave, and basically a BDD style scenario approach. And here we use a combination of simulated data as well as snapshot data from a production environment, to go through the entire process of interacting with a web services API to create user stories and defects and put them into the system and then watch the transformation go take place. And then on the far end of that, query the data back out and verify that the whole process has worked.

4:10
Where does this work? And and where does it not? Obviously, it’s pretty good coverage over the transformations is pretty good coverage over the data model. And the data structures involved. It’s the snapshotting is kind of hard, it does tend to be brittle, a little bit like the UI testing with Selenium. We occasionally see some problems when a little piece of data changes. That was the basis for some assumptions in the test. It’s also a little bit slow, as you might expect, going through full data transforms and interacting with multiple applications.

4:48
So that’s pretty much it. I’m open to questions. I also have a bunch of these these are lean coffee kits in here is a cup with Sharpie stickies if you’re not familiar with lean coffee. Find me afterwards. I’ll be happy to explain that we do a lot of lean coffees in our organization. Thank you very much.

TDD is great for testing code logic and small parts of an application. Most teams are pretty good at this.

Transcript

0:12
To give you guys some context and what we do. I have a couple questions for you.

0:19
How many of you love data?

0:22
Okay, how many of you hate data?

0:26
There we go. Yes, me too. I have a love hate relationship with data, especially when it comes into how the behavior of an application varies with the shape and the nature of the data.

Add to Bookmarks Remove Bookmark

Add to Bookmarks Remove from Bookmarks

John Reuning

Speaker(s) may be willing to present this session at local group meetings and other events.

Cookie	Duration	Description
__cfduid	1 month	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
_csrf	session	This cookie is essential for the security of the website and visitor. It ensures visitor browsing security by preventing cross-site request forgery.
_GRECAPTCHA	5 months 27 days	This cookie is set by Google. In addition to certain standard Google cookies, reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
gdpr[allowed_cookies]	1 year	This cookie is set by the GDPR WordPress plugin. It is used to store the cookies allowed by the logged-in users and the visitors of the website.
JSESSIONID	session	Used by sites written in JSP. General purpose platform session cookies that are used to maintain users' state across page requests.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
pmpro_visit		The cookie is set by PaidMembership Pro plugin. The cookie is used to manage user memberships.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	This cookie is set by Addthis to make sure you see the updated count if you share a page and return to it before our share count cache is updated.
__atuvs	30 minutes	This cookie is set by Addthis to make sure you see the updated count if you share a page and return to it before our share count cache is updated.
__jid	30 minutes	Used to remember the user's Disqus login credentials across websites that use Disqus
aka_debug		This cookie is set by the provider Vimeo.This cookie is essential for the website to play video functionality. The cookie collects statistical information like how many times the video is displayed and what settings are used for playback.
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
CONSENT	16 years 8 months 15 days 5 hours	Description Pending
disqus_unique	1 year	Disqus.com internal statistics
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
language		This cookie is used to store the language preference of the user.
lidc	1 day	This cookie is set by LinkedIn and used for routing.
locale	3 days	This cookie is used to store the language preference of a user allowing the website to content relevant to the preferred language.
STYXKEY_aa_signup_visited	session	No description

Cookie	Duration	Description
_gat_UA-17319182-1	1 minute	Set by Google Analytics and Google Tag Manager to enable website owners to track visitor behaviour and measure site performance. These cookies are used to collect information about how you use our website. The information collected includes number of visitors, pages visited and time spent on the website. The information is collected by Google Analytics in aggregated and anonymous form, and we use the data to help us make improvements to the website.
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_17319182_1	1 minute	Set by Google Analytics and Google Tag Manager to enable website owners to track visitor behaviour and measure site performance. These cookies are used to collect information about how you use our website. The information collected includes number of visitors, pages visited and time spent on the website. The information is collected by Google Analytics in aggregated and anonymous form, and we use the data to help us make improvements to the website.
_gat_UA-0000000-1	1 minute	Set by Google Analytics and Google Tag Manager to enable website owners to track visitor behaviour and measure site performance. These cookies are used to collect information about how you use our website. The information collected includes number of visitors, pages visited and time spent on the website. The information is collected by Google Analytics in aggregated and anonymous form, and we use the data to help us make improvements to the website.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
eud	1 year 24 days	The domain of this cookie is owned by Rocketfuel. This cookie is used to sync with partner systems to identify the users. This cookie contains partner user IDs and last successful match time.
S	1 hour	domain .google.com
uvc	1 year 1 month	The cookie is set by addthis.com to determine the usage of Addthis.com service.
vuid	2 years	This domain of this cookie is owned by Vimeo. This cookie is used by vimeo to collect tracking information. It sets a unique ID to embed videos to the website.

Membership

Members-only Content

Become an Agile Alliance member!

Agile Conferences

Virtual Events

Community Events

Agile2024

Agile Essentials

Download the Agile Manifesto

Recent Blog Posts

Navigating the ethical waters of Agile coaching with Alex Sloley

Agile – 1 conversation, 2 views

Agile Coach Camp Worldwide is going to Costa Rica

Agile Resources

The NEW Agile Resource Guide

Sustainability Manifesto

MEMBER INITIATIVES

Your Community

Global Development

Global Affiliates

Global Affiliates

OUR POLICIES

ABOUT US

Become a sponsor

Agile Videos

Automated testing for analytics

About this video

Watch more Agile videos

Reimagining Agile (Session Two)

Reimagining Agile (Session One)

Agile2024 Submission Tips

2022 Annual Members’ Meeting

Reimagining Agile (Session Two)

Reimagining Agile (Session One)

Agile2024 Submission Tips

Have a comment? Join the conversation

Discover the many benefits of membership

Thank you to our valued Agile Alliance Annual Partners

Our Cornerstone Corporate Supporting Members