"Why is this different than a week ago?" - Customer
Worse in regulated industries (e.g. pharma)
Very time consuming
The Problem
Overview (1 of 2)
Functional programming encourages immutable data
Regulators require audit trails
End users are really bad at describing problems
Overview (2 of 2)
Database Design Considerations
Coding Style
Example Functionality
Lessons Learned
Design Considerations (1 of 9)
Design Considerations (2 of 9)
createtable movies (
id intprimarykey,
title text);createtable licenses (
id intprimarykey,
movie_id intreferences movies (id),
title text,
start_date timestamp,
end_date timestamp
);
Design Considerations (3 of 9)
createtable movies$a (
id int,
title text,
audit_date timestampwithtimezone);createtable licenses$a (
id int,
movie_id intreferences movies (id),
title text,
start_date timestamp,
end_date timestamp,
audit_date timestampwithtimezone
);
Design Considerations (4 of 9)
createorreplace function audit_movie_insert() returns triggerlanguage plpgsql AS $$
beginexecute'insert into movies$a (id, title, audit_date, audit_action)
values ($1.id, $1.title, $2, ''I'');'using new, now();
return null;
end;
$$;
createtrigger movies$t
afterinserton movies
foreachrowexecuteprocedure audit_movie_insert();
Design Considerations (5 of 9)
Design Considerations (6 of 9)
Design Considerations (7 of 9)
select txid_current();
select
split_part(application_name, ',', 1) user,
split_part(application_name, ',', 2) request
from pg_stat_activity
where procpid = pg_backend_pid()
into v_context;
Design Considerations (8 of 9)
Design Considerations (9 of 9)
Request context (user/guid)
Usage (current vs historical)
Storage (deltas vs full data)
Manner of use (readonly production access vs. planned)
select audit_user,
audit_request,
audit_txid,
audit_date,
audit_action,
id,
(casewhen audit_action = 'D'thennullelse title end) title,
lag(title) over w title$prev
from movies$a a
window w as (partition by id orderby audit_date asc)
Demo - Blame Tool (2 of 3)
with history as (...)
select history.*,
dense_rank() over w as r
from history
where ne(title, title$prev)
window w as (partition by id orderby audit_date desc)
Demo - Blame Tool (3 of 3)
with history as (...),
ranked as (...)
select * from ranked where r = 1
Undo (1 of 3)
Infrequent data changes (fix output of software)
Target change by user/request/time
Undo changed columns
Output SQL
Report on the results
Undo (2 of 3)
-- undo queryselect array_agg(title) title,
array_agg(audit_txid) audit_txid,
id
from (
select audit_txid, id, title
from movies$a audit_data
where audit_data.audit_txid <= 324101orderby audit_txid desc
) a
groupby id
update ...
Undo (3 of 3)
update movies
set
title =
(casewhen
ne(change.title[1], change.title[2])
then change.title[2]
else movies.title
end)
from change
where movies.id = change.id
and audit_txid[1] = 324101;
Principles for Data Corrections
Provide production data to developers
Scrub data for personal information
Especially remove email addresses
Produce a before/after report
Repeatable code
Time Travelling Debugger (1 of 8)
select
id, movie_title, license_start, license_end
from
movie_history_vw
where
effective <@ (now() - interval '1day')
Time Travelling Debugger (2 of 8)
Time Travelling Debugger (3 of 8)
select
m.id, m.title,
tstzrange(
coalesce(m.audit_date, '-infinity'),
coalesce(lead(m.audit_date) over w_m, 'infinity'),
'[)'
) movie_effective
from movies$a m
window w_m as (partition by m.id orderby m.audit_date asc)
Time Travelling Debugger (4 of 8)
select
l.id, l.title, movie_id,
tstzrange(
coalesce(l.audit_date, '-infinity'),
coalesce(lead(l.audit_date) over w_l, 'infinity'),
'[)'
) license_effective
from licenses$a l
window w_l as (partition by l.id orderby l.audit_date asc)
Time Travelling Debugger (5 of 8)
with movies_history as (...),
licenses_history as (...)
select m.id, m.title, l.id, l.title,
movie_effective,
coalesce(l.license_effective, '[-infinity,infinity]') license_effective
from movies_history m
leftjoin licenses_history l
on l.movie_id = m.id
Time Travelling Debugger (6 of 8)
with movies_history as (...),
licenses_history as (...),
joined_history as (...)
select
movie_id, movie_title, license_id, license_title,
movie_effective * license_effective effective
from joined_history
where movie_effective && license_effective
Time Travelling Debugger (7 of 8)
createview movie_history_vw aswith movies_history as (...),
licenses_history as (...),
joined_history as (...)
where movie_effective @> (now() - interval'1 day')
Time Travelling Debugger (8 of 8)
createorreplace function movie_history(timestampwithtimezone)
returns setof movie_history_vw as
$$
declare
r record;
query text;
begin
return query
select *
from movie_history_vw
where license_effective @> $1end
$$ language'plpgsql';select * from movie_history(now() - interval'1 day');