notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2025-12-09 14:42:24 -05:00

Author	SHA1	Message	Date
jimmoffet	181ae4c60f	deukify service and template	2022-08-29 19:10:56 -07:00
Jim Moffet	2bcae20441	initial sms provider cleanup	2022-06-25 13:05:10 -07:00
Jim Moffet	aa4ec532a4	implement SNS	2022-06-17 11:16:23 -07:00
Jim Moffet	79ba6cc1d1	disable cache persistence & env updates	2022-06-13 21:42:36 -07:00
Jim Moffet	59b72f4853	add devcontainer configs and docker network orchestration	2022-06-13 13:16:32 -07:00
Katie Smith	4961c7cefc	Rename migration file This renames the latest migration file to match the Revision ID in the file. When these names are different, our deployment pipeline tries to run migrations on the notify-api-db-migration app and runs the functional tests twice.	2022-05-27 14:58:01 +01:00
Ben Thorner	33645c7747	Use notification view for status / billing tasks This fixes a bug where (letter) notifications left in sending would temporarily get excluded from billing and status calculations once the service retention period had elapsed, and then get included once again when they finally get marked as delivered.* Status and billing tasks shouldn't need to have knowledge about which table their data is in and getting this wrong is the fundamental cause of the bug here. Adding a view across both tables abstracts this away while keeping the query complexity the same. Using a view also has the added benefit that we no longer need to care when the status / billing tasks run in comparison to the deletion task, since we will retrieve the same data irrespective (see below for a more detailed discussion on data integrity). Such a scenario is rare but has happened. A New View ========== I've included all the columns that are shared between the two tables, even though only a subset are actually needed. Having extra columns has no impact and may be useful in future. Although the view isn't actually a table, SQLAlchemy appears to wrap it without any issues, noting that the package doesn't have any direct support for "view models". Because we're never inserting data, we don't need most of the kwargs when defining columns. Note that the "default" kwarg doesn't affect data that's retrieved, only data that's written (if no value is set). Data Integrity ============== The (new) tests cover the main scenarios. We need to be careful with how the view interacts with the deletion / archiving task. There are two concerns here: - Duplicates. The deletion task inserts before it deletes [^1], so we could end up double counting. It turns out this isn't a problem because a Postgres UNION is an implicit "DISTINCT" [^2]. I've also verified this manually, just to be on the safe side. - No data. It's conceivable that the query will check the history table just before the insertion, then check the notifications table just after the deletion. It turns out this isn't a problem either because the whole query sees the same DB snapshot [^3][^4]. I can't think of a way to test this as it's a race condition, but I'm confident the Postgres docs are accurate. Performance =========== I copied the relevant (non-PII) columns from Production for data going back to 2022-04-01. I then ran several tests. Queries using the new view still make use of indices on a per-table basis, as the following query plan illustrates: QUERY PLAN ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ GroupAggregate (cost=1130820.02..1135353.89 rows=46502 width=97) (actual time=629.863..756.703 rows=72 loops=1) Group Key: notifications_all_time_view.template_id, notifications_all_time_view.sent_by, notifications_all_time_view.rate_multiplier, notifications_all_time_view.international -> Sort (cost=1130820.02..1131401.28 rows=232506 width=85) (actual time=629.756..708.914 rows=217563 loops=1) Sort Key: notifications_all_time_view.template_id, notifications_all_time_view.sent_by, notifications_all_time_view.rate_multiplier, notifications_all_time_view.international Sort Method: external merge Disk: 9320kB -> Subquery Scan on notifications_all_time_view (cost=1088506.43..1098969.20 rows=232506 width=85) (actual time=416.118..541.669 rows=217563 loops=1) -> Unique (cost=1088506.43..1096644.14 rows=232506 width=725) (actual time=416.115..513.065 rows=217563 loops=1) -> Sort (cost=1088506.43..1089087.70 rows=232506 width=725) (actual time=416.115..451.190 rows=217563 loops=1) Sort Key: notifications_no_pii.id, notifications_no_pii.job_id, notifications_no_pii.service_id, notifications_no_pii.template_id, notifications_no_pii.key_type, notifications_no_pii.billable_units, notifications_no_pii.notification_type, notifications_no_pii.created_at, notifications_no_pii.sent_by, notifications_no_pii.notification_status, notifications_no_pii.international, notifications_no_pii.rate_multiplier, notifications_no_pii.postage Sort Method: external merge Disk: 23936kB -> Append (cost=114.42..918374.12 rows=232506 width=725) (actual time=2.051..298.229 rows=217563 loops=1) -> Bitmap Heap Scan on notifications_no_pii (cost=114.42..8557.55 rows=2042 width=113) (actual time=1.405..1.442 rows=0 loops=1) Recheck Cond: ((service_id = 'c5956607-20b1-48b4-8983-85d11404e61f'::uuid) AND (notification_type = 'sms'::notification_type) AND (notification_status = ANY ('{sending,sent,delivered,pending,temporary-failure,permanent-failure}'::text[])) AND (created_at >= '2022-05-01 23:00:00'::timestamp without time zone) AND (created_at < '2022-05-02 23:00:00'::timestamp without time zone)) Filter: ((key_type)::text = ANY ('{normal,team}'::text[])) -> Bitmap Index Scan on ix_notifications_no_piiservice_id_composite (cost=0.00..113.91 rows=2202 width=0) (actual time=1.402..1.439 rows=0 loops=1) Index Cond: ((service_id = 'c5956607-20b1-48b4-8983-85d11404e61f'::uuid) AND (notification_type = 'sms'::notification_type) AND (notification_status = ANY ('{sending,sent,delivered,pending,temporary-failure,permanent-failure}'::text[])) AND (created_at >= '2022-05-01 23:00:00'::timestamp without time zone) AND (created_at < '2022-05-02 23:00:00'::timestamp without time zone)) -> Index Scan using ix_notifications_history_no_pii_service_id_composite on notifications_history_no_pii (cost=0.70..906328.97 rows=230464 width=113) (actual time=0.645..281.612 rows=217563 loops=1) Index Cond: ((service_id = 'c5956607-20b1-48b4-8983-85d11404e61f'::uuid) AND ((key_type)::text = ANY ('{normal,team}'::text[])) AND (notification_type = 'sms'::notification_type) AND (created_at >= '2022-05-01 23:00:00'::timestamp without time zone) AND (created_at < '2022-05-02 23:00:00'::timestamp without time zone)) Filter: (notification_status = ANY ('{sending,sent,delivered,pending,temporary-failure,permanent-failure}'::text[])) Planning Time: 18.032 ms Execution Time: 759.001 ms (21 rows) Queries using the new view appear to be slower than without, but the differences I've seen are minimal: the original queries execute in seconds locally and in Production, so it's not a big issue. Notes: Performance ================== I downloaded a minimal set of columns for testing: \copy ( select id, notification_type, key_type, created_at, service_id, template_id, sent_by, rate_multiplier, international, billable_units, postage, job_id, notification_status from notifications ) to 'notifications.csv' delimiter ',' csv header; CREATE TABLE notifications_no_pii ( id uuid NOT NULL, notification_type public.notification_type NOT NULL, key_type character varying(255) NOT NULL, created_at timestamp without time zone NOT NULL, service_id uuid, template_id uuid, sent_by character varying, rate_multiplier numeric, international boolean, billable_units integer NOT NULL, postage character varying, job_id uuid, notification_status text ); copy notifications_no_pii from '/Users/ben.thorner/Desktop/notifications.csv' delimiter ',' csv header; CREATE INDEX ix_notifications_no_piicreated_at ON notifications_no_pii USING btree (created_at); CREATE INDEX ix_notifications_no_piijob_id ON notifications_no_pii USING btree (job_id); CREATE INDEX ix_notifications_no_piinotification_type_composite ON notifications_no_pii USING btree (notification_type, notification_status, created_at); CREATE INDEX ix_notifications_no_piiservice_created_at ON notifications_no_pii USING btree (service_id, created_at); CREATE INDEX ix_notifications_no_piiservice_id_composite ON notifications_no_pii USING btree (service_id, notification_type, notification_status, created_at); CREATE INDEX ix_notifications_no_piitemplate_id ON notifications_no_pii USING btree (template_id); And similarly for the history table. I then created a sepatate view across both of these temporary tables using just these columns. To test performance I created some queries that reflect what is run by the billing [^5] and status [^6] tasks e.g. explain analyze select template_id, sent_by, rate_multiplier, international, sum(billable_units), count() from notifications_all_time_view where notification_status in ('sending', 'sent', 'delivered', 'pending', 'temporary-failure', 'permanent-failure') and key_type in ('normal', 'team') and created_at >= '2022-05-01 23:00' and created_at < '2022-05-02 23:00' and notification_type = 'sms' and service_id = 'c5956607-20b1-48b4-8983-85d11404e61f' group by 1,2,3,4; explain analyze select template_id, job_id, key_type, notification_status, count(*) from notifications_all_time_view where created_at >= '2022-05-01 23:00' and created_at < '2022-05-02 23:00' and notification_type = 'sms' and service_id = 'c5956607-20b1-48b4-8983-85d11404e61f' and key_type in ('normal', 'team') group by 1,2,3,4; Between running queries I restarted my local database and also ran a command to purge disk caches [^7]. I tested on a few services: - c5956607-20b1-48b4-8983-85d11404e61f on 2022-05-02 (high volume) - 0cc696c6-b792-409d-99e9-64232f461b0f on 2022-04-06 (highest volume) - 01135db6-7819-4121-8b97-4aa2d741e372 on 2022-04-14 (very low volume) All execution results are of the same magnitude using the view compared to the worst case of either table on its own. [^1]: `00a04ebf54/app/dao/notifications_dao.py (L389)` [^2]: https://stackoverflow.com/questions/49925/what-is-the-difference-between-union-and-union-all [^3]: https://www.postgresql.org/docs/current/transaction-iso.html [^4]: https://dba.stackexchange.com/questions/210485/can-sub-selects-change-in-one-single-query-in-a-read-committed-transaction [^5]: `00a04ebf54/app/dao/fact_billing_dao.py (L471)` [^6]: `00a04ebf54/app/dao/fact_notification_status_dao.py (L58)` [^7]: https://stackoverflow.com/questions/28845524/echo-3-proc-sys-vm-drop-caches-on-mac-osx	2022-05-19 15:14:32 +01:00
Leo Hemsted	51646af92e	remove provider_rates table this was added five years ago but never used. if we want to bring back variable rates per client we might as well get a fresh start since a lot has changed since then.	2022-05-03 14:42:59 +01:00
Leo Hemsted	cc285fa41e	update migration script numbers	2022-05-03 10:54:25 +01:00
Leo Hemsted	503c5e1eea	Merge pull request #3527 from alphagov/fix-apr-rate add revision to fix apr rate	2022-05-03 10:48:19 +01:00
Ben Thorner	c27107fa74	Remove support for Reach provider This provider was never active and support was never completed, so there's little value in keeping all this potentially confusing code.	2022-04-29 12:28:08 +01:00
Leo Hemsted	6d47d97bc0	add revision to fix apr rate this will update about 80-90k rows in ft billing	2022-04-28 15:54:22 +01:00
Katie Smith	7721cd26ca	Add new SMS rates for 1 May 2022 onwards This change can be merged before the new rates go live, because they won't be used until the start date.	2022-04-26 10:56:25 +01:00
Pea Tyczynska	7da0533276	Update migrations/versions/0368_move_orgs_to_nhs_branding_.py Co-authored-by: Ben Thorner <benthorner@users.noreply.github.com>	2022-04-19 11:53:54 +01:00
Pea Tyczynska	3777358287	Move existing nhs orgs without branding onto nhs branding This is done to make self-service branding easier to implement, and also because NHS branding makes much more sense for services in those orgs than GOV.UK branding.	2022-04-12 18:28:55 +01:00
Ben Thorner	015152bab2	Add boilerplate for sending SMS via Reach This works in conjunction with the new SMS provider stub [^1]. Local testing: - Run the migrations to add Reach as an inactive provider. - Activate the Reach provider locally and deactivate the others. update provider_details set priority = 100, active = false where notification_type = 'sms'; update provider_details set active = true where identifier = 'reach'; - Tweak your local environment to point at the SMS stub. export REACH_URL="http://host.docker.internal:6300/reach" - Start / restart Celery to pick up the config change. - Send a SMS via the Admin app and see the stub log it. - Reset your environment so you can send normal SMS. update provider_details set active = true where notification_type = 'sms'; update provider_details set active = false where identifier = 'reach'; [^1]: https://github.com/alphagov/notifications-sms-provider-stub/pull/10	2022-03-30 13:38:46 +01:00
Rebecca Law	00ec3ae8f6	Add new letter rates for March 1, 2022. - second class postage will go up by 2 pence, plus VAT - international postage will go up by 7 pence, plus VAT - first class postage will go down by 6 pence, plus VAT	2022-03-01 14:35:39 +00:00
Katie Smith	008cea6222	Migration to make NHS email branding ID consistent With the changes we will be making to email branding in admin it's useful to know the ID of the NHS email brand. This makes it easier to show a preview of the branding. This means that the ID will need to be consistent across environments, so this changes the details of the NHS brand in the dev, preview and staging environments to match the production data. The migration - Removes NHS branding from existing services - Removes NHS branding from any orgs which had it as their default - Deletes the NHS branding row if it exists - Inserts a new NHS branding row with details which match those in production This does have the effect of removing NHS branding from people's local and preview services and orgs (NHS branding doesn't exist on staging), but this does not affect real teams. The NHS logo with a filename of '1ac6f483-3105-4c9e-9017-dd7fb2752c44-nhs-blue_x2.png' exists in the logo S3 bucket for all environments.	2022-02-18 12:00:01 +00:00
Pea Tyczynska	e5b1a65f1e	Merge pull request #3448 from alphagov/drop_api_key_id_column_from_broadcast_message_table Drop api_key_id column from broadcast_message table	2022-02-11 14:01:25 +00:00
Pea Tyczynska	c55d917b28	Drop api_key_id column from broadcast_message table This column has been superseded by a new column named created_by_api_key_id. Also create constraint checking that we know who created broadcast Also move data so that constraint is met before instatiating it.	2022-02-11 12:12:00 +00:00
Pea Tyczynska	00b63ca5c7	Migration adding cancelled_by_api_key_id to broadcast_message table. Also add created_by_api_key_id column that will supersede api_key_id_column. We do migration and code in separate PRs to make it easier to downgrade if anything goes wrong.	2022-02-09 17:26:37 +00:00
Pea Tyczynska	a780933893	Revert "Audit api key id when cancelling broadcast via api"	2022-02-09 11:01:39 +00:00
Pea Tyczynska	f2bef7392e	api_key_id column will have to be dropped in a separate PR This is to avoid situation where we deploy migration already but don't deploy code yet and they are not compatible for a while which leads to errors.	2022-02-08 15:57:35 +00:00
Pea Tyczynska	0737eceddb	Include check constraint in migration script Add check constraint that created_by_id should not be null, unless created_by_api_key_id is not null to the migration script. It is already in the models file. Also remove check constraint for cancelled_by_id from models, as this field would only be filled for broadcasts with cancelled status. Also add some spacing in that migration script so it is easier to read.	2022-02-02 17:21:58 +00:00
Pea Tyczynska	e1a8219eb1	DB Migration to allow auditing api key id when cancelling broadcast via API.	2022-01-26 17:26:05 +00:00
Chris Hill-Scott	54bcf618da	Store the `event` field from CAP XML broadcasts We don’t store everything that comes in the CAP XML when someone creates a broadcast via the API. One thing we do store is `<identifier>` (in a column called `reference`) which is a unique (to the external system) identifier for the broadcast. We show this in the front end instead of the template name, because broadcasts created from the API don’t use templates. However this ID isn’t very friendly – the Environment Agency just supply a UUID. The Environment Agency also populate the `<event>` field with some human readable text, for example: > 013 Issue Severe Flood Warning EA (013 is an area code which will be meaningful to the Flood Warning Service team) We should show this in the UI instead of the reference. The first step towards this is storing it in the database and returning it in the REST endpoints. Later we can have the admin app prefer `cap_event` over `reference`, where `cap_event` is present. We can’t backfill this data because we don’t keep a copy of the original XML. Seems like `<event>` is a mandatory property of `<info>`, so we don’t need to worry about the field being missing (`<info>` is optional in CAP but we require it because it contains stuff like the areas which we need in order to send out the broadcast`). *** https://www.pivotaltracker.com/story/show/176927060	2021-10-26 11:12:27 +01:00
Katie Smith	d708b64dd4	Remove original permissions from broadcast users The broadcast user permissions are changing, so to avoid confusion and permissions which exist in the database but don't display on the frontend we are going to remove all existing permissions for users of broadcast services. This also updates the permissions of invited users who are still pending. The exception to this is the `view_activity` permission, which we always add for broadcast users even if they have no other permissions. (`aad017a184/app/main/forms.py (L1043)`)	2021-07-19 14:38:48 +01:00
Rebecca Law	5fcf65422c	Fix version conflicts.	2021-07-08 08:14:03 +01:00
Rebecca Law	bcd8d85569	Fix merge conflicts	2021-07-08 08:12:18 +01:00
Rebecca Law	ae29ae28e5	Adding CONCURRENTLY to the drop index statement. Drop index concurrently will drop the index without locking out concurrent selects, inserts, updates, and deletes on the index's table namely on notifications.	2021-07-08 08:12:18 +01:00
Rebecca Law	94c4a8f238	Remove scheduled_notifications All code has been removed for ScheduledNotifications. This PR just removes the table, it has never been used.	2021-07-08 08:12:18 +01:00
Katie Smith	e5fdd8ee1f	Add new broadcast related permissions We want to have new permissions which will be used specifically for broadcasts: - `create_broadcasts` - `approve_broadcasts` - `reject_broadcasts` - `cancel_broadcasts` Cancel and reject will always go together, but having separate database permissions makes things easier to change in the future. The permission column of the permissions table is an enum. We can add values in the alembic upgrade script, but removing individual values from an enum is not supported by Postgres. To remove values, we have to recreate the enum with the old values.	2021-07-07 14:54:13 +01:00
David McDonald	d18bf2c48a	Add operator channel migration Looks identical to the government channel migration 354	2021-06-09 13:48:02 +01:00
Pea Tyczynska	1f6e225a1b	Validate ck_user_has_mobile_or_other_auth constraint This is second step out of two step migration. We divided it like this to avoid potentially locking production database for extended amounts of time.	2021-05-13 17:34:35 +01:00
Pea Tyczynska	d6c3b5e0c9	Do not validate constraint when creating it To avoid locking production database for extended amounts of time.	2021-05-13 14:13:54 +01:00
Pea Tyczynska	098c6f031b	Add webauthn as an auth type. Both in our models and as a migration to add it to auth_types table. Make sure that if we downgrade, we first clean up the data.	2021-05-13 12:44:36 +01:00
Leo Hemsted	500feba50d	add name/id and consolidate webauthn types in model/table so we can be in line with what the admin handles, and keep it simple on the api side and do as little manipulation of binary data as possible. ### Minor changes * id is a UUID we can use for referencing within notify. No relation to the key itself. * name is a user viewable name that can be set/edited * fix updated_at to have onupdate, not default ### Simplify the webauthn data credential_data is the data we store about an authenticator that we'll use to identify the key when logging in. includes the credential_id, the public_key, and the aaguid (which identifies the authenticator make/model) registration_response is the data containing audit information - in the future we can use this to ensure that the authenticators used are of high quality. both of these fields are CBOR (a kind of binary json), encoded in base64 so that they can be embedded within our regular JSON api endpoints. we don't anticipate the api ever needing to interact with this data directly.	2021-05-12 17:48:37 +01:00
Pea Tyczynska	3798a3bd1d	Add webauthn_credential table This is to store data for registered webauthn credentials, so platform admins can later use them to log in.	2021-05-12 17:48:36 +01:00
Katie Smith	7eed63eb80	Add 'government' to broadcast_channel_types table	2021-05-11 16:32:03 +01:00
Katie Smith	4624328c36	Make service_broadcast_settings.provider non-nullable We set all existing null values to "all", then make the column non-nullable. Admin is already passing through the value of "all".	2021-05-10 15:59:22 +01:00
Katie Smith	aec631f208	Add a type table for broadcast providers This adds a type table for broadcast providers, which is the pattern we follow with our models (e.g. we have a `broadcast_channel_types` table). As well as the four providers, the migration populates it with `all` which is the value that will replace `null` in a later change. It should be safe to add the foreign key constraint to the `service_broadcast_settings` in the same migration since the column is still nullable and we don't have data in that column that is not in the types table.	2021-05-06 15:30:04 +01:00
Ben Thorner	618ce14842	Rewrite migration README This clarifies how we generate migration filenames, and points to the official docs and help commands, instead of repeating them.	2021-05-05 14:27:05 +01:00
Rebecca Law	490efad97a	To avoid any table locks and contention on the production database we are going to add the unique index concurrently. This only works if we execute a commit first.	2021-04-23 15:16:18 +01:00
Rebecca Law	a637c8eb92	Add unique key on annual_billing for service_id + financial_year_start. This is an extra precaution for the table to ensure data integrity. Since we only update/insert the data using the annual_billing_dao methods the integrity is in tact. I've check the data on preview, staging and prod there are no violations of this unique key.	2021-04-20 13:42:20 +01:00
Rebecca Law	a1ae220c9f	Add new SMS rate for April 1 2021. Downgrade script isn't necessary.	2021-04-01 08:20:29 +01:00
Rebecca Law	e105257925	Fix db migration merge conflicts	2021-02-26 08:38:36 +00:00
Rebecca Law	1797a993e7	Fix db migration conflict	2021-02-26 07:49:49 +00:00
Rebecca Law	0849070eca	Add created_at and updated_at columns to ft_processing_time	2021-02-26 07:49:49 +00:00
Rebecca Law	21edf7bfdd	Persist the processing time statistics to the database. The performance platform is going away soon. The only stat that we do not have in our database is the processing time. Let me clarify the only statistic we don't have in our database that we can query efficiently is the processing time. Any queries on notification_history are too inefficient to use on a web page. Processing time = the total number of normal/team emails and text messages plus the number of messages that have gone from created to sending within 10 seconds per whole day. We can then easily calculate the percentage of messages that were marked as sending under 10 seconds.	2021-02-26 07:49:49 +00:00
David McDonald	ce2f550387	Backfill services_broadcast_settings table At the moment, we currently have broadcast services that have the 'broadcast' service permission in the DB but don't have a corresponding row in the service_broadcast_settings table. It's important to give them a row in this table because this will control which broadcast channel their messages will go out on. All broadcast services will be expected to have this row so we can then remove hardcoded defaults from our code that currently set what channel a message should go out on. Whilst, when this gets deployed, we will have released https://github.com/alphagov/notifications-admin/pull/3794, which means that when a service setting is changed via that form, they will get the corresponding row in the service_broadcast_settings form, we don't want to ask every developer to go and fill in this form for every service on their local dev environment and also do the same to every service we have in preview, staging and production. Therefore a migration is the best way to backfill the data. Note, I made the decision that a service that is in trial mode will be given the 'severe' channel. This is because this is what most trial mode services should have. Only the MNOs would ever really have a trial mode service on the test channel. And given they are in trial mode, it is not too risky to give them the 'severe' channel. For services that are live though, although there are no services in production that have the broadcast permission and are live, it is not worth taking that risk and accidently setting that service to send an alert out on the 'severe' channel. We play it say by choosing 'test'.	2021-02-23 17:15:05 +00:00

1 2 3 4 5 ...

849 Commits