notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2025-12-17 10:42:25 -05:00

Author	SHA1	Message	Date
Kenneth Kehl	27d86c949a	#224 remove crown (#228 ) Co-authored-by: Kenneth Kehl <@kkehl@flexion.us>	2023-04-11 16:29:37 -04:00
Steven Reilly	ff4190a8eb	Remove letters-related code (#175 ) This deletes a big ol' chunk of code related to letters. It's not everything—there are still a few things that might be tied to sms/email—but it's the the heart of letters function. SMS and email function should be untouched by this. Areas affected: - Things obviously about letters - PDF tasks, used for precompiling letters - Virus scanning, used for those PDFs - FTP, used to send letters to the printer - Postage stuff	2023-03-02 20:20:31 -05:00
Ryan Ahearn	8a0535fa03	Save normalized phone format to DB	2023-01-06 11:06:23 -05:00
Ryan Ahearn	6a04be0370	Remove firetext and mmg from inbound SMS options	2022-12-22 09:31:12 -05:00
Ryan Ahearn	431f7aeb3c	Gracefully handle decryption errors in past-7-days notification reports	2022-12-13 14:18:34 -05:00
stvnrlly	9e7ee1c0f8	migrate bst_date to local_date	2022-11-21 11:49:59 -05:00
stvnrlly	b50cb4712f	tz utility swap and many test updates	2022-11-10 12:33:25 -05:00
stvnrlly	637fbdb891	broadcast flake8 cleanup	2022-10-25 11:53:24 -04:00
Steven Reilly	d37c2a53b8	Merge branch 'main' into stvnrlly-remove-broadcasts	2022-10-25 10:17:49 -04:00
stvnrlly	d4e156e8ae	Merge branch 'main' into stvnrlly-remove-broadcasts	2022-10-20 19:44:20 -04:00
stvnrlly	e9fdfd59f4	clean flake8 except provider code	2022-10-19 16:16:26 +00:00
stvnrlly	1fa48ef353	remove lingering comment	2022-10-17 13:45:38 +00:00
stvnrlly	0186095920	swap out uk org types for us-specific org types	2022-10-11 20:27:49 +00:00
stvnrlly	57f4df8ed1	remove broadcast-related code, except migrations	2022-10-04 15:28:27 +00:00
jimmoffet	b0f819dbd9	canada UK ses callbacks monster mash	2022-09-15 14:59:13 -07:00
Ryan Ahearn	3c035531aa	Clean up and validate low static-scan findings	2022-08-19 14:32:11 +00:00
Jim Moffet	aa4ec532a4	implement SNS	2022-06-17 11:16:23 -07:00
Jim Moffet	79ba6cc1d1	disable cache persistence & env updates	2022-06-13 21:42:36 -07:00
Ben Thorner	33645c7747	Use notification view for status / billing tasks This fixes a bug where (letter) notifications left in sending would temporarily get excluded from billing and status calculations once the service retention period had elapsed, and then get included once again when they finally get marked as delivered.* Status and billing tasks shouldn't need to have knowledge about which table their data is in and getting this wrong is the fundamental cause of the bug here. Adding a view across both tables abstracts this away while keeping the query complexity the same. Using a view also has the added benefit that we no longer need to care when the status / billing tasks run in comparison to the deletion task, since we will retrieve the same data irrespective (see below for a more detailed discussion on data integrity). Such a scenario is rare but has happened. A New View ========== I've included all the columns that are shared between the two tables, even though only a subset are actually needed. Having extra columns has no impact and may be useful in future. Although the view isn't actually a table, SQLAlchemy appears to wrap it without any issues, noting that the package doesn't have any direct support for "view models". Because we're never inserting data, we don't need most of the kwargs when defining columns. Note that the "default" kwarg doesn't affect data that's retrieved, only data that's written (if no value is set). Data Integrity ============== The (new) tests cover the main scenarios. We need to be careful with how the view interacts with the deletion / archiving task. There are two concerns here: - Duplicates. The deletion task inserts before it deletes [^1], so we could end up double counting. It turns out this isn't a problem because a Postgres UNION is an implicit "DISTINCT" [^2]. I've also verified this manually, just to be on the safe side. - No data. It's conceivable that the query will check the history table just before the insertion, then check the notifications table just after the deletion. It turns out this isn't a problem either because the whole query sees the same DB snapshot [^3][^4]. I can't think of a way to test this as it's a race condition, but I'm confident the Postgres docs are accurate. Performance =========== I copied the relevant (non-PII) columns from Production for data going back to 2022-04-01. I then ran several tests. Queries using the new view still make use of indices on a per-table basis, as the following query plan illustrates: QUERY PLAN ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ GroupAggregate (cost=1130820.02..1135353.89 rows=46502 width=97) (actual time=629.863..756.703 rows=72 loops=1) Group Key: notifications_all_time_view.template_id, notifications_all_time_view.sent_by, notifications_all_time_view.rate_multiplier, notifications_all_time_view.international -> Sort (cost=1130820.02..1131401.28 rows=232506 width=85) (actual time=629.756..708.914 rows=217563 loops=1) Sort Key: notifications_all_time_view.template_id, notifications_all_time_view.sent_by, notifications_all_time_view.rate_multiplier, notifications_all_time_view.international Sort Method: external merge Disk: 9320kB -> Subquery Scan on notifications_all_time_view (cost=1088506.43..1098969.20 rows=232506 width=85) (actual time=416.118..541.669 rows=217563 loops=1) -> Unique (cost=1088506.43..1096644.14 rows=232506 width=725) (actual time=416.115..513.065 rows=217563 loops=1) -> Sort (cost=1088506.43..1089087.70 rows=232506 width=725) (actual time=416.115..451.190 rows=217563 loops=1) Sort Key: notifications_no_pii.id, notifications_no_pii.job_id, notifications_no_pii.service_id, notifications_no_pii.template_id, notifications_no_pii.key_type, notifications_no_pii.billable_units, notifications_no_pii.notification_type, notifications_no_pii.created_at, notifications_no_pii.sent_by, notifications_no_pii.notification_status, notifications_no_pii.international, notifications_no_pii.rate_multiplier, notifications_no_pii.postage Sort Method: external merge Disk: 23936kB -> Append (cost=114.42..918374.12 rows=232506 width=725) (actual time=2.051..298.229 rows=217563 loops=1) -> Bitmap Heap Scan on notifications_no_pii (cost=114.42..8557.55 rows=2042 width=113) (actual time=1.405..1.442 rows=0 loops=1) Recheck Cond: ((service_id = 'c5956607-20b1-48b4-8983-85d11404e61f'::uuid) AND (notification_type = 'sms'::notification_type) AND (notification_status = ANY ('{sending,sent,delivered,pending,temporary-failure,permanent-failure}'::text[])) AND (created_at >= '2022-05-01 23:00:00'::timestamp without time zone) AND (created_at < '2022-05-02 23:00:00'::timestamp without time zone)) Filter: ((key_type)::text = ANY ('{normal,team}'::text[])) -> Bitmap Index Scan on ix_notifications_no_piiservice_id_composite (cost=0.00..113.91 rows=2202 width=0) (actual time=1.402..1.439 rows=0 loops=1) Index Cond: ((service_id = 'c5956607-20b1-48b4-8983-85d11404e61f'::uuid) AND (notification_type = 'sms'::notification_type) AND (notification_status = ANY ('{sending,sent,delivered,pending,temporary-failure,permanent-failure}'::text[])) AND (created_at >= '2022-05-01 23:00:00'::timestamp without time zone) AND (created_at < '2022-05-02 23:00:00'::timestamp without time zone)) -> Index Scan using ix_notifications_history_no_pii_service_id_composite on notifications_history_no_pii (cost=0.70..906328.97 rows=230464 width=113) (actual time=0.645..281.612 rows=217563 loops=1) Index Cond: ((service_id = 'c5956607-20b1-48b4-8983-85d11404e61f'::uuid) AND ((key_type)::text = ANY ('{normal,team}'::text[])) AND (notification_type = 'sms'::notification_type) AND (created_at >= '2022-05-01 23:00:00'::timestamp without time zone) AND (created_at < '2022-05-02 23:00:00'::timestamp without time zone)) Filter: (notification_status = ANY ('{sending,sent,delivered,pending,temporary-failure,permanent-failure}'::text[])) Planning Time: 18.032 ms Execution Time: 759.001 ms (21 rows) Queries using the new view appear to be slower than without, but the differences I've seen are minimal: the original queries execute in seconds locally and in Production, so it's not a big issue. Notes: Performance ================== I downloaded a minimal set of columns for testing: \copy ( select id, notification_type, key_type, created_at, service_id, template_id, sent_by, rate_multiplier, international, billable_units, postage, job_id, notification_status from notifications ) to 'notifications.csv' delimiter ',' csv header; CREATE TABLE notifications_no_pii ( id uuid NOT NULL, notification_type public.notification_type NOT NULL, key_type character varying(255) NOT NULL, created_at timestamp without time zone NOT NULL, service_id uuid, template_id uuid, sent_by character varying, rate_multiplier numeric, international boolean, billable_units integer NOT NULL, postage character varying, job_id uuid, notification_status text ); copy notifications_no_pii from '/Users/ben.thorner/Desktop/notifications.csv' delimiter ',' csv header; CREATE INDEX ix_notifications_no_piicreated_at ON notifications_no_pii USING btree (created_at); CREATE INDEX ix_notifications_no_piijob_id ON notifications_no_pii USING btree (job_id); CREATE INDEX ix_notifications_no_piinotification_type_composite ON notifications_no_pii USING btree (notification_type, notification_status, created_at); CREATE INDEX ix_notifications_no_piiservice_created_at ON notifications_no_pii USING btree (service_id, created_at); CREATE INDEX ix_notifications_no_piiservice_id_composite ON notifications_no_pii USING btree (service_id, notification_type, notification_status, created_at); CREATE INDEX ix_notifications_no_piitemplate_id ON notifications_no_pii USING btree (template_id); And similarly for the history table. I then created a sepatate view across both of these temporary tables using just these columns. To test performance I created some queries that reflect what is run by the billing [^5] and status [^6] tasks e.g. explain analyze select template_id, sent_by, rate_multiplier, international, sum(billable_units), count() from notifications_all_time_view where notification_status in ('sending', 'sent', 'delivered', 'pending', 'temporary-failure', 'permanent-failure') and key_type in ('normal', 'team') and created_at >= '2022-05-01 23:00' and created_at < '2022-05-02 23:00' and notification_type = 'sms' and service_id = 'c5956607-20b1-48b4-8983-85d11404e61f' group by 1,2,3,4; explain analyze select template_id, job_id, key_type, notification_status, count(*) from notifications_all_time_view where created_at >= '2022-05-01 23:00' and created_at < '2022-05-02 23:00' and notification_type = 'sms' and service_id = 'c5956607-20b1-48b4-8983-85d11404e61f' and key_type in ('normal', 'team') group by 1,2,3,4; Between running queries I restarted my local database and also ran a command to purge disk caches [^7]. I tested on a few services: - c5956607-20b1-48b4-8983-85d11404e61f on 2022-05-02 (high volume) - 0cc696c6-b792-409d-99e9-64232f461b0f on 2022-04-06 (highest volume) - 01135db6-7819-4121-8b97-4aa2d741e372 on 2022-04-14 (very low volume) All execution results are of the same magnitude using the view compared to the worst case of either table on its own. [^1]: `00a04ebf54/app/dao/notifications_dao.py (L389)` [^2]: https://stackoverflow.com/questions/49925/what-is-the-difference-between-union-and-union-all [^3]: https://www.postgresql.org/docs/current/transaction-iso.html [^4]: https://dba.stackexchange.com/questions/210485/can-sub-selects-change-in-one-single-query-in-a-read-committed-transaction [^5]: `00a04ebf54/app/dao/fact_billing_dao.py (L471)` [^6]: `00a04ebf54/app/dao/fact_notification_status_dao.py (L58)` [^7]: https://stackoverflow.com/questions/28845524/echo-3-proc-sys-vm-drop-caches-on-mac-osx	2022-05-19 15:14:32 +01:00
Leo Hemsted	51646af92e	remove provider_rates table this was added five years ago but never used. if we want to bring back variable rates per client we might as well get a fresh start since a lot has changed since then.	2022-05-03 14:42:59 +01:00
Pea Tyczynska	3dc1907321	Audit api key id when cancelling broadcast via api	2022-02-11 12:01:56 +00:00
Pea Tyczynska	a780933893	Revert "Audit api key id when cancelling broadcast via api"	2022-02-09 11:01:39 +00:00
Pea Tyczynska	d05bff9efc	Merge pull request #3440 from alphagov/audit-api-key-id-when-cancelling-broadcast-via-api Audit api key id when cancelling broadcast via api	2022-02-09 10:15:03 +00:00
Chris Hill-Scott	7f72d3a60f	Bump utils to 53.0.0 Changes: 53.0.0 --- * `notifications_utils.columns.Columns` has moved to `notifications_utils.insensitive_dict.InsensitiveDict` * `notifications_utils.columns.Rows` has moved to `notifications_utils.recipients.Rows` * `notifications_utils.columns.Cell` has moved to `notifications_utils.recipients.Cell` 52.0.0 --- * Deprecate the following unused `redis_client` functions: - `redis_client.increment_hash_value` - `redis_client.decrement_hash_value` - `redis_client.get_all_from_hash` - `redis_client.set_hash_and_expire` - `redis_client.expire` 51.3.1 --- * Bump govuk-bank-holidays to cache holidays for next year.	2022-02-08 09:45:10 +00:00
Pea Tyczynska	0737eceddb	Include check constraint in migration script Add check constraint that created_by_id should not be null, unless created_by_api_key_id is not null to the migration script. It is already in the models file. Also remove check constraint for cancelled_by_id from models, as this field would only be filled for broadcasts with cancelled status. Also add some spacing in that migration script so it is easier to read.	2022-02-02 17:21:58 +00:00
Pea Tyczynska	e1a8219eb1	DB Migration to allow auditing api key id when cancelling broadcast via API.	2022-01-26 17:26:05 +00:00
Ben Thorner	cfa6284af7	Remove duplicate declaration for reference column This is identical to the declaration a few lines above.	2022-01-17 12:03:14 +00:00
Chris Hill-Scott	54bcf618da	Store the `event` field from CAP XML broadcasts We don’t store everything that comes in the CAP XML when someone creates a broadcast via the API. One thing we do store is `<identifier>` (in a column called `reference`) which is a unique (to the external system) identifier for the broadcast. We show this in the front end instead of the template name, because broadcasts created from the API don’t use templates. However this ID isn’t very friendly – the Environment Agency just supply a UUID. The Environment Agency also populate the `<event>` field with some human readable text, for example: > 013 Issue Severe Flood Warning EA (013 is an area code which will be meaningful to the Flood Warning Service team) We should show this in the UI instead of the reference. The first step towards this is storing it in the database and returning it in the REST endpoints. Later we can have the admin app prefer `cap_event` over `reference`, where `cap_event` is present. We can’t backfill this data because we don’t keep a copy of the original XML. Seems like `<event>` is a mandatory property of `<info>`, so we don’t need to worry about the field being missing (`<info>` is optional in CAP but we require it because it contains stuff like the areas which we need in order to send out the broadcast`). *** https://www.pivotaltracker.com/story/show/176927060	2021-10-26 11:12:27 +01:00
Leo Hemsted	33ca817e17	return contact list created_at in UTC, not BST make sure timestamps returned from the api are always consistent. The only place in models where we're serializing a BST timestamp is on the Notification.serialize_for_csv method now, which at least is a bit different as this is user-facing (it also returns a formatted human-readable notification_status for example).	2021-09-14 12:41:52 +01:00
Ben Thorner	54808104a6	Stop defaulting simple_polygons to empty array This is now done by the Admin app [1]. [1]: `baf20e0075`	2021-09-07 17:16:24 +01:00
Ben Thorner	59d0ab4f65	Stop defaulting "ids" to an empty array This is so we can distinguish custom broadcasts in the Admin app [1]. I've also extended the POST test for custom broadcasts to check we're correctly reading data for "names", as this wasn't being tested previously. [1]: `411fda81c0`	2021-09-07 17:16:22 +01:00
Ben Thorner	dd41cf854c	Remove support for old "areas" sub-field All broadcasts with this field have now been migrated to use "ids". This also removes a few lines that were missed in previous PRs: - Added by mistake: `fd7ebbebb0 (diff-045554136e1462693a6cbb6328b2e056a81e8b348e94575edd8f72b78c5da96eR115)` - Missed removal: `ec1171f85c (diff-045554136e1462693a6cbb6328b2e056a81e8b348e94575edd8f72b78c5da96eR110)`	2021-09-07 17:14:57 +01:00
Ben Thorner	d50c563f08	Remove support for "areas_2" field The Admin app was only using this temporarily and is now using the "areas" field instead [1], so we can delete this one. [1]: https://github.com/alphagov/notifications-admin/pull/4006	2021-09-01 17:42:15 +01:00
Ben Thorner	ec1171f85c	Switch "areas" field to "areas_2" format The Admin app is now temporarily using the "areas_2" field while we migrate "areas" to the new format [1]. [1]: https://github.com/alphagov/notifications-admin/pull/4004	2021-08-27 14:22:11 +01:00
Ben Thorner	194f54c38f	Add missing tests for old format areas This was (sort of) missed in [1], but it hasn't caused a problem because the code to create/update broadcasts will populate areas in the old ("areas") and the new ("ids") formats anyway [2]. However, we're about to remove create/update support code, so we need to have something in place to cope with old and new format data until we backfill old broadcasts with the new format [3]. [1]: https://github.com/alphagov/notifications-api/pull/3312 [2]: https://github.com/alphagov/notifications-api/pull/3312/files#diff-6be38b59e387630a0c6b8fc60312b7ba53ba9f36c54594fa5690646f286dd2b9R141 [3]: `9667433b7e`	2021-08-27 13:18:12 +01:00
Ben Thorner	8f39d476bd	Start dual running with "areas" and (area) "ids" This is necessary until: - The Admin app is using the new "areas(_2)" format to store and retrieve data. - We've migrated all existing broadcast messages to use the new format. Note that "areas" / "ids" isn't actually used for anything except printing out the PagerDuty message - it's not sent to the proxy [1]. [1]: `6edc6c70aa/app/celery/broadcast_message_tasks.py (L190-L193)`	2021-08-26 15:34:24 +01:00
Ben Thorner	fd7ebbebb0	Introduce "areas_2" so we can repurpose "areas" Currently we have: - An "areas" column in the DB that stores a JSON blob. - An "areas" field inside the "areas" JSON that stores area IDs. - Each field has to be manually copied into the JSON column. We want to move to: - An "areas" column in the DB (unchanged). - An "ids" field inside the "areas" JSON (to replace "areas"). - The Admin app sending other data inside an "areas" JSON field. The API design for areas is confusing and difficult to extend. Here we duplicate the current API functionality using an "areas_2" field. Once the Admin app is using this field, we'll be able to rename it to just "areas", which is where we want to get to. In the next commits we'll build on this to support the migration from "areas"."areas" to "areas"."ids".	2021-08-26 15:34:23 +01:00
Rebecca Law	94c4a8f238	Remove scheduled_notifications All code has been removed for ScheduledNotifications. This PR just removes the table, it has never been used.	2021-07-08 08:12:18 +01:00
Katie Smith	e5fdd8ee1f	Add new broadcast related permissions We want to have new permissions which will be used specifically for broadcasts: - `create_broadcasts` - `approve_broadcasts` - `reject_broadcasts` - `cancel_broadcasts` Cancel and reject will always go together, but having separate database permissions makes things easier to change in the future. The permission column of the permissions table is an enum. We can add values in the alembic upgrade script, but removing individual values from an enum is not supported by Postgres. To remove values, we have to recreate the enum with the old values.	2021-07-07 14:54:13 +01:00
Ben Thorner	6cf24899dd	Let existing WebAuthn users continue using it It's not a big deal if a user is no longer eligible to register a security key, so we may as well let them continue using it. This avoids putting them in a limbo state if we don't immediately change their auth type when they're no longer eligible to use the feature.	2021-06-30 15:41:43 +01:00
Ben Thorner	2fa6327efb	Add flag to say if user is eligible for WebAuthn Currently we have some data-driven roles to say who can use this feature. Adding a flag in the API means we can avoid API calls in the Admin app to determine the same. Allowing members of the GOV.UK Notify service to use the feature is a workaround, so we can avoid making someone a Platform Admin before they've protected their account with it.	2021-06-28 13:35:24 +01:00
Rebecca Law	c44ec57c17	Merge pull request #3266 from alphagov/update-notifications-model-with-indexes Tidy up models	2021-06-21 12:43:08 +01:00
Rebecca Law	ff79c65cab	Move the indexes to be inline with the table.	2021-06-21 12:06:38 +01:00
David McDonald	69212827eb	Merge pull request #3270 from alphagov/broadcast-status-transition-tests Broadcast status transition tests	2021-06-16 16:04:15 +01:00
David McDonald	54fe8ee68d	Remove old todo for support of draft to broadcasting transition It looks like we were allowing broadcasts to transition from draft to broadcasting in one go. This isn't valid now. It should go draft, pending approval and then broadcasting. It looks like this was a leftover bit of support in our code for when we were building stuff out and is no longer needed.	2021-06-15 17:18:54 +01:00
Rebecca Law	2c36898684	Add permanent-failure for letters. It's possible a letter can pass our validation but our print provider can not print the letter. The letter will be marked as permanent failure in this case. Typically happens with precompiled letters.	2021-06-15 15:12:46 +01:00
Rebecca Law	0688a16cb2	Tidy up models - Update the Notification and NotificationHistory model to reflect the database. - Updates to datatypes, removal of indexes and addition of indexes. Why? After running the `flask db migrate` command there are many deltas because we did some work to update the notification and notification_history tables, however, the SQLAlchemy models were not updated to reflect those changes. This PR cleans up all those deltas. However, there are still some differences that can be done but we can look at that in another PR.	2021-06-14 14:43:34 +01:00
Pea Tyczynska	251107029a	Add webauthn to tests that include other auth types	2021-05-13 12:44:36 +01:00
Pea Tyczynska	098c6f031b	Add webauthn as an auth type. Both in our models and as a migration to add it to auth_types table. Make sure that if we downgrade, we first clean up the data.	2021-05-13 12:44:36 +01:00
Pea Tyczynska	e6291187ba	Remove registration_response from webauthn serialize - not needed in admin app Also fix tests: First add init file so the tests are found correctly, then update the tests after we stopped serialising webauthn registration_response.	2021-05-12 17:48:37 +01:00

1 2 3 4 5 ...

852 Commits