notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2025-12-15 01:32:20 -05:00

Author	SHA1	Message	Date
Kenneth Kehl	1ecb747c6d	reformat	2023-08-29 14:54:30 -07:00
Carlo Costino	206d5fccb7	Simplify test config for now This is in prep for switching to running a single test worker for the time being. We will spend time figuring out how to get multiple concurrent workers going again separately as we seem to have several issues taking place right now with that type of test setup. Signed-off-by: Carlo Costino <carlo.costino@gsa.gov>	2023-08-16 16:49:35 -04:00
Carlo Costino	17e13c3b3b	Round 2 of attempting to fix CI test runs Signed-off-by: Carlo Costino <carlo.costino@gsa.gov>	2023-08-16 16:49:35 -04:00
Carlo Costino	5519d17e09	Fix database URI for concurrent test workers Signed-off-by: Carlo Costino <carlo.costino@gsa.gov>	2023-08-16 16:49:35 -04:00
Carlo Costino	d9abca9dbe	Fix comments and imports causing build errors Signed-off-by: Carlo Costino <carlo.costino@gsa.gov>	2023-08-16 16:49:35 -04:00
Carlo Costino	114aa45f19	Update Flask plugins and related dependencies This changeset updates a few of the Flask plugins we rely on to their latest versions, as well as the psycopg2-binary library. It required a few adjustments to how our tests are configured and run, which are probably due for a larger refactor given how much has changed with these libraries and the anticipated change of going from SQLAlchemy 1.4.x to 2.0.x. Signed-off-by: Carlo Costino <carlo.costino@gsa.gov>	2023-08-16 16:49:27 -04:00
Kenneth Kehl	4940d5e93b	notify-api-332 rename organisation	2023-07-10 11:06:29 -07:00
Ryan Ahearn	041cd08097	Clean up more mmg and firetext references	2022-12-22 09:31:12 -05:00
Ryan Ahearn	6aa46da916	Create fixture that just calls create_app	2022-12-13 08:53:41 -05:00
stvnrlly	637fbdb891	broadcast flake8 cleanup	2022-10-25 11:53:24 -04:00
stvnrlly	53204c307b	tests are, uh, mostly passing	2022-10-05 01:12:35 +00:00
stvnrlly	57f4df8ed1	remove broadcast-related code, except migrations	2022-10-04 15:28:27 +00:00
Ben Thorner	33645c7747	Use notification view for status / billing tasks This fixes a bug where (letter) notifications left in sending would temporarily get excluded from billing and status calculations once the service retention period had elapsed, and then get included once again when they finally get marked as delivered.* Status and billing tasks shouldn't need to have knowledge about which table their data is in and getting this wrong is the fundamental cause of the bug here. Adding a view across both tables abstracts this away while keeping the query complexity the same. Using a view also has the added benefit that we no longer need to care when the status / billing tasks run in comparison to the deletion task, since we will retrieve the same data irrespective (see below for a more detailed discussion on data integrity). Such a scenario is rare but has happened. A New View ========== I've included all the columns that are shared between the two tables, even though only a subset are actually needed. Having extra columns has no impact and may be useful in future. Although the view isn't actually a table, SQLAlchemy appears to wrap it without any issues, noting that the package doesn't have any direct support for "view models". Because we're never inserting data, we don't need most of the kwargs when defining columns. Note that the "default" kwarg doesn't affect data that's retrieved, only data that's written (if no value is set). Data Integrity ============== The (new) tests cover the main scenarios. We need to be careful with how the view interacts with the deletion / archiving task. There are two concerns here: - Duplicates. The deletion task inserts before it deletes [^1], so we could end up double counting. It turns out this isn't a problem because a Postgres UNION is an implicit "DISTINCT" [^2]. I've also verified this manually, just to be on the safe side. - No data. It's conceivable that the query will check the history table just before the insertion, then check the notifications table just after the deletion. It turns out this isn't a problem either because the whole query sees the same DB snapshot [^3][^4]. I can't think of a way to test this as it's a race condition, but I'm confident the Postgres docs are accurate. Performance =========== I copied the relevant (non-PII) columns from Production for data going back to 2022-04-01. I then ran several tests. Queries using the new view still make use of indices on a per-table basis, as the following query plan illustrates: QUERY PLAN ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ GroupAggregate (cost=1130820.02..1135353.89 rows=46502 width=97) (actual time=629.863..756.703 rows=72 loops=1) Group Key: notifications_all_time_view.template_id, notifications_all_time_view.sent_by, notifications_all_time_view.rate_multiplier, notifications_all_time_view.international -> Sort (cost=1130820.02..1131401.28 rows=232506 width=85) (actual time=629.756..708.914 rows=217563 loops=1) Sort Key: notifications_all_time_view.template_id, notifications_all_time_view.sent_by, notifications_all_time_view.rate_multiplier, notifications_all_time_view.international Sort Method: external merge Disk: 9320kB -> Subquery Scan on notifications_all_time_view (cost=1088506.43..1098969.20 rows=232506 width=85) (actual time=416.118..541.669 rows=217563 loops=1) -> Unique (cost=1088506.43..1096644.14 rows=232506 width=725) (actual time=416.115..513.065 rows=217563 loops=1) -> Sort (cost=1088506.43..1089087.70 rows=232506 width=725) (actual time=416.115..451.190 rows=217563 loops=1) Sort Key: notifications_no_pii.id, notifications_no_pii.job_id, notifications_no_pii.service_id, notifications_no_pii.template_id, notifications_no_pii.key_type, notifications_no_pii.billable_units, notifications_no_pii.notification_type, notifications_no_pii.created_at, notifications_no_pii.sent_by, notifications_no_pii.notification_status, notifications_no_pii.international, notifications_no_pii.rate_multiplier, notifications_no_pii.postage Sort Method: external merge Disk: 23936kB -> Append (cost=114.42..918374.12 rows=232506 width=725) (actual time=2.051..298.229 rows=217563 loops=1) -> Bitmap Heap Scan on notifications_no_pii (cost=114.42..8557.55 rows=2042 width=113) (actual time=1.405..1.442 rows=0 loops=1) Recheck Cond: ((service_id = 'c5956607-20b1-48b4-8983-85d11404e61f'::uuid) AND (notification_type = 'sms'::notification_type) AND (notification_status = ANY ('{sending,sent,delivered,pending,temporary-failure,permanent-failure}'::text[])) AND (created_at >= '2022-05-01 23:00:00'::timestamp without time zone) AND (created_at < '2022-05-02 23:00:00'::timestamp without time zone)) Filter: ((key_type)::text = ANY ('{normal,team}'::text[])) -> Bitmap Index Scan on ix_notifications_no_piiservice_id_composite (cost=0.00..113.91 rows=2202 width=0) (actual time=1.402..1.439 rows=0 loops=1) Index Cond: ((service_id = 'c5956607-20b1-48b4-8983-85d11404e61f'::uuid) AND (notification_type = 'sms'::notification_type) AND (notification_status = ANY ('{sending,sent,delivered,pending,temporary-failure,permanent-failure}'::text[])) AND (created_at >= '2022-05-01 23:00:00'::timestamp without time zone) AND (created_at < '2022-05-02 23:00:00'::timestamp without time zone)) -> Index Scan using ix_notifications_history_no_pii_service_id_composite on notifications_history_no_pii (cost=0.70..906328.97 rows=230464 width=113) (actual time=0.645..281.612 rows=217563 loops=1) Index Cond: ((service_id = 'c5956607-20b1-48b4-8983-85d11404e61f'::uuid) AND ((key_type)::text = ANY ('{normal,team}'::text[])) AND (notification_type = 'sms'::notification_type) AND (created_at >= '2022-05-01 23:00:00'::timestamp without time zone) AND (created_at < '2022-05-02 23:00:00'::timestamp without time zone)) Filter: (notification_status = ANY ('{sending,sent,delivered,pending,temporary-failure,permanent-failure}'::text[])) Planning Time: 18.032 ms Execution Time: 759.001 ms (21 rows) Queries using the new view appear to be slower than without, but the differences I've seen are minimal: the original queries execute in seconds locally and in Production, so it's not a big issue. Notes: Performance ================== I downloaded a minimal set of columns for testing: \copy ( select id, notification_type, key_type, created_at, service_id, template_id, sent_by, rate_multiplier, international, billable_units, postage, job_id, notification_status from notifications ) to 'notifications.csv' delimiter ',' csv header; CREATE TABLE notifications_no_pii ( id uuid NOT NULL, notification_type public.notification_type NOT NULL, key_type character varying(255) NOT NULL, created_at timestamp without time zone NOT NULL, service_id uuid, template_id uuid, sent_by character varying, rate_multiplier numeric, international boolean, billable_units integer NOT NULL, postage character varying, job_id uuid, notification_status text ); copy notifications_no_pii from '/Users/ben.thorner/Desktop/notifications.csv' delimiter ',' csv header; CREATE INDEX ix_notifications_no_piicreated_at ON notifications_no_pii USING btree (created_at); CREATE INDEX ix_notifications_no_piijob_id ON notifications_no_pii USING btree (job_id); CREATE INDEX ix_notifications_no_piinotification_type_composite ON notifications_no_pii USING btree (notification_type, notification_status, created_at); CREATE INDEX ix_notifications_no_piiservice_created_at ON notifications_no_pii USING btree (service_id, created_at); CREATE INDEX ix_notifications_no_piiservice_id_composite ON notifications_no_pii USING btree (service_id, notification_type, notification_status, created_at); CREATE INDEX ix_notifications_no_piitemplate_id ON notifications_no_pii USING btree (template_id); And similarly for the history table. I then created a sepatate view across both of these temporary tables using just these columns. To test performance I created some queries that reflect what is run by the billing [^5] and status [^6] tasks e.g. explain analyze select template_id, sent_by, rate_multiplier, international, sum(billable_units), count() from notifications_all_time_view where notification_status in ('sending', 'sent', 'delivered', 'pending', 'temporary-failure', 'permanent-failure') and key_type in ('normal', 'team') and created_at >= '2022-05-01 23:00' and created_at < '2022-05-02 23:00' and notification_type = 'sms' and service_id = 'c5956607-20b1-48b4-8983-85d11404e61f' group by 1,2,3,4; explain analyze select template_id, job_id, key_type, notification_status, count(*) from notifications_all_time_view where created_at >= '2022-05-01 23:00' and created_at < '2022-05-02 23:00' and notification_type = 'sms' and service_id = 'c5956607-20b1-48b4-8983-85d11404e61f' and key_type in ('normal', 'team') group by 1,2,3,4; Between running queries I restarted my local database and also ran a command to purge disk caches [^7]. I tested on a few services: - c5956607-20b1-48b4-8983-85d11404e61f on 2022-05-02 (high volume) - 0cc696c6-b792-409d-99e9-64232f461b0f on 2022-04-06 (highest volume) - 01135db6-7819-4121-8b97-4aa2d741e372 on 2022-04-14 (very low volume) All execution results are of the same magnitude using the view compared to the worst case of either table on its own. [^1]: `00a04ebf54/app/dao/notifications_dao.py (L389)` [^2]: https://stackoverflow.com/questions/49925/what-is-the-difference-between-union-and-union-all [^3]: https://www.postgresql.org/docs/current/transaction-iso.html [^4]: https://dba.stackexchange.com/questions/210485/can-sub-selects-change-in-one-single-query-in-a-read-committed-transaction [^5]: `00a04ebf54/app/dao/fact_billing_dao.py (L471)` [^6]: `00a04ebf54/app/dao/fact_notification_status_dao.py (L58)` [^7]: https://stackoverflow.com/questions/28845524/echo-3-proc-sys-vm-drop-caches-on-mac-osx	2022-05-19 15:14:32 +01:00
Leo Hemsted	6b0d3860bd	make notify_db fixture private	2022-05-04 11:37:05 +01:00
Leo Hemsted	6181c60f75	remove usage of notify_db fixture in unit tests * notify_db fixture creates the database connection and ensures the test db exists and has migrations applied etc. It will run once per session (test run). * notify_db_session fixture runs after your test finishes and deletes all non static (eg type table) data. In unit tests that hit the database (ie: most of them), 99% of the time we will need to use notify_db_session to ensure everything is reset. The only time we don't need to use it is when we're querying things such as "ensure get X works when database is empty". This is such a low percentage of tests that it's easier for us to just use notify_db_session every time, and ensure that all our tests run much more consistently, at the cost of a small bit of performance when running tests. We used to use notify_db to access the session object for manually adding, committing, etc. To dissuade usage of that fixture I've moved that to the `notify_db_session`. I've then removed all uses of notify_db that I could find in the codebase. As a note, if you're writing a test that uses a `sample_x` fixture, all of those fixtures rely on notify_db_session so you'll get the teardown functionality for free. If you're just calling eg `create_x` db.py functions, then you'll need to make you add notify_db_session fixture to your test, even if you aren't manually accessing the session.	2022-05-04 11:36:54 +01:00
Katie Smith	aec631f208	Add a type table for broadcast providers This adds a type table for broadcast providers, which is the pattern we follow with our models (e.g. we have a `broadcast_channel_types` table). As well as the four providers, the migration populates it with `all` which is the value that will replace `null` in a later change. It should be safe to add the foreign key constraint to the `service_broadcast_settings` in the same migration since the column is still nullable and we don't have data in that column that is not in the types table.	2021-05-06 15:30:04 +01:00
Ben Thorner	a91fde2fda	Run auto-correct on app/ and tests/	2021-03-12 11:45:45 +00:00
David McDonald	91f5be835a	Add DB table for service broadcast settings This will allow us to store details of which channel a service should be sending to. See the comment about how all broadcast services can have a row in the table but may not at the moment. This has been done for speed as it's the quickest way to let us set up different services to send to different channels for some needed testing with the mobile handset providers in the coming week.	2021-02-01 14:10:37 +00:00
Leo Hemsted	36ae5fadf6	add broadcast_event table It's clear that we need a way to track updates to a broadcast message. It's also clear that we'll need some kind of audit log that captures exactly what was sent out in a message. This commit adds a new database table, `broadcast_event`, which maps 1:1 with CAP XML sent to the CBCs. We'll create one of these just before sending out. The main driver for this was that cancel and update messages need to contain a list of references of all previous messages that they're amending. This is of format `{sender},{identifier},{sent_timestamp}`, and the identifier itself needs to be unique for each message.	2020-07-28 12:10:18 +01:00
Leo Hemsted	eb72a0279f	remove monkeypatch it's interacting strangely with our os.environ stuff and not restoring nicely, probably due to fixture order which is hard to visualise and control.	2020-01-06 17:53:56 +00:00
Leo Hemsted	d244146638	fix weird test isolation errors with os.environ and boto3 test_config manipulates os.environ. os.environ is an `environ` object, which acts like a dict but isn't in some subtle unknowable ways. The `reload_config` fixture would create a dict copy of the env, and then just call `os.environ = old_env` afterwards. Boto3 would then complain that it couldn't load credentials (despite us using the mock_s3 fixture and also not having creds in the environment in the first place). Not entirely sure why this happens, but it does. For some reason, it being a `dict` instead of an `environ` object causes the mocking of boto3 to fail. The solution is to not overwrite os.environ entirely, rather, use the standard dictionary setitem syntax to update the values to their previous values. Use `clear` to empty the environment too.	2020-01-06 16:21:34 +00:00
Leo Hemsted	52a33f220b	make tests use mmg 100% of the time by default we randomly choose between sms providers now - this means that tests may sometimes send firetext and sometimes mmg, so we'd need to patch out different HTTP calls, expect different values in sent_by, etc etc. To ensure tests are consistent, add a new fixture that is always used by notify_db_session, which sets the priorities of the sms providers to 100% mmg 0% firetext. if you need to test other values, then you should set the values manually in the test file	2019-11-28 13:29:02 +00:00
Pea Tyczynska	38bb2c1cf6	Simpify org types migration script and introduce foreign key Don't transform org types on prod	2019-07-24 17:55:56 +01:00
Leo Hemsted	28bff28786	remove letter rates between tests we only use them in ft_billing so no reason not to delete them. makes the tests read better as it's obvious how they work too.	2019-04-03 15:34:02 +01:00
Leo Hemsted	06f9e445f7	remove dvla_organisation from database downgrade re-populates all static data but leaves all services with 001 (HM Government) branding	2019-02-14 14:27:59 +00:00
Leo Hemsted	ac5c19c3d5	remove some pytest warnings	2018-10-26 17:54:53 +01:00
Leo Hemsted	5c4f3e246c	make test dvla response file timestamps in a random order since there'll be a bunch of threads running functional test tasks at the same time, there's no point always trying to start from the same second and then stepping back to the same one-second-back file each time. Also, this leads us to an increased risk of race conditions. This change takes the same thirty second range, but shuffles it. The tests, since they're no longer deterministic, now use a new Matcher object (w/ credit to alexey) to match any filename from within that thirty second range	2018-07-20 12:09:00 +01:00
Pea Tyczynska	183aa160c6	Add callback_type column to service_callback_api table Also add service_callback_type table with allowed types	2018-07-17 16:54:00 +01:00
Katie Smith	c57f33a02d	Use real letter_rates in tests Added the letter_rate table to the list of tables which does not get deleted after each test run and changed the tests to use the real letter rates. Also removed the letter rate DAO since this was only being used in tests, so was no longer needed.	2018-07-10 11:31:18 +01:00
Athanasios Voutsadakis	6f1e4c76d5	Make test context managers more reliable Sometimes, when a test using one of the set_config[_values] context managers failed or raised an exception it would cause the context to not be able to revert its config changes, resulting in a 'spooky action at a distance' where random tests would start to fail for non-obvious reasons.	2018-03-27 17:41:05 +01:00
Leo Hemsted	57a174aeb4	add tests for org invite rest, and update conftest	2018-02-23 10:45:18 +00:00
Leo Hemsted	450deae32a	add user_to_organisation and invited_organisation_users tables similar to user_to_service and invited_users, but without auth types or permissions	2018-02-15 14:19:24 +00:00
Leo Hemsted	5466b7cd3c	fix test create_app invocation	2017-11-23 17:04:58 +00:00
Leo Hemsted	7c8586f959	add auth_type to schemas to enable updating in endpoints	2017-10-30 12:29:01 +00:00
Leo Hemsted	6acea8323b	fixup! add auth type table, currently contains sms_auth and email_auth	2017-10-30 11:59:32 +00:00
Leo Hemsted	c39ec90727	add auth type table, currently contains sms_auth and email_auth	2017-10-30 11:11:37 +00:00
Leo Hemsted	2fefe8a957	use sqlalchemy hooks rather than pyscopg2 seems to play nicer with docker?	2017-08-29 18:03:15 +01:00
Leo Hemsted	3d4dbaa632	run tests in multiple threads at once previously we didn't do this because the tests all used the same DB (test_notifications_api), however @minglis shared a snippet that simply creates one test db per thread.	2017-08-29 17:46:11 +01:00
Leo Hemsted	c36e50bef1	update dependencies	2017-08-18 17:02:31 +01:00
Leo Hemsted	7f883f1355	don't store non-strings to os.environ in tests, we were replacing os.environ with a basic dict so that we didn't overwrite the contents of the real environment during tests. However, os.environ doesn't accept non-str values, so this commit changes the fixture so that it asserts all values set are strings. We needed to change how we store ip whitelist stuff in the env because of this.	2017-07-11 15:41:44 +01:00
Ken Tsang	114d4d84d4	Add service permissions DAO and refactor user service permission mock	2017-05-15 17:28:14 +01:00
Leo Hemsted	7e52fa4d13	add new notification_status column to models.py We now have a new column in the database, but it isn't being populated. The first step is to make sure we update this column, while still keeping the old enum based column up to date as well. A couple of changes have had to happen to support this - one irritating thing is that if we're ever querying columns individually, including `Notification.status`, then we'll need to give that column a label, since under the hood it translates to `Notification._status_enum`. Accessing status through the ORM (i.e., my_noti.status = 'sending' or similar) will work fine.	2017-05-04 17:24:28 +01:00
Rebecca Law	6dc336ad6c	Created new queries to return the rate with the sum of billable units for the year totals. Once we have the new columns in notifications table, the query will need to include the rate multiplier and if the number is international. The monthly billing query will be built next.	2017-04-28 10:10:48 +01:00
Rebecca Law	266c17dfd6	Do not delete rates	2017-04-24 16:32:23 +01:00
Imdad Ahad	c4fac1d937	Revert "Revert "add DVLA organisations to API""	2017-04-21 16:05:07 +01:00
Leo Hemsted	c3e56d5d2d	Revert "add DVLA organisations to API"	2017-04-20 18:21:56 +01:00
Leo Hemsted	d514d99a67	add DVLA organisations to API when services are created, they'll have a dvla_org_id of 001, or HM Government. That can be changed later using a regular update call	2017-04-19 16:31:18 +01:00
Imdad Ahad	73d5ce4f8b	Add tests to verify correctness of the switching provider task	2017-02-24 12:23:39 +00:00
Andras Ferencz-Szabo	e10c5e28cd	Merge pull request #763 from alphagov/cloudfoundry Run on Paas	2017-01-17 11:28:38 +00:00
Rebecca Law	6c79ddbe41	This is the first deploy in series of deploys to give certain templates priority in processing. If the template is marked as priority the notification will be sent using the `notify` queue. The `notify` queue is a low volume queue, messages here will not be queue behind a large job and should be delivered with in a more consistent time frame. - Added templates.process_type and templates_history.process_type column. - Added a template_process_type table to handle the enum for templates.process_type, initial values are normal and priority https://www.pivotaltracker.com/story/show/135429147	2017-01-13 12:14:34 +00:00

1 2

85 Commits