notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2026-01-15 07:01:20 -05:00

Author	SHA1	Message	Date
stvnrlly	637fbdb891	broadcast flake8 cleanup	2022-10-25 11:53:24 -04:00
Steven Reilly	d37c2a53b8	Merge branch 'main' into stvnrlly-remove-broadcasts	2022-10-25 10:17:49 -04:00
stvnrlly	8e2b8dd7c4	keep on flakin the flake world	2022-10-21 13:29:52 +00:00
stvnrlly	9f37592b1e	cleaner flake8 cleaning	2022-10-21 00:26:37 +00:00
stvnrlly	d4e156e8ae	Merge branch 'main' into stvnrlly-remove-broadcasts	2022-10-20 19:44:20 -04:00
stvnrlly	5dfc26c1f5	pass pytest multiline preferences	2022-10-19 16:16:29 +00:00
stvnrlly	e9fdfd59f4	clean flake8 except provider code	2022-10-19 16:16:26 +00:00
jimmoffet	97aa118fee	additional type checking in process_ses_receipt_tasks	2022-10-04 18:16:19 -07:00
jimmoffet	434b7b2d08	clean up and remove redundancy	2022-10-04 16:01:30 -07:00
stvnrlly	57f4df8ed1	remove broadcast-related code, except migrations	2022-10-04 15:28:27 +00:00
jimmoffet	fc9e4107c1	all tests passing	2022-10-03 20:07:42 -07:00
jimmoffet	c04d1df6b3	fixing tests	2022-10-03 17:16:59 -07:00
jimmoffet	c7ccc3b0dd	fix conflict	2022-10-03 09:14:04 -07:00
jimmoffet	8cb6f60f04	modify inbound notif processing	2022-10-03 09:05:34 -07:00
Jim Moffet	d0bba8a8bd	Merge branch 'main' into jim/091422/deliverycallbacks	2022-09-30 11:21:46 -04:00
jimmoffet	48af6f7c23	fix tests	2022-09-30 10:59:48 -04:00
Ryan Ahearn	e3ad01119d	Replace celery[sqs] with celery[redis]	2022-09-29 08:59:17 -04:00
jimmoffet	c636eac964	replace m2crypto with oscrypto	2022-09-23 15:57:06 -07:00
jimmoffet	ea3eefa81c	test branch for notify-api-alt temporary deploy	2022-09-23 11:56:39 -07:00
jimmoffet	4c86024f21	clean up comments	2022-09-20 20:22:12 -07:00
jimmoffet	a03de0dd56	remove outdated validatesns library and replace with maintainable code	2022-09-20 20:11:09 -07:00
jimmoffet	f1aec54665	clean up comments and method dupes	2022-09-15 15:48:37 -07:00
jimmoffet	b0f819dbd9	canada UK ses callbacks monster mash	2022-09-15 14:59:13 -07:00
Ryan Ahearn	806e2ad2dc	Review and update uses of PRNG	2022-08-19 15:26:12 +00:00
Ryan Ahearn	e77cedb039	Clean up xml finding from static-scan	2022-08-18 17:52:44 +00:00
Christa Hartsock	af6495cd4c	Get tests passing locally When we cloned the repository and started making modifications, we didn't initially keep tests in step. This commit tries to get us to a clean test run by skipping tests that are failing and removing some that we no longer expect to use (MMG, Firetext), with the intention that we will come back in future and update or remove them as appropriate. To find all tests skipped, search for `@pytest.mark.skip(reason="Needs updating for TTS:`. There will be a brief description of the work that needs to be done to get them passing, if known. Delete that line to make them run in a standard test run (`make test`).	2022-07-07 15:41:15 -07:00
Jim Moffet	aa4ec532a4	implement SNS	2022-06-17 11:16:23 -07:00
Jim Moffet	59b72f4853	add devcontainer configs and docker network orchestration	2022-06-13 13:16:32 -07:00
Ben Thorner	458e997706	Recalculate billing rows for 10 days (prev. 4) This effectively reverts [^1], which was only a temporary change. I suspect the performance problem will go away with [^2]. While we've clearly been managing without this change, it resulted in several rows being left as incorrect when letter receipts were delayed. It makes sense for us to run this task for the same period as we do to aggregate statuses, as status affects billing. [^1]: `e5c76ffda7` [^2]: https://github.com/alphagov/notifications-api/pull/3542	2022-05-17 17:38:08 +01:00
Ben Thorner	c27107fa74	Remove support for Reach provider This provider was never active and support was never completed, so there's little value in keeping all this potentially confusing code.	2022-04-29 12:28:08 +01:00
Ben Thorner	779b8e941f	Rewrite broadcast Zendesk alert at approval time The new alert happens earlier but is otherwise the same: - We only create a ticket in Production. - We only create a ticket on approval. I took this opportunity to refactor the alert as a private function and test this specifically in detail to avoid lots of repetitive mocks, which are required when calling the main "update" function. One test I haven't preserved was for when the "names" array is empty, as this was added for a legacy data integrity scenario [^1]. [^1]: `bf0bf4e31c`	2022-04-05 12:57:08 +01:00
Ben Thorner	3988a6cd07	Include exception info in SMS warning log This makes it easier to debug failures when adding a new provider.	2022-03-30 13:36:56 +01:00
Ben Thorner	b439fd0718	Add boilerplate for Reach SMS callbacks This is enough to update a notification in DB: 1. First create a notification in the UI and sent it. 2. Then reset its attributes to pretend it's for Reach. update notifications set sent_at = null, sent_by = null, notification_status='sending' where id='some-uuid'; 3. Change "notification_id" to "<some-uuid>" in the code. 4. Call the boilerplate endpoint for Reach callbacks. curl -X POST localhost:6011/notifications/sms/reach Interestingly there's no foreign key constraint on "sent_by" in the DB, so this just works: the notification is updated.	2022-03-24 16:56:33 +00:00
Leo Hemsted	2fbe9e85ac	Merge pull request #3479 from alphagov/auto-retry-stuck-av-letters automatically retry letters stuck in pending-virus-scan	2022-03-15 11:43:42 +00:00
Leo Hemsted	9e8df8b623	remove "letters stuck pending av" runbook there's not anything we know we need to do now that we resolve stuck letters automatically. Letters couuld still get into this state, so it's worth alerting us. However, we don't have anything concrete that we know how to fix these letters, so we should just remove the runbook entirely.	2022-03-10 14:10:01 +00:00
David McDonald	0d952b4d8c	Reduce timeout for service callback attempt to 5 seconds It is currently 60 seconds but we have had two incidents in the past week where there is a connection error talking to a service and the request takes up to 60 seconds before failing. When this happens, if there are a few of these callbacks then all of them will completely hog the service callback worker and build up a big queue of all the other service callbacks. 5 seconds has been chosen as that is still a pretty decent length time for a simple web request that should just be giving them a little bit of information for them to store. 5 seconds should be a sufficient enough reduction that we dramatically reduce this problem for the moment. Open to this number being changed in the future based on how we see it perform.	2022-03-08 13:05:32 +00:00
Leo Hemsted	00259893f1	automatically retry letters stuck in pending-virus-scan Since sept 2019 we've had to log on to production around once every twenty days to restart the virus scan task for a letter. Most of the time this is just a case of making sure the file is in the scan bucket, and then triggering the task. If the file isn't in the scan bucket we'd need to do some more manual investigation to find out exactly where the file got stuck, but I can only remember times when it's been in the scan bucket. So if the file is in the scan bucket, we can just check that with code and kick the task off automatically.	2022-03-07 18:31:46 +00:00
Katie Smith	514bd48614	Update flake8-bugbear from 20.11.1 to 22.1.11 And ignore a warning, since I did not think that in this case "Using .strip() with multi-character strings is misleading the reader".	2022-03-02 16:51:09 +00:00
Ben Thorner	a69d1635a1	Update FactStatus table in bulk for each service Previously we were looping over data from the Notifications/History table and then shovelling it into the status table, one row at a time - plus an extra delete to clean up any existing data. This replaces that with a batch insertion, similar to how we archive notifications [1], but using a simple subquery (via "from_select" [2]) instead of a temporary table. To make the select compatible with the insert, I've used "literal" to inject the constant pieces of data, so each row has everything it needs to go into the status table. [1]: `9ce6d2fe92/app/dao/notifications_dao.py (L295)` [2]: https://docs.sqlalchemy.org/en/14/core/dml.html#sqlalchemy.sql.expression.Insert.from_select	2022-02-16 13:40:05 +00:00
Ben Thorner	ef231d5de7	Fix task name and action in status task logs	2022-02-16 11:45:45 +00:00
Ben Thorner	7f4b140f97	Rename function to make it consistent This is consistent with the new "on_date" function. It was going off the edge of my screen before in some parts of the code.	2022-02-09 17:39:08 +00:00
Ben Thorner	1213463b8e	Only aggregate status when necessary for a service This takes a similar approach to the nightly deletion task so that we only create sub-tasks when there are actually notifications to aggregate for a given type and day [1]. We're making this change to stop the duplication errors we're getting at the moment and ensure the task can scale to more messages and more services. There are two parts to this: - Each subtask should now run within the 5 minute visibility timeout. However, they may still be duplicated if the parent task overruns [2]. - The parent task creates a mininal number of subtasks, and the query to determine this is very fast for a normal process day (milliseconds). Since all tasks will run quickly, there should be no more duplication. In order to test this more nuanced task, I rewrote the tests: - One test checks the subtask is called correctly. - One test checks we create all the right subtasks. [1]: https://github.com/alphagov/notifications-api/pull/3381 [2]: https://docs.google.com/document/d/1MaP6Nyy3nJKkuh_4lP1wuDm19X8LZITOLRd9n3Ax-xg/edit#heading=h.q3intzwqhfzl	2022-02-09 17:39:07 +00:00
Ben Thorner	c8db58d0e8	Reorder loops for creation status agg sub tasks This will help tailor the innermost loop on services.	2022-02-09 17:39:06 +00:00
Ben Thorner	d6678b6a70	Remove unnecessary logs from status aggreagtion These can be inferred elsewhere: - Task creation is obvious from task execution. If we're concerned about a specific service, we can check the updated times on the DB records, since all records are recreated each time this runs. - Task starting is already logged. - Task completion is already logged. The number of rows updated can also be inferred from the DB. The log I've found useful is the one about fetching the data, and I've also added another to time how long it takes to insert the data, as both could be sources of poor performance. Arguably we should use metrics for this sort of thing, but logs are easier in practice for the metric systems we have.	2022-02-09 17:39:05 +00:00
Ben Thorner	018a253b6f	Revert "Revert running status aggregation in parallel" This reverts commit `0f6dea0deb`.	2022-02-09 17:39:00 +00:00
Chris Hill-Scott	7f72d3a60f	Bump utils to 53.0.0 Changes: 53.0.0 --- * `notifications_utils.columns.Columns` has moved to `notifications_utils.insensitive_dict.InsensitiveDict` * `notifications_utils.columns.Rows` has moved to `notifications_utils.recipients.Rows` * `notifications_utils.columns.Cell` has moved to `notifications_utils.recipients.Cell` 52.0.0 --- * Deprecate the following unused `redis_client` functions: - `redis_client.increment_hash_value` - `redis_client.decrement_hash_value` - `redis_client.get_all_from_hash` - `redis_client.set_hash_and_expire` - `redis_client.expire` 51.3.1 --- * Bump govuk-bank-holidays to cache holidays for next year.	2022-02-08 09:45:10 +00:00
Rebecca Law	09c8fbe982	Merge pull request #3418 from alphagov/letters-too-long Mark letters as validation-failed if the templated letter is too long.	2022-02-02 08:30:50 +00:00
Rebecca Law	c01c81326c	Update log message to something a little easier to read and query for.	2022-01-24 12:25:53 +00:00
Leo Hemsted	246016a894	don't log if we dont delete anything for a service we try and delete for lots of services. this includes services that don't actually have anything to delete that day. that might be because they had a custom data retention so we always go to check them, or because they only sent test notifications (which we'll delete but not include in the count in the log line). we don't really need to see log lines saying that we didn't delete anything for that service - that's just a long list of boring log messages that will hide the actual interesting stuff - which services we did delete content for.	2022-01-21 11:04:37 +00:00
Ben Thorner	0f6dea0deb	Revert running status aggregation in parallel The top-level task didn't run successfully after this was deployed due to the worker being killed due to heavy disk usage. While the more parallel version does log much more, it doesn't totally explain the disk behaviour. Nonetheless, reverting it is sensible to give us the time we need to investigate more.	2022-01-20 12:22:33 +00:00

1 2 3 4 5 ...

1033 Commits