notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2025-12-13 08:42:21 -05:00

Author	SHA1	Message	Date
Kenneth Kehl	8fa893895c	cleanup	2025-07-10 14:29:46 -07:00
Kenneth Kehl	905df17f65	remove datetime.utcnow()	2024-05-23 13:59:51 -07:00
Cliff Hill	3982f061b6	Made enums.py for all the enums to avoid cyclic imports. Signed-off-by: Cliff Hill <Clifford.hill@gsa.gov>	2024-02-28 12:43:31 -05:00
Cliff Hill	820ee5a942	Cleaning up a lot of things, getting Enums used everywhere. Signed-off-by: Cliff Hill <Clifford.hill@gsa.gov>	2024-02-28 12:40:52 -05:00
Kenneth Kehl	00fd3a72bb	code review feedback, fix setup.cfg and reformat	2023-08-25 08:10:33 -07:00
Kenneth Kehl	026dc14021	notify-api-412 use black to enforce python style standards	2023-08-23 10:35:43 -07:00
Kenneth Kehl	08c1ad75c8	notify-260 remove server-side timezone handling	2023-05-10 08:39:50 -07:00
Steven Reilly	ff4190a8eb	Remove letters-related code (#175 ) This deletes a big ol' chunk of code related to letters. It's not everything—there are still a few things that might be tied to sms/email—but it's the the heart of letters function. SMS and email function should be untouched by this. Areas affected: - Things obviously about letters - PDF tasks, used for precompiling letters - Virus scanning, used for those PDFs - FTP, used to send letters to the printer - Postage stuff	2023-03-02 20:20:31 -05:00
stvnrlly	b50cb4712f	tz utility swap and many test updates	2022-11-10 12:33:25 -05:00
Ben Thorner	458e997706	Recalculate billing rows for 10 days (prev. 4) This effectively reverts [^1], which was only a temporary change. I suspect the performance problem will go away with [^2]. While we've clearly been managing without this change, it resulted in several rows being left as incorrect when letter receipts were delayed. It makes sense for us to run this task for the same period as we do to aggregate statuses, as status affects billing. [^1]: `e5c76ffda7` [^2]: https://github.com/alphagov/notifications-api/pull/3542	2022-05-17 17:38:08 +01:00
Ben Thorner	a69d1635a1	Update FactStatus table in bulk for each service Previously we were looping over data from the Notifications/History table and then shovelling it into the status table, one row at a time - plus an extra delete to clean up any existing data. This replaces that with a batch insertion, similar to how we archive notifications [1], but using a simple subquery (via "from_select" [2]) instead of a temporary table. To make the select compatible with the insert, I've used "literal" to inject the constant pieces of data, so each row has everything it needs to go into the status table. [1]: `9ce6d2fe92/app/dao/notifications_dao.py (L295)` [2]: https://docs.sqlalchemy.org/en/14/core/dml.html#sqlalchemy.sql.expression.Insert.from_select	2022-02-16 13:40:05 +00:00
Ben Thorner	ef231d5de7	Fix task name and action in status task logs	2022-02-16 11:45:45 +00:00
Ben Thorner	1213463b8e	Only aggregate status when necessary for a service This takes a similar approach to the nightly deletion task so that we only create sub-tasks when there are actually notifications to aggregate for a given type and day [1]. We're making this change to stop the duplication errors we're getting at the moment and ensure the task can scale to more messages and more services. There are two parts to this: - Each subtask should now run within the 5 minute visibility timeout. However, they may still be duplicated if the parent task overruns [2]. - The parent task creates a mininal number of subtasks, and the query to determine this is very fast for a normal process day (milliseconds). Since all tasks will run quickly, there should be no more duplication. In order to test this more nuanced task, I rewrote the tests: - One test checks the subtask is called correctly. - One test checks we create all the right subtasks. [1]: https://github.com/alphagov/notifications-api/pull/3381 [2]: https://docs.google.com/document/d/1MaP6Nyy3nJKkuh_4lP1wuDm19X8LZITOLRd9n3Ax-xg/edit#heading=h.q3intzwqhfzl	2022-02-09 17:39:07 +00:00
Ben Thorner	c8db58d0e8	Reorder loops for creation status agg sub tasks This will help tailor the innermost loop on services.	2022-02-09 17:39:06 +00:00
Ben Thorner	d6678b6a70	Remove unnecessary logs from status aggreagtion These can be inferred elsewhere: - Task creation is obvious from task execution. If we're concerned about a specific service, we can check the updated times on the DB records, since all records are recreated each time this runs. - Task starting is already logged. - Task completion is already logged. The number of rows updated can also be inferred from the DB. The log I've found useful is the one about fetching the data, and I've also added another to time how long it takes to insert the data, as both could be sources of poor performance. Arguably we should use metrics for this sort of thing, but logs are easier in practice for the metric systems we have.	2022-02-09 17:39:05 +00:00
Ben Thorner	018a253b6f	Revert "Revert running status aggregation in parallel" This reverts commit `0f6dea0deb`.	2022-02-09 17:39:00 +00:00
Ben Thorner	0f6dea0deb	Revert running status aggregation in parallel The top-level task didn't run successfully after this was deployed due to the worker being killed due to heavy disk usage. While the more parallel version does log much more, it doesn't totally explain the disk behaviour. Nonetheless, reverting it is sensible to give us the time we need to investigate more.	2022-01-20 12:22:33 +00:00
Ben Thorner	9686595fa8	Minor tweaks to address comments on the PR To address: - https://github.com/alphagov/notifications-api/pull/3425#discussion_r786867994 - https://github.com/alphagov/notifications-api/pull/3425#discussion_r786853329 - https://github.com/alphagov/notifications-api/pull/3425#discussion_r786848793 - https://github.com/alphagov/notifications-api/pull/3425#discussion_r786214794	2022-01-18 16:56:53 +00:00
Ben Thorner	9182ebf4e5	Parallelise status aggregation by service and day This follows a similar approach as [1]. Recently we've seen lots of errors from this task, which we think are a consequence of it doing too much work and tripping Celery's visibility timeout. While we can optimise the query [2], it's likely the errors will return as the number of live services grows. Parallelising the aggregation now will make it more futureproof. [1]: https://github.com/alphagov/notifications-api/pull/3397 [2]: https://github.com/alphagov/notifications-api/pull/3417	2022-01-12 15:47:59 +00:00
Ben Thorner	d772ae6b46	Standardise logs for status aggregation tasks This will make it easier to parallelise by service later on.	2022-01-12 15:47:57 +00:00
Ben Thorner	4feed950c4	DRY-up loops to kick off status aggregation tasks This will make it easier to parallelise by service in the following commits, since we only have one loop to change.	2022-01-12 15:47:56 +00:00
Ben Thorner	ddbf556486	Rewrite task to aggregate status by service This is a step towards parallelising the task by service and day.	2022-01-12 15:47:53 +00:00
Ben Thorner	a7b39a930c	Add comment to explain status aggregation approach This relates to the performance optimisation work we're doing [1]. Before optimising the task, it's worth asking if we can do less - the comment explains why it has to be this way. Some references to back up the comment: - We do status updates in either table [2]. - We don't allow duplicate receipts for emails [3]. - We don't allow duplicate receipts for SMS [4]. - We don't expect duplicate receipts for letters. This is something we would need to revisit if we want to support additional status updates - we could reject based on the age of the notification, rather than the status. [1]: https://github.com/alphagov/notifications-api/pull/3417 [2]: `20ead82463/app/dao/notifications_dao.py (L538)` [3]: `20ead82463/app/celery/process_ses_receipts_tasks.py (L58)` [4]: `20ead82463/app/dao/notifications_dao.py (L129-L135)`	2022-01-10 18:15:54 +00:00
Ben Thorner	666ac1ab4f	Log activity on all periodic Celery tasks As stated in the comment, this would have been helpful during an incident to give further reassurance that a task had at least started running - at the time the only evidence for this was the Cronitor dashboard itself, which we don't often look at. I've removed other, equivalent "starting" logs, but kept those that provide additional information in the log message.	2021-11-17 09:48:03 +00:00
Ben Thorner	e3e067c795	Remove redundant @statsd timing decorators These are superseded by timing task execution generically in the NotifyTask superclass [1]. Note that we need to wait until we've gathered enough data under the new metrics before removing these. [1]: https://github.com/alphagov/notifications-api/pull/3201#pullrequestreview-633549376	2021-04-12 15:19:18 +01:00
Ben Thorner	a91fde2fda	Run auto-correct on app/ and tests/	2021-03-12 11:45:45 +00:00
David McDonald	7a9c756117	Add in lots of logging to our reporting tasks We've seen only some of these reporting tasks happen but with no log messages to indicate what happened and no app crashes. This hopefully will give us a better picture of a timeline. Note, I've tried to make our message format very consistent and good for searching for in kibana so I've changed that across this whole file for consistency.	2020-04-28 10:38:53 +01:00
Leo Hemsted	cec44f60e3	fix log line typo log lines didn't make sense because the arguments were the wrong way round. As an experiment to try and clean up some of our code a bit, this commit adds f-strings. f-strings were added in python 3.6, as a way to clean up, simplify, and improve the performance of `str.format`.	2019-12-11 13:22:28 +00:00
Leo Hemsted	6ac4595224	process letters for 10 days when updating ft_notification_status sms and emails have a very predictable 72 hour lifecycle. letters, on the other hand, have ridiculously complex lifecycles - they might not get sent because it's a weekend, they might not get sent because they're second class and are only processed on alternate days, they might not get sent because a different letter in the same batch had an error that we didn't know about. Either way, it's apparent that four days is definitely not enough time to guarantee that letters have gone from sending to delivered. Extend the amount of days we process for letters to 10 days. Keep emails and sms down at 4 to keep run-times shorter We're deliberately not thinking about returned letters here at all.	2019-12-09 16:02:43 +00:00
Leo Hemsted	884cb24bfa	remove day_start from create nightly notification status it makes less sense once we introduce different start dates for letters and emails. Also, we never use it, since we just call the day tasks ourselves from commands.py	2019-12-09 16:02:21 +00:00
Leo Hemsted	0448bca542	make create_nightly_notification_status_for_day take notification_type the nightly task won't be affected, it'll just trigger three times more sub-tasks. this doesn't need to be a two-part deploy because we only trigger this overnight, so as long as the deploy completes in daytime we don't need to worry about celery task signatures	2019-12-05 14:43:33 +00:00
Leo Hemsted	8f13697cf1	Revert "trigger nightly delete tasks from the create notification status task" This reverts commit `58f24a0a83`.	2019-08-19 16:06:25 +01:00
Leo Hemsted	36dd750637	split up reporting tasks in to separate tasks per day to try and speed up overall time by parallelising	2019-08-19 16:06:25 +01:00
Leo Hemsted	92d78956be	Merge pull request #2592 from alphagov/reporting-worker Add reporting worker	2019-08-15 17:22:27 +01:00
Leo Hemsted	e5c76ffda7	reduce days to process from 10 to 4 to try and speed it up temporarily.	2019-08-15 17:06:38 +01:00
Leo Hemsted	58f24a0a83	trigger nightly delete tasks from the create notification status task the nightly tasks need to run after the create nightly notification status task - so that test notifications are still there to record stats for, and to stop the risk of deleting notificaitons part-way through recording stats for them.	2019-08-14 18:04:45 +01:00
Rebecca Law	996dcdd88c	Increase the number of days we rebuild the tables for	2019-07-18 16:45:27 +01:00
Rebecca Law	e3ee99e70d	Reduce the number of days to recalculate billing. It's not necessary to calculate longer than 4 days.	2019-05-15 14:40:53 +01:00
Rebecca Law	1c68e0f565	Remove unused method. last_n_days was only being used in a test.	2019-04-12 10:26:46 +01:00
Leo Hemsted	1dc084be54	fix nightly ft stats tables task to respect BST the create_nightly_notification_status task runs at 00:30am UK time, however this means that in summer datetime.today() will return the wrong date as the server (which runs on UTC) will run the task at 23:30 (populating the wrong row in the table). fix this to use nice tz aware functions	2019-04-02 15:15:07 +01:00
Leo Hemsted	754c65a6a2	create cronitor decorator that alerts if tasks fail make a decorator that pings cronitor before and after each task run. Designed for use with nightly tasks, so we have visibility if they fail. We have a bunch of cronitor monitors set up - 5 character keys that go into a URL that we then make a GET to with a self-explanatory url path (run/fail/complete). the cronitor URLs are defined in the credentials repo as a dictionary of celery task names to URL slugs. If the name passed in to the decorator isn't in that dict, it won't run. to use it, all you need to do is call `@cronitor(my_task_name)` instead of `@notify_celery.task`, and make sure that the task name and the matching slug are included in the credentials repo (or locally, json dumped and stored in the CRONITOR_KEYS environment variable)	2019-01-18 15:36:53 +00:00
Pea Tyczynska	22ad14fcee	Fix logging for create_nightly_notification_status	2018-11-09 11:49:49 +00:00
Pea Tyczynska	987445f1bf	ft_notification_status now updates data for 4 days back This was done so when notification is timed out from sending/pending to temporary_failure, this change has to always be caught in the ft_notification_status	2018-11-08 11:52:40 +00:00
Rebecca Law	4fc004b00a	Increase the number of days we calculate billing from 3 to 10 days. Log exception if the billing counts for letters are different in the dvla response file than what we collected.	2018-07-24 16:28:30 +01:00
Rebecca Law	709a6c38c7	Created a task to update ft_notification_status for the last three days.	2018-06-20 16:45:20 +01:00
Rebecca Law	40d8f78b2b	Convert the day_start from a string to a datetime.	2018-05-15 14:00:06 +01:00
Rebecca Law	d98581cfe6	Added a new endpoint for yearly usage totals using ft_billing.	2018-05-11 16:25:16 +01:00
Rebecca Law	99d1357c37	Fix the logging message in the nightly task	2018-05-10 17:13:38 +01:00
Rebecca Law	8028f6cc28	We found that the reporting task failed twice because of integrity constraints. This was because the rate_multiplier was being added as 1 and 1.0 which was not resolving to the same. This updates the table to use Integrer. Also changed the logging for the task.	2018-05-10 15:35:58 +01:00
Rebecca Law	d00614205e	Adding test for fetch_biling_data_for_day.	2018-04-25 14:45:51 +01:00

1 2

60 Commits