notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2025-12-22 16:31:15 -05:00

Author	SHA1	Message	Date
Pea Tyczynska	c00f82b81b	Co-Authored-By: Chris Hill-Scott <me@quis.cc> Use .format instead of concatenation to avoid type issues Trying to concatenate uuid onto a string was throwing an error. Also it is not possible to use uuid in parametrize statements it seems as it messes up with running tests on multiple threads	2019-12-11 11:18:42 +00:00
Leo Hemsted	6ac4595224	process letters for 10 days when updating ft_notification_status sms and emails have a very predictable 72 hour lifecycle. letters, on the other hand, have ridiculously complex lifecycles - they might not get sent because it's a weekend, they might not get sent because they're second class and are only processed on alternate days, they might not get sent because a different letter in the same batch had an error that we didn't know about. Either way, it's apparent that four days is definitely not enough time to guarantee that letters have gone from sending to delivered. Extend the amount of days we process for letters to 10 days. Keep emails and sms down at 4 to keep run-times shorter We're deliberately not thinking about returned letters here at all.	2019-12-09 16:02:43 +00:00
Leo Hemsted	884cb24bfa	remove day_start from create nightly notification status it makes less sense once we introduce different start dates for letters and emails. Also, we never use it, since we just call the day tasks ourselves from commands.py	2019-12-09 16:02:21 +00:00
Pea M. Tyczynska	2019070536	Merge pull request #2667 from alphagov/warn-team-about-high-failure-rates Warn team about high failure rates	2019-12-09 11:28:25 +00:00
Pea Tyczynska	87bc86efa7	Reference dev runbook for instructions in the zendesk ticket	2019-12-06 17:05:43 +00:00
Pea Tyczynska	1b7b26bf24	Query directly for services with high failure rate	2019-12-06 16:57:56 +00:00
Pea Tyczynska	b8de67ae54	Update error message to include a url to offending service	2019-12-06 16:57:54 +00:00
Pea Tyczynska	cfbb080f57	Simplify failure rate by building separate query	2019-12-06 16:57:44 +00:00
Pea Tyczynska	53efd87e28	Check for services sending sms messages to tv numbers	2019-12-06 16:57:34 +00:00
Pea Tyczynska	d72ab4f4a6	Send zendesk ticket when services found with high failure rates	2019-12-06 16:57:04 +00:00
Leo Hemsted	0448bca542	make create_nightly_notification_status_for_day take notification_type the nightly task won't be affected, it'll just trigger three times more sub-tasks. this doesn't need to be a two-part deploy because we only trigger this overnight, so as long as the deploy completes in daytime we don't need to worry about celery task signatures	2019-12-05 14:43:33 +00:00
Leo Hemsted	f7fbd6de5b	make 500s change priorities quicker it's not acceptable for a constantly failing provider to take 50 minutes to drain (5x reducing priority by 10). But similarly, we need _some_ delay, or a handful of concurrent failures will completely turn off a provider, rendering the whole excercise kinda pointless. Setting the delay before it tries to reduce priority again to one minute is nice because it means that if one request times out and returns 502, then any other requests that are in flight at that time will time out before the one minute is up and not switch, but any requests made after the switch that take sixty seconds to time out will affect it.	2019-11-28 13:29:39 +00:00
Leo Hemsted	cfe82f8f4a	make 500 error provider switches also check for recent changes moving the logic and the test from switch provider on slow delivery to dao reduce sms provider priority	2019-11-28 13:29:39 +00:00
Leo Hemsted	2a392e7137	update switch provider scheduled task it now looks at both providers and works out whether to deprioritise one, rather than binary switching from one to the other. If anything has altered the priorities in the last ten minutes it won't take any action. If both providers are slow it also won't take any action.	2019-11-28 13:29:38 +00:00
Leo Hemsted	28da190a1c	remove get_current_provider the function no longer makes sense now that we send through both at the same time. mostly just used in old tests that we'll end up rewriting shortly anyway	2019-11-28 13:29:02 +00:00
Leo Hemsted	fa7e0a1e84	add dao_reduce_sms_provider_priority function retrive the sms providers from the DB, and decrease the chosen provider's priority by 10, while increasing the other by 10. add a check in to ensure we never decrease below 0 or increase above 100 - this is per provider, we don't check that the two add up to 100 or anything. If the values are outside of this range (eg: set via the UI) then they'll probably* fix themselves at some point - we've added tests to document these cases. Use with_for_update to ensure that the method can only run once at a time - other invocations of the function will be held on that line until the currently running one ends and commits the transaction. This doesn't affect anyone doing things from the UI.	2019-11-28 13:29:01 +00:00
Leo Hemsted	6f38cbbcf1	randomly choose from providers based on priority todo: make sure if they don't add up to 100 we do something sensible, especially if they're both 0.	2019-11-28 13:29:01 +00:00
Rebecca Law	4fd6f33af2	Merge pull request #2658 from alphagov/fix-letters-in-created-status Alert if a letter doesn't make it past created status	2019-11-27 13:38:51 +00:00
Rebecca Law	853df6fbfb	Fix reference to old time frame for task.	2019-11-27 13:26:53 +00:00
Rebecca Law	e0b4b258aa	Shortened the length of time to check for messages with the wrong state. There is a chance that the there is an outstanding retry task that has yet to run but the task that are replayed here protect against the task running twice. So this just means it might get sent sooner than later.	2019-11-21 15:51:27 +00:00
Rebecca Law	ac4f0e8027	After a comment from @idavidmcdonald, I asked myself why are not creating the task to upload the pdf and update the notification. The assumption was that S3 would throw an exception if the object was uploaded twice. That's not the case the default behaviour is that if a file already exists it will be overwritten. So it is completely safe to run the task from the alert. It can also mean that we don't need to wait 4hours 15 minutes. Shall I decease the amount of time before restarting the task?	2019-11-19 16:04:21 +00:00
Rebecca Law	918975b0a6	Use sender_id from CSV metadata. When we upload a CSV for a job, we add the sender_id as metadata to the file that is uploaded on S3. There is more than one place where we process rows from that CSV. - process_job - scheduled_job - check_for_missing_rows_in_completed_jobs - check_job_status All of these places need to use the sender_id, now the sender_id is always read from the file metadata. In a subsequent PR we can remove the optional sender_id parameter from process_job task.	2019-11-15 15:42:29 +00:00
Rebecca Law	6155f7666e	Testing with latest	2019-11-15 15:42:24 +00:00
Rebecca Law	516190262a	[WIP]	2019-11-15 15:41:27 +00:00
Rebecca Law	c42420c329	Add an alert when a letter is created but doesn't have a file in S3 for sending. We can tell this is the case because there is no updated_at and billable units are still 0. At this point we are just creating a zendesk ticket - perhaps we can just call the create_letter_pdf task.	2019-11-13 16:39:59 +00:00
Rebecca Law	5aaf5cd588	Add the missing format for the log message when a missing row is processed.	2019-11-07 15:01:23 +00:00
Rebecca Law	559faf3034	Fix the query. Missing the where clause to join the two tables.... OOPS	2019-11-07 10:57:31 +00:00
Rebecca Law	db5a50c5a7	Adding a scheduled task to processing missing rows from job Sometimes a job finishes but has missed a row in the middle. It is a mystery why this is happening, it could be that the task to save the notifications has been dropped. So until we solve the missing let's find missing rows and process them. A new scheduled task has been added to find any "finished" jobs that do not have enough notifications created. If there are missing notifications the job processes those rows for the job. Adding the new task to beat schedule will be done in the next commit. A unique key constraint has been added to Notifications to ensure that the row is not added twice. Any index or constraint can affect performance, but this unique constraint should not affect it enough for us to notice.	2019-11-06 10:49:46 +00:00
Leo Hemsted	975af113e4	Merge pull request #2639 from alphagov/remove-loadtesting-db-migration remove loadtesting from the database	2019-11-06 10:49:46 +00:00
Katie Smith	ceb7cee009	Pass request_id to tasks if available We want to pass the `request_id` to Celery tasks if the task is called from an HTTP request, so that we can add the `request_id` to the logs. This change overwrites `apply_async` to add the `request_id` to the kwargs if available. When we call the task, we then add the `request_id` to g on Flask's application context. Tasks called from `send_task` won't have a `request_id` for now, and this change only affects tasks called from HTTP requests (not from other tasks or from Celery Beat).	2019-10-28 10:59:25 +00:00
Rebecca Law	98c61f58b1	Merge pull request #2624 from alphagov/add-logs-for-jobs Add more logging for process_job	2019-10-28 09:53:15 +00:00
Leo Hemsted	496b6f4737	Merge pull request #2627 from alphagov/letter-alert-v3 Letter alert	2019-10-22 13:26:35 +01:00
Rebecca Law	ad14f96b8d	A small change to make the code just a little bit clearer.	2019-10-17 15:17:43 +01:00
Pea Tyczynska	6ee7ac6cac	Refactor and harmonise metadata for invalid letters with those sent from admin app	2019-10-16 14:11:50 +01:00
Pea Tyczynska	0a617379c4	Put pdf letter validation failed message in S3 metadata So it can be used to tell the user why the letter has failed validation	2019-10-11 17:24:48 +01:00
Leo Hemsted	8285ef5f89	only check for dvla response files on mon/weds/fri dvla don't process 2nd class files on tues and thurs	2019-10-08 18:16:45 +01:00
Rebecca Law	1a203b5c04	Add more logging for process_job	2019-10-01 13:05:05 +01:00
Rebecca Law	44b7b36acd	Added a command to process a row from a job.	2019-09-26 14:19:09 +01:00
Rebecca Law	a1863fa419	Update all calls to `get_folder_name` to include the parameter name. Use created_at date of the notification for precompiled letters.	2019-09-25 14:40:09 +01:00
Pea Tyczynska	1279a46b8b	Don't log address redaction failure when letter sent with test key	2019-09-17 15:55:26 +01:00
Leo Hemsted	99eb17fc29	Merge pull request #2610 from alphagov/get-pdf-contents-via-api add api endpoint to get pdf for letter	2019-09-17 14:55:34 +01:00
Katie Smith	081543a2a9	Refactor out function to get page count This has been moved to the letters utils file since it will be used in more than one place. The notification parameter has been removed so that the function can be used when we don't have a notification id.	2019-09-12 14:58:51 +01:00
Leo Hemsted	52f7620772	create pdfs for test templated letters previously, we didn't create templated letters, and just marked them as delivered straight away. However, we may need to return PDFs for these letters, so we should create them the same as live letters. Then update the functions so that they know where to look for these letters.	2019-09-11 15:02:12 +01:00
Pea Tyczynska	fecd7b5728	Copy original file tp redaction_failure folder when redaction fails	2019-09-10 15:10:18 +01:00
Pea Tyczynska	8460147dfa	Handle both new and old response type from template preview's sanitise endpoint Fix tests so they accept new response handling	2019-09-06 13:18:21 +01:00
Leo Hemsted	8f13697cf1	Revert "trigger nightly delete tasks from the create notification status task" This reverts commit `58f24a0a83`.	2019-08-19 16:06:25 +01:00
Leo Hemsted	36dd750637	split up reporting tasks in to separate tasks per day to try and speed up overall time by parallelising	2019-08-19 16:06:25 +01:00
Leo Hemsted	92d78956be	Merge pull request #2592 from alphagov/reporting-worker Add reporting worker	2019-08-15 17:22:27 +01:00
Leo Hemsted	e5c76ffda7	reduce days to process from 10 to 4 to try and speed it up temporarily.	2019-08-15 17:06:38 +01:00
Leo Hemsted	58f24a0a83	trigger nightly delete tasks from the create notification status task the nightly tasks need to run after the create nightly notification status task - so that test notifications are still there to record stats for, and to stop the risk of deleting notificaitons part-way through recording stats for them.	2019-08-14 18:04:45 +01:00

1 2 3 4 5 ...

709 Commits