notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2025-12-22 08:21:13 -05:00

Author	SHA1	Message	Date
Katie Smith	8365c749e4	Change letter zip file names for Insolvency Service letters DVLA would like to be able to identify letters sent by the Insolvency Service, so we are changing the zipfile name. They need all zipfile names to have the same structure, so we can't just add a marker to files sent by that service - we have to change all filenames. The new format is like this: `{NOTIFY}.{DATE}.{SEQUENCE_ID}.{UNIQUE_ID}.{SERVICE_ID}.{ORG_NAME}.{EXTENSION}`	2021-05-06 09:18:44 +01:00
Ben Thorner	e3e067c795	Remove redundant @statsd timing decorators These are superseded by timing task execution generically in the NotifyTask superclass [1]. Note that we need to wait until we've gathered enough data under the new metrics before removing these. [1]: https://github.com/alphagov/notifications-api/pull/3201#pullrequestreview-633549376	2021-04-12 15:19:18 +01:00
Ben Thorner	8219b3c032	Remove non/crown indicator in letter filenames This is not required by DVLA and since [1] we no longer care about the end of letter filenames when collating them, so removing it is safe to do. Note that the name of the ZIP files of collated letters is based on a hash of the filenames, which needed updating in tests. Before merging this we need to do a test run in Staging, so DVLA can check that a mixture of the old / new filenames won't cause issues. [1]: https://github.com/alphagov/notifications-api/pull/3172	2021-03-18 13:05:12 +00:00
Ben Thorner	c76e789f1e	Reduce extra S3 ops when working with letter PDFs Previously we did some unnecessary work: - Collate task. This had one S3 request to get a summary of the object, which was then used in another request to get the full object. We only need the size of the object, which is included in the summary [1]. - Archive task. This had one S3 request to get a summary of the object, which was then used to make another request to delete it. We still need both requests, but we can remove the S3.Object in the middle. [1]: https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/s3.html#objectsummary	2021-03-16 12:53:13 +00:00
Leo Hemsted	6784ae62a6	Raise Exception if letter PDF not in S3 Previously, the function would just return a presumed filename. Now that it actually checks s3, if the file doesn't exist it'll raise an exception. By default that's a StopIteration at the end of the bucket iterator, which isn't ideal as this will get supressed if the function is called within a generator loop further up or anything. There are a couple of places where we expect the file may not exist, so we define a custom exception to rescue specifically here. I did consider subclassing boto's ClientError, but this wasn't straightforward as the constructor expects to know the operation that failed, which for me is a signal that it's not an appropriate (re-)use of the class.	2021-03-15 17:18:11 +00:00
Ben Thorner	b43a367d5f	Relax lookup of letter PDFs in S3 buckets Previously we generated the filename we expected a letter PDF to be stored at in S3, and used that to retrieve it. However, the generated filename can change over the course of a notification's lifetime e.g. if the service changes from crown ('.C.') to non-crown ('.N.'). The prefix of the filename is stable: it's based on properties of the notification - reference and creation - that don't change. This commit changes the way we interact with letter PDFs in S3: - Uploading uses the original method to generate the full file name. The method is renamed to 'generate_' to distinguish it from the new one. - Downloading uses a new 'find_' method to get the filename using just its prefix, which makes it agnostic to changes in the filename suffix. Making this change helps to decouple our code from the requirements DVLA have on the filenames. While it means more traffic to S3, we rely on S3 in any case to download the files. From experience, we know S3 is highly reliable and performant, so don't anticipate any issues. In the tests we favour using moto to mock S3, so that the behaviour is realistic. There are a couple of places where we just mock the method, since what it returns isn't important for the test. Note that, since the new method requires a notification object, we need to change a query in one place, the columns of which were only selected to appease the original method to generate a filename.	2021-03-15 13:55:44 +00:00
Ben Thorner	0379d721e5	Add missing statsd timers to celery tasks All other tasks in app/celery/*_tasks.py have timers on them. Some of these timers will be useful to check before/after performance as a way to reassure ourselves about the impact of [1]. [1]: https://github.com/alphagov/notifications-api/pull/3172	2021-03-12 12:32:22 +00:00
Ben Thorner	a91fde2fda	Run auto-correct on app/ and tests/	2021-03-12 11:45:45 +00:00
Rebecca Law	acfb759cb9	Change DVLA_EMAIL_ADDRESS to a list	2021-02-26 11:21:16 +00:00
Pea Tyczynska	4fc3af9811	Add date to personalisation for DVLA email Personalisation was missing date attribute. The email still got sent tonight, just it didn't have a value for date placeholder.	2021-02-24 10:22:22 +00:00
Pea Tyczynska	e0c73ac342	Send daily email with letter and sheet volumes to DVLA	2021-02-23 15:13:19 +00:00
Pea Tyczynska	6dab63130d	Make import order alphabetical	2021-02-23 15:13:19 +00:00
Ben Thorner	474b93f183	Remove redundant (renamed) letters task This was renamed in [1], and enough time has elapsed that instances of the task should all have finished processing. [1]: `5d6f2da155`	2021-02-17 12:57:50 +00:00
Leo Hemsted	ed182c2a22	return just the columns we need for collating letters previously we were returning the entire ORM object. Returning columns has a couple of benefits: * Means we can join on to services there and then, avoiding second queries to get the crown status of the service later in the collate flow. * Massively reduces the amount of data we return - particularly free text fields like personalisation that could be potentially quite big. 5 columns rather than 26 columns. * Minor thing, but will skip some CPU cycles as sqlalchemy will no longer construct an ORM object and try and keep track of changes. We know this function doesn't change any of the values to persist them back, so this is an unnecessary step from sqlalchemy. Disadvantages are: * The dao_get_letters_to_be_printed return interface is now much more tightly coupled to the get_key_and_size_of_letters_to_be_sent_to_print function that calls it.	2020-10-23 20:01:18 +01:00
Leo Hemsted	4b61060d32	stream notifications when collating zip files we had issues where we had 150k 2nd class notifications, and the collate task never ran properly, presumably because the volume of data being returned was too big. to try and help with this, we can switch to streaming rather than using `.all` and building up lists of data. This should help, though the initial query may be a problem still.	2020-10-23 12:20:26 +01:00
David McDonald	3dcb97c45a	Remove 'INSOLVENCY' from zip files for insolvency letters This is at request of DVLA. They would prefer to have zip files with the same number of arguments in the name. After being offered a few different options, such as including an org and service id for all zips, they chose to just remove the 'INSOLVENCY' tag. For more context see PR that added the tag https://github.com/alphagov/notifications-api/pull/3006	2020-10-23 09:58:28 +01:00
Pea Tyczynska	deb360846d	Mark letters from insolvency service So that DVLA knows to process them separately to avoid issues.	2020-10-21 16:56:18 +01:00
Pea Tyczynska	d745ba310e	Divide letters by service when putting in ZIPs When letters are sent to DVLA, we will now put them in a separate ZIP file for each service, so that if there are printing issues due to bad files from one service, other services will hopefully not be affected by that.	2020-10-21 15:19:06 +01:00
Leo Hemsted	1e928a926a	rename sending_date to created_at we don't name letters based on the day we send them on, rather, the day we create them on. If we process a letter for a second time for whatever reason, even if it's a couple of days later, it'll still go in a folder based on the created_at timestamp. There's still a slight confusion, however - if the timestamp is after 5:30pm, the folder will be for the day after. However, still the day after creation, so I think created_at still makes the most sense. Remove the term `sending_date` to try and make this relationship more apparent.	2020-09-21 14:40:22 +01:00
Leo Hemsted	bb33927b3d	rename letter get_folder_name args `_now`? why would we ever use a different _now? instead say created_at, because that's what it'll always be set to, even if we're replaying old letters. We always set the folder name to when the letter was created_at, or we might not know where to look to find it. `dont_use_sending_date` doesn't really tell us what might happen if we don't use it - the answer is we return an empty string. we ignore the folder entirely. so lets call it that. Also, remove use of freeze_gun in the tests, to prove that we don't use the current time in any calculations. Also add an assert to a mock in the get_pdf_for_templated_letter test, because we were mocking but not asserting before, so the tests didn't fail when the function signature changed.	2020-09-21 14:32:57 +01:00
Katie Smith	aee7887c14	Fix the filenames for international precompiled letters We were determing the filename for precompiled letters before we had checked if the letters were international. This meant that a letter could have a filename indicating it was 2nd class, but once we had sanitised the letter and checked the address we were setting the notification to international. This stopped these letters from being picked up to be sent to the DVLA, since the filename and postage of the letter did not match. We now regenerate the filename after the letter has been sanitised (and when we know the postage) and use the updated filename when moving the letter into the live PDF letters bucket.	2020-09-15 16:17:33 +01:00
Rebecca Law	10fe7d9fe8	Add postage for send-one-off letters. The postage is set to europe or rest-of-world for international letters, otherwise the template postage is used. Also set international for letters.	2020-08-03 14:01:59 +01:00
David McDonald	ef551247b5	Add better logging to starting collate-letter-pdfs-to-be-sent This will help us better understand how far through the task has got if it gets interrupted halfway (as was the case this morning and we struggled to understand).	2020-07-10 11:32:03 +01:00
Pea Tyczynska	61de908c5d	Simplify putting letters in right postage folders	2020-07-02 16:26:44 +01:00
Pea Tyczynska	f3c7c098d6	Include postage in zip folder filenames when sending letters Also split letters into zips in 4 categories based on their postage. Also have a separate count for zipfiles in each postage category.	2020-06-24 14:59:10 +01:00
Pea Tyczynska	9db6fc83f9	Split letters into zip files based on postage class We will split them into three categories: - first class - second class - international - this zip file will have letters for both europe and rest-of-world postage classes	2020-06-22 19:40:37 +01:00
Pea Tyczynska	5d6f2da155	Rename task from create_letters_pdf to get_pdf_for_templated_letter In a separate PR we will have to delete vestigial create_letters_pdf tasks that now only redirects to get_pdf_for_templated_letter.	2020-05-11 13:33:05 +01:00
Pea Tyczynska	879d15b736	Test logging and error message	2020-05-11 13:33:04 +01:00
Pea Tyczynska	3a00c19390	Polish and test the small task that updates billable units for letter	2020-05-11 13:32:09 +01:00
Pea Tyczynska	24a89c1c19	Modify tasks for getting letter pdf and updating billable units So that they talk with new template preview task for pdf creation	2020-05-11 13:30:59 +01:00
Chris Hill-Scott	85fc601886	Tell template preview to allow international letters If a service has permission to send international letters then it should tell template preview, so that template preview knows what rule to apply when it’s validating the address of the letter. Depends on: - [ ] https://github.com/alphagov/notifications-template-preview/pull/445	2020-05-01 14:37:24 +01:00
Pea Tyczynska	7a89b1b835	Fix bug where preview for templated letters would not show	2020-04-20 15:35:37 +01:00
Katie Smith	6ac89c9a2f	Delete old 'process-virus-scan-passed-task' This has been replaced by a new task, `sanitise-letter`, to this deletes all the code in the old task and ensures that when antivirus is not enabled locally we are calling the new task.	2020-04-02 14:52:15 +01:00
Katie Smith	8b8e736e49	Add retries to process_sanitised_letter task This task didn't have retries before, based on the assumption that if the task failed it was likely to be due to a Boto error, so retrying would not help because a file was probably not in the expected bucket. During an incident with the database, we had some letters that were stuck in the `pending-virus-check` state because this task failed. This change adds retries to the task if there was an Exception that was not a `BotoClientError`.	2020-03-30 17:28:01 +01:00
Rebecca Law	7459a4f6f6	Add a try/except around the code to get the files. The idea is to log the exception but keep going. That way the "good" files still get sent and we can investigate why a file failed.	2020-03-19 09:15:38 +00:00
David McDonald	2c41b21ddf	Remove unnecessary code and unrelevant comment	2020-02-21 16:42:37 +00:00
David McDonald	e6767590d4	Change function and task name to be more accurate Will require us to change a cronitor set up	2020-02-21 15:01:19 +00:00
David McDonald	148a5ab456	Refactor dates being passed around I believe this way is nicer to read, we don't have to change between datetimes and strings and back.	2020-02-21 15:01:19 +00:00
David McDonald	5c5eb8a96a	Remove unneeded check that notification is in created state We instead rely on the fact that only files being passed into this function we already know are in the created state	2020-02-21 15:01:19 +00:00
David McDonald	dc9bf757a8	Change which letters we want to be sent to look at all days Previously, when running the `collate_letter_pdfs_for_day` task, we would only send letters that were created between 5:30pm yesterday and 5:30 today. Now we send letters that were created before 5:30pm today and that are still waiting to be sent. This will help us automatically attempt to send letters that may have fallen through the gaps and not been sent the previous day when they should have been. Previously we solved the problem of letters that had fallen the gap by having to run the task with a date parameter for example `collate_letter_pdfs_for_day('2020-02-18'). We no longer need this date parameter as we will always look back across previous days too for letters that still need sending. Note, we have to change from using the pagination `list_objects_v2` to instead getting each individual notification from s3. We reduce load by using `HEAD` rather than `GET` but this will still greatly increase the number of API calls. We acknowledge there will be a small cost to this, say 50p for 5000 letters and think this is tolerable. Boto3 also handles retries itself so if when making one of the many HEAD requests, there is a networking blip then it should be retried automatically for us.	2020-02-21 15:01:19 +00:00
Katie Smith	35e39bcfa8	Save recipient address in process_sanitised_letter task If the letter passed sanitisation, the recipient address will be returned from template preview, so we want to save this as the `to` field of the notification.	2020-01-24 13:52:12 +00:00
Katie Smith	adf9906a96	Change process_sanitised_letter to take a single encrypted arg Template preview will now send an encrypted dict containing all the args to the `process_sanitised_letter` task, so this updates the task to handle data in the new format.	2020-01-24 13:18:37 +00:00
Rebecca Law	bb2b514e12	Save recipient address in the "to" field of a notification When a precompiled letter is sent via the admin app, we now pass in the address which can be set in the Notifications.to field. Once a precompiled letters sent by the API has passed validation we can set the address in Notifications.to field. The celery tasks to validate precompiled letters sent by the API will be done in another PR.	2020-01-07 14:35:48 +00:00
Rebecca Law	5ebd9a473c	Add the recipient address in the "to" field for precompiled letters.	2020-01-07 14:35:48 +00:00
Katie Smith	cc2191c19f	Add new tasks for sanitising precompiled letters Added a task, `sanitise-letter`, that will be called from antivirus when a letter has passed virus scan. This calls a new task in template-preview which will sanitise the PDF. A second new task, `process-sanitised-letter`, will be called from the template preview task and deals with updating the notification and moving it to the relevant bucket.	2019-12-16 11:55:09 +00:00
Rebecca Law	ad14f96b8d	A small change to make the code just a little bit clearer.	2019-10-17 15:17:43 +01:00
Pea Tyczynska	6ee7ac6cac	Refactor and harmonise metadata for invalid letters with those sent from admin app	2019-10-16 14:11:50 +01:00
Pea Tyczynska	0a617379c4	Put pdf letter validation failed message in S3 metadata So it can be used to tell the user why the letter has failed validation	2019-10-11 17:24:48 +01:00
Rebecca Law	a1863fa419	Update all calls to `get_folder_name` to include the parameter name. Use created_at date of the notification for precompiled letters.	2019-09-25 14:40:09 +01:00
Pea Tyczynska	1279a46b8b	Don't log address redaction failure when letter sent with test key	2019-09-17 15:55:26 +01:00

1 2 3

108 Commits