notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2025-12-31 12:51:47 -05:00

Author	SHA1	Message	Date
Katie Smith	9e7b84bda2	Merge pull request #2864 from alphagov/postage-constraints-3 Drop postage constraints	2020-06-19 15:48:38 +01:00
Katie Smith	ab956c9eb3	Update JSON schema postage validation for new values	2020-06-19 15:36:10 +01:00
Katie Smith	6cd914b9d3	Update models to remove postage constraints and add new postage types	2020-06-19 15:36:10 +01:00
Katie Smith	bdde035a0c	Drop postage constraints The constaints on notifications and notification_history have already been dropped in production, but still exist in staging and in dev environments. The constraints on templates and templates_history exist in all environments.	2020-06-19 15:36:10 +01:00
Katie Smith	4b1705aa14	Merge pull request #2869 from alphagov/update-schemas Update template schemas to validate postage	2020-06-19 13:15:59 +01:00
Katie Smith	98a69684c5	Update get_template_by_id_response & post_template_preview_response schemas To check the format of postage. Neither of these two schemas are used for validating - they seem to be added for reference.	2020-06-19 08:59:27 +01:00
Katie Smith	72be10c681	Add JSON schema for updating template We did not have a JSON schema for updating a template. Since we will remove the postage constraint from the templates table, this adds a JSON schema for updating a template so that we can use it to check that the postage is one of the allowed values.	2020-06-19 08:59:27 +01:00
Katie Smith	15112b2148	Update post_create_template_schema Updated the `post_create_template_schema` to check that the postage is one of our allowed values.	2020-06-19 08:59:27 +01:00
Leo Hemsted	69043d70ec	Merge pull request #2886 from alphagov/statsd-dns-cache re-enable statsd (and cache statsd DNS lookup)	2020-06-18 11:30:23 +01:00
Leo Hemsted	728e5eee24	re-enable statsd (and cache statsd DNS lookup) see https://github.com/alphagov/notifications-utils/pull/752 for implementation details. Cache DNS for 15 seconds. Note: This cache is per eventlet, so concurrent requests will handle their requests separately, but the cache does persist between sequential requests. re-enable statsd for all environments.	2020-06-18 11:20:00 +01:00
Rebecca Law	982af99793	Merge pull request #2885 from alphagov/fix-bug Fix error for get_notification_by_id when the updated_at is None for a delivered test message	2020-06-18 08:49:24 +01:00
Rebecca Law	be7afdd12b	In the effort to reduce the number of database connections I introduced a small bug. This only affected the test templated letter flow, a None type error would happen when trying to creathe `completed_at` timestamp for a `delivered` message. In the previous PR I removed the `update_notification` method to reduce the need for another update query. However, that meant the notification was marked as delivered without an updated_at timestamp. It is weird to set the updated_at when we create the notification. So is this a better fix? Or do I put the update back now? I recommend we push this fix now.	2020-06-18 08:30:19 +01:00
David McDonald	3f117282af	Merge pull request #2884 from alphagov/statsd-off Turn off statsd for the API in all environments	2020-06-17 17:40:54 +01:00
David McDonald	92c3170fe3	Turn off statsd for the API in all environments We have seen bad latency across all apps in the last week. After running a canary with statsd turned off we have seen these issues dissapear on that canary instance. We therefore will turn off statsd for all instances of the API. We will still need to investigate tomorrow what exactly changed or is causing the issue with statsd as we hadn't made any changes to it ourself. Note, this keeps statsd on for all the other apps but this will be a first step. We will also need to check performance of the other apps after releasing this.	2020-06-17 17:31:36 +01:00
Rebecca Law	77ce058732	Merge pull request #2882 from alphagov/reduce-db-connection Reduce the use of db connections	2020-06-17 14:22:58 +01:00
Rebecca Law	a8661fbd4b	Add None test back	2020-06-17 13:48:48 +01:00
Rebecca Law	e22f61977f	Remove print stmts	2020-06-17 12:13:49 +01:00
Rebecca Law	a5ed8f2079	Update the post letter flow - not able to get reduce the dB transactions used in the letter flow. Prioritising the reduction for the SMS/Email flow. Only update the daily limit cache if the service is in trial mode.	2020-06-17 12:11:28 +01:00
Chris Hill-Scott	ca5cfe8d01	Merge pull request #2879 from alphagov/serialise-less-stuff Serialise less stuff from the service model	2020-06-17 08:18:28 +01:00
Chris Hill-Scott	b3c69087d8	Serialise less stuff from the service object By default Marshallow includes unknown properties. This means every time a new property is added to the service model it gets included in the JSON-serialised response sent to the admin app. This is particuarly bad because it means that for returned letters the ID of every returned letter. So the JSON stored in Redis for the Check Your State Pension service is 86kb. Similarly the JSON stored in Redis for a big user of inbound text messaging is 458kb(!!!) because it has the ID of every received text message. That’s ~8,500 UUIDs. Luckily the admin app tells us exactly which keys it’s using here: `5952d9c26d/app/models/service.py (L31-L52)` ```python - `active` - `contact_link` - `email_branding` - `email_from` - `id` - `inbound_api` - `letter_branding` - `letter_contact_block` - `message_limit` - `name` - `prefix_sms` - `research_mode` - `service_callback_api` - `volume_email` - `volume_sms` - `volume_letter` - `consent_to_research` - `count_as_live` - `go_live_user` - `go_live_at` } ``` Plus these which it does not get automatically: - `email_branding` - `letter_branding` - `organisation` - `organisation_type` - `permissions` - `restricted` The API is returning all of these: - `active` - `all_template_folders` - `annual_billing` - `consent_to_research` - `contact_link` - `contact_list` - `count_as_live` - `created_by` - `crown` - `email_branding` - `email_from` - `go_live_at` - `go_live_user` - `id` - `inbound_api` - `inbound_number` - `inbound_sms` - `letter_branding` - `letter_contact_block` - `letter_logo_filename` - `message_limit` - `name` - `organisation` - `organisation_type` - `permissions` - `prefix_sms` - `rate_limit` - `research_mode` - `restricted` - `returned_letters` - `service_callback_api` - `users` - `version` - `volume_email` - `volume_letter` - `volume_sms` - `whitelist` So the ones that the admin is getting but not expecting are: - `all_template_folders` - `annual_billing` - `contact_list` - `created_by` - `crown` - `inbound_number` - `inbound_sms` - `letter_logo_filename` - `rate_limit` - `returned_letters` - `users` - `version` - `whitelist` Which is what this PR adds to the exclude list, except for `created_by` which is keeps because it’s needed to validate the JSON provided when creating a service.	2020-06-16 16:47:56 +01:00
Rebecca Law	21a1b8e8bd	Remove the call to the db after the notification has committed. After the commit we issue two calls to the db to get service and get notification. This is because after the commit the ORM wants to ensure that the data model objects are the latest. So far this is just a proof of concept, but the letter flow needs to be updated and we should be able to get rid of research mode. And it needs some tidy up.	2020-06-16 14:33:53 +01:00
Leo Hemsted	eec2c2859e	Merge pull request #2876 from alphagov/more-histograms add more prometheus metrics	2020-06-15 17:28:59 +01:00
Leo Hemsted	58ab99d74b	add more prometheus metrics Two new metrics: auth_db_connection_duration_seconds (histogram) wraps the first DB call of post notifications. This includes waiting to get a connection from the pool, and also making the actual request to the db to retrieve the service and api keys. (i'm not sure there's an easy way to separate these two things) post_notification_json_parse_duration_seconds wraps parsing the v2 post notifications json parsing and schema validation. Shouldn't include any async code	2020-06-15 16:26:56 +01:00
David McDonald	4c230a7235	Merge pull request #2819 from alphagov/additional-prometheus-metrics Additional prometheus metrics	2020-06-12 17:02:12 +01:00
David McDonald	8b4a424df1	Tidy up	2020-06-12 16:51:44 +01:00
Leo Hemsted	15ce9fe3f9	add metrics for redis timings	2020-06-12 14:52:22 +01:00
Leo Hemsted	d9b3b31a6a	add loadtesting to manifest so we can deploy a separate app	2020-06-12 14:52:22 +01:00
Leo Hemsted	cd9b80f415	set test_errors app fixture to session scope we have one global metrics variable `metrics = GDSMetrics()`, and we then call `metrics.init_app` from within the flask application set up. The v2/test_errors.py app_for_test fixture calls create_app, would call metrics.init_app multiple times for the same metrics instance. This causes errors, so change the fixture to session level so it only calls once per test run.	2020-06-12 14:52:22 +01:00
Leo Hemsted	faa8faa0c4	bump prometheus-client to 0.7.1 there's multiple performance improvements from prometheus-client 0.2.0. pin this bump while we wait for gds metrics client to increase its dependency	2020-06-12 14:52:22 +01:00
Leo Hemsted	c4dc0f64c5	dd sqlalchemy connection metrics for celery tasks grab the worker app name and task name rather than the web host and endpoint. also add a fallback for if we're not in a web request or a celery task. I think that'll probably happen when we use alembic, or if we do things from within flask shell	2020-06-12 14:52:22 +01:00
Leo Hemsted	6e32ca5996	add prometheus metrics for connection pools (both web and sql) add the following prometheus metrics to keep track of general app instance health. # sqs_apply_async_duration how long does the actual SQS call (a standard web request to AWS) take. a histogram with default bucket sizes, split up per task that was created. # concurrent_web_request_count how many web requests is this app currently serving. this is split up per process, so we'd expect multiple responses per app instance # db_connection_total_connected how many connections does this app (process) have open to the database. They might be idle. # db_connection_total_checked_out how many connections does this app (process) have open that are currently in use by a web worker # db_connection_open_duration_seconds a histogram per endpoint of how long the db connection was taken from the pool for. won't have any data if a connection was never opened.	2020-06-12 14:52:22 +01:00
Leo Hemsted	4bb37a05ec	Merge pull request #2871 from alphagov/utils-bump bump utils to fix statsd timing bug	2020-06-11 15:13:04 +01:00
Leo Hemsted	bd433ad24f	bump utils to fix statsd timing bug statsd timing should always be in seconds, not milliseconds	2020-06-11 15:01:15 +01:00
Chris Hill-Scott	4d2396699a	Merge pull request #2866 from alphagov/bump-utils-39.4.3 Bump utils to 39.4.3	2020-06-10 10:28:00 +01:00
Pea M. Tyczynska	852d1ffa03	Merge pull request #2867 from alphagov/turn-off-email-stub Turn off email stub on staging	2020-06-09 12:10:28 +01:00
Pea Tyczynska	bb3672100f	Turn off email stub on staging To check if it was our congestion point for the soak test. (Email stub is a bit slower to respond than Amazon SES, and the difference could lead to us having to many connections to db open as we wait for response from the stub)	2020-06-09 11:21:48 +01:00
Chris Hill-Scott	0f353739f5	Bump utils to 39.4.3 Brings in a bug fix for when a personalisation value is an empty list.	2020-06-09 10:45:09 +01:00
Pea M. Tyczynska	c63a78242c	Merge pull request #2865 from alphagov/stub-email-and-sms-in-staging Point API for staging at email and sms stubs for the soak tests.	2020-06-08 17:45:44 +01:00
Pea Tyczynska	2aa6470817	Point API for staging at email and sms stubs for the soak tests. This is done to avoid sending real email and sms and incurring unnecessary charges while we run the soak tests.	2020-06-08 17:14:16 +01:00
David McDonald	a1f8ec7d7c	Merge pull request #2862 from alphagov/ses-client-stub Stub SES email client to avoid hitting SES during load testing	2020-06-08 13:49:13 +01:00
David McDonald	7bc02ac26e	Add better logging to understand callback delivery cases In particular, we want to know how often callbacks arrive before the notification being persisted	2020-06-04 16:00:43 +01:00
Pea Tyczynska	8b7a0b88cb	Ensure that aws ses stub client is not run in production	2020-06-04 15:43:47 +01:00
Pea Tyczynska	6422a88c8c	Stub SES email client to avoid hitting SES during load testing If we set an environment variable, we can stub out calls to SES and send them to our own stub app. If the environment variable is not set, things work as normal. To be used alongside https://github.com/alphagov/notifications-email-provider-stub	2020-06-03 11:11:43 +01:00
Pea M. Tyczynska	db040c40b9	Merge pull request #2860 from alphagov/put-status-codes-in-logs Put status codes in logs	2020-06-02 16:06:05 +01:00
Pea Tyczynska	b81c7dd5ee	Put status codes in logs to see if lack of detailed status is us not recognising a code or provider not having sent the detailed status. It seems like Firetext is sometimes sending us permanent-failure without detailed status. It could be due to: - them really not sending any detailed status - them sending a status code we don't recognise - them sending 000 code that means 'no errors', which we ignore To see which one it is, and to debug such issues quicker in the future, this PR adds status and detailed status codes to the logs.	2020-06-02 13:26:41 +01:00
Katie Smith	66dc6b046f	Merge pull request #2861 from alphagov/provider-split Moving SMS supplier resting point to 50/50	2020-06-02 12:41:26 +01:00
Pete Herlihy	8a67e14e2b	Moving SMS supplier resting point to 50/50	2020-06-02 12:27:14 +01:00
Pea M. Tyczynska	9f816ad5f5	Merge pull request #2856 from alphagov/mmg_detailed_response Capture detailed delivery receipt status from MMG	2020-06-01 15:23:53 +01:00
Pea Tyczynska	c96142ba5e	Change function and variable names for readability and consistency	2020-06-01 12:44:49 +01:00
Pea Tyczynska	a4b942cf6c	Log detailed sms delivery status for mmg from process_sms_client_response task. Also log detailed delivery status for firetext in the same place in addition to it being logged from notifications_dao. Logging detailed delivery statuses will help us see why messages fail to deliver. In the future we could persist detailed delivery status in the database.	2020-06-01 12:44:49 +01:00

1 2 3 4 5 ...

7286 Commits