notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2025-12-24 01:11:38 -05:00

Author	SHA1	Message	Date
Ben Thorner	ebaef4b57b	Add "charged_units" to service usage APIs This can be calculated from the "free_allowance_used" field and the "chargeable_units" field, but having it included separately is more convenient as it can be used directly in Admin [^1]. [^1]: `417e7370bb/app/templates/views/usage.html (L38-L39)`	2022-04-27 15:57:35 +01:00
Ben Thorner	555868c442	Add "free_allowance_units" to service usage APIs This represents the number of chargeable_units that were actually free due to the free allowance - they won't be included in "cost". Although the existing calculations in Admin [^1][^2] will still be correct with a change in SMS rates - it's cost that's the problem - it makes sense to have all the knowledge about calculating usage consistently in these two APIs. Note that the Integer casting is covered by the API-level tests in test_rest. [^1]: `474d7dfda8/app/main/views/dashboard.py (L490)` [^2]: `c63660d56d/app/main/views/dashboard.py (L350)`	2022-04-27 15:57:34 +01:00
Ben Thorner	cd84928a1e	Add costs to each row in yearly usage API This will replace the manual calculations in Admin [^1][^2] for SMS and also in API [^3] for annual letter costs. Doing the calculation here also means we correctly attribute free allowance to the earliest rows in the billing table - Admin doesn't know when a given rate was applied so can't do this without making assumptions about when we change our rates. Since the calculation now depends on annual billing, we need to change all the tests to make sure a suitable row exists. I've also adjusted the test data to match the assumption that there can only be one SMS rate per bst_date. Note about "OVER" clause ======================== Using "rows=" ("ROWS BETWEEN") makes more sense than "range=" as we want the remainder to be incremental within each group in a "GROUP BY" clause, as well as between groups i.e # ROWS BETWEEN (arbitrary numbers to illustrate) date=2021-04-03, units=3, cost=3.29 date=2021-04-03, units=2, cost=4.17 date=2021-04-04, units=2, cost=5.10 vs. # RANGE BETWEEN date=2021-04-03, units=3, cost=4.17 date=2021-04-03, units=2, cost=4.17 date=2021-04-04, units=2, cost=5.10 See [^4] for more details and examples. [^1]: https://github.com/alphagov/notifications-admin/blob/master/app/templates/views/usage.html#L60 [^2]: `072c3b2079/app/billing/billing_schemas.py (L37)` [^3]: `474d7dfda8/app/templates/views/usage.html (L98)` [^4]: https://learnsql.com/blog/difference-between-rows-range-window-functions/	2022-04-27 15:57:33 +01:00
Ben Thorner	fc378fed96	Prepare to replace "billing_units" in usage APIs There is no such thing as a "billing unit". The data this field contained was also a confusing mixture of two types: - For emails and letters, it was just "notifications_sent". - For SMS, it was the "chargeable_units" (billable * multiplier). This replaces the single, ambiguous "billing_units" field with "chargeable_units" and "notifications_sent" in both usage APIs. Once Admin is using them we can remove the old field.	2022-04-27 15:57:30 +01:00
Ben Thorner	80efdd2ec6	Refactor usage API queries into functions per type This makes it easier to extend each function with costs and free allowances - especially for SMS. I've chosen to duplicate the "WHERE" clause in each subquery vs. the top-level query. This will make more sense in later commits where we start adding free allowance calculations, which need to be done on a yearly basis - knowledge the subqueries should have.	2022-04-27 15:17:18 +01:00
Ben Thorner	ee4da698fe	Standardise timezones for service usage APIs We want to query for service usage in the BST financial year: 2022-04-01T00:00:00+01:00 to 2023-03-31T23:59:59+01:00 => 2022-04-01 to 2023-03-31 # bst_date Previously we were only doing this explicitly for the monthly API and it seemed like the yearly usage API was incorrectly querying: 2022-03-31T23:00:00+00:00 to 2023-03-30T23:00:00+00:00 => 2022-03-31 to 2023-03-30 # "bst_date" However, it turns out this isn't a problem for two reasons: 1. We've been lucky that none of our rates have changed since 2017, which is long ago enough that no one would care. 2. There's a quirk somewhere in Sqlalchemy / Postgres that has been compensating for the lack of explicit BST conversion. To help ensure we do this consistently in future I've DRYed-up the BST conversion into a new utility. I could have just hard-coded the dates but it seemed strange to have the knowledge twice. I've also adjusted the tests so they detect if we accidentally use data from a different financial year. (2) is why none of the test assertions actually need changing and users won't be affected. Sqlalchemy / Postgres quirk =========================== The following queries were run on the same data but results differ: FactBilling.query.filter(FactBilling.bst_date >= datetime(2021,3,31,23,0), FactBilling.bst_date <= '2021-04-05').order_by(FactBilling.bst_date).first().bst_date datetime.date(2021, 4, 1) FactBilling.query.filter(FactBilling.bst_date >= '2021-03-31 23:00:00', FactBilling.bst_date <= '2021-04-05').order_by(FactBilling.bst_date).first().bst_date datetime.date(2021, 3, 31) Looking at the actual query for the first item above still suggests the results should be the same, but for the use of "timestamp". SELECT ... FROM ft_billing WHERE ft_billing.service_id = '16b60315-9dab-45d3-a609-e871fbbf5345'::uuid AND ft_billing.bst_date >= '2016-03-31T23:00:00'::timestamp AND ft_billing.bst_date <= '2017-03-31T22:59:59.999999'::timestamp AND ft_billing.notification_type IN ('email', 'letter') GROUP BY ft_billing.rate, ft_billing.notification_type UNION ALL SELECT sum(ft_billing.notifications_sent) AS notifications_sent, sum(ft_billing.billable_units * ft_billing.rate_multiplier) AS billable_units, ft_billing.rate AS ft_billing_rate, ft_billing.notification_type AS ft_billing_notification_type FROM ft_billing WHERE ft_billing.service_id = '16b60315-9dab-45d3-a609-e871fbbf5345'::uuid AND ft_billing.bst_date >= '2016-03-31T23:00:00'::timestamp AND ft_billing.bst_date <= '2017-03-31T22:59:59.999999'::timestamp AND ft_billing.notification_type = 'sms' GROUP BY ft_billing.rate, ft_billing.notification_type) AS anon_1 ORDER BY anon_1.notification_type, anon_1.rate If we try some manual queries with and without '::timestamp' we get: select distinct(bst_date) from ft_billing where bst_date >= '2022-04-20T23:00:00' order by bst_date desc; bst_date ------------ 2022-04-21 2022-04-20 select distinct(bst_date) from ft_billing where bst_date >= '2022-04-20T23:00:00'::timestamp order by bst_date desc; bst_date ------------ 2022-04-21 2022-04-20 It looks like this is happening because all client connections are aware of the local timezone, and naive datetimes are interpreted as being in UTC - not necessarily true, but saves us here! The monthly API datetimes were pre-converted to dates, so none of this was relevant for deciding exactly which date to use.	2022-04-26 13:11:34 +01:00
Leo Hemsted	259d4a0569	add new daily sms provider volume report code generally lifted almost exactly from the daily_volumes_report, but per provider and only for SMS.	2022-04-11 13:42:40 +01:00
Rebecca Law	29ebf0eb9a	Move the casting the column as an int to the the endpoint, we need to convert the Decimal data type to datatype json can work with.	2022-03-09 12:29:58 +00:00
Rebecca Law	466b7fa341	Report for total notifications sent per day for each channel. Daily volumes report: total volumes across the platform aggregated by whole business day (bst_date) Volumes by service report: total volumes per service aggregated by the date range given. NB: start and end dates are inclusive	2022-03-07 10:44:49 +00:00
Ben Thorner	6e8f121548	Standardise how we query midnight-to-midnight Partially addresses [1] (lots more detail to read in the comment). I've also added some tests for the status DAO function to confirm it behaves as expected across timezones. [1]: https://github.com/alphagov/notifications-api/pull/3437#discussion_r802634913	2022-02-10 10:51:27 +00:00
Rebecca Law	2257cae398	Fix bug in organisation report for its services and usages. If a service has not sent any SMS for the financial year the free allowance was showing up as 0 rather than the number in annual billing. The query has been updated to use an outer join so that the free allow will be returned when there is no ft_billing. There is a potential performance enhancement to only return the data for the services of the organisation in the `fetch_sms_free_allowance_remainder_until_date` subquery. I will investigate in a subsequent PR.	2022-01-11 10:04:36 +00:00
Pea Tyczynska	6c04deaec2	Get rid of unnecessary coalesce	2021-12-14 17:36:03 +00:00
Pea Tyczynska	a74d1b026f	Fix calculating remaining free allowance for SMS The way it was done before, the remainder was incorrect in the billing report and in the org usage query - it was the sms remainder left at the start of the report period, not at the end of that period. This became apparent when we tried to show sms_remainder on the org usage report, where start date is always the start of the financial year. We saw that sms sent by services did not reduce their free allowance remainder according to the report. As a result of this, we had to temporarily remove of sms_remainder column from the report, until we fix the bug - it has been fixed now, yay! I think the bug has snuck in partially because our fixtures for testing this part of the code are quite complex, so it was harder to see that numbers don't add up. I have added comments to the tests to try and make it a bit clearer why the results are as they are. I also added comments to the code, and renamed some variables, to make it easier to understand, as there are quite a few moving parts in it - subqueries and the like. I also renamed the fetch_sms_free_allowance_remainder method to fetch_sms_free_allowance_remainder_until_date so it is clearer what it does.	2021-12-09 18:58:10 +00:00
Katie Smith	0148b3dba6	Add new total_letters field to the billing report data This adds total_letters to the data that is returned by the `/platform-stats/data-for-billing-report` endpoint so that we can add total letters as a column in the CSV file that can be downloaded.	2021-06-11 11:31:22 +01:00
Rebecca Law	68d28aa83b	The update of SQLAlchemy 1.4.10 has caused some conflicts in our code. This PR fixes most of those conflicts. - sqlalchemy.sql.expression.case must include an else statement. - clearly define list of columns for inbound_sms_history insert, getting the list from InboundSmsHistory.__table__.c was causing data type errors. - remove relationships when not needed, the foreign key relationship is established in the creation of the column. This will get rid of the warnings referenced here: http://sqlalche.me/e/14/qzyx. - update queries now that he user relationship in ServiceUser db model has been removed. - move the check that a template is archived to the view instead of the dao method. The check was clearing the session before the version history could be done. Deleting notifications in the night tasks still needs to be investigated. The raw sql is causing an error.	2021-04-29 13:32:36 +01:00
Rebecca Law	85895a9e8b	Revert "Scheduled weekly dependency update for week 16"	2021-04-28 10:17:16 +01:00
Rebecca Law	1b070d69a1	The update of SQLAlchemy 1.4.10 has caused some conflicts in our code. This PR fixes most of those conflicts. - sqlalchemy.sql.expression.case must include an else statement. - clearly define list of columns for inbound_sms_history insert, getting the list from InboundSmsHistory.__table__.c was causing data type errors. - remove relationships when not needed, the foreign key relationship is established in the creation of the column. This will get rid of the warnings referenced here: http://sqlalche.me/e/14/qzyx. - update queries now that he user relationship in ServiceUser db model has been removed. - move the check that a template is archived to the view instead of the dao method. The check was clearing the session before the version history could be done. Deleting notifications in the night tasks still needs to be investigated. The raw sql is causing an error.	2021-04-26 11:50:30 +01:00
Rebecca Law	057c4e4568	Quick fix to ensure that billing doesn't fail if the crown is not set for the service. The letters rates for cronw and non crown are the same. It would be nice to remove the need for crown but for now this is a quick fix.	2021-03-25 08:42:46 +00:00
Pea Tyczynska	4c3d70fd55	Update usage endpoint with billing details for orgs and services	2021-03-19 16:49:48 +00:00
Ben Thorner	a91fde2fda	Run auto-correct on app/ and tests/	2021-03-12 11:45:45 +00:00
Rebecca Law	e05e9bb5e0	Change the sort order for the organisation usage page. Ensure the archived services are at the bottom of the list. The organisation trial mode page already sorts the archived services to the bottom.	2021-01-12 09:44:35 +00:00
Katie Smith	b30701d7e1	Set 'international' for letters in ft_billing `international` for letters in `ft_billing` was always False. Now that letters can be international, this changes the column value to the value of `international` for the notification.	2020-08-21 09:19:27 +01:00
Katie Smith	da8eaaed44	Update letter data for usage-for-all-services report Usage for all services is a platform admin report that groups letters by postage. We want it to show `europe` and `rest-of-world` letters under a single category of `international`, so this updates the query to do that and to order appropriately.	2020-07-14 10:22:30 +01:00
Pea Tyczynska	9c4205c7c6	Remove statsd decorators from dao functions This done so that we do not use statsd on our http endpoint. We decided we do not need metrics that this gave us. If we change our minds, we will add Prometheus-friendly decorators instead in the future.	2020-07-07 18:02:24 +01:00
Rebecca Law	7b0a3c68cd	Fix bug on organisation-usage page. The dict is initialised for all live services, but if the organisation has trial mode services they cause a key error.	2020-02-27 13:52:02 +00:00
Rebecca Law	c91f37ff4c	Change the updates to only look at today, and not yesterday.	2020-02-26 17:38:20 +00:00
Rebecca Law	f7a564a17c	Add more realistic test Add statsd Fix imports	2020-02-26 11:21:33 +00:00
Rebecca Law	67c64a8715	Format the response to a more managable list. Add a sort order	2020-02-25 17:34:03 +00:00
Rebecca Law	a2d18f8598	Update the organsition usage endpoint to use the new query. This endpoint may need to change, but we'd like to see how this performs, so we'll test this with a real data set. Then come back to make sure the format is correct and check for missing tests for the endpoint,	2020-02-25 09:29:50 +00:00
Rebecca Law	b1b457eea0	Only return the usage data for live services. The list of trail mode services is only for platform admins, therefore the usage isn't needed for that page.	2020-02-24 14:23:05 +00:00
Rebecca Law	18f272dc2b	Add queries to handle returning usage for all services associated to a given organisation.	2020-02-24 11:28:42 +00:00
Rebecca Law	49533d7792	Fix typo in function name	2020-02-24 11:26:16 +00:00
Rebecca Law	009dcd0860	Update the fetch_monthly_billing_for year to only update ft_billing for the notification types the service as permission to send to.	2020-02-20 16:08:57 +00:00
Rebecca Law	ca010ac4cb	Check service has permission to send notification type. At the moment the check_permission boolean is always false. Will set to true for usage pages	2020-02-20 13:27:39 +00:00
Leo Hemsted	0f6f2f1b91	split up _query_for_billing_data into three separate queries the queries all return lots of columns, but each query has columns it doesn't care about. eg emails don't have billable units or international flag, letters don't have international flag, sms don't have a page count etc. additionally, the query was grouping on things that never change, like service id and notification type. by making all of these literals (as in `select 1 as foo`) we see times that are over 50% quicker for gov.uk email service. Note: One of the tests changed because previously it involved emails and sms with statuses that they could never be (eg returned-letter)	2020-02-19 13:12:01 +00:00
Rebecca Law	291c6d6dc9	Add statsd annotations for the fact table queries.	2020-02-18 14:33:17 +00:00
Leo Hemsted	d457db4164	make has_delete_task_run non-optional just to ensure people think about the value of it when using the function	2019-12-03 14:19:14 +00:00
Leo Hemsted	d83827579e	make ft billing nightly task only look at one table follows same logic as the create_nightly_notification_status task, see previous commit for logic	2019-12-03 14:19:13 +00:00
Leo Hemsted	913cf5e12d	work out which table to get notification status data from previously we checked notifications table, and if the results were zero, checked the notification history table to see if there's data in there. When we know that data isn't in notifications, we're still checking. These queries take half a second per service, and we're doing at least ten for each of the five thousand services we have in notify. Most of these services have no data in either table for any given day, and we can reduce the amount of queries we do by only checking one table. Check the data retention for a service, and then if the date is older than the retention, get from history table. NOTE: This requires that the delete tasks haven't run yet for the day! If your retention is three days, this will look in the Notification table for data from three days ago - expecting that shortly after the task finishes, we'll delete that data.	2019-11-29 15:27:56 +00:00
Rebecca Law	db0d45966f	Fix query to populate ft_billing table. The group by for the query was wrong which would result in 2 rows with different totals but the same unique key, so the second row would update the first row. Meaning we had incorrect numbers for the billing data. Because some of the data had null for the sent_by column, the select would turn the Null --> dvla, but that same function was not used in the group by. So any time we had missing sent_by data we would end up with 2 rows where one would overwrite the other.	2019-11-15 10:23:48 +00:00
Rebecca Law	e64ae321cf	The sheet count was not calculated properly (it should be billable_units/notifications_sent). And it turns out the sheet count is not required for the report. This PR takes out the columns to resolve the group by error.	2019-09-03 13:16:08 +01:00
Leo Hemsted	93e631221a	use dates rather than datetimes when comparing with bst_date bst_date is a date field. Comparing dates with datetimes in postgres gets confusing and dangerous. See this example, where a date evaluates as older than midnight that same day. ``` notification_api=# select '2019-04-01' >= '2019-04-01 00:00'; ?column? ---------- f (1 row) ``` By only using dates everywhere, we reduce the chance of these bugs happening	2019-09-02 11:56:56 +01:00
Leo Hemsted	5975ae2383	remove unneccessary duped lines as per pr comments	2019-08-30 16:49:58 +01:00
Leo Hemsted	7313dbeb86	normalise join patterns across billing queries select from service, join to org and ft_billing	2019-08-30 12:18:52 +01:00
Leo Hemsted	6f420cf066	explicitly join tables from service, join organisation, the free_allowance_remainder subquery and the ft_billing table. Being explicit reduces confusion about what tables we're joining and how we're constraining those joins also remove references to AnnualBilling since we've already got the free sms allowance from the free_allowance_remainder subquery	2019-08-30 12:18:52 +01:00
Leo Hemsted	48e96f253b	ensure fetch_sms_free_allowance_remainder always returns if there are no rows for a service in ft_billing, we should still return their allowance (with 0 fragments used). To do this, we need to build the query starting from AnnualBilling and joining onto FactBilling, rather than the other way round. Also, we need to account for the possibility of the sums being null by coalescing them to 0	2019-08-30 12:18:40 +01:00
Leo Hemsted	99a008e908	explicitly join annual_billing and remove reference to service.id service.id isn't used in this query (both tables already have service_id), and explicitly joining makes what we're doing more obvious	2019-08-30 12:18:00 +01:00
Leo Hemsted	741d75c3a9	use func.greatest rather than case statement simplifies the query quite a bit	2019-08-30 12:18:00 +01:00
Leo Hemsted	b7e8f1baa2	rename which_financial_year to get_financial_year_for_datetime also fix bug on the very second of rollover, march 31st 23:00:00, and add tests	2019-08-30 12:18:00 +01:00
Rebecca Law	1c94d6d24a	Added command to populate data for annual billing based on last years values.	2019-08-30 12:17:59 +01:00

1 2

90 Commits