What's New?

December 10, 2024

Graphext Release Notes 2024-12-03

🌟 New Features

  • Percentage Change Labels for Bar and Line Charts:

    Easily track trends with percentage change labels! You can now display the percentage change between consecutive bins in simple or stacked bar charts, as well as in line charts. This addition provides a clear, visual way to highlight growth, decline, or fluctuations over time, making your data storytelling more impactful and intuitive.

https://x.com/victorianoi/status/1862068925015408892
https://x.com/victorianoi/status/1862068925015408892

  • Map numerical variables to the cells in the Heatmap: Enable mapping of numerical variables to heatmap cell values. Previously limited to count or relative count, users can now display metrics such as total revenue, average age, or any other numerical aggregation.


  • New aggregations in the Summary Table: Use list, list unique & concatenate aggregations for quantitative & date columns in Summary Table. Also Unlock new aggregation possibilities by grouping date variables into lists! This feature is perfect for cases where you need to group by a customer ID and retain all the unique transaction dates, or when you want to capture individual purchase dates—even if multiple purchases occurred on the same day. Combine this with segment or gradient-based coloring for tables to make your visualizations even more insightful and impactful.
  • Color Tables by Segment or Numerical Gradients:

    You can now apply segment-specific colors directly to table rows or cells, making it easier to align visuals with existing segment definitions—such as brands, political parties, or any other categories—boosting communication clarity and impact. Additionally, numerical variables can now be colored with gradient scales, providing a powerful way to highlight patterns and trends at a glance.


November 26, 2024

Graphext Release Notes 2024-11-26

🌟 New Features

  • Save Summary Table as Dataset: Add the ability to save the summary table (group by) from plots as a new dataset, enabling users to load it in the Data tab for further exploration. This streamlines workflows and allows deeper analysis by treating the summarized data as a base dataset, for example grouping a dataset of transactions by user id
  • Conditional Cell Highlighting in Tables: Add the ability to color table cells based on conditions for all variable types. For example, highlight exam marks below 5 in red or emails containing @gmail.com in yellow. Users can choose from three highlight styles: pill, background, or text, with automatic text color adjustment (white or black) to ensure perfect readability against the background.
  • Display Value Counts: Introduce the option to show the value count next to the Y-axis value in plots, indicating how many rows or samples are represented in each group. For example, in a bar chart of job titles with the highest average annual income, users can see the sample size alongside the income to validate whether the data is representative or is just an outlier
  • New interpolation options: Add control in Plot for showing missing values in line charts. You have three options: As solid lines, as gaps or as zeros.
  • Check Connection Status & Query Preview: You can now verify if the connection to your data source is successful and preview your query results before committing to create a project. This ensures a smoother setup process and saves time by letting you address any issues upfront.
  • Segmentation Query Editing: Enable query editing directly from the UI, allowing users to refine and customize the logic behind their segmentation with ease.

🛠️ Improvements

  • Decimal Format Selection: Introduce the option to customize the decimal format for quantitative columns, providing greater control over data presentation and ensuring consistency and clarity in numerical displays.
  • Advanced Search: Enable the use of the advanced query builder UI for searching within variables by clicking on the magnifying glass


🎨 UI Updates

  • Plot Line Markers alway active: Plot Line Markers are now active by default & can be configured in General tab in the Customization panel (instead fo Annotations tab)
  • Numbers format: The current format configuration (for numbers) is now visible in the Account Settings modal

🔧 Technical Updates

  • Cluster probabilities for HDBSCAN: when a second output column is requested in ”cluster_dataset" or ”cluster_embeddings" it will contain the probability that a data point belongs to the assigned cluster:

    https://docs.graphext.com/api-docs/analyse/graph_and_map/cluster_dataset#outputs


  • Time-based validation of ML models (train on past, test on recent data)




November 12, 2024

Graphext Release Notes 2024-11-12

🌟 New Features

  • Custom Queries for Variables: You can now perform custom queries on each variable, enabling faster category searches and more advanced queries that were previously not possible, such as AND conditions in categorical columns or disjoint ranges in numerical columns (e.g., "younger than 18 or older than 65"). For basic searches, a user-friendly UI is available, making it seamless, while for more advanced queries, a query editor and detailed documentation are provided to help you learn how to craft and execute complex queries.


  • Search for Variables in Cmd+K Omnibar: You can now search for variables from any section of Graphext directly through the Cmd+K omnibar. You'll be able to preview the distribution of those variables, making it easier to explore and analyze them without navigating through multiple sections.

🛠️ Improvements

  • Improved “prompt_ai” Step: There's no longer a need to be so explicit when forcing the output, making it more intuitive to use. A new JSON schema mode has been introduced for more structured data outputs. We now ensure that only the explicitly defined categories are returned, streamlining the results and ensuring more relevant outputs.

🔧 Technical Updates

  • Add support for Date columns in replace_values step

November 5, 2024

Graphext Release Notes 2024-11-05

🌟 New Features

  • Hall Redesign Implementation: The Hall has been redesigned for better clarity and navigation. The distinction between Home and Teams is now clearer, offering a more intuitive experience. Additionally, "What's New" is updated more frequently, and recent projects now include more filtering options, such as recently createdrecently updatedrecently refreshedrecently viewed, and Trending Projects coming soon
  • Quick Variable Tagging from Other Sections: You can now tag variables quickly from other sections, not just from the VM, making it easier and more efficient to tag relevant variables across your workflow. This feature is especially useful during EDA (Exploratory Data Analysis) before training a predictive model, allowing you to tag variables, such as those with high correlation to your target, for more focused analysis.


  • Discretization for Numeric Columns in Group_By: Implement the ability to create bins when using a numeric column for group_by, allowing for more granular grouping of data by dividing numeric values into specified ranges or intervals

🛠️ Improvements

  • Metrics Formula Editor with Syntax Highlighting and Autocompletion: Edit your metric formulas with ease using a new formula editor that includes syntax highlighting and autocompletion, making it faster and more intuitive to create and modify formulas.


🐛 Bug Fixes

  • Fix for Significant Variable Navigation: We have fixed the issue where clicking on a significant variable would not take you to the crossfilter if the "Show" was set to a specific group of tags. Now, it will automatically change the "Show" to "ALL", ensuring smooth navigation and proper visualization.

🎨 UI Updates

  • Updated Table Header in Plot: The header of the tables in plots has been rearranged for improved usability. The "+" button is now positioned on the right, while the pagination is moved to the left, offering a more intuitive layout for managing and navigating tables.
  • Shortcut to Open Plot from Data Table: Easily open the plot directly from the data table with a new shortcut, streamlining the transition between data exploration and visualization for more efficient analysis.

🔧 Technical Updates

  • Optimize metrics aggregation

October 29, 2024

Graphext Release Notes 2024-10-29

🌟 New Features

  • Sort Stacked Plot by First Color: You can now sort plots by the first color, enabling more intuitive visualization and better comparison of category proportions in the chart.
  • Option to Include or Exclude Nulls in Comparisons: You now have the option to include or exclude null values when comparing a selection to the total population
  • Drag and Drop to Order Categories in Ordinal Columns: You can now reorder the categories in an ordinal category column using drag and drop, providing a more intuitive way to reorder them

🔧 Technical Updates

  • Enhanced Group_By Step for Numerical and Date Variables: The group_by step has been modified to work with both numerical and date variables, now supporting discretization. This allows for grouping based on specified ranges or time intervals, providing more flexible and detailed aggregation of your data.
  • Group By Date with Custom Aggregations: A new method allows you to group by date and select specific aggregations, such as DayMonth, or Year, enabling more flexible and insightful date-based analysis.

October 15, 2024

Graphext Release Notes 2024-10-15

🌟 New Features

  • Create and Compare Metrics in Graphext: You can now define custom metrics like MRR or NPS within Graphext. These metrics will dynamically react to your selections, allowing you to compare them across the total population versus specific segments. For example, you can easily identify if a particular client segment has a much lower NPS than the overall average, helping to uncover valuable insights.

    In this example, we can see the NPS for all the reviews of a McDonald's. The NPS of the entire dataset is shown in grey around 46, while the NPS of the reviews with negative sentiment (highlighted in blue) is -36.9, highlighting a significant difference and offering insights into customer satisfaction based on sentiment.

    You can also visualize the difference in absolute numbers or as a percentage change between metrics.

  • Use Metrics in Plots: Now, you can use custom metrics like MRR or NPS directly in your plots, allowing you to visualize and compare these metrics across different segments or selections, enhancing your data analysis and insight generation.

🎨 UI Updates

  • Updated Row Removal and Reverse Selection: The trash icon has been removed from the top right corner. You can now remove rows or reverse your selection directly from the dropdown in the top left corner. A confirmation modal will appear with an explanation of what will be removed, ensuring clarity and preventing accidental deletions.

🔧 Technical Updates

  • Sorting Parameter for "Filter Duplicates" Step: Introduce a sorting parameter in the "filter_duplicates" step, allowing you to define how duplicates are prioritized and removed, making data cleaning more customizable and efficient.

September 24, 2024

Graphext Release Notes 2024-09-24

🌟 New Features

  • Leaf Embeddings Flow in the Wizard: Add the leaf embeddings flow to the wizard, to be able to do semi-supervised clustering, enhancing the clustering process with advanced embedding techniques and improving the flexibility and accuracy of the model.
  • Dataset Switcher Dropdown: Add a dropdown menu within a project to seamlessly switch between datasets, allowing for more flexible analysis and smoother transitions when working with multiple datasets within the same project.
  • Documentation Chat Widget: Access the documentation chat directly within Graphext through a new widget, allowing you to quickly find answers and guidance without leaving the platform.
  • File Picker for Drive and Google Sheets Integrations: Easily select and import files from Google Drive and Google Sheets with the new file picker, streamlining the process of integrating external data into Graphext.

🛠️ Improvements

  • New Aggregation Option: Group By with Lists: Now, you can aggregate categorical variables in the Group By function to generate lists of values for each group, providing a more detailed and comprehensive view of your data.

🎨 UI Updates

  • New Design for Significant Variables: The design for significant variables has been updated, allowing you to easily collapse them to reduce space when they are not needed, providing a cleaner and more organized interface for better focus on the analysis.

September 10, 2024

Graphext Release Notes 2024-09-10

🛠️ Improvements

  • Row Count for Grouped Data: Calculate the number of rows for each specific group generated by a group by operation. This will help you understand the distribution of data across the different groups in your analysis.
  • Expose Categorical Stats for Table in Plot: The current categorical statistics that are already being calculated (such as Unique Values, Mode, and Median) are now accessible, allowing you to use them directly in tables within your plots for more detailed and structured data visualization.

🔧 Technical Updates

  • Tenure and Churn Step: This step calculates the tenure by using the start and end dates and returns a new column with the tenure. It also adds a boolean column indicating churn (Yes or No) 
  • New "Discretize_on_Percentiles" Step: Introduce the discretize_on_percentiles step to categorize continuous data into discrete bins based on percentiles. This step helps segment the data more evenly and allows for better comparison and analysis.