What's New?

November 12, 2024

Graphext Release Notes 2024-11-12

🌟 New Features

  • Custom Queries for Variables: You can now perform custom queries on each variable, enabling faster category searches and more advanced queries that were previously not possible, such as AND conditions in categorical columns or disjoint ranges in numerical columns (e.g., "younger than 18 or older than 65"). For basic searches, a user-friendly UI is available, making it seamless, while for more advanced queries, a query editor and detailed documentation are provided to help you learn how to craft and execute complex queries.


  • Search for Variables in Cmd+K Omnibar: You can now search for variables from any section of Graphext directly through the Cmd+K omnibar. You'll be able to preview the distribution of those variables, making it easier to explore and analyze them without navigating through multiple sections.

🛠️ Improvements

  • Improved “prompt_ai” Step: There's no longer a need to be so explicit when forcing the output, making it more intuitive to use. A new JSON schema mode has been introduced for more structured data outputs. We now ensure that only the explicitly defined categories are returned, streamlining the results and ensuring more relevant outputs.

🔧 Technical Updates

  • Add support for Date columns in replace_values step

November 5, 2024

Graphext Release Notes 2024-11-05

🌟 New Features

  • Hall Redesign Implementation: The Hall has been redesigned for better clarity and navigation. The distinction between Home and Teams is now clearer, offering a more intuitive experience. Additionally, "What's New" is updated more frequently, and recent projects now include more filtering options, such as recently createdrecently updatedrecently refreshedrecently viewed, and Trending Projects coming soon
  • Quick Variable Tagging from Other Sections: You can now tag variables quickly from other sections, not just from the VM, making it easier and more efficient to tag relevant variables across your workflow. This feature is especially useful during EDA (Exploratory Data Analysis) before training a predictive model, allowing you to tag variables, such as those with high correlation to your target, for more focused analysis.


  • Discretization for Numeric Columns in Group_By: Implement the ability to create bins when using a numeric column for group_by, allowing for more granular grouping of data by dividing numeric values into specified ranges or intervals

🛠️ Improvements

  • Metrics Formula Editor with Syntax Highlighting and Autocompletion: Edit your metric formulas with ease using a new formula editor that includes syntax highlighting and autocompletion, making it faster and more intuitive to create and modify formulas.


🐛 Bug Fixes

  • Fix for Significant Variable Navigation: We have fixed the issue where clicking on a significant variable would not take you to the crossfilter if the "Show" was set to a specific group of tags. Now, it will automatically change the "Show" to "ALL", ensuring smooth navigation and proper visualization.

🎨 UI Updates

  • Updated Table Header in Plot: The header of the tables in plots has been rearranged for improved usability. The "+" button is now positioned on the right, while the pagination is moved to the left, offering a more intuitive layout for managing and navigating tables.
  • Shortcut to Open Plot from Data Table: Easily open the plot directly from the data table with a new shortcut, streamlining the transition between data exploration and visualization for more efficient analysis.

🔧 Technical Updates

  • Optimize metrics aggregation

October 29, 2024

Graphext Release Notes 2024-10-29

🌟 New Features

  • Sort Stacked Plot by First Color: You can now sort plots by the first color, enabling more intuitive visualization and better comparison of category proportions in the chart.
  • Option to Include or Exclude Nulls in Comparisons: You now have the option to include or exclude null values when comparing a selection to the total population
  • Drag and Drop to Order Categories in Ordinal Columns: You can now reorder the categories in an ordinal category column using drag and drop, providing a more intuitive way to reorder them

🔧 Technical Updates

  • Enhanced Group_By Step for Numerical and Date Variables: The group_by step has been modified to work with both numerical and date variables, now supporting discretization. This allows for grouping based on specified ranges or time intervals, providing more flexible and detailed aggregation of your data.
  • Group By Date with Custom Aggregations: A new method allows you to group by date and select specific aggregations, such as DayMonth, or Year, enabling more flexible and insightful date-based analysis.

October 15, 2024

Graphext Release Notes 2024-10-15

🌟 New Features

  • Create and Compare Metrics in Graphext: You can now define custom metrics like MRR or NPS within Graphext. These metrics will dynamically react to your selections, allowing you to compare them across the total population versus specific segments. For example, you can easily identify if a particular client segment has a much lower NPS than the overall average, helping to uncover valuable insights.

    In this example, we can see the NPS for all the reviews of a McDonald's. The NPS of the entire dataset is shown in grey around 46, while the NPS of the reviews with negative sentiment (highlighted in blue) is -36.9, highlighting a significant difference and offering insights into customer satisfaction based on sentiment.

    You can also visualize the difference in absolute numbers or as a percentage change between metrics.

  • Use Metrics in Plots: Now, you can use custom metrics like MRR or NPS directly in your plots, allowing you to visualize and compare these metrics across different segments or selections, enhancing your data analysis and insight generation.

🎨 UI Updates

  • Updated Row Removal and Reverse Selection: The trash icon has been removed from the top right corner. You can now remove rows or reverse your selection directly from the dropdown in the top left corner. A confirmation modal will appear with an explanation of what will be removed, ensuring clarity and preventing accidental deletions.

🔧 Technical Updates

  • Sorting Parameter for "Filter Duplicates" Step: Introduce a sorting parameter in the "filter_duplicates" step, allowing you to define how duplicates are prioritized and removed, making data cleaning more customizable and efficient.

September 24, 2024

Graphext Release Notes 2024-09-24

🌟 New Features

  • Leaf Embeddings Flow in the Wizard: Add the leaf embeddings flow to the wizard, to be able to do semi-supervised clustering, enhancing the clustering process with advanced embedding techniques and improving the flexibility and accuracy of the model.
  • Dataset Switcher Dropdown: Add a dropdown menu within a project to seamlessly switch between datasets, allowing for more flexible analysis and smoother transitions when working with multiple datasets within the same project.
  • Documentation Chat Widget: Access the documentation chat directly within Graphext through a new widget, allowing you to quickly find answers and guidance without leaving the platform.
  • File Picker for Drive and Google Sheets Integrations: Easily select and import files from Google Drive and Google Sheets with the new file picker, streamlining the process of integrating external data into Graphext.

🛠️ Improvements

  • New Aggregation Option: Group By with Lists: Now, you can aggregate categorical variables in the Group By function to generate lists of values for each group, providing a more detailed and comprehensive view of your data.

🎨 UI Updates

  • New Design for Significant Variables: The design for significant variables has been updated, allowing you to easily collapse them to reduce space when they are not needed, providing a cleaner and more organized interface for better focus on the analysis.

September 10, 2024

Graphext Release Notes 2024-09-10

🛠️ Improvements

  • Row Count for Grouped Data: Calculate the number of rows for each specific group generated by a group by operation. This will help you understand the distribution of data across the different groups in your analysis.
  • Expose Categorical Stats for Table in Plot: The current categorical statistics that are already being calculated (such as Unique Values, Mode, and Median) are now accessible, allowing you to use them directly in tables within your plots for more detailed and structured data visualization.

🔧 Technical Updates

  • Tenure and Churn Step: This step calculates the tenure by using the start and end dates and returns a new column with the tenure. It also adds a boolean column indicating churn (Yes or No) 
  • New "Discretize_on_Percentiles" Step: Introduce the discretize_on_percentiles step to categorize continuous data into discrete bins based on percentiles. This step helps segment the data more evenly and allows for better comparison and analysis.


August 27, 2024

Graphext Release Notes 2024-08-27

🌟 New Features

  • Font Style Customization in Annotations: Customize the font style in annotations, allowing you to change the font, size, color, and other properties for better alignment with your design preferences or visual clarity.
  • Mark's Design and Value Labels in Line Chart: Add marks at specific data points in line charts, along with value labels to display the exact values at those points, providing a clearer and more informative visualization of key data trends.
  • Totals Footer in Tables: Add a totals footer to your tables, automatically calculating and displaying the sum or other aggregate values of relevant columns, enhancing data overview and analysis.

🔧 Technical Updates

  • Create step to calibrate model probabilities

August 13, 2024

Graphext Release Notes 2024-08-13

🌟 New Features

  • New Table Plot Types: Introducing simple tables and summary tables (group by) plots. Now you can create tables from your data exploration, group by multiple variables, and calculate aggregations over other dataset variables, providing clear and structured insights.

🛠️ Improvements

  • Numerical Variables in Grouped Bar Chart Plots: Now you can incorporate numerical variables into grouped bar chart plots, allowing for more detailed and dynamic visualizations that represent quantitative data alongside categorical groupings.
  • Sorting Categories in Color Mapping: Implement sorting options for categories in color mapping, allowing you to organize and visualize data more effectively

🐛 Bug Fixes

  • The recipe editor is in light theme when the theme is dark

🎨 UI Updates

  • Persistent Position and Size for Recipe Modal: The recipe modal now remembers its last position and size, ensuring a consistent and personalized user experience each time you access it.